KR20190100506A

KR20190100506A - Method for forecasting of promising patents and apparatus thereof

Info

Publication number: KR20190100506A
Application number: KR1020180014775A
Authority: KR
Inventors: 이성주; 노희용
Original assignee: 아주대학교산학협력단
Priority date: 2018-02-06
Filing date: 2018-02-06
Publication date: 2019-08-29
Also published as: KR102029236B1

Abstract

Disclosed are a method for forecasting promising patents and an apparatus thereof. The method for forecasting promising patents to forecast promising of an analysis target patent by a promising patent predicting device comprises the following steps: obtaining bibliographic information of the analysis target patent; calculating a first variable value and a second variable value based on detailed technology elements of technology classification in the bibliographic information; and predicting the number of citations in the future for the analysis target patent by applying deep learning to the calculated first variable value and second variable value.

Description

Method for forecasting of promising patents and apparatus

본 발명은 특허 유망성 예측 방법 및 그 장치에 관한 것으로, 특히 특허의 발명·기술적 특징을 확인할 수 있는 사전지표를 이용하여 분석 대상 특허의 유망성을 예측할 수 있는 특허 유망성 예측 방법 및 그 장치에 관한 것이다. TECHNICAL FIELD The present invention relates to a method for predicting a patent and a device thereof, and more particularly, to a method for predicting a patent and a device for predicting the promising potential of a patent to be analyzed using prior indicators capable of confirming the invention and technical features of the patent.

기술 예측은 미래의 기술 변화에 대한 전망을 가능하게 한다. 기술 변화로 인한 영향력은 국가마다 다르지만, 일반적으로 기업들은 그들의 경쟁분야에서 기술 변화를 파악하고 선도함으로써, 이익 창출의 기회를 마련한다.Technology forecasts enable the prospect of future technology changes. While the impact of technological change varies from country to country, companies typically create opportunities for profit by identifying and leading technology changes in their competition.

한편, 많은 연구 결과를 통해 확인된 것처럼, 기술 변화에 부적절하게 대응하면 오래 지속되던 회사도 쇠퇴할 수 있다. 따라서 기술 변화에 대한 적절한 예상 및 예측은 우선순위 설정, 자원 할당 및 위험 감소와 같은 경영문제에 있어서 의사결정을 돕는데 매우 중요하다.On the other hand, as evidenced by many studies, improperly responding to technological changes can cause long-lasting companies to decline. Appropriate forecasts and forecasts for technological change are therefore critical to aid decision making in management issues such as prioritization, resource allocation and risk reduction.

최근에, 기술이 더 빠르고 복잡하게 발전하며 경쟁이 더욱 심화되면서, 기술 예측 또한 전략적으로 더욱 중요해졌다. 그 결과, 기술 예측 과정을 공식화하기 위해 적절한 모형, 방법 및 툴(tool)을 사용한 다양한 시도가 이루어지고 있다.In recent years, as technology advances faster and more complex and competition intensifies, technology prediction has become more strategically important. As a result, various attempts have been made using appropriate models, methods and tools to formulate the technology prediction process.

기술 변화와 혁신 과정은 종래 및 새로운 기술들을 결합하는 메커니즘에 기반한 프로세스로 여겨진다. 이는 분리된 기능보다는 상호 연결된 기술들의 복잡한 기능에 의존하는데, 이러한 점에서 현재 기술 중에서 미래의 기술 변화를 주도할 기술을 예측하는 것은 매우 중요하다.The process of technological change and innovation is seen as a process based on the mechanism of combining conventional and new technologies. It depends on the complex functions of interconnected technologies rather than separate functions, and it is very important to predict which of the current technologies will drive future technological change.

기술 예측에서 중요한 가정들은 크게 두 가지를 들 수 있는데, 현재의 기술 성장은 다가오는 기술 변화의 전조가 된다는 것과 이러한 기술 변화는 관련 기술에 의해 영향을 받을 것이라는 것이다. 현재 기술 성장이 미래 기술에 어떠한 영향을 미칠 것인가를 이해하는 것은 기술 예측에서 가장 중요한 작업 중 하나인데, 이는 다른 활동들에 직접적인 영향을 미치기 때문이다. 이러한 배경 하에서 미래 기술 영향을 평가하기 위한 시스템적 접근방법을 개발하는 것은 전략적 의사 결정에 도움을 주는 중요한 정보를 주기 위해 반드시 필요하다.There are two major assumptions in technology forecasting: current technology growth is a precursor to upcoming technology change and that technology change will be affected by the technology involved. Understanding how current technology growth will affect future technology is one of the most important tasks in technology forecasting because it directly affects other activities. Under this background, developing a systematic approach to assessing future technological impacts is essential to providing important information to aid strategic decision making.

미래 기술 영향을 평가하는 것은 미래에 대한 불확실성, 신뢰할 수 없는 데이터 및 현실적인 피드백의 복잡성 등과 같은 어려움 때문에, 본질적으로 매우 다루기 힘든 작업이다. 이러한 불확실성을 줄이고, 의사 결정 과정을 지원하기 위한 몇 가지 기법이 고안되어 왔지만, 종래의 방법들은 서로 다른 목적에서는 유용하지만, 분석 결과가 데이터 소스에 강하게 의존한다는 한계점을 가지고 있다.Evaluating future technology impacts is inherently difficult to handle because of difficulties such as uncertainty about the future, unreliable data, and the complexity of realistic feedback. While some techniques have been devised to reduce this uncertainty and support the decision making process, conventional methods are useful for different purposes, but have the limitation that analysis results are strongly dependent on the data source.

이러한 한계점을 극복하기 위해 기술 데이터베이스 및 혁신적인 계산 알고리즘 등이 도입되었다. 이는 환경 조사, 기술 로드맵, 계량서지학적 분석 및 창의적 문제 해결 이론과 같이 매우 강력하고 설득력 있는 기술 예측의 개발을 촉진하였는데, 이 가운데 특허 인용 분석은 기술 영향에 대한 개념적 및 경험적 분석을 위해 가장 많이 채택되는 방법 중에 하나이다.To overcome these limitations, technical databases and innovative calculation algorithms have been introduced. This has facilitated the development of very powerful and compelling technical predictions, such as environmental research, technical roadmaps, quantitative bibliographic analysis, and creative problem solving theory, of which patent citation analysis is the most adopted for conceptual and empirical analysis of technological impact. Is one of the ways.

특허 인용 분석의 기본적인 개념은 어떤 특허가 다른 특허를 인용하는 경우 두 특허간 기술적 연결 관계가 존재한다는 것이다. 전통적으로 특허 인용 분석은 기술 변화에 대한 개념적이고 질적인 분석 수요를 충족시켜줄 유일한 기회를 제공할 뿐만 아니라, 경험적으로도 기술 혁신의 대부분을 설명하고 있다고 인식되어 왔다. 즉, 특허 인용 정보는 기술들간의 결합 매커니즘을 분석하기 위한 대표적인 방법으로 미래의 기술 지식 흐름을 추정하고 이를 토대로 미래의 기술 관계를 예측함으로써 다양한 기술 기회를 획득할 수 있다.The basic concept of patent citation analysis is that when a patent cites another patent, there is a technical link between the two patents. Traditionally, patent citation analysis has been recognized not only to provide the only opportunity to meet the conceptual and qualitative demand for analysis of technological change, but to empirically explain most of the technological innovation. That is, the patent citation information is a representative method for analyzing a coupling mechanism between technologies, and thus, various technical opportunities can be obtained by estimating future technical knowledge flows and predicting future technical relationships.

이러한 특허 인용 분석은 기술 예측에 대한 시스템적이면서 경험적인 분석을 가능하게 한다는 측면에서 강점이 있지만, 과거 경향이 미래에도 같이 나타난다고 가정해야 하는 한계점을 가지고 있으며, 이러한 한계점은 결국 특허 인용 분석 결과가 기술 영향의 변화를 적절히 반영하지 못하게 한다.This patent citation analysis has strengths in terms of enabling a systematic and empirical analysis of technology predictions, but has the limitation of assuming that past trends also appear in the future. It does not adequately reflect changes in technological impact.

또한 종래에는 전체 특허를 대상으로 인용정보를 분석하여 미래 유망 특허를 추정하는 연구에만 집중하고 있어, 하나의 특허(예컨대, 출원시의 특허, 피인용 횟수가 없는 특허 등)에 대해 그 특허의 기술발명 특성을 이용하여 유망성을 예측하는 연구는 미흡한 실정이다.In addition, the prior art focuses only on researches for estimating future promising patents by analyzing citation information of all patents, and the technology of the patent for one patent (for example, a patent at the time of filing and a patent without a cited number). There is insufficient research to predict the prospects using the characteristics of the invention.

이에 관련하여, 발명의 명칭이 "특허정보를 이용한 기업성과 예측 장치 및 방법"인 한국공개특허 제2014-0146437호가 존재한다.In this regard, there is a Korean Patent Publication No. 2014-0146437 entitled "Business performance prediction apparatus and method using patent information".

본 발명이 해결하고자 하는 기술적 과제는 특허의 발명·기술적 특징을 확인할 수 있는 사전지표를 이용하여 사후 특허의 유망성을 예측할 수 있는 특허 유망성 예측 방법 및 그 장치를 제공하는 것이다. The technical problem to be solved by the present invention is to provide a patent promising predictive method and apparatus for predicting the promising of the post-patent using a prior indicator that can confirm the invention and technical features of the patent.

본 발명이 해결하고자 하는 다른 기술적 과제는 특허의 기술분류를 이용하여 미래 시점의 피인용횟수, 표준특허 여부, 기술이전 여부를 예측할 수 있는 특허 유망성 예측 방법 및 그 장치를 제공하는 것이다. Another technical problem to be solved by the present invention is to provide a patent promising prediction method and apparatus for predicting the number of citations, standard patents, and technology transfer at a future point of time using the technical classification of the patent.

본 발명의 기술적 과제들은 이상에서 언급한 기술적 과제들로 제한되지 않으며, 언급되지 않은 또 다른 기술적 과제들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.The technical problems of the present invention are not limited to the technical problems mentioned above, and other technical problems not mentioned will be clearly understood by those skilled in the art from the following description.

상기 기술적 과제를 해결하기 위한 본 발명의 일 실시예에 따른 특허 유망성 예측 방법은 특허 유망성 예측 장치가 분석 대상 특허의 유망성을 예측하는 방법에 있어서, 상기 분석 대상 특허의 서지정보를 획득하는 단계, 상기 서지정보내 기술분류의 상세기술요소에 기초하여 제1 변수값 및 제2 변수값을 산출하는 단계, 상기 산출된 제1 변수값 및 제2 변수값에 딥러닝을 적용하여 상기 분석 대상 특허에 대한 미래 시점의 인용횟수를 예측하는 단계를 포함한다. In accordance with an aspect of the present invention, there is provided a method for predicting patent promising, in which a method for predicting promising a patent of an analysis target patent is obtained by a device for predicting a patent, wherein the bibliographic information of the analysis target patent is obtained; Calculating a first variable value and a second variable value based on detailed description elements of the technical classification in the bibliographic information, and applying deep learning to the calculated first variable value and the second variable value, Predicting the number of citations in the future.

바람직하게는, 상기 기술분류는 섹션, 클래스, 서브 클래스, 메인 그룹, 서브 그룹 순으로 계층적으로 기술이 분류된 구조이고, 상기 상세기술요소는 서브클래스일 수 있다. Preferably, the description classification is a structure in which descriptions are classified hierarchically in the order of section, class, subclass, main group, and subgroup, and the detailed description element may be a subclass.

바람직하게는, 상기 제1 변수값을 산출하는 단계는, 상기 분석 대상 특허의 기술분류내 서브클래스들을 조합하여 서브클래스 쌍을 생성하는 단계, 상기 분석 대상 특허의 출원일 전에 출원된 과거 특허들의 기술분류내 서브클래스들을 조합하여 서브클래스 쌍을 생성하는 단계, 상기 분석 대상 특허의 서브클래스 쌍과 상기 과거 특허들의 서브클래스 쌍을 비교하여, 상기 분석 대상 특허에만 있는 서브클래스 쌍을 추출하는 단계, 상기 분석 대상 특허에만 있는 서브클래스 쌍과 상기 분석 대상 특허의 전체 서브클래스 쌍의 수에 기초하여, 제1 변수값을 산출하는 단계를 포함할 수 있다. Preferably, the calculating of the first variable value comprises: generating a subclass pair by combining subclasses in the technical classification of the analysis target patent, and technical classification of past patents filed before the filing date of the analysis target patent. Generating a subclass pair by combining the subclasses of the subclass, comparing the subclass pair of the target patent to a subclass pair of the past patents, and extracting a subclass pair only in the target patent; Calculating a first variable value based on the number of subclass pairs existing only in the target patent and the total number of subclass pairs of the analysis target patent.

바람직하게는, 상기 제2 변수값을 산출하는 단계는, 상기 분석 대상 특허의 출원일 전에 출원된 과거 특허들 중에서 상기 분석 대상 특허의 기술분류와 동일한 기술분류를 가지는 동일 기술분류 특허들을 확인하는 단계, 상기 분석 대상 특허와 각 동일 기술분류 특허간의 출원일 차이를 산출하는 단계, 상기 분석 대상 특허의 서브클래스 수, 상기 동일 기술분류 특허의 수, 상기 출원일 차이를 이용하여 제2 변수값을 산출하는 단계를 포함할 수 있다. Preferably, the calculating of the second variable value may include identifying the same technology classification patents having the same technology classification as that of the analysis target patent among the past patents applied before the application date of the analysis target patent, Calculating a difference in the filing date between the analysis target patent and each same technology classification patent, calculating a second variable value using the number of subclasses of the analysis target patent, the number of the same technology classification patent, and the difference in the filing date. It may include.

바람직하게는, 상기 제1 변수값 및 제2 변수값을 산출하는 단계 이후, 상기 분석 대상 특허의 기술분류의 개수, 후방 인용횟수, 청구항 수, 발명자 수 중 적어도 하나에 기초하여 기 정의된 변수들의 값을 산출하는 단계, 상기 제1 변수값, 제2 변수값, 기 정의된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 상기 분석 대상 특허의 표준기술 여부 또는 기술이전 여부를 예측하는 단계를 더 포함할 수 있다. Preferably, after calculating the first variable value and the second variable value, based on at least one of the number of technical classification of the analysis target patent, the number of back citations, the number of claims, the number of inventors, Calculating a value, applying at least one or more of the first variable value, the second variable value, and values of predefined variables to a logistic regression model to predict whether the analysis target patent is standard technology or technology transfer It may further include.

바람직하게는, 상기 예측된 인용횟수, 표준기술특허 여부, 기술이전 여부 중 적어도 하나에 기초하여 상기 분석 대상 특허의 유망성을 판단하는 단계를 더 포함할 수 있다. Preferably, the method may further include determining a prospect of the analysis target patent based on at least one of the predicted number of citations, whether the standard technology patent is valid or whether the technology is transferred.

상기 기술적 과제를 해결하기 위한 본 발명의 다른 실시예에 따른 특허 유망성 예측 장치는 분석 대상 특허를 분석하여 서지정보를 획득하는 분석부, 상기 서지정보내 기술분류의 상세기술요소에 기초하여 제1 변수값 및 제2 변수값을 산출하고, 상기 산출된 제1 변수값 및 제2 변수값에 딥러닝을 적용하여 상기 분석 대상 특허에 대한 미래 시점의 인용횟수를 예측하는 인용횟수 예측부, 상기 분석 대상 특허의 기술분류의 개수, 후방 인용횟수, 청구항 수, 발명자 수 중 적어도 하나에 기초하여 기 정의된 변수들의 값을 산출하고, 상기 제1 변수값, 제2 변수값, 기 정의된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 상기 분석 대상 특허의 표준기술 여부 또는 기술이전 여부를 예측하는 표준기술/기술이전 예측부, 상기 인용 횟수, 표준기술 여부, 기술이전 여부 중 적어도 하나에 기초하여 상기 분석 대상 특허에 대한 유망 특허 여부를 판단하는 유망성 예측부를 포함한다. According to another aspect of the present invention, there is provided an apparatus for predicting patent prospects for analyzing a patent to analyze analytical targets to obtain bibliographic information, and a first variable based on detailed technical elements of the technical classification in the bibliographic information. A citation count prediction unit configured to calculate a value and a second variable value, and predict a citation count of a future time point for the target patent by applying deep learning to the calculated first variable value and the second variable value, the analysis target Calculating the values of the predefined variables based on at least one of the number of technical classifications of the patent, the number of backward citations, the number of claims, and the number of inventors; and among the values of the first variable, the second variable, and the predefined variables Standard technology / technology transfer prediction unit for predicting whether the analysis target patent standard technology or technology transfer by applying at least one or more logistic regression model, the number of citations, Includes standard techniques or not, whether the previous technology based on at least one promise for determining whether the prospective patents for the target patent prediction of the unit.

본 발명에 따르면, 특허의 발명·기술적 특징을 확인할 수 있는 사전지표만을 이용하여 분석 대상 특허의 유망성을 쉽게 예측할 수 있다. According to the present invention, it is possible to easily predict the prospect of the analysis target patent by using only prior indicators that can confirm the invention and technical features of the patent.

또한, 분석 대상 특허의 기술분류를 이용하여 미래 시점의 피인용횟수, 표준특허 여부, 기술이전 여부를 예측할 수 있다. In addition, by using the technical classification of the analysis target patent, it is possible to predict the number of citations in the future, whether the standard patent and technology transfer.

본 발명의 효과들은 이상에서 언급한 효과들로 제한되지 않으며, 언급되지 않은 또 다른 효과들은 아래의 기재로부터 통상의 기술자에게 명확하게 이해될 수 있을 것이다.Effects of the present invention are not limited to the above-mentioned effects, and other effects not mentioned will be clearly understood by those skilled in the art from the following description.

도 1은 본 발명의 일실시예에 따른 특허 유망성 예측 방법을 설명하기 위한 도면이다.
도 2는 본 발명의 일 실시예에 따른 특허 유망성 예측 장치를 나타낸 블록도이다.
도 3은 IPC 코드를 설명하기 위한 도면이다.
도 4는 본 발명의 일 실시예에 따른 특허 유망성 예측 장치가 분석 대상 특허의 피인용 횟수를 예측하는 방법을 설명하기 위한 흐름도이다.
도 5는본 발명의 일 실시예에 따른 특허 유망성 예측 장치가 분석 대상 특허의 표준기술특허 여부 또는 기술이전 여부를 예측하는 방법을 설명하기 위한 흐름도이다. 1 is a view for explaining a patent prospect prediction method according to an embodiment of the present invention.
2 is a block diagram illustrating a patent prospective prediction apparatus according to an embodiment of the present invention.
3 is a diagram for explaining an IPC code.
4 is a flowchart illustrating a method of predicting the number of citations of an analysis target patent by a patent prospect prediction apparatus according to an embodiment of the present invention.
5 is a flowchart illustrating a method of predicting whether a patent prospective predicting device according to an embodiment of the present invention predicts whether an analysis target patent is a standard technology patent or a technology transfer.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다.As the invention allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the written description. However, this is not intended to limit the present invention to specific embodiments, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the drawings, similar reference numerals are used for similar elements.

제1, 제2, A, B 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 및/또는 이라는 용어는 복수의 관련된 기재된 항목들의 조합 또는 복수의 관련된 기재된 항목들 중의 어느 항목을 포함한다.Terms such as first, second, A, and B may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component. The term and / or includes a combination of a plurality of related items or any item of a plurality of related items.

어떤 구성요소가 다른 구성요소에 "연결되어" 있다거나 "접속되어" 있다고 언급된 때에는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해되어야 할 것이다. 반면에, 어떤 구성요소가 다른 구성요소에 "직접 연결되어" 있다거나 "직접 접속되어" 있다고 언급된 때에는, 중간에 다른 구성요소가 존재하지 않는 것으로 이해되어야 할 것이다.When a component is referred to as being "connected" or "connected" to another component, it may be directly connected to or connected to that other component, but it may be understood that other components may be present in between. Should be. On the other hand, when a component is said to be "directly connected" or "directly connected" to another component, it should be understood that there is no other component in between.

본 출원에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting of the present invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. In this application, the terms "comprise" or "have" are intended to indicate that there is a feature, number, step, operation, component, part, or combination thereof described in the specification, and one or more other features. It is to be understood that the present invention does not exclude the possibility of the presence or the addition of numbers, steps, operations, components, components, or a combination thereof.

다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다.Unless defined otherwise, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art. Terms such as those defined in the commonly used dictionaries should be construed as having meanings consistent with the meanings in the context of the related art and shall not be construed in ideal or excessively formal meanings unless expressly defined in this application. Do not.

이하에서는 본 발명에 따른 바람직한 실시예를 첨부된 도면을 참조하여 상세하게 설명한다.Hereinafter, with reference to the accompanying drawings, preferred embodiments of the present invention will be described in detail.

도 1은 본 발명의 일실시예에 따른 특허 유망성 예측 방법을 설명하기 위한 도면이다. 1 is a view for explaining a patent prospect prediction method according to an embodiment of the present invention.

도 1을 참조하면, 특허 유망성 예측 장치(200)는 사용자 단말기(100)와 네트워크 망을 통해 연결될 수있다. 네트워크 망은 매체를 통해 통신을 수행하는 망을 말하는 것으로, 유선 통신망, 컴퓨터 네트워크 및 무선통신망 중 적어도 하나를 포함할 수 있다.Referring to FIG. 1, the patent prospective prediction apparatus 200 may be connected to the user terminal 100 through a network. The network network refers to a network that communicates through a medium, and may include at least one of a wired communication network, a computer network, and a wireless communication network.

특허 유망성 예측 장치(200)는 특허 DB(300)에 저장된 데이터를 추출할 수 있다. 특허 DB(300)는 특허청 데이터베이스 또는 특허청 데이터베이스와 연동되는 특정 데이터베이스를 포함할 수 있다.The patent prospective prediction apparatus 200 may extract data stored in the patent DB 300. The patent DB 300 may include a patent database or a specific database linked with the KIPO database.

특허 유망성 예측 장치(200)는 분석 대상 특허의 출원 또는 등록 시 발명·기술적 특징을 확인할 수 있는 지표로 그 특허의 유망성을 예측한다. The patent promising predicting device 200 predicts the promising of the patent as an index for confirming the invention and technical characteristics when the patent or the analysis target patent is applied or registered.

기술영향력이 높은 유망 특허는 비 유망특허에 비해 더 많은 피인용 횟수를 갖고 있고, 사회경제적 영향력이 높은 유망특허는 특허사용권 또는 소유권의 거래가 발생한다. 따라서, 유망 특허는 피인용 횟수가 많이 발생한 특허, 기술 표준 특허, 권리 이전 특허 중 적어도 하나일 수 있다. 먼저 특허의 전방인용 정보(즉, 피인용 정보)는 해당 특허가 후속 발명에 미친 영향력을 확인할 수 있는데, 전방인용이 다수 일어난 특허는 해당 기술영역에서 독창적이거나 어떤 기술기능을 구현함에 있어 핵심적인 역할을 수행하고 있는 것으로 이해할 수 있다. 또한, 특허의 판매와 라이선스 등을 포함한 특허 소유권 변동정보는 기술발명 자체가 실질적인 수익을 발생시켰다는 점에서 보다 직접적인 기술 유망성을 강조할 수 있다.Promising patents with high technical impacts have a higher number of citations than non-promising patents, and promising patents with high socioeconomic impacts generate patent licenses or proprietary rights transactions. Therefore, the promising patent may be at least one of a patent, a technology standard patent, and a right transfer patent, in which the number of citations is frequently generated. First, the citation information of a patent (ie, citation information) can confirm the influence of the patent on the subsequent invention. A patent with a large number of forward citations plays a key role in implementing an original or specific technical function in the technical field. It can be understood that you are performing. In addition, patent ownership change information, including the sale and licensing of patents, may highlight more direct technological promising in that the invention itself generates substantial profits.

이에, 특허 유망성 예측 장치(200)는 전방 인용 횟수, 기술표준, 기술이전 중 적어도 하나를 이용하여 특허의 유망성을 예측할 수 있다. 특허 유망성 판단을 위한 사전 지표는 분석 대상 특허의 출원 또는 등록 시 발명·기술적 특징을 확인할 수 있는 지표로, 발명특성지표와 기술특성지표를 포함한다. 발명특성지표는 재조합 신규성(Recombinant novelty)과 기술요소 친숙함(component familiarity) 등을 포함하고, 기술특성지표는 기술분류의 개수에 기초한 기술의 일반성(TG, technological generality), 후방 인용회수에 기초한 기술적 기반(TB, technological basis), 청구항 수에 기초한 기술발명의 보호범위(PS, protection scope), 기술발명자의 수(IC, inventor counts) 등을 포함할 수 있다.Thus, the patent prospect prediction apparatus 200 may predict the patent prospect using at least one of forward citation counts, technical standards, and technology transfers. Preliminary indicators for determining patent promising are indicators that can confirm invention and technical characteristics when applying or registering an analysis target patent and include invention characteristic indexes and technical characteristic indexes. Invention characteristic indicators include recombinant novelty and component familiarity, and the technical characteristic indicators are based on the technical generality (TG) based on the number of technical classifications and technical based on the number of backward citations. It may include a technical basis (TB), a protection scope (PS) based on the number of claims, the number of technology inventors (IC, inventor counts), and the like.

한편, 특허 유망성 예측 장치(200)는 단일의 연산 장치로 구현되거나 둘 이상의 연산 장치가 서로 연결된 집합 장치의 형태로 구현될 수도 있다. 예를 들어, 특허 유망성 예측 장치(200)는 단일의 서버로 구현되거나 둘 이상의 서버가 연결된 형태로 구현될 수 있다.Meanwhile, the patent prospective prediction apparatus 200 may be implemented as a single computing device or in the form of a collection device in which two or more computing devices are connected to each other. For example, the patent prospective prediction apparatus 200 may be implemented as a single server or in a form in which two or more servers are connected.

이러한 특허 유망성 예측 장치(200)에 대한 상세한 설명은 도 2를 참조하기로 한다. Detailed description of the patent prospective prediction apparatus 200 will be described with reference to FIG. 2.

사용자 단말기(100)는 특허 유망성 예측 장치(200)에 접속하여 사용자가 입력한 특허에 대한 특허 유망성에 대한 정보를 제공받을 수 있는 단말기로, PC, 스마트폰, 태블릿, 노트북, PDA, 네비게이션 등을 포함할 수 있다. 여기서, 특허 유망성에 대한 정보는 분석대상 특허의 인용 횟수 예측 정보, 표준 기술 여부, 기술 이전 여부 등을 포함할 수 있다. The user terminal 100 is a terminal that can be connected to the patent prospective prediction device 200 and receive information on the patent prospect for a patent input by a user. The user terminal 100 may be a PC, a smartphone, a tablet, a laptop, a PDA, a navigation device, or the like. It may include. Here, the information on the patent promising may include information on the number of citations of the analysis target patent, information on standard technology, technology transfer, and the like.

도 2는 본 발명의 일 실시예에 따른 특허 유망성 예측 장치를 나타낸 블록도, 도 3은 IPC 코드를 설명하기 위한 도면이다. FIG. 2 is a block diagram illustrating a patent prospective prediction apparatus according to an embodiment of the present invention, and FIG. 3 is a diagram for explaining an IPC code.

도 2를 참조하면, 본 발명의 일 실시예에 따른 특허 유망성 예측 장치(200)는 분석부(210), 인용횟수 예측부(220)를 포함한다.Referring to FIG. 2, the patent prospective prediction apparatus 200 according to an embodiment of the present invention includes an analysis unit 210 and a citation count prediction unit 220.

분석부(210)는 분석 대상 특허를 분석하여 서지정보를 획득한다. 이때, 분석 대상 특허가 특허 문서로 입력되면, 분석부(210)는 그 특허 문서를 텍스트 마이닝 등의 방법으로 분석하여 서지정보를 획득할 수 있다. 또한, 분석 대상 특허가 특허 관련 번호(예컨대, 출원번호 등)로 입력되면, 분석부(210)는 해당 특허청 DB 또는 사설기관에서 보유하는 특허 DB로부터 서지정보를 획득할 수 있다. 이때, 분석부(210)는 PatentsView API 등을 이용하여 서지정보를 획득할 수 있다. 서지정보는 출원번호, 출원일, 공개번호, 공개일, 출원인명, 기술분류, 전방인용횟수, 후방인용횟수, 청구항 수, 발명자 수 등이 포함될 수 있다. 기술분류는 특허문헌을 기술별로 분류해놓은 특허분류코드로, 예컨대, 국제특허분류(IPC, International Patent Classification)와 협력특허분류(CPC, Cooperative Patent Classification)같이 국제적으로 통용되는 분류시스템부터, 미국특허분류(USPC: United States Patent Classification), 유럽특허분류(ECLA: European Classification), 일본특허분류(F-term), 그리고 한국특허분류(KPC: Korean Patent Classification) 등과 같이 다양할 수 있다. 본 실시예에서는 IPC를 이용하는 경우에 대해 설명하기로 한다. IPC는 도 3에 도시된 바와 같이 섹션, 클래스, 서브 클래스, 메인 그룹, 서브 그룹으로 구성된다. 섹션은 특허가 속하는 최상위 기술분류이고, 클래스는 섹션의 하위 기술분류로서 해당 섹션에서의 세부기술 분야를 나타내는 기술 분류이며, 서브 클래스는 클래스의 하위 기술분류로서 해당 클래스를 용도, 목적, 재료 등의 관점에 따라 보다 세분화한 기술 분류이고, 메인 그룹은 서브클래스를 보다 세분화한 기술 분류로서 관점코드를 세부적으로 구분하여 설명하는 속성값을 가지고 있는 최하위기술분류이다. 섹션, 클래스, 서브 클래스, 메인 그룹, 서브 그룹은 순차적으로 계층구조를 가지게 되는데, 하나의 섹션은 복수의 클래스로 세분화되고, 복수의 클래스 각각은 복수의 서브클래스로 세분화되며, 복수의 서브클래스 각각은 복수의 메인 그룹으로 세분화되고, 복수의 메인그룹 각각은 복수의 서브그룹으로 세분화된다.The analysis unit 210 obtains bibliographic information by analyzing the analysis target patent. In this case, when the analysis target patent is input as a patent document, the analyzer 210 may analyze the patent document by text mining or the like to obtain bibliographic information. In addition, when the analysis target patent is input as a patent-related number (eg, application number, etc.), the analysis unit 210 may obtain bibliographic information from a patent DB held in the patent office DB or a private institution. At this time, the analysis unit 210 may obtain bibliographic information using the PatentsView API. Bibliographic information may include an application number, application date, publication number, publication date, applicant name, technical classification, number of forward citations, number of back citations, number of claims, number of inventors, and the like. The technology classification is a patent classification code that classifies patent documents by technology, for example, from internationally accepted classification systems such as International Patent Classification (IPC) and Cooperative Patent Classification (CPC), and US patent classification. (USPC: United States Patent Classification), European Patent Classification (ECLA), Japanese Patent Classification (F-term), and Korean Patent Classification (KPC). In this embodiment, a case of using IPC will be described. As illustrated in FIG. 3, the IPC is composed of a section, a class, a subclass, a main group, and a subgroup. A section is a top technical classification to which a patent belongs, and a class is a sub technical classification of a section, and a technical classification indicating a detailed technical field in the section. A sub class is a sub technical classification of a class. It is a more detailed technical classification according to the viewpoint, and the main group is a technical classification which is a more detailed subclass of the subclass, and has the lowest technical classification having attribute values that describe and describe the viewpoint codes in detail. Sections, classes, subclasses, main groups, and subgroups are sequentially hierarchical. One section is subdivided into a plurality of classes, and each of the plurality of classes is subdivided into a plurality of subclasses. Are subdivided into a plurality of main groups, and each of the plurality of main groups is subdivided into a plurality of subgroups.

인용횟수 예측부(220)는 분석부(210)에서 수집한 서지정보내 기술분류의 상세기술요소에 기초하여 제1 변수값 및 제2 변수값을 산출하고, 산출된 제1 변수값 및 제2 변수값에 딥러닝을 적용하여 분석 대상 특허에 대한 미래 시점의 피인용 횟수(전방인용 횟수)를 예측한다. 여기서, 기술분류는 섹션, 클래스, 서브 클래스, 메인 그룹, 서브 그룹 순으로 계층적으로 기술이 분류된 IPC일 수 있고, 상세기술요소는 서브클래스일 수 있다. 제1 변수와 제2 변수는 분석 대상 특허의 기술분류에 기초한 변수로, 제1 변수는 재조합 신규성(RN, recombinant novelty)일 수 있고, 제2 변수는 기술요소의 친숙함(CF, component familiarity)일 수 있다. 재조합 신규성은 과거 기술발명에서 등장하지 않았던 세부기술요소의 새로운 조합이 특허의 유망성에 영향력을 미칠 것이라는 가정 하에, IPC의 서브클래스 조합을 활용하여 제안된 지표일 수 있다. 기술요소의 친숙함(CF)은 분석 대상 특허의 기술요소가 과거에 얼마나 자주 그리고 얼마나 최근까지 사용되었는지를 확인하는 지표일 수 있다. The citation count prediction unit 220 calculates the first variable value and the second variable value based on the detailed description elements of the technical classification in the bibliographic information collected by the analysis unit 210, and calculates the calculated first variable value and the second variable value. Deep learning is applied to variable values to predict the number of citations (forward citations) at a future point in time for the target patent. Here, the technical classification may be an IPC in which descriptions are hierarchically classified into sections, classes, subclasses, main groups, and subgroups, and detailed description elements may be subclasses. The first variable and the second variable are variables based on the technical classification of the patent to be analyzed, the first variable may be recombinant novelty (RN), and the second variable is component familiarity (CF). Can be. Recombinant novelty may be a proposed indicator utilizing subclass combinations of IPC, assuming that new combinations of sub-technical elements that did not appear in the prior art invention will affect the promising of patents. The familiarity of the technology elements (CF) may be an indicator of how often and how recently the technology elements of the patent to be analyzed have been used in the past.

이러한, 인용횟수 예측부(220)는 제1 변수값 산출모듈(222), 제2 변수값 산출모듈(224), 피인용횟수 예측모듈(226)을 포함한다.The citation count predictor 220 includes a first variable value calculation module 222, a second variable value calculation module 224, and a citation count prediction module 226.

제1 변수값 산출모듈(222)은 분석 대상 특허의 서브클래스 조합쌍과 분석 대상 특허의 출원일 전에 출원된 과거 특허들의 서브클래스 조합쌍을 비교하여, 분석 대상 특허에만 있는 서브클래스 조합쌍과 분석 대상 특허의 전체 서브클래스 조합쌍의 수에 기초하여 제1 변수값을 산출한다.The first variable value calculating module 222 compares the subclass combination pair of the analysis target patent and the subclass combination pairs of past patents filed before the filing date of the analysis target patent, and compares the subclass combination pair existing only in the analysis target patent. The first variable value is calculated based on the total number of subclass combination pairs of the patent.

구체적으로, 제1 변수값 산출모듈(222)은 분석 대상 특허의 기술분류에서 서브클래스 조합쌍을 생성한다. 특허 출원시 하나 이상의 기술분류를 선택하므로, 분석 대상 특허에는 하나 이상의 기술분류를 포함할 수 있다. 따라서, 제1 변수값 산출모듈(222)은 IPC코드의 서브클래스들을 조합하여 하나 이상의 서브클래스 조합쌍을 생성할 수 있다. 예를 들어, 분석 대상 특허의 기술분류가 A01G 5/06, A01F 4/06, A01C 2/04인 경우, 서브 클래스는 G, F, C로 3개임을 알 수 있다. 따라서, 제1 변수값 산출모듈(222)은 서브클래스 조합쌍인 A01G-A01G, A01G-A01F, A01G-A01C, A01F-A01F, A01F-A01C, A01C-A01C 즉, 3개 기술분류 중 2개씩 묶은 쌍을 생성한다. 그러면, 제1 변수값 산출모듈(222)은 3P2 = 3! = 3*2 = 6 개의 서브클래스 조합쌍을 생성할 수 있다. Specifically, the first variable value calculation module 222 generates a subclass combination pair in the technical classification of the patent to be analyzed. Since one or more technical classifications are selected at the time of patent application, the analysis target patent may include one or more technical classifications. Accordingly, the first variable value calculation module 222 may generate one or more subclass combination pairs by combining the subclasses of the IPC code. For example, if the technical classification of the patent to be analyzed is A01G 5/06, A01F 4/06, and A01C 2/04, it can be seen that three subclasses are G, F, and C. Accordingly, the first variable value calculating module 222 is a subclass combination pair A01G-A01G, A01G-A01F, A01G-A01C, A01F-A01F, A01F-A01C, A01C-A01C. Create a pair. Then, the first variable value calculating module 222 is 3P2 = 3! = 3 * 2 = You can create 6 subclass combination pairs.

그런 후, 제1 변수값 산출모듈(222)은 분석 대상 특허의 출원일을 확인하고, 그 출원일 전에 출원된 과거 특허들을 특허청 DB 또는 사설기관에서 보유하는 특허 DB로부터 수집하고, 수집된 각 과거 특허들에 대해 서브클래스 조합쌍을 생성한다. 이때, 제1 변수값 산출모듈(222)은 분석 대상 특허의 섹션과 클래스가 동일한 과거 특허들을 수집하고, 수집된 각 과거 특허들의 서브클래스 조합쌍을 생성할 수 있다. Thereafter, the first variable value calculation module 222 confirms the filing date of the patent to be analyzed, collects past patents filed before the filing date from the patent DB held by the KIPO or a private organization, and collects each of the past patents. Create a subclass collation pair for. In this case, the first variable value calculation module 222 may collect past patents having the same class and the same class as the analysis target patent, and generate a subclass combination pair of the collected past patents.

그런 후, 제1 변수값 산출모듈(222)은 분석 대상 특허의 서브클래스 조합쌍과 과거 특허들의 서브클래스 조합쌍을 비교하여, 분석 대상 특허에만 있는 서브클래스 조합쌍을 확인하고, 분석 대상 특허에만 있는 서브클래스 조합쌍과 분석 대상 특허의 전체 서브클래스 조합쌍의 수에 기초하여 제1 변수값을 산출한다. 과거 특허들의 서브클래스 조합과 분석 대상 특허의 서브클래스 조합을 비교해 새롭게 등장하는 서브클래스 조합이 있을 경우, 그 분석 대상 특허는 재조합 신규성을 가진 것으로 할 수 있다. 따라서, 제1 변수값 산출모듈(222)은 분석 대상 특허의 전체 서브클래스 조합쌍의 수 대비 과거 특허들에는 관측되지 않았지만 분석 대상 특허에서 새롭게 관측된 서브클래스 조합쌍의 수를 나누어 0이상 1이하의 값을 가지는 제1 변수값(RN점수)를 산출할 수 있다. 즉, 제1 변수값 산출모듈(222)은 아래 수학식 1을 이용하여 제1 변수값(재조합 신규성(

))을 산출할 수 있다.Thereafter, the first variable value calculating module 222 compares the subclass combination pair of the analysis target patent and the subclass combination pair of the past patents to identify the subclass combination pair only in the analysis target patent, A first variable value is calculated based on the number of subclass combination pairs present and the total number of subclass combination pairs of the patent to be analyzed. When there is a newly introduced subclass combination by comparing subclass combinations of past patents and subclass combinations of the analysis target patent, the analysis target patent can be regarded as having recombination novelty. Accordingly, the first variable value calculating module 222 divides the number of newly-substantiated subclass combination pairs that are not observed in the past patents compared to the total number of subclass combination pairs of the patent to be analyzed, and divides the number of newly observed subclass combination pairs from the patent to be analyzed. A first variable value (RN score) having a value of may be calculated. That is, the first variable value calculating module 222 uses a first variable value (recombinant novelty) using Equation 1 below.

)) Can be calculated.

[수학식 1][Equation 1]

여기서, k는 분석 대상 특허이고,

는 과거 특허에서 등장한 적 없는 특허 k의 기술분류 서브클래스 조합의 수,

는 특허 k의 모든 기술분류 서브클래스 조합의 수를 의미한다. Where k is the patent to be analyzed,

Is the number of combinations of technical classification subclasses of patent k that have never appeared in past patents,

Denotes the number of combinations of all technical classification subclasses of patent k.

제2 변수값 산출모듈(224)은 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허의 출원년도와 분석 대상 특허의 출원년도 차이에 기초하여 제2 변수값을 산출한다. The second variable value calculation module 224 calculates the second variable value based on the difference between the filing year of the previous patent and the filing year of the analysis target patent having the same technical classification as the analysis target patent.

구체적으로, 제2 변수값 산출모듈(224)은 분석 대상 특허의 출원일 전에 출원된 과거 특허들 중에서 분석 대상 특허의 기술분류와 동일한 기술분류를 가지는 동일 기술분류 특허들을 확인한다. 분석 대상 특허는 하나 이상의 기술분류 코드를 포함하므로, 제2 변수값 산출모듈(224)은 분석 대상 특허의 기술분류 코드 중에서 적어도 하나 이상의 기술분류코드가 동일한 과거 특허들을 추출한다. 이때, 제2 변수값 산출모듈(224)은 분석 대상 특허의 기술분류와 완전히 일치하는 과거 특허들을 추출할 수도 있지만, 섹션, 클래스 및 서브클래스까지만 동일한 과거 특허들을 추출할 수도 있다.Specifically, the second variable value calculation module 224 identifies the same technology classification patents having the same technology classification as that of the analysis target patent among the past patents applied before the application date of the analysis target patent. Since the analysis target patent includes one or more technology classification codes, the second variable value calculation module 224 extracts past patents having the same technology classification code of at least one of the technology classification codes of the analysis target patent. In this case, the second variable value calculation module 224 may extract the past patents that completely match the technical classification of the patent to be analyzed, but may also extract the same past patents up to the section, class, and subclass.

그런 후, 제2 변수값 산출모듈(224)은 분석 대상 특허와 각 동일 기술분류 특허간의 출원일 차이를 산출한다. 이때, 제2 변수값 산출모듈(224)은 출원년도의 차이, 출원년도 및 출원월의 차이, 출원년도 및 출원 월/일의 차이 등을 출원일 차이로 산출할 수 있다. 그 후, 제2 변수값 산출모듈(224)은 분석 대상 특허의 서브클래스 수, 동일 기술분류 특허의 수, 출원일 차이를 이용하여 제2 변수값을 산출할 수 있다. 즉, 제2 변수값 산출모듈(224)은 아래 수학식 2를 이용하여 분석 대상 특허(k)의 제2 변수값(기술요소 친숙함(CF_k))을 산출할 수 있다. 이는 특정 기술영역의 지식이 매년 일정한 수준만큼 손실된다는 가정 하에 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허의 출원년도와 분석 대상 특허의 출원년도 차이에 따라 점수를 달리 부여한다.Thereafter, the second variable value calculation module 224 calculates the difference in the filing date between the analysis target patent and the same technology classification patent. In this case, the second variable value calculation module 224 may calculate the difference between the filing year, the difference between the filing year and the filing month, and the difference between the filing year and the filing month / day as the filing date difference. Thereafter, the second variable value calculation module 224 may calculate the second variable value using the number of subclasses of the analysis target patent, the number of the same technology classification patent, and the filing date. That is, the second variable value calculating module 224 may calculate the second variable value (technical element familiarity CF _k ) of the analysis target patent k using Equation 2 below. It assigns different scores according to the difference between the filing year of the previous patent and the filing year of the analysis target patent, which have the same technology classification as the analysis target patent, on the assumption that the knowledge of a specific technical area is lost by a certain level every year.

[수학식 2][Equation 2]

여기서, m은 특허 k의 모든 기술분류 서브 클래스 수,

는 기술분류 서브클래스 j를 가진 모든 과거 특허 i의 수,

은 특허 k의 출원년도와 과거 특허 i의 출원년도 차이,

는 지식 손실 시간 상수를 의미한다. Where m is the number of all technical classification subclasses of patent k,

Is the number of all past patents i with technical subclass j,

Is the difference between the filing year of patent k and the filing year of patent i,

Means the knowledge loss time constant.

피인용횟수 예측모듈(226)은 제1 변수값 산출모듈(222)에서 산출된 제1 변수값과 제2 변수값 산출모듈(224)에서 산출된 제2 변수값을 투입변수로하는 알고리즘에 적용하여 분석 대상 특허에 대한 미래 시점의 피인용횟수를 예측할 수 있다. 이때, 피인용횟수 예측모듈(226)은 딥 러닝(deep learning), 인용시차분포(CLD, citation-lag distribution), 중도절단회귀분석(censored regression) 등의 알고리즘을 이용할 수 있다. 바람직하게는 딥러닝을 이용할 수 있고, 딥 러닝 알고리즘 중 전방 신경회로망(feedforward neural network, FFNN)과 함께 인과관계 확인을 위한 비선형 회귀모형(non-linear regression model)을 이용할 수 있다. 즉, 피인용횟수 예측모듈(226)은 전방 신경회로망 기반 회귀분석을 이용할 수 있다. 딥 러닝은 분석자가 지정하는 모형세팅이 성능에 주효한 영향을 미치기 때문에 그리드 탐색(grid search)을 통해 적절한 하이퍼파라미터(hyperparameter) 튜닝을 수행한다. 또한, FFNN의 과적합(overfitting)을 방지하기 위기 정규화를 수행하고, 이러한 정규화기법으로는 L1/L2 정규화로 손실함수(loss function)에 패널티를 부여하는 방법, 뉴런의 연결을 랜덤하게 끊는 드롭아웃(dropout) 방법, 그리고 학습속도 담금질(learning rate annealing) 등을 포함할 수 있다. The citation count prediction module 226 is applied to an algorithm that uses the first variable value calculated by the first variable value calculation module 222 and the second variable value calculated by the second variable value calculation module 224 as input variables. Thus, the number of citations in the future for the target patent can be predicted. At this time, the citation frequency prediction module 226 may use algorithms such as deep learning, citation-lag distribution (CLD), and censored regression. Preferably, deep learning may be used, and a non-linear regression model for causality check may be used along with a forward neural network (FFNN) of deep learning algorithms. That is, the citation count prediction module 226 may use a forward neural network based regression analysis. Deep learning performs proper hyperparameter tuning through grid search because the model settings specified by the analyst have a significant impact on performance. In addition, crisis normalization is performed to prevent overfitting of FFNN, and such normalization method is a method of assigning a penalty to a loss function by L1 / L2 normalization, and dropout of randomly disconnecting neurons. dropout methods, and learning rate annealing.

상술한 바와 같이 인용횟수 예측부(220)는 분석 대상 특허의 발명적 특성을 반영한 지표인 RN과 CF만을 이용하여 분석 대상 특허의 피인용 횟수를 예측할 수 있다.As described above, the citation count prediction unit 220 may predict the number of citations of the analysis target patent using only RN and CF which are indicators reflecting the inventive characteristics of the analysis target patent.

또한, 인용횟수 예측부(220)는 분석 대상 특허의 출원시점 또는 현재시점의 RN과 CF는 물론, 사후 정보를 동시에 활용하여 피인용 횟수를 예측할 수도 있다. In addition, the citation count predictor 220 may predict the number of citations by simultaneously utilizing the post-information as well as the RN and CF of the application point or current point of the analysis target patent.

한편, 특허 유망성 예측 장치(200)는 인용횟수 예측부(220)에서 예측된 피인용횟수에 기초하여 분석 대상 특허의 유망성을 예측하는 유망성 예측부(230)를 더 포함할 수 있다. 유망성 예측부(230)는 분석 대상 특허의 피인용횟수가 기 설정된 기준 횟수 이상인 경우 유망성이 있다고 판단할 수 있다. 이때, 유망성 예측부(230)는 유망성이 있다고 판단된 특허를 돌팔구 특허로 도출할 수 있다. On the other hand, the patent prospect prediction apparatus 200 may further include a prospect prediction unit 230 for predicting the prospect of the analysis target patent based on the number of citations predicted by the citation count prediction unit 220. The promising predictor 230 may determine that the promising number is promising when the number of times of citation of the analysis target patent is greater than or equal to a predetermined reference number. In this case, the prospective predictor 230 may derive the patent determined to be promising as a patent.

또한, 특허 유망성 예측 장치(200)는 분석 대상 특허의 표준기술 여부를 예측하는 표준기술 예측부(240)를 더 포함할 수 있다. In addition, the patent prospective prediction apparatus 200 may further include a standard technology predictor 240 that predicts whether or not the standard technology of the analysis target patent.

표준기술 예측부(240)는 분석 대상 특허의 제1 변수값과 제2 변수값 외에, 기 설정된 변수들의 값을 산출하고, 산출된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 분석대상 특허의 표준기술 여부를 예측할 수 있다. 여기서, 기 설정된 변수는 분석대상특허의 기술특성지표에 따른 변수로, 기술분류의 개수에 기초한 기술의 일반성(TG, technological generality), 후방 인용회수에 기초한 기술적 기반(TB, technological basis), 청구항 수에 기초한 기술발명의 보호범위(PS, protection scope), 기술발명자의 수(IC, inventor counts) 등을 포함할 수 있다.The standard technology prediction unit 240 calculates values of preset variables in addition to the first variable value and the second variable value of the analysis target patent, and applies at least one or more of the calculated values of the variable to the logistic regression model. Predict whether a patent is a standard technology. Here, the predetermined variable is a variable according to the technical characteristic index of the patent to be analyzed, the technical generality (TG) based on the number of technical classifications, the technical basis (TB, technical basis) based on the number of backward citations, the number of claims Protection scope (PS) based on the invention, the number of technical inventors (IC, inventor counts) and the like.

이러한 표준기술 예측부(240)는 표준기술 관련 변수값 산출모듈(242), 표준기술 예측모듈(244)을 포함한다. The standard technology prediction unit 240 includes a standard technology related variable value calculation module 242 and a standard technology prediction module 244.

표준기술 관련 변수값 산출모듈(242)은 제1 변수값과 제2 변수값 외에, 기술의 일반성(TG), 기술적 기반(TB), 기술발명의 보호범위(PS), 기술발명자의 수(IC)의 변수 값을 각각 산출한다.In addition to the first variable value and the second variable value, the standard technology-related variable value calculation module 242 includes, in addition to the first variable value and the second variable value, the generality of the technology (TG), the technology basis (TB), the protection scope (PS) of the technology invention, and the number of technology inventors (IC). Each variable value of) is calculated.

제1 변수에 해당하는 재조합 신규성은 기술진보와 긴밀한 연관관계가 있기 때문에, 신규성이 높은 특허는 기술표준에 유의미한 영향을 미칠 수 있다. 또한, 기술요소의 친숙함이 높은 특허는 경로 의존적인 기술발명일 가능성이 높기 때문에 양립성(compatibility)이 중요한 기술표준에 유의미한 영향을 미칠 수 있다. 따라서, 표준기술 여부 판단 시, 재조합 신규성과 기술요소의 친숙함을 변수로 이용할 수 있다. 재조합 신규성에 해당하는 제1 변수값과 기술요소의 친숙함에 해당하는 제2 변수값을 산출하는 방법은 수학식 1 및 수학식 2와 동일하므로 그 설명은 생략하기로 한다. Since the novelty of recombination corresponding to the first variable is closely related to technological progress, a highly novel patent may have a significant impact on technical standards. In addition, patents with high familiarity of technical elements are likely to be path dependent technology inventions, and therefore can have a significant impact on technical standards where compatibility is important. Therefore, when determining whether the standard technology, recombination novelty and familiarity of the technology elements can be used as variables. Since the method of calculating the second variable value corresponding to the familiarity between the first variable value corresponding to the recombinant novelty and the technical element is the same as in Equation 1 and Equation 2, description thereof will be omitted.

기술의 일반성은 기술분류의 개수로 정의할 수 있고, 특허에 할당된 기술분류의 수가 많을수록 다양한 기술에 적용될 수 있기 때문에 기술분류의 개수는 기술지식의 일반성을 의미한다고 볼 수 있다. 이러한 관점에서 IPC 개수를 특허 영향력 예측을 위한 지표로 활용할 수 있다. 따라서, 기술 일반성(

)은 아래 기재된 수학식 3을 이용하여 산출할 수 있다. The generality of a technology can be defined as the number of technical classifications, and since the number of technical classifications assigned to a patent can be applied to various technologies, the number of technical classifications can be regarded as the generality of technical knowledge. In this regard, the number of IPCs can be used as an indicator for predicting patent impact. Therefore, technical generality (

) Can be calculated using Equation 3 described below.

[수학식 3][Equation 3]

여기서,

는 분석 대상 특허 k의 기술분류일 수 있다. here,

May be the technical classification of the target patent k.

기술적 기반은 발명의 기술적 기초를 확인하기 위해서 특허 후방인용(backward citation)의 개수를 이용한 변수일 수 있다. 특허의 후방인용 수를 발명의 기술적 기반을 나타내는 지표로 간주하지만, 기술발명의 지엽적 탐색(local search) 관점에서 국내기술기반(DTB, domestic technological basis)과 해외기술기반(FTB, foreign technological basis)으로 분리될 수 있다. The technical basis may be a variable using the number of patent back citations to identify the technical basis of the invention. Although the number of citations of patents is regarded as an indicator of the technical basis of the invention, it can be regarded as a domestic technological basis (DTB) and a foreign technological basis (FTB) in terms of local search. Can be separated.

먼저, 국내기반기술 변수값(

)은 아래 수학식 4를 이용하여 산출할 수 있고, 해외기술기반 변수값(

)은 아래 수학식 5를 이용하여 산출할 수 있다. First, the variable value of domestic technology

) Can be calculated using Equation 4 below.

) Can be calculated using Equation 5 below.

[수학식 4][Equation 4]

여기서,

는 특허 k의 국내특허인용 수를 의미한다. here,

Denotes the number of domestic patent citations of patent k.

[수학식 5][Equation 5]

여기서,

는 특허 k의 해외특허인용 수를 의미한다.here,

Denotes the number of foreign patent citations of patent k.

기술발명의 보호범위(PS, protection scope)는 특허의 청구항 수를 이용하여 산출할 수 있다. 특허의 법적 권리는 기술발명의 청구항을 통해 판단되므로, 특허의 청구항 수는 기술발명의 보호범위를 확인할 수 있는 지표로 자주 활용된다. 일반적으로 보호범위가 넓을 경우 침해소송에 휘말릴 가능성이 함께 높아짐에도 불구하고, 실제로는 비용 때문에 대부분 기술적 필요성이 높을 경우만 소송이 발생할 뿐만 아니라, 본래 특허권자가 승소하는 경우의 특허 청구항 수는 패소하는 경우에 비해 통계적으로 유의미하게 많다. 또한 특허의 청구항 각각이 구분되고 독립된 발명으로 간주될 수 있다는 관점에서, 특허의 청구항 수가 기술적 역량을 평가하는데 잠재력을 가지고 있다. 따라서 기술발명의 보호범위는 특허의 영향력을 평가할 수 있는 잠재력이 있는 것으로 판단되므로, 특허의 청구항 수를 기술발명의 보호범위를 대변하는 지표로 활용할 수 있다. 기술발명의 보호범위 변수값(

)는 아래 수학식 6을 이용하여 산출할 수 있다. The protection scope (PS) of the present invention can be calculated using the number of claims of the patent. Since the legal rights of a patent are determined through the claims of the technical invention, the number of claims of the patent is frequently used as an indicator to confirm the protection scope of the technical invention. In general, even if the scope of protection is increased, the possibility of infringement litigation increases, but in practice, litigation only occurs when technical necessity is high due to cost, and the number of patent claims that the original patent holder wins is lost. Statistically significant compared to. Also, in view of the fact that each claim of the patent can be regarded as a separate and independent invention, the number of claims of the patent has the potential to evaluate the technical capability. Therefore, since the protection scope of the technical invention is considered to have the potential to evaluate the influence of the patent, the number of claims of the patent can be used as an indicator representing the protection scope of the technical invention. Protection range variable value

) Can be calculated using Equation 6 below.

[수학식 6][Equation 6]

여기서,

는 특허 k의 청구항 수를 의미할 수 있다. here,

May refer to the number of claims in patent k.

기술발명자의 수(IC, inventor counts)는 특허 발명자의 수를 특허 영향력을 예측하기 위한 사전지표 중 하나로 활용할 수 있다. 많은 수의 발명자 풀은 각각의 다양한 경험과 지식 그리고 창의력을 한데 모을 수 있기 때문에 기술발명에 있어 큰 이점을 기대할 수 있다. 이러한 관점에서 다수의 발명자가 개발하는 기술발명은 창조적 다양성(creative diversity)이 높아 지식창출에 영향을 미칠 뿐만 아니라 미래 전방인용의 발생에 있어 차이를 발생시킨다고 주장하였다. 또한 연구개발 측면에서 많은 수의 발명자가 하나의 특허에 포함된다는 것은 해당 기술발명에 투자되는 기업의 노력이 크고 관심이 많다는 것을 대변할 수 있기 때문에 많은 발명자가 관여된 특허의 영향력은 클 것으로 추측할 수 있다. 이러한 가정 하에 특허의 가치와 특허 발명자의 수는 유의미한 관계가 있다. 따라서 발명자의 수를 특허 영향력을 예측하기 위한 사전지표 중 하나로 활용할 수 있고, 기술 발명자의 수(

)는 아래 기재된 수학식 7을 이용하여 산출할 수 있다. The inventor counts (ICs) can be used as one of the prior indicators for predicting patent impact. A large number of inventor pools can bring together a variety of experiences, knowledge, and creativity, so you can expect great benefits in technology invention. From this point of view, the technical invention developed by many inventors argued that high creative diversity not only affects knowledge creation but also makes a difference in the generation of future forward citations. In addition, in the aspect of research and development, the inclusion of a large number of inventors in a single patent may represent that the efforts of the companies invested in the technology invention are large and interesting. Can be. Under this assumption, there is a significant relationship between the value of a patent and the number of patent inventors. Therefore, the number of inventors can be utilized as one of the prior indicators for predicting patent influence, and the number of technical inventors (

) Can be calculated using Equation 7 described below.

[수학식 7][Equation 7]

는 특허 k의 발명자를 의미한다.

Means inventor of patent k.

상술한 바와 같이, 표준기술 관련 변수값 산출모듈(242)은 분석 대상 특허의 발명특성지표에 따른 변수값과 기술특성지표에 따른 변수값을 각각 산출한다. As described above, the standard technology-related variable value calculation module 242 calculates the variable value according to the invention characteristic index of the patent to be analyzed and the variable value according to the technical characteristic index, respectively.

표준기술 예측모듈(244)은 표준기술 관련 변수값 산출모듈(242)에서 산출된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 분석 대상 특허의 표준기술 여부를 예측할 수 있다. 즉, 표준기술 예측모듈(224)은 아래 수학식 8을 이용하여 분석 대상 특허의 표준기술 여부를 예측할 수 있다.The standard technology prediction module 244 may predict whether the analysis target patent is a standard technology by applying at least one or more of the values of the variables calculated by the standard technology related variable value calculation module 242 to the logistic regression model. That is, the standard technology prediction module 224 may predict whether the analysis target patent is the standard technology by using Equation 8 below.

[수학식 8][Equation 8]

여기서,

는 i번째 종속변수,

은 이분변수 Y가 발생할 확률(표준기술 여부),

는 상수,

는 i번째 독립변수에 대한 전체 계수 벡터,

는 i번째 독립변수 벡터(표준기술 관련 변수, 즉, 7개의 변수를 의미함)를 의미한다. 계수(

)는 데이터의 독립변인 값이 들어갔을 때 전체 데이터(즉 모형) 측면에서 가장 오차 값이 적은 계수 값으로 결정된다. 따라서, 회귀분석에서는 비용함수(cost function)을 통해 이를 최소화하는 방향으로 계수값을 산출하는데, 예컨대, 경사하강법(gradient descent) 등을 이용할 수 있다. here,

Is the i dependent variable,

Is the probability of occurrence of the binary variable Y (standard or not),

Is a constant,

Is the full coefficient vector for the i independent variable,

Denotes the i-th independent variable vector (meaning seven standard variables). Coefficient(

) Is determined as the coefficient value with the least error in terms of the overall data (ie model) when the independent variable value of the data is entered. Therefore, in the regression analysis, the coefficient value is calculated in a direction to minimize the cost through a cost function, for example, a gradient descent method may be used.

표준기술 예측모듈(244)은 독립변수인 변수들의 값에 각각 산출된 계수값을 곱연산하여 분석 대상 특허의 표준기술 여부를 예측할 수 있다. 여기서는 회귀분석을 이용하여 표준기술 여부를 예측하는 것으로 설명하였으나, 표준기술 예측모듈(224)는 딥러닝 등의 다양한 알고리즘을 이용하여 분석대상 특허의 표준기술 여부를 예측할 수 있다. The standard technology prediction module 244 may predict whether the analysis target patent is standard technology by multiplying the calculated coefficient values by the values of the independent variables. Here, the regression analysis is used to predict whether the standard technology is used. However, the standard technology prediction module 224 may predict whether the standard technology of the target patent is analyzed using various algorithms such as deep learning.

표준기술 예측부(230)에서 분석대상 특허가 표준기술로 예측된 경우, 유망성 예측부(230)는 분석 대상 특허를 유망 특허로 판단할 수 있다. When the analysis target patent is predicted as the standard technology by the standard technology prediction unit 230, the prospective prediction unit 230 may determine the analysis target patent as the promising patent.

또한, 특허 유망성 예측 장치(200)는 분석 대상 특허의 기술이전 여부를 예측하는 기술이전 예측부(250)를 더 포함할 수 있다. 기술이전 예측부(250)는 분석 대상 특허의 제1 변수값과 제2 변수값 외에, 기 설정된 변수들의 값을 산출하고, 산출된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 분석대상 특허의 기술이전 여부를 예측할 수 있다. In addition, the patent prospective prediction apparatus 200 may further include a technology transfer predictor 250 predicting whether the analysis target patent transfers technology. The technology transfer prediction unit 250 calculates values of preset variables in addition to the first variable value and the second variable value of the analysis target patent, and applies at least one or more of the calculated variables to the logistic regression model. Predict whether or not a patent transfers technology.

이러한 기술이전 예측부(250)는 기술이전 관련 변수값 산출모듈(252), 표준기술 예측모듈(254)을 포함하고, 기술이전 예측부(250)가 기술이전 여부를 예측하는 방법은 표준기술 예측부(240)의 동작과 동일하므로 그 상세한 설명은 생략하기로 한다. The technology transfer prediction unit 250 includes a technology transfer related variable value calculation module 252 and a standard technology prediction module 254. The method of the technology transfer prediction unit 250 predicting technology transfer is a standard technology prediction. Since the operation of the unit 240 is the same, a detailed description thereof will be omitted.

기술이전 예측부(230)에서 분석대상 특허가 기술이전 특허로 예측된 경우, 유망성 예측부(230)는 분석 대상 특허를 유망 특허로 판단할 수 있다.When the analysis target patent is predicted as the technology transfer patent by the technology transfer predicting unit 230, the prospective predicting unit 230 may determine the analysis target patent as the promising patent.

유망성 예측부(230)는 분석대상 특허의 피인용 횟수, 표준기술 여부, 기술이전 여부 중 적어도 하나에 기초하여 분석대상 특허를 유망 특허로 판단할 수 있다. 유망 특허는 전방인용이 많이 일어난 특허, 표준기술특허, 그리고 기술이전이 발생한 특허 등을 의미하므로, 유망성 예측부(230)는 분석대상 특허의 피인용 횟수, 표준기술 여부, 기술이전 여부 등을 이용하여 분석대상 특허의 유망 특허 여부를 판단할 수 있다. The promising prediction unit 230 may determine the analysis target patent as a promising patent based on at least one of the number of citations of the analysis target patent, whether the standard technology is present or whether the technology is transferred. Promising patents mean patents with many forward citations, standard technology patents, and patents with technology transfers, and thus, the prospective prediction unit 230 uses the number of citations of the analysis target patent, whether it is a standard technology, whether a technology is transferred, etc. It is possible to determine whether the patent to be analyzed is a promising patent.

또한, 특허 유망성 예측 장치(200)는 특허의 발명·기술적 특징을 확인할 수 있는 사전지표를 이용하여 분석 대상 특허의 분쟁 여부 등을 예측할 수도 있다. In addition, the patent prospective prediction apparatus 200 may predict whether or not there is a dispute of an analysis target patent by using an advance index that can confirm the invention and technical features of the patent.

한편, 분석부(210), 인용횟수 예측부(220), 표준기술 예측부(230), 기술이전 예측부(240) 각각은 컴퓨팅 장치상에서 프로그램을 실행하기 위해 필요한 프로세서 등에 의해 각각 구현될 수 있다. 이처럼 분석부(210), 인용횟수 예측부(220), 표준기술 예측부(230), 기술이전 예측부(240) 각각은 물리적으로 독립된 각각의 구성에 의해 구현될 수도 있고, 하나의 프로세서 내에서 기능적으로 구분되는 형태로 구현될 수도 있다. Meanwhile, each of the analyzer 210, the citation count predictor 220, the standard technology predictor 230, and the technology transfer predictor 240 may be implemented by a processor required to execute a program on a computing device. . As such, each of the analysis unit 210, the citation count prediction unit 220, the standard technology prediction unit 230, and the technology transfer prediction unit 240 may be implemented by physically independent components, or within one processor. It may be implemented in a functionally distinct form.

제어부(260)는 분석부(210), 인용횟수 예측부(220), 표준기술 예측부(230), 기술이전 예측부(240)를 포함하는 특허 유망성 예측 장치(300)의 다양한 구성부들의 동작을 제어하는 구성이다. 이러한 제어부(260)는 적어도 하나의 연산 장치를 포함할 수 있는데, 여기서 상기 연산 장치는 범용적인 중앙연산장치(CPU), 특정 목적에 적합하게 구현된 프로그래머블 디바이스 소자(CPLD, FPGA), 주문형 반도체 연산장치(ASIC) 또는 마이크로 컨트롤러 칩일 수 있다.The controller 260 operates operations of various components of the patent prospect prediction apparatus 300 including an analyzer 210, a citation count predictor 220, a standard technology predictor 230, and a technology transfer predictor 240. To control the configuration. The controller 260 may include at least one arithmetic unit, wherein the arithmetic unit is a general purpose central arithmetic unit (CPU), programmable device elements (CPLDs, FPGAs), and on-demand semiconductor arithmetics suited for specific purposes. It may be an apparatus (ASIC) or a microcontroller chip.

도 4는 본 발명의 일 실시예에 따른 특허 유망성 예측 장치가 분석 대상 특허의 피인용 횟수를 예측하는 방법을 설명하기 위한 흐름도이다. 4 is a flowchart illustrating a method of predicting the number of citations of an analysis target patent by a patent prospect prediction apparatus according to an embodiment of the present invention.

도 4를 참조하면, 특허 유망성 예측 장치는 분석 대상 특허를 분석하여 서지정보를 획득한다(S410). 즉, 특허 유망성 예측 장치는 분석 대상 특허를 분석하여 출원번호, 출원일, 공개번호, 공개일, 출원인명, 기술분류, 전방인용횟수, 후방인용횟수, 청구항 수, 발명자 수 등을 포함하는 서지정보를 획득한다. Referring to FIG. 4, the patent prospect prediction apparatus analyzes an analysis target patent to obtain bibliographic information (S410). That is, the patent promising prediction device analyzes an analysis target patent to obtain bibliographic information including an application number, an application date, a publication number, a publication date, an applicant's name, a technology classification, a number of forward citations, a number of back citations, a number of claims, and a number of inventors. Acquire.

단계 S410의 수행 후, 특허 유망성 예측 장치는 서지정보내 기술분류의 서브클래스에 기초하여 제1 변수값을 산출한다(S420). 즉, 특허 유망성 예측 장치는 분석 대상 특허의 서브클래스 조합쌍과 분석 대상 특허의 출원일 전에 출원된 과거 특허들의 서브클래스 조합쌍을 비교하여, 분석 대상 특허에만 있는 서브클래스 조합쌍과 분석 대상 특허의 전체 서브클래스 조합쌍의 수에 기초하여 제1 변수값을 산출한다.After performing step S410, the patent prospective prediction apparatus calculates a first variable value based on the subclass of the technical classification in the bibliographic information (S420). That is, the patent prospect prediction apparatus compares a subclass combination pair of an analysis target patent and a subclass combination pair of past patents filed before the filing date of the analysis target patent, and compares the entire subclass combination pair and only the analysis target patent. A first variable value is calculated based on the number of subclass combination pairs.

단계 S420이 수행되면, 특허 유망성 예측 장치는 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허들을 이용하여 제2 변수값을 산출한다(S430). 즉, 특허 유망성 예측 장치는 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허의 출원년도와 분석 대상 특허의 출원년도 차이에 기초하여 제2 변수값을 산출한다. When step S420 is performed, the patent prospect prediction apparatus calculates a second variable value by using past patents having the same technology classification as the analysis target patent (S430). That is, the patent prospective prediction apparatus calculates a second variable value based on the difference between the filing year of the past patent and the filing year of the analysis target patent having the same technology classification as the analysis target patent.

단계 S430이 수행되면, 특허 유망성 예측 장치는 제1 변수값과 제2 변수값을 투입변수로 하는 알고리즘에 적용하여 분석 대상 특허에 대한 미래 시점의 피인용횟수를 예측한다(S440). 이때, 특허 유망성 예측 장치는 딥 러닝(deep learning), 회귀분석 등의 알고리즘을 이용하여 분석대상 특허의 피인용 횟수를 예측할 수 있다. When step S430 is performed, the apparatus for predicting patent prospects is applied to an algorithm using the first variable value and the second variable value as input variables to predict the number of citations of a future time point for the analysis target patent (S440). In this case, the patent prospective prediction apparatus may predict the number of citations of the analysis target patent by using algorithms such as deep learning and regression analysis.

특허 유망성 예측 장치는 피인용 횟수에 기초하여 분석 대상 특허의 유망 특허 여부를 판단할 수 있다. The patent promising prediction device may determine whether a patent to be analyzed is a promising patent based on the number of citations.

도 5는본 발명의 일 실시예에 따른 특허 유망성 예측 장치가 분석 대상 특허의 표준기술특허 여부 또는 기술이전 여부를 예측하는 방법을 설명하기 위한 흐름도이다. 5 is a flowchart illustrating a method of predicting whether a patent prospective predicting device according to an embodiment of the present invention predicts whether an analysis target patent is a standard technology patent or a technology transfer.

도 5를 참조하면, 특허 유망성 예측 장치는 분석 대상 특허를 분석하여 서지정보를 획득한다(S510). 즉, 특허 유망성 예측 장치는 분석 대상 특허를 분석하여 출원번호, 출원일, 공개번호, 공개일, 출원인명, 기술분류, 전방인용횟수, 후방인용횟수, 청구항 수, 발명자 수 등을 포함하는 서지정보를 획득한다. Referring to FIG. 5, the patent prospect prediction apparatus analyzes an analysis target patent to obtain bibliographic information (S510). That is, the patent promising prediction device analyzes an analysis target patent to obtain bibliographic information including an application number, an application date, a publication number, a publication date, an applicant's name, a technology classification, a number of forward citations, a number of back citations, a number of claims, and a number of inventors. Acquire.

단계 S510의 수행 후, 특허 유망성 예측 장치는 서지정보내 기술분류의 서브클래스에 기초하여 제1 변수값을 산출한다(S520). 즉, 특허 유망성 예측 장치는 분석 대상 특허의 서브클래스 조합쌍과 분석 대상 특허의 출원일 전에 출원된 과거 특허들의 서브클래스 조합쌍을 비교하여, 분석 대상 특허에만 있는 서브클래스 조합쌍과 분석 대상 특허의 전체 서브클래스 조합쌍의 수에 기초하여 제1 변수값을 산출한다.After performing step S510, the patent prospective prediction apparatus calculates a first variable value based on the subclass of the technical classification in the bibliographic information (S520). That is, the patent prospect prediction apparatus compares a subclass combination pair of an analysis target patent and a subclass combination pair of past patents filed before the filing date of the analysis target patent, and compares the entire subclass combination pair and only the analysis target patent. A first variable value is calculated based on the number of subclass combination pairs.

단계 S520이 수행되면, 특허 유망성 예측 장치는 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허들을 이용하여 제2 변수값을 산출한다(S530). 즉, 특허 유망성 예측 장치는 분석 대상 특허와 동일한 기술분류를 가지고 있는 과거 특허의 출원년도와 분석 대상 특허의 출원년도 차이에 기초하여 제2 변수값을 산출한다. When step S520 is performed, the patent prospect prediction apparatus calculates a second variable value by using past patents having the same technology classification as the analysis target patent (S530). That is, the patent prospective prediction apparatus calculates a second variable value based on the difference between the filing year of the past patent and the filing year of the analysis target patent having the same technology classification as the analysis target patent.

단계 S530이 수행되면, 특허 유망성 예측 장치는 분석 대상 특허의 기술특성지표에 따른 변수값들을 각각 산출한다(S540). 여기서, 기술특성지표에 따른 변수값들은 기술의 일반성(TG), 기술적 기반(TB), 기술발명의 보호범위(PS), 기술발명자의 수(IC)의 변수들의 값일 수 있다. When step S530 is performed, the patent prospect prediction apparatus calculates variable values according to technical characteristic indexes of the analysis target patent, respectively (S540). Here, the variable values according to the technical characteristic index may be values of variables of generality of technology (TG), technical basis (TB), protection scope (PS) of technology invention, and number of technology inventors (IC).

단계 S540이 수행되면, 특허 유망성 예측 장치는 제1 변수값, 제2 변수값외에 기술특성지표에 따른 변수값들을 투입변수로 하는 알고리즘에 적용하여 분석 대상 특허에 대한 표준기술 여부 또는 기술이전 여부를 예측한다(S550). 이때, 특허 유망성 예측 장치는 딥 러닝(deep learning), 회귀분석 등의 알고리즘을 이용하여 표준기술여부 또는 기술이전 여부를 예측할 수 있다. When step S540 is performed, the apparatus for predicting patent promising is applied to an algorithm having variable values according to technical characteristic indexes as input variables in addition to the first variable value and the second variable value to determine whether the standard technology or the technology transfer for the target patent is analyzed. It is predicted (S550). In this case, the patent prospective prediction apparatus may predict whether standard technology or technology transfer is performed by using algorithms such as deep learning and regression analysis.

분석 대상 특허의 제1 변수값과 제2 변수값 외에, 기 설정된 변수들의 값을 산출하고, 산출된 변수들의 값 중 적어도 하나 이상을 로지스틱 회귀모형에 적용하여 분석대상 특허의 표준기술 여부를 예측할 수 있다In addition to the first variable value and the second variable value of the patent to be analyzed, a value of preset variables may be calculated, and at least one or more of the calculated values of the variables may be applied to a logistic regression model to predict whether or not the standard technology of the patent to be analyzed is standard technology. have

상술한 특허 유망성 예측 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터로 판독 가능한 기록 매체에 기록될 수 있다. 이때, 컴퓨터로 판독 가능한 기록매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 한편, 기록매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지된 것일 수도 있다.The above-described patent prospect prediction method is implemented in the form of program instructions that can be executed by various computer means, and can be recorded in a computer-readable recording medium. In this case, the computer-readable recording medium may include program instructions, data files, data structures, and the like, alone or in combination. On the other hand, program instructions recorded on the recording medium may be those specially designed and constructed for the present invention, or may be known to those skilled in computer software.

컴퓨터로 판독 가능한 기록매체에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(Magnetic Media), CD-ROM, DVD와 같은 광기록 매체(Optical Media), 플롭티컬 디스크(Floptical Disk)와 같은 자기-광매체(Magneto-Optical Media), 및 롬(ROM), 램(RAM), 플래시 메모리, SSD (Solid State Drive)와 같은 메모리 저장장치 등 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다.Computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks such as floppy disks. Included are hardware devices specifically configured to store and execute program instructions, such as magneto-optical media, and memory storage devices such as ROM, RAM, flash memory, and solid state drives (SSD).

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

100 : 사용자 단말기
200 : 특허 유망성 예측 장치
210 : 분석부
220 : 인용횟수 예측부
230 : 표준기술 예측부
240 : 기술이전 예측부100: user terminal
200: patent prospect prediction device
210: analysis unit
220: citation count prediction unit
230: standard technology prediction unit
240: technology transfer prediction unit

Claims

In the method of predicting the prospect of the patent of the analysis target patent patent prediction apparatus,
Obtaining bibliographic information of the analysis target patent;
Calculating a first variable value and a second variable value based on detailed description elements of the technical classification in the bibliographic information; And
Predicting the number of citations of a future time point for the analysis target patent by applying deep learning to the calculated first and second variable values
Patent prospective prediction method comprising a.

The method of claim 1,
The technical classification is a hierarchical structure in which descriptions are classified into sections, classes, subclasses, main groups, and subgroups.
The detailed description element patent prediction method, characterized in that the subclass.

The method of claim 2,
Computing the first variable value,
Generating a subclass pair by combining subclasses in the technical classification of the target patent;
Generating a subclass pair by combining subclasses in the technical classification of past patents filed before the filing date of the patent to be analyzed;
Comparing the subclass pair of the patent to be analyzed with the subclass pair of the past patents to extract a subclass pair only in the patent to be analyzed; And
Calculating a first variable value based on the number of subclass pairs existing only in the analysis target patent and the total number of subclass pairs of the analysis target patent.

The method of claim 3,
Computing the second variable value,
Identifying identical technical classification patents having the same technical classification as the technical classification of the analysis target patent among past patents applied before the application date of the analysis target patent;
Calculating a difference of an application date between the analysis target patent and each same technology classification patent; And
Calculating a second variable value using the number of subclasses of the analysis target patent, the number of the same technology classification patent, and the filing date difference.

The method of claim 4, wherein
After calculating the first variable value and the second variable value,
Calculating values of pre-defined variables based on at least one of the number of technical classifications, backward citations, claims, and inventors of the target patent; And
Applying the at least one or more of the first variable value, the second variable value, and values of predefined variables to a logistic regression model to predict whether the analysis target patent is standard technology or technology transfer. Forecast method.

The method of claim 5,
And determining the prospect of the analysis target patent based on at least one of the predicted number of citations, standard technology patent, and technology transfer.

An analysis unit for obtaining bibliographic information by analyzing the analysis target patent;
The first variable value and the second variable value are calculated based on the detailed description elements of the technical classification in the bibliographic information, and deep learning is applied to the calculated first variable value and the second variable value to determine the patent. A citation count prediction unit for predicting a citation count at a future time;
Calculating values of the predefined variables based on at least one of the number of technical classifications of the analysis target patent, the number of back citations, the number of claims, and the number of inventors; and the first variable value, the second variable value, and the predefined variable. A standard technology / technology transfer prediction unit for predicting whether the analysis target patent is standard technology or technology transfer by applying at least one of the values to the logistic regression model; And
Prospective prediction unit for determining whether the patent is a promising patent for the analysis target patent based on at least one of the number of citations, whether the standard technology, technology transfer
Patent prospective prediction device comprising a.