KR101534159B1

KR101534159B1 - Method and System on Patent Information Association Analysis System for Social Network Analysis Result

Info

Publication number: KR101534159B1
Application number: KR1020090082997A
Authority: KR
Inventors: 강민수
Original assignee: (주)광개토연구소; 강민수
Priority date: 2009-09-03
Filing date: 2009-09-03
Publication date: 2015-07-08
Also published as: KR20090111791A

Abstract

본 발명은 특허 정보를 대상으로 한 SNA(Social Network Analysis) 정보 제공을 위한 융합 분석 방법 및 그 시스템에 관한 것이다. The present invention relates to a fusion analysis method and system for providing SNA (Social Network Analysis) information on patent information.

본 발명을 실시하면, 특허분류기호간, 공동발명자간, 공동출원인간, 공동인용특허간, 핵심 기술 키워드간의 SNA에 대한 융합 분석 결과를 제공받게 되는 장점이 있다.According to the present invention, there is an advantage that fusion analysis results for SNA between patent classification symbols, between co-inventors, co-filed human, common patent, and core technology keywords are provided.

특허 정보, SNA(Social Network Analysis), 융합, 특허 분석 Patent information, SNA (Social Network Analysis), convergence, patent analysis

Description

TECHNICAL FIELD The present invention relates to a patent information fusion analysis method and system for generating a social network analysis result,

특허 정보는 기술 정보, 권리 정보 및 경영 정보의 성격을 가지며, 글로벌한 국제 경쟁에서 그 중요성이 증가 되고 있다. 기술 정보로서 기술 개발 동향, 개별 특허에 적용된 기술적 아이디어를 알 수 있고, 권리 정보로서 개별 특허의 권리 범위, 국내외 권리화 정도를 파악할 수 있으며, 경영 정보로서 경쟁 기업의 기술 개발 동향 등을 파악할 수 있다.Patent information has the characteristics of technical information, right information and management information, and its importance is increasing in global international competition. It is possible to know technical development trends as technical information, technical ideas applied to individual patents, understand the scope of rights of individual patents as rights information, the degree of domesticization and domesticization, and grasp technology development trends of competitors as management information.

특허 정보를 입수할 수 있는 방법은 다양하나, 크게 1) 각국 특허청이 제공해 주는 특허 정보 시스템을 이용하는 것과 2) 민간 기업이 개발한 특허 정보 시스템을 이용하는 것, 및 3) 개별 기업이 자사의 목적에 맞게 구축된 특허 정보 시스 템을 이용하는 것 등으로 대별할 수 있다. 1)의 방법의 대표적인 예가 대한민국 특허청(www.kipo.go.kr) 및 대한민국 특허정보원(www.kipris.or.kr)이 제공하는 특허 정보 시스템이 있으며, 2)의 대표적인 예로 미국의 www.delphion.com, 일본의 www.patolis.co.jp, 대한민국의 www.wips.co.kr, www.wisdomain.com 등 있다.There are many ways to obtain patent information, but 1) it uses the patent information system provided by each country's patent office, 2) uses patent information system developed by private enterprise, and 3) And the use of a patent information system that is tailored. 1) is a patent information system provided by the Korea Intellectual Property Office (www.kipo.go.kr) and the Korea Intellectual Property Office (www.kipris.or.kr), and 2) .com, www.patolis.co.jp in Japan, www.wips.co.kr in Korea, and www.wisdomain.com.

www.delphion.com 의 운영회사인 톰슨사이언티픽과 같은 각 국가의 민간 기업들은 특허 정보 데이터베이스를 구축해 놓고, 특허 검색 엔진을 통하여 입력된 검색식에 대해 다양한 방식으로 검색 결과를 제공해 주고 있다. 또한, 이들 특허 정보를 기반으로 한 다양한 종류의 분석 소프트웨어가 개발되어 있으며, www.micropatent.com의 운영사(톰슨사이언티픽에 인수 합병되었음)가 개발한 상표명 AUREKA로 보급되는 소프트웨어가 유명하다. 그리고, 세계 각국의 기술 선진 대기업들은 내부에 특허 검색 및 특허 관리 시스템을 구축하여 운영하고 있는 것으로 알려져 있으나, 외부에서 그 기업들의 특허 검색 및 특허 관리 시스템에 대해 접근하기가 어렵다.Companies in each country, such as Thomson Scientific, the operating company of www.delphion.com, set up a patent information database and provide search results in a variety of ways to the search terms entered through the patent search engine. In addition, various kinds of analysis software based on these patent information have been developed, and software that is popular with the brand name AUREKA developed by the operator of www.micropatent.com (merged with Thompson Scientific) is famous. It is known that large-scale advanced technology companies in the world have built and operated a patent search and patent management system in the inside, but it is difficult to access the patent search and patent management system of the companies from outside.

이러한 특허 검색 및 특허 관리 시스템을 구축하기 위해서는 수십억원 이상의 비용이 지출되는 것이 통상이어서, 국내외 대부분의 중견 및 중소 벤처 기업들은 자체 특허 검색 및 특허 관리 시스템을 구축하고 있지 못하고 있다. 이들 중견 및 중소 벤처 기업들은 특허 전담 조직을 두고 있는 경우도 다수 있으며, 이들이 자사의 특허에 대한 관리 및 자사를 위한 특허 정보를 수집하고 있다. 하지만, 기업 규모가 작거나, 특허에 대한 의식이 크지 않은 기업들은 이러한 특허 전담 조직을 갖추고 있지 못하는 경우가 많다. 이들 기업들은 특허법률사무소나 로펌 등에 의뢰하여 자신의 특허를 관리하고 있는 수준에 거치고 있다.In order to construct such a patent search and patent management system, it is common to spend more than several billions of dollars, and most domestic and foreign medium and small-sized venture companies are not building their own patent search and patent management system. These mid- to small-sized venture companies have a patent-dedicated organization, and they are collecting patent information for management and own company. However, companies that are small in size or not conscious of patents often do not have such a patented organization. These companies have been submitting to patent law offices or law firms to manage their own patents.

이들 특허 전담 조직은 www.delphion.com 등과 같은 외부에서 접근 가능한 유무료 온라인 특허 정보 서비스 업체들을 이용하여, 이들이 제공해 주는 플랫폼으로 특허 정보를 수집, 관리하고 있다. 그리고, 기업 명의로 된 특허 출원의 대부분은 그 기업 소속 발명자들에 의해서 이루어 지며, 그 발명자는 자신의 발명과 관련된 국내외의 특허 정보를 수집하는 것이 전세계적인 추세가 된다. 그 이유가 특허 공개 기간의 존재에도 불구하고 전세계의 신기술 정보는 70% 이상이 특허 정보의 형태로 공개되며, 특허 정보가 논문 정보 등에 비해서 상업적으로 이용가능한 아이디어가 많기 때문으로 풀이된다. 하지만, 이들 발명자들의 상당 부분은 특허 검색에 익숙해 있지 않거나, 검색에 익숙한 자라 하더라도 www.deljphion.com 등과 같은 외부 특허 정보 서비스 제공업체에 접속하여 특허 정보를 입수 받고 있다. These patents are collecting and managing patent information on a platform that they provide by using online and offline patent information service providers such as www.delphion.com. Most of the patents filed under the name of the company are made by the inventors belonging to the company, and the inventor becomes a global trend to collect patent information at home and abroad related to his invention. The reason is that despite the existence of the patent publication period, more than 70% of the new technology information in the world is disclosed in the form of patent information, and patent information is more commercially available than the paper information. However, many of these inventors are not familiar with patent searches, or even those who are familiar with the search are accessing patent information service providers such as www.deljphion.com to obtain patent information.

통상적으로 발명자들은 특정 기술 분야에 대하여 수년 내지 십수년간 지속적으로 연구하는 것이 일반적이므로, 자신 발명한 기술 분야가 크게 변동되지 않는 것이 일반적이다. 그러므로, 자신이 발명한 기술 분야와 직접 또는 간접적으로 관련되는 분야에 관한 국내 또는 해외 각국의 특허 기술 정보는 발명자에게 유익한 정보가 될 것이다. 그러므로, 개별 기업 단위 뿐만 아니라, 그 기업 소속의 발명자 단위로 운영되며, 발명자 단위까지 최적화되어 특정 발명자 만을 위한 차별화된 특허 정보 서비스 시스템의 공급이 요청되어 왔으며, 이와 같은 특허 정보 서비스 시스템은 자체적으로 개발하고 유지할 수 없는 중견 및 중소벤처 기업들에게는 특허 정보에 대한 접근성을 크게 향상시킬 것이며, 자체적으로 개발 유지할 수 있는 기 업들에게는 사내 특허 정보 유통의 새로운 플랫폼이 될 것이다.Generally, it is common for the inventors to continuously study for a certain technology field for several years to several decades, so that it is common that the inventive technology field does not greatly change. Therefore, the patent technical information of domestic or foreign countries concerning the field directly or indirectly related to the technical field invented by the inventor will be useful information to the inventor. Therefore, it has been demanded not only to individual companies but also to the inventor's unit, and to supply differentiated patent information service system for only the specific inventor by optimizing to the inventor's unit. Such a patent information service system is developed It will greatly improve the accessibility of patent information to medium and small-sized venture companies that can not be maintained, and it will become a new platform for in-house patent information distribution for companies that can develop and maintain themselves.

또한, 기업 또는 발명자들이 자신이 관리하는 특허를 자동으로 분석하여, 이들과 관련된 경쟁 기업 및 경쟁 기술에 관한 분석 정보, 경쟁 기업 및 경쟁 기술에 대한 감시 정보 등이 자동으로 정리되어 기업 또는 발명자에게 직접 제공된다면, 기업 또는 발명자에게 편리할 것이다.In addition, companies or inventors automatically analyze the patents they manage and analyze information about their competitors and competitive technologies, monitoring information about competitors and competitive technologies, etc., If provided, it would be convenient for the enterprise or inventor.

본 발명이 이루고자 하는 기술적 과제는 특허 문건 정보에 대한 한위 특허 분류 기호 정보 포함한 특허 분류 기호, 공동 출원인, 공동 발명자, 인용 문건, 핵심 기술 키워드 간의 융합 분석 방법 및 그 시스템을 제시하는 것이다.The present invention is directed to a method and system for analyzing a fusion between a patent classification symbol, a joint applicant, a co-inventor, a citation document, and a key technology keyword including a patent classification symbol information on a patent document information.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 사용자의 컴퓨터에 특허 정보를 제공하는 시스템이, 특허 문건 집합에 대한 선택 정보를 입수 받는 단계; 상기 선택 정보를 기초로 처리 대상 특허 문건 집합이 특정되는 단계; 상기 특정된 처리 대상 특허 문건 집합에 대하여 적어도 하나 이상의 기설정된 처리를 수행하여 가공 특허 정보가 생성되는 단계;를 더 포함하며, 상기 처리 대상 특허 문건 집합을 특정하는 단계 및 상기 기설정된 계산을 수행하여 가공 특허 정보를 생성하는 단계 중 어느 한 단계 이상에서 적어도 하나 이상의 특허 분류 기호를 활용하며, 상기 특허 분류 기호의 활용은 상기 선택 정보를 입수 받는 단계 및 상기 가공 특허 정보를 생성하는 단계 중 어느 한 단계 이상에서 제1 특허 분류 기호가 주어졌을 때, 상기 제1 특허 분류 기호가 속하는 특허 분류 기호 체계 상에서 상기 제1 특허 분류 기호가 기설정된 레벨에 해당하는 경우, 상기 제1 특허 분류 기호의 하위 특허 분류 기호에 해당하는 특허 정보를 포함하도록 정보 처리하는 것이며, 상기 특허 분류 기호는 국제 특허 분류(IPC), 미국 특허 분류(USPC), 일본의 파일 인덱스(FI), 일본의 F-term(FT), 유럽 특허 분류(ECLA) 중 어느 하나 이상인 것을 특징으로 하는 특허 정보 제공 방법을 제공한다. According to an aspect of the present invention, there is provided a system for providing patent information to a user's computer, the method comprising: receiving selection information on a set of patent documents; Specifying a set of patent documents to be processed based on the selection information; Further comprising the step of generating processed patent information by performing at least one predetermined process on the specified set of patent documents to be processed, the method comprising: specifying the set of patent documents to be processed; and performing the predetermined calculation At least one of the patent classification symbols is used in at least one of the steps of generating patented patent information, and the utilization of the patent classification symbol is performed at any one of the steps of obtaining the selection information and generating the processed patent information When the first patent classification symbol is given, if the first patent classification symbol corresponds to a predetermined level on the patent classification scheme to which the first patent classification symbol belongs, And the patent information corresponding to the symbol is included in the patent information, The symbol may be any one or more of the International Patent Classification (IPC), the US Patent Class (USPC), the Japanese File Index (FI), the Japanese F-term (FT) and the European Patent Classification (ECLA) &Lt; / RTI >

상기 기설정된 레벨은 도트 레벨(dot level)의 특허 분류 기호인 것이며, 상기 도트 레벨의 특허 분류 기호는 상기 특허 분류 기호 체계 상의 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것인 것이 바람직하다.The predetermined level is a patent classification symbol of a dot level and the patent classification symbol of the dot level includes a dot in a title information which is a description corresponding to a patent classification symbol in the patent classification system, Is preferably included.

상기 도트 레벨의 특허 분류 기호는 상기 특허 분류 기호가 국제 특허 분류(IPC)인 경우에는 서브 그룹 이하의 특허 분류 기호 중에서 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것이며, 미국 특허 분류(USPC)의 경우에는 서브클래스 이하의 특허 분류 기호 중에서 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것이며, 일본의 파일 인덱스(FI)의 경우에는 서브 그룹 이하의 특허 분류 기호 중에서 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것이며, 일본의 F-term(FT)일 경우에는 테마 코드 미만의 특허 분류 기호 중에서 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것이며, 유럽 특허 분류(ECLA)의 경우에는 서브 그룹 이하의 특허 분류 기호 중에서 특허 분류 기호에 대응되는 설명인 타이틀(title) 정보에 도트(dot)가 포함되어 있는 것인 것이 바람직하다.If the patent classification symbol is the international patent classification (IPC), the dot classification of the dot level includes a dot in the title information which is a description corresponding to the patent classification symbol among the patent classification symbols below the subgroup In the case of the United States Patent Class (USPC), a dot is included in title information, which is a description corresponding to a patent classification symbol among subclasses and the following patent classification symbols, and a Japanese file index In the case of the Japanese F-term (FT), the title code is a title code corresponding to the patent classification symbol. (Dot) is included in the title information, which is a description corresponding to the patent classification symbol among the patent classification symbols, and in the case of the European Patent Classification (ECLA) It is preferable that a dot is included in the title information which is a description corresponding to the patent classification symbol among the patent classification symbols below the group.

상기 제1 특허 분류 기호의 하위 특허 분류 기호에 해당하는 특허 정보를 포함하도록 정보 처리하는 것은, 상기 제1 특허 분류 기호가 도트 레벨의 특허 분류 기호일 경우, 상기 도트 레벨의 제1 특허 분류 기호의 모든 하위 특허 분류 기호 및 상기 제1 특허 분류 기호의 모든 하위 특허 분류 기호에 대응되는 특허 문건 집합 중 어느 하나 이상을 정보 처리의 대상으로 하는 것인 것이 바람직하다.The information processing to include the patent information corresponding to the sub-patent classification symbol of the first patent classification symbol is performed when the first patent classification symbol is a patent classification symbol of a dot level, It is preferable that at least one of all the sub-patent classification symbols and the set of patent documents corresponding to all the sub-patent classification symbols of the first patent classification symbol is subjected to information processing.

상기 선택 정보를 입수 받는 방법은 1) 사용자 컴퓨터로부터 검색식으로 입력 받는 방법, 2) 사용자 컴퓨터로부터 상기 사용자 컴퓨터에 제시된 기설정된 상기 특허 문건에 포함된 서지 사항을 구성하는 정보 중에서 선택되는 어느 하나 이상에 대한 선택 정보를 전송받 받는 방법, 3) 상기 사용자가 사전에 생성해 놓은 적어도 하나 이상의 특허 문건 집합 중에서 선택된 특정 특허 문건 집합에 대한 선택 정보를 입수 받는 방법 및 4) 상기 시스템으로부터 기설정된 적어도 하나 이상의 조건을 제공받는 방법 중 어느 하나의 방법 이상이 사용되는 것인 것인 것이 바람직하다.The method for receiving the selection information includes: 1) a method of receiving input from a user computer through a search formula; and 2) information constituting a bibliography included in the predetermined patent document presented on the user's computer from the user's computer A method for receiving selection information on a set of specific patent documents selected from among a set of at least one patent document previously generated by the user and 4) a method for receiving selection information for at least one It is preferable that at least one of the above-mentioned methods is used.

상기 처리 대상 특허 문건 집합이 특정되는 단계는 상기 1) 내지 4)의 방법 중 어느 하나가 실시되었을 때, 상기 1) 내지 4)의 방법의 실시되는 과정에서 적어도 하나 이상의 제1 특허 분류 기호가 포함되어 있고, 상기 제1 특허 분류 기호가 도트 레벨의 특허 분류 기호일 경우, 상기 도트 레벨의 제1 특허 분류 기호의 모든 하위 특허 분류 기호에 대응되는 특허 문건 집합이 처리 대상 특허 문건 집합으로 특정되게 되는 것인 것이 바람직하다.The step of specifying the set of patent documents to be processed includes at least one first patent classification symbol in the course of the implementation of the methods 1) to 4) when any one of the methods 1) to 4) And when the first patent classification symbol is a patent classification code of a dot level, a set of patent documents corresponding to all the lower patent classification symbols of the first patent classification symbol of the dot level is specified as a set of patent documents to be processed .

상기 도트 레벨의 상기 제1 특허 분류 기호의 모든 하위 특허 분류 기호에 대응되는 특허 문건 집합이 처리 대상 특허 문건 집합으로 특정되게 되는 방법은, 상기 도트 레벨의 상기 제1 특허 분류 기호의 모든 하위 특허 분류 기호를 입수하고, 상기 제1 특허 분류 기호 및 상기 모든 하위 특허 분류 기호를 포함하는 검색 식 또는 질의식으로 상기 처리 대상 특허 문건 집합이 특정되게 하는 방법인 것이 바람직하다.A method in which a set of patent documents corresponding to all the sub-patent classification symbols of the first patent classification symbol at the dot level is specified as a set of patent documents to be processed is characterized in that all the sub-patent classification , And a set of patent documents to be processed is specified by a search formula or a quality expression including the first patent classification symbol and all the sub-patent classification symbols.

상기 특허 문건 집합을 구성하는 개별 특허는 적어도 하나 이상의 기설정된 가중치 부여 정책에 따라 가중치가 부여된 특허를 하나 이상을 포함하며, 상기 가공 특허 정보는 상기 개별 특허에 부여된 가중치가 반영되어 생성되는 것인 것이 바람직하다.The individual patent constituting the patent document set includes at least one patent having weighted according to at least one predetermined weighting policy, and the processed patent information is generated by reflecting the weight given to the individual patent .

상기 가중치 부여 정책은 상기 개별 특허와 관련된 1) 기설정된 카운팅 정보 및 2) 기설정된 계산값 정보 중 어느 하나 이상을 포함하여 계산되는 것이며, 상기 기설정된 카운팅 정보의 대상이 되는 카운팅 값은 1) 제1국 및 제2국 중 어느 하나 이상의 국가에서의 패밀리수, 2) 청구항 수 및 3) 피인용수 중 어느 하나 이상을 포함하는 것이며, 상기 기설정된 계산값 정보에는 1) 상기 특허의 발명자에 대한 적어도 하나 이상의 평가 점수, 2) 상기 특허의 출원인에 대한 적어도 하나 이상의 평가 점수, 3) 상기 발명자의 상기 특허에 포함된 특허 분류 기호에 대한 적어도 하나 이상의 기설정된 계산값, 4) 상기 출원인의 상기 특허에 포함된 특허 분류 기호에 대한 적어도 하나 이상의 기설정된 계산값, 5) 상기 출원인의 재무 정보를 반영한 적어도 하나 이상의 기설정된 계산값 및 6) 상기 특허에 포함된 특허 분류 기호 자체에 대한 적어도 하나 이상의 기설정된 계산값 중 어느 하나 이상을 포함하여 계산되는 것인 것이 바람직하다.Wherein the weighting policy is calculated to include at least one of 1) predetermined counting information related to the individual patent, and 2) predetermined calculation value information, and the counting value to be a target of the preset counting information is 1) The number of families in at least one country of the first country and the second country, 2) the number of claims, and 3) the number of persons cited. The predetermined calculation value information includes 1) At least one evaluation score for at least one evaluation score for the applicant of the patent; 3) at least one predetermined calculation value for the patent classification symbol included in the patent of the inventor; 4) At least one predetermined calculation value for the patent classification symbol included in the applicant's financial information, That the specified calculated value and 6), which is calculated to include any one or more of the calculated value at least one pre-set for the patent classification codes themselves contained in the patent are preferred.

상기 가공 특허 정보가 상기 사용자의 컴퓨터에 제공될 때, 상기 특허 분류 기호 정보에 대응되는 가공 특허 정보가 제공되며, 상기 사용자의 컴퓨터로부터 상 기 특허 분류 기호의 적어도 하나 이상의 하위 특허 분류 기호에 대한 가공 특허 정보가 요청될 때, 상기 가공 특허 정보는 상기 특허 분류 기호가 포함된 특허 분류 기호 체계 상에서, 상기 특허 분류 기호의 적어도 하나 이상의 직하위 특허 분류 기호에 대응되는 가공 특허 정보를 제공하는 것이 바람직하다.When the processed patent information is provided to the user's computer, processing patent information corresponding to the patent classification symbol information is provided, and processing of at least one or more sub-patent classification symbols of the patent classification symbol When the patent information is requested, the processed patent information preferably provides the processed patent information corresponding to at least one or more sub-patent classification symbols of the patent classification symbol on the patent classification scheme including the patent classification symbol .

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 사용자의 컴퓨터에 특허 정보를 제공하는 시스템이, 특허 문건 집합에 대한 선택 정보를 입수 받는 단계; 상기 선택 정보를 기초로 처리 대상 특허 문건 집합이 특정되는 단계; 상기 특정된 처리 대상 특허 문건 집합에 대하여 적어도 하나 이상의 기설정된 처리를 수행하여 가공 특허 정보가 생성되는 단계;를 더 포함하며, 상기 선택 정보를 입수 받는 방법은 1) 사용자 컴퓨터로부터 검색식으로 입력 받는 방법, 2) 사용자 컴퓨터로부터 상기 사용자 컴퓨터에 제시된 기설정된 상기 특허 문건에 포함된 서지 사항을 구성하는 정보 중에서 선택되는 어느 하나 이상에 대한 선택 정보를 전송받 받는 방법, 3) 상기 사용자가 사전에 생성해 놓은 적어도 하나 이상의 특허 문건 집합 중에서 선택된 특정 특허 문건 집합에 대한 선택 정보를 입수 받는 방법 및 4) 상기 시스템으로부터 기설정된 적어도 하나 이상의 조건을 제공받는 방법 중 어느 하나의 방법 이상이 사용되는 것인 것인 것이며, 상기 가공 특허 정보는 상기 특정된 처리 대상 특허 문건 집합에 대한 적어도 하나 이상의 기설정된 분석 처리를 통하여 생성되는 적어도 하나 이상의 다차원 분석 정보인 것이며, 상기 다차원을 구성하는 개별적인 차원은 1)시간, 2)특허 분류 기호, 3) 출원인, 4) 발명자, 5) 대리인, 6) 국가, 7) 키워드 및 8) 융합 패턴 중에서 선택되는 어느 하나 이상 인 것이며, 상기 다차원 분석 정보는 상기 처리 대상 특허 문건 집합을 대상으로 하여 상기 적어도 하나 이상의 개별적 차원에 대한 적어도 하나 이상의 기설정된 카운팅 값, 적어도 하나 이상의 기설정된 계산식에 의한 계산값, 적어도 하나 이상의 질의식에 의한 질의 결과값 및 적어도 하나 이상의 분석 처리 알고리즘에 대한 처리 결과값 중 어느 하나 이상인 것을 특징으로 하는 특허 정보 제공 방법을 제공한다.According to an aspect of the present invention, there is provided a system for providing patent information to a user's computer, the method comprising: receiving selection information on a set of patent documents; Specifying a set of patent documents to be processed based on the selection information; And generating processed patent information by performing at least one predetermined process on the specified set of patent documents to be processed. The method for obtaining the selection information includes: 1) A method for receiving selection information on at least one selected from information constituting a bibliography included in a predetermined patent document presented in the user's computer from a user's computer, A method for obtaining selection information for a specific set of patent documents selected from at least one set of patent documents made by the system, and 4) a method for receiving at least one predetermined condition from the system is used , And the processed patent information is stored in the specified processing table 1) time, 2) a patent classification symbol, 3) an applicant, 4) an inventor, and 4) an inventor, , 5) an agent, 6) a country, 7) a keyword, and 8) a fusion pattern, and the multidimensional analysis information includes at least one of at least one individual dimension And a processing result value for at least one or more analysis processing algorithms, the at least one predetermined counting value, the at least one predetermined calculation expression, the at least one query result value, Provide a method of providing.

상기 분석 정보를 생성할 때, 상기 분석 정보는 상기 처리 대상 문건 집합에 포함된 특허 분류 기호에 대하여, 1) 모든 특허 분류를 포함하여 처리하는 방법 및 2) 주특허분류(main patent classification)만을 처리하는 방법 중에서 선택되는 어느 한 방법 이상을 사용하며, 상기 모든 특허 분류를 포함하여 처리하는 경우에는 a) 모든 특허 분류를 동등한 가중치로 처리하는 방법 및 b) 주특허분류(main patent classification)와 부특허분류(sub patent classification)에 대하여 다른 가중치로 처리하는 방법 중에서 선택되는 어느 하나 이상을 적용하여 분석 정보를 생성하는 것인 것이 바람직하다.When generating the analysis information, the analysis information may include information about (1) a method of processing all patent classes, (2) a method of processing only the main patent classification, and A method of treating all patent classes with equivalent weights, and b) a method of processing main patent classification and patent patent It is preferable that the analysis information is generated by applying at least one selected from among methods of processing the sub-patent classification with different weights.

상기 분석 처리는 특허 분류 기호간의 융합 분석 처리이며, 상기 특허 분류 기호간의 융합 분석 처리는 상기 처리 대상 특허 문건 집합에 포함된 특허 문건 중 동일 특허 분류 체계 상의 2개 이상의 특허 분류 기호를 포함하는 특허 문건들을 대상으로 하여, 상기 특허 문건들에 포함된 상기 2개 이상의 특허 분류들 간의 연관성(association) 분석, 빈발 패턴 분석, 상관성 분석 및 군집화 중 어느 하나 이상을 수행하는 것인 것이 바람직하다.The analysis processing is a fusion analysis processing between patent classification symbols. The fusion analysis processing between the patent classification symbols includes a patent document including two or more patent classification symbols on the same patent classification system among the patent documents included in the processing target patent set It is desirable to perform at least one of association analysis, frequent pattern analysis, correlation analysis, and clustering among the two or more patent classes included in the patent documents.

상기 특허 분류 기호 간의 융합 분석 처리에 있어서, 동일 특허 분류 체계 상에서 상기 2개 이상의 특허 분류 기호가 하나의 특허 문건에 포함되어 있을 경우, 상기 2개 이상의 특허 분류 기호가 상기 특허 분류 기호 체계 상의 최상위 특허 분류 기호가 아니거나 기설정된 레벨의 분류 기호가 아닌 경우, 상기 융합 분석 처리는 상기 2개 이상의 특허 분류 기호 각각에 대하여, 상기 2개 이상의 특허 분류 기호의 기설정된 레벨까지의 상위 특허 분류 기호를 포함하여 처리하는 것인 것이 바람직하다.When two or more patent classification symbols are included in one patent document in the same patent classification system in the fusion analysis processing between the patent classification symbols, the two or more patent classification symbols may be the highest patent The fusion analysis process includes a top patent classification symbol up to a predetermined level of the two or more patent classification symbols for each of the two or more patent classification symbols if the classification symbol is not a classification symbol or a classification symbol of a predetermined level And the like.

상기 융합 분석 처리는 상기 처리 대상 특허 문건 집합을 구성하는 개별 특허 문건에 대하여 1) 하나의 특허 문건의 기설정된 적어도 하나 이상의 필드에서 추출한 텍스트 정보를 기초로 생성되는 적어도 2 이상의 기술 키워드 간의 융합 분석 처리, 2) 공동 발명자가 있는 특허 문건에 대한 공동 발명자 간의 융합 분석 처리, 3) 공동 출원인이 있는 특허 문건에 대한 공동 출원인 간의 융합 분석 처리 및 4) 하나의 특허 문건의 인용 정보에 함께 포함되어 있는 인용 특허 문건들 간의 융합 분석 처리 중 어느 하나 이상을 포함하며, 상기 융합 분석 처리는 상기 특허 문건들에 포함된 상기 2개 이상의 기술 키워드, 공동 발명자, 공동 출원인, 공동 인용 특허 문건들 간의 연관성(association) 분석, 빈발 패턴 분석, 상관성 분석 및 군집화 중 어느 하나 이상을 수행하는 것인 것이 바람직하다.The convergence analysis processing may be performed for individual patent documents constituting the set of patent documents to be processed by 1) fusion analysis processing between at least two technical keywords generated based on text information extracted from at least one field of one patent document , 2) the analysis of the convergence between the co-inventors on the patent document with the co-inventor, 3) the analysis of the convergence between the co-applicants on the patent document with the co-applicant, and 4) the citation included in the citation information of one patent document And the fusion analysis processing between the patent documents. The fusion analysis processing is an association between the two or more technical keywords, the co-inventor, the co-applicant, and the joint patent documents included in the patent documents, Analysis, frequent pattern analysis, correlation analysis, and clustering. That the will is preferred.

상기 융합 분석 처리 결과를 적어도 2개 이상의 노드를 포함하는 네트워크 다이어그램으로 시각화 처리하는 단계;를 더 포함하며, 상기 시각화 처리에는 상기 노드에 대한 정보 및 상기 노드와 노드를 연결하는 선에 대한 정보 중 어느 하나 이상을 더 포함하고 있는 것인 것이 바람직하다.And visualizing the result of the fusion analysis processing into a network diagram including at least two nodes, wherein the visualization processing includes a step of displaying information on the node and information on a line connecting the node and the node And more preferably at least one.

상기 노드와 노드를 연결하는 선에 대한 정보에는 수치 정보가 더 포함되어 있으며, 상기 사용자 컴퓨터로부터 상기 수치 또는 상기 선에 대한 선택 정보가 입수받는 단계; 상기 상기 수치 정보에 대응되는 근거 특허 문건 정보를 상기 처리 대상 특허 문건 집합이 포함되는 특허 정보 데이터베이스에서 추출하는 단계; 및 상기 추출된 근거 특허 문건 정보를 상기 사용자 컴퓨터에 전송하는 단계;를 더 포함하는 것이 바람직하다.Wherein the information about the line connecting the node and the node further includes numerical information and the selection information for the numerical value or the line is received from the user computer; Extracting, from the patent information database including the set of patent documents to be processed, the found patent document information corresponding to the numerical information; And transmitting the extracted patent document information to the user computer.

상기 처리 대상 문건 집합에 대하여, 상기 사용자 컴퓨터로부터 상기 처리 대상 문건 집합에 대한 변경 정보를 입수 받는 단계; 및 상기 입수 받은 변경 정보를 기준으로 처리 대상 문건 집합을 특정하는 단계;를 더 포함하며, 상기 변경 정보의 입수는 상기 사용자 컴퓨터에 변경 정보의 입수를 위한 인터페이스를 제공하고, 상기 인터페이서에 대한 상기 사용자의 선택 정보를 입수받는 방식으로 수행되는 것인 것이며, 상기 인터페이스에는 1) 시간 종류 선택 인터페이스, 2) 시간 구간 선택 인터페이스, 3) 국가 선택 인터페이스 4) 특허 분류 기호 선택 인터페이스 5) 인용 특화 인터페이스, 6) 패밀리 정보 포함 선택 인터페이스, 7) 검색 후 선택 인터페이스 중 어느 하나 이상을 포함하며, 상기 시간 종류 인터페이스에는 출원일, 공개일 및 등록일 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 시간 구간 선택 인터페이스는 시작 구간과 종료 구간 각각에 대한 년/월/주/일 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 국가 선택 인터페이스는 적어도 2 이상의 국가에 대한 어느 하나 이상의 선택을 제 공하는 인터페이스를 포함하며, 상기 특허 분류 기호 선택 인터페이스는 특허 분류 기호의 종류 선택 인터페이스, 특허 분류 기호 레벨 선택 인터페이스, 메인 특허 분류 기호 및 서브 특허 분류 기호 선택 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 인용 특허 인터페이스에는 전방 인용과 후방 인용 중 어느 하나 이상의 선택에 관한 인용 방향 인터페이스, 인용 깊이의 선택에 관한 인용 깊이 인터페이스, 직접 인용과 간접 인용 중 어느 하나 이상의 인용의 선택에 관한 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 패밀리 정보 포함 선택 인터페이스에는 패밀리 정보 포함 여부 및 국가 범위의 선택에 관한 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 검색 후 선택 인터페이스는 상기 처리 대상 문건 집합을 검색 대상으로 하여 기설정된 필드 검색을 제시하고, 상기 필드에 사용자 컴퓨터로부터 검색어를 입수받고, 상기 검색어에 대응되는 검색 결과를 제공하고, 상기 검색 결과 중에서 적어도 하나 이상의 선택 정보를 입수 받아, 상기 선택 정보와 관련되는 특허 문건 집합만을 처리 대상 문건 집합으로 하도록 처리하는 인터페이스를 포함하는 것인 것이 바람직하다.Receiving change information about the set of documents to be processed from the user computer for the document set to be processed; And a step of specifying a set of documents to be processed based on the received change information, wherein the change information is obtained by providing an interface for obtaining change information to the user computer, 2) a time interval selection interface; 3) a country selection interface; 4) a patent classification symbol selection interface; 5) a citation specialized interface; and 6) ) Family information selection interface, and 7) a post-search selection interface, wherein the time type interface includes an interface for providing a selection of at least one of a filing date, a disclosure date, and a registration date, For each of the start and end sections / Month / week / day, wherein the country selection interface comprises an interface providing at least one selection for at least two countries, the patent classification selection interface comprising: An interface providing a selection of at least one of a class classification symbol selection interface, a patent classification symbol level selection interface, a main patent classification symbol, and a subpublic classification selection interface, wherein the cited patent interface includes a front citation and a rear citation An interface for providing a selection of one or more of a citation direction interface for any one or more selections, a citation depth interface for selection of a citation depth, and an interface for selection of at least one of direct citation and indirect citation, Wherein the selection interface includes an interface for providing a selection of at least one of whether or not to include family information and a selection of a country range, wherein the post-search selection interface is configured to select, as a search target, A search result corresponding to the search word is obtained, and at least one selection information is obtained from the search result, and only a set of patent documents related to the selection information It is preferable to include an interface for processing the document set to be processed.

상기 분석 처리는 적어도 2 이상의 특허 문건을 1개의 분석 단위로 하는 집합 단위의 인용 분석 처리이며, 상기 집합 단위의 인용 분석 처리는 1)전방 인용 특허 문건 집합 대상 분석 처리, 2) 후방 인용 특허 문건 집합 대상 분석 처리 및 3) 인용 발생 자기 특허 문건 집합 대상 분석 처리 중 어느 하나 이상을 포함하며, 상기 1) 내지 3)의 분석 처리는 상기 1) 내지 3)의 분석 처리의 대상이 되는 특허 문건 집합에서 상기 적어도 하나 이상의 기설정된 다차원 분석 정보를 생성해 내는 것인 것이며, 상기 전방 인용 특허 문건 집합 대상 분석 처리를 위한 전방 인용 특허 문건 집합의 특정은 처리 대상 특허 문건 집합에 포함되어 있는 적어도 1 인용 깊이 이상의 인용 정보로 구성되는 문건 집합을 포함하는 방식으로 특정되는 것이며.The analysis processing is a categorization analysis processing of a set unit that takes at least two patent documents as one analysis unit, and the citation analysis processing of the set unit is performed as follows: 1) analysis processing of a patent document set subject to a front citation; 2) Subject analysis processing, and citation generation self patent document set subject analysis processing, and the analysis processing of 1) to 3) above is performed in a set of patent documents to be subjected to the analysis processing of 1) to 3) And the at least one predetermined multidimensional analysis information is generated, and the specification of the set of patent documents for forward citation for the patent document set subject analysis processing for forward citation is at least one depth for at least one person included in the set of patent documents to be processed And a set of documents consisting of citations.

상기 후방 인용 특허 문건 집합 대상 분석 처리를 위한 후방 인용 특허 문건 집합의 특정은 처리 대상 특허 문건 집합을 인용하고 있는 적어도 1 인용 깊이 이상의 후출원 문건 집합을 포함하는 방식으로 특정되는 것이며.The specification of the set of patent documents for the backward citation for the patent document set subject analysis is specified in such a way that it includes a set of documents to be filed later than at least one person citing the set of patent documents to be processed.

상기 인용 발생 자기 특허 문건 집합 대상 분석 처리를 위한 인용 발생 자기 특허 문건 집합의 특정은 처리 대상 특허 문건 집합을 구성하는 특허 문건 중 적어도 1회 이상의 인용을 받은 특허 문건으로 구성되는 특허 문건 집합을 포함하는 방식으로 특정되는 것인 것이 바람직하다.Citation generation for self-patent document set subject analysis The specification of the self-patent document set for analysis includes a set of patent documents consisting of patent documents cited at least once among the patent documents constituting the set of patent documents to be processed It is preferable that the above-mentioned method is specified by a method.

상기 집합 단위의 인용 분석 처리는 전방 인용 특허 문건 집합의 특정 시에는 상기 특허 문건 집합에 대한 간접 전방 인용 문건 집합을 더 포함하는 방식, 상기 후방 인용 특허 문건 집합의 특정 시에는 간접 후방 인용 문건 집합 중 어느 하나 이상을 더 포함하여 집합 단위의 인용 분석 처리를 수행하는 것인 것이며, 상기 간접 전방 인용 문건 집합은, 상기 처리 대상 특허 문건 집합을 구성하는 처리 대상 특허 문건에 대하여, 상기 처리 대상 특허 문건보다 출원일이 앞서면서, 상기 처리 대상 특허 문건이 인용하는 특허 문건을 동시에 인용하고 있는 특허 문건들로 구성되는 것이며, 상기 간접 후방 인용 문건 집합은, 상기 처리 대상 특허 문건 집 합을 구성하는 처리 대상 특허 문건에 대하여, 상기 처리 대상 특허 문건보다 출원일이 늦으면서, 상기 처리 대상 특허 문건을 인용하는 특허 문건이 동시에 인용하고 있는 특허 문건들로 구성되는 것인 것이 바람직하다.The citation analysis process of the set unit includes a method of further including a set of indirectly quoted documents for the set of patent documents at the time of specifying the set of patent documents for the front cite, Wherein the set of indirectly quoted documents includes at least one of the set of patent documents, and the set of indirectly quoted documents includes at least one set of patent documents, Wherein the indirect posterior citation document set comprises a patent document citing the patent document cited by the patent document to be processed at the same time as the filing date is ahead of the filing date of the patent document, The filing date is later than the patent document to be processed, It would patent documents citing target patent document consisting of patent documents that are cited at the same time is preferred.

상기 다차원 분석 정보는 수치 정보가 포함되어 있는 표 형식으로 상기 사용자 컴퓨터에 전송되며, 상기 사용자 컴퓨터로부터 상기 수치에 대한 선택 정보가 입수받는 단계; 상기 상기 수치 정보에 대응되는 근거 특허 문건 정보를 상기 처리 대상 특허 문건 집합이 포함되는 특허 정보 데이터베이스에서 추출하는 단계; 및 상기 추출된 근거 특허 문건 정보를 상기 사용자 컴퓨터에 전송하는 단계;를 더 포함하는 것이 바람직하다.Wherein the multidimensional analysis information is transmitted to the user computer in a tabular form including numerical information, and the selection information on the numeric value is received from the user computer; Extracting, from the patent information database including the set of patent documents to be processed, the found patent document information corresponding to the numerical information; And transmitting the extracted patent document information to the user computer.

상기 처리 대상 특허 문건 집합을 특정하는 단계 및 상기 기설정된 계산을 수행하여 가공 특허 정보를 생성하는 단계 중 어느 한 단계 이상에서 적어도 하나 이상의 특허 분류 기호를 활용할 수 있으며, 상기 특허 분류 기호를 활용할 때의 상기 특허 분류 기호의 활용은 상기 선택 정보를 입수 받는 단계 및 상기 가공 특허 정보를 생성하는 단계 중 어느 한 단계 이상에서 제1 특허 분류 기호가 주어졌을 때, 상기 제1 특허 분류 기호가 속하는 특허 분류 기호 체계 상에서 상기 제1 특허 분류 기호가 기설정된 레벨에 해당하는 경우, 상기 제1 특허 분류 기호의 하위 특허 분류 기호에 해당하는 특허 정보를 포함하도록 정보 처리하는 것이며, 상기 특허 분류 기호는 국제 특허 분류(IPC), 미국 특허 분류(USPC), 일본의 파일 인덱스(FI), 일본의 F-term(FT), 유럽 특허 분류(ECLA) 중 어느 하나 이상인 것이 바람직하다.At least one or more of the patent classification symbols may be utilized in at least one of the steps of specifying the set of patent documents to be processed and generating the processed patent information by performing the predetermined calculation, Wherein the use of the patent classification symbol is performed when the first patent classification symbol is given at least one of the step of obtaining the selection information and the step of generating the processed patent information, Wherein the information processing unit includes information about the patent information corresponding to the sub-patent classification code of the first patent classification code when the first patent classification code corresponds to a predetermined level on the system, IPC), United States Patent Classification (USPC), Japan File Index (FI), Japan F-term (FT), European Patent (ECLA). &Lt; / RTI >

상기 사용자 컴퓨터로부터 상기 사용자 컴퓨터에 제시된 기설정된 상기 특허 문건에 포함된 서지 사항을 구성하는 정보 중에서 선택되는 어느 하나 이상에 대한 선택 정보를 전송받 받는 방법은 상기 사용자 컴퓨터에 적어도 하나 이상의 출원인을 포함하는 출원인 정보, 적어도 하나 이상의 특허 분류 기호를 포함하는 특허 분류 기호 정보, 적어도 하나 이상의 발명자를 포함하는 발명자 정보, 적어도 하나 이상의 대리인을 포함하는 대리인 정보 중 어느 하나 이상을 상기 사용자 컴퓨터에 제공하는 단계; 상기 사용자 컴퓨터로부터 상기 출원인, 상기 특허 분류 기호, 상기 발명자 정보, 상기 대리인 정보 중 어느 하나 이상에 대한 선택 정보를 입수 받는 단계; 및 상기 선택 정보로 특정되는 특허 문건 집합을 처리 대상 특허 문건 집합으로 처리하여, 기설정된 다차원 분석 결과를 생성하여 상기 사용자 컴퓨터에 제공하는 단계;를 더 포함하는 것이 바람직하다.A method for receiving selection information on at least one selected from information constituting a bibliography included in the predetermined patent document presented on the user computer from the user computer includes receiving at least one applicant from the user computer Providing at least one of applicant information, patent classification information including at least one patent classification symbol, inventor information including at least one inventor, and proxy information including at least one agent, to the user computer; Obtaining selection information on at least one of the applicant, the patent classification code, the inventor information, and the proxy information from the user computer; And processing the set of patent documents identified by the selection information as a set of patent documents to be processed to generate a predetermined multidimensional analysis result and providing the result to the user computer.

상기 시스템은 상기 출원인별, 상기 특허 분류 기호별, 상기 발명자별 또는 상기 대리인별 다차원 분석 결과를 용이하고 신속하게 생성하기 위한 적어도 하나 이상의 중간 다차원 분석 결과 테이블을 생성하는 것이며, 상기 사용자 컴퓨터에게 기설정된 적어도 하나 이상의 출원인별 다차원 분석 목록이나 메뉴, 기설정된 적어도 하나 이상의 특허 분류 기호별 다차원 분석 목록이나 메뉴, 기설정된 적어도 하나 이상의 발명자별 다차원 분석 목록이나 메뉴 및 기설정된 적어도 하나 이상의 대리인별 다차원 분석 목록이나 메뉴 중 어느 하나 이상을 제공하는 단계; 상기 사용자 컴퓨터로부터 상기 다차원 분석 목록이나 메뉴에 대한 선택 정보를 입수하는 단계; 및 상기 선택 정보를 입수 받은 분석 목록이 상기 중간 다차원 분석 결과 테 이블에서 입수할 수 있는 경우에는 상기 중간 다차원 분석 결과 테이블을 참조하여 상기 선택 정보를 입수 받은 분석 결과를 생성하여 상기 사용자 컴퓨터에 제공하는 단계;를 더 포함하는 것이 바람직하다.The system generates at least one intermediate multidimensional analysis result table for easily and quickly generating the multidimensional analysis results for each applicant, the patent classification symbol, the inventor or the agent, A multidimensional analysis list or menu for at least one applicant, at least one multidimensional analysis list or menu for at least one patent classification, a multidimensional analysis list or menu for each at least one inventor, Providing one or more of the menus; Obtaining selection information for the multidimensional analysis list or menu from the user computer; And if the analysis list obtained from the selection information is available from the intermediate multidimensional analysis result table, the analysis result obtaining unit obtains the selection information by referring to the intermediate multidimensional analysis result table and provides the analysis result to the user computer The method further comprising:

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 사용자의 컴퓨터에 특허 정보를 제공하는 시스템이, 제1국에서 공개된 적어도 하나 이상의 특허 문건 정보를 입수 받는 단계; 상기 제1국 출원일과 상기 선택 정보를 입수 받은 날짜가 1년이 넘은 경우, 상기 선택 정보를 입수 받은 특허 문건에 대하여 1) PCT 출원 정보와 2) 제2국에서의 상기 제1국의 특허 문건을 우선권으로 하는 제2국 패밀리 정보, 및 3) 상기 제2국 패밀리 정보에 해당하는 제2국 특허 문건의 현재 상태 정보 중 어느 하나 이상을 조회하는 단계; 상기 특허 문건에 대한 PCT 출원 정보가 없으며, 상기 제2국에서 상기 패밀리 정보가 없는 제1 경우, 상기 PCT 출원 정보가 있더라도 상기 제1국 출원일이 상기 선택 정보를 입수 받은 날짜로부터 기설정된 기관이 경과하고, 제2국 패밀리 정보가 없는 제2경우, 및 상기 제2국 패밀리 정보가 있더라도 상기 제2국 특허 문건의 현재 상태 정보가 취하, 소멸, 거절 결정 확정 및 무효 확정 중 어느 하나 이상이 된 제3의 경우 중 어느 하나에 해당하는지를 판단하는 단계; 및 상기 제1국의 특허 문건이 상기 제1 경우, 상기 제2 경우, 상기 제3 경우 중 어느 하나에 해당하는 경우에는 상기 제1국의 특허 문건 정보가 표시될 때, 상기 제1 경우, 상기 제2 경우, 상기 제3 경우 중 어느 하나에 해당함에 대응되는 기설정된 표시 정보를 상기 제1국의 특허 문건 정보의 기설정된 주변에 표시될 수 있도록 하여 상기 사용자의 컴퓨터에 상기 제1국의 특허 문건 정보와 상기 기설정된 표시 정보를 전송하는 단계;를 더 포함하며, 상기 제2국은 적어도 하나 이상인 것인 것을 특징으로 하는 특허 정보 제공 방법을 제공한다.According to an aspect of the present invention, there is provided a system for providing patent information to a user's computer, the method comprising: receiving at least one patent document information disclosed in a first country; If the first station filing date and the date of receiving the selection information are more than one year, the patent document having received the selection information includes 1) PCT application information and 2) patent document of the first country in the second country And 3) a current state information of a patent document of a second country corresponding to the second station family information; If there is no PCT application information for the patent document, and the second station does not have the family information, even if there is the PCT application information, A second case in which there is no second country family information, and a second case in which the current state information of the second country patent document is withdrawn even if there is the second country family information, 3; < / RTI > And when the patent document of the first country corresponds to any one of the first case, the second case and the third case, when the patent document information of the first country is displayed, The display information corresponding to one of the first case, the second case and the third case can be displayed in a predetermined vicinity of the patent document information of the first country, And transmitting the document information and the predetermined display information, wherein the second station is at least one or more than one.

본 발명의 효과는 다음과 같다.The effects of the present invention are as follows.

특허 분류 기호, 기술 키워드 정보에 대한 융합 정보를 처리함으로써, 특허 분류 기호간, 기술 키워드 간 융합 경향을 파악할 수 있으며, 융합 속성을 본 발명의 특허 분석에 활용할 수 있다.Patent classification symbols and technology keyword information are processed, and the fusion tendency between the patent classification symbols and the technology keywords can be grasped, and the fusion property can be utilized in the patent analysis of the present invention.

특허 문건 집합 단위로 인용 분석을 포함한 각종 분석으로 처리함으로써, 문건 집합의 크기, 종류 등에 독립적으로 특허 정보를 처리할 수 있게 된다. Patent information can be processed independently of the size and type of document sets by processing various analyzes including citation analysis in units of a patent document set.

공동 출원인, 공동 발명자, 함께 나온 공동 선행 인용 특허, 함께 나온 특허 분류 기호, 함께 나온 기술 키워드 등을 분석함으로써, 출원인, 발명자 등의 네트워크 관계 및 인용 특허들 상호 간의 네트워크 및 특허 분류 기호 간의 네트워크와 기술 키워드 간의 네트워크를 파악할 수 있게 되며, 이들 각각에 대한 분석 결과를 생성할 수 있게 된다.By analyzing joint applicants, co-inventors, co-leading patents together, patent classification codes together, and technology keywords together, network and technology between networks and patent classification symbols between applicants and inventors, The network between the keywords can be grasped, and analysis results for each of them can be generated.

전제 정보Premise information

공개되는 특허 문건의 형식 및 포함되어 있는 정보The format of the published patent document and the information it contains

각 국가의 특허청은 일정 조건을 갖춘 특허 출원 또는 특허 등록에 대하여 공개용 공보를 발행하고 있다. 공개되는 특허 정보에는 각 국가마다 차이가 있지만 크게 서지 사항에 관한 정보 및 특허 본문에 관한 정보를 포함하고 있다.The Patent Office of each country issues a public notice about patent application or patent registration with certain conditions. The patent information that is disclosed differs from country to country but includes information about the bibliography and information about the patent text.

서지 사항에 관한 정보로서는 특허 문건 발행국에 관한 정보, 발명의 명칭, 출원인(assignee를 포함한다, 이하 같다)/특허권자(assignee를 포함한다, 이하 같다)에 관한 정보, 발명자에 관한 정보, 특허 분류 기호(IPC, USPC, FI, FT, ECLA 중에서 선택되는 어느 하나 이상, 이하 같다)에 관한 정보, 출원일 등의 각종 날짜에 관한 정보, 출원번호 등의 각종 번호에 관한 정보가 필수적으로 포함되어 있다. 또한, 초록 정보, 대표도가 있을 경우 대표도에 관한 정보, 인용(reference)에 관한 정보(출원인이 개시한 선행 기술에 관한 reference 정보, 심사관이 심사하거나 조사한 문건 또는 특허 분류 기호에 관한 정보 등 중의 어느 하나 이상을 말한다.), 우선권에 관한 정보, 심사관에 관한 정보 등이 선택적으로 존재할 수 있다. 상기 서지 사항의 내용으로 초록(abstract)에 관한 정보, 대표 청구항 또는 특허 청구 범위 중 제1항에 관한 정보가 더 포함되어 있을 수 있다.The information on the bibliography includes information on the patent document issuing country, the name of the invention, information on the applicant (including assignee, same as follows) / patentee (including assignee, hereafter), information on the inventor, Information about various dates (such as IPC, USPC, FI, FT, ECLA, etc.), information on various dates such as the filing date, and application numbers are indispensably included. In addition, if there are abstract information, representative information, information about the representative figure, reference information (reference information about the prior art initiated by the applicant, information examined or examined by the examiner, information about the patent classification symbol, etc.) Information about the priority, information about the examiner, and the like may optionally exist. The contents of the bibliography may further include information relating to the abstract, the representative claim, or the information related to claim 1 of the claims.

본문에는 특허의 기술적 내용에 관한 정보들이 포함되어 있다. 통상적인 특허 문건의 본문에는 발명의 명칭에 관한 정보, 발명의 구성에 관한 정보, 특허 청구 범위에 관한 정보를 포함하고 있으며, 이들 이외에도 발명의 효과, 산업상 이용 가능성, 발명이 이루고자 하는 기술적 과제, 도면에 관한 설명, 종래의 기술 중 어느 하나 이상이 포함되어 있다. 또한, 발명자의 선택 또는 특허 기술의 종류에 따라 필수적 또는 선택적으로 도면에 관한 정보가 더 포함되어 있다. The text contains information on the technical content of the patent. The text of a typical patent document contains information on the name of the invention, information on the composition of the invention, and information on the scope of the patent claim. In addition to these, the effects of the invention, industrial applicability, Description of the drawings, and conventional techniques. In addition, information relating to drawings is essentially or optionally included depending on the inventor's choice or the type of patent technology.

특허 분류 기호 정보 및 특허 분류 기호 체계의 존재Patent Classification Symbol Information and Patent Classification Sign System exists

아울러, 각 특허 문건에는 적어도 하나 이상의 특허 분류 기호 체계 상의 적 어도 하나 이상의 특허 분류 기호가 부여되어 있다. 대한민국, 미국, 일본, 유럽(EPO) 등의 각 국가가 발행하는 문건에는 IPC(International Patent Classification,국제 특허 분류) 기호가 부여되어 있으며, 각 국가별로 자국만의 특별한 분류 체계(미국 특허청의 경우 USPC 또는 UPC, 일본 특허청의 경우 F-In addition, each patent document contains at least one patent classification symbol on at least one of the patent classification schemes. The IPC (International Patent Classification) symbols are assigned to documents issued by countries such as Korea, the United States, Japan, and Europe (EPO). Each country has its own unique classification system (USPC Or UPC, for the Japanese Patent Office F-

Term/FT, FI(File Index), 유럽 특허청의 경우 ECLA 등)를 가지고 있는 경우에는 그 특허 문건의 발행국 만의 특허 분류 기호가 부여되어 있다.Term / FT, FI (File Index), and European Patent Office (ECLA)), the patent classification code of the issuing country of the patent document is given.

특허 분류 기호 체계들은 각 분류 기호에 "타이틀"이라는 해당 특허 분류 기호의 내용을 설명하는 부분이 1:1로 대응되어 있다. 한편, 각 특허 분류 기호 체계 상에서 일정 깊이(depth) 이하의 경우에는 각 특허 분류 기호들의 타이틀 정보에 도트(dot)가 부여되어 있는 경우가 있다. 하기의 예로부터 특허 분류 기호 체계의 계층성, 타이틀 정보의 존재, 타이틀 정보 앞에 분류 체계 상의 상대적인 위치에 관한 정보를 제공하는 도트의 존재를 보여 준다. The patent classification schemes have a 1: 1 correspondence in each classification symbol that describes the content of the corresponding patent classification symbol "title". On the other hand, in a case where the depth is less than a predetermined depth on each patent classification system, dots are given to the title information of each patent classification symbol. From the following example, the presence of dots providing information on the hierarchical nature of the patent classification system, the presence of title information, and the relative position of the classification system in front of the title information is shown.

　　　　섹션　　　　　　　　　　　 : H　　　　　 전기 Section: H Electricity

　　　　클래스　　　　　　　　　 : H01　　 기본적 전기소자 Class: H01 Basic electrical element

　　　　서브클래스　　　　　 : H01F　 자석 Subclass: H01F Magnets

　　　　메인그룹　　　　　　　 : H01F 1/00　 자성재료를 특징으로 하는 자석 또는 자 성체 Main group: H01F 1/00 Magnet or magnetic body characterized by magnetic material

　　　　1-도트 서브그룹 :　　　　　　 1/01 *무기재료로 된 것 1-dot subgroups: 1/01 * Inorganic materials

　　　　2-도트 서브그룹 :　　　　　　　 1/03 **보자력에 의해 특징되는 것 2-dot subgroups: 1/03 ** characterized by coercivity

　　　　3-도트 서브그룹 :　　　　　　　 1/032 ***경질 자성재료의 것 3-dot subgroups: 1/032 *** of hard magnetic material

　　　　4-도트 서브그룹 :　　　　　　·　 1/04 **** 금속 또는 합금 4-dot subgroup: · 1/04 **** Metal or alloy

　　　　5-도트 서브그룹 :　　　　　　　 1/047 *****조성물로 특징되는 합금 5-dot subgroup: 1/047 ***** Alloy characterized by composition

　　　　6-도트 서브그룹 :　　　　　　　 1/053 ******희토류 금속을 포함하는 것6-dot subgroup: 1/053 ****** Containing rare earth metals

상기 IPC를 예로 들어 설명하면, 모든 서브그룹에 해당하는 특허 분류 기호는 "서브클래스" + "숫자/숫자" 구조로 되어 있는데, 이 특허 분류 기호에 대응되는 타이틀에는 도트(dot)가 결합되어 있는 것이 대부분이다.(숫자/00을 제외하고는 대부분 도트가 결합되어 있으며, 통상 숫자/00은 메인그룹에 대응되는 기호이다.) 타이틀에 결합되어 있는 도트의 개수가 적을수록 그 타이틀은 그 주변과 비교할 때 상대적으로 상위 개념의 기술 분류이며, 도트의 개수가 많을수록 그 타이틀은 그 주변과 비교할 때 상대적으로 하위/세부 개념의 기술 분류이다.Taking the IPC as an example, the patent classification symbols corresponding to all the subgroups have a structure of "subclass" + "number / number", and titles corresponding to the patent classification codes are combined with dots (Usually, the numbers / 00 are the symbols corresponding to the main group.) The smaller the number of dots combined in a title, the more the title is related to its surroundings In comparison, the technical classification is a relatively higher concept, and the higher the number of dots, the more the technical classification of the sub / detailed concept is as compared with the surrounding area.

상기의 예와 같이 IPC는 다단계 계층 구조(트리(tree)구조)로 되어 있다. 한편, USPC, FT 및 ECLA 등도 다단계 계층 구조를 가진다. 상기 각 특허 분류 기호의 다단계 계층 구조는 DB화 될 수 있을 것이다.As in the above example, the IPC has a multi-level hierarchical structure (tree structure). On the other hand, USPC, FT and ECLA also have a multi-level hierarchical structure. The multi-level hierarchical structure of each patent classification symbol may be DB.

본 발명의 특허 정보 시스템(1)의 구성 소개Introduction of the configuration of the patent information system (1) of the present invention

먼저 본 발명의 특허 정보 시스템(1)의 구조에 대해서 간략하게 설명한다. First, the structure of the patent information system 1 of the present invention will be briefly described.

도 1 및 도 13을 기준으로 설명한다.1 and Fig. 13.

본 발명의 특허 정보 시스템(1)은 유무선 네트워크(200)를 통하여 사용자 컴퓨터(300)와 연결되어 있다. 이때, 사용자 컴퓨터(300)는 PC, 노트북, 유무선 통신 단말기, 게임기 등의 사람을 전제로 한 컴퓨터와, 본 특허 정보 시스템(1)이 아닌 제3의 서버, 조직 또는 기관의 서버, 제3의 프로그램 모듈 등 상기 특허 정보 시스템(1)에 접속하는 임의의 컴퓨터를 말한다. 상기 특허 정보 시스템(1)이 웹서비스를 할 경우 상기 특허 정보 시스템(1)은 웹서비스 지원 모듈을 탑재하고 있어야 한다.The patent information system 1 of the present invention is connected to a user computer 300 through a wired / wireless network 200. At this time, the user computer 300 may be a computer based on a person such as a PC, a notebook computer, a wired / wireless communication terminal, or a game machine, a server of a third server, organization or institution other than the present patent information system 1, A program module, or the like, connected to the patent information system 1. When the patent information system 1 performs a web service, the patent information system 1 must include a web service support module.

상기 특허 정보 시스템(1)은 특허 데이터 등 각종 데이터와 관계있는 DB부(20), 각종 전처리를 수행하는 전처리 모듈부(30), 특허 정보를 처리 및 분석하는 특허 정보 처리 기본 모듈(40), 회원 관리, 정책 관리 등과 같은 각종 지원을 담당하는 지원 모듈(50)(회원 정보 처리 모듈(501), 다국어 처리 모듈(502), 번역 모듈(503) 등이 탑재되어 있다), 특허 정보의 고급 분석과 직접 관계되는 특허 인텔리전스 모듈(60)(상기 특허 인텔리전스 모듈은 주로 특허 분석을 통하여 고급 특허 정보를 생성하므로, 특허 분석 인텔리전스 모듈 또는 간략히 분석 모듈이라고 명명하기도 한다. 본 명세서에서는 이 양자 용어는 동등한 의미로 사용된다. 이하 같다.) 및 출원인 또는 발명자 또는 대리인 단위로 된 특허 정보 서비스 집합체를 생성하는 계층적으로 통합된 특허 정보 서비스 시스템이 있다.The patent information system 1 includes a DB unit 20 related to various data such as patent data, a preprocessing module 30 for performing various preprocessing, a patent information processing basic module 40 for processing and analyzing patent information, (A member information processing module 501, a multilingual processing module 502, a translation module 503, and the like are mounted) for carrying out various kinds of support such as membership management and policy management, The patent intelligence module 60 is referred to as a patent analysis intelligence module or a brief analysis module since the patent intelligence module generates advanced patent information mainly through patent analysis. In the present specification, the term " The same shall apply hereinafter) and a hierarchically integrated patent information record that generates a set of patent information services in the applicant or inventor or agent unit There are bus systems.

상기 특허 정보 시스템(1)은 크게 6가지 구성 요소를 포함하여 구성 되어 있다. The patent information system 1 includes six components.

첫째 구성 요소는 각종 전처리 모듈로 이들은 입수된 특허 문건에 대하여 각 목적별로의 각종 전처리를 수행하는 모듈들이다. The first component is various preprocessing modules, which are modules that perform various preprocessing for each purpose for patent documents obtained.

둘째 구성 요소는 특허 정보 처리 기본 모듈(40)로 이들에는 1) 검색 기능, 2) 분석 기능, 3) 감시 기능, 4) 분석 기능, 5) 특허 문건 집합 입수 기능, 6) 각종 다단계 디렉토리 생성 기능, 리포팅 기능을 수행하는 모듈을 포함한다. The second component is the patent information processing basic module 40. These functions include 1) search function, 2) analysis function, 3) monitoring function, 4) analysis function, 5) patent document collection function, 6) , And a module for performing a reporting function.

세번째 구성 요소는 특허 분석 인텔리전스 모듈(60)로 이들에는 1) 인용 분석, 비교 분석 등을 포함한 각종 심화된 특허 분석 정보를 생성하는 모듈이다. The third component is the patent analysis intelligence module 60, which is 1) a module for generating various advanced patent analysis information including citation analysis, comparative analysis and the like.

네번째 구성 요소는 특허 정보 서비스 시스템 생성 모듈로 출원인, 발명자, 대리인, 특허 분류 기호별 특허 정보 시스템(1)을 생성하는 모듈이다. The fourth component is a module for generating the patent information system (1) by the applicant, the inventor, the agent, and the patent classification by the patent information service system generation module.

다섯번째 구성 요소는 지원 모듈(50)로 1) 회원 정보 처리, 2) 다국어 처리, 3) 번역 처리, 4) 웹서비스 처리 및 기타 본 발명 사상이 각종 네트워크(200) 환경 에서 서비스 될 수 있도록 하는 지원을 담당하는 구성 요소로 되어 있다. The fifth component is that the support module 50 allows the user to process 1) member information processing, 2) multilingual processing, 3) translation processing, 4) web service processing and other aspects of the invention can be serviced in various network 200 environments It is the component responsible for the support.

여섯번째 구성 요소는 각종 DB로 이들 DB에는The sixth component is the various DBs.

1) 특허 문건 마스트 DB(202), 1) Patent document The mast DB 202,

2) 특허 분류 기호 마스트 DB(203), 2) Patent classification symbol mast DB 203,

3) 주체 마스트 DB(204) 등의 각종 마스트 DB, 3) various mast DBs such as the subject mast DB 204,

4) 분석과 관련된 각종 DB, 4) Various DB related to analysis,

5) 각종 지원 DB(회원 DB(206-2), 메뉴 DB(206-3), 정책 DB(대표 어구 추출 정책 DB(206-1-1), 가중치 부여 정책 DB(206-1-2), 특허 지표 DB(206-1-3), 분석 질의식 DB(206-1-4)) 등), 5) Various support DBs (member DB 206-2, menu DB 206-3, policy DB (representative phrase extraction DB 206-1-1, weighting policy DB 206-1-2, Patent index DB 206-1-3, and analysis quality DB 206-1-4)),

6) 각종 2차 가공 DB(텍스트 마이닝 DB(207-1), 대표 어구 DB(207-2), 6) Various secondary processing DBs (text mining DB 207-1, representative word DB 207-2,

7) 복수 특허 분류 기호 관계 DB(207-3)) 등이 있을 수 있다.7) multiple patent classification symbol relation DB 207-3) and the like.

상기 6가지 구성 요소는 대표적인 요소로 이외에도 본 발명의 명세서에 기재된 다른 구성은 본 발명의 시스템(1)에 포함됨은 당연할 것이며, 당업자에게 전산 시스템을 구성하기 위한 당연한 기능 및 모듈(방화벽, 회원 관리 등등)에 대해서는 별도로 언급하지 않는다.It should be understood that the above-mentioned six components are typical elements, and other configurations described in the present invention are included in the system 1 of the present invention, and those skilled in the art will understand the functions and modules (firewall, Etc.) are not mentioned separately.

상기 DB부(20)에는 DB를 관리하는 DBMS(201), 특허 문건을 통합하여 저장하고 있는 DB인 특허 문건 마스트 DB(202), 특허 분류 기호에 대한 정보를 저장하고 있는 특허 분류 기호 마스트 DB(203), 출원인, 발명자, 기업 등에 관한 정보를 저장하고 있는 주체 마스트 DB(204), 다차원 분석 결과를 저장하고 있는 다차원 분석용 연산 수행 결과 테이블 DB(205), 회원 및 각종 정책이나 옵션에 대한 정보를 저 장하고 있는 지원 DB부(206) 및/또는 각종 데이터를 가공한 결과물을 저장하고 있는 2차 가공 DB부(207) 등이 있다.The DB unit 20 is provided with a DBMS 201 for managing DBs, a patent document mast DB 202 which is a DB storing integrated patent documents, a patent classification code mast DB 203), a subject mast DB 204 storing information on applicants, inventors, companies, etc., a multidimensional analysis calculation execution result table DB 205 storing multidimensional analysis results, information on members and various policies and options A secondary DB part 207 storing the result of processing various kinds of data, and the like.

상기 메뉴 DB에는 각 국가 또는 각 언어별로 각 메뉴에 대한 표기법이 있으며, 본 발명의 특허 정보 시스(1)은 접속하는 사용자의 브라우저에 있는 로케일 정보를 읽어 그 로케일 정보에 대응되는 각 국가별/언어별 메뉴를 상기 메뉴 DB에서 추출하여 상기 추출된 언어로 된 메뉴로 화면을 생성하여 사용자에게 전송한다.In the menu DB, there is a notation for each menu for each country or each language. The patent information system 1 of the present invention reads the locale information in the browser of the connecting user and stores the locale information in each country / language Extracts a star menu from the menu DB, creates a screen with the menu of the extracted language, and transmits the screen to the user.

전처리 모듈부(30)The pre-

본 발명의 각종 전처리 모듈에 대해서 설명한다. 상기 전처리 모듈브(30)에는 1) 원천 특허 문건 처리 모듈(301-1), 2) 카운팅 전처리 모듈(3100), 3) 특허 분류 기호별 통계 전처리 모듈(3200), 4) 가중치 전처리 모듈(3300), 5) 인용 정보 전처리 모듈(3400), 6) 특허 분류 기호 전처리 모듈(301-3-1 또는 3500), 7) 출원인 대표명화 전처리 모듈(301-4-1-1 또는 3600), 8) 대표 어구 추출 전처리 모듈(3700), 9) 패밀리 정보 전처리 모듈(3800), 10) 복수 특허 분류 기호 관계성 전처리 모듈(3900), 11) 권리자 변동 정보 전처리 모듈(302), 12) 행정 처리 정보 입수 모듈(303) 중 어느 하나 이상이 더 포함되어 있을 수 있다.Various preprocessing modules of the present invention will be described. 2) a counting preprocessing module 3100; 3) a statistical preprocessing module 3200 according to a patent classification symbol; 4) a weight preprocessing module 3300 5) citation information preprocessing module 3400, 6) patent classification symbol preprocessing module 301-3-1 or 3500, 7) applicant representative name preprocessing module 301-4-1-1 or 3600, 8) A representative phrase extraction preprocessing module 3700, 9) a family information preprocessing module 3800, 10) a plurality of patent classification symbol relation preprocessing module 3900, 11) a holder change information preprocessing module 302, 12) Module 303 may be further included.

원천 특허 문건 처리 모듈(301-1)The original patent document processing module (301-1)

본 발명의 원천 특허 문건 처리 모듈(301-1)에 대해 설명한다. 상기 원천 특허 문건 처리 모듈(301-1)은 입수된 각 국가별(국제 출원된 특허 문건은 WIPO에서 공개한다. WIPO는 국가가 아니나, 기재의 편의상 국가별에 포함시켜서 설명한다.) 입수 특허 문건 데이터(XML, SGML 또는 기타 형식)을 처리한다. The original patent document processing module 301-1 of the present invention will be described. The original patent document processing module 301-1 is disclosed in each country (the internationally filed patent document is disclosed by WIPO). WIPO is not a country, but is included in each country for convenience of description. Process data (XML, SGML or other formats).

상기 원천 특허 문건 처리 모듈(301-1)은 상기 입수 특허 문건 데이터를 입수하여 기설정된 처리를 수행한 다음, 상기 입수 특허 문건 데이터를 본 발명의 특허 정보 시스템(1)에서 처리 가능한 형식에 부합하는 특허 문건 데이터를 생성한다 The original patent document processing module 301-1 obtains the acquired patent document data, performs predetermined processing, and then matches the acquired patent document data to a format that can be processed by the patent information system 1 of the present invention Generate patent document data

상기 기설정된 처리에는 1) 오류 필터링, 2) 패턴화된 오류 수정, 3) 수동 오류 수정, 4) SGML 문건 데이터를 XML 문건 데이터로의 변환, 5) 표준 형식 변환 등이 있을 수 있다. The predetermined processing may include 1) error filtering, 2) patterned error correction, 3) manual error correction, 4) conversion of SGML document data into XML document data, and 5) standard format conversion.

상기 원천 특허 문건 처리 모듈(301-1)은 입수되는 상기 입수 특허 문건 데이터를 기설정된 적어도 하나 이상의 오류 검증 필터(301-1-1)를 통과시켜 상기 입수 특허 문건 데이터가 본 발명의 특허 정보 시스템(1)에서 처리 가능한 형식에 부합하는지를 검증한다. The original patent document processing module 301-1 passes the acquired patent document data to at least one or more error verification filters 301-1-1, (1). &Lt; / RTI >

특허 문건 마스트 DB 생성 모듈(301-2)Patent document mast DB generation module (301-2)

본 발명의 특허 문건 마스트 DB 생성 모듈(301-2)은 상기 원천 특허 문건 처리 모듈(301-1)이 생성한 특허 문건 데이터 및 본 발명의 각종 전처리 모듈이 처리한 정보를 기초로 하여 특허 문건 마스트 DB(202)를 생성한다. 본 발명의 특허 문건 마스트 DB 생성 모듈(301-2)이 포함하고 있는 세부 모듈을 나열하면 다음과 같다. 1) 원천 서지사항 DB(202-1-1), 가공 서지사항 DB(202-1-2), 번역 서지사항 DB(202-1-3), 대표 도면 DB(202-1-4)를 생성하는 서지 사항 DB 생성 모듈(301-2-1)이 있다. 2)특허 문건 DB(202-2-1), 번역 특허 문건 DB(202-2-2), 가공 특허 문건 DB(202-2-3)을 생성하는 문건 DB 생성 모듈(301-2-2)이 있다. 3) 미국 assignee 변 동 DB(202-3-1), 권리 설정 변동 DB(202-3-2)를 생성하는 권리자 변동 DB 생성 모듈(301-2-3)이 있다. 4) 국가별 특허 상태 DB(202-4-1), 국가별 패밀리 DB(202-4-2)를 생성하는 패밀리 정보 DB 생성 모듈(301-2-4)이 있다. 5) 출원인 인용 정보 DB(202-5-1), 심사 인용 정보 DB(202-5-2)를 생성하는 인용 정보 DB 생성 모듈(301-2-5) 및 6) 행정 처리 정보 DB(202-6)를 생성하는 행정 처리 정보 DB 생성 모듈(301-2-6)이 있다.The patent document mast DB generation module 301-2 of the present invention generates the patent document mast 301-2 based on the patent document data generated by the original patent document processing module 301-1 and the information processed by the various preprocessing modules of the present invention DB 202 is generated. The detailed modules included in the patent document mast DB generation module 301-2 of the present invention are listed as follows. 1) generate a source bibliographic item DB 202-1-1, a bibliographic item DB 202-1-2, a translation bibliographic item DB 202-1-3, and a representative drawing DB 202-1-4 And a bibliography DB generation module 301-2-1. 2) a document DB creation module 301-2-2 for creating a patent document DB 202-2-1, a translation patent document DB 202-2-2, and a processed patent document DB 202-2-3; . 3) a US assignee change DB 202-3-1, and a rights holder change DB creation module 301-2-3 that generates a rights setup change DB 202-3-2. 4) a family information DB generation module 301-2-4 for generating a country-specific patent status DB 202-4-1 and a country-specific family DB 202-4-2. 5) a citation information DB generation module 301-2-5 for generating the applicant citation information DB 202-5-1 and the examination citation information DB 202-5-2, and 6) the citation information DB 202- 6) for generating the processing result information.

상기 특허 문건 마스트 DB(202)에는 서지 사항 DB, 문건 DB, 권리 변동 정보 DB, 패밀리 정보 DB, 인용 정보 DB, 행정 처리 정보 DB 등이 있다. The patent document master DB 202 includes a bibliographic database, a document database, a rights change information database, a family information DB, a citation information DB, and an administrative processing information DB.

상기 서지 사항 DB에는 특허 문건 데이터에서 직접 입수할 수 있는 원천 서지 사항 정보, 특허 문건 데이터 자체 또는 전체 특허 문건 데이터 집합을 대상으로 가공한 가공 서지 사항 정보, 각종 언어별로 상기 원천 서지 사항 및 가공 서지 사항 중 번역의 대상이 되어 번역된 언어별 번역 서지 사항 정보 및 기타 정보를 포함하고 있다. 상기 서지 사항 DB를 생성하는 것은 본 발명의 서지 사항 DB 생성 모듈이 담당한다.The bibliographic database includes bibliographic information directly obtained from the patent document data, processed bibliographic information processed for the patent document data itself or the whole patent document data set, the bibliographical references and bibliographic references And includes translation bibliographic information and other information translated by the translator. The bibliography DB generation module of the present invention is responsible for generating the bibliographic database.

상기 문건 DB에는 특허 문건 데이터 자체에 관한 특허 문건 데이터, 상기 특허 문건 데이터의 일부 또는 전부를 각종 언어로 번역한 언어별 번역 문건 데이터, 상기 특허 문건 데이터 또는 상기 언어별 번역 특허 문건 데이터를 pdf, doc, tiff, html 등과 같은 각종 파일 형식으로 변환한 가공 특허 문건 데이터를 포함하고 있다. 상기 각 특허 문건 데이터, 언어별 번역 특허 문건 데이터, 가공 특허 문건 데이터는 각각 특허 문건 DB, 번역 특허 문건 DB, 가공 특허 문건 DB의 형태로 존재할 수 있다. 상기 문건 DB를 생성하는 것은 본 발명의 문건 DB 생성 모듈이 담당한다.The document DB includes patent document data on the patent document data itself, translational document data for each language in which part or all of the patent document data is translated into various languages, data on the patent document data or translated patent document data on the language into pdf, doc , tiff, html, and so on. The patent document data, the translated patent document data and the processed patent document data by language may exist in the form of a patent document DB, a translation patent document DB, and a processed patent document DB, respectively. The document DB creation module of the present invention is responsible for creating the document DB.

상기 권리자 변동 정보 DB에는 미국 특허 문건을 대상으로 한 assignee 변동 정보 DB를 포함하고 있으며, 각국 특허청이 관리하는 각종 특허권자 및 특허출원인의 명의 정보의 변경 사항에 관한 정보 및 각종 실시권자의 설정 등록과 말소 등록에 관한 정보를 포함할 수 있다. 상기 권리자 변동 정보 DB를 생성하는 것은 본 발명의 권리자 변동 DB 생성 모듈이 담당한다.The rights holder variation information DB includes an assignee variation information database for US patent documents. Information on changes in the names of various patent owners and patent applicants managed by each country's patent office, registration and erasure registration of various licensees As shown in FIG. The owner variation DB generation module of the present invention generates the owner variation information DB.

상기 패밀리 정보 DB에는 개별 특허 문건과 관련된 각 국가별 패밀리 특허 문건에 관한 정보를 포함하고 있는 국가별 패밀리 데이터 및 각 패밀리 특허 문건의 각 국가에서의 상태(국가별 출원의 현 상태, 권리 취득 여부 등) 정보를 포함하고 있는 국가별 특허 상태 데이터를 포함할 수 있다. 상기 패밀리 정보 DB를 생성하는 것은 본 발명의 패밀리 정보 DB 생성 모듈이 담당한다.The family information DB includes the family data for each country and the status of each family patent document in each country (including the current state of the application by country, the right to obtain the rights, etc.) ) Information about the patent status of each country. The generation of the family information DB is performed by the family information DB generation module of the present invention.

상기 인용 정보 DB에는 특허 출원인이 인용한 인용 정보에 기초하여 생성되는 출원인 인용 정보 DB, 특정한 특허 출원에 대한 심사관의 심사 과정에서 인용된(주로 심사 대상 특허 출원의 신규성 및 진보성을 부정하기 위하여 인용되는 문건) 타 특허 문건심사 인용에 관한 정보에 기초하여 생성되는 심사 인용 정보 DB가 있다. 상기 인용 정보 DB를 생성하는 것은 본 발명의 인용 정보 DB 생성 모듈이 담당한다.The citation information DB includes an applicant's citation information DB generated based on the citation information cited by the patent applicant, a citation information DB cited in the examination process of the examiner for the specific patent application (mainly referred to as " Document) There is an examination citation information DB which is generated based on information on examination of other patent documents. The citation information DB generation module of the present invention is responsible for generating the citation information DB.

상기 행정 처리 정보 DB에는 각 특허 문건에 대하여 각국 특허청이 발생시키는 행정 처리 정보에 기초하여 생성되는 데이터를 포함하고 있다. 상기 행정 처리 정보 DB를 생성하는 것은 본 발명의 행정 처리 정보 DB 생성 모듈이 담당한다.The administrative processing information DB includes data generated based on the administrative processing information generated by each country's patent office for each patent document. The administrative processing information DB generation module of the present invention is responsible for generating the administrative processing information DB.

이어, 상기 서지 사항 DB 생성 모듈은 상기 특허 문건 데이터 및/또는 전체 특허 문건 데이터를 대상으로 각종 전처리 모듈 등이 가공한 정보를 입수하여 가공 서지 사항 데이터를 생성할 수 있으며, 상기 가공 서지 사항 데이터를 포함하고 있는 DB를 가공 서지 사항 DB라 한다. 상기 가공 서지 사항에는 1) 각종 카운팅 정보, 2) 각종 계산 및 평가 정보 등이 있을 수 있다. Then, the bibliographic database generation module can obtain the processed bibliographic item data by obtaining information processed by various preprocessing modules, etc., on the patent document data and / or the entire patent document data, The included DB is called the bibliographic DB. The processing bibliography may include 1) various counting information, and 2) various calculation and evaluation information.

각종 카운팅 정보를 예를 든다면, 1)출원인 수 및/또는 특허권자 수, 2) 발명자 수, 3) 출원 또는 등록 단계 등과 같은 각 단계별 청구항 수, 4) 명세서 페이지 수, 5) 도면 수, 6) 특허 분류 기호의 종류 수, 7) 각 종류의 특허 분류 기호의 수, 8) 레퍼런스(reference)수(=인용수)/피인용 수, 9) 심사 인용 수/피인용 수, 10) 심사관이 조사한 특허 분류 기호의 수, 11) 우선권 주장수, 12) 자국 포함 각 국가별 패밀리 특허 개수, 13) 전체 패밀리 개수, 14) 독립항 수, 15) 종속항 수, 16) 레퍼런스 중에서 국가 단위의 특허 수, 17) 레퍼런스 중에서 전체 특허 문건 수, 18) 레퍼런스 중에서 비특허 문건 수 등이 있을 수 있다. 상기와 같은 카운팅 정보를 처리하는 것은 본 발명의 카운팅 전처리 모듈이 담당한다. 상세한 것은 본 발명의 카운팅 전처리 모듈에서 설명한다.4) number of pages of specification pages, 5) number of drawings, 6) number of documents, 6) number of documents, 6) number of documents, 8) Number of references (= number of citations) / number of citations, 9) Number of examination citation / number of citations, 10) Number of patent classification codes The number of patent classification marks, 11) the number of priority claims, 12) the number of family patents in each country including its own country, 13) the total number of families, 14) 17) the total number of patent documents in the reference, and 18) the number of non-patent documents in the reference. The counting preprocessing module of the present invention processes such counting information. Details will be described in the counting preprocessing module of the present invention.

각종 계산 및 평가 정보를 예로 든다면, 1) 그 특허 문건에 포함된 출원인의 그 특허 문건에 포함된 특허 분류 기호(그 특허 분류 기호 자체, 그 특허 분류 기호의 하위 특허 분류 기호 전체, 또는 직상위/차상위 특허 분류 기호 또는 그 특허 분류 기호의 메인그룹까지의 모든 단계의 특허 분류 기호를 대상으로 할 수도 있다.)에 대한 총출원수, 총등록수, 점유율, 집중율, 활동율(AI)등과 같은 각종 특허 지표를 계산한 값, 2) 그 특허 문건에 포함된 발명자의 그 특허 문건에 포함된 특허 분류 기호에 대한 각종 특허 지표를 계산한 값, 3) 각종 분석 지표 등이 있을 수 있으며, 상세한 것은 본 발명의 각종 전처리 모듈 및 분석 모듈에서 설명한다.In the case of various calculation and evaluation information, for example, (1) the patent classification mark included in the patent document of the applicant included in the patent document (the patent classification mark itself, the entire subpopulation classification symbol of the patent classification mark, Such as the total number of applications, the total number of registrations, the percentage of shares, the concentration rate, the activity rate (AI), etc. for the patent classification code of the next higher level or the patent classification code of all stages up to the main group of the patent classification code) 2) a value obtained by calculating various patent indices of the patent classification mark included in the patent document of the inventor included in the patent document, and 3) various analysis indexes. For details, Various preprocessing modules and analysis modules according to the present invention will be described.

특허 분류 기호 마스트 DB 생성 모듈(301-3)Patent classification code mast DB generation module 301-3

본 발명의 상기 전처리 모듈에는 특허 분류 기호 마스트 DB 생성 모듈(301-3)을 포함하고 있는데 상기 특허 분류 기호 마스트 DB 생성 모듈(301-3)이 생성하는 DB는 다음과 같다. The preprocessing module of the present invention includes a patent classification code mast DB generation module 301-3. The DB generated by the patent classification code mast DB generation module 301-3 is as follows.

1) 각국의 특허청 또는 특허 원천 데이터 소스들로부터 입수되는 각종 특허 분류 기호의 raw data를 저장하고 있는 원천 특허 분류 기호 DB(203-1)이다.1) The original patent classification code database 203-1 storing raw data of various patent classification codes obtained from patent offices or patent source data sources of respective countries.

2) 각종 특허 분류 기호를 본 발명의 특허 정보 시스템(1)의 용도에 맞게 변형한 특허 분류 기호를 저장하고 있는 변형 특허 분류 기호 DB(203-2)가 있으며, 이는 변형 특허 분류 기호 생성 모듈(301-3-3)이 생성한다.2) a modified patent classification symbol DB 203-2 storing a patent classification code in which various patent classification codes are modified in accordance with the use of the patent information system 1 of the present invention, 301-3-3).

3) 임의의 특허 분류 기호에 대한 모든 상위 특허 분류 기호를 모아서 저장하고 있는 총상위 특허 분류 기호 집합 DB(203-3)가 있으며, 이는 총상위 특허 분류 기호 집합 생성 모듈(301-3-1-1)이 생성한다.3) a total upper patent classification symbol set DB 203-3 that collects and stores all upper patent classification symbols for an arbitrary patent classification symbol, which is a total upper patent classification symbol set generation module 301-3-1- 1).

4) 임의의 특허 분류 기호에 대한 직하위 또는 모든 하위 특허 분류 기호에 대한 정보를 저장하고 있는 하위 특허 분류 기호 집합 DB(203-4)가 있으며, 이는 특허 분류 기호 집합 생성 모듈(301-3-1-2)이 생성한다. 상기 하위 특허 분류 기호 집합 생성 모듈(301-3-1-2)에는 직하위 특허 분류 기호 집합만을 생성하는 직하위 특허 분류 기호 집합 생성 모듈(301-3-1-2-1)와 모든 하위 특허 분류 기호 집합을 생성하는 총하위 특허 분류 기호 집합 생성 모듈(301-3-1-2-2)이 있다. 4) a sub-patent classification symbol set DB 203-4 storing information on a sub-patent classification symbol or any sub-patent classification symbol for any patent classification symbol, and this patent classification symbol set generation module 301-3- 1-2). The sub-patent classification symbol set generation module 301-3-1-2 includes a sub-patent classification symbol set generation module 301-3-1-2-1 for generating only a sub-subset of patent classification codes, And a total sub-patent classification symbol set generation module 301-3-1-2-2 for generating a classification symbol set.

5) 특허 분류 기호 체계를 트리 구조로 저장하고 있는 특허 분류 기호 트리 테이블 DB(203-5) 있으며, 이는 특허 분류 기호 트리 테이블 생성 모듈(미도시)이 생성한다.5) Patent classification symbol tree table DB 203-5 storing the patent classification symbol system as a tree structure, which is generated by a patent classification symbol tree table creation module (not shown).

6) 임의의 특허 분류 기호에 대한 총상위 특허 분류 기호를 레벨별로 저장하고 있는 총상위 특허 분류 기호 테이블 DB(203-6)가 있으며, 이는 총상위 특허 분류 기호 집합 생성 모듈(301-3-1-1)이 생성한다. 한편, 특허 분류 기호에서 갱신이 있을 경우, 이는 본 발명의 특허 분류 갱신 모듈(301-3-5)이 처리하며, 갱신된 특허 분류 기호에 대하여 관련된 기설정된 모듈이 기설정된 처리를 수행한다.6) There is a total upper patent classification symbol table DB (203-6) storing the total upper patent classification symbols for any patent classification symbol by levels, which is the total upper patent classification symbol set generation module 301-3-1 -1). On the other hand, when there is an update in the patent classification symbol, it is processed by the patent classification update module 301-3-5 of the present invention, and the predetermined module related to the updated patent classification symbol performs predetermined processing.

이하, 특허 분류 기호에 대해서 통합적으로 설명한다. Hereinafter, patent classification symbols will be described collectively.

특허 분류 기호의 다단계성Multistage of patent classification codes

다단계 계층(트리, tree) 구조로 되어 있는 각 특허 분류 기호는 특허 분류 기호 vs. 그 특허 분류 기호의 타이틀 정보의 대응 관계를 갖고 있다. 본 특허 명세서의 곳곳에서 이러한 예를 다수 제시되어 있다. 계층 구조 상에서 주어진 특허 분류 기호의 하위 특허 분류 기호 모두에 대한 정보를 입수하기 위해서는 검색 엔진(검색 모듈(401)과 검색 엔진은 동의어로서 본 발명에서 사용된다.) 또는 DBMS(201)에 확장자/와일드카드 등을 사용할 수 있다. 예를 들어, H01F의 하위 특허 분류에 관한 모든 정보를 입수하려면, 검색어 또는 질의어에 H01F에 확장자/와일드카드(예를 들어?)를 결합하여 질의하면 H01F의 모든 하위 특허 분류 기호를 입 수할 수 있다. 마찬가지로 H01F1/00의 하위 특허 분류에 관한 모든 정보를 입수하려면 "H01F1?"을 입력하면 된다. 하지만, 만약 H01F1/01의 하위 특허 분류에 관한 모든 정보를 입수하기 위해서는 "H01F1/01?"를 입력할 수 없다. 그 이유는 IPC 등의 특허 분류 기호의 타이틀 정보에 도트(dot)가 포함되는 특허 분류 기호들은 동일한 표시 패턴(예를 들어 IPC의 경우 메인그룹 표시+ / + 숫자, USPC의 경우 클래스번호+숫자, FT의 경우에는 숫자1자+영문자1자+3자리숫자+영문자2자+숫자 등)을 가지면서 도트의 수로서 상대적인 부모 자식(parent child) 관계/상위 하위 관계를 표시하기 때문이다. 즉, 타이틀 정보에 도트가 포함되어 있지 않은 경우에는 특허 분류 기호의 표기 체계 만으로도 부모 자식 관계/상위 하위 관계를 구별할 수 있지만, 그렇지 않은 경우에는 특허 분류 기호의 표기 체계 만으로는 구별할 수 없기 때문이다.Each patent classification symbol in a multi-level hierarchical (tree) structure has a patent classification symbol. And the corresponding relationship of the title information of the patent classification symbol. A number of such examples have been presented elsewhere in this patent specification. (Search module 401 and search engine are used as synonyms in the present invention) or DBMS 201 to obtain information on all of the sub-patent classification codes of a given patent classification symbol on the hierarchical structure, Card or the like can be used. For example, to obtain all the information about sub-patent classification of H01F, all sub-patent classification codes of H01F can be obtained by querying the query or query by combining the extension / wildcard (eg?) With H01F . Similarly, "H01F1?" Can be input to obtain all information about the sub-patent classification of H01F1 / 00. However, it is not possible to enter "H01F1 / 01?" In order to obtain all information on the sub-patent classification of H01F1 / 01. The reason is that the patent classification symbols including the dot in the title information of the patent classification symbol such as IPC have the same display pattern (for example, in the case of IPC, the main group indication + / + number, in case of USPC, The number of dots, and the relative parent-child relationship / upper-level relationship are displayed while having the number of dots in the FT case. In other words, if the title information does not include a dot, it is possible to distinguish the parent-child relationship / parent-child relationship only by the notation system of the patent classification symbol, but otherwise it can not be distinguished only by the notation system of the patent classification symbol .

한편, 특허 분류 기호가 계층 구조를 가진다는 속성 상, 하위 특허 분류 기호에 해당하는 특허 문건은 상위 특허 분류 기호에도 해당되는 것이 마땅하다. 즉, 특정 특허 문건에 H01F1/04라는 특허 분류 기호가 할당되었다면, 이 특허 문건은 H01F1/032에도 해당되고, H01F1/03에도 해당되며, H01F1/01에도 해당되며, 당연히 H01F1/00에도 해당되게 된다. 이를 역으로 해석하면 H01F1/03과 관련되는 특허 정보에는 H01F1/03뿐만 아니라, 특허 분류 기호의 계층 구조 체계 상 H01F1/03의 하위에 있는 모든 특허 분류 기호에 관련된 특허 정보가 포함되어 있어야 한다. 즉, H01F1/03과 관련되는 특허 정보에는 당연히 H01F1/032, H01F1/04뿐만 아니라 특허 분류 기호 체계 상 H01F1/03의 하위에 있는 모든 특허 분류 기호에 관련된 특허 정 보가 포함되어야 한다.On the other hand, in the property that the patent classification symbol has a hierarchical structure, it is desirable that the patent document corresponding to the sub-patent classification code corresponds to the upper patent classification code. That is, if a patent classification code H01F1 / 04 is assigned to a specific patent document, this patent document also applies to H01F1 / 032, corresponds to H01F1 / 03, corresponds to H01F1 / 01, and also corresponds to H01F1 / 00 . In reverse, the patent information pertaining to H01F1 / 03 shall include not only H01F1 / 03, but also patent information related to all patent classification codes under H01F1 / 03 in the hierarchical structure of the patent classification symbol. In other words, patent information pertaining to H01F1 / 03 should include not only H01F1 / 032 and H01F1 / 04, but also patent information related to all patent classification codes under H01F1 / 03 in the patent classification system.

도 14에서는 이와 같은 본 발명의 발명 사상이 적용되지 않은 경우의 한 예를 보여주고 있다. 도 14에서 알 수 있듯이 A61B 3/00에 대응되는 문건보다 그 하위 분류인 A61B 3/02에 대응되는 문건이 많이 나오고 있다.FIG. 14 shows an example in which the inventive idea of the present invention is not applied. As can be seen from Fig. 14, there are many documents corresponding to the sub-category A61B 3/02 than the documents corresponding to A61B 3/00.

하위 특허 분류 기호의 처리 방법 3가지3 ways of processing sub-patent classification

상기와 같은 이유로, 타이틀 정보에 도트가 포함되어 있는 특허 분류 기호에 대해서는 그 특허 분류 기호의 모든 하위 특허 분류 기호에 관한 정보를 포함하도록 처리할 필요가 있으며, 이와 같은 처리를 담당하는 것은 본 발명의 특허 분류 기호 마스트 DB 생성 모듈(301-3)에 포함되어 있는 특허 분류 기호 전처리 모듈(301-3-1 또는 3500)이다.For the above reasons, it is necessary to process information on all the sub-patent classification symbols of the patent classification symbol in which the dot is included in the title information. The patent classification symbol preprocessing module 301-3-1 or 3500 included in the patent classification code mast DB generation module 301-3.

주어진 특허 분류 기호에 대한 정보 추출의 모든 과정에는 그 하위 특허 분류 기호에 관한 특허 정보를 포함시킬 필요가 있으며, 그러한 정보 추출 과정의 대표적인 경우로서 1) 검색, 2) 통계, 3) 분석, 4) 감시, 5) 디렉토리 표시 등을 들 수 있다. 확장자(와일드카드)를 사용하지 못하는 상황에서 주어진 특허 분류 기호에 대하여 모든 하위 특허 분류 기호에 관한 정보를 포함시키기 위해서는 본질적으로 다음과 같은 과정 중 어느 하나의 처리가 필요하게 된다.It is necessary to include patent information on the sub-patent classification in all the processes of information extraction for a given patent classification. 1) Search, 2) Statistics, 3) Analysis, 4) Monitoring, and 5) directory display. In the case where the extension (wildcard) can not be used, in order to include information on all the sub-patent classification symbols for a given patent classification, essentially one of the following processes is required.

첫째, 주어진 특허 분류 기호에 대하여 특허 분류 기호 체계(트리 구조)를 깊이 우선 탐색 등의 방식으로 탐색하여 모든 하위 특허 분류 기호에 관한 정보를 입수한다. 이러한 정보 입수 과정은 질의/검색마다 매번 일어날 수 있다. 한편, 주어진 특허 분류 기호 체계(트리 구조)에 대하여 모든 특허 분류 기호 또는 타이틀 정보에 도트를 포함하고 있는 특허 분류 기호에 대한 모든 하위 특허 분류 기호를 입수하여 저장해 놓고, 질의/검색 시 저장된 모든 하위 특허 분류 기호에 관한 정보를 입수하는 방식을 취할 수도 있다. 트리 구조로 된 데이터가 주어졌을 때, 이 트리 구조를 구성하는 각 노드(node)에 대하여 깊이 우선 탐색 등의 방식으로 각 노드의 직하위 노드 및/또는 모든 하위 노드를 찾아 내는 것은 전산학의 기초에 해당하는 것이므로, 이에 대한 설명은 생략한다. 또한, 입수한 직하위 노드 및/또는 모든 하위 노드의 정보를 특정 노드에 대응시켜 저장하는 것 또한 기초적인 것이므로, 이에 대한 설명도 생략한다.First, the patent classification system (tree structure) is searched for a given patent classification symbol in a manner such as a depth first search to obtain information about all sub-patent classification symbols. This information retrieval process can occur every time a query / retrieval occurs. On the other hand, all the sub-patent classification symbols for the patent classification symbols including dots in all patent classification symbols or title information are obtained and stored for a given patent classification scheme (tree structure), and all sub-patents stored in the query / It may be possible to obtain information on the classification symbol. Given the data in a tree structure, it is essential for each node constituting this tree structure to find a node directly below the node and / or all its descendants in a manner such as depth-first search, Therefore, a description thereof will be omitted. In addition, storing the information of the lower node and / or all of the lower nodes obtained in correspondence with the specific node is also a basic one, so that the explanation thereof is also omitted.

둘째, 모든 특허 분류 기호에 대하여 자신의 상위에 해당하는 특허 분류 기호에 관한 정보를 특허 분류 기호 체계(트리 구조)를 참조하여 단계별로 저장해 놓은 다음 주어진 특허 분류 기호가 있을 때, 이 특허 분류 기호가 최초로 나타나는 특정 단계를 찾아 내고, 그 특정 단계에 그 특허 분류 기호가 표기되어 있는 모든 특허 분류 기호에 관한 정보를 입수하는 방식을 취할 수도 있다.Second, for all patent classification codes, information about the patent classification code corresponding to the upper part of the patent classification code is stored in a stepwise manner with reference to the patent classification system (tree structure). Then, when there is a given patent classification code, It is possible to find a specific step that appears first, and to obtain information about all patent classification symbols for which the patent classification mark is indicated at the specific step.

셋째, 주어진 특허 분류 기호 체계(트리 구조)를 그 체계를 유지하면서 특허 분류 기호의 표기만을 변형시켜, 변형 특허 분류 기호를 생성하고, 변형 특허 분류 기호에 대하여 위 첫째와 둘째 방법을 실시할 수 있다. 한편, 특허 분류 기호 체계를 참조하여 변형하는 방식을 적절히 선택하면 범위(range) 검색/범위(range) 질의에 응답할 수 있도록 특허 분류 기호 표기를 변형할 수 있게 된다.Third, a given patent classification scheme (tree structure) can be modified to generate only the representation of the patent classification symbol, generating the modified patent classification symbol, and perform the first and second methods for the modified patent classification symbol . On the other hand, by appropriately selecting a method of transforming referring to the patent classification system, it is possible to modify the patent classification notation so as to respond to a range search / range query.

H04B 7/00를 통한 예시적 설명Illustrative explanation through H04B 7/00

본 발명에서는 IPC 7판 기준 메인그룹 H04B 7/00(타이틀은 무선전송시스템이 다.) 및 이의 하부에 해당하는 특허 분류 기호를 실시예로 들어 상기 특허 분류 기호들 간의 트리 구조를 설명하나, 이러한 방법 및 설명은 본 실시예에 한정되지 않고, 전 IPC 영역에 걸쳐 동일 또는 대등하게 적용됨은 물론이며, 타이틀에 도트 구조로서 계층의 상위와 하위를 표현하는 방식으로 포함하는 다른 특허 분류 기호 체계(예를 들면, USPC, FI, FT 또는 ECLA) 등에 대해서도 동일 또는 대등한 방식으로 적용된다.In the present invention, the tree structure between the patent classification symbols is described by taking the patent classification code corresponding to IPC 7th edition reference main group H04B 7/00 (title is a wireless transmission system) and its lower part as an embodiment, The method and description are not limited to the present embodiment, but may be applied to the same IPC area or the entire IPC domain, and may be applied to other patent classification schemes (for example, For example, USPC, FI, FT or ECLA) are applied in the same or equivalent manner.

메인그룹 H04B 7/00 및 이의 하부에 해당하는 특허 분류 기호 및 그 타이틀은 IPC 7판 기준으로 다음과 같은 구성을 가진다.The patent classification codes and titles corresponding to the main group H04B 7/00 and its lower part shall have the following composition on the basis of IPC 7 edition.

H04B 7/00 무선전송시스템, 즉, 방사전자계를 사용한 것H04B 7/00 Radio transmission systems, ie using radiated electromagnetic fields

H04B 7/005 .전송의 제어 ; 등화Control of transmission H04B 7/005. Equalization

H04B 7/01 .위상편이의 저감H04B 7/01 Reduction of phase shift

H04B 7/015 .반향효과의 저감H04B 7/015. Reduction of echo effect

H04B 7/02 .다이버시티(diversity)시스템Diversity systems H04B 7/02.

H04B 7/04 .. 떨어져 배치된 복수의 독립공중선을 사용한 것H04B 7/04 .. using a plurality of discrete aerials spaced apart

H04B 7/06 ...송신국에 있어서의 것H04B 7/06 ... in the sending station

H04B 7/08 ...수신국에 있어서의 것H04B 7/08 ... in recipient station

H04B 7/10 .. 편파 또는 방향특성에 의해서 특징지어지는 단일공중선시스 템을 사용한 것H04B 7/10 .. using a single aerial system characterized by polarization or directional characteristics

H04B 7/12 ..주파수다이버시티 시스템H04B 7/12 .. Frequency Diversity System

H04B 7/14 .릴레이(relay)시스템H04B 7/14 Relay system

H04B 7/145 ..수동 릴레이 시스템H04B 7/145 .. Manual relay system

H04B 7/15 ..능동 릴레이 시스템H04B 7/15 .. Active relay system

H04B 7/155 ...기지국(ground based stations)H04B 7/155 ... ground-based stations

H04B 7/165 ....각도변조를 사용하는 것Using angle modulation H04B 7/165 ....

H04B 7/17 ....펄스변조를 사용하는 것H04B 7/17 .... using pulse modulation

H04B 7/185 ...우주국 또는 항공기탑재국(space based or airborne stations)H04B 7/185 ... space-based or airborne stations

H04B 7/19 ....지구동기국(earth synchronization stations)H04B 7/19 .... earth synchronization stations

H04B 7/195 ....비동기국H04B 7/195 .... Asynchronous station

H04B 7/204 ...다중악세스(multiple access )H04B 7/204 ... multiple access

H04B 7/208 ....주파수분할 다중악세스H04B 7/208 .... frequency division multiple access

H04B 7/212 ....시분할 다중악세스H04B 7/212 .... Time division multiple access

H04B 7/216 ....코드분할 또는 스프레드-스펙트럼 다중 억세스H04B 7/216 .... Code division or spread-spectrum multiple access

H04B 7/22 .산란파 전파시스템H04B 7/22. Spawner wave propagation system

H04B 7/24 .둘 또는 그 이상의 지점간의 통신을 위한 것For communication between two or more points H04B 7/24.

H04B 7/26 .. 적어도 하나의 지점이 이동될 수 있는 것H04B 7/26 .. that at least one point can be moved

상기를 도트를 기준으로 트리 계층 구조를 더욱 쉽게 할 수 있도록 하면 다 음 표 1과 같은 표현될 수 있다.The tree hierarchy structure can be expressed as shown in Table 1 below.

[표 1][Table 1]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup 4-도트서브그룹4-dot subgroup 타이틀title H04B 7/00H04B 7/00 무선전송시스템Wireless transmission system H04B 7/005H04B 7/005 .전송의 제어 ; 등화Control of transmission; Equalization H04B 7/01H04B 7/01 .위상편이의 저감Reduction of phase shift H04B 7/015H04B 7/015 .반향효과의 저감Reduction of echo effect H04B 7/02H04B 7/02 .다이버시티(diversity)시스템. Diversity system H04B 7/04H04B 7/04 .. 떨어져 배치된 복수의 독립공중선을 사용한 것Using multiple independent antennae spaced apart H04B 7/06H04B 7/06 ...송신국에 있어서의 것... in the sending station H04B 7/08H04B 7/08 ...수신국에 있어서의 것... in the receiving station H04B 7/10H04B 7/10 .. 편파 또는 방향특성에 의해서 특징지어지는 단일공중선시스템을 사용한 것Using a single aerial system characterized by polarization or directional characteristics H04B 7/12H04B 7/12 ..주파수다이버시티 시스템.. Frequency Diversity System H04B 7/14H04B 7/14 .릴레이(relay)시스템Relay system H04B 7/145H04B 7/145 ..수동 릴레이 시스템.. Manual Relay System H04B 7/15H04B 7/15 ..능동 릴레이 시스템.. Active Relay System H04B 7/155H04B 7/155 ...기지국(ground based stations)... ground-based stations H04B 7/165H04B 7/165 ....각도변조를 사용하는 것.... using angle modulation H04B 7/17H04B 7/17 ....펄스변조를 사용하는 것.... using pulse modulation H04B 7/185H04B 7/185 ...우주국 또는 항공기탑재국... space station or aircraft carrier H04B 7/19H04B 7/19 ....지구동기국.... Earth Motive Bureau H04B 7/195H04B 7/195 ....비동기국.... asynchronous station H04B 7/204H04B 7/204 ...다중악세스... multiple access H04B 7/208H04B 7/208 ....주파수분할 다중악세스.... frequency division multiple access H04B 7/212H04B 7/212 ....시분할 다중악세스.... Time Division Multiple Access H04B 7/216H04B 7/216 코드분할 또는 스프레드-스펙트럼 다중 억세스Code division or spread-spectrum multiple access H04B 7/22H04B 7/22 .산란파 전파시스템.Shelter Wave Propagation System H04B 7/24H04B 7/24 .둘 또는 그 이상의 지점간의 통신을 위한 것For communication between two or more points H04B 7/26H04B 7/26 .. 적어도 하나의 지점이 이동될 수 있는 것... that at least one point can be moved

이때, 특허 분류 기호 체계(트리 구조)를 참조하여 자신의 상위에 있는 모든 특허 분류 기호를 입수하면 아래 표 2와 같은 총상위 특허 분류 기호 테이블을 생성할 수 있다. 하기 표 2와 같은 총상위 특허 분류 기호 테이블은 IPC 뿐만 아니라, USPC, FT, ECLA, FI 등에 대해서도 생성할 수 있음은 물론이다 하겠다. 한편, 하기 표 2와 같은 총상위 특허 분류 기호 테이블은 상기 특허 분류 기호들이 갱신될 때, 갱신된 특허 분류 기호 정보를 입수하여 상기 테이블도 갱신되는 것이 바람직하다. 하기 표 2와 같은 총상위 특허 분류 기호 테이블은 본 발명의 총상위 특허 분류 기호 테이블 생성 모듈이 특허 분류 기호 마스트 DB로부터 특허 분류 기호를 입수 받아 생성한다. 상기 총상위 특허 분류 기호 테이블 생성 모듈은 특허 분류 기호에 관한 정보가 갱신될 때마다 상기 총상위 특허 분류 기호 테이블을 생성하는 것이 바람직하다. 상기 총상위 특허 분류 기호 테이블 생성 모듈은 본 발명의 변형 특허 분류 기호 DB에 대해서도 동일한 형태의 총상위 특허 분류 기호 테이블을 생성해 놓을 수 있을 것이다.At this time, referring to the patent classification system (tree structure), if all the patent classification codes at the top of the patent classification system are obtained, a total top patent classification table as shown in Table 2 below can be generated. It will be appreciated that the table of total top patent classification symbols as shown in Table 2 below can be generated not only for IPC but also for USPC, FT, ECLA, FI and the like. On the other hand, when the patent classification symbols are updated, it is preferable that the upper general patent classification symbol table as shown in Table 2 below is updated and the table is updated as well. The total top patent classification symbol table as shown in Table 2 below is obtained by obtaining the patent classification symbol from the patent classification symbol mast DB of the total top patent classification symbol table generation module of the present invention. The total upper patent classification symbol table generation module preferably generates the total upper patent classification symbol table every time the information on the patent classification symbol is updated. The total upper patent classification symbol table generation module may generate a total upper patent classification symbol table of the same type for the modified patent classification symbol DB of the present invention.

하기 표 2에서 각 필드인, IPC_ID는 특정 IPC의 ID를, S는 섹션 레벨, C는 클래스 레벨, SC는 서브클래스, MG는 메인그룹, 1dot는 타이틀 정보에 1개의 도트가 있는 레벨을, 2dot는 타이틀 정보에 2개의 도트가 있는 레벨을, 3dot는 타이틀 정보에 4개의 도트가 있는 레벨을, (그러므로, n dot는 타이틀 정보에 n개의 도트가 있는 레벨을 나타낸다), self는 IPC 자기 자신을 나타낸다.In the following Table 2, IPC_ID indicates a specific IPC ID, S indicates a section level, C indicates a class level, SC indicates a subclass, MG indicates a main group, 1 dot indicates a level having one dot in the title information, (Therefore, n dot indicates a level having n dots in the title information), self indicates a level having two dots in the title information, and 3dot indicates a level having four dots in the title information. .

[표 2][Table 2]

IPC_IDIPC_ID SS CC SCSC MGMG 1dot1dot 2dot2dot 3dot3dot 4dot4dot SelfSelf 6996469964 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/00H04B7 / 00 6996569965 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/005H04B7 / 005 H04B7/005H04B7 / 005 6996669966 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/01H04B7 / 01 H04B7/01H04B7 / 01 6996769967 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/015H04B7 / 015 H04B7/015H04B7 / 015 6996869968 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/02H04B7 / 02 6996969969 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/04H04B7 / 04 H04B7/04H04B7 / 04 6997069970 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/04H04B7 / 04 H04B7/06H04B7 / 06 H04B7/06H04B7 / 06 6997169971 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/04H04B7 / 04 H04B7/08H04B7 / 08 H04B7/08H04B7 / 08 6997269972 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/10H04B7 / 10 H04B7/10H04B7 / 10 6997369973 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/02H04B7 / 02 H04B7/12H04B7 / 12 H04B7/12H04B7 / 12 6997469974 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/14H04B7 / 14 6997569975 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/145H04B7 / 145 H04B7/145H04B7 / 145 6997669976 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/15H04B7 / 15 6997769977 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/155H04B7 / 155 H04B7/155H04B7 / 155 6997869978 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/155H04B7 / 155 H04B7/165H04B7 / 165 H04B7/165H04B7 / 165 6997969979 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/155H04B7 / 155 H04B7/17H04B7 / 17 H04B7/17H04B7 / 17 6998069980 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/185H04B7 / 185 H04B7/185H04B7 / 185 6998169981 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/185H04B7 / 185 H04B7/19H04B7 / 19 H04B7/19H04B7 / 19 6998269982 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/185H04B7 / 185 H04B7/195H04B7 / 195 H04B7/195H04B7 / 195 6998369983 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/204H04B7 / 204 H04B7/204H04B7 / 204 6998469984 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/204H04B7 / 204 H04B7/208H04B7 / 208 H04B7/208H04B7 / 208 6998569985 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/204H04B7 / 204 H04B7/212H04B7 / 212 H04B7/212H04B7 / 212 6998669986 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/14H04B7 / 14 H04B7/15H04B7 / 15 H04B7/204H04B7 / 204 H04B7/216H04B7 / 216 H04B7/216H04B7 / 216 6998769987 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/22H04B7 / 22 H04B7/22H04B7 / 22 6998869988 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/24H04B7 / 24 H04B7/24H04B7 / 24 6998969989 HH H04H04 H04BH04B H04B7/00H04B7 / 00 H04B7/24H04B7 / 24 H04B7/26H04B7 / 26 H04B7/26H04B7 / 26

즉, H04B7/06이라는 IPC 특허 분류 기호가 주어졌을 때, IPC 특허 분류 기호 체계를 참조하여 자신의 모든 부모 노드(트리 구조 상의 분기점 등을 노드(node)라 한다)를 찾으면, H, H04, H04B, H04B7/00, H04B7/02, H04B7/04, H04B7/06, H04B7/06이 되고, 이를 상기 필드 구조에 맞추어 정리하면, IPC_ID69970에 나와 있는 것과 같은 데이터를 생성할 수 있다.In other words, given the IPC patent classification code H04B7 / 06, if all parent nodes (branch points in the tree structure are referred to as nodes) are found by referring to the IPC patent classification system, H, H04, H04B , H04B7 / 00, H04B7 / 02, H04B7 / 04, H04B7 / 06, and H04B7 / 06, and the data as shown in IPC_ID69970 can be generated according to the field structure.

상기 표 2와 같은 총상위 특허 분류 기호 테이블이 주어졌을 때, 주어진 특허 분류 기호의 모든 상위 특허 분류 기호를 입수하는 것은 1) self 필드에서 주어진 특허 분류 기호를 찾고, 찾은 특허 분류 기호의 행을 따라 상위 단계로 올라가면서 각 단계별로 특허 분류 기호를 획득하면 될 것이다. 상기와 같은 주어진 특허 분류 기호의 총상위 특허 분류 기호를 모아서 DB화 시켜 놓은 것이 본 발명의 총상위 특허 분류 기호 집합 DB이며, 상기 총상위 특허 분류 기호 테이블을 활용하여 특정한 특허 분류 기호의 모든 하위 특허 분류 기호를 추출하여 DB화 시켜 놓은 것이 본 발명의 하위 특허 분류 기호 집합 DB이다. 이때, 상기 총상위 특허 분류 기호 테이블을 활용하여 특정한 특허 분류 기호의 직하위 특허 분류 기호만을 추출할 수 있는데, 이러한 정보를 DB와 시켜 놓은 것이 본 발명의 직하위 특허 분류 기호 집합 DB이다.Given the total top patent classification table as shown in Table 2 above, obtaining all the top patent classification symbols of a given patent classification symbol is accomplished by 1) searching for a given patent classification symbol in the self field, You will get the patent classification symbol for each step as you move up. The total top patent classification symbol sets of the given patent classification codes are collected and DBed together to form a database of the total top patent classification codes set of the present invention. The classification symbol is extracted and converted into a DB, which is a sub-patent classification symbol set DB of the present invention. At this time, it is possible to extract only the sub-classifications of a specific patent classification symbol using the above-mentioned upper patent classification symbol table. This information is DB, which is a subset of the patent classification set of the present invention.

상기 표 2와 같은 총상위 특허 분류 기호 테이블이 있을 때, 특정한 특허 분류 기호가 주어졌을 때, 그 특허 분류 기호 및 그 하위 모든 특허 분류 기호를 입수하는 방법에 대해서 H04B7/15를 예로써 설명한다. 1 단계로 먼저 Self에서 H04B7/15를 찾으면, IPC_ID69976에 있음을 알게 된다. 2단계로, 어느 레벨의 필드 에서 자기 자신과 동일한 특허 분류 기호가 대응되어 있는지를 찾으면 2 dot 레벨 필드임을 알게 된다. 3단계로, 2 dot 레벨 필드에서 H04B7/15가 있는 특허 분류 기호를 찾으면, 각각 H04B7/15, H04B7/155, H04B7/165, H04B7/17, H04B7/185, H04B7/19, H04B7/195, H04B7/204, H04B7/208, H04B7/212, H04B7/216이 된다. 이들이 H04B7/15 자신과 H04B7/15의 모든 하위 특허 분류 기호가 된다. 찾은 결과에서 자기 자신을 제외하면 자신의 모든 하위 특허 분류 기호를 자기 자신과 대응시킬 수 있음은 물론이다 하겠다. 이때, 특정한 특허 분류 기호에 대하여 그 특허 분류 기호 및 모든 하위 특허 분류 기호에 관한 정보로 하위 특허 분류 기호 집합 DB를 생성할 수 있다. 이때, 특정한 특허 분류 기호에 대하여 그 특허 분류 기호의 모든 하위 특허 분류 기호를 저장하면 총하위 특허 분류 기호 집합 DB를 생성할 수 있다. 그리고, 자기 자신을 포함한 모든 하위 특허 분류의 개수(여기서는 11개, 자기 자신을 제외하면 10개)를 카운팅할 수도 있을 것이다. 이러한 카운팅은 본 발명의 카운팅 전처리 모듈이 수행하는 것이 바람직하다.When there is a total top patent classification symbol table as shown in Table 2 above, and given a specific patent classification symbol, a description will be given by way of example of H04B7 / 15 as a method of obtaining the patent classification symbol and all the lower patent classification symbols. If you find H04B7 / 15 in Self first, you will find IPC_ID69976. In step 2, if it is found at which level field that the same patent classification code is associated with itself, it is found that it is a 2-dot level field. In step 3, if you find a patent classification code with H04B7 / 15 in the 2-dot level field, you will find the patent classification code H04B7 / 15, H04B7 / 155, H04B7 / 165, H04B7 / 17, H04B7 / 185, H04B7 / 19, H04B7 / 195, H04B7 / / 204, H04B7 / 208, H04B7 / 212, and H04B7 / 216. These are all sub-patent classification codes of H04B7 / 15 itself and H04B7 / 15. It is of course possible to match all sub-patent classification codes with oneself by excluding oneself from the found result. At this time, the sub-patent classification symbol set DB can be generated from the information about the patent classification symbol and all the sub-patent classification symbols for a specific patent classification symbol. At this time, if all the sub-patent classification symbols of the patent classification symbol are stored for a specific patent classification symbol, a total sub-patent classification symbol set DB can be generated. It may also count the number of all sub-patent categories (including 11 in this case and 10, excluding oneself), including oneself. Such counting is preferably performed by the counting preprocessing module of the present invention.

한편, 여기서 자신의 직하위 특허 분류 기호만을 추출하는 방법을 생각해 보면, 상기 3단계를 수행할 때 3 dot 레벨 필드값이 null이 아니며(직하위에 값이 있음), 4 dot 레벨 필드값이 null인(직하위의 하위값은 없음) 특허 분류 기호를 찾으면, 각각 H04B7/155, H04B7/185, H04B7/204를 얻게 된다. 이때, 특정한 특허 분류 기호에 대하여 그 특허 분류 기호의 직 하위 특허 분류 기호를 저장하면 직하위 특허 분류 기호 집합 DB를 생성할 수 있다.If the 3-dot level field value is not null (there is a value directly below) and the 4-dot level field value is null when the 3-step level value is null (No lower value directly below) If you find the patent classification, you will get H04B7 / 155, H04B7 / 185, H04B7 / 204 respectively. At this time, if a patent classification code is stored for a specific patent classification code, the partial patent classification code set DB can be generated.

이때, 자신의 직하위 특허 분류 기호의 개수(여기서는 3)을 카운팅해 놓을 수 있을 것이며, 이러한 카운팅은 본 발명의 카운팅 전처리 모듈이 수행하는 것이 바람직할 것이다. 이와 같은 직하위 특허 분류 기호가 필요한 경우에는 단계별 펼침의 필요성 때문이다. 단계별 펼침이란 IPC 디렉토리 구조를 표시할 때, 각 레벨(섹션, 클래스, 서브클래스, 메인그룹, 1 도트 서브그룹, 2 도트 서브그룹, 3 도트 서브그룹, 4 도트 서브그룹, ... n 도트 서브그룹 등으로 단계별로 펼치는 것을 말하며, 그 단계에 있는 특허 분류 기호만을 표시해 준다. 한편, 이 경우 표시는 단계별 펼침이 되더라도 그 각 단계에 대한 값(검색 결과, 통계값, 계산값, 분석값 등의 모든 값)은 그 특정한 특허 분류 기호 및 그 특정한 특허 분류 기호의 모든 하위 특허 분류 기호를 모두 고려한 값으로 하는 것이 바람직할 것이다. 예를 들면, H04B7/15의 직하위를 펼쳤을 때, H04B7/155, H04B7/185, H04B7/204이 그 다음 단계로 나오더라도, H04B7/155에 대한 값은 H04B7/155 및 H04B7/155의 하위인 H04B 7/165와 H04B 7/165을 모두 포함하여 고려한 값이 H04B 7/155의 값으로 표시되어야 한다는 것을 의미한다. 물론, 특수한 경우에는(사용자의 선택 등) 그 특정한 특허 분류 기호만에 대한 값을 표시해 줄 수 있을 것이다.At this time, it is possible to count the number (3 in this case) of the own sub-classification codes, and this counting is preferably performed by the counting preprocessing module of the present invention. This is because of the necessity of step-by-step unfolding when the subclause is required. Step-by-step unfolding means that each level (section, class, subclass, main group, one dot subgroup, two dot subgroup, three dot subgroup, four dot subgroup, ... n dot sub (Such as search results, statistical values, calculated values, analytical values, etc.) for each step even if the display is stepwise unfolded. All values) should be taken into account for all of the specific patent classification and all sub-patent classification codes of that particular patent classification. For example, when expanding under H04B7 / 15, H04B7 / 155, The values for H04B7 / 155 and H04B7 / 155, including both H04B 7/165 and H04B 7/165, which are lower than H04B7 / 155 and H04B7 / 155, Should be displayed as a value of / 155 It means, of course, in special cases will be able to display the values for only that particular patent classification symbols (such as the user's choice).

상기와 같은 테이블 구조(상기 3단계로 자신의 하위 특허 분류 기호를 찾는데 사용되는 테이블을 말한)는 후술하는 검색, SQL 명령어로 필요한 정보를 입수할 때, 및 통계 분석 처리 등에서 활용될 수 있으며, 특히 분석 등의 편의를 위해 사전에 각 특허 분류 기호에 대하여 기설정된 규칙에 따르는 값을 계산해 놓을 경우(뷰(View), 물리적인 뷰(Materialized View, 다차원 분석용 연산 수행 결과 테이블), 큐브(Cube) 등)에 활용될 수 있다. 이러한 계산값을 계산할 때 롤업 연 산(roll up, 자신에 대한 값에 자신의 하부에 있는 대상에 대한 값들을 포함시켜 자신의 값으로 하는 것을 포함하는 개념이다.)이 활용되며, 이러한 롤업 연산의 경우 상기 테이블이 활용될 수 있다. 롤업 연산이란 예를 들면 연도값에는 각 분기별 값을 더하여 연도값을 생성할 수 있으며, 각 분기별 값은 각 분기를 이루는 월들의 값을 더하여 분기별 값을 생성할 수 있는 방식을 말한다. 특정 특허 분류 기호에 대한 값(예를 들면 연도별 출원수)을 계산할 때, 자기 자신에 대한 값과 자신의 하위 특허 분류 기호에 대한 값을 합하여 그 특허 분류 기호에 대한 값을 생성할 수 있다. 상세한 것은 본 발명의 분석에서 상술한다.The above table structure (referred to as the table used for finding the sub-patent classification symbol in the above three steps) can be utilized in obtaining information necessary for a search, an SQL command, and statistical analysis processing described later. (For example, a view, a materialized view, and a cube) are calculated for each patent classification symbol in advance in order to facilitate the analysis, Etc.). When calculating these calculations, roll up (roll up) is a concept that includes the value of itself as its value by including the values for its underlying objects. The above table can be utilized. For example, a roll-up operation can generate a year value by adding a quarter value to a year value, and a quarter value can be generated by adding a month value of each branch. When calculating a value for a particular patent classification (for example, the number of applications per year), a value for the patent classification can be generated by adding a value for itself and a value for its sub-patent classification. Details will be described in detail in the analysis of the present invention.

상기와 같은 방법으로 모든 IPC 코드/IPC 특허 분류 기호에 대해서 동일하게 처리하면, 모든 IPC 특허 분류 기호를 대상으로 하는 상기와 같은 형식의 테이블을 생성할 수 있고, 이러한 테이블 및 상기에서 언급한 각 언어별 타이틀 정보를 포함하여 원천 특허 분류 기호 DB 중 IPC에 해당하는 원천 IPC 특허 분류 기호 DB를 생성할 수 있다. 상기와 같은 방식으로 USPC, FT, FI, ECLA 등도 마찬가지의 테이블을 생성할 수 있게 된다.If all the IPC codes / IPC patent classification codes are processed in the same manner as above, a table having the above-described format targeting all the IPC patent classification codes can be generated, and these tables and the respective languages It is possible to generate the source IPC patent classification symbol DB corresponding to the IPC among the original patent classification symbol DB including the star title information. In the same manner, USPC, FT, FI, and ECLA can also generate the same table.

H04B7/00이하에 있는 특허 분류 기호를 대상으로 하여, 특허 분류 기호 간의 child-parent 관계가 나타난 정보 구조의 구현예를 제시한다.H04B7 / 00 The following is an example of an information structure that shows the child-parent relationship between patent classification symbols.

먼저, child 특허 분류 기호에 그 분류 기호가 속한 직상위 분류 기호인 parent(child -> parent)를 대응시키는 방법에 따른 결과이다. 그 예가 하기 표 3에 있다.First, it is a result of a method of mapping a child patent classification symbol to parent (child -> parent), which is a parent classification code, to which the classification symbol belongs. Examples are shown in Table 3 below.

[표 3][Table 3]

Child(자신)Child (self) Parent(자신의 parent)Parent (your parent) H04B 7/005H04B 7/005 7/007/00 H04B 7/01H04B 7/01 7/007/00 H04B 7/015H04B 7/015 7/007/00 H04B 7/02H04B 7/02 7/007/00 H04B 7/04H04B 7/04 H04B 7/02H04B 7/02 H04B 7/06H04B 7/06 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 H04B 7/04H04B 7/04 H04B 7/10H04B 7/10 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 H04B 7/02H04B 7/02 H04B 7/14H04B 7/14 7/007/00 H04B 7/145H04B 7/145 H04B 7/14H04B 7/14 H04B 7/15H04B 7/15 H04B 7/14H04B 7/14 H04B 7/155H04B 7/155 H04B 7/15H04B 7/15 H04B 7/165H04B 7/165 H04B 7/155H04B 7/155 H04B 7/17H04B 7/17 H04B 7/155H04B 7/155 H04B 7/185H04B 7/185 H04B 7/15H04B 7/15 H04B 7/19H04B 7/19 H04B 7/185H04B 7/185 H04B 7/195H04B 7/195 H04B 7/185H04B 7/185 H04B 7/204H04B 7/204 H04B 7/15H04B 7/15 H04B 7/208H04B 7/208 H04B 7/204H04B 7/204 H04B 7/212H04B 7/212 H04B 7/204H04B 7/204 H04B 7/216H04B 7/216 H04B 7/204H04B 7/204 H04B 7/22H04B 7/22 7/007/00 H04B 7/24H04B 7/24 7/007/00 H04B 7/26H04B 7/26 H04B 7/24H04B 7/24

이어서, 자기 자신의 직하위에 대한 모든 child 정보를 가지는 방법(parent -> child)에 대한 결과의 일실시예가 표 4에 있다.An example of the result for the method (parent - > child) having all child information on its own subordinate is shown in Table 4.

[표 4][Table 4]

Parent(자신)Parent (self) Child(자신의 직하위 child)Child (own sub child) H04B 7/00H04B 7/00 H04B 7/005
H04B 7/01
H04B 7/015
H04B 7/02
H04B 7/14
H04B 7/22
H04B 7/24H04B 7/005
H04B 7/01
H04B 7/015
H04B 7/02
H04B 7/14
H04B 7/22
H04B 7/24 H04B 7/005H04B 7/005 nullnull H04B 7/01H04B 7/01 nullnull H04B 7/015H04B 7/015 nullnull H04B 7/02H04B 7/02 H04B 7/04
H04B 7/10
H04B 7/12H04B 7/04
H04B 7/10
H04B 7/12 H04B 7/04H04B 7/04 H04B 7/06
H04B 7/08H04B 7/06
H04B 7/08 H04B 7/06H04B 7/06 nullnull H04B 7/08H04B 7/08 nullnull H04B 7/10H04B 7/10 nullnull H04B 7/12H04B 7/12 nullnull H04B 7/14H04B 7/14 H04B 7/145
H04B 7/15H04B 7/145
H04B 7/15 H04B 7/145H04B 7/145 nullnull H04B 7/15H04B 7/15 H04B 7/155
H04B 7/185
H04B 7/204H04B 7/155
H04B 7/185
H04B 7/204 H04B 7/155H04B 7/155 H04B 7/165
H04B 7/17H04B 7/165
H04B 7/17 H04B 7/165H04B 7/165 nullnull H04B 7/17H04B 7/17 nullnull H04B 7/185H04B 7/185 H04B 7/19
H04B 7/195H04B 7/19
H04B 7/195 H04B 7/19H04B 7/19 nullnull H04B 7/195H04B 7/195 nullnull H04B 7/204H04B 7/204 H04B 7/208
H04B 7/212
H04B 7/216H04B 7/208
H04B 7/212
H04B 7/216 H04B 7/208H04B 7/208 nullnull H04B 7/212H04B 7/212 nullnull H04B 7/216H04B 7/216 nullnull H04B 7/22H04B 7/22 nullnull H04B 7/24H04B 7/24 H04B 7/26H04B 7/26 H04B 7/26H04B 7/26 nullnull

디렉토리 생성 모듈(405)Directory creation module 405,

본 발명의 디렉토리 생성 모듈(405)은 특허 분류 기호별 직하위 특허 분류 기호를 제공하고, 상기 특허 분류 기호 및 상기 직하위 특허 분류 기호에 대한 카운팅 값을 상기 특허 분류 기호 및 상기 직하위 특허 분류 기호 옆에 표시해 줄 수 있다. 상기 카운팅 값에는 1) 모든 하위 분류의 개수, 2) 직하위 특허 분류의 개수, 3) 분류의 깊이, 4) 모든 하위 분류에 대응되는 특허 문건의 수, 5) 직하위 특허 분류 기호에 대응되는 특허 문건의 수 중에서 선택되는 어느 하나 이상이 될 수가 있는데, 가급적 1), 2), 5)가 함께 표시 있는 것이 바람직하다. 상기 디렉토리 생성 모듈(405)은 상기 1) ~5) 중 어느 하나 이상을 본 발명의 i) 카운팅 전처리 모듈이 수행해 놓은 카운팅 값에서 읽어 오거나, ii) 후술하는 총량 분석용 다차원 분석용 연산 수행 결과 테이블 iii) 특허 분류 기호 트리 테이블, 또는 iv) 총상위 특허 분류 기호 테이블 중 어느 하나 이상에서 읽어와서, 상기 1) ~5)를표시해 줄 수 있다.The directory generation module 405 of the present invention provides a subclass patent classification code for each patent classification symbol and sets the count value for the patent classification symbol and the subclass patent classification code to the patent classification symbol You can mark it next to it. The counting values include 1) the number of all subclasses, 2) the number of subclasses, 3) the depth of classification, 4) the number of patent documents corresponding to all subclasses, 5) And the number of patent documents. It is preferable that 1), 2), and 5) are displayed together as much as possible. The directory generation module 405 reads one or more of the above 1) through 5) from the count value of the counting preprocessing module of the present invention, or ii) iii) the patent classification tree table, or iv) the total upper patent classification symbol table, to display the above 1) to 5).

상기 특허 분류 기호 정보를 디렉토리 방식으로 배치할 때, 각 디렉토리에 색깔이나 아이콘 등의 기호 표시를 하여, 사용자의 접근성을 향상시킬 수 있다. 즉, 직하위 분류가 많은 경우, 모든 하위 분류가 많은 경우, 깊이가 깊은 경우, 해당 특허 문건이 많은 경우 등에 따라 다른 색깔을 부여하거나 기설정된 아이콘 등을 부여함으로써, 사용자가 그 디렉토리의 내부 속성(모든 하위 분류의 개수, 분류의 깊이, 특허 문건의 수 등)을 더 잘 알 수 있게 도울 수 있다. 특히, 특정 특허 분류 기호 디렉토리의 모든 하위 분류의 개수가 많거나 분류의 깊이가 많은 경우는 그 영역으로의 특허 출원이 많고, 기술이 다양하게 출원되고 있음을 방증하기 때문 에, 그렇지 못한 디렉토리보다 중요성이 더 클 경우가 많기 때문이다.When the above-described patent classification symbol information is arranged in a directory format, symbols and symbols such as a color or an icon are displayed in each directory, thereby improving the accessibility of the user. That is, by giving a different color or giving a predetermined icon or the like according to the case where there are many subclasses, a lot of subclasses, a deep depth, a lot of patent documents, etc., The number of subclasses, the depth of classification, the number of patent documents, etc.). In particular, if the number of all subclasses of a particular patent classification directory is large or the depth of classification is large, it indicates that there are many patent applications in the area, and that the applications are variously applied. Therefore, Is often larger.

도 15는 본 발명의 디렉토리 생성 모듈(405)의 일 구현예를 보여주고 있다. 상기 도 15에서 알 수 있듯이, 디렉토리의 기설정된 단계부터는 적어도 하나 이상의 특허 분류 기호에 체크 박스를 할 수 있고, 이 체크 박스로 선택된 특허 분류 기호에 대하여, 국가 단위로 검색을 하거나, 특허 분류 기호 기준으로 한 특허 분석을 수행할 수 있음을 알 수 있다.FIG. 15 shows an embodiment of the directory creation module 405 of the present invention. As can be seen from FIG. 15, at least one patent classification symbol can be checked-boxed from a predetermined stage of the directory, and a patent classification symbol selected by the check box can be searched for by country, The patent analysis can be carried out.

본 발명의 디렉토리 생성 모듈(405)은 적어도 하나 이상의 특허 분류 기호의 종류별로 다단계 특허 분류 기호 체계에 대응되는 특허 분류 기호에 대한 디렉토리를 생성하는 특허 분류 기호 활용 디렉토리 생성 모듈(405-1)과 본 발명의 명세서에서 소개하는 각종 분석 주제별 또는 기타 확정가능한 주제별 다단계 디렉토리를 생성하는 주제별 디렉토리 생성 모듈(405-2)이 있다. 상기 주제별 디렉토리 생성 모듈의 작동예는 도 ***에서 보여 주고 있다. 상기 디렉토리는 사용자 또는 상기 시스템(10)의 관리자가, 특정 주제에 맞게 다단계 계층 구조를 가지는 디렉토리를 생성하고, 상기 디렉토리에 검색식이나 질의식, 키워드, 특허 분류 기호 등과 같은 요소를 대응시켜 놓을 수 있고, 상기 시스템(10)은 상기 대응된 정보를 이용하여, 검색, 감시를 수행할 수 있고, 상기 검색 또는 감시된 결과에 대하여 본 명세서 상의 다양한 분석을 수행할 수 있다.The directory creation module 405 of the present invention includes a patent classification symbol utilization directory creation module 405-1 for creating a directory for a patent classification symbol corresponding to a multi-stage patent classification symbol system for each type of at least one patent classification symbol, There is a subject directory creation module 405-2 for creating a multi-level directory according to various analysis topics or other definable subject matter introduced in the specification of the present invention. An operation example of the above-mentioned directory creation module is shown in FIG. The directory may be created by a user or an administrator of the system 10 having a hierarchical hierarchical structure in accordance with a specific subject and may be associated with elements such as a search formula, a keyword, a patent classification symbol, , And the system 10 can perform searching and monitoring using the corresponding information, and can perform various analyzes herein on the retrieved or monitored results.

하위 계층 특허 분류 기호 처리 모듈Lower layer patent classification processing module

하위 계층 특허 분류 기호 처리 모듈은, 사용자가 입력하는 검색식 등에서 입수 받은 특허 분류 기호에 대하여, 상기 계층적 특허 분류 기호 데이터베이스를 통하여 상기 입수 받은 특허 분류 기호에 종속되는 하위 계층 특허 분류 기호를 찾아 내는 역할을 수행한다.The lower hierarchical patent classification symbol processing module finds a lower hierarchical patent classification symbol subordinate to the acquired patent classification symbol through the hierarchical patent classification symbol database with respect to the patent classification symbol received from a search formula inputted by the user Role.

입수 받은 특허 분류 기호에 대해 하위 계층 특허 분류 기호에 해당하는 특허 문건이 검색 결과에 포함되게 하기 위해서는 다음과 같은 3가지 중 어느 하나 이상이 필요하며, 이러한 일을 상기 하위 계층 특허 분류 기호 처리 모듈이 수행한다.In order for the patent document corresponding to the lower-level patent classification code to be included in the search result for the received patent classification code, at least one of the following three is required, and the lower-level patent classification code processing module .

첫째, 상기 계층적 특허 분류 기호 데이터베이스를 이용하여 입수 시점을 기준으로 하여 입수 받은 특허 분류 기호의 하위 계층 특허 분류 기호를 모두 찾아 낸다. 계층적 특허 분류 기호 데이터베이스 상의 특허 분류 기호들이 트리 구조를 가지고 있기 때문에 이를 찾아 낼 수 있게 된다. 입수 받은 검색식에 특정한 특허 분류 기호가 포함되어 있을 경우, 상기 트리 구조를 따라서 그 특허 분류 기호의 하위 계층의 모든 특허 분류 기호들을 찾아낼 수 있게 되고, 그 결과를 수정 검색식 생성 모듈에 전송하며, 수정 검색식 생성 모듈은 전송 받은 모든 특허 분류 기호들을 반영한 새로운 검색식을 생성하여 검색 엔진에 질의할 수 있다.First, the hierarchical patent classification database is used to find all the lower hierarchical patent classification codes of the patent classification symbols obtained based on the time of acquisition. Hierarchical Patent Classifiers Because patent classification symbols on a database have a tree structure, they can be found. When the acquired search formula includes a specific patent classification symbol, all patent classification symbols in the lower hierarchy of the patent classification symbol can be found according to the tree structure, and the result is transmitted to the modified search formula generation module , The modified search expression generation module can generate a new search expression reflecting all transmitted patent classification codes and query the search engine.

둘째, 사전에 모든 특허 분류 기호에 대하여 그 하위 계층 특허 분류 기호들을 찾아내어 매칭하여(매칭 테이블에) 저장해 놓고, 입수되는 검색식에서 특정 특허 분류 기호가 있을 경우, 그 특정 특허 분류 기호에 매칭된 하위 계층 특허 분류 기호들을 즉시 상기 수정 검색식 생성 모듈에 전송할 수 있다.Second, if the lower level patent classification codes are found and matched (stored in the matching table) with respect to all patent classification codes in advance, and there is a specific patent classification code in the retrieval formula obtained, Layer patent classification symbols to the modified search formula generation module immediately.

상기 하위 계층 특허 분류 기호 추출 모듈이 추출하는 하위 계층 특허 분류 기호의 범위는 크게 2가지가 있을 수 있다. 하나는 입수 받은 특허 분류 기호의 직 하위 특허 분류 기호만으로 추출하는 방법이며, 다른 하나는 입수 받은 특허 분류 기호에 종속되는 모든 하위 특허 분류 기호를 추출하는 방법이다. 이때, 직하위 특허 분류 기호만을 추출하는 경우에도, 상기 추출된 직하위 특허 분류 기호들 각각에 대하여 다시 이들의 직하위 특허 분류 기호를 추출해 내며, 이와 같은 방식을 반복함으로써 입수 받은 특허 분류 기호에 종속되는 모든 하위 특허 분류 기호를 추출해 낼 수 있다.The lower layer patent classification symbol extraction module extracts the lower layer patent classification symbol in two ways. One is a method of extracting only the subclasses of the acquired patent classification code, and the other is a method of extracting all the subclass classification codes that are dependent on the received patent classification code. In this case, even when only the sub-classifier is extracted, the sub-sub-classifier is extracted again for each extracted sub-classifier, and the sub- All sub-patent classification codes can be extracted.

예를 들어 설명하면, 검색식에서 포함되어 있는 특허 분류 기호가 H04B 7/15인 경우, 직하위 특허 분류 기호는 H04B 7/155, H04B 7/185, 및 H04B 7/204 이며, 하위의 모든 특허 분류 기호는 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216가 된다.For example, if the patent classification code contained in the search formula is H04B 7/15, the sub-patent classification codes are H04B 7/155, H04B 7/185, and H04B 7/204, The symbols are H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7 / 216.

H04B7/15, H04B7/155, H04B7/165, H04B7/17, H04B7/185, H04B7/19, H04B7/195, H04B7/204, H04B7/208, H04B7/212, H04B7/216가 된다.H04B7 / 15, H04B7 / 155, H04B7 / 165, H04B7 / 17, H04B7 / 185, H04B7 / 19, H04B7 / 195, H04B7 / 204, H04B7 / 208, H04B7 / 212 and H04B7 / 216.

상기와 같은 직하위 특허 분류 기호 및 모든 하위 특허 분류 기호를 추출하는 것은 상기 child -> parent 대응 관계에서나, parent -> child 대응 관계에서나 직접 또는 반복적으로(recursively) 용이하게 추출해 낼 수 있음은 당업자에게는 당연할 것이다.It will be appreciated by those skilled in the art that the extraction of such sub-classifications and all sub-classifications may be easily and directly or recursively extracted from the parent-to-parent correspondence or the parent-to-child correspondence It will be natural.

상기 하위 계층 특허 분류 기호 추출 모듈은 추출된 하위 계층 특허 분류 기호를 주합하여 저장한다. 상기의 검색식에서 포함되어 있는 특허 분류 기호가 H04B 7/15인 경우에는 직하위 특허 분류 기호만을 저장할 경우에는 ^H04B 7/15, H04B 7/155, H04B 7/185, 및 H04B 7/204^ 을 주합하여 저장하며, 하위의 모든 특허 분류 기호를 함께 저장할 경우에는 ^H04B 7/15, H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216^를 주합하여 저장한다.The lower layer patent classification symbol extracting module stores the extracted lower layer patent classification symbol summingly. If the patent classification code included in the above search formula is H04B 7/15, then only H04B 7/15, H04B 7/155, H04B 7/185, and H04B 7/204 ^ H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/15, H04B 7/17, H04B 7/17, H04B 7/15, H04B 7/15, H04B 7/16, H04B 7/16, H04B 7/16, H04B 7/15, 195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 ^.

수정 검색식 생성 모듈Modified Search Expression Generation Module

수정 검색식 생성 모듈은 상기 주합된 특허 분류 기호를 원래의 기존 검색식과 통합하여 수정 검색식을 생성한다.The revised retrieval expression generation module generates the revised retrieval expression by integrating the categorized patent classification code with the original existing retrieval expression.

예를 들어 설명하면, 주어진 검색식이 ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC = H04B 7/15}일 경우에 상기 수정 검색식 생성 모듈은 주어진 검색식에서 IPC H04B 7/15를 찾아서, 직하위 또는 모든 하위 특허 분류 기호를 입수 받고 주합하여, ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC = H04B 7/15 OR H04B 7/155 OR H04B 7/185 OR H04B 7/204} 또는 ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC = H04B 7/15 OR H04B 7/155 OR H04B 7/165 OR H04B 7/17 OR H04B 7/185 OR H04B 7/19 OR H04B 7/195 OR H04B 7/204 OR H04B 7/208 OR H04B 7/212 OR H04B 7/216}와 같은 분류 기호가 통합된 수정된 검색식을 생성해 낸다. 바람직하게는 후자의 검색식이 좋다.For example, in the case where a given search expression is ^ {keyword = wireless and active} and {applicant = Samsung Electronics} and {IPC = H04B 7/15}, the modified search expression generation module generates IPC H04B 7 / 15 Keyword = wireless and active} and {applicant = Samsung} and {IPC = H04B 7/15 OR H04B 7/155 OR H04B 7 / 185 OR H04B 7/204} or {Keyword = Wireless and Active} and {Applicant = Samsung} and {IPC = H04B 7/15 OR H04B 7/155 OR H04B 7/165 OR H04B 7/17 OR H04B 7 / 185 OR H04B 7/19 OR H04B 7/195 OR H04B 7/204 OR H04B 7/208 OR H04B 7/212 OR H04B 7/216}. Preferably, the latter search expression is preferable.

상기 하위 계층 특허 분류 기호 처리 모듈이 수행하는 역할을 요약하면, 입수되는 검색식에서 특허 분류 기호가 포함될 때, 그 특허 분류 기호의 하위 계층 특허 분류 기호에 관련된 정보를 찾아 내어 수정 검색식 생성 모듈이 처리할 수 있 도록 해 준다.In summary, when the patent classification code is included in the retrieval formula obtained, the information related to the lower class patent classification symbol of the patent classification symbol is found, and the modified retrieval expression generation module processes I can do it.

상기 검색 엔진은 사용자의 컴퓨터로부터 특허 분류 기호가 포함된 검색식을 입수 받는다 이때, 상기 입수된 검색식에 포함되어 있는 특허 분류 기호가 타이틀 정보에 도트를 포함하는 경우, 상기 특허 분류 기호 전처리 엔진을 통하여 특허 분류 기호 체계 상에서 하위 특허 분류 기호를 입수하고, 상기 입수한 특허 분류 기호를 OR로 묶고, 상기 OR로 묶은 하위 특허 분류 기호를 포함한 수정 검색식을 생성하고, 수정 검색식으로 검색을 수행한다. 상기 각 단계에 대해서는 상술한 바와 같다.If the patent classification symbol included in the obtained search formula includes a dot in the title information, the search engine may search the patent classification symbol preprocessing engine Obtains a sub-patent classification symbol through the patent classification system, binds the obtained patent classification symbol to OR, generates a modified search expression including the sub-patent classification symbol bound to the OR, and performs search using the modified search formula . The above steps are as described above.

한편 통상적인 검색 엔진은 범위 검색을 지원한다. 예를 들어 설명하면, H04B 7/15의 경우라면, H04B 7/15 및 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216를 모두 or 관계로 묶어 검색 엔진에 질의하는 것 보다, From H04B 7/15 to H04B 7/216으로 질의하는 것이 훨씬 더 빨리 검색 결과를 응답 받을 수 있다. 이는 특히, 하위 계층 특허 분류 기호들이 많으면 많을수록 그 검색 속도의 차이는 커진다. Meanwhile, a typical search engine supports range search. For example, in the case of H04B 7/15, H04B 7/15 and H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B It is much faster to query from From H04B 7/15 to H04B 7/216 than to query the search engine by putting together all of the 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 The results can be answered. This is especially true if the number of lower-level patent classification symbols is larger, the difference in the search speed becomes larger.

범위 검색을 위해서는 검색 엔진 또는 DBMS가이 범위에 대한 이형 범위 기호 처리를 지원하는가에 따라 생성되는 수정 검색식 구성 요소가 약간 상이하게 된다. 이형 범위 기호 처리란 ^A보다 크거나 같고, B 보다 작은 것^과 같이 범위의 양쪽에 쓰이는 기호가 다른 경우를 처리하는 것을 말한다. 이형 범위 기호 처리를 지원하는 경우에는, 입수된 특허 분류 기호에 대하여, ^(상기 입수된 특허 분류 기호보 다 크거나 같고, 상기 입수된 특허 분류 기호의 next sibling보다는 작은 범위의 특허 분류 기호들)^라는 수정된 검색식 구성 요소를 생성한다. 이형 범위 기호 처리를 지원하지 않는 경우에는 ^{(상기 입수된 분류 기호) or (상기 입수된 특허 분류 기호보다 크거나 같고, 상기 입수된 특허 분류 기호의 next sibling보다는 작은 범위의 특허 분류 기호들)}을 생성하게 된다. 물론, 사전식으로 배열되는 변형 특허 분류 기호를 사용하는 경우에는 그 변형 특허 분류 기호에 대해서 위와 같이 처리함은 물론이라 하겠다.For the range search, the modified search expression components generated are slightly different depending on whether the search engine or the DBMS supports the disparity range symbol processing for this range. Distinct Range Symbol processing refers to processing a case where symbols on both sides of the range are different, such as ^, which is greater than or equal to ^ A and less than B. In the case of supporting the heterogeneous range symbol processing, it is preferable that, for the obtained patent classification symbol, ^ (patent classification symbols larger than or equal to the obtained patent classification symbol and smaller than the next sibling of the acquired patent classification symbol) Create a modified search expression component ^. (The above-mentioned obtained classification symbol) or (the patent classification symbols that are equal to or larger than the obtained patent classification symbol and smaller than the next sibling of the acquired patent classification symbol) }. Of course, in the case of using a modified patent classification code arranged in a dictionary manner, it is needless to say that the modified patent classification code is processed as described above.

상기 입수되는 검색식이 ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC = H04B 7/15}라면, 이형 범위 기호 처리가 되는 경우에는 ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC >= H04B 7/15 and IPC <H04B 7/22}^이 된다. 반면, 이형 범위 기호 처리가 되지 않는 경우에는 ^{키워드=무선 and 능동} and {출원인 = 삼성전자} and {IPC = H04B 7/15 or (IPC > H04B 7/15 and IPC <H04B 7/22)}가 되게 된다.If the retrieval formula obtained is ^ {keyword = wireless and active} and {applicant = Samsung Electronics} and {IPC = H04B 7/15} = Samsung Electronics} and {IPC> = H04B 7/15 and IPC <H04B 7/22}. On the other hand, if the disparity range symbol processing is not possible, ^ {keyword = wireless and active} and {applicant = Samsung Electronics} and {IPC = H04B 7/15 or (IPC> H04B 7/15 and IPC <H04B 7/22) }.

이상과 같은 본 발명의 특허 분류 기호 전처리 엔진의 핵심은 주어진 특허 분류 기호가 있을 때, 그 특허 분류 기호의 모든 하위 계층(child) 또는 직하위 계층에 있는 하위 특허 분류도 자동으로 처리하여 검색, 분석, 감시 등의 모든 검색식 및 질의식에 반영될 수 있게 하는 기능에 있다.The core of the patent classification symbol preprocessing engine of the present invention is that when there is a given patent classification symbol, all sub-classes (child) of the patent classification symbol or sub-patent classification in the sub-hierarchy are automatically processed, , And surveillance, and the ability to be reflected in the quality consciousness.

수정된 검색식으로 검색 결과가 나오는 경우, 검색 결과의 디스플레이는 검색자의 선택에 따라 클러스터링(clustering)하여 제공될 수도 있다. 클러스터링이란, 검색 결과를 묶어서 그룹단위로 표현하는 방식으로, 본 발명에 적용하는 경우 다음과 같다. 클러스트링 되는 구조를 표현하는 클러스터링 창이 있거나, 클러스터링 레벨을 선택할 수 있을 때, 검색 결과를 기설정되거나 기선택된 클러스터링 레벨에 상응하는 도트 서브그룹 및 그 이하의 서브그룹의 검색 결과만 표현하며, 검색자로부터 하위 또는 상위 클러스터링 레벨이 선택되는 경우(상기 클러스터링 레벨에는 도트 레벨이 대응되어 있다) 상기 선택된 클러스터링 레벨에 상응하는 검색 결과를 표현한다. When a search result is displayed in a modified search expression, the display of the search result may be provided by clustering according to the searcher's selection. Clustering refers to a method of grouping search results and grouping them into groups, and the present invention is applied as follows. When there is a clustering window representing a clustered structure or when a clustering level can be selected, only the search results of the dot subgroup and subgroups corresponding to the pre-set or pre-selected clustering levels are retrieved, And expresses the search result corresponding to the selected clustering level when the lower or higher clustering level is selected (the clustering level corresponds to the dot level).

상기의 H04B 7/00이 관련된 검색 결과를 통해 예를 들면, 1-도트 서브그룹 레벨의 클러스터링이 선택된 경우에는 상기 클러스터링 창에는 1-도트 서브그룹에 해당하는 7/005, H04B 7/01, H04B 7/015, H04B 7/02, H04B 7/14, H04B 7/22, H04B 7/24이 H04B 7/00의 하위 디렉토리 폴더 형태로 나타나고, 검색 결과는 H04B 7/00에 해당하는 모든 결과가 나온다. 이때, 검색자가 H04B 7/14를 선택했을 때는 그 직하위인 H04B 7/145, H04B 7/15가 하위 디렉토리 폴더 형태로 나타나고, 이 경우 검색 결과는 H04B 7/14 및 그 하부 특허 분류 기호에 해당하는 검색 결과만 표시된다. 다시 검색자가 H04B 7/15를 선택했을 경우에는 그 하위인 H04B 7/155, H04B 7/185, H04B 7/204가 표시되며, 검색 결과는 H04B 7/15 및 그 하위 특허 분류 기호에 해당하는 검색 결과만이 표시된다.For example, when clustering at the 1-dot subgroup level is selected through the search result related to H04B 7/00, the clustering window includes 7/005, H04B 7/01, H04B 7/015, H04B 7/02, H04B 7/14, H04B 7/22, H04B 7/24 appear in the form of a subdirectory folder in H04B 7/00, and the search results are all results that correspond to H04B 7/00 . At this time, when the searcher selects H04B 7/14, H04B 7/145 and H04B 7/15 are displayed in the form of a subdirectory folder. In this case, the search result corresponds to H04B 7/14, Only search results are displayed. If the searcher again selects H04B 7/15, the subordinates H04B 7/155, H04B 7/185, and H04B 7/204 are displayed, and the search results are retrieved according to H04B 7/15 and its subpopulation classification code Only results are displayed.

이때, 검색자의 편의를 위해서 클러스터링 창의 각 클러스트링 항목(디렉토리의 폴더명) 옆에 자신의 직하위에 몇 개의 특허 분류 기호가 있는 지와, 모든 하부 특허 분류 기호의 개수가 몇 개 있는지를 함께 또는 어느 하나만을 표시해 줄 수 있다. 그 특허 분류 기호 및 그 특허 분류 기호의 모든 하위 특허 분류 기호에 해당하는 특허 문헌의 개수를 표시해 줄 수도 있다. 이때, 출원 문헌수와 등록 문헌 수 2개 모두를 표시해 줄 수도 있으며, 출원 문헌수나 등록 문헌 수 1개만 표시해 줄 수도 있다. 예를 들면, 특허 분류 기호가 H04B 7/15인 경우에는 직하위 특허 분류 기호는 H04B 7/155, H04B 7/185, 및 H04B 7/204 이므로 ^3^을 표시해 주거나, 모든 하위의 특허 분류 기호가 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216이므로, ^10^을 표시해 줄 수 있다. 양자를 모두 표시하는 주는 것이 바람직한데, 그 이유는 직하위에는 분류가 적지만, 그 직하위의 하부에 엄청난 크기의 분류가 있을 수 있기 때문이다. 이는 특히 새롭게 떠오르는 신기술 분야 또는 기술 분화 속도 또는 세부 분류 증가 속도가 큰 첨단 기술 분야에서 더욱 그러할 것이다. 만약 직하위 분류의 개수만 표시해 주는 경우, 그 수치값이 적을 때 그 기술 분야의 중요성이 실제함에도 불구하고, 이를 파악할 수 없어 그 분류를 무시하고 넘어가는 사용자가 있을 수 있기 때문이다.At this time, for the sake of the searcher's convenience, next to each clustering item (folder name of the directory) in the clustering window, there are several patent classification symbols on its lower right and how many Can be displayed. The number of patent documents corresponding to the patent classification mark and all sub-patent classification marks of the patent classification mark may be displayed. At this time, both the number of documents to be filed and the number of documents to be registered may be displayed, and only the number of documents filed or registered documents may be displayed. For example, if the patent classification code is H04B 7/15, the subtype patent classification code is H04B 7/155, H04B 7/185, and H04B 7/204, so indicate ^ 3 ^ Are H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216 So, you can display ^ 10 ^. It is preferable to give both quantities, because there are few classifications directly above and below, but there can be huge classifications at the bottom. This is especially true in the emerging new technology sector or in high-tech areas where the rate of technology differentiation or the speed with which detailed classification increases. If only the number of subclassifications is shown, there may be users who ignore the classification because the technical value is less important when the numerical value is small.

패밀리 정보 전처리 모듈Family information preprocessing module

특허 패밀리란 특정 특허 문건과 국내, 또는 해외적으로 직접적으로 관련 있는 문건으로 구성된 문건 정보의 집합을 말한다. 통상적으로 하나의 국가 내에서, 1) 분할출원, 2) 변경 출원이나 이중출원, 3) 국내 우선권 주장 출원(미국에서의 일부 계속 출원, 재발행 특허 등과 같이 각 국가마다 패밀리의 범위를 규정하는 조건이 조금씩 다를 수 있다) 등이 국내 특허 패밀리를 이룬다. 통상적으로 하나의 출원을 기초로 한 1) 국제출원(PCT), 2) 조약 우선권 출원, 3) 상기 1) 또는 2)가 특정 국가의 특허청에 출원되었을 때의 각 국가별 출원 및 상기 3)의 출원에 관한 (1) 분할출원, (2) 변경 출원이나 이중출원, (3) 국내 우선권 주장 출원(미국에서의 일부 계속 출원, 재발행 특허 등과 같이 각 국가마다 패밀리의 범위를 규정하는 조건이 조금씩 다를 수 있다) 등이 국제 특허 패밀리를 이룬다. 본 발명의 특허 패밀리 정보를 전처리 하는 패밀리 정보 전처리 모듈에 대한 구성은 도 9에 잘 나타나 있다. 패밀리 정보를 처리하기 위해서는 그 정보를 처리하는 패밀리 정보 전처리 엔진(3810)이 필요하며, 패밀리 정보를 입수하기 위한 각종 DB가 필요하다. 필요한 DB의 예로는 조약 우선권 정보 DB(3671), PAJ DB(3673), KPA DB(3675), Inpadoc DB(3677), 기타 패밀리정보 DB(3830) 등이 있을 수 있다. 특히 Inpadoc DB(3677)가 중요한 역할을 한다. A patent family is a collection of document information consisting of documents directly related to a specific patent document, domestic or overseas. In general, in a single country, the following conditions apply: 1) a split application; 2) an application for change or duplication; and 3) a claim for domestic priority claim (such as some continuation applications and reissue patents in the United States) Which may be slightly different) constitute the domestic patent family. (1) International application (PCT), (2) Treaty priority application, and (3) When the above 1) or 2) has been filed with the Patent Office of a specific country, (1) application for divisional application, (2) application for change or double application, (3) application for domestic priority claim (for some countries such as continuation application and reissue patent in the United States, Etc.) constitute the international patent family. The configuration of the family information preprocessing module for preprocessing the patent family information of the present invention is shown in FIG. In order to process the family information, a family information preprocessing engine 3810 for processing the information is required, and various DBs for obtaining family information are required. Examples of the necessary DBs may include treaty priority information DB 3671, PAJ DB 3673, KPA DB 3675, Inpadoc DB 3677, and other family information DB 3830. In particular, Inpadoc DB (3677) plays an important role.

상기 패밀리 정보 전처리 모듈은 특허 문건 마스트 DB의 가공과 업데이트와 밀접하게 관련이 있다. 그 이유는 특허 정보는 1) 신규한 문건이 지속적으로 발행되고, 2) 국가 단위로 입수되는 것이 대부분이며, 상기 각 국가별로 신규한 문건이 지속적을 발행되므로, 상기 패밀리 정보는 상기 1) 및/또는 2)에 따라 언제든지 변경될 가능성이 있기 때문이다. 즉, 상기 1) 또는 2)의 문건 입수 시 상기 패밀리 정보는 갱신되어야 할 필요가 있으며, 상기 1) 또는 2)의 문건 입수 시 패밀리 정보의 갱신은 본 발명의 패밀리 정보 전처리 모듈이 수행한다. 상기 패밀리 정보는 기존 문건 또는 신규 문건을 대상으로 원출원 정보 내지 우선권 정보에 특정 출원 번호가 존재하는 지를 통해서 발견할 수 있다.The family information preprocessing module is closely related to the processing and updating of the patent document mast database. The reason for this is that most of the patent information is 1) a new document is continuously issued, 2) a document is obtained nationwide, and new documents are continuously issued for each country, Or 2) at any time. That is, the family information needs to be updated at the time of obtaining the document of 1) or 2), and the update of the family information at the time of obtaining the document of 1) or 2) is performed by the family information preprocessing module of the present invention. The family information can be found through the existence of a specific application number in the source information or priority information for existing or new documents.

상기 패밀리 정보 전처리 모듈의 상기 패밀리 정보 전처리 엔진이 패밀리 정 보를 처리하는 방법은 도 87에 나와 있다. 상기 패밀리 정보 전처리 모듈은 적어도 하나 이상의 특허 문건 정보를 입수하고(S3720), 상기 특허 문건 정보에 우선권 주장 번호, 분할 출원, 변경 출원 또는 일부 계속 출원 등의 패밀리 정보가 있는 지를 파악하며(S3730), 패밀리 정보가 있는 경우, 상기 특허 문건의 정보를 패밀리 정보에 대응되는 패밀리 특허 문건의 고유 문건 번호와 관련 지어 특허 DB 또는 검색 인덱스에 저장하며(S3740), 선택적으로 입수된 특허 문건에 국가별 패밀리 정보의 개수를 카운팅하여 저장하거나 패밀리 특허 문건의 고유 문건 번호에 패밀리 정보의 개수에 관한 정보가 있을 때 그 개수에 가산한다(S3750).A method of processing the family information by the family information preprocessing engine of the family information preprocessing module is shown in FIG. The family information preprocessing module obtains at least one or more patent document information (S3720), determines whether the patent document information includes family information such as a priority claim number, a split application, a change application, or a continuation application (S3730) If the family information is present, the information of the patent document is stored in the patent database or the search index in association with the unique document number of the family patent document corresponding to the family information (S3740), and the family document (S3750). If there is information about the number of family information in the unique document number of the family patent document, the number is added to the number of the family information (S3750).

상기 패밀리 정보 전처리 모듈이 처리한 패밀리 정보는 본 발명의 패밀리 정보 DB에 저장된다.The family information processed by the family information preprocessing module is stored in the family information DB of the present invention.

인용 정보 전처리 모듈Citation information preprocessing module

본 발명의 인용 정보 전처리 모듈에는 인용과 관련된 정보를 획득하는 인용 정보 획득 모듈(3400-1)과 인용 정보 기반하여 피인용 정보를 갱신하는 피인용 정보 갱신 모듈(3400-2) 중 어느 하나 이상을 포함하고 있다. 상기 인용 정보 획득 모듈은 출원인의 특허 문건에서 출원인이 인용한 정보에서 인용 관련 정보를 획득하는 출원인 인용 정보 획득 모듈(3400-1-1)과 심사관이 심사 과정에서 인용한 정보를 획득하는 심사 인용 정보 획득 모듈(3400-1-2)이 있다. 한편, 상기 출원인 인용 정보 획득 모듈(3400-1-1)에는 다시, 출원인 인용 특허 문건 정보 획득 모듈(3400-1-1-1)과 출원인 인용 비특허 문건 정보 획득 모듈(3400-1-1-2)이 있다. 상기 출원인 인용 특허 문건 정보 획득 모듈(3400-1-1)은 인용 정보가 특허 정보일 경우에 작동한다. 그리고, 심사 인용 정보 획득 모듈(3400-1-2)에는 심사관 인용 특허 문건 정보 획득 모듈(3400-1-1-1)과 심사관 인용 비특허 문건 정보 획득 모듈(3400-1-2)이 있다. 상기 인용 정보 전처리 모듈은 상기 인용 정보 획득 모듈이 획득한 인용 정보를 기설정된 방식으로 처리한다.In the citation information preprocessing module of the present invention, any one or more of the citation information acquisition module 3400-1 for acquiring information related to the quotation and the cited information update module 3400-2 for updating the citation information based on the citation information . The citation information acquisition module includes an applicant citation information acquisition module 3400-1-1 for acquiring citation related information from the information cited by the applicant in the applicant's patent document and examination citation information There is an acquisition module 3400-1-2. The applicant's citation information acquisition module 3400-1-1 is again connected to the applicant's patent document information acquisition module 3400-1-1-1 and the applicant's non-patent document information acquisition module 3400-1-1- 2). The patent document information acquisition module 3400-1-1 for citing applicant operates when the citation information is patent information. The examination citation information acquisition module 3400-1-2 includes an examiner citation patent document information acquisition module 3400-1-1-1 and an examiner citation non-patent document information acquisition module 3400-1-2. The citation information preprocessing module processes the citation information acquired by the citation information acquisition module in a preset manner.

특허 인용 정보의 종류에는 1) 출원인이 선행 기술로서 타 특허를 인용하는 선행 기술 인용 정보와 2) 심사관이 특정 특허 문건을 심사할 때 인용하는 인용 참증으로서의 특허를 인용하는 참증 인용 정보 및 3) 간접 인용 정보가 있을 수 있다. 물론, 상기 인용은 국내외적으로 일어남은 당연할 것이다. 통상적으로 상기 첫번째 선행 기술로 타특허를 인용하는 것은 대한민국이나 일본에서는 종래 기술 문헌 정보에 있고, 미국에서는 레퍼런스(reference) 정보에 있다. 이러한 선행 기술 인용 정보는 특허 문건의 서지 사항 또는 본문에서 획득 가능하다. 또한, 많은 국가들이 참증 인용 정보를 공개하는데, 이는 특허청에서 공개하는 행정 정보 등에서 확인할 수도 있다. 본 명세서에서는 상기 선행 기술 인용 정보를 중점적으로 설명하나, 획득된 참증 인용 정보도 선행 기술 인용 정보처럼 다루어 질 수 있는 경우에는 상기 설명이 동등하게 준용 될 수 있을 것이다. 상기 간접 인용은 간접 전방 인용과 간접 후방 인용이 있다. 하나 이상의 처리 대상 특허 문건 집합이 있을 때, 이들 특허 문건 집합의 간접 전방 인용 문건 집합은, 상기 처리 대상 특허 문건 집합을 구성하는 처리 대상 특허 문건에 대하여, 상기 처리 대상 특허 문건보다 출원일이 앞서면서, 상기 처리 대상 특허 문건이 인용하는 특허 문건을 동시에 인용하고 있는 특허 문건들로 구성되는 것이다. 이들 특허 문건 집합의 상기 간접 후방 인용 문건 집합은, 상기 처리 대상 특허 문건 집합을 구성하는 처리 대상 특허 문건에 대하여, 상기 처리 대상 특허 문건보다 출원일이 늦으면서, 상기 처리 대상 특허 문건을 인용하는 특허 문건이 동시에 인용하고 있는 특허 문건들로 구성되는 것인 것이다. 이들 간접 인용은 주로 특허 출원일과 특허 공개 기간의 차이에서 기인하는 경우가 많다.2) the information of the patents citing the patents as quotation marks cited when the examiner examines specific patent documents; and 3) the indirect reference information citing the patents as indirect references There may be citation information. Of course, it is natural that the above quotation occurs both domestically and internationally. Normally, citing other patents with the above-mentioned prior art is in the prior art literature information in Korea or Japan, and in reference information in the United States. Such prior art citations may be obtained in the bibliography or text of the patent document. In addition, many countries disclose citations for citations, which can be found in administrative information published by the Patent Office. In the present specification, the prior art citation information will be mainly described, but if the obtained historical citation information can be treated as prior art citation information, the above description can be equally applied. The indirect citation includes indirect citation and indirect citation. When there is at least one set of patent documents to be processed, the set of indirectly quoted documents of these patent document sets is a set of indirectly quoted documents of the set of patent documents to be processed, with respect to the patent document to be processed constituting the set of patent documents to be processed, And the patent documents citing the patent document cited by the patent document to be processed simultaneously. The set of indirectly quoted documents of the set of patent documents includes a patent document for citing the patent document to be processed while the filing date of the patent document to be processed is later than the patent document to be processed, Is a combination of patent documents cited at the same time. These indirect citation are often caused by the difference between the patent application date and the patent disclosure period.

상기 선행 기술 인용 정보 및 상기 참증 인용 정보를 전처리하는 것은 본 발명의 인용 정보 전처리 모듈이 수행하며, 도 5에서 알 수 있듯이 상기 인용 정보 전처리 모듈에는 선행 기술 인용 정보를 전처리하는 선행 기술 인용 정보 전처리 모듈과 참증 인용 정보를 전처리하는 참증 인용 정보 전처리 모듈이 포함되어 있다. 통상적으로 특정 문건의 서지 사항 또는 본문에 전방 인용(backward citation)은 포함되어 있기 때문에, 전방 인용에 관한 문건 정보는 상기 특정 문건의 검색 인덱스 또는 특정 문건 정보에 저장되어 있다. 하지만, 상기 특정 문건에 관한 후방 인용(forward citation)은 그 특정 문건 정보에 저장되지 않고, 상기 특정 문건을 인용하는 문건의 검색 인덱스 또는 상기 인용하는 문건 정보에 저장되어 있다. 그러므로, 상기 특정 문건에 관한 후방 인용을 찾아서 상기 특정 문건과 관련 지어 저장할 필요가 있게 되고, 이러한 기능은 상기 선행 기술 인용 정보 전처리 모듈이 수행한다. As shown in FIG. 5, the citation information preprocessing module includes a preprocessing information preprocessing module for preprocessing the prior art citation information. The preprocessing information preprocessing module preprocesses the prior art citation information and the pre- And preprocessing information preprocessing module for preprocessing the information of the reference citation. Normally, the bibliographic information of the specific document or the backward citation is included in the text, so the document information about the forward reference is stored in the search index or the specific document information of the specific document. However, the forward citation relating to the specific document is not stored in the specific document information but is stored in the search index of the document to which the specific document is cited or in the cited document information. Therefore, it is necessary to search for the backward quotation related to the specific document and store it in association with the specific document, and this function is performed by the preliminary technical citation information preprocessing module.

선행 기술 인용 정보 전처리 모듈Prior art citation information preprocessing module

상기 선행 기술 인용 정보 전처리 모듈은 상기 특정 문건의 출원 번호 내지 상기 특정 문건의 고유 번호값을 상기 검색 인덱스 또는 DBMS에 질의함으로써, 상 기 특정 문건을 인용하고 있는 상기 특정 문건의 후방 인용 정보를 획득할 수 있다. 상기 선행 기술 인용 정보 전처리 모듈은 상기 획득한 상기 특정 문건의 후방 인용 정보를 상기 특정 문건의 인덱싱 과정에서 상기 인덱스에 포함시켜 놓을 수도 있고, 상기 특허 문건 마스트 DB에 상기 특정 문건의 후방 인용 정보로서 저장할 수 있다. 전방 인용 정보는 거의 변화가 없는 반면(출원인이 출월할 때 기재하는 것이며, 간혹 추가 등이 있을 수 있으나 전방 인용의 추가는 거의 발생하지 않는 것이 통상이다.) 후방 인용 정보는 특허 정보의 갱신 시 항상 모니터링 해야 하는 정보이다. 그러므로, 신규한 특허 문건이 발생해서, 검색 인덱스가 갱신되거나, 특허 문건 마스트 DB가 갱신되는 경우, 상기 선행 기술 인용 정보 전처리 모듈은 상기 특정 문건을 인용하고 있는 갱신 특허 문건이 있는 지를 질의하여, 후방 인용 정보를 갱신해야 한다.The preliminary technical citation information preprocessing module obtains the rear citation information of the specific document citing the specific document by querying the search index or the DBMS with the unique number value of the specific document or the application number of the specific document . The prior art citation information preprocessing module may include the acquired rear citation information of the specific document in the index in the indexing process of the specific document or may be stored in the patent document master DB as rear citation information of the specific document . While forward citation information is almost unchanged (it is usually stated when the applicant is going to be released, and there may be occasional additions, but the addition of forward citation rarely occurs). The backward citation information is always updated This is the information you need to monitor. Therefore, when a new patent document is generated and the search index is updated or the patent document mast DB is updated, the prior art citation information preprocessing module inquires whether there is an updated patent document citing the specific document, The citation information must be updated.

인용 번호를 대표화하는 인용 정보 전처리 방법에 대해서는 도 88에 상세히 나와 있다. 상기 인용 정보 전처리 모듈은 특정 문건에 존재하는 적어도 하나 이상의 인용 번호를 입수 받고(S3820), 상기 인용 번호의 속성이 기설정된 인용 번호 속성이 아닌 인용 번호를 입수하고(S3830), 상기 입수된 인용 번호를 검색 엔진 또는 DBMS에 질의하여 상기 기설정된 속성의 인용 번호를 입수 받고(S3840), 상기 입수된 기설정된 속성의 인용 번호로 상기 입수 받은 인용 번호를 대표화한다(S3850).The citation information preprocessing method for representing the citation number is described in detail in FIG. In step S3820, the preprocessing information preprocessing module obtains at least one quotation number existing in a specific document (S3820), obtains a quotation number that is not a predetermined quotation number attribute in the attribute of the quotation number (S3830) In step S3840, the query number of the predetermined attribute is inquired to the search engine or DBMS in step S3840. In step S3850, the obtained quoted number is represented by the obtained quotation number of the predetermined attribute.

특정 문건을 인용하고 있는 후출원 문건에 대한 정보인 후방 인용 문건 정보의 획득한 후 상기 후방 인용 문건 정보를 상기 특정 문건의 문건 정보에 포함시킬 필요가 있으며, 이러한 과정의 일 실시예적 방법을 도 89가 도시하고 있다.It is necessary to incorporate the backward reference document information into the document information of the specific document after acquiring the backward reference document information which is information on the application document after citing the specific document. Respectively.

상기 인용 정보 전처리 모듈은 특정 문건 데이터를 입수하고(S3920), 상기 특정 문건 데이터에 포함된 출원번호, 등록번호, 공개번호 중 적어도 어느 하나 이상을 입수하며 (S3940), 상기 입수한 번호를 검색 엔진의 인용 필드에 질의하거나, DBMS로 인용 필드를 한정하여 질의하고(S3950), 상기 질의 결과로서 후방 인용 정보를 입수 받으며(S3960), 상기 후방 인용 정보를 상기 문건 데이터의 서지 사항에 포함시킨다(S3970).The citation information preprocessing module obtains specific document data (S3920), obtains at least one of an application number, a registration number, and a public number included in the specific document data (S3940) (S3950), the back quote information is obtained as the query result (S3960), and the back quote information is included in the bibliographic information of the document data (S3970 ).

참증 인용 정보 전처리 모듈Information module for pre-

참증 인용 정보 전처리 모듈은 상기 선행 기술 인용 정보 전처리 모듈이 선행 기술 인용 정보를 대상으로 수행하는 역할을 참증 인용 정보를 대상으로 완전히 동일하게 수행한다.The pre-citation information preprocessing module performs the same functions as the citation information for the prior art citation information preprocessing module on the prior art citation information.

카운팅 전처리 모듈Counting preprocessing module

카운팅 전처리 모듈에 대해서 설명한다. 상기 카운팅 전처리 모듈에는 개별 문건 단위에서 카운팅 처리를 수행하는 개별 문건 단위 카운팅 전처리 모듈과 2이상의 문건 단위에서 카운팅 처리를 수행하는 복수 문건 단위 카운팅 전처리 모듈이 있다.The counting preprocessing module will be described. The counting preprocessing module includes a preprocessing module for counting individual document units for performing a counting process in individual document units and a preprocessing module for counting multiple document units for counting processes in two or more document units.

개별 문건 단위 카운팅 전처리 모듈Individual document unit counting preprocessing module

상기 개별 문건 단위 카운팅 전처리 모듈은 개별 특허 문건에 나와 있는 1)출원인 수 및/또는 특허권자 수, 2) 발명자 수, 3) 출원 또는 등록 단계 등과 같은 각 단계별 청구항 수, 4) 명세서 페이지 수, 5) 도면 수, 6) 특허 분류 기호의 종류 수, 7) 각 종류의 특허 분류 기호의 수, 8) 레퍼런스(reference)수, 9) 심사 인용 수, 10) 심사관이 조사한 특허 분류 기호의 수, 11) 우선권 주장수, 12) 국가별 패밀리 특허 개수, 13) 전체 패밀리 개수, 14) 독립항 수, 15) 종속항 수, 16) 레퍼런스 중에서 국가 단위의 특허 수, 17) 레퍼런스 중에서 전체 특허 문건 수, 18) 레퍼런스 중에서 비특허 문건 수 중에서 적어도 어느 하나 이상의 수치적 정보를 계산할 수 있다.The pre-processing module for counting the individual document unit counts the number of claims, 4) the number of specification pages, 5) the number of documents, 5) the number of patent applicants, The number of patent classification marks, 7) the number of patent classification codes of each kind, 8) the number of references, 9) the number of patent citations, 10) the number of patent classification codes examined by the examiner, The number of patent claims in the country in the reference, 17) the total number of patent documents in the reference, 18) the number of patent claims in the country, At least one numerical information among the number of non-patent documents in the reference can be calculated.

또한, 상기 개별 문건 단위 카운팅 전처리 모듈은 상기 개별 특허 문건 또는 상기 개별 특허 문건을 발행하는 특허청 등의 행정 처리 정보 등으로부터 1) 출원일로부터 등록일까지의 기간, 2) 우선일부터 출원일까지의 기간, 3) 출원일부터 심사청구일까지의 기간, 4) 의견제출통지일부터 의견서 제출일까지의 기간 정보 중 적어도 어느 하나 이상을 계산할 수 있다.In addition, the above-mentioned individual document unit counting preprocessing module can be used for a period from the filing date to the registration date, (2) the period from the priority date to the filing date, (3) the period from the filing date to the filing date, ) Period from the filing date to the date of the examination request, and 4) period information from the date of notification of the opinion submission to the date of submission of the opinion.

또한, 상기 개별 문건 단위 카운팅 전처리 모듈은 상기 개별 특허 문건 또는 행정 정보에서 개별 국가별로의 특정한 제도의 활용 또는 적용 여부에 관한 정보를 입수할 수 있는데, 그 정보로서는 1) 심사청구 제도의 활용 여부, 2) 분할 출원의 활용 여부, 3) 일부 계속 출원의 활용 여부, 4) 재발행 특허 제도의 활용 여부, 5) 우선심사 신청 여부, 5) 결정계 심판 활용 여부, 6) 비당사자계 소송 발생 여부 중 어느 하나 이상을 알아 낼 수 있으며, 상기 제도의 활용 또는 적용이 있는 경우에는 카운팅 값을 1로 하고, 그렇지 않을 경우에는 0으로 할 수 있을 것이다.In addition, the individual document count pre-processing module can obtain information on the utilization or application of a specific system for each individual country in the individual patent document or administrative information. The information includes 1) whether or not to use the examination request system, 2) whether or not to use split applications, 3) whether to apply some of the continuation applications, 4) whether to use the reissue patent system, 5) One or more can be identified, and if the system is used or applied, the count value can be set to 1, otherwise, it can be set to zero.

또한, 상기 개별 문건 단위 카운팅 전처리 모듈은 상기 개별 특허 문건과 관 련해서 상기 특허 문건 마스트 DB를 조사함으로써(검색 엔진 또는 DBMS를 사용하여) 적어도 하나 이상의 제2국에서의 상기 개별 특허 문건과 관련된 정보를 입수하고 카운팅할 수 있다. 상기와 같은 정보로는 1) 출원 국가수, 2) 등록 국가수, 3) 국제출원 여부, 4) 각 국가별 패밀리 수, 5) 전체 패밀리수 중 적어도 어느 하나 이상에 관한 정보를 알아 내어 카운팅할 수 있다.In addition, the individual document unit counting pre-processing module may include information related to the individual patent document in at least one second country (using a search engine or a DBMS) by examining the patent document mast database in association with the individual patent document Can be obtained and counted. Such information may include information about at least one of 1) the number of countries to be filed, 2) the number of countries registered, 3) the number of international applications, 4) the number of families in each country, and 5) .

또한, 상기 개별 문건 단위 카운팅 전처리 모듈은 상기 검색 엔진 또는 상기 DBMS를 통하여 1) forward citation(나를 인용하고 있는 국내외 다른 특허 문건들), 2) backward citation(내가 인용하는 이전 문건들), 3) 간접 인용 중 어느 하나 이상을 알아내어 카운팅할 수 있다.Also, the pre-processing module for counting individual document units can be classified into 1) forward citation (other domestic and foreign patent documents citing me), 2) backward citation (previous documents that I cite), 3) indirect One or more of the citations can be found and counted.

또한 상기 개별 문건 단위 카운팅 전처리 모듈은 상기 문건에 대한 1) 이의신청 여부, 2) 정보제공 여부, 3) 당사자계 심판 발생 여부 등에 대한 정보를 입수할 수 있다. 상기 1) 내지 3)에 관한 정보는 대한민국의 경우 대한민국 특허청 또는 특허심판원으로부터 직접 또는 조회 방식 등으로 입수할 수 있다.In addition, the individual document count pre-processing module can obtain information on 1) whether or not to file an objection, 2) whether information is provided, and 3) whether the judgment of a party has occurred. The information on 1) to 3) above can be obtained from the Korean Intellectual Property Office or the Patent Tribunal, either directly or through inquiry methods.

상기 개별 문건 단위 카운팅 전처리 모듈이 카운팅한 값은 상기 특허 정보 마스트 DB 중 가공 서지 사항 DB에 저장되는 것이 바람직하다. 아울러, 상기 카운팅한 값은 특허 문건을 검색 엔진이 인덱싱할 때 생성되는 인덱스에 포함시키는 방식으로 저장되는 것이 더욱 바람직하다. 상기 카운팅 값이 상기 인덱스에 포함시키는 방법은 1) 상기 검색 엔진이 인덱싱을 할 때, 상기 개별 문건 단위 카운팅 전처리 모듈을 가동하여 카운팅 값을 획득하거나, 2) 상기 특허 정보 마스트 DB에 저장 된 카운팅 값을 입수하는 방식 중 어느 하나 이상이 사용될 수 있다.The value counted by the individual document unit counting preprocessing module is preferably stored in the bibliographic database of the patent information mast database. It is further preferable that the counted value is stored in such a manner that the patent document is included in an index generated when the search engine indexes the patent document. The method of including the counting value in the index may include: 1) when the search engine performs indexing, activating the individual document unit counting preprocessing module to obtain a counting value; or 2) counting the count value stored in the patent information mast DB Or a method of acquiring the above information can be used.

상기 카운팅 전처리 모듈은 상기 특허 분류 기호 마스트 DB를 참조하여 각 특허 분류 기호에 대하여, 1) 총 하위 특허 분류 기호의 개수, 2) 직하위 특허 분류 기호의 개수 카운팅하여 그 카운팅 값을 상기 특허 분류 기호 마스트 DB에 저장해 놓을 수 있다.The counting preprocessing module refers to the patent classification code mast DB to calculate the number of patent classification codes for each patent classification symbol by 1) the total number of sub-patent classification codes, 2) the number of sub-patent classification codes, It can be stored in the mast database.

카운팅 전처리 모듈이 카운팅을 처리하는 방식은 도 78에 잘 나타나 있다. 먼저 카운팅 전처리 모듈이 적어도 하나 이상의 특허 문건 정보를 입수하며(S2820), 입수된 특허 문건 별로 적어도 하나 이상의 상기 개별 문건 단위 카운팅 전처리 모듈을 호출하며(S2830), 상기 개별 문건 단위 카운팅 전처리 모듈이 특허 문건 정보를 대상으로 카운팅 처리를 수행하며(S2840), 카운팅 처리 결과를 특허 문건 정보의 문건 고유 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S2850)The manner in which the counting preprocessing module handles counting is well illustrated in FIG. First, the counting preprocessing module obtains at least one patent document information (S2820), calls at least one or more individual document unit pre-processing modules for each patent document acquired (S2830), and the individual document unit count pre- (S2840). The counting result is stored in the patent DB or the search index together with the document unique number of the patent document information (S2850)

복수 문건 단위 카운팅 전처리 모듈Multi-document unit counting preprocessing module

이하, 상기 복수 문건 단위로 계산된 점수를 입수하는 모듈, 즉 복수 문건 단위 카운팅 전처리 모듈을 설명한다. 상기 복수 문건 단위 카운팅 전처리 모듈은 상기 대상 집합이 되는 복수 개의 문건에서 상기 개별 문건 단위 카운팅 전처리 모듈이 카운팅 해 놓은 정보를 취합하는 방식으로 점수를 입수한다. 즉, 상기 복수 문건 단위 카운팅 전처리 모듈은 적어도 2 이상의 문건 집합에 대하여 카운팅 데이 터를 생성한다.Hereinafter, a module for acquiring the score calculated in units of the plurality of documents, i.e., a multi-document unit counting preprocessing module will be described. The preprocessing module for counting multiple document units obtains a score by collecting information counted by the individual document unit pre-processing module in a plurality of documents constituting the target set. That is, the multi-document unit counting preprocessing module generates counting data for at least two document sets.

가중치 전처리 모듈Weight pre-processing module

문건 단위 가중치 전처리 모듈Document unit weight preprocessing module

하나의 특허 문건에 포함된 발명 사상의 가치를 시스템적으로 정확하게 측정한다는 것은 불가능하다. 하지만, 정확한 가치 측정이 불가능하다고 해서 모든 특허 문건의 가치를 동일하게 취급한다는 것도 비합리적일 수 있다. 그러므로, 본 발명에서는 시스템이 데이터로 파악할 수 있는 정보를 기반으로 하여 특허 문건의 가중치를 부여하는 방법을 개시한다. 상기 특허 문건의 가중치를 부여하는 기능은 본 발명의 문건 단위 가중치 전처리 모듈이 담당한다.It is impossible to systematically measure the value of inventive ideas contained in a patent document. However, it would be irrational to treat all patent documents equally, as it is impossible to measure the value accurately. Therefore, the present invention discloses a method for weighting patent documents based on information that the system can grasp as data. The document unit weight pre-processing module of the present invention is responsible for the weighting function of the patent document.

본 발명의 가중치 전처리 모듈은 문건 단위 가중치 전처리 모듈(3310)과 주체 단위 가중치 전처리 모듈(3330)이 있다. 상기 문건 단위 가중치 전처리 모듈(3310)에는 비용 지출 관점 가중치 전처리 모듈(3311), 인용 관점 가중치 전처리 모듈(3313), 분쟁 관점 가중치 전처리 모듈(3315), 및/또는 집중 관점 가중치 전처리 모듈(3317) 등을 포함하고 있을 수 있다. 그리고, 상기 주체 단위 가중치 전처리 모듈(3330)에는 출원인 단위 가중치 전처리 모듈(3331), 발명자 단위 가중치 전처리 모듈(3333), 및/또는 대리인 단위 가중치 전처리 모듈(3335) 등을 포함하고 있을 수 있다.The weight pre-processing module of the present invention includes a document unit weight pre-processing module 3310 and a subject unit weight pre-processing module 3330. The document unit weight preprocessing module 3310 includes a cost expenditure perspective weight preprocessing module 3311, a citation perspective weight preprocessing module 3313, a conflict perspective weight preprocessing module 3315, and / or a central perspective weight preprocessing module 3317 . &Lt; / RTI > The subject-based weight pre-processing module 3330 may include an applicant unit weight pre-processing module 3331, an inventor-based weight pre-processing module 3333, and / or a proxy unit weight pre-processing module 3335.

비용 지출 관점 가중치 전처리 모듈Expenditure perspective weighted pre-processing module

확률적, 통계적 또는 사회적 관점에서, 더 많은 비용을 지불한 특허가 더 중요한 특허일 가능성이 높을 것이다. 동일 주체에서 비용의 합리적 배분의 관점에서 볼 때에도 이러한 가정이 성립할 가능성이 높으며, 경쟁 관계에 있는 다른 주체들 간에서도 이러한 가정이 성립할 가능성이 높다. 하나의 특허에 관하여 비용이 지불되는 구조를 보면 다음과 같은 가중치 부여 요소를 생각할 수 있다. From a probabilistic, statistical or societal point of view, it is likely that more costly patents are more important. This assumption is highly likely to be established from the perspective of rational allocation of costs in the same entity, and this assumption is likely to be established among other competitors. The following weighting factors can be considered in the structure where a patent is paid for a patent.

1) 양적인 관점으로 (1) 청구항의 수, 독립항의 수, 명세서의 양(페이지 수), 패밀리의 수 또는 특허 분류 기호의 개수 2) 상태적인 관점으로 출원 중인 특허, 등록된 특허, 거절된 특허 또는 포기된 특허인 경우, 3) 과정적인 관점으로 심사청구 여부, 우선심사 활용 여부, 심판 청구 여부, 국내 우선권 주장 여부 또는 분할 출원 여부, 4) 주체적 관점에서 출원인의 수 또는 발명자의 수, 5) 해외적 관점에서 국제 출원 활용 여부, 해외 국내 단계 진입된 나라 수 등과 같은 요소들이 특허 문건의 가치에 영향을 줄 수 있을 것이다. 이러한 가중치 부여 요소들은 개별 문건의 서지 사항 정보, 명세서 내용 정보 또는 행정 처리 정보 등에서 입수할 수 있다. (1) The number of claims, the number of independent claims, the number of pages (number of pages), the number of families, or the number of patent classification symbols in quantitative terms. 2) Patents pending, registered, rejected 3) the number of applicants or inventors from a subjective point of view; and 5) the number of applicants from a subjective point of view. 5) Factors such as the use of international applications from an international perspective and the number of countries entering the foreign country stage may affect the value of patent documents. These weighting factors can be obtained from bibliographic information of individual documents, specification content information or administrative processing information.

통상적으로 청구항의 수가 많거나 명세서의 양이 많으면 발명의 내용이 넓고 다양할 가능성이 있어 발명에 시간과 비용이 많이 들었을 가능성이 높으며, 대리인들도 더 많은 비용을 청구할 가능성이 높다. 또한, 패밀리 수가 많거나, 특허 분류 기호의 개수가 많을수록 발명의 포괄적일 가능성이 높을 것이다. 그리고, 출원 중인 특허보다는 등록된 특허가 더 비중이 높을 가능성이 높으며, 심사 청구를 한 특허 출원 또는 우선 심사 청구를 한 출원이 그렇지 않은 특허 출원보다 중요할 가능성이 더 높을 것이다. 출원인이 2이 이상일 경우에는 공동 출원이므로 2 이상의 다른 주체간의 협동 작업의 결과이므로, 그렇지 않은 경우보다 통계적으로 중요할 가 능성이 있으며, 발명자가 여러 명일 경우에도 여러 사람이 함께 머리를 맞대어 발명한 것이므로 통계적으로 중요할 가능성이 높을 것이다. 또한, 통상적으로 해외 출원은 국내 출원보다 비용 지출이 월등히 크기 때문에, 해외 출원이 있거나, 해외 출원이 많을 경우에는 그 출원인이 상당한 재정적 지출을 하게 되기 때문에, 적어도 그 출원인에게 그 특허는 상대적으로 중요성이 높다고 판단하는 것이 합리적이다.In general, if the number of claims is large or the amount of specification is large, the contents of the invention may be wide and diverse, and it is highly likely that the inventions are time-consuming and costly, and agents are more likely to charge more. In addition, the greater the number of families or the greater the number of patent classification symbols, the more likely it is that the invention is comprehensive. In addition, it is more likely that the registered patent is more important than the pending patent, and it is more likely that the patent application filed for the examination request or the application filed for the priority examination is more important than the patent application filed for the non-patent application. If the applicant is more than 2, it is a joint application. Therefore, it is more likely to be statistically significant than the other cases because it is the result of cooperation between two or more subjects. Even if there are several inventors, It is likely to be statistically significant. In addition, since the overseas application is usually much more expensive than the domestic application, the applicant has a considerable financial expenditure when there is an overseas application or an overseas application, so that the patent is relatively important to the applicant It is reasonable to judge it to be high.

상기와 같은 관점에서 하나의 문건에 대해서 가중치 부여 요소의 존재 및 수치를 파악(존부만이 문제시 될 경우에는 존재하지 않을 경우 수치는 0, 존재하면 1이 될 것이며, 청구항 개수와 같은 요소들은 1이상의 수치값으로 나올 것이다.)할 수 있을 것이며, 그 존재 및/또는 수치에 따라 상기 가중치 부여 요소별로 가중치 점수를 부여할 수 있을 것이다. 하나의 문건에 대해 상기 1) 내지 5)의 각 관점을 이루는 세부적인 가중치 부여 요소의 존재 및 수치를 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발명의 비용 지출 관점 가중치 전처리 모듈이 수행한다.From the above viewpoint, it is possible to grasp the existence and numerical value of a weighting factor for one document (if there is no problem only if there is no problem, the numerical value will be 0, if there is 1, And a weight score may be assigned to each of the weighting factors according to the existence and / or the numerical value. In order to grasp the existence and numerical values of the detailed weighting factors constituting each aspect of the above 1) to 5) with respect to one document and to assign a weighting score to each weighting factor, the cost expenditure perspective weighting pre- .

상기 비용 지출 관점 가중치 전처리 모듈이 가중치를 처리하는 방법은 도 79에 잘 나타나 있다. 상기 비용 지출 관점 가중치 전처리 모듈은 적어도 하나 이상의 특허 문건 정보를 입수하고(S2920), 카운팅 전처리 모듈을 이용하거나 특허 DB 또는 검색 인덱스를 통하여 입수된 특허 문건 정보에서 비용 지출과 관련성이 있는 적어도 하나 이상의 기설정된 카운팅 기준별로의 카운팅 결과값을 입수하며(S2930), 상기 특허 정보 처리 정책 DB에서 상기 특허 정보 상기 카운팅 기준별 가중치를 입수 한다(S2940). 이때, 상기 기설정된 카운팅 기준별에 대한 정보가 특허 정보 처리 정책 DB에 저장된 특허 정보 처리 정책을 참조한다. 예를 들면, 상기 비용 지출 관점의 상기 특허 정보 처리 정책 DB의 청구항의 관점에서는 (청구항 수*0.1 + 독립항수*0.3) 등과 같은 각 관점별에서 각 대상별로 가중치를 처리하는 정책을 저장하고 있다. 그리고, 상기 비용 지출 관점 가중치 전처리 모듈은 카운팅 기준별 가중치와 카운팅 기준별로의 카운팅 결과값으로 비용 관점 가중치값을 생성하며(S2940), 생성된 가중치 값을 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S2950).The manner in which the cost-expenditure-based weighting pre-processing module processes the weights is well illustrated in FIG. The cost expenditure perspective weight pre-processing module obtains at least one or more patent document information (S2920), and uses the counting preprocessing module or the patent document information obtained through the patent DB or the search index, A counting result value for each counting criterion is obtained (S2930), and a weight for each counting criterion is obtained from the patent information processing policy DB (S2940). At this time, information on the predetermined counting criteria is referred to the patent information processing policy stored in the patent information processing policy DB. For example, in view of the claims of the patent information processing policy DB from the viewpoint of cost expenditure, the policy for processing weights is stored for each subject in each viewpoint (claim count * 0.1 + independent count * 0.3). The cost-expenditure-based perspective weighting preprocessing module generates a cost-based weighted value as a result of counting for each counting criterion and each counting criterion (S2940), compares the generated weight value with the unique document number of the patent document information, In the search index (S2950).

인용 관점 가중치 전처리 모듈Citation perspective weighted pre-processing module

논문과 마찬가지로 다수에게 많이 인용되는 특허가 중요한 특허일 가능성이 높다. 이러한 인용 관점에서 개별 문건의 가중치를 처리하는 모듈을 인용 관점 가중치 전처리 모듈이라 한다. 인용의 관점을 세부적으로 나누면 1) 후방 인용(forward citation)의 관점에서 전체 후방 인용의 수, 후방 인용의 깊이 수, 1단계 후방 인용의 수, 후방 인용의 평균 기간, 후방 인용의 시간 대비 집중도, 2) 심사관 인용의 관점에서 심사관의 인용 여부 등과 같은 가중치 부여 요소가 있을 수 있다. 즉, 자신의 특허에 대한 인용이 직접 또는 간접적(자신을 인용한 그 특허를 타 특허가 인용함)으로 많을수록, 직접적으로 인용하는 1단계 인용이 많을수록, 후방 인용의 평균 기간이 적절한 범위 내에 들수록, 자신의 특허를 인용하는 자가 시기적으로 최근 기단에 집중될수록, 그 특허의 중요성이 클 가능성이 높을 것이다. 또한, 심사관이 심사 과정에서 인용된 특허가 그렇지 않은 특허보다 중요성이 높을 수 있다. 상기 인용 관점 가중치 전처리 모듈이 인용 관점에서 가중치를 전처리하는 방식은 상기 비용 지출 관점 가중치 전처리 모듈이 처리하는 방법과 대동소이하며, 도 80에 잘 나와 있다. As with the paper, it is highly probable that many patents cited by the majority are important patents. A module that processes the weights of individual documents from this citation perspective is referred to as a citation perspective weighted preprocessing module. The point of view of the quotation is divided in detail: 1) the number of total rear citation, the number of rear citation, the number of backward citation in the first stage, the average period of rear citation, 2) There may be weighting factors such as whether the examiner is cited in terms of the examiner's citation. In other words, the more the citation of the patent is directly or indirectly quoted (the patent cited by the patent is quoted by the other patent), the more the first step citation is directly cited, The more the authors cite their own patents, the more likely it is that they will become more important. In addition, patents quoted by the examiner during the review process may be of greater importance than those that are not. The way in which the citation-based view weight pre-processing module preprocesses the weights in terms of citation is similar to that of the above-mentioned cost expenditure perspective weight pre-processing module, as shown in FIG.

상기 인용 관점 가중치 전처리 모듈은 적어도 하나 이상의 특허 문건 정보를 입수하여(S3020), 카운팅 전처리 모듈을 이용하거나 특허 DB 또는 검색 인덱스를 통하여 입수된 특허 문건 정보에서 전방 인용 및/또는 후방 인용, 또는 심사 인용에 관련성이 있는 적어도 하나 이상의 기설정된 카운팅 기준별로의 카운팅 결과값을 입수하며(S3030), 상기 특허 정보 처리 정책 DB를 참조하여 카운팅 기준별 가중치를 입수하고(S3040), 카운팅 기준별 가중치와 카운팅 기준별로의 카운팅 결과값으로 인용 관점 가중치값을 생성하며(S3050), 생성된 가중치 값을 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3060) 이때, 인용에 관한 상기 기설정된 카운팅 기준별에 대한 정보가 특허 정보 처리 정책 DB에 저장된 특허 정보 처리 정책을 참조한다. 예를 들면, 상기 인용 관점의 상기 특허 정보 처리 정책 DB에는 "후방 인용 카운팅 수*0.3"등과 같은 각 관점별에서 각 대상별로 가중치를 처리하는 정책을 저장하고 있다. The citation-based view weight pre-processing module obtains at least one or more patent document information (S3020), and uses the counting preprocessing module, the forward citation and / or the rear citation or the examination citation in the patent document information obtained through the patent DB or the search index (S3030), obtains a weight value for each counting criterion by referring to the patent information processing policy DB (S3040), and calculates a weight for each counting criterion and a counting criterion (Step S3050). The generated weight value is stored in the patent DB or the search index together with the unique document number of the patent document information (S3060). At this time, Information about the counting criteria is referred to the patent information processing policy stored in the patent information processing policy DB. For example, the patent information processing policy DB from the above cited view stores a policy for processing weights for each subject in each view such as "count of rear citation count * 0.3 ".

분쟁 관점 가중치 전처리 모듈Dispute perspective weighted pre-processing module

하나의 특허 문건과 관련해서 분쟁이 많을수록 그 특허의 중요성이 클 가능성이 높다. 특허 정보에서 파악할 수 있는 분쟁적 요소는 1) 무효 심판, 2) 이의신청, 3) 정보 제공, 4) 소극적 또는 적극적 권리범위 확인 심판 등이 있을 수 있다. 상기 분쟁적 관점에서의 가중치 부여 요소의 존재 및 수치를 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발명의 분쟁 관점 가중치 전처리 모듈이 수행한다. The more disputes a patent has, the greater the importance of the patent. The disputed elements that can be identified in patent information are: 1) invalidation, 2) objection, 3) information provision, and 4) passive or active rights judgment. The dispute perspective weighting pre-processing module of the present invention performs the determination of the presence and the numerical value of the weighting factor from the disputed viewpoint and assigns a weighting score to each weighting factor.

상기 분쟁 관점 가중치 전처리 모듈이 가중치를 처리하는 방식은 도 81에 나와 있으며, 상기 비용 지출 관점 가중치 전처리 모듈 및 상기 인용 관점 가중치 전처리 모듈이 가중치를 처리하는 방식과 대동소이하다. The manner in which the dispute perspective weighting pre-processing module processes the weights is illustrated in FIG. 81, which is similar to the manner in which the cost expenditure perspective weight pre-processing module and the citation perspective weight pre-processing module process weights.

상기 분쟁 관점 가중치 전처리 모듈은 적어도 하나 이상의 특허 문건 정보를 입수하며(S3120), 카운팅 전처리 모듈을 이용하거나 특허 DB 또는 검색 인덱스를 통하여 입수된 특허 문건 정보에서 분쟁 관련성이 있는 적어도 하나 이상의 기설정된 카운팅 기준별로의 카운팅 결과값을 입수하고(S3130), 상기 특허 정보 처리 정책 DB를 참조하여 카운팅 기준별 가중치를 입수하며(S3140), 카운팅 기준별 가중치와 카운팅 기준별로의 카운팅 결과값으로 분쟁 관점 가중치값을 생성하고(S3150), 생성된 가중치 값을 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3160).The dispute perspective weight pre-processing module obtains at least one patent document information (S3120), and uses the counting preprocessing module or the patent document information obtained through the patent DB or the search index to generate at least one predetermined counting criterion (S3130). Then, the weighting value for each counting criterion is obtained by referring to the patent information processing policy DB (S3140). The weighting value for each counting criterion and the counting result value for each counting criterion (S3150), and stores the generated weight value in the patent DB or the search index together with the unique document number of the patent document information (S3160).

즉, 상기 분쟁 관점 가중치 전처리 모듈이 가중치를 처리하는 방식은 분쟁의 수에 대한 카운팅을 수행하고, 각 카운팅 수에 대한 가중치를 특허 정보 처리 정책 DB로부터 입수하여 분쟁 관점 가중치값을 생성하는 것을 본질로 한다.That is, the dispute perspective weight pre-processing module processes the weights by counting the number of disputes and obtaining the weight for each count number from the patent information processing policy database to generate a dispute point weight value do.

집중 관점 가중치 전처리 모듈Concentration perspective weighting preprocessing module

특정 시점을 기준으로 여러 주체가 집중하고 있는 기술 분야가 중요한 기술 분야일 가능성이 높을 것이다. 하나의 출원인이 여러 가지 기술 분야에 골고루 출원하는 경우는 거의 없으며, 자신이 중요하다고 생각하는 곳, 기술 투자가 집중되 는 곳에 다수를 출원하고, 중요도가 떨어진다고 생각되는 곳에서는 방어적으로 소수를 출원하는 것이 대부분이다. 그러므로, 하나의 출원이 속하는 기술 분야가 여러 출원인이 출원을 많이 하여 다른 기술 분야 대비 상대적으로 1) 출원수가 많거나, 출원의 증가율 또는 증가 속도 가 높거나, 3) 출원인의 수가 많거나, 4) 점유율의 변동이 기설정된 범위를 벗어나거나 하는 경우라면 그 기술 분야는 중요한 기술 분야일 가능성이 높을 것이다. 여러 출원인이 집중하는 분야는 상기 기술 매력도(technology attractiveness), 증가율, 증가 속도, 점유율의 변동 등과 같은 분석 요소로 측정 가능할 것이다. 이때 본 문단에서의 기술 분야는 IPC, USPC, FI, FT, ECLA 등과 같은 특허 문건에 포함되어 있는 계층화된 특허 분류 기호를 각 계층 단위에서 파악한 것이 바람직할 것이다. 물론, 하위 특허 분류 기호를 자동으로 포함하여 각 분석 요소별 수치값을 계산하는 것이 더욱 바람직할 것이다. 즉, IPC로 H04B 7/26가 있을 때, 이 IPC에 대한 각종 분석 요소별 수치값을 계산할 수도 있으며, H04B 7/00의 관점에서 각종 분석 요소별 수치값을 계산할 수도 있을 것이다. 이때, 상기 H04B 7/26의 계층적으로 하위 특허 분류 기호에 속하는 다른 특허 분류 기호들도 포함되어 계산되는 것이 더욱 바람직할 것이다. 집중의 관점에서 가중치 부여 요소의 존재 및 수치를 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발명의 집중 관점 가중치 전처리 모듈이 수행한다.It is highly probable that the technology field in which various subjects are concentrated based on a specific point in time is an important technology field. One applicant rarely evenly applies to various technical fields. He or she applies to a place where he or she thinks it is important, places where technology investment is concentrated, and places a defensive minority in defenseless places Most of them do. Therefore, the technical field to which one application belongs is that many applicants have a lot of applications compared to other technical fields, 1) the number of applications is high, the rate of increase or increase of application is high, 3) If the change in market share is out of the predetermined range, then the technology field is likely to be an important technology field. The areas in which the various applicants concentrate can be measured by such analytical factors as technology attractiveness, rate of increase, rate of increase, change in market share. At this time, it is desirable that the technical field in this paragraph grasp the layered patent classification symbols included in patent documents such as IPC, USPC, FI, FT, ECLA, etc. in each layer unit. Of course, it would be more desirable to automatically include the subpopulation classifier to calculate the numerical value for each analysis element. In other words, when IPC is H04B 7/26, it is possible to calculate the numerical values of various analysis elements for this IPC, and also to calculate the numerical values of various analysis elements from the viewpoint of H04B 7/00. At this time, it is more preferable that other patent classification symbols belonging to the sub-patent classification symbol H04B 7/26 are hierarchically included. From the viewpoint of concentration, the concentration-weighted weight pre-processing module of the present invention performs the identification of the presence and the numerical value of the weighting factor and assigns a weight score to each weighting factor.

상기 집중 관점 가중치 전처리 모듈이 가중치를 처리하는 방식은 상기 비용 지출 관점 가중치 전처리 모듈 및 상기 인용 관점 가중치 전처리 모듈이 가중치를 처리하는 방식과 대동소이하다. 즉, 각 개별 문건에 대한 서지 정보로 출원인/특허 권자의 그 기술분야에 대한 집중도/활동도/점유율 등과 같은 집중의 의미를 포함하는 특허 정보 분석 지표의 값을 측정하고, 각 값에 가중치를 특허 정보 처리 정책 DB로부터 입수하여 집중 관점 가중치값을 생성한다.The manner in which the convergence-based weighting pre-processing module processes the weights is contrary to the manner in which the cost-expenditure-based weighting preprocessing module and the citation-based weighting preprocessing module process the weights. That is, the bibliographic information for each individual document measures the value of the patent information analysis index including the concentration of the applicant / patentee in terms of concentration / activity / market share in the technical field, and assigns a weight to each value And obtains the concentrated view weight value from the information processing policy DB.

상기 집중 관점 가중치 전처리 모듈이 가중치를 처리하는 방식은 도 82에 나와 있으며, 상기 비용 지출 관점 가중치 전처리 모듈 및 상기 인용 관점 가중치 전처리 모듈이 가중치를 처리하는 방식과 대동소이하다. The manner in which the centralized point weighting pre-processing module processes the weights is illustrated in FIG. 82, which is similar to the manner in which the cost-expenditure-based weighting preprocessing module and the citation-based weighting preprocessing module process the weights.

상기 집중 관점 가중치 전처리 모듈이 적어도 하나 이상의 특허 문건 정보를 입수하며(S3220), 특허 문건 DB에서 출원인 정보, 특허 기술 분류 정보를 입수하며(S3230), 입수된 출원인 정보와 특허 기술 분류 정보를 값으로 하여 특허 기술 분류 기호 DB를 참조하여 특허 기술 분류 기호 체계상에서 입수된 특허 기술 분류 또는 적어도 하나 이상의 상위 특허 기술 분류 기호에 대한 출원인의 집중도/활동도/점유율 중 적어도 하나 이상의 기준으로 그 값을 계산하며(S3240), 각 기준별 집중도/활동도/점유율에 대한 가중치 정보를 상기 특허 정보 처리 정책 DB로부터 입수하며(S3250), 상기 기준별 가중치와 집중도 결과값으로 집중 관점 가중치값을 생성하며(S3260), 생성된 가중치 값을 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3270).The centralized point weighting preprocessing module obtains at least one patent document information (S3220), obtains the applicant information and the patent technical classification information from the patent document DB (S3230), and obtains the applicant information and the patent technical classification information as values Referring to the patent technical classification code DB to calculate the value based on at least one of the patent technical classification obtained on the patent technical classification code system or the applicant's concentration / activity / occupancy rate of at least one higher patent technical classification code (S3240), weight information on the concentration / activity / occupancy ratio of each criterion is obtained from the patent information processing policy DB (S3250), and a concentrated perspective weight value is generated from the criterion weight value and the concentration result value (S3260) , The generated weight value is stored in the patent DB or the search index together with the unique document number of the patent document information (S3270).

상기 집중 관점 가중치 전처리 모듈은 출원인이 아닌 발명자 단위에서 가중치를 처리할 수도 있으며, 본 가중치를 처리하는 방식은 도 83에 나와 있다. 상기 집중 관점 가중치 전처리 모듈이 적어도 하나 이상의 특허 문건 정보를 입수하며(3320), 특허 문건 정보에서 발명자 정보, 특허 기술 분류 정보를 입수하 며(S3330), 입수된 발명자 정보와 특허 기술 분류 정보를 값으로 하여 특허 기술 분류 기호 DB를 참조하여 특허 기술 분류 기호 체계상에서 입수된 특허 기술 분류 또는 적어도 하나 이상의 상위 특허 기술 분류 기호에 대한 발명자의 집중도를 적어도 하나 이상의 기준으로 계산하고(S3340), 각 기준별 집중도에 대한 가중치 정보를 입수하며(S3350), 기준별 가중치와 집중도 결과값으로 집중 관점 가중치값을 생성하고(S3360), 생성된 가중치 값을 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3370).The concentrated-point weighting preprocessing module may process weights in units of inventors rather than applicants, and the manner in which the weights are processed is shown in FIG. The centralized point weighting preprocessing module obtains at least one or more patent document information (3320), obtains the inventor information and the patent technology classification information from the patent document information (S3330), obtains the inventor information and the patent technology classification information , Referring to the patent technical classification code DB to calculate the concentration of the inventor on the patent technology classification or at least one or more higher patent technical classification codes obtained on the patent technical classification code system in step S3340, (Step S3350). The centralized weight value is generated from the weight value of the reference and the concentration result value (S3360), and the generated weight value is added to the patent database or the search index (S3370).

주체 단위 가중치 전처리 모듈Subject-unit weight pre-processing module

하나의 특허 문건이 있을 때, 그 특허 문건에 개입되는 주체가 중요한 주체일 경우, 그 주체가 출원한 특허가 중요한 특허일 가능성이 높다. 이때 어떤 주체가 더 중요한 주체일 수 있을 가를 판단하는 것이 관건이다. 하나의 특허 문건 포함되는 주체는 출원인, 발명자 및 대리인이 있을 수 있다. 하나의 특허 문건에 대하여 상기 주체 관점에서 그 문건의 가중치를 전처리하는 기능을 수행하는 것을 주체 단위 가중치 전처리 모듈이라 한다. 도 4에서 알 수 있듯이 상기 주체 단위 가중치 전처리 모듈은 주체의 종류에 따라 출원인 단위 가중치 전처리 모듈, 발명자 단위 가중치 전처리 모듈, 대리인 단위 가중치 전처리 모듈이 있다.When there is a patent document, if the subject involved in the patent document is an important subject, the patent applied by the subject is likely to be an important patent. The key is to judge which subject is more important. A subject that includes a patent document may be an applicant, an inventor, and an agent. A preprocessing module for subjecting a patent document to the function of preprocessing the weight of the document from the subject point of view. As shown in FIG. 4, the subject-based weight pre-processing module includes an applicant unit weight pre-processing module, an inventor unit weight pre-processing module, and an agent unit weight pre-processing module, depending on the subject.

출원인 단위 가중치 전처리 모듈Applicant unit weight pre-processing module

특정 문건에 포함되어 있는 기술 분야가 상기 문건의 출원인이 집중하는 기술 분야인 경우라면 그 출원의 중요성이 클 가능성이 높을 것이다. 특정 문건의 출원인이 집중하는 분야는 그 기술 분야로의 집중율, 점유율, AI 등의 분석 요소로 측정 가능할 것이다. 상기와 같은 분석 요소의 관점에서 가중치 부여 요소의 수치값을 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발명의 출원인 단위 가중치 전처리 모듈이 수행한다.If the technical field included in the specific document is the technical field to which the applicant of the document is concentrated, the importance of the application is likely to be significant. The field in which the applicant of a specific document focuses can be measured by analysis factors such as concentration rate, market share, and AI in the technology field. The unit weight pre-processing module, which is an applicant of the present invention, performs numerical values of the weighting elements from the viewpoint of the above-described analysis elements and assigns weighting scores to the respective weighting elements.

상기 출원인 단위 가중치 전처리 모듈이 가중치를 처리하는 방식은 도 84에 나와 있다. 상기 출원인 단위 가중치 전처리 모듈이 출원인 단위로 가중치를 처리하기 위한 기준 정보를 입수하고(S3420), 기준에 부합하는 출원인 명의의 문건을 대상으로 적어도 하나 이상의 문건 단위 가중치 전처리 모듈이 계산한 결과값을 입수하며(S3430), 문건 단위 가중치 전처리 모듈별 가중치 정보를 상기 특허 정보 처리 정책 DB로부터 입수하고(S3440), 각 문건 단위 가중치 전처리 모듈의 가중치와 각 결과값으로 입수한 기준별 출원인 단위 가중치값을 생성하며(S3450), 생성된 가중치 값을 기준과 함께 또는 독립적으로 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3460).The manner in which the Applicant's unit weight pre-processing module processes the weights is shown in FIG. The applicant's unit weight pre-processing module obtains the reference information for processing the weights in units of applicants (S3420), and obtains the result values calculated by at least one document unit weight preprocessing module for the documents of applicant's name conforming to the standard (S3430), weight information for each document unit weight preprocessing module is obtained from the patent information processing policy DB (S3440), and a weight value of each document unit weight preprocessing module and an applicant unit weight value obtained by each result value are generated (S3450). The created weight value is stored in the patent DB or the search index together with the reference or independently of the patent document information (S3460).

발명자 단위 가중치 전처리 모듈Inventor unit weight pre-processing module

특정 문건에 포함되어 있는 기술 분야가 상기 문건의 발명자가 집중하는 기술 분야인 경우라면 그 출원의 중요성이 클 가능성이 높을 것이다. 특정 문건의 발명자가 집중하는 분야는 그 기술 분야로의 그 발명자 기준에서의 집중율, 점유율, AI(공식은 출원인 기준 공식에서 발명자로 대체했을 때와 동일하다) 등의 분석 요소로 측정 가능할 것이다. 상기와 같은 분석 요소의 관점에서 가중치 부여 요소의 수치값을 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발 명의 발명자 단위 가중치 전처리 모듈이 수행한다.If the technical field included in a specific document is a technical field concentrated by the inventor of the document, the importance of the application will be high. The field in which the inventor of a particular document focuses may be measured by an analysis factor such as concentration ratio, occupancy, and AI in the inventor's standard in the technology field (the formula is the same as when the inventor was substituted in the applicant reference formula). The inventor's unit weight preprocessing module of the present invention grasps the numerical value of the weighting factor from the viewpoint of the above-described analysis factor and assigns a weighting score to each weighting factor.

상기 발명자 단위 가중치 전처리 모듈이 가중치를 처리하는 방식은 도 85에 나와 있다. 상기 발명자 단위 가중치 전처리 모듈이 발명자 단위로 가중치를 처리하기 위한 기준 정보를 입수하고(S3520), 기준에 부합하는 발명자 명의의 문건을 대상으로 적어도 하나 이상의 문건 단위 가중치 전처리 모듈이 계산한 결과값을 입수하며(S3530), 문건 단위 가중치 전처리 모듈별 가중치 정보를 상기 특허 정보 처리 정책 DB로부터 입수하며(S3540), 각 문건 단위 가중치 전처리 모듈의 가중치와 각 결과값으로 입수한 기준별 발명자 단위 가중치값을 생성하며(S3550), 생성된 가중치 값을 기준과 함께 또는 독립적으로 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3560). 상기 가중치 정보에는 "과거 전체 출원대비 등록율*0.1+해외 출원 패밀리 수*0.5"과 같은 정책이 포함되어 있을 수 있다.The manner in which the inventor's unit weight pre-processing module processes the weights is shown in FIG. The inventor's unit weight pre-processing module obtains the reference information for processing the weights in units of inventors (S3520), obtains the result values calculated by at least one document unit weight preprocessing module for documents of the inventor's name conforming to the standard (S3530), weight information for each document unit weight preprocessing module is obtained from the patent information processing policy DB (S3540), and a weight value of each document unit weight preprocessing module and a criterion value (S3550). The created weight value is stored in the patent database or the search index together with the reference or the unique document number of the patent document information (S3560). The weight information may include a policy such as "registration rate of past total application * 0.1 + number of overseas filing families * 0.5".

대리인 단위 가중치 전처리 모듈Agent unit weight pre-processing module

특정 문건에 포함되어 있는 기술 분야가 상기 문건의 대리인이 집중하는 기술 분야인 경우라면 그 특허 명세서의 완성도는 그렇지 않은 대리인이 작성하는 경우보다 높을 것이다. 그러므로, 특정 문건의 대리인이 집중하는 분야는 그 기술 분야로의 그 대리인 기준에서의 집중율, 점유율, AI(공식은 출원인 기준 공식에서 대리인으로 대체했을 때와 동일하다) 등의 분석 요소로 측정 가능할 것이다. 상기와 같은 분석 요소의 관점에서 가중치 부여 요소의 수치값을 파악하고, 각 가중치 부여 요소별로 가중치 점수를 부여하는 것은 본 발명의 대리인 단위 가중치 전처리 모듈이 수행한다. 상기 대리인 단위의 가중치는 출원인 단위의 가중치나 상기 발명자 단위 가중치보다 중요도가 낮을 것이다.If the technical field included in a specific document is a technical field that the agent of the document focuses on, the completeness of the patent specification will be higher than that of a non-representative. Therefore, the field in which the agent of a particular document focuses can be measured by analysis factors such as concentration, share, and AI (the formula is the same as when it was replaced by the agent in the applicant reference formula) will be. The agent-unit weight pre-processing module of the present invention performs the numerical value of the weighting element from the viewpoint of the above-described analysis element and assigns a weighting score to each weighting element. The weight of the agent unit may be lower than the weight of the applicant unit or the weight of the inventor unit.

상기 대리인 단위 가중치 전처리 모듈이 가중치를 처리하는 방식은 도 86에 나와 있다. 상기 대리인 단위 가중치 전처리 모듈이 대리인 단위로 가중치를 처리하기 위한 기준 정보를 입수하며(S3620), 기준에 부합하는 대리인 명의의 문건을 대상으로 적어도 하나 이상의 문건 단위 가중치 전처리 모듈이 계산한 결과값을 입수하며(S3630), 문건 단위 가중치 전처리 모듈별 가중치 정보를 상기 특허 정보 처리 정책 DB로부터 입수하며(S3640), 각 문건 단위 가중치 전처리 모듈의 가중치와 각 결과값으로 입수한 기준별 대리인 단위 가중치값을 생성하며(S3650), 생성된 가중치 값을 기준과 함께 또는 독립적으로 특허 문건 정보의 고유 문건 번호와 함께 특허 DB나 검색 인덱스에 저장한다(S3660). 상기 가중치 정보에는 "출원대비 등록율*0.3"과 같은 정책이 포함되어 있을 수 있다.The manner in which the proxy unit weight pre-processing module processes the weights is shown in FIG. The proxy unit weight pre-processing module obtains the reference information for processing the weight in the proxy unit (S3620), and obtains the result value calculated by at least one document unit weight preprocessing module for the document of the proxy name matching the standard (S3630), weight information for each document unit weight preprocessing module is obtained from the patent information processing policy DB (S3640), and weight values of each document unit weight preprocessing module and a representative agent unit weight value obtained as each result value are generated (S3650). The created weight value is stored in the patent DB or the search index together with the reference or the unique document number of the patent document information (S3660). The weight information may include a policy such as "registration rate * 0.3 relative to application ".

본 발명의 문건 단위 가중치 전처리 모듈 및 주체 단위 가중치 전처리 모듈은 상기와 같은 적어도 하나 이상의 각 가중치 전처리 모듈이 획득한 가중치 부여 요소에 대하여 기설정된 가중치를 부여하여 문건 단위의 가중치를 계산해 낸다. 이때, 각 관점의 가중치 부여 모듈 각각 또는 각 주체별 가중치 부여 모듈 각각은 자신이 획득한 가중치 부여 요소에 대하여 기설정된 가중치를 부여하여 자신의 관점에서의 문건 단위의 가중치를 계산해 낼 수 있다. 상기의 각 가중치들은 관리자 또 는 사용자의 입력에 따라 다르게 부여될 수 있음은 물론이다 하겠다.The document unit weight pre-processing module and the subject unit weight pre-processing module according to the present invention assign a predetermined weight to the weighting factors acquired by at least one or more weight pre-processing modules as described above, and calculate the weights of document units. At this time, each of the weighting modules of each viewpoint or each of the weighting modules for each subject can give a predetermined weight to the weighting factor acquired by itself, and calculate the weight of the document unit from the viewpoint of the user. It will be appreciated that each of the above weights may be assigned differently depending on the input of an administrator or a user.

주체 마스트 DB 생성 모듈(301-4)The subject mast DB generation module 301-4

주체의 종류Type of Subject

특허 문건에 대하여 주체는 크게 1) 출원인, 2) 발명자, 3) 대리인이 있을 수 있다. 출원인에는 기업 등과 같은 조직과 개인이 있을 수 있다. 본 발명에서 상기 주체를 저장하고 관리하는 DB를 주체 마스트 DB(204)라고 하며, 상기와 같은 주체 마스트 DB(204)를 생성하는 모듈을 주체 마스트 DB 생성 모듈(301-4)이라 한다.As for the patent document, the subject may be 1) an applicant, 2) an inventor, and 3) an agent. The applicant may be an organization and an individual such as an enterprise. In the present invention, a DB for storing and managing the subject is referred to as a subject mast DB 204, and a module for generating the subject mast DB 204 as described above is referred to as a subject mast DB generation module 301-4.

상기 주체 마스트 DB(204)에는 대표명화 된 출원인 명칭 DB, 대표명화 된 발명자 명칭 DB, 대표명화 된 대리인 명칭 DB를 더 포함할 수 있으며, 이와는 독립적으로 기업 정보 DB를 더 포함하고 있을 수 있다. 이들에 대해서 각각 설명한다.The subject matter mast DB 204 may further include a representative name DB, a well-known inventor name DB, and a representative name of a representative agent DB, and may further include an enterprise information DB independently. These will be described separately.

주체 마스트 DB 생성 모듈(301-4)의 구성The configuration of the subject mast DB generation module 301-4

본 발명의 주체 마스트 DB 생성 모듈(301-4)에는 대표명화 전처리 모듈(301-4-1) 포함되어 있다. 상기 대표명화 전처리 모듈은 출원인 대표명화를 처리하는 출원인 대표명화 전처리 모듈(301-4-1-1), 발명자를 대표명화 처리하는 발명자 대표명화 전처리 모듈(301-4-1-2), 대리인을 대표명화 처리하는 대리인 대표명화 전처리 모듈(301-4-1-3)중 어느 하나 이상을 포함하고 있다. 한편, 본 발명의 주체 마스트 DB 생성 모듈(301-4)에는 기업 정보 DB 생성 모듈이 더 포함되어 있을 수 있다. 기업 정보 DB에는 기업의 재무 정보, 회사 현황 정보, 회계 정보, 주생산품, 대표자, 법인등록번호, 사업자 번호, 웹사이트, 전화번호, 팩스 번호 중 어느 하나 이상이 더 포함되어 있을 수 있다. 상기 주체 마스트 DB(204)에는 세부적으로 기업 정보 DB(204-1), 대표명화된 출원인 명칭 DB(204-2), 대표명화된 발명자 명칭 DB(204-3), 대표명화된 대리인 명칭 DB(204-4), 및/또는 조직 정보 DB(204-5)을 더 포함하고 있을 수 있다.The subject mast DB generating module 301-4 of the present invention includes the representative name pre-processing module 301-4-1. The representative linguistic pre-processing module includes a representative linguistic pre-processing module (301-4-1-1), an inventor representative linguistic preprocessing module (301-4-1-2) for processing the inventor's representative linguistic name, an agent And representative representative linguistic pre-processing module 301-4-1-3 for performing representative linguistic processing. Meanwhile, the subject mast DB generating module 301-4 of the present invention may further include a company information DB generating module. The corporate information DB may further include at least one of financial information of company, information of company status, accounting information, main product, representative, corporation registration number, business number, website, telephone number and fax number. The subject matter mast DB 204 is further provided with an enterprise information DB 204-1, a representative name DB 204-2, a representative name DB 203-3, a representative name DB 204-4, and / or an organization information DB 204-5.

대표 어구 추출 전처리 모듈Representative phrase extraction pre-processing module

대표 어구의 개념Concept of representative phrases

이어서, 본 발명의 대표 어구 추출 모듈에 대해서 설명한다. 먼저 대표 어구의 개념에 대해서 설명한다. 대표 어구(Representing Phrase)란 상기 대표 어구를 통해서, 특정한 문건 전체 또는 상기 특정한 문건의 일부분을 대표할 수 있는 단어 또는 2 이상의 단어로 구성되는 어구를 말한다. 본 발명의 대표 어구가 가지는 기본적인 속성은 주어지는 기설정된 대표 어구 추출 전체 문건 집합에서는 상당히 희박하게 나오면서, 주어지는 기설정된 대표 어구 추출 목표 문건 집합(1개의 대표 어구 추출 목적 문건을 포함한다) 또는 대표 어구 추출 목적 문건의 일부분 또는 기설정된 대표 어구 추출 목표 문건 집합의 각 문건의 공통된 일부분에서는 기설정된 기준 이상으로 빈번하게 나오는 것이다. Next, the representative word extraction module of the present invention will be described. First, the concept of representative phrases is explained. A Representing Phrase is a word consisting of two or more words that can represent a whole or a part of a specific document through the representative phrase. The basic attributes of the representative phrases of the present invention are a set of predetermined representative phrases extracted from a given set of representative phrases and a set of representative phrases (including one representative phrase extraction objective document) A part of the objective document or a common representative part of each document of the predetermined representative phrase extraction target document set frequently comes out above the predetermined standard.

상기 기설정된 대표 어구 추출 전체 문건 집합의 종류에는 예시적으로 1) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호로 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 2) 특정 국가 DB에서 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 3) 특정 국가 DB에서 특정 출원인의 특허 문건에 발명자로 포함된 특정 발명자 명의(즉, 출원인 명의 and 발명자 명의)에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 4) 특정 국가 DB에서 특정 대리인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 5) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호 및 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 6) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호, 특정 출원인 명의 및 특정 발명자 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 7) 상기 특정 국가 DB에서 특정 출원인 명의 및 특정 대리인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 8) 특정 국가의 모든 특허 문건 집합, 9) 적어도 2 이상의 국가의 모든 특허 문건 집합, 10) 상기 1) 내지 9)에 기설정된 기간 단위로 특정되는 특허 문건 집합 등이 그 예가 될 것이다.The set of predefined representative phrase extraction whole document sets includes, for example, 1) a set of patent documents generated by a method of specifying a set of patent documents with a specific patent classification symbol on a specific patent classification system in a specific country DB, 2) (3) the name of the specific inventor (ie, the name of the applicant and the inventor) included in the patent document of the specific applicant in the specific country DB as the inventor 4) a set of patent documents generated by a method of specifying a set of patent documents by a specific agent name in a specific country DB, 5) a set of patent documents generated by a method of specifying a set of patent documents by a specific country DB A specific patent classification symbol on a specific patent classification system and a patent document set by a specific applicant 6) a patent document generated by a method of specifying a set of patent documents by the name of a specific applicant and a specific inventor by a specific patent classification symbol in a specific patent classification system in a specific country DB, 8) a set of patent documents generated by a method of specifying a set of patent documents by the name of a specific applicant and a specific agent in the specific country DB, 9) a set of all patent documents of a specific country, 9) A set of all patent documents, and 10) a set of patent documents specified in units of a predetermined period in the above 1) to 9).

한편, 상기 대표 어구 추출 목표 문건 집합은 상기 대표 어구 추출 전체 문건 집합의 임의의 부분 집합(문건 1개에서부터 적어도 하나 이상의 특정한 속성(예를 들면 IPC 등의 특정한 특허 분류 기호, 출원인, 기간, 국가, 대리인 또는 발명자 또는 이들의 하나 이상의 조합 등)이 공통되는 문건들로 이루어진 부분 집합)이 될 수 있다.The representative phrase extraction target document set includes at least one or more specific attributes (for example, a specific patent classification code such as IPC, applicant, period, country, etc.) from an arbitrary subset of the representative phrase extraction whole document set An agent or inventor, or a combination of one or more of these) may be a subset of documents in common.

다.All.

상기 대표 어구 추출 목표 문건 집합(타겟 집합을 포함하는 super set)은 상기 대표 어구 추출 전체 문건 집합(타겟 집합)과 상대적인 개념으로, 상기 대표 어구 추출 전체 문건 집합의 부분 집합이면 충분하나, 가급적 그 부분 집합의 크기는 작을수록 좋다. 상기 대표 어구 추출 목표 문건 집합의 예로는 1) 특정 출원 1개, 특정 기업의 모든 출원, 특정 발명자의 출원, 특정 발명자의 특정 IPC의 출원, 특정 기업의 특정 IPC의 모든 출원, 특정 IPC의 특정 연도의 모든 출원 등이 그 예가 될 것이다.The representative set extraction target document set (the super set including the target set) is a concept relative to the representative document extraction whole document set (target set), and the subset of the representative document extraction whole document set is sufficient. Preferably, The smaller the size of the set, the better. Examples of the representative phrase extraction target document set include 1) a specific application, all applications of a specific company, an application by a specific inventor, a specific IPC application by a specific inventor, all applications of a specific IPC of a specific company, All of which are examples.

한편, 대표 어구 추출 목적 문건의 일부분의 예로는 특허 청구 범위, 특허 청구 범위 중의 독립항, 특허 청구 범위 중의 종속항, 발명의 효과나 산업상 이용 가능성, 요약 등이 그 예가 될 것이다. 한편, 기설정된 대표 어구 추출 목표 문건 집합의 각 문건의 공통된 일부분은 상기 대표 어구 추출 목표 문건 집합을 구성하는 모든 문건의 전술한 특허 청구 범위 등이 될 것이다.Examples of a part of a representative phrase extraction objective document include an example of a patent claim, an independent item in the claims, a subordinate item in the claims, an effect of the invention, industrial applicability, and a summary. On the other hand, a common part of each document of the predetermined representative phrase extraction target document set will be the above-mentioned patent claim range of all the documents constituting the representative phrase extraction target document set.

상기 대표 어구는 1) 명사만으로 구성되는 어구, 2) 명사와 동사만으로 구성되는 어구, 3) 형용사나 부사까지 포함되어 형성되는 어구가 있을 수 있다. 상기 대표 어구의 길이는 단어수를 기준으로 할 때 1~5개가 바람직하며, 2~3개가 더욱 바람직하다. 통상적으로 기술적 개념이 1개의 단어로 생성되기도 하지만, 많은 경우 2~3개의 단어로 기술적 개념 또는 기술적 작용 또는 효과를 구성하는 것이 많기 때문이다.The representative phrases include 1) a phrase consisting of only nouns, 2) a phrase consisting of only nouns and verbs, and 3) a phrase consisting of adjectives and adverbs. The length of the representative phrase is preferably 1 to 5, more preferably 2 to 3, based on the number of words. Typically, a technical concept is created in one word, but in many cases it is often the case that a technical concept or a technical action or effect consists of two or three words.

그리고, 상기 대표 어구 추출 목표 문건 집합은 상기 대표 어구 추출 전체 문건 집합과 상대적인 개념으로, 상기 대표 어구 추출 전체 문건 집합의 부분 집합이면 충분하나, 가급적 그 부분 집합의 크기는 작을수록 좋다. 상기 대표 어구 추출 목표 문건 집합의 예로는 1) 특정 출원 1개, 특정 기업의 모든 출원, 특정 발명자의 출원, 특정 발명자의 특정 IPC의 출원, 특정 기업의 특정 IPC의 모든 출원, 특정 IPC의 특정 연도의 모든 출원 등이 그 예가 될 것이다.The representative phrase extraction target document set is a relative concept to the representative phrase extraction whole document set. The subset of the representative phrase extraction whole document set is sufficient. Preferably, the smaller the subset size is, the better. Examples of the representative phrase extraction target document set include 1) a specific application, all applications of a specific company, an application by a specific inventor, a specific IPC application by a specific inventor, all applications of a specific IPC of a specific company, All of which are examples.

이어, 상기 대표 어구 추출 과정을 상세히 설명한다. 상기 대표 어구 추출은 본 발명의 대표 어구 추출 전처리 모듈이 수행한다. 상기 본 발명의 대표 어구 추출 전처리 모듈에는 1) 각 언어별 형태소 분석 엔진(analyzer), 2) 어구 생성 엔진, 3) 어구별 카운팅 엔진, 4) 시소러스 사전/특허 기술 용어 사전 등을 비롯한 각종 사전 데이터베이스, 5) 대표 어구 추출 엔진을 포함한다. 상기 대표 어구 추출 전처리 모듈의 구성은 도 8에 잘 나타나 있다. 대표 어구를 추출해 내는 알고리즘을 포함하고 있는 대표 어구 추출 전처리 엔진(3710), 대표 어구를 추출하는 기준이 되는 정책에 관한 정보를 포함하고 있는 대표 어구 추출 정책 DB(3730), 대표 어구가 추출되었을 때 각 대표어구에 대한 빈도값 등의 각종 카운팅 또는 계산 정보 및 대표 어구가 어느 문건의 어느 위치(필드 등)에서 나왔는지에 관한 정보를 포함하고 있는 대표어구-빈도-출원번호 대응 DB(3750)가 포함되어 있다. 한편, 동의어나 유의어의 처리를 위한 시소러스 DB(3770) 및 2종 이상의 언어의 처리를 위한 번역 시스템을 활용하거나, 기번역된 사전을 이용하여 대표 어구에 대한 번역 정보를 포함하고 있는 대표 어구 번역 DB(3790)가 더 포함되어 있을 수 있다. 이하, 본 발명의 대표 어구 추출 전처리 엔진에 대해 더욱 상세하게 설명한다. 상기 대표 어구 추출 전처리 엔진은 문장이나 특정 필드의 문단 등이 입력되었을 때, 용어(terminology)를 추출(extraction)하는 기능을 수행한다.Next, the representative phrase extraction process will be described in detail. The representative phrase extraction is performed by the representative word extraction preprocessing module of the present invention. The representative word extraction preprocessing module of the present invention includes various dictionary databases including 1) a morphological analysis engine for each language, 2) a phrase generation engine, 3) a classification counting engine, and 4) a thesaurus dictionary / , 5) a representative phrase extraction engine. The configuration of the representative phrase extraction preprocessing module is well illustrated in FIG. A representative phrase extraction preprocessing engine 3710 including an algorithm for extracting representative phrases, a representative phrase extraction policy DB 3730 including information on a policy as a basis for extracting representative phrases, A representative word-frequency-application number correspondence DB 3750 including information on various counting or calculation information such as the frequency value for each representative word, and information on which position (field, etc.) of the document the representative word comes from . On the other hand, a thesaurus DB 3770 for processing synonyms and synonyms, and a translation system for processing two or more languages, or a representative phrase translation DB including translation information for representative phrases using a previously translated dictionary (3790) may be further included. Hereinafter, the representative phrase extraction preprocessing engine of the present invention will be described in more detail. The representative phrase extraction preprocessing engine performs a function of extracting a terminology when a sentence or a paragraph of a specific field is input.

상기 본 발명의 어구 생성 엔진은 어구 단위로 1~5개(바람직하게는 2~3개)의 어구 배열을 생성한다. 예를 들어, 하나의 특허 문건에 포함된 하나의 문장이 입력되었을 때, 상기 출력되는 어구들을 순서대로 a b c d e f, g라고 하자. 이때, 상기 어구 생성 엔진의 어구 생성 단위가 3단어일 경우, 상기 어구 생성 엔진은 abc, bcd, cde, def, efg 라는 어구별 어구를 생성한다. 상기 어구 생성 엔진은 주어지는 문건 또는 문건 부분에 대하여 문장 단위로 어구를 생성한다. 문건 부분에 대하여 어구를 생성하는 것의 일례로는 특허 청구 범위에 나오는 문장들을 대상으로 하여 상기 어구를 생성할 수 있을 것이다. 상기 배열의 생성 방법은 1) 하나의 문장 단위로 처리하되, 하나의 문장에서 추출된 어구가 n개 있을 때, 상기 n개의 어구에 대한 nC2, nC3, nC4, nC5로 하여 어구를 생성하는 방법(예를 들면, nC2가 적용될 때, 상기 어두들은 ab, ac, ad, ae, af, ag, bc, ...fg가 생성된다.), 2) 문건 단위에서 m개의 어구가 있을 경우, 문건 단위에서 상기 1)의 방법을 적용하는 방법, 3) 순차적으로 2~5개의 어구로 배열하는 방법(예를 들면, 2개의 어구가 순차적으로 배열된다고 할 때는 ab, bc, cd, de, ef, fg가 되며, 3개의 어구가 배열된다고 할 때는 abc, bcd, cde, def, efg가 된다. 물론, 2개 및 3개의 어구의 배열 모두를 사용할 수도 있을 것이다.)The phrase generation engine of the present invention generates 1 to 5 (preferably 2 to 3) phrases in units of phrases. For example, when a sentence included in a patent document is input, let the output phrases be a b c d e f, g in order. At this time, if the phrase generation unit of the phrase generation engine is three words, the phrase generation engine generates abc, abd, bcd, cde, def, and efg. The phrase generation engine generates a phrase on a sentence unit basis with respect to a given document or document portion. An example of creating a phrase for a document part may be to generate the phrase for the sentences in the claims. The method of generating the arrangement is as follows: 1) a phrase is generated by using nC2, nC3, nC4, and nC5 for the n phrases when n words are extracted from one sentence, For example, when nC2 is applied, the digits ab, ac, ad, ae, af, ag, bc, ... fg are generated.) 2) B, cd, de, ef, fg, and so on) when the two arrangements are sequentially arranged. , And abc, bcd, cde, def, and efg when three phrases are arranged.) Of course, you can use both arrays of two and three phrases.

한편, 상기 형태소 분석기를 통과한 상기 어구는 텀(term)이라고도 명명하기도 한다. 상기 텀은 형태소 분석기에서의 세팅값에 따라서 달라질 수 있다. 이때, 상기 세팅값의 예로는 1) 체언만 분리함, 2) 용언만 분리하되, 대표형만 추출함, 3) 형용사나 부사의 경우 대표형만 추출함 등이 그러한 예가 될 수 있다.On the other hand, the phrase that has passed through the morpheme analyzer may also be called a term. The term may vary depending on the setting value in the morpheme analyzer. In this case, examples of the setting values include 1) separating only the vowel, 2) extracting only the representative word, but extracting only the representative word, and 3) extracting only the representative word in the case of the adjective or adverb.

이어, 상기 어구별 카운팅 엔진은 상기 생성된 각 어구별로, (어구, 문건번호) 또는 (어구, 카운팅값) 정보를 필수적으로 생성하며, 바람직하게는 (어구, 카운팅값, 문건 번호) 또는 (어구, 문건 번호, 문건 번호상의 위치 필드명(예를 들면 특허 청구 범위)) 정보를 생성하며, 더욱 바람직하게는 (어구, 카운팅값, 문건 번호, 문건 상의 위치필드명(예를 들면 특허 청구 범위)) 정보를 생성한다. 상기 어구별 카운팅 엔진은 대표 어구 DB(207-2)에 상기 생성한 정보를 저장한다. 하기 표 5은 상기 대표 어구 DB(207-2)에 포함되는 데이터의 일예를 나타낸다. 하기 표 5에 나타난 예는 국가별로 특정 어구에 대해서 생성되는 데이터의 일예를 보여주고 있다.Then, the erroneous counting engine essentially generates information (a phrase, a document number) or (a phrase, a counting value) for each of the generated phrases, and preferably generates information (a phrase, a counting value, a document number) (E.g., a phrase, a counting value, a document number, a location field name on the document (for example, a patent claim), a location field name on the document number ) Information. The distinguishing counting engine stores the generated information in the representative word DB 207-2. Table 5 shows an example of data included in the representative word DB 207-2. The example shown in Table 5 below shows an example of data generated for specific phrases by country.

[표 5][Table 5]

어구
IDFish
ID 어구Fish 어구 단어수Word count 문서내 누적 카운팅값(문건 번호 기준)Cumulative count value in document (based on document number) 국가 코드country code 문건 번호Article number 어구 위치Fishing location ...... "어구 절대 ID""False absolute ID" 1One abcabc 33 1One KRKR 10-2003-001234510-2003-0012345 특허청구범위Patent Claims 1One 1One abcabc 33 22 KRKR 10-2003-001234510-2003-0012345 특허청구범위Patent Claims 22 1One abcabc 33 33 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 33 1One abcabc 33 44 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 44 1One abcabc 33 55 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 55 1One abcabc 33 66 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 66 1One abcabc 33 77 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 77 22 bcdbcd 33 1One KRKR 10-2003-001234510-2003-0012345 특허청구범위Patent Claims 88 22 bcdbcd 33 22 KRKR 10-2003-001234510-2003-0012345 특허청구범위Patent Claims 99 22 bcdbcd 33 33 KRKR 10-2003-001234510-2003-0012345 특허청구범위Patent Claims 1010 22 bcdbcd 33 44 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 1111 22 bcdbcd 33 55 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 1212 22 bcdbcd 33 66 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 1313 22 bcdbcd 33 77 KRKR 10-2003-001234510-2003-0012345 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 1414 ...... ...... ...... ...... ...... ...... ...... ...... 1One abcabc 33 1One KRKR 10-2003-005678910-2003-0056789 특허청구범위Patent Claims 150150 1One abcabc 33 22 KRKR 10-2003-005678910-2003-0056789 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 151151 1One abcabc 33 33 KRKR 10-2003-005678910-2003-0056789 발명의 상세한 설명DETAILED DESCRIPTION OF THE INVENTION 152152 ...... ...... ...... ...... ...... ...... ...... ......

상기 표 5에 나와 있는 데이터는 특허 문건 10-2003-0012345 및 10-2003-0056789으로 생성된 것임을 알 수 있으며, 어구 생성 엔진은 3개의 단위로 어구를 생성했음을 알 수 있다. 그리고, 상기 10-2003-0012345에서 상기 어구 abc는 특허 청구 범위에서 2회, 발명의 상세한 설명에서 5회가 나왔음을 알 수 있다. 또한, 어구 bcd는 특허 청구 범위에서 3회, 발명의 상세한 설명에서 4회가 나왔음을 알 수 있다. 또한, 어구 abc는 특허 문건 10-2003-0056789에서 특허청구범위에 1회, 발명의 상세한 설명에서 2회가 나왔음을 알 수 있다. 그러므로, 본 발명의 어구별 카운팅 엔진은 반복적으로 또는 재귀적으로 입수되는 모든 특허 문건에 대하여 상기 표 5과 같은 데이터를 생성할 수 있다. 아울러, 상기 어구 생성 엔진은 2개의 단위로 어구를 생성할 수 있을 것이며, 상기 2개의 단위로 생성되는 어구에 대하여, 본 발명의 어구별 카운팅 엔진은 상기 표11과 같은 데이터를 생성할 수 있음은 자명하다 할 것이다. 나아가 1개 또는 4~5개의 단위로도 상기 어구별 카운팅 엔진은 상기 표 5과 같은 데이터를 생성할 수 있을 것이다.It can be seen that the data shown in Table 5 is generated by the patent documents 10-2003-0012345 and 10-2003-0056789, and it can be seen that the phrase generation engine has generated the phrases in three units. In the above-mentioned 10-2003-0012345, it can be seen that the phrase abc has been issued twice in the claims and five times in the detailed description of the invention. Also, it can be seen that the phrase bcd is three times in the claims and four times in the detailed description of the invention. In addition, the phrase abc can be found in the patent document 10-2003-0056789 once in the claims and twice in the detailed description of the invention. Therefore, the categorical counting engine of the present invention can generate data as shown in Table 5 for all patent documents repeatedly or recursively obtained. In addition, the phrase generation engine may generate a phrase in two units, and the word counting engine of the present invention can generate the data as shown in Table 11 for phrases generated in the two units It will be self-evident. Further, even in one or four or five units, the above-mentioned counting engine will be able to generate the data shown in Table 5 above.

이때, 상기 어구별 ID를 생성하는 것이 문제시 될 수 있는데, 상기 어구별 ID를 생성하는 방법은 통상적인 DB 기술에서 자명한 것이나, 몇 가지 예시적인 방법을 제시한다. 첫째 방법은, 최초로 생성되는 어구에 대하여 순차적으로 ID를 부여하고, 다음으로 생성되는 어구에 대하여 기존에 있던 어구를 검색하여 없으면 새로운 ID를 부여하고, 있다면 기존의 ID를 부여한다. 이러한 과정을 모든 입수되는 어구마다 반복 또는 재귀적으로 수행한다. 둘째 방법은 어구에 대한 ID 부여 없이 입수되는 모든 어구에 대하여 순차적으로 임시 ID를 부여하여 상기 표11과 같은 데이터를 생성한 다음, 각 어구마다 하나씩 읽으면서 동일 어구에 대해서 동일한 정식 ID를 부여하며, 이러한 과정을 모든 정식 ID가 부여되지 않은 어구들을 대상으로 반복적으로 또는 재귀적으로 수행한다.At this time, the generation of the distinction ID may be a problem. The method of generating the distinction ID is obvious in the conventional DB technology, but presents some exemplary methods. In the first method, the IDs are sequentially given to the first generated phrases, the existing phrases are searched for the next generated phrases, the new IDs are assigned if there is no phrases, and the existing IDs are assigned if they exist. This process is repeated or recursively performed for every available phrase. In the second method, temporary IDs are sequentially assigned to all the phrases obtained without the IDs for the phrases to generate the data as shown in Table 11, and the same formal IDs are assigned to the same phrases while reading one for each phrase, This process is repeatedly or recursively performed on phrases for which no formal ID is assigned.

그리고, 본 발명의 상기 어구별 카운팅 엔진은 매 어구가 생성되어 처리될 때마다 어구 절대 ID를 생성하여 모든 어구마다 부여할 수 있다. The discrimination counting engine of the present invention can generate a phrase absolute ID every time a phrase is generated and processed and assign it to every phrase.

또한, 본 발명의 어구별 카운팅 엔진은 목표 문건 집합을 기준으로 한 각 어구별 누적 카운팅 값을 생성할 수 있다. 상기 목표 문건 집합에는 상기 표와 같이 하나의 문건을 기준으로 누적 카운팅 값을 계산할 수도 있으며, 하나의 문건 내의 특정한 필드(예를 들면 특허 청구 범위)로도 누적 카운팅이 가능함은 자명할 것이다.In addition, the inventive counting engine of the present invention can generate each cumulative counting value based on the target document set. The target document set may be counted based on one document as shown in the above table, and it is obvious that cumulative counting can be performed even in a specific field (for example, patent claim) in one document.

또한, 본 발명의 어구별 카운팅 엔진은 전체 문건 집합을 기준으로 한 각 어구별 누적 카운팅 값을 생성할 수 있으며 상기 표 6와 같은 방식으로 저장할 수도 있을 것이다.(상기 예에서 전체 문건 집합을 특허 문건 10-2003-0012345 및 10-2003-0056789로 가정하고, 수치를 상기 표 기준으로 입력하였으며, 실제로는 훨씬 더 큰 전체 문건 집합을 대상으로 본 발명 사상이 적용된다.)The erroneous counting engine of the present invention can generate a cumulative counting value based on the entire document set and store it in the manner shown in Table 6. In the above example, 10-2003-0012345, and 10-2003-0056789, the numerical values are entered based on the above table, and the present invention is actually applied to a much larger overall document set.)

[표 6][Table 6]

어구
IDFish
ID 어구Fish 누적 카운팅값(전체 문건 집합 기준)Cumulative counting value (based on total document set) 관련 어구 절대 IDAbsolute ID of related phrase ...... ...... ...... ...... ...... ...... ...... ...... 1One abcabc 1010 1, 2, 3, 4, 5, 6, 7, 150, 151, 1521, 2, 3, 4, 5, 6, 7, 150, 151, 152 22 bcdbcd 77 8, 9, 10, 11, 12, 13, 148, 9, 10, 11, 12, 13, 14 ...... ...... ...... ...... ...... ...... ...... ......

또한, 본 발명의 어구별 카운팅 엔진은 문건 상의 특정 필드(예를 들면 특허 청구 범위 등)를 기준으로 하기 표 7과 같이 한 각 어구별 누적 카운팅 값을 생성할 수 있다.In addition, the segregated counting engine of the present invention can generate a counted cumulative counting value as shown in Table 7 on the basis of a specific field on the document (for example, claims).

[표 7][Table 7]

어구
IDFish
ID 어구Fish 누적 카운팅값(특허청구범위기준)Cumulative counting value (based on claims) 관련 어구 절대 IDAbsolute ID of related phrase ...... ...... ...... ...... ...... ...... ...... ...... 1One abcabc 33 1, 2, 1501, 2, 150 22 bcdbcd 33 8, 9, 108, 9, 10 ...... ...... ...... ...... ...... ...... ......

.........어, 본 발명의 대표 어구 추출 엔진에 대해서 설명한다. 대표 어구 추출 전체 문건 집합에서 추출한서 (어구, 문건 번호) 데이터가 있는 경우, 상기 데이터로 하기 표 8와 같은 (어구, 카운팅값, 문건 번호) 데이터가 생성하는 것은 용이할 것이다. 상기 (어구, 카운팅값, 문건 번호) 데이터는 동일한 어구가 있을 경우, 카운팅 값을 1씩 증가시켜 주는 방법으로 생성될 수 있을 것이다. 되었을 때, 본 발명의 대표 어구 추출 엔진은 (어구, 문건 번호) 데이터를 기초로 하여, (어구, 카운팅값, 문건 번호)을 생성할 수 있으며, (어구, 문건 번호, 문건 번호상의 위치 필드명) 데이터로, (어구, 카운팅값, 문건 번호, 문건 번호상의 위치 필드명) 데이터를 생성할 수 있을 것이다. 상기와 같은 과정은 적어도 하나 이상의 대표 어구 추출 목표 문건 집합에 속하는 모든 문건에 대하여 수행할 수 있을 뿐만 아니라, 상기 대표 어구 추출 목표 문건 집합의 모집합이 되는 적어도 하나 이상의 대표 어구 추출 전체 문건 집합에 대해서도 수행할 수 있을 것이다.......... The representative phrase extraction engine of the present invention will be described. Representative Phrase Extraction If there is data (phrase, document number) data extracted from the entire document set, it will be easy to generate data (phrase, counting value, document number) data as shown in Table 8 below. The above (phrase, counting value, document number) data may be generated by increasing the counting value by 1 when there is the same phrase. The representative word extraction engine of the present invention can generate (a phrase, a counting value, a document number) based on data (a phrase, a document number) and a position field name on a phrase, a document number, ) Data, (phrase, counting value, document number, location field name on document number) data. The above process can be performed not only for all documents belonging to at least one representative phrase extraction target document set but also for at least one representative phrase extraction whole document set that is the sum of the representative phrase extraction target document set You can do it.

대표 어구 추출 엔진은 대표 어구 추출 전체 문건 집합의 예로서 1) 1개 국가 DB에 속하는 모든 문건, 2) 주어진 적어도 하나 이상의 특허 분류 기호에 대응되는 모든 문건, 3) 특정한 출원인에 대한 모든 문건, 4) 특정한 발명자에 대한 모든 문건 또는 5) 이들 각각에 기간을 한정하여 생성되는 모든 문건, 또는 6) 이들 각각을 조합(합집합, 차집합, 교집합 등의 임의의 집합 연산을 포함한다)하여 생성되는 모든 문건에 대해서도 (어구, 문건 번호) 데이터 또는 (어구, 문건 번호, 문건 번호상의 위치 필드명) 데이터를 생성할 수 있을 것이다. 한편, 상기 대표 어구 추출 전체 문건 집합의 임의의 부분 집합에 대응되는 모든 문건에 대해서도 (어구, 문건 번호) 데이터 또는 (어구, 문건 번호, 문건 번호상의 위치 필드명) 데이터를 생성할 수 있을 것이다.The representative phrase extraction engine is an example of a representative phrase extraction set of all documents: 1) all documents belonging to one national DB, 2) all documents corresponding to at least one given patent classification, 3) all documents for a specific applicant, 4 ) All documents for a specific inventor, or (5) all documents produced by limiting the duration of each of these, or (6) all of the documents generated by combining them (including any aggregate operations such as union, difference or intersection) (Eg, phrase, document number) data or a (location field name on the phrase, document number, and document number) data for the document. On the other hand, data (a phrase, a document number) or data (a phrase, a document number, and a location field name on the document number) data may be generated for all documents corresponding to an arbitrary subset of the representative document set.

이때, 상기 대표 어구 추출 엔진은 적어도 하나 이상의 대표 어구 추출 전체 문건 집합 각각에 대하여 기설정된 적어도 하나 이상의 대표 어구 추출 목표 문건 집합에 대하여 각 어구 별로 1) 대표 어구 추출 목표 문건 집합에서의 빈도(frequency F) 및 2) 상기 대표 어구 추출 목표 문건 집합의 모집합이 되는 대표 어구 추출 전체 문건 집합에서의 빈도(total frequency, T)를 계산할 수 있을 것이다. 한편, 상기 대표 어구 추출 전체 문건 집합에서 어구 총수(All frequency, A)를 계산할 수 있음은 물론이며, 상기 대표 어구 추출 목표 문건 집합에서의 어구 총수(All frequency of target set, AT)를 계산할 수 있음도 물론이다 하겠다.At this time, the representative phrase extraction engine extracts at least one representative phrase from the at least one or more representative phrase extraction sets for each set of at least one representative phrase extraction target document set for each of the phrases 1) frequency in the representative phrase extraction target document set (frequency F ) And 2) the total frequency (T) in the representative document set extracted from the representative phrase extraction set which is the sum of the collection of the representative document extraction target document set. In addition, it is possible to calculate the total frequency (A) of the representative text in the representative text extraction set, and calculate the total frequency of the AT in the representative text extraction target document set Of course I will.

이때, 상기 대표 어구 추출 엔진은 대표 어구 추출 정책 DB를 참조하여 주어진 대표 어구 추출 전체 문건 집합에서 주어진 대표 어구 추출 목표 문건 집합의 대표 어구를 추출한다. 이때, 상기 대표 어구 추출 정책은 기설정된 조건 하에서 (어구, 어구의 확률값)에서 상기 어구의 확률값이 기설정된 기준 범위에 포함되는 지의 여부가 될 수 있다. 상기 확률값의 예로는 특정 어구별로의 F/T, F/A, F/AT값이 될 수 있을 것이다. 한편, 상기 대표 어구 추출 정책은 1) F, T, A, AT 값의 변동값(증감율, 증감속도), 2) F/T, F/A, F/AT의 변동(증감율, 증감속도)값이 기설정된 기준 범위를 만족하는 지도 될 수 있을 것이다. 이때, 상기 어구 추출 정책의 기설정된 조건은 기간별, 국가별, 출원인별, 발명자별, 또는 적어도 하나 이상의 특허 분류 기호 별 또는 기설정된 속성을 공유하는 문건 집합별로 상기 (어구, 어구의 확률값)에서 상기 어구의 확률값에 다른 기준 범위의 값을 대응시킬 수 있을 것이다. 상기 예를 들면, 특허 출원이 많은 IPC H 섹션에서 추출된 어구에 대한 확률값과 특허 출원인 적은 IPC D 섹션에서 추출된 어구의 확률값은 다르게 적용될 수도 있다는 것이다. 물론, 1) F, T, A, AT 값의 변동값(증감율, 증감속도), 2) F/T, F/A, F/AT의 변동(증감율, 증감속도)값에도 상기 어구가 추출된 문건 집합(대표 어구 추출 전체 문건 집합, 대표 어구 추출 목표 문건 집합)의 속성에 따라 유동적으로 다른 기준이 적용될 수 있을 것이다.At this time, the representative phrase extraction engine refers to the representative phrase extraction policy DB and extracts a representative phrase of the representative target phrase extraction target document set from the given representative phrase extraction whole document set. At this time, the representative phrase extraction policy may be whether or not the probability value of the phrase is included in a preset reference range under a predetermined condition (probability value of the phrase, phrase). Examples of the probabilities may be F / T, F / A, F / AT values for specific phrases. The representative phrase extraction policy is defined as follows: 1) Variation value (increase / decrease rate) of F, T, A, and AT values; 2) Variation (increase / decrease rate, increase / decrease rate) of F / T, F / It can be instructed to satisfy the predetermined reference range. At this time, the predetermined condition of the phrase extraction policy may be determined in accordance with the above (the probability value of the phrase and the phrase) by the period, the country, the applicant, the inventor, or at least one patent classification symbol, The probability value of the phrase may correspond to a value of another reference range. For example, the probabilities for the phrases extracted in the IPC H section with many patent applications and the probabilities of the phrases extracted in the IPC D section may be applied differently. Of course, the above-mentioned phrases are also extracted in the values of F / T, F / A, F / AT fluctuation (increase / decrease rate, increase / decrease rate) Different criteria may be applied flexibly according to the attributes of the document set (representative phrase extraction whole document set, representative phrase extraction target document set).

이하, 간략한 표로써 본 발명의 대표 어구 추출의 개념을 설명한다. 본 발명의 대표 어구 추출 엔진은 1) 전체 어구들의 출현 개수(frequency) 기준으로, 어구 ID별로 확률값(=특정 어구의 출현 개수/전체 어구 수)을 대응시키고, 기설정된 확률값 범위에 해당하는 어구들을 후보 대표 어구로 선정하고, 기설정된 대표 어구 선발 규칙을 참조하여, 목표 문건 집합별로의 대표 어구를 선정한다. 상기 출현 개수는 상기 각 특정 필드별로 계산될 수 있음은 자명할 것이다. 하기 표 8와 같은 예에서는 문건 번호별로 각 어구별 출현 개수를 카운팅하여 기설정된 대표 어구 추출 정책에 부합하는 상기 문건 번호별로 대표 어구를 추출할 수 있을 것이다.Hereinafter, the concept of representative word extraction of the present invention will be described with a brief table. The representative phrase extraction engine of the present invention is configured such that 1) a probability value (= number of occurrences of a specific phrase / total number of phrases) is associated with each phrase ID on the basis of frequency of occurrence of all phrases, and phrases corresponding to a predetermined probability value range And the representative phrase for each target document set is selected by referring to the predetermined representative phrase selection rule. It will be appreciated that the number of occurrences can be calculated for each specific field. In the example shown in Table 8 below, the number of occurrences of each word can be counted according to the document number, and a representative word can be extracted for each document number conforming to the predetermined representative phrase extraction policy.

[표 8][Table 8]

구I어구Old I fish 대표 어구 추출 목표 문건 집합문건에서의 출현개수(F)(frequency)The number of occurrences in the representative document extraction target document set (F) (frequency) 문건 번호Article number 대표어구추출 전체 문건 집합에서의 출현개수(T)Representative phrase extraction Number of occurrences in total document set (T) F/T(%)출현개수/전체 출현개수(%)F / T (%) Number of occurrences / Total number of occurrences (%) T/A전체에서의 출현개수/전체 어구 총수(10억)(%)Number of occurrences in total T / A / Total number of total words (1 billion) (%) 어구 총수(A)Total number of fishes (A) abcabc 4040 #1#One 8000080000 0.050.05 0.0080.008 1,000,000,0001,000,000,000 abcabc 22 #2#2 8000080000 0.00250.0025 0.0080.008 abcabc 200200 #3# 3 8000080000 0.250.25 0.0080.008 bcdbcd 88 #1#One 200200 44 0.000020.00002 bcdbcd 100100 #2#2 200200 5050 0.000020.00002 bcdbcd 22 #4#4 200200 0.10.1 0.000020.00002 cdecde 1515 #1#One 30003000 0.50.5 0.00030.0003 cdecde 100100 #4#4 30003000 3.33.3 0.00030.0003

상기 표 8에서 알 수 있듯이, 모든 어구마다 이와 1:1로 대응되는 어구 절대 ID가 있을 것이다. 그러므로고, (어구, 문건 번호) 데이터는 (어구 절대 ID, 어구, 문건 번호) 데이터와 본질적으로 대등하다. 각 어구 절대 ID마다 어구 ID가 대응되며, 각 어구 절대 ID마다 상기 어구가 나온 문건 번호가 대응되어 있으며, 각 문건 번호 마다 서지 사항이 대응되어 있다. 그러므로, 상기 표 8의 문건 번호 및 문건 번호별 상기 어구 ID별 출현 개수를 입수하는 것은 목표 문건 집합의 한 예를 문건 번호로 하는 것일 수 있다. 마찬가지로, 상기 문건 번호 자리에 기설정된 레벨의 IPC가 대응될 수도 있으며, 특정한 출원인 또는 특정한 발명자 등이 대응될 수 있으며, 각 경우마다 각 목표 문건 집합별 어구 ID별 출현 개수를 카운팅할 수 있을 것이다. 상기 목표 문건 집합별 어구 ID별 출현 개수를 카운팅하는 것은 DBMMS에서는 테이블별 조인(join) 명령을 통해서 할 수 있을 것이다.As shown in Table 8, there will be a phrase absolute ID corresponding to 1: 1 for every phrase. Therefore, the data (phrase, document number) is essentially equivalent to the data (absolute ID, phrase, document number). A phrase ID corresponds to each phrase absolute ID, a document number from which the phrase is associated for each phrase absolute ID, and a bibliography is associated with each document number. Therefore, obtaining the number of occurrences of the document ID and the document number of each of the document IDs in Table 8 above may be an example of the target document set as the document number. Similarly, a predetermined level IPC may correspond to the document number place, a specific applicant or a specific inventor may be corresponded, and in each case, the number of occurrences of each phrase ID per each target document set may be counted. The DBMMS can count the number of occurrences by the ID of each target document set through the table-specific join command.

이때, 예를 들어, 상기 대표 어구 추출 정책 DB에 있는 상기 대표 어구 추출 정책을 이루는 한선발 규칙이 1) 전체 어구 개수 대비(예를 들면 10억 개) 전체에서의 출현개수(T)가 1/1000% 미만이고, 2) 목적하는 문건 번호에서의 출현 개수가 1/1,000,000% 이상이며, 3) 출현개수/전체 출현 개수(%)가 10%에서 1%이라고 할 때, 각 어구로 각 문건 번호의 대표 어구가 될 수 있는지를 검토한다. 어구 abc는 문건 번호 #1 내지 #4 어느 것에도 대표어구가 될 수 없고(조건 1) 위반), 어구 bcd는 문건 번호 #2에서만 상기 문건 번호를 대표하는 어구가 될 수 있으며,(문건 번호 1 및 문건 번호 4에서는 조건 2)위반), 어구 cde는 문건 번호 #4에서만 대표어구가 될 수 있다(문건 번호 1에서는 조건 3) 위반) 그러므로, 문건 번호 #1의 대 표 어구에는 상기, abc, bcd, cde는 포함되지 않으며, 문건 번호 #2의 대표 어구에는 bcd가 포함될 수 있으며, 문건 번호 #3에 대해서는 상기 어구 abc는 포함되지 않으며, 문건 번호 #4에 대해서는 어구 cde가 포함될 수 있게 된다. In this case, for example, if the one-way rule constituting the representative phrase extraction policy in the representative phrase extraction policy DB is 1) the number of occurrences (T) in the total number of phrases (for example, 1 billion) 2) the number of occurrences in the target document number is 1 / 1,000,000% or more; and 3) the number of occurrences / total number of occurrences (%) is 10% to 1% Of the total number of words. The phrase abc can not be a representative word in any of document numbers # 1 to # 4 (violation of condition 1), and the phrase bcd can be a phrase representing the document number only in document number # 2, And abbreviation for document number 4), the word cde can only be a representative word for document number # 4 (document number 1, condition 3). Therefore, bcd and cde are not included, and the representative word of the document number # 2 includes bcd. The document abc is not included in the document number # 3, and the word cde can be included in the document number # 4.

상기와 같은 방법으로 본 발명의 대표 어구 추출 전처리 모듈은 검색 엔진의 인덱서(401-3) 및 상기 인덱스를 활용하여 어구별 카운팅 데이터를 생성할 수 있으며, 상기 어구별 카운팅 데이터를 바탕으로 하여 전술한 바와 같은 방식으로 문건별, 기설정된 문건 집합별, 특정 필드별 내용, 또는 기설정된 문건 집합에서 추출된 모든 특정 필드별 내용에서 대표 어구를 추출할 수 있게 된다. 예를 들면, 윗 문단에서 상기 문건 번호 #2에 대표 어구에는 bcd를 대응시킬 수 있게 된다. 이어, 대표 어구 추출 목표 문건 집합별로 대표 어구가 추출되었다면, 상기 대표 어구는 상기 대표 어구 추출 목표 문건 집합 단위 또는 상기 대표 어구 추출 목표 문건 집합을 이루는 개별 문건 단위로 저장될 수 있다. 즉, 하나의 대표 어구 추출 목표 문건 집합 단위 또는 상기 개별 문건에는 적어도 하나 이상의 대표 어구가 대응될 수 있다. 상기 대표 어구는 상기 대표 어구 추출 목표 문건 집합의 개별 필드 단위 또는 상기 대표 어구 추출 목표 문건 집합을 이루는 개별 문건의 개별 필드 단위로 적어도 하나 이상이 대응될 수 있을 것이다. 예를 들면, 상기 문건 번호 #2에 어구 bcd를 대응시켜 저장할 수 있게 된다. 이때, 상기 #2를 호출하였을 때, 상기 어구 bcd를 표시하여 줄 수 있게 된다.In this way, the representative word extraction preprocessing module of the present invention can generate categorical counting data by using the indexer 401-3 of the search engine and the index, and based on the categorical counting data, The representative phrases can be extracted from the contents of each specific field extracted from documents, predetermined document sets, specific field contents, or a predetermined document set in the same manner as in FIG. For example, in the upper paragraph, the document number # 2 can correspond to the bcd in the representative phrase. If representative phrases are extracted for each set of representative phrase extraction target documents, the representative phrases may be stored in units of the representative phrase extraction target document set or individual document units constituting the representative phrase extraction target document set. That is, at least one representative phrase may correspond to one representative phrase extraction target document set unit or the individual document. The representative word may be corresponded to at least one of individual field units of the representative word extraction target document set or individual field units of the individual documents constituting the representative word extraction target document set. For example, the phrase bcd can be stored in correspondence with the document number # 2. At this time, when the # 2 is called, the phrase bcd can be displayed.

이때, 특정한 문건 번호에 대하여 너무 많은 대표 어구가 할당될 수 있게 된다. 즉, 특정한 문건에 대하여는 상기 예로 든 조건을 충족하는 대표 어구가 100개 가 될 수도 있으며, 이 경우, 4) 각 대표 어구마다 계산되는 출현 빈도(frequency)가 높은 순으로 10개 내지 30개의 대표 어구를 추출하는 규칙을 적용하여, 대표 어구의 개수를 제한할 수 있다. 한편, 특정한 문건에 대해서는 대표 어구가 1개 내지 없을 경우가 발생할 수 있는데, 이 경우에는 상기 예로든 조건에서 조건 3)을 완화하여 적용하여, 대표 어구의 숫자를 10개 내지 30개로 유지할 수 있을 것이다.At this time, too many representative phrases can be assigned to specific document numbers. That is, for a specific document, there may be 100 representative phrases satisfying the above-mentioned conditions, and in this case, 4) 10 to 30 representative phrases in the order of frequency in which each representative phrase is calculated, , The number of representative phrases can be limited. On the other hand, there may be a case where there is not one representative phrase for a specific document. In this case, condition 3) may be relaxed in the above-mentioned condition so that the number of representative phrases can be maintained at 10 to 30 .

상기와 같은 과정을 모든 문건 내지는 모든 문건 집합에 대하여 반복적으로 수행하면, 모든 문건에 대하여 기설정된 개수 범위(예로 들면 10개 내지 30개)의 대표 어구를 대응시켜 놓을 수 있을 것이다. 그러므로, 상기와 같은 과정을 통하여 상기 문건 번호에 대응되는 대표 어구를 획득할 수 있게 된다. 그리고, 상기 대표 어구 및 상기 대표 어구에 대한 계산값(예로 들면, 상기 문건 번호의 문건에서의 출현개수, 전체에서의 출현개수 등)은 상기 문건 번호에 대한 서지 사항의 한 예로 취급되어 각종 분석에 활용될 수 있다.If the above process is repeatedly performed on all documents or all sets of documents, a representative phrase of a predetermined number range (for example, 10 to 30) may be associated with all documents. Therefore, the representative word corresponding to the document number can be obtained through the process described above. The calculation values (for example, the number of occurrences of the document number in the document, the number of occurrences in the whole document, and the like) for the representative word and the representative word are treated as an example of the bibliography for the document number, Can be utilized.

상기 실시예에서는 출현 빈도의 계산을 문건 단위로 하였는데, 상기 출현 빈도는 다음과 같은 1) 문건 집합 또는 2) 문건 내부의 필드 단위(특허 청구 범위 등) 또는 3) 문건 집합의 모든 문건 내부의 필드(예를 들면 2000년부터 2005년까지 출원된 대한민국의 IPC H04B 7/02에 모든 문건에 포함된 특허청구범위 중 제 1항)에 포함된 내용을 단위로 하여 계산할 수도 있을 것이다. 이때, 하나의 문건 내부인 경우에는 하나의 문건과 동일하게 취급(문건 번호의 대응 관점)할 수 있을 것이다. 하지만, 2개 이상의 문건 집합의 경우에는 문건 번호의 대응 관계가 없어지는 것처럼 보일 수 있으나 아래 3 가지 방법 중 어느 하나의 방법을 사용하면 이 문제 를 용이하게 해결할 수 있을 것이다. 이는 1) 문건 집합에 속하는 개별 문건의 단위로 대표 어구를 추출한 다음, 상기 대표 어구들을 합집합 연산을 수행한 다음, 기설정된 범위(예로 들면 10개 내지 30개)로 대표 어구의 수를 한정하는 방법, 2) 만약 문건 번호별로 대표 어구가 이미 추출되어 있다면 상기 문건 집합에 속하는 문건 번호에 대응되는 대표 어구를 입수해서 합집합 연산을 수행한 다음, 기설정된 범위로 대표 어구의 수를 한정하는 방법, 3) 문건 집합에 속하는 모든 문건을 처리한 다음에 각 어구를 문건 번호가 아닌 문건 집합의 고유 ID에 대응하여 대표 어구를 추출하는 방법 중 어느 하나를 사용할 수 있을 것이다. 상기 3) 문건 집합의 모든 문건 내부의 필드에 포함된 내용 단위일 경우에도 상기 1) 합산 처리 내지 2) 문건 집합 고유 ID 대응 처리 방법이 사용될 수 있을 것이며, 만약 필드 단위로 대표 어구가 마련되어 있다면, 상기 3) 입수 후 합산 처리 방법이 사용될 수 있을 것이다.In the above embodiment, the appearance frequency is calculated in units of documents. The appearance frequency is the following: 1) a document set or 2) a field unit within the document (claim scope etc.) or 3) (Eg, paragraph 1 of the patent claims included in all documents in IPC H04B 7/02 of the Republic of Korea filed between 2000 and 2005). At this time, in the case of one document, it can be handled in the same way as one document (the corresponding point of the document number). However, in the case of two or more sets of documents, the correspondence of document numbers may seem to disappear, but one of the following three methods can easily solve this problem. (1) a method of extracting representative phrases as a unit of individual documents belonging to a document set, then performing a union operation of the representative phrases, and then limiting the number of representative phrases to a predetermined range (for example, 10 to 30) , 2) if representative phrases have already been extracted for each document number, obtaining a representative phrase corresponding to the document number belonging to the document set, performing a union operation, and then limiting the number of representative phrases to a predetermined range; ) After processing all the documents belonging to the document set, one of the methods of extracting the representative words corresponding to the unique ID of the document set other than the document number may be used for each of the words. If the content unit is included in the field of all the documents in the above 3) document set, the above 1) summation processing and 2) document unique ID correspondence processing method may be used. If a representative phrase is provided in the field unit, The above-mentioned 3) method of collecting after acquisition may be used.

이하, 형태소 분석 엔진과 연결된 검색 엔진의 인덱서(401-3)(indexer)을 활용하여 상기 대표 어구를 추출하는 방법을 설명한다. 검색 엔진을 사용하는 이유는 상기 검색 엔진은 통상적으로 검색어 값에 대한 개수를 카운팅하는데 뛰어난 성능을 가지고 있기 때문이며, 인덱서(401-3)가 인덱싱해 놓은 데이터를 DB용 데이터로 용이하게 변환할 수 있기 때문이다. 인덱서(401-3)는 입수되는 텀과 상기 텀이 나온 문건 번호 또는 경우에 따라서는 상기 텀이 나온 필드명을 저장한다. 예로 들면, (term 1, #1)은 상기 term 1이 문건 #1에서 입수되었다는 뜻이며, (CL:term 1, #1)은 상기 term 1이 상기 문건 #1의 특허 청구 범위(Claim, CL)라는 필드에서 입 수되었다는 뜻이다. 검색 엔진의 인덱스에는 상기 (term 1, #1)이나 (CL:term 1, #1)과 같은 데이터들이 엄청나게 많이 저장되어 있으며, 검색어로 term 1을 넣거나, 검색 필드를 특허 청구 범위로 한정하여 term 1을 입력할 경우, 검색 결과로서 문건 번호 #1을 출력해 준다. 한편, 검색 엔진은 상기 term 1이 전체 문건에서 몇 개가 들어가 있는지, 또는 특허 청구 범위에서 몇 개가 들어가 있는지를 아주 빠른 시간에 처리해 준다.(통상적으로 검색 엔진은 검색 결과의 전체 개수를 먼저 리턴해 주고, 이어서, 전체 개수 중 앞쪽 일부만을 먼저 보여 주는 형태를 취한다.)Hereinafter, a method of extracting the representative word using an indexer 401-3 (indexer) of the search engine connected to the morpheme analysis engine will be described. The reason why the search engine is used is that the search engine typically has excellent performance in counting the number of search word values, and can easily convert data indexed by the indexer 401-3 into data for DB Because. The indexer 401-3 stores the obtained term and the document number from which the term comes, or, in some cases, the field name from which the term appears. For example, (term 1, # 1) means that term 1 is obtained from document # 1, and (CL: term 1, # 1) means that term 1 is a claim ) In the field. The index of the search engine contains a large amount of data such as (term 1, # 1) or (CL: term 1, # 1), and term 1 is inserted into the search term, If you enter 1, it outputs document number # 1 as the search result. On the other hand, the search engine is able to process very quickly the number of entries in the entire document or the number of claims in the claims. (Typically, the search engine returns the total number of search results first , Then take the form of first showing only the front part of the total number.)

이때, 본 발명의 어구 생성 엔진은 상기 형태소 분석 엔진(analyzer)를 통과한 n개의 /텀들을 입수 받고, 상기 입수 받은 n개의 텀들을 기설정된 방식으로 조합하여 조합된 텀들을 상기 문건 번호와 함께 검색 엔진의 상기 인덱서(401-3)에 전송할 수 있다. (이때, 상기 문건 번호를 이미 상기 인덱스가 알고 있는 경우에는 굳이 문건 번호까지 전송할 필요가 없다. 이해의 편의상 전송되는 조합된 텀의 출처로서의 문건 번호를 포함하여 전송한다고 한다.) 이때, 상기 텀들을 조합하는 방법은 전술한 바과 같이, 1) 1개 문장 단위로 처리하여 1개 문장에서 입수되는 모든 텀을 대상으로 2~5개의 텀별로 모든 가능한 방법으로 조합하여 처리하는 방법, 2) 전체 문건 단위에 포함된 모든 텀을 대상으로 2~5개의 텀별로 모든 가능한 방법으로 조합하여 처리하는 방법(이 경우에는 텀의 조합이 굉장히 커서 처리 시간이 길어지는 문제가 있으나, 가장 정밀한 어구 집합을 생성할 수 있다.), 3) 하나의 문장에서 2~5개의 텀별로 순차적으로 조합하는 방법 중 어느 하나 이상을 사용할 수 있다. 물론, 상기 전체 문건 단위의 처리 방법에는 문단 단위, 특정 필드 단위(특 허 청구 범위 등)의 처리 방식도 전술한 바와 같은 대응한 방식으로 취급될 수 있을 것이다.At this time, the phrase generation engine of the present invention obtains n / terms that have passed through the morpheme analysis engine, combines the received n terms in a preset manner, and searches the combined terms together with the document number To the indexer 401-3 of the engine. (In this case, if the document number is already known by the index, it is not necessary to transmit it to the document number.) For convenience of explanation, it is assumed that a document number as a source of a combined combined term is transmitted. The method of combining is as follows: 1) a process is performed in units of one sentence, and all the terms obtained in one sentence are processed in a combination of all possible methods by 2 to 5 terms, 2) (In this case, the combination of terms is so large that the processing time is prolonged, but the most precise phrase set can be generated). ), And 3) a method of sequentially combining two to five terms in one sentence. Of course, the processing method of the entire document unit may be handled in a corresponding manner as described above, such as the processing of a paragraph unit or a specific field unit (a claim range, etc.).

예를 들어 설명하면 다음과 같다. 상기 본 발명의 어구 생성 엔진은 상기 출력된 를 입수 받아, 상기 단위로 1~5개(바람직하게는 2~3개)의 배열을 생성한다. 예를 들어, 하나의 특허 문건에 포함된 하나의 문장이 입력되었을 때, 상기 출력되는 들을 형태소 별로 순서대로 a b c d e f, g라고 하자. 이때, 상기 어구 생성 엔진의 어구 생성 단위가 3단어일 경우, 상기 어구 생성 엔진은 순차적으로 abc, bcd, cde, def, efg 라는 별 어구를 생성하거나 ab, ac, ad, ae, af, ag, bc,...fg와 같은 2개의 단위로 어구를 생성할 수 있다.(이에 대해서는 전술한 바와 같다). 이때, 상기 어구 생성 엔진은 상기 abc, bcd 나 ab, ac 등과 같은 어구(=복수 개의 텀들(multiple terms))를 상기 인덱서(401-3)에 전송한다. 인덱서(401-3)는 전송받은 어구(복수개의 텀들)들을 (abc, #1), (bcd, #1), (ab, #1), (ac, #1) 등과 같이 저장한다. 이때. 만약 필드들이 한정되는 경우라면, (CL : abc, #1), (CL : bcd, #1), (CL : ab, #1), (CL : ac, #1)과 같이 상기 어구(복수 개의 텀들)이 나온 구체적인 필드명까지 포함하여 인덱스(index)로 저장한다. 하기 표 9는 인덱스의 일 예를 보여 개념적인 테이블이다.For example: The phrase generation engine of the present invention receives the output, and generates 1 to 5 (preferably 2 to 3) arrays in the unit. For example, when one sentence included in a patent document is input, let the output be a b c d e f, g in order of morphemes. If the word generation unit of the word generation engine is 3 words, the word generation engine sequentially generates abc, bcd, cde, def, efg or ab, ac, ad, ae, af, ag, bc, ... fg (which is described above). At this time, the phrase generation engine transmits a phrase (= multiple terms) such as abc, bcd, ab, ac, etc. to the indexer 401-3. The indexer 401-3 stores the sentences (a plurality of terms) such as (abc, # 1), (bcd, # 1), (ab, # 1), (ac, # 1) At this time. If the fields are limited, the words (CL: abc, # 1), (CL: bcd, # 1), (CL: ab, # 1) (Fields) including the specific field names from which they are stored. Table 9 below is a conceptual table showing an example of an index.

[표 9][Table 9]

어구Fish 문건 번호Article number 필드field abcabc #1#One DD abcabc #1#One DD abcabc #1#One DD abcabc #1#One CC abcabc #1#One CC bcdbcd #1#One DD bcdbcd #1#One CC acac #1#One DD abcabc #2#2 DD abcabc #2#2 DD abcabc #2#2 CC

D는 발명의 상세한 설명(description), C는 특허청구범위(claim)을 나타낸다.D is a description of the invention, and C is a claim.

상기 표 9에서 어구 abc가 #1번 문건의 발명의 상세한 설명에서 3회가 나타났고, 특허 청구 범위에서 2회가 나타났으며, 상기 어구 bcd가 1번 문건의 발명의 상세한 설명에서 1회가 나타났고, 특허 청구 범위에서 1회가 나타났으며, 상기 어구 ac가 1번 문건의 발명의 상세한 설명에서 1회가 나타났음을 보여 주고 있다. 한편, 어구 abc가 #2번 문건의 발명의 상세한 설명에서 2회가 나타났고, 특허 청구 범위에서 1회가 나타났음을 보여 준다.In Table 9, the phrase abc has appeared three times in the detailed description of the invention of the document # 1, twice in the claims, and the phrase bcd is once in the detailed description of the invention of the document No. 1 And once in the claims of the patent, the phrase ac appears once in the detailed description of the invention of No. 1. On the other hand, it is shown that the phrase abc appears twice in the detailed description of the invention of item # 2, and once in the claims.

이때, 상기 본 발명의 어구별 카운팅 엔진은 상기 인덱스(index)의 데이터를 입수 받아, 상기 어구(복수개의 텀들)별의 개수 및 상기 어구가 나온 출처로서의 문건 번호에 관한 데이터를 생성한다. 상기 생성된 데이터는 전술한 방식으로 생성한 데이터와 동일하거나 대응한 것이며, 상기 인덱스를 대상으로 예를 들면 다음 표 10과 같을 수 있다. At this time, the sorting engine of the present invention receives data of the index and generates data about the number of the phrases (the plurality of terms) and the document number as the source from which the phrase is derived. The generated data is the same as or corresponding to the data generated in the above-described manner. For example, the index may be as shown in Table 10 below.

[표 10][Table 10]

어구 IDPhrase ID 어구Fish 문건 번호Article number 필드field 출현회수Number of times of appearance 1One abcabc #1#One DD 33 1One abcabc #1#One CC 22 1One abcabc #2#2 DD 22 1One abcabc #2#2 CC 1One 22 bcdbcd #1#One DD 1One 22 bcdbcd #1#One CC 1One 33 acac #1#One DD 1One

상기와 같은 방법으로 본 발명의 대표 어구 추출 전처리 모듈은 검색 엔진의 인덱서(401-3) 및 상기 인덱스를 활용하여 어구별 카운팅 데이터를 생성할 수 있으며, 상기 어구별 카운팅 데이터를 바탕으로 하여 전술한 바와 같은 방식으로 문건별, 기설정된 문건 집합별, 특정 필드별 내용, 또는 기설정된 문건 집합에서 추출된 모든 특정 필드별 내용에서 대표 어구를 추출할 수 있게 된다.In this way, the representative word extraction preprocessing module of the present invention can generate categorical counting data by using the indexer 401-3 of the search engine and the index, and based on the categorical counting data, The representative phrases can be extracted from the contents of each specific field extracted from documents, predetermined document sets, specific field contents, or a predetermined document set in the same manner as in FIG.

이어, 대표 어구 추출 목표 문건 집합별로 대표 어구가 추출되었다면, 상기 대표 어구는 상기 대표 어구 추출 목표 문건 집합 단위 또는 상기 대표 어구 추출 목표 문건 집합을 이루는 개별 문건 단위로 저장될 수 있다. 즉, 하나의 대표 어구 추출 목표 문건 집합 단위 또는 상기 개별 문건에는 적어도 하나 이상의 대표 어구가 대응될 수 있다. 상기 대표 어구는 상기 대표 어구 추출 목표 문건 집합의 개별 필드 단위 또는 상기 대표 어구 추출 목표 문건 집합을 이루는 개별 문건의 개별 필드 단위로 적어도 하나 이상이 대응될 수 있을 것이다.If representative phrases are extracted for each set of representative phrase extraction target documents, the representative phrases may be stored in units of the representative phrase extraction target document set or individual document units constituting the representative phrase extraction target document set. That is, at least one representative phrase may correspond to one representative phrase extraction target document set unit or the individual document. The representative word may be corresponded to at least one of individual field units of the representative word extraction target document set or individual field units of the individual documents constituting the representative word extraction target document set.

상기 본 발명의 대표 어구 추출 엔진은 어구에 대해서 문건 번호 정보를 함께 생성하는 것으로 설명하였다. 이는 문건 번호가 대응되어 있는 경우, 상기 어구를 대표 어구로 하는 문건 집합을 추출하기 용이하게 하기 위한 목적이 있다. 상기 어구를 대표 어구로 하는 문건 집합을 용이하게 추출할 수 있으면, 상기 문건 집합을 분석 대상 문건 집합으로 하여 다양한 분석을 수행할 수 있다. 예를 들어, "radio frequency identification tag"라는 어구를 대표 어구로 하는 문건 집합에 대하여, 1) 국가별/연도별/출원인별로 출원수/등록수/등록율/점유율/집중율/활동율 등과 같은 다양한 분석 지표 2) 국가별/연도별/출원인별 IPC 또는 기타 특허 분류 기호의 다단계별(depth)별 분포 분석, 3) 이 문건 집합의 세부 문건 집합별(출원인별, 특허 분류 기호별) 인용, 피인용 분석 등과 같이 하나의 문건 집합이 주어졌을 때 본 발명의 분석 모듈 및 특허 인텔리전스 모듈(60)이 지원하는 각종 분석을 수행할 수 있다. 그리고, 어구에 대하여 문건 번호를 대응시켜 놓지 않고, 검색 엔진을 통해서는 특정 어구를 포함하는 문건은 추출하는 경우, 그 특정 어구가 그 문건의 대표 어구인지를 판별할 수 없는 문제가 있다.The representative phrase extraction engine of the present invention has been described as generating the document number information together with the phrase. This is for the purpose of making it easy to extract a document set having the phrase as a representative phrase when document numbers are associated. If it is possible to easily extract a document set having the representative phrase as the phrase, the document set can be subjected to various analyzes using the document set to be analyzed. For example, for a document set with the phrase "radio frequency identification tag" as a representative phrase, 1) various analyzes such as number of applications / registration number / registration rate / market share / concentration ratio / activity rate by country / (2) analysis of the distribution of IPC or other patent classification by country, year, and applicant by depth, (3) citation of this document set by detailed document set (by applicant and patent classification) Analysis and the like, it is possible to perform various analyzes supported by the analysis module and the patent intelligence module 60 of the present invention. There is a problem in that, when a document including a specific phrase is extracted through a search engine without associating a document number with the phrase, it can not be determined whether the specific phrase is a representative word of the document.

상기와 같은 대표 어구 정보 전처리 방법에 대해서는 도 105 내지 도 107에 잘 나타나 있다.The above-described representative word information preprocessing method is well illustrated in FIG. 105 to FIG.

도 105를 기준으로 상기 대표 어구 추출 전처리 모듈을 통한 대표 어구 추출을 위한 를 조합을 생성하는 방법을 간략히 설명한다.105, a method for generating a combination for extracting representative phrases through the representative phrase extraction preprocessing module will be briefly described.

상기 대표 어구 추출 전처리 모듈은 적어도 2이상의 특허 문건을 포함하는 문건 집합을 입수하고(S5520), 상기 문건 집합에서 추출한 개별 특허 문건 전체 또는 각종 필드별 내용에 대하여 형태소 분석기로 처리하여 2 이상의 어구(텀, term 또는 terminology)을 추출하며(S5530), 상기 추출된 에 대하여 기설정된 기준에 의한 2개 이상의 의 기설정된 방식으로의 가능한 모든 조합을 생성하며(S5540), 상기 생성된 의 조합을 상기 특허 문건 정보를 포함하여 DB에 저장하거나, 검색 인덱서(401-3)에 전송하는(S5550) 방식으로 대표 어구 추출을 위한 를 조합을 생성한다.The representative phrase extraction preprocessing module obtains a document set including at least two patent documents (S5520), processes the entire contents of the individual patent documents extracted from the document set or the contents of various fields by a morpheme analyzer, , term or terminology) is extracted (S5530), and all possible combinations of two or more predetermined methods in accordance with a predetermined criterion are generated for the extracted information (S5540) Information is stored in the DB or transmitted to the search indexer 401-3 (S5550).

도 106을 기준으로 상기 대표 어구 추출 전처리 모듈을 통한 필드별/부분 문건 집합별 대표 어구 추출을 위한 를 조합을 생성하고, 기설정된 조건을 충족하는 의 조합을 추출하는 방법을 간략히 설명한다.A method for generating a combination for extracting representative phrases per field / partial document set through the representative word extraction preprocessing module based on FIG. 106, and extracting a combination of combinations satisfying predetermined conditions will be briefly described.

상기 대표 어구 추출 전처리 모듈은 적어도 2이상의 특허 문건을 포함하는 문건 집합을 입수하고(S5620), The representative phrase extraction preprocessing module obtains a document set including at least two patent documents (S5620)

상기 문건 집합에서 추출한 개별 특허 문건 전체 또는 각종 필드별 내용에 대하여 형태소 분석기로 처리하여 2 이상의 (텀)을 추출하며(S5630), 상기 추출된 에 대하여 기설정된 기준에 의한 2개 이상의 의 가능한 모든 조합을 생성하고(S5640), 상기 생성된 의 조합을 상기 특허 문건 정보를 포함하여 DB에 저장하거나, 검색 인덱서(401-3)에 전송하며(S5650), 상기 문건 집합을 구성하는 각 문건의 서지 사항을 입수하여 적어도 하나 이상의 기준으로 문건 부분 집합을 생성하며(S5660), 상기 문건 부분 집합에 한정된 상기 의 조합의 개수와 상기 문건 (전체) 집합에 속한 상기 의 조합의 개수를 입수하여, 문서 전체 또는 각종 필드별로 기설정된 조건을 충족하는 의 조합을 추출한다(S5670),The morpheme analyzer processes all the individual patent documents extracted from the document set or contents of various fields to extract two or more terms (S5630), and extracts two or more possible combinations (S5640). The generated combination is stored in the DB including the patent document information or transmitted to the search indexer 401-3 (S5650), and the bibliography of each document constituting the document set (S5660), and obtains the number of the combinations defined in the document subset and the number of the combinations included in the document (total) set, A combination of a plurality of fields satisfying predetermined conditions is extracted (S5670)

도 107을 기준으로 상기 대표 어구 추출 전처리 모듈을 통한 대표 어구 추출을 위한 를 조합을 생성하고, 각 문건 집합별로 비교하는 방법을 간략히 설명한다.A method for generating a combination for extracting representative phrases through the representative phrase extraction preprocessing module based on FIG. 107, and comparing the phrases according to each document set will be briefly described.

상기 대표 어구 추출 전처리 모듈은 적어도 2이상의 특허 문건을 포함하는 복수개의 문건 집합을 입수하고(S5720), 각 문건 집합별로 상기 문건 집합에서 추출한 개별 특허 문건 전체 또는 각종 필드별 내용에 대하여 형태소 분석기로 처리하여 2 이상의 (텀)을 추출하며(S5730), 상기 추출된 의 조합을 생성하고((S5740), 상기 생성된 의 조합을 상기 특허 문건 정보를 포함하여 DB에 저장하거나, 검색 인덱서(401-3)에 전송하며(S5750), 상기 의 조합을 기설정된 기준으로 상기 복수개의 문건 집합별로 비교한다(S5760)The representative phrase extraction preprocessing module obtains a plurality of document sets including at least two patent documents (S5720). Then, the representative phrase extraction preprocessing module processes the entire individual patent documents extracted from the document set or the contents of various fields by a morpheme analyzer (S5730), the extracted combination is generated (S5740), the generated combination is stored in the DB including the patent document information, or the retrieval indexer 401-3 (S5750), and the combination is compared on the basis of the plurality of document sets on a predetermined basis (S5760)

복수 특허 분류 기호 관계성 전처리 모듈Multiple Patent Classification Relation Preprocessing Module

하나의 특허 문건에 하나의 특허 분류 기호 체계에 속하는 2개 이상의 특허 분류 기호가 부여되어 있는 문건이 다수 있다. 다. 이러한 경향그리고, 은 1) 1특허 출원 범위의 확대, 2) 발명의 고도화, 3) 기술 융합화의 경향, 4) 특허 분류 기호 부여의 정밀화 등에 따라 하나의 특허 문건에 대하여 복수개의 특허 분류 기호가 부여되는 경향은 더욱 커지고 있다. 하나의 특허 문건에 2개 이상의 특허 분류 기호가 부여 되었다는 것은 그 특허 문건에는 2개 이상의 각 특허 분류 기호에 대응되는 2개 이상의 기술적 관점, 기술적 구성, 기술적 내용, 기술적 측면, 기술적 속성, 기술적 특징 요소와 같은 것들이 포함되어 있다는 것을 의미할 수 있다. 그러므로, 2개 이상의 특허 분류 기호가 부여 되어 있는 문건은 1개의 특허 분류 기호가 부여된 문건과 달리 취급할 필요가 있게 된다.There are many documents in which one patent document has more than one patent classification symbol belonging to one patent classification system. All. This tendency can be attributed to a plurality of patent classification symbols for one patent document due to 1) enlargement of a patent application scope, 2) improvement of an invention, 3) tendency of technical convergence, and 4) Is becoming more and more popular. The fact that two or more patent classification marks are assigned to a patent document means that the patent document contains at least two technical views, technical constructions, technical contents, technical aspects, technical properties, technical features And the like may be included. Therefore, documents with two or more patent classification marks need to be treated differently from documents with one patent classification symbol.

본 발명의 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 2개 이상의 특허 분류 기호가 부여된 문건만을 재료로 하여, 복수개의 특허 분류 기호간에 숨어 있는 관계성을 발견하여 활용하는 것을 목적의 하나로으로 한다. 상기 복수 특허 분류 기호 관계성 전처리 모듈은 IPC/UPC/FT 중의 어느 하나만을 기준으로 한(즉, 동일한 종류의 특허 분류 기호를 기준으로 한) 동종의 특허 분류 기호간의 숨어 있는 관계성을 발견하여, 기술의 융합에 대한 정보를 생성하는 것을 목적으로 하는 동종 복수 특허 분류 기호 관계성 전처리 모듈과, 하나의 문건에 IPC, UPC, FT 등 중에서 선택되는 2 이상의 특허 분류 기호가 포함되어 있을 때, 상기 2 이상의 특허 분류 기호간의 관계성을 파악하는 이종 복수 특허 분류 기호 관계성 전처리 모듈을 포함한다. 도 60은 상기 복수 특허 분류 기호 관계성 전처리 모듈의 구성, 동종 복수 특허 분류 기호 관계성 전처리 모듈의 구성 및 이종 복수 특허 분류 기호 관계성 전처리 모듈의 구성을 보여주고 있다.The purpose of the present invention is to discover and utilize the hidden relationships among a plurality of patent classification symbols, using only documents to which at least two patent classification symbols are assigned as a material. The plurality of patent classification symbol preliminary relations preliminary processing modules find hidden relationships among the same kind of patent classification symbols based on only one of IPC / UPC / FT (that is, based on the same kind of patent classification symbol) When a patent includes two or more patent classification symbols selected from among IPC, UPC, FT and the like in the same document, And a heterogeneous plural patent classification related relationship pre-processing module for grasping the relationship between the above patent classification symbols. FIG. 60 shows the configuration of the above-mentioned plural patent classification symbol related relation preprocessing module, the configuration of the same type plural patent classification symbol relation pre-processing module, and the configuration of the different plural patent classification symbol relation preprocessing module.

동종 복수 특허 분류 기호 관계성 전처리 모듈Equivalent multiple patent classification symbol relationship pre-processing module

이하, 본 발명의 동종 복수 특허 분류 기호 관계성 전처리 모듈에 대해서 설명한다. 어느 하나의 문건에 한 종류의 복수개의 특허 분류 기호가 부여되어 있거나, 2종 이상의 특허 분류 기호가 부여되어 있더라도, 선택된 하나의 특허 분류 기호 체계 상의 복수개의 특허 분류 기호들을 처리하는 것을 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 수행한다. 상기 복수개의 특허 분류 기호에 관한 처리를 담당하는 것은 본 발명의 동종 복수 특허 분류 기호 관계성 전처리 엔진이 수행하며, 상기 수행의 결과는 본 발명의 동종 대표 어구 복수 특허 분류 기호 관계 DB(207-3)에 저장된다.Hereinafter, the same kind of multiple patent classification symbol relation pre-processing module of the present invention will be described. Processing a plurality of patent classification symbols on a selected one of the patent classification schemes, even if a plurality of patent classification symbols of one kind are given to any one document or two or more kinds of patent classification symbols are given, The classifier relationship pre-processing module performs. The same kind of multiple patent classification symbol relationship preprocessing engine of the present invention is responsible for the processes related to the plurality of patent classification symbols, and the result of the above execution is the same as that of the present invention, ).

이론적으로 먼저 설명하고, 이어 구체적인 실례를 들어서 설명한다. 먼저, 가장 단순한 모델로 하나의 문건에 2개의 특허 분류 기호가 포함된 경우에 대한 복수 개의 특허 분류 기호에 대한 관계성 처리 방법을 제시한다. 하나의 특허 문건에 특허 분류 기호 A1와 특허 분류 기호 B1이 부여되어 있다고 가정하자. 이때, 특허 분류 기호 체계 상에서 A1의 직상위 노드(부모 특허 분류 기호)가 A2, A2의 직상위 노드가 A3라고 하며, 이러한 규칙이 계속될 때, A1의 상위 노드는 A2, A3, A4, ... An이 될 수 있다. 마찬가지로 B1에 대해서도, B2, B3, B4, ,Bn을 할당할 수 있게 된다. 이들 An, Bn들 중에서 가장 상위에 있는 것들은 섹션이 된다. 그리고, 섹션 위에는 그냥 IPC 등과 같은 특허 분류 기호 체계명이 있다고 하자. 이렇게 될 때, 상기 An들과 상기 Bn들은 적어도 섹션 또는 IPC라는 특허 분류 기호 체계명 하에서 적어도 하나 이상의 공통 노드를 가지게 된다.(섹션이 다른 등 가장 멀리 떨어져 있어도, IPC라는 동일 분류 기호 체계를 공유한다는 점에서는 일치한다.) 상기 공통 노드 중 상기 특허 분류 기호 체계 상에서 최하위 레벨에 있는 것을 최하위 공통 특허 분류 기호라 명명한다. 이때, A1과 B1의 최하위 공통 특허 분류 기호가 각각 Ai+1와 Bj+1라 하면, Ai+1 = Bj+1가 된다. 상기 주어진 하나의 특허 문건의 내용 상 A1과 B1이 함께 있는 경우, 상기 A1과 B1은 상기 특허 문건에 대하여 관계성을 가진다.(즉, 하나의 특허 문건에 나와 있는 발명 또는 발명들이 A1 및 B1의 속성을 독자적으로 또는 전체적으로 함께 갖고 있다는 뜻으로서의 관계성을 말한다.) 이때, 특허 분류 기호 체계의 본질 상 A1은 상기 B1의 모든 부모 노드인 B2, B3, ... Bj+)와 관계성을 가진다. 마찬가지로, B1은 A1의 모든 부모 노드인 A2, A3, ...A(i+1)와 관계성을 가진다. 이때, A1과 Bj+1의 관계성은 Bj+1와 Ai+1가 동일하 므로, 어짜피 부모 노드이므로, 그 관계성을 논할 필요가 없으며, 모든 A들과 Bj+1(=Ai+1)들의 관계성도 어짜피 부모 노드에 존재하는 것들이므로 별도로 논할 필요가 없다. 즉, 최하위 공통 부모 노드는 관계성 계산 테이블에서 고려하지 않는 것이 바람직할 것이다.Theoretically, it will be explained first, followed by concrete examples. First, we propose a relationship processing method for a plurality of patent classification codes in the case where two patent classification codes are included in one document as the simplest model. Suppose that a patent document has a patent classification code A1 and a patent classification code B1. In this case, A2, A3, A4, and A3 are the parent nodes of A1, A2, and A2, respectively, in the patent classification system. ... An. Likewise, B2, B3, B4, and Bn can be assigned to B1. The top of these An and Bn are the sections. And suppose that there is a patent classification system name just like IPC on the section. When this happens, the An and Bn have at least one common node under at least a section or a patent classification scheme scheme name IPC. (Even if the sections are far apart, they share the same classification scheme called IPC. The common node among the common nodes at the lowest level in the patent classification scheme is called the lowest common patent classification symbol. At this time, Ai + 1 = Bj + 1, where Ai + 1 and Bj + 1 are the lowest common patent classification symbols of A1 and B1, respectively. If A1 and B1 are present together in the content of the given patent document, A1 and B1 are related to the patent document. (That is, (B 1, B 3,..., B j +), which are all parent nodes of the above B1, in the nature of the patent classification system. Similarly, B1 has a relationship with all the parent nodes of A1, A2, A3, ..., A (i + 1). In this case, since the relationship between A1 and Bj + 1 is the same as Bj + 1 and Ai + 1, it is not necessary to discuss the relationship between them and since Bj + 1 (= Ai + 1) Since relationships exist in parent nodes, there is no need to discuss them separately. That is, it is desirable that the lowest common parent node is not considered in the relationship calculation table.

이때, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 복수개의 특허 문건 기호를 처리할 때, 상기 특허 분류 기호 체계(트리 구조) 데이터를 참조하여, 하기 표 11과 같은 데이터를 만들 수 있다. 이때, Ai+1과 Bj+1은 실제 테이블에서는 필요치 않는 셀들(Ai까지와 Bj까지만이 관계성 테이블에 필요함)이나, 설명의 편의(그 셀들에는 값이 없음을 보여 주기 위함)를 위해서 테이블에 표시한다. 즉, 실제 표에서는 Ai, Bj까지만으로 테이블이 구성된다.At this time, when processing the plurality of patent document symbols, the pre-processing module for related species plural patent classification symbol may refer to the patent classification scheme (tree structure) data to produce data as shown in Table 11 below. In this case, Ai + 1 and Bj + 1 do not need cells in the actual table (only Ai and Bj are required for the relationship table) or for convenience of explanation (to show that the cells have no value) Display. That is, in the actual table, the table is composed of only Ai and Bj.

[표 11][Table 11]

A1A1 A2A2 AiAi Ai+1Ai + 1 B1B1 1One 1One 1One 1One B2B2 1One 1One 1One 1One 1One 1One 1One 1One BjBj 1One 1One 1One 1One Bj+1=Ai+1Bj + 1 = Ai + 1

즉, 상기 A1 ~ Ai와 B1 ~ Bj사이의 모든 교차되는 쌍(예를 들면 (A1, B1), ... (Ai, Bj) 등)은 상기 주어진 특허 문건을 매개로 하여 관련성 가진다고 할 수 있다.That is, all the intersecting pairs (A1, B1, ..., Ai, Bj, etc.) between the A1-Ai and B1-Bj can be said to have relevance through the given patent document .

대한민국 특허출원 제10-2005-0111868호에는 2006년 01월 자로 H04B 7/26 및 H04B 7/15가 부여되어 있다. 이를 통해서 예시적으로 설명한다. H04B 7/26의 부모는 차례로 H04B 7/24, H04B 7/00, H04B, H04, H가 된다. H04B 7/15의 부모는 차례 로, H04B 7/14, H04B 7/00이 된다. 이 경우, 최하위 공통 특허 분류 기호는 H04B 7/00이 된다. 그러므로, 이를 표로 만들면 하기 표 12과 같다. H04B 7/00과 관련된 셀이 그 셀값이 아무 값도 없음은 위에서 설명한 바와 같다.Korean Patent Application No. 10-2005-0111868 discloses H04B 7/26 and H04B 7/15 as January 2006. This is illustrated by way of example. The parent of H04B 7/26 is H04B 7/24, H04B 7/00, H04B, H04, H in turn. The parent of H04B 7/15 is, in turn, H04B 7/14, H04B 7/00. In this case, the lowest common patent classification code is H04B 7/00. Therefore, the table is shown in Table 12 below. The cell associated with H04B 7/00 has no cell value as described above.

[표 12][Table 12]

H04B 7/26H04B 7/26 H04B 7/24H04B 7/24 H04B 7/00H04B 7/00 H04B 7/15H04B 7/15 1One 1One H04B 7/14H04B 7/14 1One 1One H04B 7/00H04B 7/00

그러므로, 대한민국 특허출원 제10-2005-0111868호를 기준으로, 9개의 교차되는 특허 분류 기호의 관련성 쌍이 생성되게 된다. 이때, 하나의 문건에 대하여 2 이상의 특허 분류 기호가 존재할 때, 맨 앞에 나오는 것을 주 특허 분류 기호라 하므로, 본 문건에서는 H04B 726이 주 특허 분류 기호가 되고, 상기 주 특허 분류 기호의 부모 노드에 있는 다른 상위 특허 분류 기호도 주어진 특허 문건에서 역시 주 특허 분류 기호로 취급된다. 상기 주 특허 분류 기호가 문제가 되는 것은 하나의 문건에 특허 분류 기호가 3개 이상 있을 때의 취급과 관련하여 중요한 역할을 한다. 특허 분류 기호가 3개 이상 있을 때는 아래와 같이 특허 분류 기호를 처리할 수 있다.Therefore, based on Korean Patent Application No. 10-2005-0111868, nine pairs of cross-related patent classification symbols are generated. In this case, when there are two or more patent classification symbols for one document, the first one is the main patent classification symbol. Therefore, in this document, H04B 726 becomes the main patent classification symbol, Other top patent classification codes are also treated as main patent classification codes in the given patent documents. The problem of the main patent classification symbol plays an important role in relation to handling when there are three or more patent classification symbols in one document. When there are three or more patent classification symbols, the following patent classification symbols can be processed.

먼저, 모든 특허 분류 기호를 동등하게 취급하는 방법이다. 특허 분류 기호가 n개가 있을 경우, 상기 n개의 특허 분류 기호들 중에서 2개의 특허 분류 기호의 조합을 취하는 방법은 수학적으로 조합 이론에 따라 nC2개가 된다. 이들 nC2개의 각 특허 분류 기호의 조합에 대하여, 상기 특허 분류 기호가 2개일 때의 처리를 수행하고, 그 결과로서 nC2개의 테이블(table)을 얻으며, 각 테이블에는 모든 교차되 는 쌍에 관한 정보를 갖고 있다.First, it treats all patent classification equally. When there are n patent classification symbols, the method of taking the combination of two patent classification symbols out of the n patent classification symbols mathematically becomes nC2 according to the combination theory. For each combination of the two nC2 patent classification symbols, the processing when the patent classification symbol is two is performed, and as a result, nC2 tables are obtained. In each table, information about all intersected pairs is obtained I have.

다음으로, 주 특허 분류 기호를 정하고, 상기 주 특허 분류 기호를 취하고, 다른 1개는 n-1개의 부 특허 분류 기호에서 취하여 n-1개의 조합을 취한 다음에 각 조합에 대하여 n-1개의 테이블을 얻고, 각 테이블에는 먼저, 모든 특허 분류 기호를 동등하게 취급하는 방법이다. 특허 분류 기호가 n개가 있을 경우, 상기 n개의 특허 분류 기호들 중에서 2개의 특허 분류 기호의 조합을 취하는 방법은 수학적으로 조합 이론에 따라 nC2개가 된다. 이들 nC2개의 각 특허 분류 기호의 조합에 대하여, 상기 특허 분류 기호가 2개일 때의 처리를 수행하고, 그 결과로서 nC2개의 테이블(table)을 얻으며, 각 테이블에는 모든 교차되는 쌍에 관한 정보를 갖고 있다. 이때, 주 특허 분류 기호와 관련되는 교차 쌍에 대해서는 주 특허 분류 기호와 관련된 가중치를 부여할 수 있을 것이다. 그리고, n-1개의 부 특허 분류 기호에 대하여 n-1C2개의 조합을 얻고, 각 조합에 대하여 상기와 같은 처리를 하여 모든 교차되는 쌍에 관한 정보를 얻는다. 이때, 부 특허 분류 기호와 관련되는 교차 쌍에 대해서는 부 특허 분류 기호와 관련된 가중치를 부여할 수 있을 것이다.Next, the main patent classification symbol is determined, the above-mentioned main patent classification symbol is taken, the other one is taken from n-1 patent classification symbols to obtain n-1 combinations, and then n-1 tables , And each table is first treated as an equivalence of all patent classification codes. When there are n patent classification symbols, the method of taking the combination of two patent classification symbols out of the n patent classification symbols mathematically becomes nC2 according to the combination theory. As for the combination of the two nC2 patent classification symbols, the processing when two patent classification symbols are performed, and as a result, nC2 tables are obtained, and each table has information on all intersecting pairs have. At this time, weights related to the main patent classification symbol may be assigned to the crossing pair related to the main patent classification symbol. Then, n-1C2 combinations are obtained for n-1 patent classification symbols, and the same process is performed for each combination to obtain information on all intersecting pairs. At this time, weights related to the patent classification code may be assigned to the crossing pair related to the patent classification code.

대한민국 특허출원 제10-2006-0012606호에는 2006년 01월 자로 부여된 H04B 7/04, H04B 7/155, H04Q 7/30 특허 분류 기호가 있다. 상기 H04B 7/04의 부모 노드는 각각 H04B 7/02, H04B 7/00, H04B, H04, H가 있으며, H04B 7/155의 부모 노드는 각각 H04B 7/15, H04B 7/14, H04B 7/00, H04B, H04, H가 있으며, H04Q 7/30의 부모 노드는 각각 H04Q 7/20, H04Q 7/00, H04Q, H04, H가 있다. 이때, 최하위 공통 특허 분류 기호는 H04B 7/04와 H04B 7/155의 조합에 대한 H04B 7/00이며, H04B 7/155와 H04Q 7/30의 조합에 대해서는 H04이며, H04B 7/04와 H04Q 7/30에 대해서는 H04가 된다.Korean Patent Application No. 10-2006-0012606 discloses H04B 7/04, H04B 7/155, and H04Q 7/30 patent classification codes given as January, 2006. H04B 7/02, H04B 7/00, H04B, H04, and H are the parent nodes of H04B 7/04 and H04B 7/15, H04B 7/14 and H04B 7 / 00, H04B, H04, and H, and the parent nodes of H04Q 7/30 are H04Q 7/20, H04Q 7/00, H04Q, H04, and H, respectively. At this time, the lowest common patent classification code is H04B 7/00 for the combination of H04B 7/04 and H04B 7/155, H04 for the combination of H04B 7/155 and H04Q 7/30, H04B 7/04 and H04Q 7/15 / 30 is H04.

이때, 모든 특허 분류 기호에 대해 동등하게 취급하면, 아래와 같은 3C2 개수의 테이블이 나온다. 먼저 표 13는 H04B 7/04와 H04B 7/155의 관계성 테이블이다.At this time, if all the patent classification symbols are treated equally, the following 3C2 number of tables are obtained. Table 13 is the relationship table between H04B 7/04 and H04B 7/155.

[표 13][Table 13]

H04B 7/04H04B 7/04 H04B 7/02H04B 7/02 H04B 7/00H04B 7/00 H04B 7/155H04B 7/155 1One 1One H04B 7/15H04B 7/15 1One 1One H04B 7/14H04B 7/14 1One 1One H04B 7/00H04B 7/00

다음으로 표 14은 H04B 7/04와 H04Q 7/30의 관계성 테이블이다.Table 14 is the relationship table between H04B 7/04 and H04Q 7/30.

[표 14][Table 14]

H04B 7/04H04B 7/04 H04B 7/02H04B 7/02 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 1One 1One 1One 1One H04Q 7/20H04Q 7/20 1One 1One 1One 1One H04Q 7/00H04Q 7/00 1One 1One 1One 1One H04QH04Q 1One 1One 1One 1One H04H04

다음으로, 표 15은 H04B 7/155와 H04Q 7/30의 관계성 테이블이다.Next, Table 15 is a relationship table between H04B 7/155 and H04Q 7/30.

[표 15][Table 15]

H04B 7/155H04B 7/155 H04B 7/15H04B 7/15 H04B 7/14H04B 7/14 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 1One 1One 1One 1One 1One H04Q 7/20H04Q 7/20 1One 1One 1One 1One 1One H04Q 7/00H04Q 7/00 1One 1One 1One 1One 1One H04QH04Q 1One 1One 1One 1One 1One H04H04

이때, 이어 가중치를 부여하는 방법에 대해서 설명한다. 가중치의 부여는 크게 2가지가 있다. 첫번째가 위에서 설명한 주 특허 분류 기호와의 관련성 테이블에 더 큰 가중치를 부여하고, 부 특허 분류 기호들 간의 관련성 테이블에 더 적은 가중치를 부여하는 방법이 있다. 예를 들면 주 특허 분류 기호와의 관련성 테이블에는 1을 부여하고, 부 특허 분류 기호와의 관련성 테이블에는 0.5를 부여하는 방법이 있을 것이다.At this time, a method of assigning the following weights will be described. There are two types of weighting. There is a first way to assign a larger weight to the relevance table with the main patent classification symbol described above, and to give less weight to the relevance table between the patent classifications. For example, there is a method of assigning 1 to the relevance table with the main patent classification symbol and 0.5 to the relevance table with the patent classification symbol.

이렇게 되면 다음으로, H04B 7/155와 H04Q 7/30의 관계성 테이블은 다음 표 16와 같이 될 것이다.Next, the relationship tables of H04B 7/155 and H04Q 7/30 will be as shown in Table 16 below.

[표 16][Table 16]

H04B 7/155H04B 7/155 H04B 7/15H04B 7/15 H04B 7/14H04B 7/14 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 0.50.5 0.50.5 0.50.5 0.50.5 0.50.5 H04Q 7/20H04Q 7/20 0.50.5 0.50.5 0.50.5 0.50.5 0.50.5 H04Q 7/00H04Q 7/00 0.50.5 0.50.5 0.50.5 0.50.5 0.50.5 H04QH04Q 0.50.5 0.50.5 0.50.5 0.50.5 0.50.5 H04H04

두번째는, 복수개의 특허 분류 기호를 가지고 있는 특허 문건에 대한 가중치를 균등하게 부여하는 방법이 있을 것이다. 예를 들면 2개의 특허 분류 기호를 가지고 있을 때는 각 특허 분류 기호의 관련성 쌍의 값이 1이 된다고 할 때, n개의 특허 분류 기호를 가지고 있는 특허 분류 기호의 쌍에 대해서는 그 관련성의 쌍의 값을 가능한 조합의 개수인 nC2를 나눈 값을 균등하게 할당할 수도 있다. 즉, 가중치를 1/(nC2)로 할 수도 있을 것이다. 예를 들면, 이와 같이 할 때, 상기, H04B 7/155와 H04Q 7/30의 관계성 테이블은 다음 표 17과 같은 것이다.Secondly, there will be a way to equally weight the patent documents with multiple patent classification symbols. For example, if we have two patent classification codes, and the value of the pair of relevance of each patent classification is 1, then for pairs of patent classification codes having n patent classification codes, The value obtained by dividing nC2, which is the number of possible combinations, may be evenly distributed. That is, the weight may be 1 / (nC2). For example, when doing so, the relationship tables of H04B 7/155 and H04Q 7/30 are as shown in Table 17 below.

[표 17][Table 17]

H04B 7/155H04B 7/155 H04B 7/15H04B 7/15 H04B 7/14H04B 7/14 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 1/31/3 1/31/3 1/31/3 1/31/3 1/31/3 H04Q 7/20H04Q 7/20 1/31/3 1/31/3 1/31/3 1/31/3 1/31/3 H04Q 7/00H04Q 7/00 1/31/3 1/31/3 1/31/3 1/31/3 1/31/3 H04QH04Q 1/31/3 1/31/3 1/31/3 1/31/3 1/31/3 H04H04

세번째는 3개 이상의 특허 분류 기호를 가지는 문건에 대하여 주 특허 분류 기호와 관련된 쌍과, 부 특허 분류 기호와 관련된 쌍에 다른 가중치를 부여하는 방법이 있을 것이다. 예를 들어 n개의 특허 분류 기호가 있을 때, 주 분류와 관계되는 조합의 개수는 (n-1)이 되므로, 주 분류 기호와의 관련된 조합에 전체의 75%를 할당하고, 이들과 관련된 관련 쌍에는 0.75*1/(n-1)의 가중치를 부여할 수 있다. 부 특허 분류 기호들과 관련되는 모든 조합에 25%의 가중치를 부여할 때, 부 특허 분류 기호들과 관련되는 모든 쌍에는 0.25*1/((n-1)C2)의 가중치를 부여할 수 있을 것이다. 이때, 상기 %의 배분은 기설정된 값으로 조절할 수 있을 것이다. 예를 들면 n> 5일 때, 주 특허 분류 기호와 관련된 것들에 50%를, 부 특허 분류 기호와 관련된 것들에 50%를 배당할 수 있을 것이며, n= 4일 때는 60% 대 40%를, n=3일 때는 75% 대 25%를 할당할 수 있을 것이다. 즉, IPC 의 개수가 3보다 클 때에는 (sub IPC, Sub IPC) 특허 분류 기호 조합 쌍이 생기므로, (main IPC, Sub IPC) 특허 분류 기호 조합 쌍과 (Sub IPC, Sub IPC) 특허 분류 기호 조합 쌍에 대해 다른 가중치를 줄 수 있을 것이다.Third, there will be a way to assign different weightings to the pair associated with the main patent classification symbol and the pair associated with the patent classification symbol for documents with more than three patent classification symbols. For example, when there are n patent classification symbols, the number of combinations associated with the main classification is (n-1), so 75% of the total is assigned to the relevant combination with the main classifier, A weight of 0.75 * 1 / (n-1) can be given. When assigning a weight of 25% to all combinations associated with the patent classifiers, all pairs associated with patent classifiers may be weighted 0.25 * 1 / ((n-1) C2) will be. At this time, the distribution of the percentage may be adjusted to a predetermined value. For example, when n> 5, we could allocate 50% to those related to the main patent classification and 50% to those related to the patent classification, 60% to 40% when n = 4, When n = 3, you could allocate 75% to 25%. That is, when the number of IPCs is larger than 3 (sub IPC, Sub IPC), pairs of patent classification codes are formed. Therefore, the combination of (main IPC, Sub IPC) patent classification code pair and (Sub IPC, Sub IPC) We can give different weights to

n=3일 때는 75% 대 25%를 할당하면 다음과 같은 테이블이 생성될 수 있다. 먼저 하기 표 18는 H04B 7/04와 H04B 7/155의 관계성 테이블이다.When n = 3, assigning 75% to 25%, the following table can be created. Table 18 below is the relationship table between H04B 7/04 and H04B 7/155.

[표 18][Table 18]

H04B 7/04H04B 7/04 H04B 7/02H04B 7/02 H04B 7/00H04B 7/00 H04B 7/155H04B 7/155 0.75/20.75 / 2 0.75/20.75 / 2 H04B 7/15H04B 7/15 0.75/20.75 / 2 0.75/20.75 / 2 H04B 7/14H04B 7/14 0.75/20.75 / 2 0.75/20.75 / 2 H04Q 7/00H04Q 7/00

다음으로 하기 표 19는 H04B 7/04와 H04Q 7/30의 관계성 테이블이다.Table 19 below is the relationship table between H04B 7/04 and H04Q 7/30.

[표 19][Table 19]

H04B 7/04H04B 7/04 H04B 7/02H04B 7/02 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 H04Q 7/20H04Q 7/20 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 H04Q 7/00H04Q 7/00 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 H04QH04Q 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 0.75/20.75 / 2 H04H04

다음으로, 하기 표 20은 H04B 7/155와 H04Q 7/30의 관계성 테이블이다.Next, Table 20 is a relationship table between H04B 7/155 and H04Q 7/30.

[표 20][Table 20]

H04B 7/155H04B 7/155 H04B 7/15H04B 7/15 H04B 7/14H04B 7/14 H04B 7/00H04B 7/00 H04BH04B H04H04 H04Q 7/30H04Q 7/30 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 H04Q 7/20H04Q 7/20 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 H04Q 7/00H04Q 7/00 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 H04QH04Q 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 0.25/10.25 / 1 H04H04

한편, 복수개의 특허 분류 기호를 가지는 개별 문건의 기여도를 문건 단위로 동일(가령 1)이라고 했을 때, 다음과 같은 가중치 부여 방법을 생각할 수 있다. 극단적인 예로써, 대한민국 특허 출원 제 10-2005-0042032호 문건에 H04B 7/02 및 H04B 7/14와 같이 2개의 특허 분류 기호가 나와 있고, 이들이 next sibling 관계일 때(즉, H04B 7/02 및 H04B 7/14 부모 노드가 H04B 7/00으로 동일하며, 그 부모 노드가 직상위 특허 분류 기호이다) 상기 H04B 7/02 및 H04B 7/14의 관계성 테이블은 오직 1개가 나오며, 그 테이블의 셀(cell) 또한 1개이다. 상기 대한민국 특허 출원 제 10-2005-0042032호 문건에 대한 테이블은 하기 표 21과 같다.On the other hand, when the contribution of individual documents having a plurality of patent classification symbols is the same as document unit (for example, 1), the following weighting method can be considered. As an extreme example, Korean Patent Application No. 10-2005-0042032 discloses two patent classification symbols such as H04B 7/02 and H04B 7/14, and when they are in the next sibling relationship (i.e. H04B 7/02 And the H04B 7/14 parent node is the same as H04B 7/00 and its parent node is the immediate parent patent classification symbol), the relationship tables of H04B 7/02 and H04B 7/14 are only one, There is also one cell. The table for the Korean Patent Application No. 10-2005-0042032 is shown in Table 21 below.

[표 21][Table 21]

H04B 7/02H04B 7/02 H04B 7/00H04B 7/00 H04B 7/14H04B 7/14 1One H04B 7/00H04B 7/00

이때, 상기 특정한 특허 문건에 대해서 H04B 7/02 및 H04B 7/14는 강력하게 관계되어 있다고 볼 수 있다. 한편, 상기 대한민국 특허출원 제10-2006-0012606호를 기준으로 했을 경우, 3개의 테이블에, 각 테이블마다 6개, 16개 및 20개의 셀을 가지게 된다. 이때, 상기 (H04B 7/02, H04B 7/14)라는 하나의 특허 분류 기호 조합을 기준으로 할 때, 대한민국 특허 출원 제 10-2005-0042032호 문건의 기여도와 상기 대한민국 특허출원 제10-2006-0012606호의 기여도를 어떻게 평가할 것인가의 문제가 발생한다. 상기 기여도의 가중치 처리는 다음과 같은 방법이 있을 것이다.At this time, H04B 7/02 and H04B 7/14 are strongly related to the specific patent document. On the other hand, according to Korean Patent Application No. 10-2006-0012606, three tables have 6, 16, and 20 cells for each table. At this time, when one patent classification symbol combination (H04B 7/02, H04B 7/14) is referred to, the contribution of the Korean Patent Application No. 10-2005-0042032 and the Korean Patent Application No. 10-2006- There arises a problem of how to evaluate the contribution of 0012606. The weighting process of the contribution may be performed as follows.

첫째, 관계성의 존재 내지는 발생을 기준으로 테이블이나 상기 테이블에서 발생하는 셀의 개수에 무관하게 어느 문건에서나 특정 특허 분류 기호의 조합 (Ai, Bj)(예를 들면, 상기 (H04B 7/02, H04B 7/14))를 동일한 값의 기여도 가중치(예를 들면 1)로 처리할 수 있다. First, a combination of specific patent classification symbols (Ai, Bj) (for example, the above-mentioned (H04B 7/02, H04B 7/14) can be treated with the same value of contribution weight (e.g., 1).

둘째, 상기 테이블의 개수 및 셀의 개수를 고려하여, 특정 특허 분류 기호의 조합 (Ai, Bj)에 대한 각 특허 문건의 기여도 가중치를 다르게 하여 부여할 수 있을 것이다. 이때, 특정 특허 분류 기호의 조합 (Ai, Bj)에 대한 기여도 가중치를 각 특허 문건에서 발생하는 모든 테이블에 존재하는 모든 셀의 총수로 나눈 것으로 할 수 있다. 예를 들면 (H04B 7/02, H04B 7/14)에 대하여, 대한민국 특허 출원 제 10-2005-0042032호 문건에 의한 기여도 가중치는 1(=테이블 1개, 셀의 총수도 1개)이며, 대한민국 특허출원 제10-2006-0012606호의 기여도 가중치는 1/(6+16+20)(=3개의 테이블에 있는 모든 셀의 수)가 된다.Second, considering the number of the tables and the number of cells, contribution of each patent document to the combination (Ai, Bj) of specific patent classification symbols may be given different weights. At this time, the contribution weight for the combination (Ai, Bj) of the specific patent classification symbol can be obtained by dividing the contribution weight by the total number of all the cells existing in all tables generated in each patent document. For example, the contribution weight according to the Korean Patent Application No. 10-2005-0042032 is 1 (= 1 table, 1 cell) for (H04B 7/02, H04B 7/14) The contribution weight of the patent application No. 10-2006-0012606 is 1 / (6 + 16 + 20) (= the number of all cells in three tables).

셋째, 상기 기여도의 가중치를 상기 특정 특허 분류 기호의 조합 (Ai, Bj)가 나온 테이블에서의 셀의 총수로 나눈 값으로 할 수 있을 것이다. 예를 들면, 예를 들면 (H04B 7/02, H04B 7/14)에 대하여, 대한민국 특허 출원 제 10-2005-0042032호 문건에 의한 기여도 가중치는 역시 1(=테이블 1개, 셀의 총수도 1개)이며, 대한민국 특허출원 제10-2006-0012606호의 기여도 가중치는 1/(6)(=3개의 테이블에 있는 모든 셀의 수)가 된다Third, the weight of the contribution may be a value obtained by dividing the weight of the contribution by the total number of cells in the table in which the combination (Ai, Bj) of the specific patent classification symbol appears. For example, with respect to (H04B 7/02, H04B 7/14), for example, the contribution weight according to Korean Patent Application No. 10-2005-0042032 is 1 (= 1 table, And the contribution weight of Korean Patent Application No. 10-2006-0012606 is 1 / (6) (= the number of all cells in 3 tables)

넷째, 상기 둘째 또는 셋째의 기여도 가중치를 역으로 계산하는 방법이 있다. 즉, 상기 기여도의 가중치를 상기 특정 특허 분류 기호의 조합 (Ai, Bj)가 나온 테이블에서의 셀의 총수로 곱한 값으로 할 수 있을 것이다. 예를 들면, 예를 들면 (H04B 7/02, H04B 7/14)에 대하여, 대한민국 특허 출원 제 10-2005-0042032호 문건에 의한 기여도 가중치는 역시 1(=테이블 1개, 셀의 총수도 1개)이며, 대한민국 특허출원 제10-2006-0012606호의 기여도 가중치는 6(=3개의 테이블에 있는 모든 셀의 수)이 된다. 이러한 기여도 가중치 부여 방법은 이종 기술의 융합도를 계산하 는데 더욱 유망할 것이다. 즉, 테이블에 존재하는 셀의 숫자가 적은 경우에는 그 기술은 기술 분류 체계 자체에서 연관성이 강한 기술로 묶어 놓은 것일 수 있기 때문에, 하나의 테이블에서 테이블의 크기가 클수록 그 기술은 기술 분류 체계 상 멀리 떨어져 있는 기술이기 때문이다.Fourth, there is a method of calculating the second or third contribution weights inversely. That is, the weight of the contribution may be a value obtained by multiplying the weight of the contribution by the total number of cells in the table in which the combination (Ai, Bj) of the specific patent classification symbol appears. For example, with respect to (H04B 7/02, H04B 7/14), for example, the contribution weight according to Korean Patent Application No. 10-2005-0042032 is 1 (= 1 table, And the contribution weight of Korean Patent Application No. 10-2006-0012606 is 6 (= the number of all cells in 3 tables). This contribution weighting method will be more promising for calculating the convergence of heterogeneous technologies. In other words, when the number of cells existing in the table is small, the technology may be bundled with a strong association technique in the technology classification system itself. Therefore, the larger the size of a table in one table, This is because it is a technology apart.

상기와 같은 방법으로 특허 분류 기호 조합 (Ai, Bj)에 대하여 다음과 같은 정보를 획득할 수 있을 것이다.The following information about the patent classification symbol combination (Ai, Bj) can be obtained by the above method.

첫째, (H04B 7/02, H04B 7/14) (Ai, Bj)에 대한 카운팅 정보이다. 상기 카운팅 시 본 발명에서 언급하거나, 본 발명 사상에 부합하는 가중치를 부여할 경우, 상기 특허 분류 기호 조합 (Ai, Bj)에 대하여 적어도 하나 이상의 가중치가 적용된 가중치별 카운팅 정보를 획득할 수 있을 것이다. First, it is the counting information for (Ai, Bj) (H04B 7/02, H04B 7/14). When weights are given in the present invention or in accordance with the present invention at the time of counting, it is possible to obtain the weighting-based counting information to which at least one weight is applied to the patent classification symbol combination (Ai, Bj).

둘째, 모든 특허 분류 기호 조합은 그 조합이 생성될 때, 그 조합이 어느 문건에 의해서 생성되었는지에 대한 정보가 저장되어 있다. 즉, 상기 (H04B 7/02, H04B 7/14)라는 특허 분류 기호 조합은 대한민국 특허출원 제10-2006-0012606호, 및 대한민국 특허 출원 제 10-2005-0042032호 문건에 의해서 생성된 것이며, (H04B 7/15, H04B 7/24)라는 특허 분류 기호 조합은 대한민국 특허출원 제10-2005-0111868호라는 문건에 의해서 생성된 것이다. 그러므로, 특허 분류 기호 조합 (Ai, Bj)에 대해 문건 고유 번호가 대응되며, 상기 문건 고유 번호에는 그 문건 고유 번호를 가지는 문건에 대한 모든 서지 사항을 포함한 그 문건에 대한 모든 정보가 대응될 수 있다. 상기 모든 테이블에는 관련된 2개의 특허 분류 기호 조합이 존재하며, 그 특허 분류 기호 조합들이 나온 특허 문건이 대응되며, 상기 특허 문건에는 그 특허 문건의 서지 사항이 대응될 수 있다. 즉, 상기 H04B 7/155와 H04Q 7/30의 관계성 테이블에 존재하는 모든 특허 분류 기호 조합 (Ai, Bj)에 대하여, 문건 고유 번호로 대한민국 특허출원 제10-2005-0111868호가 대응된다. 그리고, 상기 문건 고유 번호에 대응되는 문건의 서지 사항 등에서 상기 문건에 대하여 출원번호는 10-2006-0012606이며, 출원인은 삼성전자이며, 발명자는 최도인과 황성택이며, 출원일은 2006년02월09일인 것과 같이 서지 사항 및 상기 문건 고유 번호에 해당하는 문건에 관한 모든 정보를 대응시킬 수 있게 된다. 물론, 상기 문건 고유 번호는 출원 번호일 수 있으며, 본 특허 출원이 미국 등에 출원되었다면 그 미국 출원의 서지 사항에도 상기 수행된 결과값을 대응(매핑)해 놓을 수 있을 것이다.Second, all patent classification combinations contain information about when the combination was created, by which document the combination was created. That is, the combination of the above-mentioned patent classification symbols (H04B 7/02, H04B 7/14) is generated by Korean Patent Application No. 10-2006-0012606 and Korean Patent Application No. 10-2005-0042032 H04B 7/15, H04B 7/24) are produced by the Korean Patent Application No. 10-2005-0111868. Therefore, a document unique number is corresponded to a patent classification symbol combination (Ai, Bj), and all the information on the document including all bibliographic references to the document having the document unique number can be corresponded to the document unique number . In all of the above tables, there are two related patent classification symbol combinations, corresponding to the patent documents from which the patent classification symbols are derived, and the bibliography of the patent documents can be corresponded to the patent documents. That is, Korean Patent Application No. 10-2005-0111868 corresponds to document identification number for all patent classification symbol combinations (Ai, Bj) existing in the relationship tables of H04B 7/155 and H04Q 7/30. In the bibliography of the document corresponding to the document identification number, the application number is 10-2006-0012606, the applicant is Samsung Electronics, the inventor is Choi and Hwang Sung Taek, and the filing date is February 09, 2006 The bibliographic information and all information related to the document corresponding to the document unique number can be corresponded. Of course, the document unique number may be an application number, and if the present patent application is filed in the US or the like, the resultant value may be mapped to the bibliography of the US application.

상기와 같은 모든 특허 분류 기호 조합 (Ai, Bj)과 문건 번호의 대응은 상기 특허 분류 기호 조합 (Ai, Bj)의 생성 시에 상기 카운팅 값과 함께 DB에 저장되는 것이 바람직할 것이다.The correspondence between all the patent classification symbol combinations (Ai, Bj) and document numbers as described above is preferably stored in the DB together with the count value at the time of generating the patent classification symbol combination (Ai, Bj).

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 입수되는 특허 문건에 대하여 특허 분류 기호가 2개 이상일 경우, 각 특허 분류 기호의 조합에 대하여 특허 분류 기호 체계(트리 구조) 데이터를 참조하여, 가 특허 분류 기호의 조합별로 기설정된 개수의 테이블을 만든다. 이때, 각 테이블에 들어가는 수치값은 기설정된 기준에 의한 가중치를 반영한 값일 수 있다. 이어, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 각 테이블에서 추출된 특허 분류 기호의 조합 (Ai, Bj)에 대하여 기설정된 기여도 가중치값을 적용하여 특허 분류 기호의 조합 (Ai, Bj)별 하기 표 22과 같은 특허 분류 기호의 조합 (Ai, Bj)별 관계성 결과 데이터를 생 성할 수 있다.The above-mentioned homogeneous plural patent classification symbol relationship pre-processing module refers to the patent classification symbol system (tree structure) data for the combination of each patent classification symbol when the patent document obtained is two or more patent classification symbols, Create a predefined number of tables for each combination of symbols. At this time, the numerical value entered into each table may be a value reflecting the weight based on the predetermined reference. Then, the related-kind plural patent classification symbol preliminary processing module applies predefined contribution weight values to the combination (Ai, Bj) of the patent classification symbols extracted from the respective tables to determine the combination of patent classification symbols (Ai, Bj) (Ai, Bj) of the patent classification symbols as shown in Table 22 below.

본 모델을 설명하기 위하여 특허 문건이 대한민국 특허출원 제10-2005-0111868호, 대한민국 특허출원 제10-2006-0012606호, 및 대한민국 특허 출원 제 10-2005-0042032호 3개만이 있다고 가정하자.(물론, 입수 되는 모든 문건에 대해서 반복적으로 처리하면 모든 문건 집합을 기준으로 한 모든 특허 분류 기호 조합 (Ai, Bj)에 대한 하기와 같은 데이터의 생성이 가능할 것이다.) 이때, 특허 분류 기호 조합 (Ai, Bj)에 다음과 같은 테이블의 생성이 가능할 것이다.To explain this model, suppose that there are only three patent documents: Korean Patent Application No. 10-2005-0111868, Korean Patent Application No. 10-2006-0012606, and Korean Patent Application No. 10-2005-0042032. Of course, it is possible to generate the following data for all patent classification combinations (Ai, Bj) based on all document sets by repeatedly processing all the documents obtained. , Bj) will be able to generate the following table.

[표 22][Table 22]

특허 분류 기호 조합 (Ai, Bj)Patent classification symbol combination (Ai, Bj) 문건 번호Article number Frequency
(단순빈도)Frequency
(Simple frequency) 1/nC2 균등할당1 / nC2 Equal allocation (M,S), (S,S) 차등할당(M, S), (S, S) differential allocation (H04B 7/15 ,H04B 7/26)(H04B 7/15, H04B 7/26) 10-2005-011186810-2005-0111868 1One 1One 1One (H04B 7/15,H04B 7/24)(H04B 7/15, H04B 7/24) 10-2005-011186810-2005-0111868 1One 1One 1One (H04B 7/14,H04B 7/26)(H04B 7/14, H04B 7/26) 10-2005-011186810-2005-0111868 1One 1One 1One (H04B 7/14,H04B 7/24)(H04B 7/14, H04B 7/24) 10-2005-011186810-2005-0111868 1One 1One 1One (H04B 7/155,H04B 7/04)(H04B 7/155, H04B 7/04) 10-2006-001260610-2006-0012606 1One 1/31/3 0.75/20.75 / 2 (H04B 7/155,H04B 7/02)(H04B 7/155, H04B 7/02) 10-2006-001260610-2006-0012606 1One 1/31/3 0.75/20.75 / 2 (H04B 7/15,H04B 7/04)(H04B 7/15, H04B 7/04) 10-2006-001260610-2006-0012606 1One 1/31/3 0.75/20.75 / 2 (H04B 7/15,H04B 7/02)(H04B 7/15, H04B 7/02) 10-2006-001260610-2006-0012606 1One 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/04)(H04B 7/14, H04B 7/04) 10-2006-001260610-2006-0012606 1One 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/02)(H04B 7/14, H04B 7/02) 10-2006-001260610-2006-0012606 22 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/02)(H04B 7/14, H04B 7/02) 10-2005-004203210-2005-0042032 22 1One 1One 1One 1/31/3 .... 1One 1/31/3 .... (H04Q 7/00, H04B 7/00)(H04Q 7/00, H04B 7/00) 10-2006-001260610-2006-0012606 1One 1/31/3 0.25/10.25 / 1 (H04Q 7/00, H04B)(H04Q 7/00, H04B) 10-2006-001260610-2006-0012606 1One 1/31/3 0.25/10.25 / 1 (H04Q,H04B)(H04Q, H04B) 10-2006-001260610-2006-0012606 1One 1/31/3 0.25/10.25 / 1

(n은 특허분류 기호 수), (M,S)는 (main IPC, Sub IPC)를 (S,S)는 (sub IPC, Sub IPC) 를 말하며, n>2 이상일 경우, (M,S), (S,S) 차등할당은 (M,S)에 대해서는 75%/(n-1)의 가중치를, (S,S)에 대해서는 25%/(n-1)C2의 가중치를 부여하였다. n=2 일 때는 그냥 1을 부여하였다. S is the sub IPC and S is the sub IPC. If n> 2, (M, S), (M, S) , The (S, S) differential allocation is given a weight of 75% / (n-1) for (M, S) and a weight of 25% / (n-1) C2 for (S, S). When n = 2, 1 was assigned.

이때, 상기 (H04B 7/14,H04B 7/02) 특허 분류 기호 조합 쌍에 주목하면, 문건 번호가 2개가 대응되어 있음을 알 수 있다. 즉, 상기 (H04B 7/14,H04B 7/02) 특허 분류 기호 조합 쌍은 2개의 문건에서 동시에 출현했음을 의미하며, Frequency(단순 빈도)의 값은 2가 된다. 1/nC2 균등할당과 (M,S), (S,S) 차등할당 등 기설정된 방식의 가중치 또는 기여도 가중치를 부여할 수 있을 것이다. At this time, paying attention to the pair of patent classification symbols (H04B 7/14, H04B 7/02), it can be seen that two document numbers correspond to each other. That is, the pair (H04B 7/14, H04B 7/02) of the patent classification symbol pair appeared at the same time in two documents, and the value of frequency (simple frequency) becomes 2. 1 / nC2 equalization, and (M, S), (S, S) differential allocation.

상기 표 22는 특허 문건이 대한민국 특허출원 제10-2005-0111868호, 대한민국 특허출원 제10-2006-0012606호, 및 대한민국 특허 출원 제 10-2005-0042032호 3개에 대해서 특허 분류 기호의 조합 (Ai, Bj)에 대하여 관계성 데이터를 생성한 것이다. 상기 방식을 입수되는 모든 문건에 대하여 반복적 또는 재귀적으로 적용할 수 있음은 당업자에게 자명할 것이며, 상기 기설정된 가중치 또는 기여도 가중치 중 어느 하나 이상을 기준으로, 각 특허 분류 기호의 조합 (Ai, Bj)에 대하여 관계성 데이터를 생성할 수 있음도 자명할 것이다.Table 22 shows the combinations of the patent classification symbols for the three patent documents, Korean Patent Application No. 10-2005-0111868, Korean Patent Application No. 10-2006-0012606, and Korean Patent Application No. 10-2005-0042032 Ai, Bj). It will be apparent to those skilled in the art that it is possible to apply it repeatedly or recursively to all documents obtained by the method, and based on any one or more of the preset weights or contribution weights, the combination of each patent classification symbol (Ai, Bj It will be appreciated that it is possible to generate the relationship data for the < RTI ID = 0.0 >

이상과 같이 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 임의의 (Ai, Bj)에 대하여 1) 이 (Ai, Bj)값이 나온 문건 번호, 2) 특정한 상황i별 가중치 정책j에 따르는 가중치(Wij), 3) (M,S)인지 (S,S)인지의 여부, 4) 기타 (Ai, Bj)에 대한 속성 정보 중 어느 하나 이상을 DB에 저장할 수 있다. As described above, the homogeneous plural patent classification symbol relation pre-processing module is a module for calculating a weighted value according to a weighting policy j for each specific situation i (1), (2), Wij), 3) whether it is (M, S) or not (S, S), and 4) other attribute information about Ai and Bj.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 복수 개의 특허 분류 기호를 처리하는 방법은 IPC에 한정되지 않음은 당연하다 할 것이다. 즉, 상기 IPC에서 수행된 것과 완전히 동일하게 미국 특허 문건에 대해서는 USPC를, 일본 특허 문건에 대해서는 FT와 FI, ECLA를 대상으로 하여 복수 개의 특허 분류 기호를 처리할 수 있을 것이다.It will be appreciated that the method of processing the plurality of patent classification symbols by the same kind of multiple patent classification symbol relationship preprocessing module is not limited to IPC. That is, it is possible to process a plurality of patent classification symbols targeting USPC for US patent documents and FT, FI, and ECLA for Japanese patent documents exactly the same as those performed at the IPC.

기술 융합 패턴 분석 모듈Technology convergence pattern analysis module

이때, 상기 특허 분류 기호의 조합 (Ai, Bj)에 대하여 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 저장한 1) (Ai, Bj)값이 나온 문건 번호, 2) 특정한 상황i별 가중치 정책j에 따르는 가중치(Wij), 3) (M,S)인지 (S,S)인지의 여부, 4) 기타 (Ai, Bj)에 대한 속성 정보 중 어느 하나 이상을 처리하면 다음과 같은 다음과 같은 정보들을 계산, 입수 또는 추출할 수 있을 것이다. 하기와 같은 정보는 본 발명의 기술 융합 패턴 분석 모듈이 수행한다.(Ai, Bj) values stored by the pre-processing module of the same kind of multiple patent classification symbol relation with respect to the combination (Ai, Bj) of the patent classification symbols, (2) a weight number policy j 4) Whether or not at least one of the weight information (Ai, Bj) and the weight information (Ai, Bj) according to the following information , Or extract them. The following information is performed by the technology fusion pattern analysis module of the present invention.

상기 기술 융합 패턴 분석 모듈이 처리할 수 있는 정보는 본질적으로 2가지이다.There are essentially two pieces of information that the technology fusion pattern analysis module can process.

첫째, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 각 특허 분류 기호 쌍의 출현 회수를 입수할 수 있다. 예를 들면, (H04B 7/26, H04B 7/15)는 대한민국 특허출원 제10-2005-0111868호에 1회 출현한 반면, (H04B 7/14,H04B 7/02)는 10-2006-0012606와 10-2005-0042032양 특허 공보에 모두 출현하였다. (주 특허 분류 기호가 아닐 시 또는 모든 특허 분류 기호를 동등하게 취급할 경우에는 (Ai, Bj) = (Bj, Ai)로 취급될 수 있다. 어느 하나가 주특허 분류 기호이면 (Ai, Bj)와 (Bj, Ai)를 다르게 취급할 수도 있으며, 동일하게 취급할 수도 있을 것이다. ) 그러므로, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 모든 특허 분류의 쌍에 대하여 그 쌍이 나온 회수를 기록하여 놓을 수 있다. 또한, 상기 쌍이 나온 특허 문건들에 대한 특허 문건 번호를 저장해 놓을 수 있을 것이다. First, the homogeneous multiple patent classification symbol relationship pre-processing module can obtain the number of appearance of each patent classification symbol pair. For example, (H04B 7/26, H04B 7/15) appeared once in Korean patent application No. 10-2005-0111868, while (H04B 7/14, H04B 7/02) appeared in 10-2006-0012606 And 10-2005-0042032, respectively. (Ai, Bj) = (Bj, Ai) if not the main patent classification symbol or treat all patent classification symbols equally. If one of them is the main patent classification symbol (Ai, Bj) And (Bj, Ai) may be handled differently and treated the same.) Therefore, the above-mentioned homogeneous plural patent classification related relationship pre-processing module records the number of pairs of all patent classification pairs . In addition, the patent document numbers for the patent documents that are derived from the pair will be stored.

둘째, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 특정 특허 분류 기호가 주어질 때, 상기 특정 특허 분류 기호와 가장 빈번하게 결합하는 특허 분류 기호를 찾을 수 있을 것이다. 이는 상기 주어진 특정 특허 분류 기호가 들어가는 모든 쌍과, 그 쌍이 나온 회수를 카운팅함으로써 용이하게 획득할 수 있을 것이다. 이때, 기설정된 가중치 또는 기여도 가중치를 적용하여, 상기 특정 특허 분류 기호와 가장 빈번하게 결합하는 특허 분류 기호를 찾을 수 있을 것이며, 아울러, 상기 특허 분류 기호 체계(트리 구조)를 참조하여, 기설정된 레벨 단위에서 가장 빈번하게 결합하는 특허 분류 기호를 찾을 수 있을 것이다. 즉, 예를 들면, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 H04B 7/26와 가장 빈번하게 결합하는 특허 분류 기호를 찾을 수 있을 것이며, 상기 IPC sub class, IPC main group, 또는 IPC subgroup 수준에서 상기 H04B 7/26와 가장 빈번하게 결합하는 특허 분류 기호를 찾을 수 있을 것이다.Second, the homogeneous plural patent classification symbol preliminary processing module may find the patent classification symbol most frequently combined with the specific patent classification symbol when the specific patent classification symbol is given. This may be easily achieved by counting all the pairs into which the given patent classification symbol is entered and the number of times the pair has appeared. At this time, it is possible to find a patent classification symbol that most frequently combines with the specific patent classification symbol by applying a predetermined weight or contribution weight. In addition, referring to the patent classification scheme (tree structure) You will find the patent classification code that most frequently combines in units. That is, for example, the homogeneous multiple patent classification symbol relationship pre-processing module will find the patent classification symbol most frequently associated with H04B 7/26, and it can be found at the IPC sub class, IPC main group, or IPC subgroup level You will find the patent classification code most frequently associated with H04B 7/26.

셋째, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 특허 분류 기호 쌍과 상기 특허 문건 번호의 대응 관계를 통하여, 상기 특허 분류 기호 쌍과 상기 특허 문건 번호에 대응되는 서지 사항을 대응시켜 다양한 분석 결과를 생성할 수 있다. 상기 분석 결과에는 상기 특허 분류 기호 쌍과 1) 국가, 2) 기간, 3) 출원인, 4) 발명자 및 5) 대리인 중에서 선택되는 적어도 하나 이상의 조합과의 분석 결과를 획득할 수 있을 것이다. 예를 들면, 삼성전자의 대한민국 특허출원 2000년도부터의 2005년도까지에서 가장 많이 결합된 특허 분류 기호 쌍을 알 수 있을 것 이다. 나아가, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 특허 분류 기호가 주어질 때, 상기 특허 분류 기호와 가장 잘 결합하는 특허 분류 기호를 찾을 수 있을 것이다. 아울러, 상기 특허 분류 기호가 주어질 때, 상기 특허 분류 기호와 가장 잘 결합하는 특허 분류 기호를 상기 1) 국가, 2) 기간, 3) 출원인, 4) 발명자 및 5) 대리인 중에서 선택되는 적어도 하나 이상의 조합으로 이루어지는 문건 집합에서도 찾을 수 있을 것이다. 예를 들면, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 H04B 7/26와 가장 다수가 결합하는 특허 분류 기호 체계 상에서의 각 단계별 특허 분류 기호를 찾을 수 있을 것이며, H04B 7/26에 대하여 삼성전자 주식회사의 미국 특허출원 2003년도부터의 2005년도까지에서도 찾을 수 있을 것이다. 이를 통해, 상기 특정 기간 동안의 특정 주체에 대한 특허 명세서에 나타난 기술 융합의 정도를 파악할 수 있을 것이다.Third, the homogeneous plural patent classification symbol relation pre-processing module associates the pair of patent classification symbols with the bibliography corresponding to the patent document number through the correspondence relationship between the pair of patent classification symbols and the patent document number, Lt; / RTI > The analysis result may be obtained by analyzing at least one or more combinations selected from the pair of patent classification symbols, 1) country, 2) period, 3) applicant, 4) inventor and 5) agent. For example, Samsung Electronics' patent application in Korea from 2000 to 2005 will be able to identify the most combined pairs of patent classification codes. Further, the homogeneous multiple patent classification related relationship pre-processing module may find a patent classification symbol best associated with the patent classification symbol given the patent classification symbol. In addition, given the patent classification symbol, the patent classification symbol that best combines the patent classification symbol with the patent classification symbol is referred to as 1) country, 2) period, 3) applicant, 4) inventor and 5) Can be found in the document set. For example, the homologous multiple patent classification symbol relationship preprocessing module will find the patent classification symbol for each step on the patent classification system which is most combined with H04B 7/26, and H04B 7/26, Samsung Electronics The patent application filed by the corporation from 2003 to 2005 can be found. Through this, the degree of the technical convergence shown in the patent specification for a specific subject during the specific period can be grasped.

넷째, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 주어진 임의의 특허 분류 기호의 조합 (Ai, Bj)에 대하여, 상기 특허 분류 기호의 조합 (Ai, Bj)에 관계된 특별한 특허 문건 집합을 대응시킬 수 있을 것이다. 이때, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 상기 대응되는 특허 문건 집합에 대한 각종 분석을 수행하면, 상기 수행된 분석 결과는 상기 특허 분류 기호의 조합 (Ai, Bj)에 대한 분석 결과가 된다. 상기 특별한 특허 문건 집합에는 예시적으로 1) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호로 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 2) 특정 국가 DB에서 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 3) 특정 국가 DB에서 특정 출원인의 특허 문건에 발명자로 포함된 특정 발명자 명의(즉, 출원인 명의 and 발명자 명의)에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 4) 특정 국가 DB에서 특정 대리인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 5) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호 및 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 6) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호, 특정 출원인 명의 및 특정 발명자 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 7) 상기 특정 국가 DB에서 특정 출원인 명의 및 특정 대리인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 8) 특정 국가의 모든 특허 문건 집합, 9) 적어도 2 이상의 국가의 모든 특허 문건 집합, 10) 상기 1) 내지 9)에 기설정된 기간 단위로 특정되는 특허 문건 집합, 11) 상기 1) 내지 9)에 등록 여부, 심사청구 여부 등과 특정 조건을 결부하는 것으로 생성되는 특허 문건 집합에 대하여 사전에 계산해 놓을 수 있을 것이다. 물론, 사용자가 특정하거나 생성한 문건 집합에 대해서도 상기와 같은 계산을 수행해 놓을 수 있을 것이다.Fourth, the homogeneous plural patent classification symbol preliminary processing module can correspond to a set of patent documents related to the combination (Ai, Bj) of the patent classification symbol for a given arbitrary combination of patent classification symbols (Ai, Bj) There will be. At this time, if the related-kind plural patent classification symbol preliminary processing module performs various analyzes on the corresponding set of patent documents, the analysis result is an analysis result on the combination (Ai, Bj) of the patent classification symbols . The set of special patent documents includes, for example, 1) a set of patent documents generated by a method of specifying a set of patent documents with a specific patent classification symbol in a specific patent classification system in a specific country DB, 2) (3) a set of patent documents by the name of a specific inventor (ie, applicant's name and inventor's name) included in the patent document of a specific applicant in a specific country DB as an inventor 4) a set of patent documents generated by a method of specifying a set of patent documents by a specific agent name in a specific country DB, 5) a set of patent documents generated by a specific country DB By a method of specifying a set of patent documents by a specific patent classification mark and a specific applicant's name 6) a set of patent documents generated by a method of specifying a set of patent documents by a specific patent classification mark on the specific patent classification system, a specific applicant's name and a specific inventor's name in a specific country DB, 7) 8) a set of patent documents generated by a method of specifying a set of patent documents by a specific applicant's name and a specific agent's name in a particular country DB, 9) a set of all patent documents in a particular country, 9) 10) a set of patent documents specified in the above-mentioned 1) to 9), 11) a set of patent documents generated by associating the specific conditions with the registration of the patent application in 1) to 9) . Of course, the same calculations can be performed on the set of documents that the user has specified or created.

또한, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 1) 내지 상기 11)로 특정되는 어느 하나 이상의 문건 집합에 대하여, 상기 문건 집합을 구성하는 특허 문건 중에서, 특허 분류 기호가 2 이상이 부여되어 있는 문건들만을 추출해서, 그 추출된 문건 집합을 대상으로만, 상기 특허 분류 기호의 조합 (Ai, Bj)을 입수할 수 있다. 상기 입수된 특허 분류 기호의 조합 (Ai, Bj)에 대하여, 상기 표와 같은 관계성 데이터를 생성하고, 상기 생성된 관계성 데이터를 정렬 또는 분석(빈도, 최빈값, 최대값, 최소값, 연도별 개수, 출원인, 발명자 등의 주체별 개수 등의 분석이 그 예가 된다.)한다. 상기와 같은 분석을 마치면, 상기 1) 내지 11)로 특정되는 어느 하나 이상의 문건 집합 또는 상기 추출된 문건 집합 대상별로의 상기 특허 분류 기호의 조합 (Ai, Bj)의 관점에서 파악되는 기술 융합에 관한 정보를 획득할 수 있게 된다.In addition, the above-mentioned homogeneous plural patent classification symbol preliminary processing module is characterized in that, among the patent documents constituting the document set, at least two patent classification symbols are assigned to at least one document set specified by 1) to 11) (Ai, Bj) of the patent classification symbol can be obtained only by extracting the documents having the patent classification symbol. (Ai, Bj) of the obtained patent classification symbols, and the generated relationship data is sorted or analyzed (frequency, mode, maximum value, minimum value, number of years , The applicant, the inventor, etc.). Upon completion of the analysis as described above, it is possible to determine whether any one or more document sets specified by 1) through 11) or the technical convergence grasped from the viewpoint of the combination (Ai, Bj) of the patent classification symbol for each extracted document set subject Information can be acquired.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 수행하는 상기와 같은 반복 수행 방법은 본 수행의 결과로 생성되는 값을 찾기 위한 일실시예에 불과할 것이며, 당업자라면 전산 기술을 이용하여 본 발명 사상에 대한 다양한 계산 방법을 시도할 수 있을 것이다. 상기 특허 분류 기호가 2 이상 부여된 모든 특허 문건의 모든 특허 분류 기호에 대하여 상기 수행된 결과(예를 들면 각 특허 분류 기호 쌍에 대한 값(가중치 부여 또는 미부여 등을 포괄한다)을 가지는 테이블 등)는 DB로 저장되거나, 실시간으로 생성되는 뷰(view) 내지는 물리적인 뷰(materialized view)로 제공될 수 있음은 당연할 것이다.The above-mentioned iterative execution method performed by the homogeneous plural patent classification symbol preliminary processing module will be merely an embodiment for searching for a value generated as a result of the present execution, and those skilled in the art will be able to understand the present invention You can try different calculation methods. (For example, a table having a value (including weighting or non-granting) for each pair of patent classification symbols) for all patent classification symbols of all patent documents to which at least two patent classification symbols are assigned ) May be stored in a DB, or may be provided in a real-time generated view or a materialized view.

상기와 같은 과정을 실시하는 일 실시예적 과정을 도 103에서 보여주고 있다. 이하, 도 103을 기준으로 설명한다. 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 하나의 특허 문건을 포함하는 문건 집합을 입수하며(S5320), 상기 문건 집합에 포함되는 특허 문건 중에서 특허 분류 기호가 2개 이상인 복수 특허 분류 기호 포함 문건을 추출하며(S5330), 주 특허 분류 기호 및 부 특허 분류 기호의 구분 처리에 관한 기설정된 기준에 따라 복수 특허 문건 기호 문건에 포함되는 복수개의 특허 분류 기호의 조합을 생성하며(S5340), 상기 조합별로 상기 조합에 포함된 각각의 특허 분류 기호에 대하여 특허 분류 기호 체계 상의 부모 노드에 관한 정보를 입수하며(S5350), 상기 부모 노드에 대하여 최하위 공통 특허 분류 기호를 입수하며(S5360), 상기 최하위 공통 특허 분류 기호 직전까지의 자신을 포함한 각 부모 노드 특허 분류 기호의 쌍을 생성하며(S5370), 상기 각 쌍을 균등 또는 기설정된 기준을 적용한 가중치를 부여하여 저장하며(S5380), 상기 저장된 각 쌍을 대상으로 하여 기설정된 기준에 따른 통계값, 파라미터 또는 계산값을 생성한다(S5390)FIG. 103 shows an exemplary process of performing the above process. The following description will be made with reference to Fig. The similarity multiple patent classification symbol relationship preprocessing module obtains a document set including at least one patent document (S5320), and among the patent documents included in the document set, a document including a plurality of patent classification symbols having at least two patent classification symbols (S5330). In step S5330, a combination of a plurality of patent classification symbols included in the plurality of patent document documents is generated in accordance with predetermined criteria regarding classification processing of the main patent classification symbol and the patent classification symbol (S5340) Information on the parent node in the patent classification symbol system is obtained for each patent classification symbol included in the combination (S5350), the lowest common patent classification symbol is obtained for the parent node (S5360) A pair of each parent node patent classification code including itself up to the patent classification symbol is generated (S5370), and each pair Stores the weighting applied uniformly or pre-set criteria, and (S5380), and generates the statistical values, parameters or values calculated according to predetermined criteria to the destination for each pair of the stored (S5390)

이어, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈을 통해서 적어도 2 이상의 문건 집합에 대하여 복수개의 특허 분류 기호를 전처리하여 비교하는 방법에 대해서 도 104를 기준으로 설명한다. Next, a method for preprocessing and comparing a plurality of patent classification symbols with respect to at least two document sets through the homogeneous plural patent classification symbol preliminary processing module will be described with reference to FIG.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 하나의 특허 문건을 포함하는 문건 집합을 2 이상 입수하며(S5410), 상기 각 문건 집합별로 상기 문건 집합에 포함되는 특허 문건 중에서 특허 분류 기호가 2개 이상인 복수 특허 분류 기호 포함 문건을 추출하며(S5430), 주 특허 분류 기호 및 부 특허 분류 기호의 구분 처리에 관한 기설정된 기준에 따라 복수 특허 문건 기호 문건에 포함되는 복수개의 특허 분류 기호의 조합을 생성하며(S5440), 상기 조합별로 상기 조합에 포함된 각각의 특허 분류 기호에 대하여 특허 분류 기호 체계 상의 부모 노드에 관한 정보를 입수하고(S5450), 상기 부모 노드에 대하여 최하위 공통 특허 분류 기호를 입수하며(S5460), 상기 최하위 공통 특허 분류 기호 직전까지의 자신을 포함한 각 부모 노드 특허 분류 기호의 쌍을 생성하고(S5470), 상기 각 쌍을 균등 또는 기설정된 기준을 적용한 가중치를 부여하여 저장하고(S5480), 상기 저장된 각 쌍을 대상으로 하여 기설정된 기준에 따른 통계값, 파라미터 또는 계산값을 생성하고(S5490), 상기 통계값, 파라미터 또는 계산값을 상기 문건 집합별로 비교한다(S5495).The same kind of multiple patent classification symbol relationship pre-processing module obtains at least two sets of documents including at least one patent document (S5410), and, among the patent documents included in the document set for each document set, two patent classification symbols (S5430), and generates a combination of a plurality of patent classification symbols included in the plurality of patent document documents according to predetermined criteria regarding classification processing of the main patent classification symbol and the patent classification symbol (S5440), information on the parent node in the patent classification symbol system is acquired for each patent classification symbol included in the combination for each combination (S5450), the lowest common patent classification symbol is obtained for the parent node (S5460), a pair of each parent node patent classification symbol including itself up to the lowest common patent classification symbol (S5470). Each of the pairs is weighted by applying an equal or preset reference (S5480), and a statistical value, a parameter or a calculation value according to a preset reference is generated (S5490), and compares the statistical values, parameters, or calculated values on the document set (S5495).

도 115내지 도 117에는 복수 특허 분류 기호 분석 방법에 대한 예시적인 실시예를 보여주고 있다.Figures 115 to 117 illustrate an exemplary embodiment of a plurality of patent classification symbol analysis methods.

도 115는 특허 분류 기호를 입수 받고, 상기 입수 받은 특허 분류 기호를 기준으로 최빈 융합 특허 분류 기호들에 대한 분석 정보를 제시하는 실시예적 방법이다.115 is an exemplary method for obtaining patent classification codes and presenting analytical information on the most-recent fusion patent classification symbols based on the acquired patent classification codes.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 하나 이상의 특허 분류 기호를 입수 받고(S6520), 입수된 특허 분류 기호에 대한 최빈 융합 특허 분류 기호 정보를 입수하여(S6530), 최빈 융합 특허 분류 기호에 대하여 특허 분류 기호 체계를 반영하여 적어도 하나 이상의 단계로 디스플레이하거나, 최빈 융합 특허 분류 기호에 대응되는 특허 문건 집합에 대하여 기설정된 적어도 하나 이상의 분석을 수행한다(S6540).The same kind multiple patent classification symbol preliminary processing module receives at least one or more patent classification symbols (S6520), obtains the most-fused patent classification symbol information for the acquired patent classification symbols (S6530) At least one step is displayed in accordance with the patent classification scheme, or at least one analysis is performed on the set of patent documents corresponding to the best-fit patent classification symbol in operation S6540.

도 116는 문건 집합을 입수 받고, 상기 입수 받은 문건 집합에서 최빈 특허 분류 기호를 추출하고, 상기 추출된 특허 분류 기호를 기준으로 최빈 융합 특허 분 류 기호들에 대한 분석 정보를 제시하는 실시예적 방법이다.116 is an exemplary method for acquiring a document set, extracting a most-favored patent classification symbol from the acquired document set, and presenting analytical information on the least-fused patent classification symbols based on the extracted patent classification symbol .

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 하나 이상의 문건 집합을 입수 받고(S6620), 문건 집합을 구성하는 개별 문건별로 특허 분류 기호를 입수하고(S6630), 개별 문건별로 입수된 특허 분류 기호를 문건 집합 수준에서 최빈값을 기준으로 랭킹을 부여하고(S6640), 랭킹이 부여된 개별 특허 분류 기호별로 최빈 융합 특허 분류 기호 정보를 입수하고(S6650), 최빈 융합 특허 분류 기호에 대하여 특허 분류 기호 체계를 반영하여 적어도 하나 이상의 단계로 디스플레이하거나, 최빈 융합 특허 분류 기호에 대응되는 특허 문건 집합에 대하여 기설정된 적어도 하나 이상의 분석을 수행한다(S6660).The similarity multiple patent classification symbol relation pre-processing module obtains at least one document set (S6620), obtains a patent classification symbol for individual documents constituting the document set (S6630), obtains the patent classification symbol obtained for each document (S6640), the most-convergent patent classification symbol information is obtained for each individual patent classification symbol ranked (S6650), and the patent classification symbol system for the least-convergent patent classification symbol is obtained (S6660), or at least one analysis is performed on the set of patent documents corresponding to the least-fused patent classification symbol.

도 117는 문건 집합을 입수 받고, 상기 입수 받은 문건 집합에서 개별 문건 단위에서 추출한 특허 분류 기호를 기준으로 최빈 융합 특허 분류 기호들을 추출하고, 추출된 최빈 융합 특허 분류 기호들을 통합하는 방식으로 분석 정보를 제시하는 실시예적 방법이다.FIG. 117 is a view for explaining the method of extracting the most-fused patent classification symbols based on the patent classification code extracted from the document unit in the acquired document set, integrating the extracted most-used patent classification symbols, Which is an exemplary method.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 적어도 하나 이상의 문건 집합을 입수 받고(S670), 문건 집합을 구성하는 개별 문건별로 특허 분류 기호를 입수하며(S670), 개별 문건별로 입수된 특허 분류 기호를 기준으로 최빈 융합 특허 분류 기호 정보를 입수하고(S670), 개별 문건별로 입수된 최빈 융합 특허 분류 기호 정보를 통합하며(S670), 통합된 최빈 융합 특허 분류 기호에 대하여 특허 분류 기호 체계를 반영하여 적어도 하나 이상의 단계로 디스플레이하거나, 통합된 최빈 융합 특허 분류 기호에 대응되는 특허 문건 집합에 대하여 기설정된 적어도 하나 이상의 분석을 수행한다(S670).In step S670, the pre-processing module acquires at least one document set (S670), acquires a patent classification symbol for each document constituting the document set (S670), obtains the patent classification symbol obtained for each document (S670), the most-convergent patent classification symbol information obtained for each document is integrated (S670), and the integrated most-favored fusion classification symbol is reflected at least (S670) at least one step or at least one analysis for the set of patent documents corresponding to the integrated best-fit patent classification code is performed.

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 본 발명 사상의 특징의 하위 특허 분류 기호 포함 사상을 적용하지 않고, 단순하게 각 문건에 포함되어 있는 특허 분류(patent classification, PC) 기호쌍(PC1, ..., PCi, PCj,..., PCn)에 대해서도 작동될 수 있다. 즉, 상기에서는 주어진 PCi에 대하여 주어진 특허 분류 기호 체계 상의 상위 특허 분류 기호를 고려하지 않고, 문건에 나타난 (PCi, PCj)만을 대상으로 특허 분류 기호 조합쌍을 구성할 수 있다. 이때, PC1이 주(main) 특허 분류 기호이므로, 상기에서 설명한 main vs. sub 간의 가중치 부여는 그대로 적용할 수 있다.The above-mentioned homogeneous multiple patent classification symbol relation pre-processing module may be applied to a patent classification (PC) symbol pair (PC1, PC2) included in each document without applying the subpopulation symbol included in the feature of the present invention. ..., PCi, PCj, ..., PCn. That is, in the above description, it is possible to construct a pair of patent classification symbols for only (PCi, PCj) shown in the document without considering the upper patent classification symbol on the given patent classification scheme for a given PCi. At this time, since PC1 is the main patent classification symbol, the main < RTI ID = 0.0 > The weighting between sub can be applied as it is.

이종 복수 특허 분류 기호 관계성 전처리 모듈Heterogeneous multiple patent classification symbol relationship pre-processing module

상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 동종의 특허 분류 기호에 대한 처리를 설명하였다. 이어서, 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈에 대해서 설명한다. 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈의 작동 방식과 완전히 유사하다. 미국에서 발행되는 특허 문건에는 하나의 특허 문건에 IPC와 USPC가 동시에 결합되어 있는 것이 거의 전부이며, 일본에서 발행되는 특허 문건에도 IPC와 FT가 동시에 부여되어 있다. 하나의 특허 문건에 IPC1,… IPCn이, USPC1,…,USPCm 부여되어 있다고 가정하자. 이 경우, 이종 복수 특허 분류 기호 관계성 처리의 관점에서는 상기 IPC1,… IPCn과 USPC1,…,USPCm사이에는 4가지 종류의 관계성의 조합이 생긴다. 먼저 IPC1과 USPC1의 관계로, 이들은 각각 main IPC와 main USPC의 쌍이 된다. 그리고, IPC1와 USPC2,…,USPCm의 쌍으로 이는 main IPC와 sub USPC와의 쌍이 된다. 그리고, IPC2,… IPCn와, USPC1의 쌍으로, 이는 sub IPC와 main USPC의 쌍이 된다. 그리고, , IPC2,… IPCn각각과 USPC2,…,USPCm각각에 대한 쌍으로, 이는 sub IPC와 sub USPC의 쌍이 된다.The homogeneous multiple patent classification related pre-processing module described the processing of the same kind of patent classification code. Next, the heterogeneous plural patent classification symbol relation pre-processing module will be described. The heterogeneous plural patent classification related relationship pre-processing module is completely similar to that of the homogeneous plural patent classification related pre-processing module. In the United States, patent documents contain almost all IPC and USPC combinations in one patent document, and both IPC and FT are granted to patent documents issued in Japan. One patent document on IPC1, ... IPCn, USPC1, ... , And USPCm. In this case, in view of the heterogeneous plural patent classification relation processing, the IPC1, ... IPCn and USPC1, ... , There are four kinds of relationship combinations between USPCm. First, in relation to IPC1 and USPC1, they are a pair of main IPC and main USPC, respectively. And, IPC1 and USPC2, ... , A pair of USPCm, which is a pair of a main IPC and a sub USPC. Then, IPC2, ... A pair of IPCn and USPC1, which is a pair of sub IPC and main USPC. Then, IPC2, ... Each IPCn and USPC2, ... , And USPCm, which is a pair of sub IPC and sub USPC.

상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 각 4가지 종류별로, 각 종류 하에 있는 이종 복수 특허 분류 기호들 간에서 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 수행한 바와 동일한 처리를 할 수 있다. 이때, 이종 특허 분류 기호를 대상으로 하므로, 상기 최소 공통 특허 분류 기호는 없기 때문에 기설정된 값으로 한정할 수 있다. 즉, IPC의 경우에는 subclass에서, USPC에서는 class까지만을 대상으로 하여, 주어진 IPC와 주어진 USPC 쌍에 대하여 상위 특허 분류 기호를 한정 지을 수 있다. 즉, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 주어진 특허 분류 기호 조합(Ai, Bj)(단, Ai, Bj는 동일한 체계의 특허 분류 기호이다.)에 대하여 상기와 같은 관계성 테이블을 생성한 바와 같이, 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 특허 분류 기호 조합(Ai, Bj) (단, Ai, Bj는 다른 체계의 특허 분류 기호이다.)에 대하여 상기와 같은 관계성 테이블을 생성할 수 있다. 상기 생성된 관계성 테이블에 대하여 가중치 또는 기여도 가중치를 부여할 수 있을 것이다. 또한, 각 이종 특허 분류 기호 조합(Ai, Bj) (단, Ai, Bj는 다른 체계의 특허 분류 기호이다.)에 대하여, 특허 문건 번호를 대응시킬 수 있음과, 상기 특허 문건 번호로부터 입수되는 서지사항을 반영한 카운팅 및 계산을 수행할 수 있음도 자명할 것이다. 아울러, 특정한 문건 집합에 대하여, 이종 특허 분류 기호 조합(Ai, Bj)를 획득할 수 있을 것이며, 이들 조합에 대한 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈과 동등한 분석을 수행할 수 있을 것이다.The above-mentioned heterogeneous plural patent classification symbol relationship pre-processing module can perform the same processing as that performed by the homogeneous plural patent classification symbol relationship pre-processing module among different heterogeneous patent classification symbols under each type for each four kinds. At this time, because the heterogeneous patent classification symbol is targeted, since there is no minimum common patent classification symbol, it can be limited to a predetermined value. That is, the upper patent classification code can be defined for a given IPC and a given USPC pair, targeting only the subclass in the case of IPC and the class in the USPC. That is, the above-mentioned homogeneous plural patent classification symbol relation preprocessing module generates the above relationship table for a given patent classification symbol combination (Ai, Bj) (where Ai and Bj are patent classification codes of the same system) As described above, the heterogeneous plural patent classification related relationship preprocessing module generates the above relationship table for the patent classification symbol combination (Ai, Bj) (where Ai and Bj are patent classification codes of different systems) . A weight or a contribution weight may be given to the generated relationship table. In addition, it is possible to correspond the patent document number to each of the heterogeneous patent classification symbol combinations (Ai, Bj) (where Ai and Bj are patent classification codes of different systems), and the bibliography It will also be appreciated that counting and calculation may be performed to reflect the matter. In addition, for a particular set of documents, a heterogeneous patent classification combination (Ai, Bj) may be obtained and an equivalent analysis of the homogeneous multiple patent classification related preprocessing module for these combinations may be performed.

특허 분류 기호별 통계 전처리 모듈Statistical preprocessing module by patent classification symbol

특허 분류 기호의 다단계성Multistage of patent classification codes

본 발명의 특허 분류 기호별 통계 전처리 모듈은 적어도 하나 이상의 특허 분류 기호 체계 상의 특허 분류 기호에 대하여, 적어도 하나 이상의 국가별 특허 DB를 대상으로 기설정된 통계값을 찾아내는 전처리를 수행한다. The statistical preprocessing module for each patent classification symbol according to the present invention performs preprocessing for finding a predetermined statistical value for at least one patent database of at least one country with respect to a patent classification symbol on at least one patent classification system.

도 61은 상기 특허 분류 기호별 통계 전처리 모듈의 구조에 대해서 설명하고 있다. 상기 특허 분류 기호별 통계 전처리 모듈은 특허 분류 기호별로 통계값을 전처리하는 특허 분류 기호별 통계 전처리 엔진(3210)과 상기 전처리된 특허 분류 기호별 통계값을 저장하고 있는 뷰(view), 테이블(table) 또는 정보 조직체로서의 전처리된 특허 분류 기호별 통계 DB(3230)를 포함하고 있다.FIG. 61 illustrates the structure of the statistical preprocessing module for each patent classification symbol. The statistical preprocessing module for each patent classification symbol includes a statistical preprocessing engine 3210 for each patent classification symbol for preprocessing statistical values for each patent classification symbol, a view storing statistical values for each preprocessed patent classification symbol, ) Or a preprocessed patent classification symbol-specific statistical DB 3230 as an information organization.

상기 특허 분류 기호별 통계 전처리 모듈에서 취급하는 특허 분류 기호는 IPC, USPC, FT, FI, ECLA 중 어느 하나 이상이다. 상기 특허 분류 기호별 통계 전처리 모듈에서 국가별 특허 DB는 대한민국, 미국, 일본, 유럽(EPO)가 발행하는 특허 DB를 포함하며, 그 이외의 국가의 특허청이 발행하는 특허 DB도 포함될 수 있음은 자명하다. 각 국가는 IPC를 공통으로 하고, 자국별로 특별한 분류 체계(미국 특허청의 경우 USPC 또는 UPC, 일본 특허청의 경우 FT, FI, 유럽 특허청의 경우 ECLA 등)를 가지고 있는 경우도 있다. 2이상의 특허 분류 체계를 가지고 있는 국가의 특 허 DB를 대상으로 한 기설정된 통계값을 찾아내는 것은 상기 2이상의 분류 체계별로 독립적으로 수행되는 것이 바람직하다. 즉, 미국의 경우라면, IPC 와 USPC에 대해서 각각 전처리를 할 필요가 있게 된다. 또한, 각 특허 분류 기호 체계에는 각 체계 마다의 레벨이 있다. IPC의 경우에는 section, subsection, class, subclass, group, subgroup이 있으며, subgroup이하의 세부 분류에는 그 특허 분류 기호에 대응되는 타이틀 정보에 도트(dot)의 수로써 계층 구조를 표현하고 있다. 상기 계층 구조에 대한 예시는 특허 분류 기호 전처리 모듈(301-3-1 또는 3500)에 상세히 설명되어 있다. 그러므로, 본 발명에서의 특허 분류 기호의 레벨은 IPC section, class, subclass, main group, 1dot subgroup, 2 dot subgroup, ...n dot subgroup으로 레벨별로 구분될 수 있다. 상기 레벨은 상기 main group까지는 분류 기호 체계에서 상하위 계층 구조를 확인할 수 있으나, subgroup부터는 타이틀에 포함된 dot 정보를 활용해야만 상하위 계층 구조를 확인할 수 있다. 상기와 같은 dot의 수로써 계층 구조를 확인하는 것은 USPC의 경우 subclass부터이며, FT의 경우 전체 FT 기호에서 전7자리("테마 코드 + 영문자 2자"까지)이후부터이다.The patent classification symbol handled by the statistical preprocessing module according to the patent classification symbol is at least one of IPC, USPC, FT, FI, and ECLA. In the statistical preprocessing module for each patent classification symbol, the patent database of each country includes the patent database issued by Korea, USA, Japan, Europe (EPO), and the patent database issued by the patent office of other countries Do. Each country has a common IPC and may have a special classification system (USPC or UPC for US Patent Office, FT, FI for European Patent Office, ECLA for European Patent Office) in each country. It is desirable that the pre-set statistical value for a patent database of a country having two or more patent classification systems is independently performed for each of the above two or more classification systems. That is, in the case of the United States, it is necessary to preprocess IPC and USPC respectively. In addition, each patent classification system has a level for each system. In the case of IPC, there are section, subsection, class, subclass, group, and subgroup. In subcategories below, the hierarchical structure is expressed by the number of dots in the title information corresponding to the patent classification symbol. An example of the hierarchical structure is described in detail in the patent classification pre-processing module 301-3-1 or 3500. FIG. Therefore, the level of the patent classification symbol in the present invention can be divided into IPC section, class, subclass, main group, 1 dot subgroup, 2 dot subgroup, ... n dot subgroup. The level can identify the upper and lower hierarchical structures in the classification symbol system up to the main group, but from the subgroup, the dot information contained in the title must be utilized to confirm the hierarchical structure of the upper and lower levels. It is from the subclass in the case of USPC to check the hierarchical structure as the number of dots as above, and from FT in the case of FT after all 7 digits (from "theme code + two letters of English alphabet") to the whole FT symbol.

상기 특허 분류 기호별 통계 전처리 모듈이 특허 분류 기호별로 통계값을 전처리하는 방법의 일례는 도 100에 나와있다. 상기 특허 분류 기호별 통계 전처리 모듈은 제1국에서의 특정 특허 분류 기호 체계에 대하여, 주어지는 특허 분류 기호의 하위 분류 기호가 자동으로 포함될 수 있도록 특허 분류 기호의 트리 구조를 이용하여 특허 분류 기호를 재조정하여 DB에 저장하며(S5020), 특허 분류 기호를 포함하는 검색식 또는 검색 쿼리를 입수하며(S5030), 검색식 또는 검색 쿼리에 포함 된 주어진 특허 분류 기호에 대하여 재조정된 특허 분류 기호로 변환하며(S5040), 재조정된 특허 분류 기호를 포함하는 검색식 또는 검색 쿼리를 대상으로 검색 엔진 또는 DBMS(201)로 검색을 수행하며(S5050), 상기 검색 결과로서 특허 문건 정보를 입수하여 계량화 하며(S5060), 상기 검색 결과에 대하여 기설정된 시간 단위 또는 통합된 시간 단위로, 출원인, 발명자, 또는 대리인 등 주체에 대하여 기설정된 통계값, 파라미터 또는 계산값을 기준으로 랭킹 정보를 계산하는 기능을 수행한다.An example of a method of preprocessing statistical values for each patent classification symbol by the statistical preprocessing module for each patent classification symbol is shown in FIG. The statistical preprocessing module for each patent classification symbol may be used to readjust the patent classification symbol by using the tree structure of the patent classification symbol so that the sub classification code of the given patent classification symbol is automatically included in the specific patent classification system in the first country (S5020). Then, the retrieval expression or the retrieval query including the patent classification symbol is obtained (S5030), and the retrieved expression or the retrieval query is converted into the re-adjusted patent classification symbol for the given patent classification symbol included in the retrieval expression or the retrieval query S5040), the search engine or the DBMS 201 searches for a retrieval expression or a retrieval query including the re-adjusted patent classification symbol (S5050), obtains and quantifies the patent document information as the retrieval result (S5060) , An applicant, an inventor, or a representative of the applicant, on a predetermined time unit or an integrated time unit with respect to the search result The statistical value, and performs a function of calculating the ranking information based on the parameters or calculations.

또한 상기 특허 분류 기호별 통계 전처리 모듈은 주어진 특허 분류 기호에 대하여 하위 특허 분류 기호를 자동으로 포함시켜 통계값, 파라미터 또는 계산값을 생성할 수도 있으며, 이에 대한 일실시예적 방법이 도 101에 나타나 있다.In addition, the statistical preprocessing module for each patent classification symbol may automatically generate a statistical value, a parameter or a calculated value by automatically including a sub-patent classification symbol for a given patent classification symbol, and an exemplary method thereof is shown in FIG. 101 .

상기 특허 분류 기호별 통계 전처리 모듈은 특허 분류 기호가 포함된 검색식을 입수하고(S5120), 상기 검색식에 포함된 특허 분류 기호에 대하여 하위 특허 분류 기호에 관한 특허 정보를 포함하여 관계되는 특허 문건 정보를 입수하며(S5130), 상기 특허 문건에 포함된 인용 정보 및 피인용 정보를 기설정된 통계값, 파라미터 내지 계산값를 처리한다(S5140).The statistical preprocessing module for each patent classification symbol obtains a search formula including a patent classification symbol (S5120). The patent classification information includes patent information related to the sub-patent classification symbol for the patent classification symbol included in the search formula, (S5130), and processes the statistical values, parameters, and calculation values of the citation information and the cited information included in the patent document (S5140).

아울러, 상기 특허 분류 기호별 통계 전처리 모듈은 특정한 문건 집합의 문건 부분 집합에 대하여 인용 또는 피인용 정보를 하위 특허 분류 기호를 포함하는 관점에서 특허 분류 기호별 통계값, 파라미터값, 또는 계산값을 생성할 수도 있으며, 이에 대한 일실시예적 방법이 도 102에 나타나 있다.In addition, the statistical preprocessing module for each patent classification symbol generates a statistical value, a parameter value, or a calculation value according to a patent classification symbol from the viewpoint of including a sub-patent classification symbol for a citation or a cited information for a document subset of a specific document set An exemplary method for this is shown in FIG.

상기 특허 분류 기호별 통계 전처리 모듈은 적어도 하나의 문건 집합을 입수하고(S5220), 상기 문건 집합에 포함된 모든 문건에 대하여 출원인, 발명자, 대리 인, 적어도 하나 이상의 특허 기술 분류 기호를 추출하며(S5230), 상기 문건에 대하여 상기 추출된 기준 중 어느 하나 이상을 적용하여 문건 부분 집합을 생성하고(S5240), 상시 문건 부분 집합을 구성하는 모든 문건에 대하여, 상기 문건이 인용하는 선출원 문건 정보를 입수하거나(S5250), 상기 문건 부분 집합을 구성하는 모든 문건에 대하여, 상기 문건을 인용하는 후출원 문건 정보를 입수하며(S5260), 상기 입수된 선출원 문건 및/또는 상기 입수된 후출원 문건에 대하여 기설정된 인용, 피인용 관련 통계값, 파라미터 또는 계산값을 생성할 수 있다(S5270)The statistical preprocessing module for each patent classification symbol obtains at least one document set (S5220), extracts at least one patent technical classification symbol from the applicant, the inventor, and the proxy for all documents included in the document set (S5230 ), A document subset is generated by applying one or more of the extracted criteria to the document (S5240), and for all the documents constituting the regular document subset, the first-party document information cited by the document is obtained (S5250). Then, for all the documents constituting the document subset, the application document information after citing the document is obtained (S5260). Then, the obtained document and / or the document Quot ;, a related statistic value for a citation, a parameter or a calculation value (S5270)

이하, 본 발명의 특허 분류 기호별 통계 전처리 모듈을 특허 분류 기호별 통계 전처리 엔진의 관점에서 더욱 상세하게 설명한다. Hereinafter, the statistical preprocessing module according to the patent classification symbol of the present invention will be described in more detail from the viewpoint of the statistical preprocessing engine for each patent classification symbol.

옵션 사항Options

상기 특허 분류 기호별 통계 전처리 모듈이 상기 기설정된 통계값을 처리할 때는 다음과 같은 옵션이 고려되며, 각 옵션별 교차 선택이 가능할 것이다.When the statistical preprocessing module according to the patent classification symbol processes the preset statistical value, the following options are considered, and each option can be cross-selected.

첫째, 먼저 각 특허 분류 기호가 주어졌을 때, 그 특허 분류 기호의 하위 분류 기호에 관한 정보를 자동적으로 포함할 것인가의 문제이다. 이는 특히 타이틀 정보에 나온 dot수로부터 계층 구조를 확인해야 하는 레벨일 경우 더욱 그러하다. 예를 들면, H, H04, H04B, H04B 7/00레벨까지는 확장자를 사용하여 용이하게 주어진 레벨의 IPC하부에 있는 특허 분류 기호를 포함하는 특허 문건을 찾아 낼 수 있으나, subgroup레벨로 가면 본 발명의 하위 분류 포함 검색 방법을 활용해야만 그 하위 분류에 속하는 특허 분류 기호를 포함하는 특허 문건을 찾아 낼 수 있다. 예를 들면, H04B 7/15라는 IPC가 있을 때 하위 분류 기호를 포함하지 않고 통계값을 생성할 때는 각 국가별 특허 문건 중에서 H04B 7/15이 포함되어 있는 특허 문건을 입수하면 된다. 하지만, 하위 분류 기호 포함하여 통계값을 생성할 때는 H04B 7/15와, IPC 계층 구조상 그 하위에 있는 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216가 포함된 특허 문건들도 입수해야 한다. 이는 USPC, FT 등에서도 마찬가지이며, 공통된 설명은 생략한다. First, it is a matter of whether information on the subclassification symbol of the patent classification symbol is automatically included when each patent classification symbol is given. This is especially true if you need to check the hierarchy from the number of dots in the title information. For example, using the extension to H, H04, H04B, H04B 7/00 levels, it is easy to find patent documents containing patent classification symbols at the bottom of a given level of IPC, but at the subgroup level, Only the sub-classification search method can be used to find patent documents containing patent classification codes belonging to the sub-category. For example, if you have an IPC of H04B 7/15 and you want to generate statistical values without subcategorizations, you can obtain a patent document containing H04B 7/15 from each country's patent documents. However, when generating statistical values, including subclassifiers, H04B 7/15 and H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19 under the IPC hierarchy , H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216. This also applies to the USPC, FT, and the like, and a common description will be omitted.

하위 분류 기호를 자동으로 포함하여 통계값을 처리할 때에는 본 발명의 범위 검색(검색 엔진 이용 또는 DBMS(201) 이용 시 모두 해당)의 방법을 활용할 수도 있을 것이다. 한편, 본 목적을 달성하기 위해서 범위 검색이 아니더라도, 주어진 특허 분류 기호를 처리할 때, 그 하위에 있는 모든 특허 분류 기호를 저장해 놓고, 그 특허 분류 기호에 대한 통계값을 처리할 때, 그 특허 분류 기호에 대해서만 통계처리를 하는 것이 아니라, 하위에 있는 모든 특허 분류 기호를 대상으로 통계값을 처리할 수도 있을 것이다. 위의 H04B 7/15를 통해서 예를 들면, H04B 7/15의 하부에 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216가 있음을 특허 분류 기호 계층 구조 정보에서 입수한 다음, H04B 7/15의 하부에 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216가 있음을 저장한 다음, H04B 7/15에 대한 기설정된 통계값을 구하라는 명령이 입수될 때, 자동적으로 하부에 존재하는 H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216를 포함하여 기설정된 통계값을 구하는 방식을 채용할 수도 있다는 것이다. 이는 USPC, FT 등의 다른 특허 분류 기호 체계에서도 마찬가지이다.When processing the statistical value including the sub classifier automatically, the range search of the present invention (both using the search engine or using the DBMS 201) may be utilized. In order to achieve this object, when processing a given patent classification symbol, not all of the range search, all the patent classification symbols below the patent classification symbol are stored, and when processing the statistical value for the patent classification symbol, It is possible to process statistical values for all patented classifiers in the subordinate, rather than only statistical processing for symbols. H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B at the bottom of H04B 7/15, for example, 7/204, H04B 7/208, H04B 7/212 and H04B 7/216 from the Patent Classification Hierarchy Information, then H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216, H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/195, H04B 7/195, H04B 7/195, H04B 7/195, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 may be adopted. This is true of other patent classification schemes such as USPC and FT.

주어진 레벨의 특허 분류 기호가 있을 때, 단순히 그 특허 분류 기호를 포함하는 문건을 대상으로 상기 기설정된 통계값을 생성하는 것보다 상기 특허 분류 기호의 하위 특허 분류 기호들을 자동으로 포함하여 기설정된 통계값을 생성하는 것이 바람직하다.When there is a patent classification code of a given level, it is preferable to automatically include the sub-patent classification codes of the patent classification symbol rather than generating the predetermined statistical value for a document containing the patent classification code, . &Lt; / RTI >

둘째, 복수개의 특허 분류 기호가 존재할 경우, 이를 어떻게 처리할 것인가의 문제이다. 이 경우, 1) 주(main) 특허 분류 기호(통상 제일 첫번째 나오는 것이 주 특허 분류 기호이다.)만을 통계 처리의 대상으로 하는 방법, 2) 주 특허 분류 기호와 종(sub) 특허 분류 기호에 대해 균등한 가중치를 두면서 처리하는 방법, 3) 주 특허 분류 기호와 종 특허 분류 기호에 대해 다른 가중치를 주는 방식으로(예를 들면, 특허 분류 기호가 n+1개가 있을 때, 주 특허 분류 기호 50%, 나머지 종 특허 분류 기호는 50%*1/n으로 가중치를 주는 방법) 처리할 수 있을 것이다. 상기 1) 내지 3) 중에서 1)과 3)이 바람직하며, 그 중에서 3)이 더욱 바람직할 것이다.Second, if there are multiple patent classification codes, how to handle them is a problem. In this case, 1) the main patent classification code (usually the first one is the main patent classification code) is used as the statistical processing target, 2) the main patent classification code and the sub patent classification code (Eg, when there are n + 1 patent classification codes, the main patent classification code is 50% or more) , And the rest of the patent classification code is weighted by 50% * 1 / n). Of the above 1) to 3), 1) and 3) are preferable, and 3) among them is more preferable.

셋째, 복수개의 출원인과 발명자가 있을 때의 처리 방법이다. 이 경우, 1) 매출원인 또는 발명자마다 100%의 가중치를 주는 방법과, 2) 출원인 또는 발명자수에 대해 균분하는 방법(n명의 발명자가 있을 때 각 발명자마다 100%*1/n로 처리함)이 있을 것이다. 출원인의 경우에는 실시 독립의 관점에서 출원인마다 100%의 가중치를 주는 것이, 발명자는 1/n으로 처리하는 것이 바람직할 것이다. 대리인의 경우에는 위의 출원인이나 발명자의 처리 방법을 채용할 수 있을 것이며, 대리인마다 100%로 하는 것이 바람직할 것이다.Third, it is a processing method when there are a plurality of applicants and inventors. In this case, 1) a method of giving a weight of 100% to each person who made a sale or an inventor, 2) a method of balancing the number of applicants or inventors (100% * 1 / n for each inventor when there are n inventors) . In the case of the applicant, it would be desirable for the inventor to treat it as 1 / n to give a weight of 100% for each applicant in terms of implementation independence. In the case of an agent, the processing method of the applicant or the inventor as described above may be adopted, and it is preferable that the agent is 100%.

상기 특허 분류 기호별 통계 전처리 모듈은 주어진 특허 분류 기호에 대하여 상기 특허 분류 기호 체계 상에서 자신의 직하위 특허분류 기호의 개수를 카운트해 놓을 수 있으며, 자신의 모든 하위 특허 분류 기호의 개수를 카운트해 놓을 수 있을 것이다.The statistical preprocessing module for each patent classification symbol may count the number of its own sub-classifications on the patent classification system for a given patent classification symbol and count the number of all its sub-patent classification symbols It will be possible.

상기 특허 분류 기호별 통계 전처리 모듈은 주어진 특정 특허 분류 기호에 대하여 지정된 국가별 특허 DB를 대상으로 하여, 기설정된 기간별로, 출원 기준 및/또는 등록 기준으로 하기와 같은 정보를 계산해 놓을 수 있다. 상기와 같은 계산이 되는 이유는 상기 특허 분류 기호, 지정 국가, 설정 시간, 출원 또는 등록 여부 등의 조건을 만족하는 특허 문건 집합이 확정되기 때문이다. 상기 확정된 특허 문건 집합에 관한 카운팅으로 하기와 같은 각종 파라미터를 계산할 수 있게 된다.The statistical preprocessing module for each patent classification symbol may calculate the following information on the basis of the application standard and / or registration standard for each predetermined period of time, with respect to the specified patent DB for the given patent classification symbol. The reason for this calculation is that a set of patent documents satisfying the conditions such as the patent classification symbol, the designated country, the set time, the application or the registration is determined. By counting the determined set of patent documents, various parameters such as the following can be calculated.

예를 들면, 출원 기준으로는 연도별 또는 분기별 출원수, 출원수증가율, 출원수증가속도, 출원수증가가속도를 계산해 놓을 수 있을 것이다. 연도별 출원수는 특정 연도에 출원된 특허 문건의 전체 개수를 구하는 방식으로 계산될 수 있을 것이다. 증가율의 계산은 {(현기간 개수-과거기간개수)/과거기간 개수}*100%가 되며, 증가속도는 계산은 {(현기간 개수-과거기간개수)/시간간격}이 되며, 증가가속도는 증가속도의 시간간격에 대한 변화율로 계산가능할 것이다. For example, the number of applications, the number of applications, the rate of increase in the number of applications, and the acceleration of the number of applications can be calculated by the year or quarter. The number of applications per year may be calculated in such a manner that the total number of patent documents filed in a particular year is obtained. The calculation of the growth rate is {(current period number - past period number) / past period number} * 100%, and the increase rate is {(current period number - past period number) / time interval} The rate of change with respect to the time interval of the increase rate.

또한, 연도별 출원자 수를 구할 수 있을 것이며(이때, 공동 출원일 경우, 상기 옵션 사항을 적용할 수 있을 것이다.), 출원자 수가 계산되면 출원자수증가율을 계산해 놓을 수 있으며, 나아가 1개 출원당 평균출원자수 및 평균출원자수증가율도 계산할 수 있을 것이다. 그리고, 연도별 발명자수, 발명자수증가율, 평균발명자수, 평균발명자수증가율도 상기의 연도별 출원자 수에 준하여 계산할 수 있을 것이다.In addition, the number of applicants per year may be obtained (in this case, in the case of a joint application, the above options may be applied), the number of applicants can be calculated when the number of applicants is calculated, The rate of increase in embroidery and average number of applications will also be calculated. The number of inventors, the number of inventors, the average number of inventors, and the average number of inventors can be calculated based on the number of applicants per year by the year.

그리고, 개별 출원마다 출원 청구항 수를 계산할 수 있으므로, 연도별 출원청구항수, 출원청구항수증가율, 1개 출원당 평균출원청구항수, 평균출원청구항수증가율을 계산할 수 있을 것이다. 이때, 출원 청구항을 독립항과 종속항으로 구분지을 경우, 독립항 수와 종속항 수를 계산할 수 있으므로, 이들에 대한 증가율 등을 계산할 수 있을 것이다.In addition, since the number of patent claims can be calculated for each individual application, the number of claims for patent application, the number of claims for patent application, the average number of patent applications per application, and the average patent application claim increase rate can be calculated. In this case, if the claim is divided into independent and dependent clauses, the number of independent clauses and the number of dependent clauses can be calculated.

또한, 개별 출원마다 특허 분류 기호의 개수를 계산할 수 있을 것이므로, 연도별 특허분류기호수, 특허분류기호수증가율을 계산할 수 있을 것이며, 1개 출원당 평균특허분류기호수, 평균특허분류기호수증가율도 계산할 수 있을 것이다. 또한, 미국이나 일본 등과 같이 2개 이상의 특허 분류 기호가 사용되는 특허 DB의 경우에는 각 특허 분류 기호마다 이를 계산해 놓을 수 있을 것이다. 개별 출원마다 그 출원에 관한 패밀리 정보가 있을 때, 상기 패밀리 정보를 기준으로 연도별 패밀리수, 매밀리수증가율 및 1개 출원당 평균패밀리국가수, 평균패밀리국가수증가율도 계산할 수 있을 것이다.In addition, since the number of patent classification symbols can be calculated for each individual application, it will be possible to calculate the rate of increase of the Patent Classifier Lake and Patent Classifier Lake by year, and the average Patent Classifier Lake and the average Patent Classifier Lake Growth Rate per application can be calculated will be. In the case of a patent database in which two or more patent classification codes are used, such as the United States or Japan, it may be calculated for each patent classification code. When there is family information about the application for each individual application, the number of families by year, the number of milliseconds to increase, the average number of family countries per application, and the average family number increase rate can also be calculated based on the family information.

상기는 출원 기준으로 계산된 것이며, 이와 동일한 파라미터들을 등록 기준으로 계산할 수도 있을 것이다. 등록 기준으로 계산할 수 있는 파라미터들은 등록수, 등록수증가율, 등록수증가속도, 등록수증가가속도, 등록자수, 등록자수증가율, 평균등록자수, 평균등록자수증가율, 발명자수, 발명자수증가율, 평균발명자수, 평균발명자수증가율, 등록청구항수, 등록청구항수증가율, 평균등록청구항수, 평균등 록청구항수증가율, 특허분류기호수, 특허분류기호수증가율, 평균특허분류기호수, 평균특허분류기호수증가율, 패밀리수, 매밀리수증가율, 평균패밀리국가수, 평균패밀리국가수증가율 등이 있을 수 있다.The above is calculated based on the application standard, and the same parameters may be calculated on the basis of registration. The parameters that can be calculated based on the registration criteria include the number of registrations, the number of registrations, the number of registrations, the number of registrants, the number of registrants, the number of registrants, the average number of registrants, the average number of registrants, Patent Classifier Lake Growth Rate, Average Patent Classifier Lake, Average Patent Classifier Lake Growth Rate, Number of Families, Number of Mills, Number of Claims, Number of Registered Claims, Growth rate, average number of family countries, and the average number of family countries.

상기와 같은 파라미터들은 예시적인 것이며, 그 이외에도 출원 또는 등록 특허 정보에 포함된 각종 수치 정보 및 본 발명의 사전 카운팅된 값들을 재료로 하여 다양한 파라미터 값도 계산될 수 있을 것이다. 상기 파라미터 값들은 본질적으로 카운트값, 카운트값의 변화율과 변화값, 상기 카운트값의 통계량(평균, 표준편차), 2 이상의 카운트값에 대한 함수값(예를 들면 등록율 등이 그 예가 된다.)The above parameters are exemplary, and various parameter values may also be calculated using various numerical information included in the application or registered patent information and the pre-counted values of the present invention. The parameter values are essentially a count value, a change rate and a change value of the count value, a statistical amount (average, standard deviation) of the count value, and a function value (e.g., a registration rate) for two or more count values.

수량 기준 주체의 발견Discovery of quantity subject

상기 특허 분류 기호별 통계 전처리 모듈은 주어진 특정 특허 분류 기호에 대하여 지정된 국가별 특허 DB를 대상으로 하여, 기설정된 기간별로, 출원 기준 및/또는 등록 기준으로 하기와 같은 정보를 계산해 놓을 수 있다. 상기와 같은 계산이 되는 이유는 상기 특허 분류 기호, 지정 국가, 설정 시간, 출원 또는 등록 여부 등의 조건을 만족하는 특허 문건 집합이 특정되기 때문이다. 상기 특정된 특허 문건 집합에 관한 카운팅으로 하기와 같은 각종 파라미터를 계산할 수 있게 된다.The statistical preprocessing module for each patent classification symbol may calculate the following information on the basis of the application standard and / or registration standard for each predetermined period of time, with respect to the specified patent DB for the given patent classification symbol. The reason for the above calculation is that a set of patent documents satisfying the conditions such as the patent classification symbol, the designated country, the set time, the filing or the registration is specified. The counting of the specified set of patent documents makes it possible to calculate various parameters as follows.

예를 들면, 상기 특정된 특허 문건 집합에 대한 카운팅으로 연도별 또는 연도 통합 최다 출원인, 발명자, 대리인을 추출하여 각 출원인, 발명자, 또는 대리인별로 랭킹을 매겨 놓을 수 있을 것이다.For example, by counting the set of specified patent documents, the inventor or agent who is the most applicant of the year or the year integrated can be extracted and ranked by each applicant, inventor, or agent.

또한, 위에서 추출한 출원인들, 발명자들, 또는 대리인별로 집중도, 활동도(Activity Index)등을 계산해 놓을 수 있으며, 상기 계산의 결과로 집중도를 기 준으로 한 출원인, 발명자 또는 대리인의 랭킹을 구할 수 있을 것이며, 활동도를 기준으로 한 출원인, 발명자, 또는 대리인의 랭킹을 구할 수 있을 것이다. 참고로, 출원수 기준 집중도란 {(특정 기간 동안의 특정 주체의 특정 특허 분류 기호에 대한 모든 출원/ 특정 기간 동안의 특정 주체의 모든 출원)*100%}로 구할 수 있을 것이다. 또한, 활동도는 [{(특정 기간 동안의 특정 주체의 특정 특허 분류 기호에 대한 모든 출원/ 특정 기간 동안의 특정 주체의 모든 출원)}/{(특정 기간 동안의 특정 특허 분류 기호에 대한 모든 출원/ 특정 기간 동안의 모든 출원)}*100%]으로 계산할 수 있다. 이때, 검색 엔진 또는 상기 DBMS(201)를 통하여 1) 특정 기간 동안의 특정 주체의 특정 특허 분류 기호에 대한 모든 출원, 2) 특정 기간 동안의 특정 주체의 모든 출원, 3) 특정 기간 동안의 특정 특허 분류 기호에 대한 모든 출원 및 4) 특정 기간 동안의 모든 출원 각각의 개수를 입수할 수 있기 때문에, 위의 집중도나 활동도를 계산할 수 있게 된다. 한편, 상기 집중도나 활동도는 특허 지표의 일 예에 불과하며, 상기 검색 엔진 또는 상기 DBMS(201)로 직접 또는 사전 계산을 통하여 간접적으로 입수할 수 있는 수치값들을 입력값으로 하는 정의된 함수가 있는 경우, 그 함수의 값을 계산할 수 있을 것이며, 그 계산된 함수값을 통하여 출원인, 발명자, 또는 대리인의 그 함수값을 기준으로 한 랭킹을 계산할 수 있을 것이다. 본 발명의 상기 함수에는 특허 분석 지표로 활용되는 정의될 수 있는 종래의 모든 특허 분석 지표 중 특허 DB를 기준으로 검색 엔진 또는 상기 DBMS(201)가 입수할 수 있는 값으로 정의되는 모든 특허 분석 지표가 해당될 수 있다. In addition, it is possible to calculate the concentration, activity index, etc. for the applicants, inventors, or agents extracted from the above, and the ranking of the applicant, inventor, or agent based on the degree of concentration as a result of the calculation , And the ranking of the applicant, inventor, or agent based on the activity level may be obtained. For reference, the concentration based on the number of applications will be calculated as {(100%)} (all applications / specific applications for a particular subject for a particular time period). In addition, the activity level is defined as [({(all applications for a specific subject's specific patent classification for a particular time period / all applications for a particular subject during a specified period)} / { / All applications for a specific period)} * 100%]. 1) All applications for specific patent classification codes of a specific subject during a specific period, 2) All applications of a specific subject during a specific period, 3) All of the applications for the classifier, and 4) the number of each of the applications for a particular period of time, so that the above concentration or activity can be calculated. Meanwhile, the concentration or activity is only an example of a patent index, and a defined function that takes as input values numerical values indirectly obtainable directly or through pre-calculation to the search engine or the DBMS 201 If so, it will be able to calculate the value of the function and calculate the ranking based on the function value of the applicant, inventor, or agent through the calculated function value. In the function of the present invention, all patent analysis indicators defined as values obtainable by the search engine or the DBMS 201 on the basis of the patent DB among all the conventional patent analysis indicators that can be defined as the patent analysis index .

아울러, 상기에서는 출원수를 기준으로 해서 출원인, 발명자 또는 대리인별 랭킹을 계산했으나, 출원수 증가율, 출원수 증가속도 등과 같은 다른 파라미터들을 기준으로 상기 특허 지표 내지는 함수에 대해서 출원인, 발명자 또는 대리인별 랭킹을 계산해 놓을 수 있을 것이다. 상기 파라미터로는 출원청구항수, 출원청구항수증가율, 1개 출원당 평균출원청구항수, 1개 출원당 평균출원청구항수증가율, 특허분류기호수, 특허분류기호수증가율, 1개 출원당 평균특허분류기호수, 1개 출원당 평균특허분류기호수증가율, 패밀리수, 매밀리수증가율, 1개 출원당 평균패밀리국가수, 1개 출원당 평균패밀리국가수증가율 등이 있을 수 있을 것이다.In addition, in the above description, the rankings of applicants, inventors, or agents are calculated based on the number of applications, but the rankings of applicants, inventors, or agents are determined for the patent indexes or functions based on other parameters such as the number of applications, . The parameters include the number of claims to be filed, the number of claims to be filed, the average number of claims per patent application, the average number of patent claims per patent application, the patent classifier lakes, the patent classifier lake growth rate, There may be an average patented classifier rate per filing, the number of families, the number of millions, the average number of family countries per application, and the average number of family members per application.

상기 각종 특허 지표 내지는 함수는 출원 기준으로 설명하였으나, 등록을 기준으로도 정의될 수 있고, 계산될 수 있음은 자명할 것이다. 상기 출원 또는 등록의 한정은 특허 문건 집합의 특정 내지 확정의 문제이기 때문이다. 그러므로, 출원 중에서 출원 단계에서 심사청구가 있는 출원만을 대상으로 상기 특허 문건 집합을 특정 내지 확정할 수도 있을 것이다. 아울러, 실용신안제도를 도입하고 있는 경우, 실용신안등록출원 또는 실용신안등록을 각각 출원 또는 등록에 포함시킬 지의 여부 또한 상기 특허 문건 집합의 특정 내지 확정에 관한 문제로 이러한 특정 내지 확정의 방법과 기준은 당업자의 수준에서 다양하게 정할 수 있을 것이다.Although various patent indices or functions have been described with reference to the application, it will be obvious that they can also be defined and calculated based on registration. This is because the limitation of the application or registration is a matter of specifying or determining the set of patent documents. Therefore, the patent document set may be specified or determined only for the application for which the examination request is made at the application stage among the applications. In addition, in the case of introducing the utility model system, whether or not to include the utility model registration application or the utility model registration in each application or registration is also a problem concerning the specification or determination of the set of patent documents, May be variously determined at the level of those skilled in the art.

인용 관련Citation related

상기 특허 분류 기호별 통계 전처리 모듈은 주어진 특정 특허 분류 기호에 대하여 인용 정보를 포함하고 있는 국가별 특허 DB(예를 들면 미국 특허청 발행 특허 문건으로 구축한 특허 DB)를 대상으로 하여, 기설정된 기간별로, 출원 기준 및/또는 등록 기준으로 하기와 같은 인용 관련 정보를 생성해 놓을 수 있다. 상기와 같은 계산이 되는 이유는 상기 특허 분류 기호, 지정 국가, 설정 시간, 출원 또는 등록 여부 등의 조건을 만족하는 특허 문건 집합이 특정되기 때문이다. 예를 들면, USPC=002/456(Apparrel 중에서도 MISCELLANEOUS 중에서도 GUARD OR PROTECTOR 중에서도 Body cover)가 주어졌을 때, 자신의 특허 문건의 특허 분류 기호로 USPC=002/456를 포함하고 있는 특허 문건 집합을 특정할 수 있게 된다. 상기 특정된 특허 문건 집합에 관한 카운팅으로 하기와 같은 각종 파라미터를 계산할 수 있게 된다.The statistical preprocessing module for each patent classification symbol is a patent database (for example, a patent database established by the United States Patent and Trademark Office patent document) containing citation information for a given patent classification symbol, , The citation related information may be generated based on the application standard and / or the registration standard. The reason for the above calculation is that a set of patent documents satisfying the conditions such as the patent classification symbol, the designated country, the set time, the filing or the registration is specified. For example, when USPC = 002/456 (body cover among GUARD OR PROTECTOR among MISCELLANEOUS among Apparrel) is given, a set of patent documents containing USPC = 002/456 as a patent classification code of own patent document . The counting of the specified set of patent documents makes it possible to calculate various parameters as follows.

미국 특허 문건의 서지 사항에는 자기 문건이 인용한 타인의 특허 등의 인용 정보가 존재한다. 즉, B 문건이 A 문건 및 a 문건을 인용하고 있을 때, A 문건 및 a 문건은 인용당한 것, 즉 피인용 특허이며, B 문건은 A 및 a 문건을 인용한 특허이다. 이때, B 문건과 관계된 출원인(asignee를 포함한다), 발명자, 대리인, 특허 분류 기호는 각각 인용 출원인, 인용 발명자, 인용 대리인, 인용 특허 분류 기호가 되며, A 문건 및 a 문건과 관계된 출원인(asignee를 포함한다), 발명자, 대리인, 특허 분류 기호는 각각 피인용 출원인, 피인용 발명자, 피인용 대리인, 피인용 특허 분류 기호가 된다.In the bibliography of US patent documents, there is citation information such as patent of other person quoted by his / her own document. In other words, when document B is citing documents A and a, documents A and a are cited documents, ie patents are patents, and documents B are patents citing documents A and a. In this case, the applicant (including the asignee), the inventor, the agent and the patent classification code related to the document B become the patent applicant, the patentee, the patent agent, the patent agent, and the patent classification code, respectively. The inventor, the agent, and the patent classification code shall be patent categorizations for the patentee, the patentee, the patent agent, and the patentee, respectively.

B 문건이 A 문건 및 a 문건을 인용하고 있을 때, B 문건의 서지사항에는 A 문건 및 a 문건의 번호가 들어있다. B 문건이 A 문건 및 a 문건을 인용하는 관계가 있을 때, 전산적으로는 인용 관점에서의 A->B, a->B의 맵핑(mapping) 및 B->A, B->a로의 맵핑이 있게 되고, 상기 맵핑 관계를 통하여 A<->B, a<->B의 상호 관계를 성립시킬 수 있다. 그러므로, B가 인용하고 있는 문건은 B의 서지사항에 있어 용이 하게 데이터화 될 수 있게 되고, 상기 맵핑을 통하여, A가 인용되고 있는 문건들의 집합 및 a가 인용되고 있는 문건 집합도 용이하게 특정할 수 있다. 즉, 상기 A 문건이 인용되고 있는 문건 집합 및 a 문건이 인용되고 있는 문건 집합 모두에는 B 문건이 포함되어 있을 것이다. When the B document is citing the A document and the a document, the bibliography of the document B contains the numbers of the A document and the a document. When the B document has a relationship citing the A document and the a document, mapping from A-> B, a-> B and B-> A, B-> a are computationally cited , And the relationship of A <-> B, a <-> B can be established through the mapping relation. Therefore, a document that B is citing can be easily dataized in B's bibliography, and through the mapping, a set of documents to which A is being cited and a set of documents to which a is cited can easily be specified have. That is, the set of documents for which the A document is cited and the document for which the a document is cited will include the B document.

상기 과정은 확정 또는 특정된 모든 문건에 대하여 수행될 수 있을 것이다. 즉, 특정된 모든 특허 문건에 대하여 하나씩(one by one)으로 처리될 때, 하거나, 기타 전산적인 방법으로 모든 특허 문건 간의 맵핑 관계를 설정할 수 있을 것이다. 하나씩 처리될 때는 각 처리의 대상이 되는 문건은 B 문건의 입장에 서게 됨은 자명할 것이다. 이때, A 문건 및 a 문건이 모두 미국 특허 DB라는 전체 집합에 포함될 경우에는 A 문건 및 a 문건과 관련된 정보에 B 문건과의 매핍 정보가 기록될 수 있을 것이며, 어느 하나가 다른 국가의 특허 DB에서 관리되는 문건일 경우에도 그 국가의 특허 DB의 그 문건과 관련된 정보에 B 문건과의 매핍 정보가 기록될 수 있을 것이다.The process may be performed on all documents that have been confirmed or specified. That is, you can set mapping relationships between all patent documents when they are processed one by one for all specified patent documents, or in other computational ways. When processed one by one, it will be obvious that the document to be processed is placed in the position of document B. In this case, if both the A and the a documents are included in the overall set of US patent DB, the information related to the document A and the document a can be recorded in the information about the document B, Even in the case of a managed document, the information related to the document in the patent database of the country may be recorded in the information related to the document.

인용 정보를 포함하고 있는 특허 문건 집합(B가 있는 문건 집합)이 특정되면, 상기 특허 분류 기호별 통계 전처리 모듈은 상기 특허 문건 집합의 각각의 특허 문건에 포함되어 있는 피인용 특허 문건 번호(문건 A의 번호, 문건 a의 번호)를 취합할 수 있게 되고, 상기 취합된 피인용 특허 문건 번호에는 대응하는 피인용 특허 문건이 있으며, 상기 피인용 특허 문건에는 서지 사항이 포함되어 있음은 자명하다. 따라서, 상기 특정된 특허 문건 집합을 구성하는 모든 특허 문건이 인용한 피인용 특허 문건 집합 또한 특정한 계산 시점 기준으로 특정되게 된다. 그러므로, 상기 특정된 피인용 특허 문건 집합이 분석, 카운팅 또는 계산 대상이 될 수 있다. 이때, 중복 카운팅은 당연히 허용된다. 즉, 하나의 피인용 문건이 상기 인용 정보를 포함하고 있는 특허 문건 집합의 다수 문건에 의해 인용될 때, 상기 피인용 문건은 그 인용되는 회수만큼의 가중치 또는 카운팅 값을 가져야 한다. 즉, 각 피인용 문건 번호에는 상기 인용 회수만큼의 가중치가 곱해져 있다고 취급되면 타당할 것이다. 즉, 문건 A가 5번 인용되고, 문건 a가 3번 인용되었다고 할 때, 모든 인용출원수 등과 같은 하기 파라미터값을 계산할 때, A문건에는 5의 가중치를, a문건에는 3의 가중치를 두는 방식으로 계산하는 것이 바람직하다. 인용 정보를 포함하고 있는 특허 문건 집합을 기준으로 할 때, 상기 피인용 특허 문건 집합을 대상으로 한 모든 계산값은 "인용"이라는 말이 붙을 수 있다. 상기 중복 카운팅이 될 때, 가장 많은 회수로 인용된 피인용 문건 번호의 랭킹을 계산할 수 있을 것이다.If a set of patent documents containing citation information (a set of documents with B) is specified, the statistical preprocessing module for each patent classification symbol is a patent document number (refer to document A The number of the document a, and the number of the document a) can be collected. The collected patent document number includes the corresponding patent document, and it is obvious that the patent document for citing includes the bibliography. Accordingly, a set of patent documents for citing documents cited by all patent documents constituting the specified patent document set is also specified at a specific calculation time reference. Therefore, the specified set of patent documents for citing can be analyzed, counted, or calculated. At this time, redundant counting is, of course, allowed. That is, when a cited document is cited by a plurality of documents of a set of patent documents containing the citation information, the cited document must have a weight or a count value as many times as the cited document. That is, it is valid if each cited document number is deemed to be multiplied by the weight of the number of citations. That is, when document A is cited five times and document a is cited three times, when calculating the following parameter values such as the number of cited applications, a weight of 5 for the A document and a weight of 3 for the a document . Based on a set of patent documents containing citation information, all calculated values for the cited patent document set may be referred to as "quoted ". When the duplicate counting is performed, the rankings of cited document numbers quoted the greatest number of times will be calculated.

상기 특허 분류 기호별 통계 전처리 모듈은 상기 피인용 특허 문건 집합(문건 A, 문건 a가 있는 문건 집합) 또는 임의의 특정된 특허 문건 집합을 대상으로 하여 다음과 같은 정보들을 계산할 수 있을 것이다 그리고, 먼저 양적인 기준으로 인용출원수, 인용출원수 증가율, 인용출원수 증가속도(이들은 피인용 특허 문건 집합을 구성하는 문건의 개수로 알 수 있다.)를 알 수 있으며, 인용출원수증가가속도, 인용출원자수, 인용출원자수증가율, 인용평균출원자수, 인용평균출원자수증가율, 인용발명자수, 인용발명자수증가율, 인용평균발명자수, 인용평균발명자수증가율, 인용출원청구항수, 인용출원청구항수증가율, 인용평균출원청구항수, 인용평균출원청구항수증가율, 인용특허분류기호수, 인용특허분류기호수증가율, 인용평균특 허분류기호수, 인용평균특허분류기호수증가율도 계산될 수 있을 것이다. 상기 인용출원수, 인용출원수증가율, 인용출원수증가속도, 인용출원수증가가속도, 인용출원자수, 인용출원자수증가율, 인용평균출원자수, 인용평균출원자수증가율, 인용발명자수, 인용발명자수증가율, 인용평균발명자수, 인용평균발명자수증가율, 인용출원청구항수, 인용출원청구항수증가율, 인용평균출원청구항수, 인용평균출원청구항수증가율, 인용특허분류기호수, 인용특허분류기호수증가율, 인용평균특허분류기호수, 인용평균특허분류기호수증가율 등의 카운팅 방법은 상기 파라미터 계산 방법에서 설명한 바와 동일하다. 즉, 피인용 특허 문건 집합을 대상으로 하여, 출원수, 출원수증가율, 출원수증가속도, 출원수증가가속도, 출원자수, 출원자수증가율, 평균출원자수, 평균출원자수증가율, 발명자수, 발명자수증가율, 평균발명자수, 평균발명자수증가율, 출원청구항수, 출원청구항수증가율, 평균출원청구항수, 평균출원청구항수증가율, 특허분류기호수, 특허분류기호수증가율, 평균특허분류기호수, 평균특허분류기호수증가율을 계산하면, 그 값이 각각 상기 인용출원수, 인용출원수증가율, 인용출원수증가속도, 인용출원수증가가속도, 인용출원자수, 인용출원자수증가율, 인용평균출원자수, 인용평균출원자수증가율, 인용발명자수, 인용발명자수증가율, 인용평균발명자수, 인용평균발명자수증가율, 인용출원청구항수, 인용출원청구항수증가율, 인용평균출원청구항수, 인용평균출원청구항수증가율, 인용특허분류기호수, 인용특허분류기호수증가율, 인용평균특허분류기호수, 인용평균특허분류기호수증가율이 된게 된다.The statistical preprocessing module according to the patent classification symbol may calculate the following information about a set of patent documents for the cited document (document A, a document set with document a) or a specific set of patent documents, and The number of cited applications, the number of cited applications, the rate of increase in the number of cited applications (these can be known by the number of documents constituting the set of patent documents for cited persons) can be known, and the increase in the number of cited applications, The number of patent applicants, the number of patent applications, the average number of patent applications, the average number of patent applications, the number of patent inventors, the number of patent inventors, the average number of inventors, the number of patent inventors, the number of patent applications, Number of patent claims, number of filed patent applications, number of claims filed, patent categorizer lake, patent categorizer lake growth rate, citation average patent classifier lake, The growth rate of the patent classifier lake could also be calculated. The number of cited applications increased, the number of cited applications increased, the number of cited applications increased, the number of applications cited, the number of applications cited, the average number of applications cited, the average number of applications increased, the number of cited inventors, , The average number of inventors cited, the average number of inventors increased, the number of claims filed, the number of claims filed, the number of patent applications filed, the average patent filings filed, the number of filed patent claims, the patented classifier lake, The method of counting the classifier lakes, the average patent classifier lake growth rate, and the like is the same as described in the above parameter calculation method. In other words, with respect to a set of patent documents for citing, the number of applications, the number of applications, the rate of increase in the number of applications, the acceleration of the number of applications, the number of applicants, the number of applicants, the average number of applicants, Patent Classifier Lake Growth Rate, Average Patent Classifier Lake, Average Patent Classifier Lake Growth Rate, Number of Average Inventors, Average Number of Inventors, Number of Claims, Number of Claims, The rate of increase in the number of cited applications, the rate of increase in the number of cited applications, the rate of increase in the number of cited applicants, the rate of increase in the number of cited applicants, the number of cited average applications, the rate of increase in the number of cited average applications, The number of inventors, the number of inventors, the average number of inventors, the average number of inventors, the number of patent applications, the number of patent applications, the number of patent applications, , The rate of increase in the number of patent application claims, the rate of patent categorizer lake, the rate of patent categorizer lake increase, the average patent classifier lake, and the average patent classifier lake rate of citation.

마찬가지로 상기 특허 분류 기호별 통계 전처리 모듈은 상기 피인용 특허 문 건 집합을 대상으로 하여, 상기 각 파라미터별로 출원인, 발명자, 대리인 또는 특허 분류 기호의 랭킹을 계산할 수 있을 것이다. 즉, 출원수를 기준으로 한 최다 피인용 출원인별 랭킹, 최다 피인용 발명자별 랭킹을 계산할 수 있을 것이다. 이때, 상기 특허 분류 기호별 랭킹을 계산할 때, 본 발명의 하위 특허 분류 기호 체계를 활용하여 각 특허 분류 기호 체계 상의 각 단계별로 최다 인용 특허 분류 기호의 랭킹을 계산할 수 있을 것이다. 이때, 상기 피인용 특허 문건 집합에 포함된 피인용 특허 문건에 포함된 적어도 하나 이상의 특허 분류 기호가 포함되어 있을 것이며, 상기 포함된 특허 분류 기호의 (특허 분류 기호 체계 상의) 모든 상위 특허 분류 기호 각각도 인용되었다고 보는 것이 타당할 것이다. 그러므로, 이러한 관점이 상기 각 단계별 최다 인용 특허 분류 기호의 랭캥의 계산에 포함되는 것이 바람직할 것이다. 상기 피인용 특허 문건 집합에 포함된 미국 특허 문건을 대상으로 할 때에는 상기 특허 분류 기호는 USPC와 IPC 둘 다가 될 수도 있을 것이다.Similarly, the statistical preprocessing module for each patent classification symbol may calculate the ranking of the applicant, the inventor, the agent, or the patent classification symbol for each of the parameters for the patent document set for the cited person. That is, it is possible to calculate the ranking of the applicant for the highest number of applicants based on the number of applications, and the ranking for each inventor by the highest number of applicants. At this time, when calculating the rankings according to the patent classification symbols, the ranking of the most cited patent classification symbols may be calculated for each stage of each patent classification system using the sub-patent classification scheme of the present invention. At this time, at least one or more patent classification symbols included in the patent document for citing included in the set of cited patent documents will be included, and all the upper patent classification codes (in the patent classification scheme) It would be reasonable to assume that it was quoted. Therefore, it would be desirable for this view to be included in the calculation of the quotation of the most cited patent classification symbols for each step. When the US patent documents included in the set of patent documents for citing are targeted, the patent classification codes may be both USPC and IPC.

상기 특허 분류 기호별 통계 전처리 모듈은 상기 인용 정보를 포함하고 있는 특허 문건 집합(B가 있는 문건 집합)을 대상으로 하여, 출원인 정보를 획득할 수 있을 것이며, 상기 인용 정보를 포함하고 있는 출원인별 특허 문건 집합(상기 모든 출원인 중의 어느 하나의 출원인별 특허 문건 집합에는 상기 B 문건이 포함되어 있을 것이다.)을 대상으로 하여, 상기 출원인별 특허 문건 집합을 대상으로도 상기 피인용 특허 문건 집합을 특정할 수 있을 것이며, 특정된 상기 피인용 특허 문건 집합에 대하여 상기의 인용출원수 등과 같은 각종 파라미터를 계산할 수 있을 것이다. 이때, 상기 파라미터별로, 상기 출원인별 랭킹을 계산할 수 있을 것이다. 예를 들면, 연도별로 USPC=002/456(Apparrel 중에서도 MISCELLANEOUS 중에서도 GUARD OR PROTECTOR 중에서도 Body cover)로 특정된 특허 문건 집합에서 출원인들을 추출할 수 있으며, 상기 추출된 출원인별로의 문건 집합을 생성할 수 있게 된다.(예를 들면 미국 등록 번호 06401262호, assignee Benetton Group S.p.A., US Cl. 2/456; 2/411인 문건이 있을 때, 상기 Benetton Group S.p.A.,별로 USPC=002/456에 대응되는 상기 문건 집합을 생성할 수 있다.) 상기 특허 분류 기호별 통계 전처리 모듈은 상기 출원인별로 생성된 문건 집합을 대상으로 하여 동일한 방법으로 상기 피인용 문건 집합을 생성할 수 있게 되고, 상기 피인용 문건 집합에 대하여 상기의 각종 파라미터들을 계산할 수 있다.The statistical preprocessing module for each patent classification symbol can acquire applicant information on a patent document set (document set with B) containing the quotation information, and it is possible to obtain applicant information A set of documents (a set of patent documents per applicant of any one of the above-mentioned applicants will include the document B), and the set of patent documents for the cited persons is also specified for the set of patent documents for each applicant And it will be possible to calculate various parameters such as the number of cited applications for the specified set of cited patent documents. At this time, the applicant's ranking can be calculated for each parameter. For example, it is possible to extract applicants from a set of patent documents identified by USPC = 002/456 (body cover among GUARD OR PROTECTOR among MISCELLANEOUS among Apparrel, by year), and to generate a document set for each extracted applicant (For example, US Patent No. 06401262, assignee Benetton Group SpA, US Cl. 2/456; 2/411), the document set corresponding to the above Benetton Group SpA, USPC = 002 / The statistical preprocessing module for each patent classification symbol can generate the citation document set in the same manner for the document set generated for each applicant, Can be calculated.

상기 맵핑이 관점에서, 상기 특정된 특허 문건 집합에 포함된 각 특허 문건을 인용하고 있는 문건 집합, 즉 인용 문건 집합을 생성할 수 있을 것이다. 예를 들어 설명하면 B 문건을 C 문건 및 c 문건이 인용하고 있을 때, 상기 맵팽 관계를 통하여, B 문건은 C 문건 및, c 문건에 의해 인용당했다는 것을 알 수 있게 된다. 이러한 C 문건 및 c 문건을 찾기 위해서는 검색 엔진의 인용 문건 번호 필드에 B 문건의 번호를 입력하면, 검색 결과로서 B 문건이 인용 정보에 포함된 C 문건 등이 나오게 된다. 물론, DBMS(201)에 B 문건의 번호를 인용 번호 필드를 지정하여 select 등의 쿼리를 질의하여서도 C 문건 등을 검색 결과로 입수할 수 있게 된다. B->C, B->c의 맵핑 관계가 데이터로 정리되어 있다면, 상기 B 문건을 인용하고 있는 C 문건이 포함된 문건 집합을 용이하게 특정할 수 있을 것이다.From the above mapping point of view, a set of documents citing each patent document included in the specified set of patent documents, that is, a set of documents cited, may be generated. For example, when a B document is cited by a C document and a c document, it can be seen that the B document is cited by the C document and the c document. To find these C and c documents, enter the number of the B document in the citation document number field of the search engine. Then, the C document containing the B document in the citation information appears as the search result. Of course, it is possible to obtain a C document or the like as a search result by querying a query such as select by designating a quotation number field for the number of the B document in the DBMS 201. If the mapping relations of B-> C and B-> c are summarized as data, it is possible to easily specify a document set including the C document citing the B document.

그러므로, 상기 특허 분류 기호별 통계 전처리 모듈은 특정된 특허 문건 집 합에 대하여, 상기 특정된 특허 문건 집합의 개별 특허 문건을 인용하고 있는 인용 문건 집합을 대상으로 하여 하기와 같은 피인용 정보(인용 당한 B의 입장에서는 인용한 C가 포함된 문건들은 피인용 정보가 된다)를 계산할 수 있다.Therefore, the statistical preprocessing module for each patent classification symbol may be used for a set of patent documents citing the individual patent documents of the specified patent document set for a specified set of patent documents, In the case of B, the documents containing the quoted C are the information for reference).

특허 문건 집합(B가 있는 문건 집합)이 특정되면, 상기 특허 분류 기호별 통계 전처리 모듈은 검색 엔진 또는 DBMS(201)에 질의함으로써, 상기 특허 문건 집합의 각각의 특허 문건을 인용하고 있는 인용 특허 문건 번호(문건 C의 번호, 문건 c의 번호)를 취합할 수 있게 된다. 상기 취합된 인용 특허 문건 번호에는 서지 사항이 대응되어 있음은 자명하므로, 상기 특정된 특허 문건 집합을 구성하는 모든 특허 문건을 인용한 인용 특허 문건 집합 또한 특정한 계산 시점 기준으로 특정되게 된다. 그러므로, 상기 특정된 인용 특허 문건 집합이 분석, 카운팅 또는 계산 대상이 될 수 있다. If a set of patent documents (a set of documents with B) is specified, the statistical preprocessing module for each patent classification symbol queries the search engine or the DBMS 201 to obtain a patent document citing each patent document of the patent document set The number (the number of document C, the number of document c). Since it is obvious that the bibliographic references correspond to the collected patent document numbers, the set of patent documents citing all the patent documents constituting the specified patent document set is also specified at a specific calculation time reference. Therefore, the specified set of cited patent documents can be analyzed, counted, or calculated.

이때, 중복 카운팅은 허용되는 것이 바람직하다. 상기 특정된 특허 문건 집합에, B 문건과 b문건이 속한다고 할 때, 상기 B 문건과 b 문건이 모두 C 문건에 의해 인용된다고 할 때, 문건 C의 가중치는 2가 되어야 한다. 즉, C문건은 B 문건으로 검색해도 검색 결과에 포함되며, b 문건으로 검색해도 검색 결과에 포함된다. At this time, it is preferable that duplicate counting is allowed. If both the B document and the B document are cited by the C document, the weight of the document C should be two. That is, the C document is included in the search result even if the B document is searched.

상기 특정된 특허 문건 집합을 기준으로 할 때, 상기 인용 특허 문건 집합을 대상으로 한 모든 계산값은 "피인용"이라는 말이 붙을 수 있다. 상기 중복 카운팅이 될 때, 상기 특정된 특허 문건 집합에 속하는 특허 문건을 가장 많이 인용한 인용 문건 번호의 랭킹을 계산할 수 있을 것이다.Based on the specified set of patent documents, all calculated values for the cited patent document set may be referred to as "for citation. &Quot; When the duplicate counting is performed, the ranking of the citation document number that cites the most patent documents belonging to the specified set of patent documents will be calculated.

상기 특허 분류 기호별 통계 전처리 모듈은 상기 인용 특허 문건 집합(문건 C, 문건 c가 있는 문건 집합)을 대상으로 하여 상기와 같은 정보들을 계산할 수 있을 것이다.The statistical preprocessing module according to the patent classification symbol may calculate the above information on the cited patent document set (document C, document c with document c).

상기 특허 분류 기호별 통계 전처리 모듈은 마찬가지로 상기 인용 특허 문건 집합을 대상으로 하여, 상기 각 파라미터별로 출원인, 발명자, 대리인 또는 특허 분류 기호의 랭킹을 계산할 수 있을 것이다. 즉, 출원수를 기준으로 한 최다 인용 출원인별 랭킹, 최다 인용 발명자별 랭킹을 계산할 수 있을 것이다. 이때, 상기 특허 분류 기호별 랭킹을 계산할 때, 본 발명의 하위 특허 분류 기호 체계를 활용하여 각 특허 분류 기호 체계 상의 각 단계별로 최다 피인용 특허 분류 기호의 랭킹을 계산할 수 있을 것이다. 이때, 상기 인용 특허 문건 집합에 포함된 인용 특허 문건에 포함된 적어도 하나 이상의 특허 분류 기호가 포함되어 있을 것이며, 상기 포함된 특허 분류 기호의 (특허 분류 기호 체계 상의) 모든 상위 특허 분류 기호 각각도 인용과 관계되었다고 보는 것이 타당할 것이다. 그러므로, 이러한 관점이 상기 각 단계별 최다 인용 특허 분류 기호의 랭캥의 계산에 포함되는 것이 바람직할 것이다. 상기 인용 특허 문건 집합에 포함된 미국 특허 문건을 대상으로 할 때에는 상기 특허 분류 기호는 USPC와 IPC 둘 다가 될 수도 있을 것이다.The statistical preprocessing module for each patent classification symbol may similarly calculate the ranking of the applicant, inventor, agent, or patent classification symbol for each of the above-mentioned parameters for the cited patent document set. That is, it is possible to calculate the ranking for the most cited applicant and the ranking for the most cited inventor based on the number of applications. At this time, when calculating the rankings according to the patent classification symbols, the ranking of the patent classification symbols for each of the plurality of patent classification schemes can be calculated by using the sub-patent classification scheme of the present invention. At this time, at least one or more patent classification symbols included in the cited patent documents included in the cited patent document set will be included, and each of the upper patent classification codes (in the patent classification scheme) It would be reasonable to assume that Therefore, it would be desirable for this view to be included in the calculation of the quotation of the most cited patent classification symbols for each step. When the US patent documents included in the cited patent document set are targeted, the patent classification code may be both USPC and IPC.

이때, 상기 특허 분류 기호별 통계 전처리 모듈은 상기 특정된 특허 문건 집합(B와 b가 있는 문건 집합) 자체를 분석할 수 있다. 이 때도 중복 카운팅이 허용되어 중복된 회수만큼 가중치를 부여하여 취급하는 것이 바람직하다. 예를 들면, 문건 B가 5번 인용되고, 문건 b가 3번 인용되었다고 할 때, 모든 상기 특정된 특허 문건 집합을 대상으로 피인용출원수 등과 같은 하기 파라미터값을 계산할 때, B문 건에는 5의 가중치를, b문건에는 3의 가중치를 두는 방식으로 계산하는 것이 바람직하다. 상기 특허 분류 기호별 통계 전처리 모듈은 상기 가중치를 반영하여 상기 특정된 특허 문건 집합을 대상으로 하여 가장 많이 인용된 특허 문건의 랭킹을 계산할 수 있을 것이다. 나아가, 상기 특허 분류 기호별 통계 전처리 모듈은 상기 특정된 특허 문건 집합(문건 B, 문건 b가 있는 문건 집합)을 대상으로 하여 먼저 양적인 기준으로 피인용출원수(B는 C에 의해서 인용되었으므로, B의 입장에서는 피인용이라는 말을 사용할 수 있다.) 등의 값을 카운팅하거나 계산할 수 있을 것이다.At this time, the statistical preprocessing module according to the patent classification symbol can analyze the specified patent document set (the document set having B and b) itself. Even at this time, it is preferable that duplicate counting is allowed, and weights are assigned to the duplicated number of times. For example, when document B is cited five times and document b is cited three times, when calculating the following parameter values, such as the number of applications for culling, for all the specified set of patent documents, , And a weight of 3, and a weight of 3. The statistical preprocessing module for each patent classification symbol may calculate the ranking of the most cited patent documents on the specified set of patent documents by reflecting the weight. Further, the statistical preprocessing module according to the patent classification symbol is used for the number of cited patent applications (C is cited by C) on a quantitative basis, with respect to the specified set of patent documents (documents B and B) ) Can be used to count or compute values such as.

상기 특허 분류 기호별 통계 전처리 모듈은 상기 특정된 특허 문건 집합(B가 있는 문건 집합)을 대상으로 하여, 출원인 정보를 획득할 수 있을 것이며, 상기 출원인별 특허 문건 집합(상기 모든 최빈 출원인 중의 어느 하나의 출원인별 특허 문건 집합에는 상기 B 문건이 포함되어 있을 것이다.)을 대상으로 하여, 상기 출원인별 특허 문건 집합을 대상으로도 상기 인용 특허 문건 집합을 특정할 수 있을 것이며, 특정된 상기 인용 특허 문건 집합에 대하여 상기의 피인용출원수 등과 같은 각종 파라미터를 계산할 수 있을 것이다. 이때, 상기 파라미터별로, 상기 출원인별 랭킹을 계산할 수 있을 것이다. 예를 들면, 연도별로 USPC=002/456(Apparrel 중에서도 MISCELLANEOUS 중에서도 GUARD OR PROTECTOR 중에서도 Body cover)로 특정된 특허 문건 집합에서 출원인들을 추출할 수 있으며, 상기 추출된 출원인별로의 문건 집합을 생성할 수 있게 된다.(예를 들면 미국 등록 번호 06401262호, assignee Benetton Group S.p.A., US Cl. 2/456; 2/411인 문건이 있을 때, 상기 Benetton Group S.p.A.,별로 USPC=002/456에 대응되는 상기 문건 집합을 생성할 수 있다.) 상기 특허 분류 기호별 통계 전처리 모듈은 상기 출원인별로 생성된 문건 집합을 대상으로 하여 동일한 방법으로 상기 인용 문건 집합을 생성할 수 있게 되고, 상기 인용 문건 집합에 대하여 상기의 각종 파라미터들을 계산할 수 있다.The statistical preprocessing module for each patent classification symbol may acquire the applicant information on the specified set of patent documents (set of documents with B), and the set of patent documents for each applicant (any one of the above-mentioned best applicants The set of patent documents for each applicant will be included in the set of patent documents for each applicant), and the set of patent documents for the citation It is possible to calculate various parameters such as the above-mentioned number of cited applications for the set. At this time, the applicant's ranking can be calculated for each parameter. For example, it is possible to extract applicants from a set of patent documents identified by USPC = 002/456 (body cover among GUARD OR PROTECTOR among MISCELLANEOUS among Apparrel, by year), and to generate a document set for each extracted applicant (For example, US Patent No. 06401262, assignee Benetton Group SpA, US Cl. 2/456; 2/411), the document set corresponding to the above Benetton Group SpA, USPC = 002 / The statistical preprocessing module for each patent classification symbol can generate the citation document set in the same manner with respect to the document set generated for each applicant, Parameters can be calculated.

본 발명의 상기 특허 분류 기호별 통계 전처리 모듈이 상기 각종 파라미터들을 계산하는 방법은 확정 또는 특정된 특허 문건 집합별로 계산값을 생성하는 것을 특징으로 한다. 그러므로, 상기 특허 분류 기호별 통계 전처리 모듈의 입장에서는 사기 특허 분류 기호별 통계 전처리 모듈에 입력되는 특정된 특허 문건 집합의 존재가 중요하며, 그 특허 문건 집합의 속성과 크기는 문제되지 않는다.(다만, 인용과 관련된 계산을 하기 위해는 상기 특정된 특허 문건 집합에 인용 정보가 포함되어 있어야 한다.) The method for calculating the various parameters by the statistical preprocessing module according to the patent classification symbol according to the present invention is characterized in that calculation values are generated for each defined or set patent document set. Therefore, in the view of the statistical preprocessing module according to the patent classification symbol, it is important that the set of the specified patent document inputted to the statistical preprocessing module according to the fraud patent classification symbol is important, and the property and the size of the patent document set do not matter , Citation information must be included in the specified set of patent documents in order to perform calculations related to citations.)

아울러, 본 발명의 상기 특허 분류 기호별 통계 전처리 모듈은 상기 특허 분류 기호별 통계 전처리 모듈을 사용하는 사용자가 지정한 특허 문건 집합에 대해서도 상기 파라미터 중 적어도 어느 하나 이상을 계산할 수 있을 것이다.In addition, the statistical preprocessing module for each patent classification symbol of the present invention may calculate at least one of the above parameters for a set of patent documents designated by the user using the statistical preprocessing module for each patent classification symbol.

특허 정보 처리 기본 모듈(40)Patent Information Processing Base Module (40)

이어서, 본 발명의 특허 정보 처리 기본 모듈(40)에 대해서 설명한다. 상기 특허 정보 처리 기본 모듈(40)에는 1) 검색 엔진 모듈, 2) 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402), 3) 분석 모듈, 4) 감시 모듈(403), 5) 특허 문건 집합 입수 모듈, 6) 디렉토리 생성 모듈(405), 7) 리포팅 모듈(406), 8) 간이 분석 모듈(407) 등이 있다.Next, the patent information processing basic module 40 of the present invention will be described. The patent information processing basic module 40 includes a search engine module 2) an operation execution result table generation module 402 for multidimensional analysis, 3) an analysis module, 4) a monitoring module 403, and 5) Module 6) a directory creation module 405, 7) a reporting module 406, 8) a simple analysis module 407, and the like.

상기 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 동종의 특허 분류 기호에 대한 처리를 설명하였다. 이어서, 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈에 대해서 설명한다. 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈의 작동 방식과 완전히 유사하다. 미국에서 발행되는 특허 문건에는 하나의 특허 문건에 IPC와 USPC가 동시에 결합되어 있는 것이 거의 전부이며, 일본에서 발행되는 특허 문건에도 IPC와 FT가 동시에 부여되어 있다.된다. 하나의 특허 문건에 IPC1, IPCn이, USPC1, ...USPCm 부여되어 있다고 가정하자. The above-mentioned homogeneous plural patent classification related pre-processing module described the processing of the same type of patent classification symbol. Next, the heterogeneous plural patent classification symbol relation pre-processing module will be described. The heterogeneous plural patent classification related relationship pre-processing module is completely similar to that of the homogeneous plural patent classification related pre-processing module. In the United States, patent documents include almost all IPCs and USPCs combined in a single patent document, and IPCs and FTs are assigned to patent documents issued in Japan at the same time. Suppose IPC1, IPCn, USPC1, ... USPCm are assigned to one patent document.

이 경우, 이종 복수 특허 분류 기호 관계성 처리의 관점에서는 상기 IPC1, IPCn과 In this case, in view of the heterogeneous plural patent classification related processing, the IPC1, IPCn,

USPC1, ... , USPCm사이에는 4가지 종류의 관계성의 조합이 생긴다. 먼저 IPC1과 There are four kinds of relationship combinations between USPC1, ..., and USPCm. First,

USPC1의 관계로, 이들은 각각 main IPC와 main USPC의 쌍이 된다. 그리고, IPC1와 In relation to USPC1, they are each a pair of main IPC and main USPC. Then,

USPC2,...USPCm의 쌍으로 이는 main IPC와 sub USPC와의 쌍이 된다. 그리고, IPC2, IPCn와, USPC1의 쌍으로, 이는 sub IPC와 main USPC의 쌍이 된다. 그리고, IPC2, ..., IPCn각각과 USPC2, ... USPCm각각에 대한 쌍으로, 이는 sub IPC와 sub USPC의 쌍이 된다.A pair of USPC2, ... USPCm, which is a pair of a main IPC and a sub USPC. Then, in the pair of IPC2, IPCn and USPC1, this is a pair of sub IPC and main USPC. Then, pairs of IPC2, ..., IPCn and USPC2, ..., USPCm, respectively, are a pair of sub IPC and sub USPC.

상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 각 4가지 종류별로, 각 종류 하에 있는 이종 복수 특허 분류 기호들 간에서 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈이 수행한 바와 동일한 처리를 할 수 있다. 이때, 이종 특허 분류 기호를 대상으로 하므로, 상기 최소 공통 특허 분류 기호는 없기 때문에 기설정된 값으로 한정할 수 있다. 즉, IPC의 경우에는 subclass에서, USPC에서는 class까지만을 대상으로 하여, 주어진 IPC와 주어진 USPC 쌍에 대하여 상위 특허 분류 기호를 한정지을 수 있다. 즉, 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈은 주어진 특허 분류 기호 조합(Ai, Bj)(단, Ai, Bj는 동일한 체계의 특허 분류 기호이다.)에 대하여 상기와 같은 관계성 테이블을 생성한 바와 같이, 상기 이종 복수 특허 분류 기호 관계성 전처리 모듈은 특허 분류 기호 조합(Ai, Bj) (단, Ai, Bj는 다른 체계의 특허 분류 기호이다.)에 대하여 상기와 같은 관계성 테이블을 생성할 수 있다. 상기 생성된 관계성 테이블에 대하여 가중치 또는 기여도 가중치를 부여할 수 있을 것이다. 또한, 각 이종 특허 분류 기호 조합(Ai, Bj) (단, Ai, Bj는 다른 체계의 특허 분류 기호이다.)에 대하여, 특허 문건 번호를 대응시킬 수 있음과, 상기 특허 문건 번호로부터 입수되는 서지사항을 반영한 카운팅 및 계산을 수행할 수 있음도 자명할 것이다. 아울러, 특정한 문건 집합에 대하여, 이종 특허 분류 기호 조합(Ai, Bj)를 획득할 수 있을 것이며, 이들 조합에 대한 상기 동종 복수 특허 분류 기호 관계성 전처리 모듈과 동등한 분석을 수행할 수 있을 것이 다.The above-mentioned heterogeneous plural patent classification symbol relationship pre-processing module can perform the same processing as that performed by the homogeneous plural patent classification symbol relationship pre-processing module among different heterogeneous patent classification symbols under each type for each four kinds. At this time, because the heterogeneous patent classification symbol is targeted, since there is no minimum common patent classification symbol, it can be limited to a predetermined value. That is, the upper patent classification code can be defined for a given IPC and a given USPC pair, targeting only the subclass in the case of IPC and the class in the USPC. That is, the above-mentioned homogeneous plural patent classification symbol relation preprocessing module generates the above relationship table for a given patent classification symbol combination (Ai, Bj) (where Ai and Bj are patent classification codes of the same system) As described above, the heterogeneous plural patent classification related relationship preprocessing module generates the above relationship table for the patent classification symbol combination (Ai, Bj) (where Ai and Bj are patent classification codes of different systems) . A weight or a contribution weight may be given to the generated relationship table. In addition, it is possible to correspond the patent document number to each of the heterogeneous patent classification symbol combinations (Ai, Bj) (where Ai and Bj are patent classification codes of different systems), and the bibliography It will also be appreciated that counting and calculation may be performed to reflect the matter. In addition, for a particular set of documents, a heterogeneous patent classification combination (Ai, Bj) may be obtained and an equivalent analysis of the homogeneous multiple patent classification related preprocessing module for these combinations may be performed.

특허 분류 기호 검색 모듈(401)Patent Classification Search Module (401)

이어, 특허 분류 기호 검색 모듈(401)에 대해서 설명한다. 특허 분류 기호의 검색이란 검색어로서 기술 키워드를 입력했을 때, 상기 기술 키워드를 타이틀 정보로 포함하고 있는 특허 분류 기호를 검색 결과로 출력해 주는 것을 말한다.Next, the patent classification code search module 401 will be described. The search of a patent classification symbol refers to outputting, as a search result, a patent classification symbol including the technical keyword as title information when a technical keyword is inputted as a search term.

이어서, 하기의 예로써 본 발명의 특징적인 특허 분류 기호 검색을 설명한다.Next, the characteristic patent classification search of the present invention will be described with the following examples.

　　　　섹션　　　　　　　　　　　 : H　　　　　 전기 Section: H Electricity

　　　　클래스　　　　　　　　　 : H01　　 기본적 전기소자 Class: H01 Basic electrical element

　　　　서브클래스　　　　　 : H01F　 자석 Subclass: H01F Magnets

　　　　메인그룹　　　　　　　 : H01F 1/00　 자성재료를 특징으로 하는 자석 또는 자성체 Main group: H01F 1/00 Magnet or magnetic body characterized by magnetic material

　　　　1-도트 서브그룹 : H01F 1/01 *무기재료로 된 것 1-dot subgroup: H01F 1/01 * Inorganic materials

　　　　2-도트 서브그룹 :　H01F 1/03 ** 보자력에 의해 특징되는 것 2-dot subgroup: H01F 1/03 ** characterized by coercivity

　　　　3-도트 서브그룹 :　H01F 1/032 *** 경질 자성재료의 것 3-dot subgroup: H01F 1/032 *** Of hard magnetic material

　　　　4-도트 서브그룹 :　H01F 1/04 **** 금속 또는 합금 4-dot subgroup: H01F 1/04 **** Metal or alloy

　　　　5-도트 서브그룹 :　H01F 1/047 ***** 조성물로 특징되는 합금 5-dot subgroup: H01F 1/047 ***** Alloy characterized by composition

　　　　6-도트 서브그룹 :　H01F 1/053 ****** 희토류 금속을 포함하는 것6-dot subgroup: H01F 1/053 ****** Containing rare-earth metals

본 발명의 특허 분류 기호 검색의 첫번째 특징은 하위 특허 분류 기호가 검색되었을 때, 그 검색된 하위 특허 분류 기호의 상위 특허 분류 기호를 함께 그 결 과로 제공해 주는 것이다. 예를 들어, 검색어로 "경질 자성재료"가 입력되었을 때, "H01F 1/032 ·. 경질 자성재료의 것"을 검색결과로 제시함과 아울러, 상기 H01F 1/032를 특허 분류 기호 마스트 DB(203)에 조회하여, 상기 특허 분류 기호 체계 트리에서 상기 H01F 1/032의 기설정된 레벨까지의 상위 특허 분류 기호를 검색 결과로 제공해 주는 것이 바람직할 것이다. 즉, H01F 1/032를 검색어로 입력되었을 때, 하기와 같은 결과가 검색 결과로 나오는 것이 바람직할 것이다. 상기 기설정된 레벨은 IPC의 경우에는 서브클래스 USPC의 경우에는 클래스, FT의 경우에는 테마, ECLA나 FI의 경우에는 서브클래스인 것이 바람직할 것이나, 그 상위나 그 하위 레벨까지 검색 결과로 제시해 주어도 무방할 것이다.The first feature of the patent classification search of the present invention is that when a sub-patent classification code is searched, the top patent classification code of the searched sub-patent classification is provided together as a result thereof. For example, when a "hard magnetic material" is input as a search term, "H01F 1/032. Of a hard magnetic material" is presented as a search result, and H01F 1/032 is referred to as a patent classification symbol mast DB 203), and it is preferable to provide the upper patent classification symbol from the patent classification symbol system tree up to the preset level of H01F 1/032 as a search result. In other words, when H01F 1/032 is input as a search term, it is preferable that the following result is displayed as a search result. It is preferable that the predetermined level is a subclass in case of IPC, a class in case of subclass USPC, a theme in case of FT, or a subclass in case of ECLA or FI, something to do.

　　　　서브클래스　　　　　 : H01F　 자석 Subclass: H01F Magnets

　　　　1-도트 서브그룹 : H01F 1/01 * 무기재료로 된 것 1-dot subgroup: H01F 1/01 * Inorganic materials

상기와 같은 결과를 출력해 주기 위해서는 다음과 같은 단계를 거친다. 첫째, 검색어로 입력된 키워드를 특허 분류 기호를 타이틀 정보와 함께 인덱싱 해 놓은 특허 분류 기호 인덱스에 질의하여, 상기 검색어를 특허 분류 기호의 타이틀 정보에 포함되어 있는 하나 이상의 검색 결과로서의 특허 분류 기호를 찾아 낸다.("경질 자성 재료"를 입력했을 때, H01F 1/032을 찾아낸다.) 둘째, 찾아낸 특허 분류 기호를 특허 분류 기호 마스트 DB(203)에 질의하여, 상기 특허 분류 기호의 기설정된 단계까지의 상위 특허 분류 기호를 찾아낸다.( H01F 1/03, H01F 1/01, H01F 1/00, H01F를 찾아낸다.) 셋째, 상기 찾아낸 상위 특허 분류 기호를 상기 특허 분류 기호 마스트 DB(203)에 질의하여 상기 찾아낸 상위 특허 분류 기호의 타이틀 정보를 입수한다. 넷째, 상기 첫째부터 셋째 단계까지의 입수한 특허 분류 기호 및 상기 특허 분류 기호의 타이틀 정보를 도트 구조 등의 계층을 표시하는 정보와 함께 결합하여 표시해 준다.In order to output the above result, the following steps are performed. First, a keyword entered as a search term is compared with a patent classification symbol index indexed with title information, and the search term is searched for a patent classification symbol as one or more search results included in the title information of the patent classification symbol (When the "hard magnetic material" is inputted, H01F 1/032 is found.) Second, the patent classification code found is referred to the patent classification code mast DB 203, (H01F 1/03, H01F 1/01, H01F 1/00, H01F are found). Third, the above-mentioned top patent classification symbol is found in the patent classification code mast DB 203 And obtains the title information of the found patent classification symbol. Fourth, the patent classification symbol obtained from the first to the third steps and the title information of the patent classification symbol are combined and displayed together with information indicating a layer such as a dot structure.

한편, 상기 특허 분류 기호 검색에서 "경질 자성 재료" AND "희토류"로 검색 질의를 했을 경우, 상기 어느 행에서도 "자석"과 "희토류"가 동시에 들어간 행은 없기 때문에 검색 결과를 제공해 줄 수 없다. 하지만, 특허 분류 기호 체계를 고려할 때는 상기 "H01F 1/053 ******희토류 금속을 포함하는 것"가 검색 결과로 출력되어야만 한다. 그러므로, 상기와 같은 문제를 해결하기 위하여, 단순히 행 단위로만 인덱싱을 하면 특허 분류 기호의 체계성을 반영하는 검색 결과를 제공해 주지 못하게 된다. 이를 위해서는 다음과 같이 특허 분류 기호 및 그 타이틀 정보를 변형한다. 변형의 핵심은 각 타이틀 정보에 자신의 상위 특허 분류 기호의 타이틀 정보를 모두 병합시켜 포함시켜 놓는 것이다. 하기 표 23는 그 일 형태를 보여 주고 있다.On the other hand, when a search query is made with "hard magnetic material" AND "rare earths " in the patent classification search, no rows in which" magnet "and" rare earth "are simultaneously present in any of the above rows can not provide search results. However, when considering the patent classification system, "H01F 1/053 ****** containing rare earth metal" should be output as a search result. Therefore, in order to solve the above problems, indexing only on a row-by-row basis does not provide a search result reflecting the systematicity of the patent classification symbol. To do this, we modify the patent classification symbol and title information as follows. The core of the modification is to incorporate all the title information of the upper patent classification symbol of each title into each title information. Table 23 below shows the work type.

[표 23][Table 23]

IPC 기호IPC symbol 병합된 타이틀 정보Merged title information H　H 전기Electricity H01H01 전기; 기본적 전기소자;Electricity; Basic electric device; H01FH01F 전기; 기본적 전기소자; 자석Electricity; Basic electric device; magnet H01F 1/00　H01F 1/00 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체Electricity; Basic electric device; magnet; A magnet or a magnetic substance characterized by a magnetic material H01F 1/01H01F 1/01 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Of inorganic material H01F 1/03H01F 1/03 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것; 보자력에 의해 특징되는 것Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Made of inorganic materials; What is characterized by coercivity H01F 1/032H01F 1/032 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것; 보자력에 의해 특징되는 것; 경질 자성재료의 것Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Made of inorganic materials; Characterized by coercivity; Of hard magnetic material H01F 1/04　H01F 1/04 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것; 보자력에 의해 특징되는 것; 경질 자성재료의 것; 금속 또는 합금Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Made of inorganic materials; Characterized by coercivity; Of a hard magnetic material; Metal or alloy H01F 1/047H01F 1/047 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것; 보자력에 의해 특징되는 것; 경질 자성재료의 것; 금속 또는 합금; 조성물로 특징되는 합금Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Made of inorganic materials; Characterized by coercivity; Of a hard magnetic material; Metal or alloy; Alloy characterized by composition H01F 1/053H01F 1/053 전기; 기본적 전기소자; 자석; 자성재료를 특징으로 하는 자석 또는 자성체; 무기재료로 된 것; 보자력에 의해 특징되는 것; 경질 자성재료의 것; 금속 또는 합금; 조성물로 특징되는 합금; 희토류 금속을 포함하는 것Electricity; Basic electric device; magnet; A magnet or a magnetic material characterized by a magnetic material; Made of inorganic materials; Characterized by coercivity; Of a hard magnetic material; Metal or alloy; An alloy characterized by a composition; Containing rare earth metals

한편, 상기는 최상위 특허 분류 기호까지 그 타이틀 정보를 병합시켜 놓았으나, 기설정된 레벨까만 병합시켜 놓을 수 있다. 상기 병합시켜 놓는 기설정된 레벨은 IPC의 경우에는 서브클래스 USPC의 경우에는 클래스, FT의 경우에는 테마, ECLA나 FI의 경우에는 서브클래스인 것이 바람직하다.Meanwhile, the title information is merged up to the highest patent classification symbol, but it can be merged with a predetermined level black. The predetermined level to be merged is preferably a class in case of IPC, a class in case of subclass USPC, a theme in case of FT, and a subclass in case of ECLA or FI.

상기 특허 분류 기호와 상기 병합된 타이틀 정보를 행 단위로 인덱싱을 해 놓은 경우, 검색어로 "경질 자성 재료" AND "희토류"를 입수 받으면, 검색 결과로 H01F 1/053을 검색 결과로 찾아내 줄 수 있다. 한편, 하나의 특허 분류 기호가 검색 결과로 나왔을 때(예를 들면 상기 H01F 1/053가 나왔을 때), 상기 검색 결과의 모든 또는 기설정된 상위 특허 분류 기호 및 그 특허 분류 기호의 타이틀 정보를 함께 검색 결과로 제공해 줄 수 있음은 전술한 바와 같다. 즉, "경질 자성 재료" AND "희토류"라는 검색어가 입력되었을 때, 하기와 같은 검색 데이터가 검색 결과에 포함되어야 한다.When the patent classification symbol and the merged title information are indexed on a row-by-row basis, when "hard magnetic material" AND "rare earth" is obtained as a search term, H01F 1/053 can be found as a search result have. On the other hand, when one patent classification symbol comes out as a search result (for example, when H01F 1/053 comes out), all or a predetermined top patent classification symbol of the search result and title information of the patent classification symbol are searched together The results can be provided as described above. That is, when the search term "hard magnetic material" AND "rare earth" is input, the following search data should be included in the search result.

만약, "금속 AND 보자력"이 검색어로 입력되었을 경우, 하기와 같은 검색 결과로 최종적으로 나오는 것이 바람직하다.If "metal AND coercive force" is entered as a search term, it is preferable that the search result is as follows.

　　　　서브클래스　　　　　 : H01F　 자석 Subclass: H01F Magnets

이대, H01F 1/03 주위에 이 특허 분류의 하위 특허 분류를 확장하여 볼 수 있는 수단(예를 들면 (+) 기호/마크 등)이 있고, 사용자가 이 수단을 선택하였을 경우, 하위 특허 분류를 계층적으로 표시하여 주는 것이 바람직할 것이다.There are means (eg, (+) symbol / mark, etc.) extending around the sub-patent classification of this patent classification around H01F 1/03, and if the user selects this means, It would be desirable to display them hierarchically.

도 16은 본 발명의 특허 분류 기호 검색 모듈(401)이 작동되는 일 구현예에 관한 일실시예적 도면이다. 검색은 특허 분류 기호를 선택받는 단계, 특허 분류 기호를 검색할 검색 언어를 선택받는 단계 및 적어도 하나 이상의 검색어를 입력받는 단계 및 입력 받은 검색식으로 검색을 수행하여, 검색 결과를 생성하는 단계로 이루어진다.16 is an exemplary diagram of an embodiment in which the patent classification code search module 401 of the present invention is operated. The search includes a step of selecting a patent classification symbol, a step of selecting a search language to search for a patent classification symbol, a step of inputting at least one search word, and a step of performing a search using an inputted search expression to generate a search result .

한편, 상기 특허 분류 기호 검색 모듈(401)은 각 특허 분류 기호간의 계층 구조(child-patent) 정보를 활용하여, 상기 병합된 타이틀 정보를 활용하지 않고도 본 발명의 목적을 달성할 수도 있다. 검색어/질의어 입력 결과로 나오는 2 이상의 특허 분류 기호가 있을 때, 이들이 특허 분류 기호 체계 상에서 child-parent 관계가 있는지를 조사함으로써 동일한 목적을 달성할 수 있다.Meanwhile, the patent classification symbol search module 401 may use the child-patent information between each patent classification symbol to achieve the object of the present invention without utilizing the merged title information. When two or more patent classification symbols appear as a result of query / query entry, they can achieve the same purpose by examining whether there is a child-parent relationship in the patent classification system.

특허 문건 집합 입수 모듈(404)Patent document collection module (404)

이어 본 발명의 특허 문건 집합 입수 모듈(404)에 대해 설명한다. 상기 특허 문건 집합 입수 모듈(404)에는 자동 선정 문건 집합 입수 모듈(404-1)과 사용자 직접 생성 문건 집합 입수 모듈(404-2)이 있으며, 상기 사용자 직접 생성 문건 집합 입수 모듈(404-2)에는 검색식을 통한 문건 입수 모듈(404-2-1)과 IPC 디렉토리 등과 같이 문건 집합이 특정될 수 있는 디렉토리 상에서의 선택을 통해서 문건 집합을 입수할 수 있는 디렉토리 상에서의 선택을 통한 문건 입수 모듈(404-2-2)이 있다. 한편, 특허 문건 집합 입수 모듈(404)은 자신 또는 타인이 생성한 특허 문건 집합을 입수하거나 선택할 수도 있으며, 상기 특허 문건 집합 입수 모듈(404)은 임의의 2개 이상의 특허 문건 집합의 집합 연산(합집합, 차집합, NOT 연산 등 임의의 집합 연산)을 통하여 새로운 특허 문건 집합을 생성하도록 할 수 있다. 상기 검색식을 통한 문건 입수 모듈(404-2-1)에는 검색식의 질의 대상에 따라, 검색 엔진에 질의하는 검색 엔진을 통한 문건 집합 입수 모듈(404-2-1-1)과 DBMS(201)에 질의하는 DBMS(201)를 통한 문건 집합 입수 모듈(404-2-1-2)이 있다. 상기 자동 선정 문건 집합 입수 모듈(404-1)은 본 발명의 명세서 상에서 설명한 특허 문건 집합의 확정에서 소개한 각종 확정된 특허 문건 집합 중 사용자에 의해서 생성된 것을 제외 하고는 자동적으로 입수할 수 있다.Next, the patent document set acquisition module 404 of the present invention will be described. The document collection set access module 404 includes an automatically selected document collection access module 404-1 and a user generated document collection access module 404-2. A document retrieval module 404 through a selection on a directory in which a document set can be obtained through a selection on a directory in which a document set can be specified such as a document retrieval module 404-2-1 through a retrieval expression and an IPC directory 404-2-2). Meanwhile, the patent document set acquisition module 404 may obtain or select a patent document set generated by itself or others, and the patent document set acquisition module 404 may perform a set operation (combination of two or more patent document sets) , A set of differences, a NOT operation, and the like) to generate a new set of patent documents. The document retrieval module 404-2-1 through the retrieval expression is provided with a document collection module 404-2-1-1 through a search engine that queries a search engine and a DBMS 201 The document collection module 404-2-1-2 via the DBMS 201 for querying the document collection module 404-2-1-2. The automatically selected document set acquisition module 404-1 can be automatically obtained from a set of various confirmed patent documents introduced in the determination of the set of patent documents described in the specification of the present invention, except for those generated by the user.

다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)The multi-dimensional analysis operation result table generation module 402

이어 본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)에 대해서 설명한다. 특허 정보에 관한 데이터가 DB로 구축되어 있을 경우, 통상적으로 SQL 쿼리문을 사용하여 원하는 결과를 출력해 낸다. 이때, DB 구조가 잘 설계되어 있더라도, 하나의 결과를 도출해 내기 위해서 다수의 테이블을 조인(join)하여야 하고, 각 테이블의 크기가 클 경우, select문 등을 처리하는데 상당히 많은 시간이 걸린다. Next, the multi-dimensional analysis operation result table generation module 402 of the present invention will be described. If the data on patent information is constructed in DB, usually the SQL query statement is used to output the desired result. At this time, even if the DB structure is well designed, it is necessary to join a plurality of tables in order to derive a single result, and when each table is large, it takes considerable time to process a select statement.

예를 들어, 삼성전자주식회사의 대한민국에서 다출원 IPC 1 dot subgroup별로 연도별 점유율를 구할 경우, 이를 단순 select문으로 구성할 경우, 여러 종류의 테이블(DB 스키마의 설계에 따라 다를 수 있지만 통상적으로 수개 이상의 테이블(예를 들면, 국가별 출원인 테이블, IPC 테이블, 문건 테이블 등)이 join되어야 한다. 먼저 삼성전자의 IPC 1 dot subgroup 레벨에서의 다출원 IPC를 추출해야 하며, 이때, 하위 IPC가 자동적으로 포함되어야 하면 각 IPC별로 하위 IPC를 추출해 내야 한다. 이어, 추출한 1 dot subgroup IPC 에 대하여 삼성전자의 대한민국의 모든 특허 출원 데이터를 기준으로 연도별 출원의 개수를 찾아 낸 다음, 다출원 1 dot subgroup IPC 들을 찾아 낸다. 이어, 찾아낸 다출원 1 dot subgroup IPC 각각에 대한 삼성전자의 출원의 개수와 전체 출원의 개수의 비를 구해 다출원 1 dot subgroup IPC 각각에 대한 삼성전자의 점유율을 구한다. 이를 하나의 SQL 쿼리문으로 작성하면 SQL 쿼리문의 길이가 길 뿐만 아니라, 이를 처리하는데도 오랜 시간이 걸린다. 특히, 본 발명 사상에 도입된 하위 특허 분류 기호에 해당하는 문건을 자동적으로 포함시켜 상기 목적하는 바를 구하기 위해서는 상당히 많은 연산을 수행해야 하며, 이는 응답 속도의 심각한 저해를 가져 온다. 한편, 엘지전자에 대해서 동일한 내용을 처리하거나, IPC subgroup 레벨에서 엘지전자에 대한 점유율을 처리하는 것과 같이 다양하게 변형된 요구 사항이 반복적으로 발생할 가능성이 있게 된다. For example, if the share of the IPC 1 dot subgroup per year is calculated by Samsung Electronics Co., Ltd., and it is composed of a simple select statement, it is possible to use various types of tables (which may differ depending on the design of the DB schema, (Eg, applicant table, IPC table, document table, etc.) should be joined. First, multiple application IPCs should be extracted from SEC's IPC 1-dot subgroup level. Next, we extract the number of submissions per year based on all the patent application data of Samsung Electronics of Korea for the 1 dot subgroup IPC extracted, and then apply the 1 dot subgroup IPCs Find the ratio between the number of applications filed by Samsung Electronics and the total number of applications for each 1 dot subgroup IPC filed 1 dot subgroup of the original 1 dot subgroup Samsung Electronics' share of each IPC is calculated by using one SQL query statement, the length of the SQL query is long, and it takes a long time to process it. On the other hand, in the case of processing the same contents for the LG Electronics, or for the IPC subgroups, it is necessary to perform a large number of calculations in order to obtain the above- There is a possibility that repeatedly varying requirements may be encountered, such as handling the share of the LG Electronics at the level.

이러한 경우, 상기 다양하게 변형된 요구 사항에 부합하도록 특허 정보를 사전에 처리하여 분석 목적에 부합하는 다차원 분석을 위한 연산을 수행하고, 그 결과를 저장한 테이블 데이터가 있는 경우, 그 테이블에 질의하면 응답 속도의 획기적 향상을 가져오게 된다. 다차원 연산에는 롤업 연산이나 큐브 연산 등을 포함하고 있으며, 특허 데이터를 대상으로 한 이러한 연산 수행의 결과는 DB의 테이블에 저장된다. 이때, 상기 테이블을 큐브, 물리적인 뷰(materialized view, 다차원 분석용 연산 수행 결과 테이블), 뷰(view) 등의 다양한 이름으로 불릴 수 있으나, 이름과는 독립적으로 이름이 지칭하는 내용은 동일하다. 상기와 같은 테이블에 대하여 SQL 쿼리를 사용하여 목적하는 데이터를 추출한다. 큐브 연산을 수행한 다차원 결과 테이블을 대상으로 목적하는 데이터를 추출하는 것은 통상 MDX (multidimensional expression) 쿼리를 사용하나, 본 명세서에는 설명의 편의상 모두 SQL로 통칭한다. 즉, 다차원 분석을 위한 롤업 연산을 수행한 결과 테이블에 접근하는 것은 SQL 쿼리로 질의하는 것이 일반적이며, 다차원 분석을 위한 큐브 연산을 수행한 결과 테이블에 접근하는 것은 MDX 쿼리로 질의하는 것이 일반적이나, 본 명세서에서는 양자 모두 다차원 분석을 위한 연산 수행 결과 테이블에 접근한다는 본질은 같고, 또한 SQL 과 MDX가 그 질의 형식 등에서 대동소이함으로, 기술(description)의 편의상 SQL로 질의한다고 한다. 즉, 다차원 분석용 연산 수행 결과 테이블에 SQL로 질의한다는 말의 의미는 1) 상기 테이블이 롤업 연산을 수행해 놓은 것일 경우, SQL 쿼리로 질의한다는 것이며, 2) 상기 테이블이 큐브 연산을 수행해 놓은 것이면 MDX 쿼리로 질의한다는 것으로 분설하여 이해하여야 한다. 한편, 상기에서 다차원(dimension)이란 1개 차원 이상의 차원으로 분석을 수행한다는 것을 의미한다.(1개 차원의 분석도 당연히 포함하는 개념이다.) 본 발명의 다차원 분석용 연산 수행 결과 테이블에는 사용자가 목적하는 분석 결과가 논리적으로 n 단계의 정보 처리를 거쳐 생성된다고 할 때, 1 내지 n-1 단계 또는 n 단계까지의 연산 수행 결과를 저장하고 있게 된다. 통상적인 SQL 질의식 또는 DBMS는 연산 기능이 있기 때문에, 상기 다차원 분석용 연산 수행 결과 테이블에는 중간 단계의 연산 수행 결과까지만 생성해 놓고, 최종적인 분석 결과는 DBMS 또는 SQL 질의식을 통하여 생성할 수 있을 것이다. 이하, 같다.In such a case, the patent information is processed in advance so as to meet the various modified requirements, and an operation for multi-dimensional analysis corresponding to the analysis purpose is performed. If table data storing the result is stored, And the response speed is remarkably improved. The multidimensional operation includes a roll-up operation and a cube operation, and the results of such operation performed on the patent data are stored in a DB table. At this time, the table may be referred to as various names such as a cube, a materialized view, and a view, but the contents referred to by the name are the same regardless of the name. The target data is extracted using the SQL query for the above table. In general, multidimensional expression (MDX) queries are used to extract desired data from a multidimensional result table subjected to a cube operation, but they are collectively referred to as SQL for convenience of description. In other words, it is common to query a SQL query to access a table as a result of performing a roll-up operation for multi-dimensional analysis. Accessing a table resulting from performing a cube operation for multidimensional analysis is generally performed using an MDX query, In this specification, both of them have the same nature of accessing the operation result table for multidimensional analysis, and also SQL and MDX query in SQL for convenience of description because they are very similar in the query format. That is, the meaning of querying the operation result table of the multi-dimensional analysis operation result table is as follows: 1) If the table is a roll-up operation, the query is performed by an SQL query; 2) If the table is a cube operation, It should be understood that the query is a query. In the meantime, the term " dimension " means that the analysis is performed on one or more dimensions. (The analysis also includes analysis of one dimension naturally.) In the operation result table for multidimensional analysis of the present invention, When the desired analysis result is logically generated through the n-step information processing, the result of the arithmetic operation from 1 to n-1 or n is stored. Since a conventional SQL query or DBMS has an arithmetic function, it is possible to generate only the result of the intermediate operation in the operation result table for the multi-dimensional analysis, and the final analysis result can be generated through the DBMS or SQL query will be. Hereinafter,

본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 목적하는 정보를 신속하게 출력해 주기 위해, 특허 정보를 롤업 연산 또는 큐브 연산 또는 기타 목적하는 연산 또는 데이터 처리를 수행하여 그 결과를 테이블로 생성해 놓는 엔진을 말한다. 특히, 상기 테이블을 사용하는 본 발명의 분석 모듈은 롤업(roll-up)이나 드릴다운(drill down) 또는 드릴스루(drill through) 등이 발생할 때, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성해 놓은 테 이블에 간략한 형태의 SQL 질의를 함으로써 신속하게 목적하는 데이터를 생성할 수 있게 된다.The multidimensional analysis operation result table generation module 402 of the present invention performs a roll-up operation, a cube operation, or other desired operation or data processing on the patent information in order to quickly output desired information, It is an engine that is created as a table. Particularly, when the roll-up, drill down, drill through, or the like occurs, the analysis module of the present invention using the above-mentioned table is configured to generate the multi-dimensional analysis operation result table generation module 402 ) To generate the desired data quickly by performing a simple SQL query on the generated table.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203), 주체 마스트 DB(204)를 대상으로 하여, 1개 이상의 분석 주제 카테고리별로 롤업 연산 및/또는 큐브 연산을 수행하여 그 결과를 테이블로 생성해 놓는다. 전술한 바와 같이 상기 특허 문건 마스트 DB(202)에는 서지 사항 마스트 DB 등이 있으며, 특허 분류 기호 마스트 DB(203)에는 특허 분류 기호 체계의 종류별로 된 특허 분류 기호 DB 등이 있으며, 주체 마스트 DB(204)에는 출원인 명칭 DB, 발명자 명칭 DB 등이 있으며, 상기 출원인 명칭 DB는 각 국가별로 대표명화 되어 있는 것이 바람직하다.The multidimensional analysis calculation execution result table generation module 402 performs a rollup process for each of the one or more analysis subject categories with respect to the patent document mast DB 202, the patent classification symbol mast DB 203, and the subject mast DB 204, Performs an operation and / or a cube operation, and creates a table as a result of the operation. As described above, the patent document mast DB 202 includes a bibliographic mast DB, and the patent classification symbol mast DB 203 includes a patent classification symbol DB for each type of patent classification symbol system. 204 includes an applicant's name DB, an inventor's name DB and the like, and the applicant's name DB is preferably represented by each country.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203), 주체 마스트 DB(204) 등에 포함된 데이터로 적어도 하나 이상의 분석 주제에 대응하여 상기 분석 주제에 적합한 분석 결과를 생성해 주기 위하여 다차원 분석용 연산을 수행하고 그 결과를 테이블로 생성해 둔다. 상기 테이블은 복수개가 있을 수 있으며, 상기 복수개의 테이블들은 분석 DW(data warehouse)를 구성한다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 재료로 사용하는 특허 정보 테이블의 DB 스키마는 스타스키마로 구축되는 것이 바람직하다.The multidimensional analysis calculation execution result table generation module 402 generates the multidimensional analysis calculation execution result table generation module 402 corresponding to at least one analysis subject with data included in the patent document mast DB 202, the patent classification symbol mast DB 203, the subject mast DB 204, In order to generate an analysis result suitable for the analysis subject, a multi-dimensional analysis calculation is performed and the result is generated as a table. There may be a plurality of tables, and the plurality of tables constitute an analysis DW (data warehouse). Preferably, the DB schema of the patent information table used as the material by the multi-dimensional analysis operation result table generation module 402 is constructed with a star schema.

스타스키마로 재구축된 테이블들의 E-R 다이어그램(Entity Relationship Diagram)은 중앙에 FACT 테이블이 있고, 상기 FACT 테이블을 참조하는 기준 정보 테이블로서의 차원 테이블(Dimension Table)이 있다. 상기 FACT 테이블에는 특허 문건에 대한 서지사항 데이터(차원 테이블에 있는 데이터는 제외됨, 다만 차원 테이블의 각 차원의 ID가 있음, 예를 들면 특허 문건에 포함된 특정 IPC(들)를 가리키는 IPC_ID, 특정 출원인(들)을 가리키는 출원인ID 등, 날짜에 대응되는 날짜ID들이 적어도 하나 이상씩 포함되어 있음)가 필드별로 포함되어 있는 것이 바람직하다. 상기 차원 테이블에는 특허 분류 기호(IPC는 필수, 국가별 FACT 테이블일 경우, 각 국가별 특허 분류 기호) 테이블, 연도 등의 날짜 테이블, 출원인/대리인/발명자 등의 주체 테이블, 국가/지역/소속 등과 같은 위치 테이블 등, 차원(dimension)에 속할 수 있는 기타 대상들의 테이블이 있다. 한편, 상기 차원 테이블에는 상기 FACT 테이블에 포함된 각종 ID에 상응하는 ID가 포함되어 있는 것은 당연할 것이다. 예를 들면, IPC 차원 테이블에는 IPC_ID가 있으며, 상기 IPC_ID에는 구체적인 IPC가 1개 대응되어 있다. 상기 스타스키마로 재구축된 테이블들은 기본적으로 국가별로 구축되거나, 국가를 통합하여 구축될 수 있을 것이다. 국가를 통합하여 구축할 경우나 기타의 경우에 패밀리 정보 데이터나 INPADOC/DocDB 데이터 등과 같이 여러 국가가 함께 관련되는 것은 별도의 테이블로 관리되는 것이 바람직하다.The E-R diagram (Entity Relationship Diagram) of the tables reconstructed with the star schema includes a FACT table in the center and a dimension table as a reference information table that refers to the FACT table. The FACT table includes bibliographic data on the patent document (data in the dimension table is excluded, but there is an ID of each dimension of the dimension table, for example IPC_ID indicating the specific IPC (s) included in the patent document) (At least one date ID corresponding to the date, such as the applicant ID indicating the date (s)) is included in each field. The dimension table includes a table of patent classification symbols (IPC is required, a patent classification symbol for each country in the case of a country-specific FACT table), a date table such as a year, a subject table such as an applicant / agent / inventor, a country / There is a table of other objects that can belong to a dimension, such as the same location table. It should be appreciated that the dimension table includes IDs corresponding to the various IDs included in the FACT table. For example, the IPC dimension table has an IPC_ID, and the IPC_ID is associated with one specific IPC. The tables reconstructed with the above star schema can be basically constructed by country or integrated with countries. It is desirable that the countries that are related to each other, such as family information data, INPADOC / DocDB data, etc., should be managed in a separate table.

본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 스타스키마로 재구축된 특허 정보 테이블의 데이터를 재료로 하여 본 발명의 각종 분석 목적을 위한 다차원 분석용 연산 수행 결과 테이블을 생성한다. 한편, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 다른 분석 주제 또는 복잡한 분석 주제에 대한 분석을 위하여 상기 생성해 놓은 다차원 분석용 연산 수행 결과 테이블을 재료로 하여 다른 다차원 분석용 연산 수행 결과 테이블을 생성할 수도 있다. 즉, 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 재료로 하는 테이블은 1) 스타스키마로 재구축된 특허 정보 테이블, 2) 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203), 주체 마스트 DB(204), 3) 다차원 분석용 연산 수행 결과 테이블 중 어느 하나 이상이다. 한편, 설명의 편의상 스타스키마로 상기 1) ~3)을 재구축된 특허 정보 테이블로 통칭하여 설명하나, 2)와 3)이 배제되는 것이 아님은 당연하다 할 것이다.The multidimensional analysis operation result table generation module 402 of the present invention generates a multidimensional analysis operation result table for various analysis purposes of the present invention using the data of the patent information table reconstructed as a star schema as a material. Meanwhile, the multidimensional analysis calculation execution result table generation module 402 performs a multidimensional analysis calculation result table creation process using the generated multidimensional analysis calculation execution result table as a material for analysis on another analysis subject or a complex analysis subject You can also create a table. That is, the table of the material for the multidimensional analysis calculation execution result table generation module 402 of the present invention includes 1) a patent information table reconstructed as a star schema, 2) a patent document master database 202, A DB 203, a subject mast DB 204, and 3) a multidimensional analysis calculation execution result table. On the other hand, for convenience of explanation, the above-mentioned 1) to 3) are described as a reconstructed patent information table with the star schema, but it is needless to say that 2) and 3) are not excluded.

다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)과 분석 모듈 간의 관계The relationship between the operation-result table generation module 402 and the analysis module for multi-dimensional analysis

본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 분석 DW를 생성하는 분석 DW 생성 모듈(402-1)과 분석 주제별 다차원 분석용 연산 수행 결과를 생성하는 각종 모듈 및 기타 본 명세서의 다차원 분석용 연산 수행 결과를 생성하는데 필요한 기능을 수행하는 모듈이 있다. 상기 분석 주제별 다차원 분석용 연산 수행 결과를 생성하는 각종 모듈에는 총량 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈 (402-2), 인용 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402-3), 경쟁 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈 (402-4), 발명자 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402-5), 특허 기술 분류별 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402-6), 융합 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402- 7), 대표 어구 분석용 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402-8) 등이 있다. 상기 각 모듈들은 각각 분석 DW(205 -1), 총량 분석용 다차원 분석용 연산 수행 결과 테이블(205 -2), 인용 분석용 다차원 분석용 연산 수행 결과 테이블(205 -3), 경쟁 분석용 다차원 분석용 연산 수행 결과 테이블(205 -4), 발명자 분석용 다차원 분석용 연산 수행 결과 테이블(205 -5), 특허 기술 분류별 분석용 다차원 분석용 연산 수행 결과 테이블(205 -6), 융합 분석용 다차원 분석용 연산 수행 결과 테이블(205 -7), 대표 어구 분석용 다차원 분석용 연산 수행 결과 테이블(205 -8) 들을 생성한다.The multidimensional analysis operation result table generation module 402 of the present invention includes an analysis DW generation module 402-1 for generating the analysis DW, various modules for generating a multidimensional analysis operation result for each analysis subject, There is a module that performs the functions necessary to generate the result of the operation for multidimensional analysis. Various modules for generating a multidimensional analysis operation result for each of the analysis subjects include a multidimensional analysis operation result table generation module 402-2 for total amount analysis, a multidimensional analysis operation result table generation module 402-3 for citation analysis, An operation result table generation module 402-4 for multidimensional analysis for competition analysis, an operation result table generation module 402-5 for multidimensional analysis for inventor analysis, an operation result table for multidimensional analysis for patent technology classification, Module 402-6, a multi-dimensional analysis operation result table generation module 402-7 for fusion analysis, and a multi-dimensional analysis operation result table generation module 402-8 for representative phrase analysis. Each of the modules includes an analysis DW (205-1) for the total amount analysis, a calculation result table 205-2 for the multidimensional analysis for the total amount analysis, a calculation result table 205-3 for the multidimensional analysis for citation analysis, a multidimensional analysis (205-4), inventor analysis multidimensional analysis operation result table (205-5), patent technology classification analysis multidimensional analysis operation result table (205-6), multidimensional analysis for fusion analysis A calculation execution result table 205 -7 for the representative phrase analysis, and an operation execution result table 205 -8 for the multidimensional analysis for the representative phrase analysis.

이어, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)과 본 발명의 분석 모듈의 관계에 대하여 설명한다. 상기 분석 모듈은 적어도 하나 이상의 분석 주제별에 대응되는 분석식(SQL 쿼리식)을 포함하고 있으며, 상기 SQL 쿼리식으로 상기 다차원 분석용 연산 수행 결과 테이블에 질의하여 분석 주제별로 목적하는 분석 결과를 입수해 내고, 이를 본 발명의 시스템(1)이 제공하는 인터페이스에 맞게 가공 처리하여 본 시스템의 사용자에게 제공해 준다.Next, the relationship between the multi-dimensional analysis operation result table generation module 402 and the analysis module of the present invention will be described. The analysis module includes an analytic expression (SQL query expression) corresponding to at least one analytic subject. The analysis module inquires the operation result table for the multi-dimensional analysis using the SQL query expression, And processes it according to the interface provided by the system 1 of the present invention and provides it to the user of the present system.

한편, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 분석 모듈의 성능(목적하는 데이터 추출/계산/입수에 걸리는 시간 감축 및 전산 자원 소모 절약 등)을 감안하여 기설정된 규모/단계 이상의 분석 목적별로 적어도 하나 이상의 테이블을 생성해 놓을 수도 있다. 즉, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 사용자의 화면에 제공되는 최종 분석 결과 화면 데이터까지 생성해 놓을 수도 있지만, 각 분석 주제별로 일정 단계까지만 생성해 놓고(중간 단계까지의 데이터만 생성해 놓고) SQL 쿼리 자체에서 제공하는 각종 명령어들을 사용하여 상기 최종 분석 결과 화면 데이터를 생성하도록 하는 것이 더욱 바람직하다. 이는 상기 최종 분석 결과 화면의 종류가 많을 경우에는 엄청난 비효율(전산 자원의 낭비)를 초래하기 때문에, 후자가 바람직하나, 전자의 방법도 전산 자원의 최적화를 포기한다면(응답 속도의 극대화를 위한다면) 허용될 수도 있을 것이다.Meanwhile, the multidimensional analysis operation execution result table generation module 402 may perform a multidimensional analysis operation execution result table generation process in consideration of the performance of the analysis module (time reduction for obtaining / calculating / acquiring desired data, At least one table may be created for each analysis purpose. That is, the multidimensional analysis calculation execution result table generation module 402 may generate the final analysis result screen data to be provided on the screen of the user, but it is possible to generate only the predetermined level for each analysis subject It is more preferable to generate the final analysis result screen data by using various commands provided by the SQL query itself. This is because, if the final analysis results in a large number of kinds of screens, the latter causes a great inefficiency (waste of computational resources), but if the former method gives up optimization of computational resources (if it is to maximize the response speed) It may be allowed.

롤업과 드릴 다운Rollups and drilldowns

특허 정보 데이터를 상기 물리적인 뷰 데이터로 가공하기 전에 롤업(roll up)과 드릴다운(drill down)의 개념을 설명한다. 롤업과 드릴다운은 데이터 연산에서 기본적인 것이며, 이해의 편의를 위해 예를 들어 설명한다. 다음 표 24과 같은 2006년 출원 데이터가 있다고 가정하자.The concepts of roll up and drill down are described before processing patent information data into the physical view data. Rollups and drilldowns are fundamental to data operations, and are described as examples for ease of understanding. Assume that there is application data for 2006 as shown in Table 24 below.

[표 24][Table 24]

2006년2006 1월January 2월February 3월In March 4월April 5월In May 6월June 7월In July 8월August 9월September 10월October 11월November 12월December 월별출원수Monthly number of applications 1One 22 33 44 55 66 77 88 99 1010 1One 22

상기와 같은 데이터를 아래와 같이 분기별 및 연도별로 합칠 수 있다. 하기 표 25는 개념적 설명을 위한 것이며, 이러한 롤업에 대한 데이터 구조는 실제의 개발예마다 다를 수 있다.The above data can be combined by quarter and year as follows. Table 25 below is for conceptual explanations, and the data structure for this roll-up may differ from actual development examples.

[표 25][Table 25]

2006년2006 1월January 2월February 3월In March 4월April 5월In May 6월June 7월In July 8월August 9월September 10월October 11월November 12월December 월별출원수Monthly number of applications 1One 22 33 44 55 66 77 88 99 1010 1One 22 분기quarter 1분기First quarter 2분기Q2 3분기Third quarter 4분기Quarter 분기별출원수Quarterly applications 66 1515 2424 1313 연도year 20062006 연도별출원수Number of applications by year 5858

이러한 방식으로 한번에 하나의 차원을 기준으로 한 방향으로만 작은 단위에서 더 큰 단위로 합치는 것을 롤업 연산이라 한다. 즉, 월 단위값을 분기 단위값으로, 분기 단위값을 연도별 단위값으로 합치는 것이 롤업 연산의 한 예가 되며, , 하나의 차원에서 상위 단위로 합쳐진 결과에 접근하는 것/입수하는 것/추출하는 것을 롤업이라 한다.In this way, combining small units to larger units in one direction only at one time is called a roll-up operation. That is, an example of a roll-up operation is to combine the monthly unit value with the branch unit value and the branch unit value with the unit value per year, and to access / obtain / extract the result combined into a higher unit in one dimension Called roll-up.

한편, 2006년의 58건의 데이터는 1분기 6건, 2분기 15건, 3분기 24건, 4분기 13건의 데이터의 합으로 구성된 것이라는 것을 알 수 있으며2006년의 58건을 1분기 6건, 2분기 15건, 3분기 24건, 4분기 13건으로 나눌 수 있고(왜냐하면 원래부터 나누어진 것을 합쳤으므로), 각 분기의 건수를 월별로 나눌 수 있게 된다. 이처럼 하나의 차원에서 더 큰 단위에서 작은 단위로 세분화하여 접근하는 것/입수하는 것/추출하는 것/을 드릴다운이라 한다. 그러므로, 롤업과 드릴다운은 동전의 양면과 같은 것이며, 가장 작은 단위에서 가장 큰 단위까지 데이터를 롤업 연산 등의 다차원 연산을 수행해 놓은 다차원 연산된 데이터가 있다면, 가장 큰 상위 단위에서 하위 단위로 드릴다운하여 데이터를 볼 수도 있게 된다.On the other hand, we can see that 58 data in 2006 is composed of 6 data in 1Q, 15 data in 2Q, 24 data in 3Q and 13 data in 4Q. 15 branches, 24 branches in the third quarter, and 13 branches in the fourth quarter (because the original divisions are combined), and the number of branches in each quarter can be divided by month. This approach, called subdivision of a larger unit to a smaller unit in one dimension, to obtain / extract / is called drilldown. Thus, rollups and drilldowns are like two sides of a coin, and if you have multidimensional computed data that performs multidimensional operations such as rollup operations from the smallest unit to the largest unit, you can drill down So that data can be viewed.

통상적으로 큐브 연산된 다차원 분석용 연산 수행 결과 테이블에 접근하여 목적하는 데이터를 입수하는 것을 OLAP(online analytic process)이라 하며, 통상적인 OLAP 엔진은 드릴 다운과 드릴 스루 등의 기능을 지원한다.Typically, OLAP (online analytic process) is used to access a cube-operated multidimensional analysis operation result table to obtain desired data. A typical OLAP engine supports drill-down and drill-through functions.

롤업과 드릴다운Rollups and drilldowns

하기 표를 보면 IPC 등의 특허 분류 기호 체계상에서 주어진 특허 분류 기호에 대하여 직하위 특허 분류 기호에 관한 정보를 확보하는 것의 중요성을 알 수 있다. 특히, 특허 분류 기호의 표기만으로는 상위 및 하위를 구분할 수 있는 특허 분류 기호(예를 들면 IPC의 경우 1dot subgroup 이하의 특허 분류 기호)에 대하여 직하위 특허 분류 기호 정보 입수의 필요성을 역설하는 본 발명의 하위 특허 분류 기호 자동 포함이라는 정보 처리 방법의 중요성에 대해서 실감할 수 있게 된다.The following table shows the importance of securing information on subclasses of a patent classification symbol given on IPC and other patent classification schemes. In particular, the present invention, which emphasizes the necessity of obtaining the information of the sub-classifications of the patent classification symbols (for example, patent classification codes of 1 sub subgroups in case of IPC) It becomes possible to realize the importance of the information processing method called automatic inclusion of the sub-patent classification symbol.

롤업과 드릴 다운의 설명을 하기 표 26를 기준으로 예시적으로 설명한다. 하기 표33은 H04B 7/02를 중심으로 한 특정 기업 A의 특허 출원 수를 나타낸다. 괄호, () 안의 숫자는 A 기업의 출원 특허 문건 중에서 특정 IPC가 나온 문건의 수이다. 한편, 브라켓, {}안의 숫자는 다차원 연산된 숫자이다. 타이틀 정보는 참고를 위하여 부가되었으며, 롤업과는 무관하나, 드릴다운 시 사용자들이 특허 분류 기호만으로는 그 분류 기호가 의미하는 바를 알기 어려우므로, 타이틀 정보를 제공해 주는 것이 타당하며, 특허 분류 기호의 다단계성(하위 특허 분류 기호가 왜 포함되 어야 하는 지)의 이해의 편의를 위해서 부가하였다. 4 dot 서브그룹은 H04B 7/02 밑에 없지만, 특허 분류 기호의 다단계성을 보여 주는 차원에서 참고로 추가하였다. 하기 표를 제외하고는 4-도트 서브그룹과 타이틀 정보는 없앤 표로서 설명할 것이다.Roll-up and drill-down will be described with reference to Table 26 below. Table 33 below shows the number of patent applications of a particular company A centering on H04B 7/02. Parentheses and numbers in parentheses are the number of documents of a specific IPC from patent applications filed by the A company. On the other hand, brackets, numbers in {} are multidimensional numbers. Title information is added for reference, and it is not related to roll-up. However, since it is difficult for users to know what the classification symbol means when they drill down, it is appropriate to provide title information, and multi- (Why the subpopulation classification should be included). The 4-dot subgroup is not listed under H04B 7/02, but is added as a reference to show the multistage nature of the patent classification symbol. Except for the table below, the 4-dot subgroup and title information will be described as a missing table.

[표 26][Table 26]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup 4-도트서브그룹4-dot subgroup 타이틀/IIPC 설명/특허 분류기호 설명Title / IIPC Description / Patent Classification Symbol Description H04B 7/00(2000){2605}H04B 7/00 (2000) {2605} 무선전송시스템Wireless transmission system H04B 7/005(64){64}H04B 7/005 (64) {64} .전송의 제어 ; 등화Control of transmission; Equalization H04B 7/01(4){4}H04B 7/01 (4) {4} .위상편이의 저감Reduction of phase shift H04B 7/015(3){3}H04B 7/015 (3) {3} .반향효과의 저감Reduction of echo effect H04B 7/02(88){207}H04B 7/02 (88) {207} .다이버시티(diversity)시스템. Diversity system H04B 7/04(57){114}H04B 7/04 (57) {114} .. 떨어져 배치된 복수의 독립공중선을 사용한 것Using multiple independent antennae spaced apart H04B 7/06(36){36}H04B 7/06 (36) {36} ...송신국에 있어서의 것... in the sending station H04B 7/08(21){21}H04B 7/08 (21) {21} ...수신국에 있어서의 것... in the receiving station H04B 7/10(3){3}H04B 7/10 (3) {3} .. 편파 또는 방향특성에 의해서 특징지어지는 단일공중선시스템을 사용한 것Using a single aerial system characterized by polarization or directional characteristics H04B 7/12(2){2}H04B 7/12 (2) {2} ..주파수다이버시티 시스템.. Frequency Diversity System

롤업을 이해하기 위하여 H04B 7/02를 보자. H04B 7/02의 다차원 연산된 숫자는 207(=88+114+3+2)인데, 이 중 88은 특허 문건 중에서 H04B 7/02를 포함하고 있는 문건의 수치이며, 114는 H04B 7/04 및 그 하위 특허 분류에서 기인한 것이며, 3과 2는 각각 H04B 7/06 및 H04B 7/08에서 기인한 것이다. 한편, H04B 7/04의 다차 원 연산된 숫자 114(=57+36+21)인데, 이 중 57은 특허 문건 중에서 H04B 7/04를 포함하고 있는 문건의 수치이며, 36과 21은 각각 H04B 7/06과 H04B 7/08에 기인한 것이다. 한편, H04B7/00에 다차원 연산된 문건은 2650인데, 이는 특허 문건 중에서 H04B7/00를 포함하고 있는 문건의 수치와 이 특허 분류 기호의 하부에 있는 다른 특허 분류 기호에서 다차원 연산된 수치이며, 표 26에 포함된 "..."은 H04B7/00의 하부 특허 분류 기호 중에서 상기 표33에서 표시된 것 이외의 다른 것들이 더 있음을 의미한다.See H04B 7/02 to understand the rollup. The multidimensional computed number of H04B 7/02 is 207 (= 88 + 114 + 3 + 2), of which 88 is the value of the document containing H04B 7/02 in the patent documents, 114 is the H04B 7/04 and And 3 and 2 are due to H04B 7/06 and H04B 7/08, respectively. On the other hand, the multidimensional computed number 114 (= 57 + 36 + 21) of H04B 7/04, 57 of which is the number of documents containing H04B 7/04 in the patent documents, and 36 and 21 are H04B 7 / 06 and H04B 7/08. On the other hand, the number of documents that are computed multidimensionally in H04B7 / 00 is 2650, which is the numerical value of the documents containing H04B7 / 00 in the patent documents and the multi- ... means that there are more than those shown in Table 33 above among the sub-patent classification symbols of H04B7 / 00.

만약 H04B7/02에 대한 특허 문건 정보를 가져 올 때(검색, 또는 카운팅 또는 기타 여타의 통계, 분석, 계산 등에서) H04B7/02으로 표기된 정보만을 가져와서는 안됨을 알 수 있다. 왜냐하면, H04B7/02에 대한 특허 정보는 특허 분류 기호 체계의 계층 구조상 H04B7/02 이하에 있는 모든 특허 분류 기호에 대한 특허 정보를 포함해야 함이 더 타당할 것이다.(예를 들면, H04B7/02에 해당하는 기술 영역에 속해 있는 모든 문건을 요청할 경우, H04B7/02가 표기되어 있는 문건 뿐만 아니라, H04B7/02 하위에 있는 모든 특허 분류 기호에 해당되는 문건을 포함하여 출력해 주는 것이 타당하기 때문이다.) 그러므로, 본 발명의 하위 특허 분류 포함 정보 처리(검색, 또는 카운팅 또는 기타 여타의 통계, 분석, 계산 등)가 필요한 이유이다. 상기 표33에서 알 수 있듯이, 만약 특허 문건의 IPC 정보에 H04B 7/08가 있는 경우, 이 문건은 H04B 7/04에도 해당되며, H04B 7/02에도 해당되게 된다. 그러므로, 출원수나 등록수 등의 카운팅의 관점에서 보면, H04B 7/04에 대한 카운팅값은 H04B 7/04 그 자신이 표기된 값과 H04B 7/06가 표기된 값, H04B 7/08이 표기된 값을 합 한 값이 된다.When retrieving patent document information for H04B7 / 02 (search, counting, or any other statistical, analytical, calculation, etc.), it should be noted that it should not contain only the information marked H04B7 / 02. Because patent information on H04B7 / 02 is more appropriate, it should include patent information on all patent classifications below H04B7 / 02 in the hierarchy of the patent classification system (for example, H04B7 / 02 If you request all documents belonging to the relevant technical domain, it is appropriate to include the documents corresponding to all patent classification under H04B7 / 02, as well as the documents marked H04B7 / 02. ) Therefore, the sub-patent classification of the present invention is the reason why information processing (search, or counting or other statistics, analysis, calculation, etc.) is required. As can be seen in Table 33 above, if there is H04B 7/08 in the IPC information of the patent document, this document also applies to H04B 7/04 and H04B 7/02. Therefore, from the point of view of counting applications or registrations, the counting value for H04B 7/04 is the sum of the values marked H04B 7/04 itself, the values H04B 7/06 and H04B 7/08 Lt; / RTI >

상기와 같은 롤업 및 드릴다운 모든 IPC 및 모든 기타 특허 분류 기호의 모든 레벨에 대해서도 적용될 수 있을 것임은 자명하다 할 것이다. 각 IPC별 드릴 다운의 예를 다음과 같은 형식이 그 일례가 될 수 있을 것이다.It will be appreciated that the invention may be applied to all levels of roll-up and drill down all IPCs and all other patent classifications as described above. An example of drill down by each IPC would be the following format:

이어, 하기 표들은 연도별로 드릴 다운하는 경우의 일 실시예를 보여 준다. 바로 위의 표33에 나와 있는 정보를 연도를 기준 방향으로 하여 드릴다운 한다면, 아래 표 27와 같은 데이터가 보여질 수 있을 것이다. 표기의 편의상 다차원 연산된 숫자만 남기며, 데이터가 없는 4-도트 서브그룹도 표시하지 않는다.The following tables show one embodiment of drill down by year. If you drill down to the information in Table 33 directly above the year as a year, you can see the data shown in Table 27 below. For the sake of convenience, only the multidimensional computed numbers are left, and the 4-dot subgroup without data is also not displayed.

[표 27][Table 27]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup H04B 7/00(200){2605}H04B 7/00 (200) {2605}

상기에서 H04B 7/00을 하위 특허 분류를 기준 방향으로 하여 드릴다운 한다면, 아래 표 28와 같은 데이터가 보여질 수 있을 것이다.If we drill down to H04B 7/00 with the subpath classification as the reference direction, the data shown in Table 28 below can be seen.

[표 28][Table 28]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup H04B 7/00(200){2605}H04B 7/00 (200) {2605} H04B 7/005(64){64}H04B 7/005 (64) {64} H04B 7/01(4){4}H04B 7/01 (4) {4} H04B 7/015(3){3}H04B 7/015 (3) {3} H04B 7/02(88){207}H04B 7/02 (88) {207}

이때, H04B 7/015에 대하여 드릴다운한다면 H04B 7/015에는 하위 특허 분류 기호 정보가 없으므로, 추가적으로 변하는 것은 아무것도 없을 것이다. 이때, H04B 7/02에 대하여 드릴다운한다면 아래와 같은 표 29와 같은 데이터가 보여질 수 있을 것이다.At this time, if you drill down on H04B 7/015, H04B 7/015 will have no sub-patent classification information, so there will be no further changes. At this time, if you drill down on H04B 7/02, you can see the data as shown in Table 29 below.

[표 29][Table 29]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup 4-도트서브그룹4-dot subgroup 타이틀title H04B 7/00(7){37}H04B 7/00 (7) {37} 무선전송시스템Wireless transmission system H04B 7/005(64){64}H04B 7/005 (64) {64} .전송의 제어 ; 등화Control of transmission; Equalization H04B 7/01(4){4}H04B 7/01 (4) {4} .위상편이의 저감Reduction of phase shift H04B 7/015(3){3}H04B 7/015 (3) {3} .반향효과의 저감Reduction of echo effect H04B 7/02(88){207}H04B 7/02 (88) {207} .다이버시티(diversity)시스템. Diversity system H04B 7/04(2){9}H04B 7/04 (2) {9} .. 떨어져 배치된 복수의 독립공중선을 사용한 것Using multiple independent antennae spaced apart H04B 7/10(2){2}H04B 7/10 (2) {2} .. 편파 또는 방향특성에 의해서 특징지어지는 단일공중선시스템을 사용한 것Using a single aerial system characterized by polarization or directional characteristics H04B 7/12(3){3}H04B 7/12 (3) {3} ..주파수다이버시티 시스템.. Frequency Diversity System

이어, H04B 7/04를 드릴다운 한다면 아래의 표 30와 같은 데이터가 보여 질 수 있을 것이다.If you drill down on H04B 7/04, you can see the data shown in Table 30 below.

[표 30][Table 30]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup H04B 7/00(200){2605}H04B 7/00 (200) {2605} H04B 7/005(64){64}H04B 7/005 (64) {64} H04B 7/01(4){4}H04B 7/01 (4) {4} H04B 7/015(3){3}H04B 7/015 (3) {3} H04B 7/02(88){207}H04B 7/02 (88) {207} H04B 7/04(57){114}H04B 7/04 (57) {114} H04B 7/06(36){36}H04B 7/06 (36) {36} H04B 7/08(21){21}H04B 7/08 (21) {21} H04B 7/10(3){3}H04B 7/10 (3) {3} H04B 7/12(2){2}H04B 7/12 (2) {2}

상기 표 27~표37는 A 기업의 특정 특허 분류 기호에 대한 출원일/출원연도가 롤업되어 있는 결과이다. 즉, 각 다차원 연산된 결과는 과거부터 2000년까지, 2001년, 2002년, 2003년, 2004년, 2005년 및 현재일로부터 1년 6개월 이전이 속하는 연도부터 현재까지 연도까지(약칭으로 "최근"이라 한다.)로 구분 지어 드릴다운 될 수 있을 것이며, 이를 위해 사전에 A 기업의 특허 문건 집합에 대하여 각 연도 단위로 각 특허 분류 기호에 대응되는 문건의 수치값을 롤업해 놓아야 하며, 하기 표 31은 그 일예를 보여 준다.Tables 27 to 37 are the results of roll-up of the filing date / filing year for the specific patent classification symbol of Company A. In other words, the results of each multidimensional computation are from year 1 to year 6 from the past to the year 2000, 2001, 2002, 2003, 2004, 2005 and the present day to the current year (abbreviated as " "). In order to do so, the numerical value of the document corresponding to each patent classification symbol must be rolled up in advance for each set of patent documents of company A, and the following table 31 shows the example.

설명의 편의상 H04B 7/02를 중심으로 연도별 롤업과 드릴다운을 설명한다.For the sake of clarity, we describe yearly rollups and drilldowns around H04B 7/02.

[표 31][Table 31]

메인그룹Main group 1-도트서브그룹1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup 2000이전Before 2000 2001(+)2001 (+) 2002(+)2002 (+) 2003(+)2003 (+) 2004(+)2004 (+) 2005(+)2005 (+) 최근(2006-현재) (+)Recent (2006-present) (+) 합계(+)Total (+) H04B 7/02(88){207}H04B 7/02 (88) {207} 2525 1515 77 3030 6060 6464 66 207207 H04B 7/04(57){114}H04B 7/04 (57) {114} 1010 66 33 1010 4242 3939 44 114114 H04B 7/06(36){36}H04B 7/06 (36) {36} 33 00 00 33 1717 1212 1One 3636 H04B 7/08(21){21}H04B 7/08 (21) {21} 55 33 1One 55 44 33 00 2121 H04B 7/10(3){3}H04B 7/10 (3) {3} 33 00 00 00 00 00 00 33 H04B 7/12(2){2}H04B 7/12 (2) {2} 00 00 1One 00 00 1One 00 22

예를 들어, 상기 2001 옆에 있는 (+) 표시(설명의 편의상 드릴 다운이 가능함을 의미하는 표시로 사용되는 기호를 의미함)에 사용자가 클릭 등의 행위를 하면, 본 시스템(1)은 그 행위를 받아들여, 2001년의 하위 차원(예를 들면 분기(quarter)) 별 데이터를 추출하여 사용자에게 제공해 줄 수 있으며, 이는 하기 표 32에 표현되어 있다.For example, when the user performs a click or the like on the (2001) next to the 2001 (meaning a symbol used as an indication that drilling down is possible for the sake of explanation), the present system 1 (Eg, quarter) data for 2001 and provide it to the user, as shown in Table 32 below.

[표 32][Table 32]

인룹1-도트서브그룹Inroup 1-dot subgroup 2-도트서브그룹2-dot subgroup 3-도트서브그룹3-dot subgroup 2000이전Before 2000 2001(+)2001 (+) 01/1Q01 / 1Q 01/2Q01 / 2Q 01/3Q01 / 3Q 01/4Q01 / 4Q 2002(+)2002 (+) 합계(+)Total (+) H04B 7/02(88){207}H04B 7/02 (88) {207} 2525 1515 1One 22 44 88 77 207207 H04B 7/04(57){114}H04B 7/04 (57) {114} 1010 66 00 00 66 00 33 114114 H04B 7/06(36){36}H04B 7/06 (36) {36} 33 00 00 00 00 00 00 3636 H04B 7/08(21){21}H04B 7/08 (21) {21} 55 33 33 00 00 00 1One 2121 H04B 7/10(3){3}H04B 7/10 (3) {3} 33 00 00 00 00 00 00 33 H04B 7/12(2){2}H04B 7/12 (2) {2} 00 00 00 00 00 00 1One 22

롤업의 기준 방향과 차원Base Direction and Dimension of Rollup

이어, 본 발명에서 사용되는 차원(dimension)에 대해 더욱 상세하게 설명한다. 통상적으로 시간(time)은 일-주-월-분기-연도 등과 같은 다양한 단위로 롤업 또는 드릴다운의 방향축이 되는 차원이 될 수 있다. 한편, 특허 분류 기호 체계 상의 특허 분류 기호도 롤업 또는 드릴다운의 방향축이 되는 차원이 될 수 있다. Next, the dimensions used in the present invention will be described in more detail. Typically, the time may be a dimension that is the directional axis of roll-up or drill-down in various units such as one-week-week-month-quarter-year. On the other hand, the patent classification symbol on the patent classification system can also be a dimension that serves as a directional axis of roll-up or drill-down.

먼저 가장 기본적인 시간 차원으로, 일-주-월-분기-연도-복수 연도 단위 기간에서 선택되는 어느 하나 이상이 될 수 있으며, 출원인, 공개일, 등록일 등과 같은 특허 문건에 포함된 모든 종류의 시간 속성 필드에 상기와 같은 다단계 시간 차원을 적용할 수 있다. First, it is the most basic time dimension. It can be any one or more selected from one-week-week-month-quarter-year-multiple-year unit period and all types of time attributes included in patent documents such as applicant, Field can be applied to the field.

다음으로 특허 분류 기호 차원이 있을 수 있는데, 각 국가별로 적어도 하나 이상의 특허 분류 기호 체계를 도입하고 있는 바, 상기 특허 분류 기호 차원은 특허 분류 기호 체계 자체의 다단계성을 그대로 반영하여 차원을 설정할 수 있다. 한편, 상기 적어도 하나 이상의 선택된 특허 분류 기호들의 묶음을 하나의 단위로 하여, 별도의 차원을 설정할 수도 있다. 예를 들면 RFID라는 큰 주제에 대하여 다단계로 상기 주제에 대응되는 다단계 소주제를 설정하고, 상기 소주제에 상기 특허 분류 기호들의 묶음을 대응시키는 방법으로 별도의 차원을 설정할 수도 있다. 이러한 별도의 차원은 개인화된 다차원 분석용 연산 수행 결과 테이블/개인화된 큐브(personalized cube)-사용자 개개인이 자신의 분석 목적을 달성하기 위하여 만드는 테이블/큐브에 특히 유용하다.Next, there may be a dimension of the patent classification symbol. At least one patent classification symbol system is introduced for each country, and the dimension of the patent classification symbol can be set to reflect the multistage property of the patent classification symbol system itself . Meanwhile, a bundle of the at least one selected patent classification symbol may be set as a unit, and another dimension may be set. For example, a multi-level subchapier corresponding to the subject can be set in a multistage manner on a large topic of RFID, and a separate dimension can be set by a method of associating the bundles of the patent classification symbols with the subchapters. These separate dimensions are particularly useful for personalized multidimensional analytical results-based performance tables / personalized cubes - tables / cubes that each individual user makes to achieve his or her analytical purpose.

지역도 하나의 차원이 될 수 있다. 지역 단위 중에서 특허 문건으로부터 가장 용이하게 입수할 수 있는 단위는 국가이며, 주소 정보로부터 하나의 국가 내에서의 지역별로 분할하여 다단계 계층을 가지는 차원을 생성해 놓을 수 있다.The region can also be a dimension. Among the regional units, the most easily obtainable unit from the patent document is the country, and from the address information, it is possible to create a dimension having a multi-level hierarchy by dividing into regions within one country.

주체도 하나의 차원이 될 수 있다. 주체에는 출원인, 발명자, 대리인 등이 있으며, 출원인의 속성(기업, 연구소, 대학 등의 속성), 규모(대, 중, 소 등)와 같은 양적 속성, 그룹-계열사 등의 출원인 간의 속성 등도 차원이 될 수 있다. 한편, 기업 등의 주가, 매출, 이익율 등과 같은 기업 재무 정보나 기업 금융 정보 등도 차원이 될 수 있다. 한편, 글로벌 기업, 로컬 기업 등과 같이 임의로 각 기업마다 대응시킬 수 있는 메타 속성 또한 차원이 될 수 있다.The subject can also be a dimension. The subject includes applicants, inventors, and agents, and quantitative attributes such as applicant's attributes (enterprise, research institute, university, etc.), size (large, medium, small, etc.), and attributes between applicants such as group- . On the other hand, corporate financial information such as stock prices, sales, and profit rate of companies and corporate financial information can be a dimension. On the other hand, meta-attributes such as global companies and local companies, which can be arbitrarily assigned to each company, can also be a dimension.

한편, 출원, 등록, 거절 등과 같은 문건의 현황에 속성도 차원이 될 수 있다. 한편, 각종 카운팅 값을 그룹화시킬 때, 각 카운팅 값이 속하는 그룹도 차원이 될 수 있다. 이러한 그룹의 예로, 청구항 1~5개, 6~10개, 10개~15개, 15개 이상 등과 같은 것이 그 한 예가 되며, 공동 출원인 수, 공동 발명자 수, 패밀리 수(국내 패밀리 수, 해외 패밀리 수 등) 등도 수 그 자체는 차원이 될 수 없지만, 그 수가 그룹화될 때, 그 수가 속하는 그룹은 차원이 될 수 있다.On the other hand, attributes can also be a dimension to the status of documents such as application, registration, rejection, etc. On the other hand, when grouping various counting values, the group to which each counting value belongs may also be a dimension. Examples of such groups include claims 1 to 5, 6 to 10, 10 to 15, 15 or more, and the number of co-applicants, the number of co-inventors, the number of families (the number of domestic families, Number, etc.) The number itself can not be a dimension, but when the number is grouped, the group to which it belongs can be a dimension.

본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 선택되는 적어도 하나 이상의 차원별로 다차원 연산을 수행하여 다차원 분석용 연산 수행 결과 테이블을 생성해 놓는다. 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 다차원 분석용 연산 수행 결과 테이블을 생성할 때, 특허 분류 기호가 차원에 포함되어 있을 경우, 주어진 상기 특허 분류 기호에 대한 다차원 연산 값이 생성될 때, 상기 특허 분류 기호 및 상기 특허 분류 기호의 상위 특허 분류 기호를 함께 고려하여 다차원 연산값을 생성하는 것이다. 이렇게 다차원 연산값이 생성되면, 임의의 특허 분류 기호에 대한 다차원 연산값을 입수할 때, 그 특허 분류 기호 및 그의 하위 특허 분류 기호들에 대한 모든 값들이 다차원 연산된 수치값들이 반영되어 생성된 값을 입수하게 된다. 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 주어진 특허 문건에 포함된 특허 분류 기호가 있을 때, 상기 특허 분류 기호 뿐만 아니라, 상기 특허 분류 기호의 상위 특허 분류 기호에도 상기 특허 분류 기호에 대한 다차원 연산을 수행한 값을 반영한다. 예를 들면, 문건 번호 #1에 H04B 7/06이라는 IPC가 부여되어 있을 경우, 이 문건으로 다차원 연산 데이터를 생성할 때, H04B 7/06에도 카운팅 값 1을 부여하고, 상기 H04B 7/06은 직상위인 H04B 7/04, H04B 7/02에도 카운팅값을 부여해야 한다. 물론, H04B 7/00이상의 상위 레벨에도 카운팅값이 부여됨은 상식적일 것이다.The multidimensional analysis operation result table generation module 402 of the present invention generates a multidimensional analysis operation result table by performing a multidimensional operation for each selected at least one dimension. When the multidimensional analysis calculation execution result table generation module 402 of the present invention generates the multidimensional analysis calculation execution result table, if the patent classification symbol is included in the dimension, the multidimensional calculation operation for the given patent classification symbol Value is generated, the multilevel calculation value is generated by considering the patent classification symbol and the upper patent classification symbol of the patent classification symbol together. When the multidimensional operation value is generated, when a multidimensional operation value for an arbitrary patent classification symbol is obtained, all values of the patent classification symbol and its subpublic classification symbols are multiplied by the multidimensional computed numerical values, . The multidimensional analysis operation result table generation module 402 of the present invention generates the multidimensional analysis operation result table generation module 402 in accordance with the patent classification notation And reflects a value obtained by performing a multidimensional operation on the symbol. For example, when document number # 1 is assigned IPC H04B 7/06, when the multidimensional operation data is generated by this document, H04B 7/06 is also given a counting value of 1, and H04B 7/06 Count values must also be assigned to H04B 7/04 and H04B 7/02 immediately above. Of course, it is common sense that counting values are also given to higher levels of H04B 7/00 and higher.

이와 같이 롤업과 드릴다운의 방향 축이 되는 것들이 주로 차원(dimension)이 되는데, 이러한 차원으로는 IPC 등의 특허 분류 기호, 시간 등이 대표적이며, 국가, 지역, 출원인, 발명자, 상태, 인용, 패밀리 정보 등도 차원축이 될 수 있다. 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 선택된 적어도 하나 이상의 각 차원을 롤업의 대상 축으로 하여 사전에 총량, 점유율, 집중율, 활동율 등과 같은 분석 지표값을 계산해 놓는다.As such, dimensions that are the axis of roll-up and drill-down are mainly dimensions. Patent categorization symbols such as IPC, time, etc. are typical examples of such dimensions, and include countries, regions, applicants, inventors, Information can also be a dimension axis. The multidimensional analysis calculation execution result table generation module 402 of the present invention calculates the analysis index values such as the total amount, the occupancy rate, the concentration rate, the activity rate, and the like, using the selected at least one dimension as the target axis of the rollup.

먼저 공간적으로 볼 때, 전체 국가-개별국가-개별국가 내 지역(특허 정보에는 출원인 및/또는 발명자의 국가 및/또는 주소 정보가 포함되어 있다) 예를 들면, 전체 국가-대한민국-서울특별시-강남구...등과 같이 단계별로 세분화할 수 있다. 이는 시간-도-기--...동일한 방법이 될 것이다. In the first place, the whole country - the individual country - the area within the individual country (the patent information includes the country and / or address information of the applicant and / or the inventor) ..., and so on. This will be the same way - time-period-period.

출원인을 세분화 한다면, 출원인(국가 통합)-각 국가별 출원인으로 세분화할 수 있을 것이며, 모든 국가 통합과 각 국가별 사이에 중간 규모의 지역(북미, 아시아, 유럽, 중동, 남미, 아프리카 등)을 도입할 수 있을 것이다. 중간 규모를 도입하기 위해서는 반드시 북미에 해당하는 국가명(예를 들면, 미국, 캐나다, 멕시코 등)을 북미라는 중간 카테고리에 맵핑(mapping)해 놓아야 한다. If the applicant is subdivided, it will be possible to subdivide it into applicant (national integration) - applicant for each country, and it will be possible to integrate all national integration and intermediate regions (North America, Asia, Europe, Middle East, South America, Africa, etc.) . In order to introduce medium scale, it is necessary to map a country name corresponding to North America (for example, USA, Canada, Mexico, etc.) to an intermediate category called North America.

또한, 출원인에서 발명자까지를 롤업/드릴다운해 놓을 수 있는데, 방향은 크게 다음과 같은 것들이 예가 될 수 있다. 1) 출원인(국가 통합)-출원인-각 국가별 발명자로 롤업/드릴다운하거나 2) 출원인(국가통합)-발명자(국가통합)-국가별 발명자로 롤업/드릴다운할 수 있을 것이다. 한편, 하나의 국가 내에서도 출원인의 속성 별로 롤업/드릴다운할 수 있을 것이다. 예를 들면, 출원인을 기업, 대학교, 연구소 등의 조직 유형단위로 나누고, 각각 기업을 대기업, 중견기업, 중소기업으로 나누고, 대학교를 국공립대, 사립대, 전문 기술 대학 등으로 나누고, 연구소를 공공 연구소, 민간 연구소 등으로 세분화시킨 다음 이들에 대해서 롤업/드릴다운할 수 있을 것이다. 이때, 각 출원인에 대하여 맵핑 정보가 있어야 함은 당연할 것이다. 예를 들면, 출원인 A가 기업이고, 그 규모는 중소기업에 속한다는 정보가 맵핑되어 있어야 한다. In addition, the applicant to the inventor can be rolled up / drilled down. The direction may be as follows. 1) Applicant (National Integration) - Applicant - Rollup / Drill down to each country's inventor or 2) Applicant (National Integration) - Inventor (National Integration) On the other hand, it will be possible to roll-up / drill down according to the attribute of the applicant in one country. For example, applicants are divided into organizational type units such as corporations, universities, and research institutes. The companies are divided into large corporations, mid-sized corporations, and small and medium-sized enterprises, and universities are divided into national and public universities, private universities, And then roll-up / drill down on them. At this time, it is natural that there should be mapping information for each applicant. For example, information must be mapped that applicant A is a corporation and its size belongs to a small business.

이때, 기업 등에 대한 재무제표 데이터가 있는 경우, 보다 다양한 롤업/드릴다운을 할 수 있다. 예를 들면 기업 재무제표로 매출, 이익, 주가 및 이들 각각의 증감율 등에 관한 정보가 있다고 했을 때, 각 기업-매출규모-매출규모 범위별, 기업-이익-이익증감율, 또는 기업-주가-주가증감율 범위 등 다양한 카테고리별로 롤업/드릴다운 할 수 있을 것이다. 즉, 롤업/드릴다운의 본질은 하나의 대상에 대한 복수의 카테고리 정보가 있을 때, 그 카테고리별로 정보 처리 결과를 배치하는 것이다. 하나의 특허 문건에 국가, 시간, 출원인, 발명자, 특허 분류 기호 정보가 있고, 출원인 정보에 그 출원인의 지역 정보가 있다. 그리고, 출원인이 기업일 경우 그 기업의 재무제표 정보 등이 있을 기업 정보 DB로 구축되어 있을 수 있다. 이때, 전체 또는 설정된 임의의 특허 문건 집합 단위로 1) 특허 분류 기호 정보와 같이 하나의 선택된 카테고리가 다단계 계층 구조를 지닐 경우, 그 카테고리 내에서 선택된 특허 분류 기호에 대응되는 각종 정보 처리(검색, 또는 카운팅 또는 기타 여타의 통계, 분석, 계산 등)를 하고 그 결과를 출력해 줄 수 있게 되며, 2) 2종 이 상의 카테고리에 대한 다양한 교집합(카테고리의 종류를 n이라 할 때, 가능한 교집합의 수는 nCr,(n combination r, r은 r은 1 이상이나 n보다 작거나 같다.)이나, 이 모든 조합이 다 선택가능하지는 않을 수 있으며, 특별한 분석적 의미를 가지지 조합도 나올 수 있음은 당연할 것이다)을 기준으로 롤업/드릴다운할 수 있을 것이다. At this time, if there is financial statement data for a company, more variety of roll-up / drill-down can be done. For example, if there is information on sales, profit, share price, and each increase / decrease ratio in company financial statements, each company - sales scale - revenue scale, company - profit - profit increase rate, or company - stock price - You can roll-down / drill down into various categories such as scope. That is, the essence of roll-up / drill-down is to arrange the information processing results for each category when there is a plurality of category information for one object. In one patent document, there is country, time, applicant, inventor, patent classification information, and applicant information has local information of the applicant. If the applicant is a corporation, it may be constructed as an enterprise information DB in which the company's financial statement information may be included. At this time, if a selected category has a multi-level hierarchical structure such as 1) patent classification symbol information in whole or set arbitrary set of patent documents, various information processing (search, or the like) corresponding to the patent classification symbol selected in the category Counting, or any other statistic, analysis, calculation, etc.), and output the result. 2) Various intersections of two or more categories. (When the category type is n, the number of possible intersections is nCr, (n combination r, r is at least 1 but less than or equal to n), but not all of these combinations may be selectable, and it is natural that they have special analytical meaning and combinations. Roll-down / drill-down on the basis of "

교집합을 통해 롤업/드릴다운할 경우에는 다음과 같은 점을 주의해야 한다. The following points should be noted when rolling / drilling down through an intersection.

첫째는 2개의 카테고리를 사용할 때이다. 예를 들면 A 카테고리(예를 들면 시간), B카테고리(예를 들면 IPC)를 대상으로 롤업/드릴다운할 경우, 정보 처리(검색, 또는 카운팅 또는 기타 여타의 통계, 분석, 계산 등)의 결과 표현의 편의상 1개 방향(예를 들면 세부 IPC로 시간축은 드릴다운하지 않고, IPC만 펼쳐지거나(이에 대해서는 위의 롤업/드릴다운 개념을 설명할 때 예시하였다)으로 드릴 다운하는 것이 바람직하다. 물론, IPC도 드릴다운하고, 시간축도 드릴다운 할 수 있지만, 이 경우에는 결과가 너무 복잡해 보일 수 있으므로, 특별한 경우가 아니면 지양하는 것이 바람직하다. 물론, 롤업은 양향방 모두에 대하여 수행해 놓아야 한다. 즉, 롤업은 카테고리 A의 기설정된 단위와 카테고리 B의 설정된 단위의 모든 조합으로 구성되는 셀에 대한 정보값을 기초로, 각 셀의 A 카테고리 방향으로도 롤업 정보 처리를 해 놓아야 하며, B 카테고리 방향으로도 롤업 정보 처리를 해 놓아야 한다. 상기 바로 위의 표는 그 일례를 보여 주고 있다. 셀 안의 각 수치는 상기 셀에 대한 정보값에 해당하고, {}안의 숫자는 각 특허 분류 기호 레벨에서의 2000년부터 2005년까지의 연도별 특허 건수를 롤업해 놓은 것에 해당하며, H04B 7/02{17}에서 의 17은 H04B 7/02 및 자신의 하위 특허 분류 기호까지의 특허 문건 수를 롤업해 놓은 값이 된다. 위 표에서는 연도를 기준으로 롤업(예를 들어, 2005년의 모든 특허 문건 수가 몇 개다, H04B 7/02에 2005년도에 몇 개다 등은 표시되어 있지 않지만, 이 또한, 롤업되어 표시 될 수 있다는 것은 당연할 것이며, 2005년을 구성하는 분기별, 월별로 다차원 연산된다는 것도 당연할 것이다.)The first is when you use two categories. For example, if you rolled up / drill down to category A (for example time) and category B (for example IPC), you can see the results of information processing (search or counting or other statistics, analysis, For convenience of presentation, it is desirable to drill down in one direction (for example, in the detail IPC, the time axis is not drilled down, only the IPC is unfolded (this is exemplified when explaining the roll-up / drill down concept above) , IPC can also be drilled down and the time axis can be drilled down, but in this case the result may seem too complex, so it is desirable to avoid it unless it is a special case. , The roll-up can be performed in the A category direction of each cell based on the information value of the cell composed of all the combinations of the predetermined unit of category A and the set unit of category B The table immediately above shows an example. Each value in the cell corresponds to the information value of the cell, and the { Corresponds to rolling up the number of patents per year from 2000 to 2005 at each patent classification level, 17 in H04B 7/02 {17} corresponds to H04B 7/02 and its sub-patent classification The number of patent documents up to the symbol is rolled up. In the table above, roll-up based on the year (for example, how many patent documents are in 2005, how many are in H04B 7/02 in 2005, etc. , But it would also be natural that it could also be rolled up and displayed, and it would of course be multidimensional computed on a quarterly and monthly basis).

둘째는 3개 이상의 카테고리를 사용할 때이다. 이 경우에도 드릴다운은 선택되는 한 방향으로 펼쳐지는 것이 바람직하며, 2 방향으로 펼쳐지는 것도 허용될 수 있으나, 3 방향으로 펼쳐지는 것은 시각적으로 표현이 불가능하거나 극히 어려우므로 지양하는 것이 바람직하다. 이 경우에도 롤업은 모든 카테고리에 대해서 계산되어 있는 것이 바람직하나, 롤업 계산량과 사용 빈도에 따라서 사전에 롤업할 필요가 없는 차원축에 대해서는 롤업해 놓지 않고, 사용이 발생하는 시점에서 처리하는 것도 좋을 것이다. 롤업을 해 놓지 않아도 필요로 하는 정보를 그 발생 시점에서 롤업하여 처리할 수 있으나, 롤업 처리를 위하여 상대적으로 긴 시간이 소모될 수 있다. 예를 들어, 연도별로 롤업 처리를 해 놓지 않은 경우라면, 대상이 되는 모든 문건의 서지 사항에서 연도 정보를 입수하고, 연도별로 분리하여 계산하고, 연도별로 출력해 주면 연도별 드릴다운의 효과를 낼 수 있다. 다만, 이를 사전에 연도별로 분리하여 계산하고, 이를 롤업해 놓으면, 나중에 연도별로 드릴다운할 때, 롤업해 놓은 연도별 정보를 단지 읽어와서 디스플레이 해 주면 되어 응답 속도면에서 유리하다.The second is when using more than three categories. In this case, it is preferable that the drill-down is expanded in one direction to be selected, and it is acceptable to expand in two directions. However, it is preferable to expand in three directions because it is impossible to express visually or extremely difficult. In this case, it is preferable that the roll-up is calculated for all categories, but it is also preferable to process the dimension axis that does not need to be rolled up beforehand according to the roll-up calculation amount and frequency of use, . The necessary information can be rolled up and processed at the time of occurrence even without roll-up, but a relatively long time may be consumed for roll-up processing. For example, if you do not roll-up by year, you can obtain year information from bibliographies of all the documents you are interested in, calculate separately for each year, and print the year by year, . However, it is advantageous in terms of response speed when it is calculated by dividing it in advance by the year and rolling up the roll-up by the year.

총량 분석용 다차원 분석용 연산 수행 결과 테이블데이터Total amount analysis Multidimensional analysis Operation result table data

이하, 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성하는 다차원 분석용 연산 수행 결과 테이블에 대해 더욱 상세하게 설명한다. 아울러, 상기 다차원 분석용 연산 수행 결과 테이블에 상기 분석 모듈이 어떻게 접근하여 어떠한 데이터를 생성하는지에 대해서도 아울러 설명한다.Hereinafter, the multi-dimensional analysis operation result table generated by the multi-dimensional analysis operation result table generation module 402 of the present invention will be described in more detail. In addition, how the analysis module accesses the multidimensional analysis operation result table and what data is generated will also be described.

통상적으로 특허 문건에 표기되는 출원일 표기가 연월일의 표기가 있으므로, 일단위로 각 특허 분류 기호별로 카운팅이 되어 있는 경우, 이를 월단위, 분기단위, 연단위로 다차원 연산을 수행할 수 있을 것이다.In general, since the date of the filing date indicated in the patent document is denoted by the date of the month, if the number is counted by each patent classification symbol, the multi-dimensional operation can be performed on a month unit, a branch unit or a podium.

하기 표 33은 다차원 연산이 수행된 결과 테이블의 일 실시예적 일부분을 보여 준다. 하기 표 33과 같은 데이터는 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 스타스키마로 재구축된 특허 정보 테이블에서 출원인과 IPC 및 연도별 날짜 정보를 기준으로 사전에 출원인별, 연도별 다단계 IPC의 각 계층별로 특허 문건의 개수를 카운팅하는 방법으로 생성해 놓은 데이터이다. 표 33과 같은 데이터는 테이블/뷰/물리적인 뷰(materialized view) 등의 임의의 형식으로 저장되어 있을 수 있으며, 저장된 테이블에 포함된 데이터는 적절한 SQL 쿼리문을 사용하여 추출할 수 있다.(예를 들면, 출원인 A의 IPC 1 dot 서브그룹 기준(하기에서는 C5 레벨)에서의 다출원 IPC별 (다차원 연산된)연도별 출원수를 가져오는 것이 그 한 예가 된다. 아울러 출원인별 연도별 출원수를 생성하는 것도 당연할 것이 다.(이를 위해 연도별 총합계 데이터가 있으면 더욱 좋다(하기 표 33에서는 공간 제약상 표시하지 않았음, 없더라도 연도별로 합하면 된다.)) 특히, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 IPC 등의 특허 분류 기호 DB를 참조하여, 와일드카드(*, ? 등)등으로 접근할 수 없는 자신의 특허 분류 기호의 하위 특허 분류 기호에 해당하는 정보를 다차원 연산을 수행해 놓아야 한다.(이는 특히 IPC의 경우 subgroup이하, USPC의 경우에는 class 레벨 미만 FT의 경우에도 타이틀 정보에 dot가 있는 레벨에서 더욱 더 중요하다)Table 33 below shows a portion of one embodiment of a result table in which a multidimensional operation is performed. The data as shown in the following Table 33 is generated by the multidimensional analysis calculation execution result table generation module 402 in the patent information table reconstructed by the star schema based on the date of the IPC and the year of the applicant, It is the data created by counting the number of patent documents for each layer of multi-level IPC. Data such as Table 33 can be stored in any format such as table / view / materialized view, and the data contained in the stored table can be extracted using the appropriate SQL query statement. For example, the number of applications for each IPC (multidimensional computed year) for each IPC 1-dot subgroup reference (C5 level in the following) of Applicant A is taken as an example. (It is better to have the grand total data for the year for this purpose.) (In Table 33, the space is not displayed due to space constraints, The generation module 402 refers to the patent classification symbol DB of the IPC and the like and selects the patent classification code corresponding to the sub-patent classification code of the patent classification code that can not be accessed by using a wild card (*,? (This is especially important at the level of the dot in the title information, even for sub-groups for IPC, and for FT below class level for USPC)

[표 33][Table 33]

GC GC C5C5 C6C6 C7C7 C..C .. 0101 0202 0303 0404 0505 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 66 33 1010 4242 3939 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 1515 77 3030 6060 6464

상기 테이블을 설명하면 다음과 같다. AppName은 출원인명을, C밑의 숫자는 IPC의 루트(root) 노드로부터의 노드 계층(C1은 섹션, C2는 클래스, C3는 서브클래스, C4는 메인그룹, C5는 1 도트 서브그룹, C6은 2 도트 서브그룹, C7은 3 도트 서브그룹을 말한다. 이하 C8부터 C20까지가 배정되어 있을 수 있다. 통상적으로 C15 까지면 충분하다.)을, 01, 02 등은 연도를 말한다. 연도는 00년 이전에도 있으며, 이에 대한 숫자는 생략하였다. 현재를 07년으로 볼 경우, 현재까지 공개된 문건을 기준으로 한 출원일 기준의 수치값이 각 셀의 데이터값이다. The table will be described as follows. AppName is the name of the application, the number under C is the node layer from the root node of IPC (C1 is the section, C2 is the class, C3 is the subclass, C4 is the main group, C5 is the 1 dot subgroup, 2 dot subgroup, and C7 is a 3-dot subgroup, which may be assigned to C8 through C20, usually up to C15), and 01, 02, and so on. The year is before 00, and the numbers are omitted. If the present data is 07, the numerical value based on the filing date based on the documents published so far is the data value of each cell.

한편, 상기 테이블에서 값을 생성할 때, 합계 필드의 각 셀에 들어가는 값은 각 연도별 값의 합산값을 취하면 되고, 회색 기간은 현재일을 기준으로 1년 6개월(통상적인 출원일 이후 강제 공개 기간임) 이전일이 속한 연도부터 현재일까지를 단위 기간으로 하여 그 기간에 속하는 각 연도별 값의 합산값을 취하면 된다. On the other hand, when generating a value in the table, the value of each cell of the sum field may be calculated by summing up the value of each year, and the gray period may be one year and six months The period from the previous day to the present day as a unit period and take the sum of the values of each year belonging to the period as the unit period.

GID는 어느 레벨까지 다차원 연산된 결과인지를 보여주는데 사용된다. GID 7의 H04B 7/02의 경우는 자기 자신(C5)까지 모두 다차원 연산된 결과(예를 들어 64 = 24+39+0+1)를, 이보다 한단계 낮은 GID 3의 H04B 7/02는 C6레벨까지(자기 자신 제외) 다차원 연산된 결과를 보여 준다. 24는 GID 1에서 나온 값이며, 이 24는 H04B 7/02라는 특허 분류 기호가 표기되어 있는 특허 문건의 숫자를 말한다. GID 값의 표기는 임의 방식일 수 있으나(임의의 약속임) 본 명세서에서는 "2의 n좌승-1"의 수열을 이루는 방식으로 표기했다. The GID is used to show to what level the result of the multidimensional operation. In the case of GID 7 H04B 7/02, the result of multidimensional operation (for example, 64 = 24 + 39 + 0 + 1) (Except oneself) to show the result of multi-dimensional computation. 24 is the value from GID 1, and 24 is the number of the patent document with the patent classification symbol H04B 7/02. The representation of the GID value may be in any manner (arbitrary), but in this specification it is denoted by a sequence of "

GID는 롤업의 단계를 말하는 것으로 C8 레벨까지의 롤업된 것을 GID 0, C7 레벨까지의 롤업된 것을 GID 1, C6 레벨까지의 롤업된 것을 GID 3, C5 레벨까지의 롤업된 것을 GID 7, C4 레벨까지의 롤업된 것을 GID 15, C3 레벨까지의 롤업된 것을 GID31, C2 레벨까지의 롤업된 것을 GID 63라 표시한다. 이때, C8 레벨 이전 및 C1레벨에서도 GID를 할당할 수 있음은 자명하나, 크게 실효성이 없어 시스템(1)에 반영하지 않아도 무방할 것이다. H04B 7/06에 GID 0은 C8 레벨까지 롤업하였다는 것이며, GID 1은 C7레벨까지 롤업하였다는 것이다.(상기 예에서는 C8에 값이 없으므로 C7까지 롤업된 것과 C8까지 롤업된 값은 같다.)GID is the rollup stage. Roll up to C8 level is rolled up to GID 0, C7 level is rolled up to GID 1, Roll up rolled up to C6 level is rolled up to GID 3, Roll up rolled up to C5 level is rolled up to GID 7, Is rolled up to the level GID 15, and the roll up to the C3 level is denoted as GID 31, and the roll up to the level C 2 is denoted as GID 63. At this time, it is obvious that the GID can be allocated even before the C8 level and at the C1 level, but it is not necessary to reflect the GID to the system 1 because it is not very effective. In H04B 7/06, GID 0 rolls up to C8 level and GID 1 rolls up to C7 level. (In the example above, rolled up to C7 and rolled up to C8 are the same because there is no value in C8.)

본 명세서에서 사용하는 GID 및 특허 분류 기호의 레벨과의 대응 관계에 대한 일 실시예이며, 하기 표 34을 중심으로 이에 대해 설명한다.The correspondence relationship between the GID and the level of the patent classification symbol used in this specification is described as one example, and the description will be centered on Table 34 below.

[표 34][Table 34]

GIDGID 대상 범위Target range IPC 레벨IPC level USPC 레벨 USPC level FT 레벨FT level ECLA 레벨ECLA Level 00 C8C8 4 dot4 dot 4 dot4 dot 4 dot4 dot 4 dot4 dot 1One C7C7 3dot3dot 3dot3dot 3dot3dot 3dot3dot 33 C6C6 2dot2dot 2dot2dot 2dot2dot 2dot2dot 77 C5C5 1dot1dot 1dot1dot 1dot1dot 1dot1dot 1515 C4C4 main groupmain group 0dot0dot 00 레벨00 Level main groupmain group 3131 C3C3 subclasssubclass classclass 테마코드Theme code subclasssubclass 6363 C2C2 classclass super categorysuper category super categorysuper category classclass 127127 출원인Applicant 출원인Applicant 출원인Applicant 출원인Applicant 출원인Applicant 255255 전체all 전체all 전체all 전체all 전체all

상기 GID는 그 대응 레벨까지의 롤업한다는 것을 의미하는 코드이다. 예를 들면, GID 15는 C4레벨이며, 이는 IPC로 메인그룹, USPC는 0dot, FT로는 00레벨, ECLA로는 메인그룹까지 롤업(자기 자신 및 자신의 하위 모든 특허 분류까지 롤업)한 정보가 그 행에 기록되어 있음을 의미한다. 그러므로, 어느 레벨에서 정보를 보기를 원하는 경우, 그 레벨에 해당하는 GID를 지정하면, 그 레벨까지의 다차원 연산된 정보를 볼 수 있다. 본 명세서에서는 GID라는 약칭을 사용하나, 이는 의미상 "롤업 레벨 코드"라고 할 수 있다. 이상과 이하에서, 롤업 레벨 코드와 GID는 같은 의미로 사용된다. FT에서 00레벨은 통상 AA 등과 같이 뒤에 숫자가 없고 영문자 2자로 된 레벨을 의미한다. 수퍼카테고리란, USPC나 FT 같은 경우에는 각각 class, 테마코드 레벨의 항목이 너무 많아 이를 카테고리별로 그룹핑한 것을 말한다.The GID is a code that means that it rolls up to its corresponding level. For example, GID 15 is at the C4 level, which means that rollups (roll up to themselves and all subclasses of their own) from the main group to the main group, the USPC to 0dot, the FT to the 00 level, and the ECLA to the main group, As shown in FIG. Therefore, if you want to view information at a certain level, you can specify the GID corresponding to that level, and you can see the multidimensional computed information up to that level. In the present specification, the abbreviation GID is used, but it can be referred to as a "roll-up level code" In the above and below, the rollup level code and the GID are used in the same sense. In the FT, the 00 level usually refers to a level with two alphabetic characters, such as AA, which is not followed by a number. In the case of USPC or FT, the super category means that there are too many items of the class and the theme code level, and these are grouped by category.

상기 표 33에서 알 수 있듯이, 상기 테이블에서 각 행의 각 특허 분류 기호 는 좌측에 자신의 직상위 특허 분류 기호를 갖고 있음을 알 수 있다. 본 발명의 다차원 분석용 연산 수행 결과 생성 모듈은 상기 표 33과 같은 데이터로서 상기 각 IPC 기호마다의 다차원 연산된 수치값을 생성해 낸다. 이러한 것은 특정 출원인 A의 국가별 모든 특허 문건 집합에 대하여 각 특허 문건 집합에 포함되어 있는 모든 특허 분류 기호별로 생성해 놓을 수 있다. 한편, 상기와 표 33과 같은 데이터를 모든 출원 문건을 기준으로 생성해 놓을 수도 있으며, 모든 등록된 문건만을 대상으로 하여 생성해 놓을 수도 있다. As can be seen from Table 33, it can be seen that each patent classification symbol in each row in the table has its own top patent classification symbol on the left side. The multidimensional analysis operation result generation module of the present invention generates multidimensional computed numeric values for each of the IPC symbols as data shown in Table 33. [ This can be created for every patent classification set in each patent document set for a set of all patent documents for a particular applicant A by country. On the other hand, data as shown in Table 33 and Table 33 may be generated based on all application documents, or may be generated based on all registered documents.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 표 33과 같은 다차원 분석용 연산 수행 결과 테이블은 특정한 기설정된 속성을 지니는 문건 집합별로 생성해 놓을 수 있다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 1) 상기 기설정된 속성을 지니는 문건 집합에 포함되어 있는 특허 문건의 출원번호 등과 같은 특허 문건을 특정 지을 수 있는 키(key, 문건 고유 정보)값을 입수하는 단계, 2) 상기 스타스키마 구조로 재구축된 테이블의 FACT 테이블에서 특허 문건을 특정 지을 수 있는 키값이 있는 키값들만을 추출하는 단계, 3) 상기 추출된 키값들에 해당하는 특허 문건만을 대상으로 적어도 하나 이상의 기설정된 분석 주제별로 다차원 분석용 연산 수행 결과 테이블을 생성하는 단계를 거쳐서 이루어진다. 상기 1) ~2) 단계를 거치면, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 기설정된 처리의 대상이 되는 처리 대상 문건 집합이 생성된다. 상기 FACT 테이블은 모든 문건 전체 집합에 대한 FACT 정보가 포함되어 있다. 상기 FACT 정보에는 문건 고유 정보가 포함되어 있음은 당연하며, 각종 서지 사항 정보가 더 포함되어 있을 수 있다. 상기 문건 고유 정보는 출원번호 또는 문건 고유 코드 또는 문건 일련 코드인 것이 바람직하다.The multidimensional analysis operation result table generation module 402 may generate a multidimensional analysis operation result table as shown in Table 33 for each document set having a predetermined property. The multidimensional analysis operation result table generation module 402 generates a multidimensional analysis operation result table 301 by using a key for identifying a patent document such as an application number of a patent document included in a document set having the predetermined property, Extracting only the key values having the key value that can specify the patent document in the FACT table of the table reconstructed with the star schema structure, 3) extracting the patent document corresponding to the extracted key values And generating a multidimensional analysis operation execution result table according to at least one predetermined analysis subject with respect to only the predetermined analysis subject. After the steps 1) and 2) are performed, the multi-dimensional analysis operation result table generation module 402 generates a target document set to be subjected to predetermined processing. The FACT table includes FACT information for the entire set of documents. The FACT information naturally includes document specific information, and may include various bibliographic information. Preferably, the document specific information is an application number or a document specific code or a document serial code.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 기설정된 분석 주제별로 국가별 또는 국가 통합별 모든 문건에 대하여 적어도 하나 이상의 기설정된 분석 주제별로 적어도 하나 이상의 상기 다차원 분석용 연산 수행 결과 테이블을 생성하는 것이 바람직하다. 상기 임의의 문건 집합에는 사전에 규정 가능한 적어도 하나 이상의 속성을 공유하는 임의의 문건 집합일 수 있다. 이러한 임의의 문건 집합 종류에는 예시적으로 1) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호로 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 2) 특정 국가 DB에서 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 3) 특정 국가 DB에서 특정 출원인의 특허 문건에 발명자로 포함된 특정 발명자 명의(즉, 출원인 명의 and 발명자 명의)에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 4) 특정 국가 DB에서 특정 대리인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 5) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호 및 특정 출원인 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 6) 특정 국가 DB에서 특정 특허 분류 기호 체계상의 특정 특허 분류 기호, 특정 출원인 명의 및 특정 발명자 명의에 의해 특허 문건 집합을 특정하는 방법에 의해서 생성되는 특허 문건 집합, 7) 상기 특정 국가 DB에서 특정 출원인 명의 및 특정 대리인 명의에 의해 특허 문건 집합을 특정 하는 방법에 의해서 생성되는 특허 문건 집합, 8) 특정 국가의 모든 특허 문건 집합, 9) 적어도 2 이상의 국가의 모든 특허 문건 집합, 10) 상기 1) 내지 9)에 기설정된 기간 단위로 특정되는 특허 문건 집합 등이 그 예가 될 것이다. 한편, 상기 임의의 문건 집합에는 제1국 이외의 해외 국가에 패밀리 정보가 있는 것에 대해서만 생성해 놓을 수도 있고, 미국의 경우 재발행 특허 문건(reissued document)으로만 이루어지는 문건 집합일 수 있다. 본 문단에서 언급하는 문건 집합은 예시적인 것이며, 이러한 예시적인 것들에 대해서, 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 기설정된 분석 주제별로 상기 문건 집합에 대하여 적어도 하나 이상의 기설정된 분석 주제별로 적어도 하나 이상의 상기 다차원 분석용 연산 수행 결과 테이블을 생성하는 것이 바람직하다.The multidimensional analysis calculation execution result table generation module 402 generates at least one multidimensional analysis calculation execution result table for each document by country or country integration according to a predetermined analysis theme by at least one predetermined analysis subject . The arbitrary document set may be any set of documents sharing at least one attribute that can be defined in advance. Examples of such arbitrary document set types include: 1) a collection of patent documents generated by a method of specifying a set of patent documents as a specific patent classification symbol on a specific patent classification system in a specific country DB, 2) (3) the name of a specific inventor (ie, applicant's name and inventor's name) included as an inventor in the patent document of a specific applicant in a specific country DB, 4) a set of patent documents generated by a method of specifying a set of patent documents by a specific agent name in a specific country DB, 5) a set of patent documents generated by a method of specifying a set of patent documents, By specifying a specific patent classification on the patent and by specifying a set of patent documents by the specific applicant's name 6) a set of patent documents generated by a method of specifying a set of patent documents by a specific patent classification mark on the specific patent classification system, a specific applicant's name and a specific inventor's name in a specific country DB, 7) 8) a set of patent documents generated by a method of specifying a set of patent documents by a specific applicant's name and a specific agent's name in a particular country DB, 9) a set of all patent documents in a particular country, 9) 10) An example of a set of patent documents specified in the above-mentioned 1) to 9) in units of a predetermined period. On the other hand, the above-mentioned arbitrary document set may be created only for the family information in the foreign countries other than the first country, or in the case of the United States, a document set consisting only of the reissued document. The document set referred to in this paragraph is exemplary, and with respect to these illustrative examples, the multi-dimensional analysis operation result table generation module 402 of the present invention may include at least one It is preferable that at least one or more of the multidimensional analysis operation execution result tables are generated for each set of analysis topics.

한편, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 사용자가 생성한 임의의 문건 집합에 대해서도 상기 표 33과 같은 다차원 분석용 연산 수행 결과 테이블을 생성할 수 있을 것이다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 1) 상기 사용자가 생성한 임의의 문건 집합에 포함되어 있는 특허 문건의 출원번호 등과 같은 특허 문건을 특정 지을 수 있는 키(key, 문건 고유 정보)값을 입수하는 단계, 2) 상기 스타스키마 구조로 재구축된 테이블의 FACT 테이블에서 특허 문건을 특정 지을 수 있는 키값이 있는 키값들만을 추출하는 단계, 3) 상기 추출된 키값들에 해당하는 특허 문건만을 대상으로 적어도 하나 이상의 기설정된 분석 주제별로 다차원 분석용 연산 수행 결과 테이블을 생성하는 단계를 거쳐서 이루어진다.The multidimensional analysis operation result table generation module 402 may generate a multidimensional analysis operation result table as shown in Table 33 for any document set generated by the user. The multidimensional analysis operation result table generation module 402 generates a multidimensional analysis result table generation module 402 for generating a multidimensional analysis result table based on a key for identifying a patent document such as an application number of a patent document included in an arbitrary document set generated by the user, 2) extracting only key values having a key value that can specify a patent document in the FACT table of the table reconstructed with the star schema structure, 3) extracting key values corresponding to the extracted key values And a step of generating an operation result table for multidimensional analysis for at least one or more predetermined analysis topics only for documents.

총량 데이터의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)의 처리 방법A processing method of an operation execution result table generation module 402 for multidimensional analysis of the total amount data

상기 표와 같은 다차원 분석용 연산 수행 결과 테이블을 생성하기 위하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 기설정된/입수된 문건 집합(예를 들면 특정 국가의 출원 또는 등록된 모든 문건들의 집합)에 대하여 다음과 같은 단계를 처리한다.In order to generate a multidimensional analysis operation result table as shown in the above table, the multidimensional analysis operation result table generation module 402 generates the multidimensional analysis result table for generating a multidimensional analysis operation result table (e.g., Lt; / RTI > the following steps are performed.

첫째, 상기 기설정된 속성을 지니는 문건 집합에 포함되어 있는 특허 문건의 출원번호 등과 같은 특허 문건을 특정 지을 수 있는 키(key, 문건 고유 정보)값을 입수하고, 상기 스타스키마 구조로 재구축된 테이블의 FACT 테이블에서 특허 문건을 특정 지을 수 있는 키값이 있는 키값들만을 추출하고, 상기 추출된 키값들에 해당하는 특허 문건만을 대상으로 적어도 하나 이상의 기설정된 분석 주제별 다차원 연산용 기초 테이블용 명령어 집합(상기 명령어 집합은 script일 수 있다.)을 입수한다.(이때, 상기 다차원 연산용 기초 테이블을 대상으로 한 분석 주제별 다차원 연산 실행 명령어 집합도 동시에 입수할 수 있을 수 있다.) 만약 특정 국가의 출원 또는 등록된 모든 문건들로 이루어진 특허 문건 전체 집합일 경우, 다른 과정 없이 상기 적어도 하나 이상의 기설정된 분석 주제별로 다차원 분석용 연산 수행 결과 테이블을 생성하기 위한 분석 주제별 명령어 집합만을 입수한다. 즉, 본 단계에서는 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 기설정된 처리를 하기 위한 처리 대상 문건 집합이 확정되고, 기설정된 처리를 하기 위한 분석 주제 별 명령어 집합이 확정되게 된다.First, a key (key unique information) value that can specify a patent document such as an application number of a patent document included in the document set having the predetermined property is obtained, and a table reconstructed with the star schema structure A key table having a key value for specifying a patent document is extracted from the FACT table of the key table, and at least one set of instructions for a basic table for a multidimensional calculation by a predetermined analysis subject (only the patent documents corresponding to the extracted key values (The command set may be a script.) (At this time, a multidimensional operation execution instruction set for each analysis subject for the multidimensional operation base table may also be obtained at the same time.) If an application or registration The entire set of patent documents consisting of all the documents, Obtain only a set of instructional instructions for the analysis subject to generate the operation result table for the multi-dimensional analysis according to the predetermined analysis topic. That is, in this step, the multidimensional analysis operation execution result table generation module 402 determines a processing target document set for performing the predetermined processing, and the instruction set for each analysis topic for the predetermined processing is determined.

둘째, 상기 처리 대상 문건 집합의 개개의 문건을 대상으로 분석 주제별 명령어 집합을 실행하여 다차원 연산을 위한 기초 테이블을 생성한다. 상기 다차원 연산을 위한 기초 테이블에는 적어도 하나 이상의 차원에 관한 정보를 포함하며, 상기 각 레코드에는 다차원 연산을 수행하기 위한 각 차원별 기초 데이터가 입력되어 있어야 한다. 상기 기초 데이터는 존재 유무(1 또는 null)에 관한 정보(출원/등록이 발생했다, 없었다)에 관한 정보와 각종 카운팅 정보(청구항 수 등 본 명세서에서 카운팅 정보로 언급한 정보들 중 어느 하나 이상)가 포함될 수 있다. 상기 차원은 1) 특허 분류 기호의 종류별 특허 분류 기호 차원 2) 날짜 차원, 3) 위치 차원(국가, 지역 등), 4) 주체의 속성 차원(출원인, 발명자, 대리인 중 선택된 어느 하나 이상에 대한 규모, 속성 등에서 구분되는 차원(예를 들면, 출원인을 기업, 대학, 연구기관, 개인 및 기타로 나누고, 기업을 다국적 기업, 대기업, 중소기업, 등으로 나눌 수 있을 것이다. - 이는 주체 마스트 DB(204)에서 각 출원인의 속성을 참조하여 입수될 수 있다.) 중에서 선택되는 어느 하나 이상 또는 이들 중에서 조합되어 생기는 복합 차원 중 어느 하나 이상이 될 수 있다. Second, a base table for the multidimensional operation is generated by executing an instruction set according to an analysis topic for each document of the target document set. The base table for the multi-dimensional operation includes information on at least one dimension, and basic data for each dimension for performing a multi-dimensional operation is input to each of the records. The basic data includes information on the presence or absence (1 or null) of information (application / registration has occurred, was absent) and various counting information (any one or more of the information referred to herein as counting information, May be included. The dimensions include: 1) a patent classification symbol for each type of patent classification symbol, 2) a date dimension, 3) a location dimension (country, region, etc.), 4) an attribute dimension of a subject It is possible to divide the applicant into companies, universities, research institutes, individuals and others, and divide the enterprise into multinational corporations, large corporations, small and medium enterprises, (Refer to the attributes of each applicant in the above), or a composite dimension resulting from combination of any of the above.

이때, 각 차원에 필요한 서지 사항을 입력하기 위하여 상기 다차원 연산을 위한 기초 테이블에는 필드가 설계되어 있어야 한다. 필드에는 특허 분류 기호 차원에는 IPC일 경우, 섹션부터 n dot 서브그룹, USPC일 경우 (수퍼카테고리부터)class부터 n dot 레벨까지 등과 같이 특허 분류 기호 체계 상의 모든 상위 및 하위 특허 분류 기호를 수용할 수 있도록 각 레벨별 필드가 있어야 한다. 한편, 시 간 차원을 위해서는 분석을 위한 가장 작은 단위의 필드가 있으면 좋다. 특허 분석의 경우, 일 또는 주 단위 분석까지 필요없는 경우라면 월 필드까지만 있으면 충분할 것이다. 한편, 출원인 속성 차원이 있다면, 기업(다국적 기업, 대기업, 중소기업), 대학, 연구원, 개인, 기타에 해당하는 필드가 있어야 할 것이다.At this time, a field must be designed in the base table for the multidimensional operation in order to input a bibliography necessary for each dimension. The field contains a patent classification symbol dimension that can accommodate all upper and lower patent classification symbols in the patent classification scheme, such as from IPT to n dot subgroups, from USPC (from super category) to n dot levels There should be a field for each level. On the other hand, for the time dimension, it is sufficient to have the smallest unit field for analysis. In the case of a patent analysis, it would suffice for a month field only if a day or week analysis is not required. On the other hand, if there is an applicant attribute dimension, there should be fields that correspond to companies (multinational corporations, large corporations, small and medium enterprises), universities, researchers, individuals,

예시로 상기 다차원 연산을 위한 기초 테이블에, 출원인을 기준으로 특허 분류 기호 차원과 출원일 차원만이 포함되어도 충분할 경우, 출원인 A의 2005년 1월 3일에 출원하고, IPC가 IPC가 H04B 7/06인 문건에 대해 하기 표 35와 같은 데이터가 생성되면 된다.For example, if it is sufficient that only the patent classification symbol dimension and the filing date dimension based on the applicant is sufficient in the basis table for the above-mentioned multi-dimensional operation, the applicant A filed on January 3, 2005, and the IPC has applied IPC to H04B 7/06 The data as shown in Table 35 below should be generated for the document.

[표 35][Table 35]

국C1Station C1 C2C2 C3C3 C4C4 C5C5 C6C6 C7C7 05/0105/01 05/0205/02 05/0305/03 HH H04H04 H04BH04B H04B 7/00H04B 7/00 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 1One

한편, 기간과 출원인 속성에 다음과 같이 입력되어도 무방할 것이다. 하기 표 36과 같은 데이터는 연도 및 출원인 속성에서 부분적으로 다차원 연산을 수행해 놓은 것으로 후술하는 다차원 연산 시에 처리될 수도 있지만, 사전에 생성해 놓아도 문제될 것은 없다.On the other hand, the period and applicant attributes may be entered as follows. The data as shown in Table 36 below is a partial multidimensional operation performed in the year and the applicant attributes, and may be processed in the multidimensional operation described later, but it is not a problem even if it is created in advance.

[표 36][Table 36]

C1C1 C2C2 C3C3 C4C4 C5C5 C6C6 C7C7 0505 05/0105/01 05/0205/02 05/0305/03 HH H04H04 H04BH04B H04B 7/00H04B 7/00 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 1One 1One

한편, 출원인의 위치나 출원인의 재무 레벨 등의 기업 평가, 패밀리 수의 범위를 기준으로 한 패밀리 수 차원, 청구항 수 범위를 기준으로 한 청구항 수 차원 등이 더 포함되어 있을 수 있는데, 이는 하기 표 37와 같이 추가적인 데이터가 포함되어 있으면 된다.On the other hand, there may be further included corporate evaluations such as the applicant's position or applicant's financial level, the number of families based on the number of families, and the number of claims based on the number of claims. As well as additional data.

[표 37][Table 37]

국가country StateState CityCity 패밀리 수 범주 1Number of families Category 1 패밀리 수 범주 2Number of families Category 2 기업 평가 n-1Company Ratings n-1 기업 평가 n-2Company Ratings n-2 청구항수 범위 1Claim range 1 청구항 수범위 2Claim range 2 KrKr 경기도Gyeonggi-do 수원시Suwon 1One 1One 1One

상기와 같은 데이터를 생성하는 방법은 다음과 같다. 1) 문건에 포함된 특허 분류 기호를 특허 분류 기호 마스트 DB(203)에 조회하여, 상기 특허 분류 기호의 모든 상위 특허 분류 기호를 입수하고, 상기 입수된 모든 상위 특허 분류 기호를 각각의 레벨에 맞게 입력한다. 예를 들면 IPC가 H04B 7/06일 때는 3 dot 서브그룹(C7) 레벨에는 H04B 7/06, 2 dot 서브그룹(C6) 레벨에는 H04B 7/04, 1 dot 서브그룹(C5)레벨에는 H04B 7/02, 메인그룹(C4)레벨에는 H04B 7/00, 서브클래스(C3)레벨에는 H04B을, 클래스(C2) 레벨에는 H04를, 섹션(C1) 레벨에는 H를 입력한다. 2) 출원일/등록인 등의 각종 날짜 정보 중에서 필요한 기준날짜(여기서는 출원일) 기준으로 시간 차원에 필요한 값을 넣는다. 만약 출원일이 기준 날짜이고, 2005. 1월 3일이고, 시간 차원이 월, 분기, 년의 차원으로 나뉘어 있다면, 2005년에 1월에 1을 입력한다. 3) 기타 서지 사항을 기타 차원에 입력한다. 그리고, 대기업에 1을 입력한다. 주소 정보를 이용하여 출원인의 위치와 같은 필드값을 채우고, 특허 문건 마스트 DB(202)를 조회하여 대상이 되는 특허 문건의 패밀리 수를 조회하고, 그 조회값이 어느 범주에 들어가는지를 판단하여 그 범주에 1을 기록한다. 청구항 수 범위도 패밀리 수와 마찬가지로 처리한다.A method of generating the above data is as follows. 1) The patent classification symbol included in the document is referred to the patent classification code mast DB 203 to obtain all the upper patent classification codes of the patent classification codes, . For example, when IPC is H04B 7/06, H04B 7/06 is assigned to the 3-dot subgroup (C7) level, H04B 7/04 is assigned to the 2-dot subgroup (C6) / 02, H04B 7/00 in the main group C4 level, H04B in the subclass C3 level, H04 in the class C2 level, and H in the section C1 level. 2) Enter the required value in the time dimension based on the required reference date (here, filing date) from various date information such as filing date / registrant. If the filing date is the base date, January 3, 2005, and the time dimension is divided into dimensions of month, quarter, and year, enter 1 in January of 2005. 3) Enter other bibliographies in the other dimension. Then, enter 1 into a large corporation. The field value such as the applicant's position is filled using the address information, the patent document master DB 202 is inquired to inquire the number of the family of the patent document to be a subject, the category to which the inquiry value falls, &Lt; / RTI > The claims range is treated in the same way as the number of families.

하나의 문건에 특허 분류 기호가 2가지 종류 이상이 있을 경우(예를 들면 IPC와 USPC), 각 종류별로 상기 다차원 연산을 위한 기초 테이블을 생성해 놓을 수도 있으며, 하나의 테이블 내에 2가지 종류 이상의 특허 분류 기호를 처리해 놓을 수도 있을 것이다. 한편, 동일한 종류의 특허 분류 기호가 복수개가 문건에 포함되어 있을 경우, 개개의 특허 분류 기호에 대하여 독립적인 레코드를 생성하는 것이 바람직할 것이다. 한편, 만약 출원인이 2 이상의 공동출원인 경우에는 각 출원인별로 동일한 정보를 가지는 레코드를 출원인의 수만큼 생성한다.(만약 주체가 발명자라면 발명자별로 레코드를 생성하고, 대리인 분석을 위한 경우, 주체가 대리인이라면 대리인별로 레코드를 생성한다.)When there are two or more kinds of patent classification symbols in one document (for example, IPC and USPC), a base table for the above-mentioned multi-dimensional calculation may be generated for each type, and two types or more of patents You may also want to process the classifier. On the other hand, when a plurality of patent classification codes of the same kind are included in the document, it is desirable to generate an independent record for each patent classification code. On the other hand, if the applicant is two or more co-applicants, a record having the same information for each applicant is generated for the number of applicants (if the subject is an inventor, a record is generated for each inventor, and for agent analysis, Create records for each agent.)

셋째, 상기 다차원 연산을 위한 기초 테이블을 대상으로 다차원 연산을 수행하고, 다차원 분석용 연산 수행 결과 테이블을 생성한다. 하기 표 38와 같은 테이블의 데이터는 IPC 차원을 기준으로 시간 차원은 연도 차원까지 롤업 연산을 수행한 결과를 보여 주고 있다.(표기의 편의 상 04년도에 대한 것만 분기-연도로 롤업 연산됨을 표시한다.)Third, a multidimensional operation is performed on the basis table for the multidimensional operation, and a multidimensional analysis operation execution result table is generated. The data of the table as shown in Table 38 below shows the result of performing the roll-up operation from the IPC dimension to the time dimension up to the year dimension. (For the sake of convenience, it is shown that only the year 2004 is rolled up to the quarter- .)

[표 38][Table 38]

C5C5 C6C6 C7C7 C..C .. 0101 0202 0303 0404 04/1Q04 / 1Q 04/2Q04 / 2Q 04/3Q04 / 3Q 04/4Q04 / 4Q H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1One 22 33 1111 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1One 22 33 1111 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 1One 22 1One 00 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 1One 22 1One 00 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 44 44 44 99 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 44 44 44 99 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 66 33 1010 4242 66 88 88 2020 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 H04B 7/02H04B 7/02 99 33 2020 1818 66 66 66 00 H04B 7/02H04B 7/02 99 33 2020 1818 66 66 66 00 H04B 7/02H04B 7/02 99 33 2020 1818 66 66 66 00 H04B 7/02H04B 7/02 1515 77 3030 6060 1212 1414 1414 2020

상기 다차원 연산을 수행하는 방법은 롤업 연산을 수행하는 방법과 큐브 연산을 수행하는 방법으로 나뉜다. 차원인 IPC, 날짜, 출원인 기준으로 롤업 연산과 큐브 연산에 대해 각각 설명한다. IPC, 날짜, 출원인 차원이 있을 때, 롤업 연산은 상기 3가지 차원 중 선택된 한 방향으로만 진행되는 반면, 큐브 연산은 3P3(3가지 중 3가지를 선택했을 때 발생하는 순열의 수, 여기서는 6) 방향으로 진행된다. 즉, IPC 차원을 1, 날짜 차원을 2, 출원인 차원은 3이라 했을 때, 롤업 연산은 1->2->3과 같은 한 방향으로만 진행된 것이나, 큐브 연산은 1->2->3뿐만 아니라, 1->3->2, 2->1->3, 2->3->1, 3->1->2, 3->2->1 등 6가지 방향으로 모두 또는 선택된 1이상의 방향으로 진행될 수 있다. 그러므로, 롤업 연산을 할 경우, 1->2->3까지 연산한 결과가 필요없는 경우(최종적으로 출원인의 종류 단위로된 롤업 연산된 결과값이 생성됨), 즉 IPC만 롤업 연산된 결과가 필요하거나, 상기 표 38와 같이 IPC와 날짜 차원까지만 롤업된 연산 결과가 필요할 경우 선택되는 차원 중 어느 하나 이상에 대해서만 목적하는 롤업 방향을 지정하여 롤업 연산을 수행하면 된다. The multidimensional operation is divided into a roll-up operation and a cube operation. The roll-up operation and the cube operation are described based on the IPC, the date, and the applicant, respectively. When the IPC, date, and applicant dimensions are present, the rollup operation proceeds only in one of the three dimensions, while the cube operation is 3P3 (the number of permutations generated when three of the three are selected, here 6) Lt; / RTI > That is, when the IPC dimension is 1, the date dimension is 2, and the applicant dimension is 3, the roll-up operation proceeds only in one direction such as 1-> 2-> 3, But not in all six directions, such as 1-> 3-> 2, 2-> 1-> 3, 2-> 3-> 1, 3-> 1-> 2, 3-> 2-> Or more. Therefore, when the roll-up operation is performed, if the result of the operation up to 1-> 2-> 3 is not needed (the final result of the rolled up operation in the applicant's type unit is generated) Alternatively, the roll-up operation may be performed by designating a desired roll-up direction only for at least one of dimensions selected when IPC and date dimensions are rolled up to the date dimension as shown in Table 38. [

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 주어지는 특허 문건 집합에서, 출원일 정보와 특허 분류 기호 정보를 입수한 다음, 특허 분류 기호 체계가 데이터화 되어 있는 특허 분류 기호 마스트 DB(203) 또는 스타스키마 구조로 재구축된 테이블을 참조하여 상기 입수한 특허 분류 기호의 상위 특허 분류 기호를 추출해 낸다. 상기 추출된 상위 특허 분류 기호 및 출원일의 연도 정보로 상기 표와 같은 정보가 포함되어 있는 다차원 분석용 연산 수행 결과 테이블의 데이터를 완성해 낸다. 이때, 하나의 특허 문건에 특허 분류 기호가 2종류 이상이 있는 경우(예를 들면 미국의 경우, IPC와 USPC가 있음) 각 특허 분류 기호의 종류별로 독립적으로 처리하면 된다. 그리고, 하나의 특허 문건에 단일 종류의 2개 이상의 특허 분류 기호가 들어 있는 경우, 1) 맨 앞의 메인 특허 분류 기호에 대해서만 처리하거나, 2) 모든 특허 분류 기호에 대해서 처리하거나, 3) 메인 특허 분류 기호와 서브 특허 분류 기호에 대한 가중치가 있는 경우, 그 가중치를 반영하여 처리할 수 있다. 이때, 3)의 경우, 수치에 소수점이 나올 수 있으며, 이 경우, 반올림을 하는 것이 타당한 표기 방법일 것이다. 이때, 상기 1), 2) 및 3) 중에서 어느 정책을 취할 지는 선택의 문제이다. 한편, 상기 1), 2) 및 3) 이외에도 다른 정책이 가능할 수도 있음은 물론이다 하겠다. 상기 2)의 방법을 취할 경우, 상기 다차원 분석용 연산 수행 결과 테이블의 라인 수가 늘어나게 되는데, 이는 각 특허 분류 기호마다 적어도 하나 이상의 새로운 데이터 값이 생성되기 때문이다(즉, 서브 IPC에 대해서도 롤업 연산을 해 놓아야 하므로). 한편, 상기 3)의 방법을 취할 경우 수치값이 소수점이 나올 수 있게 된다.The multidimensional analysis calculation execution result table generation module 402 obtains the filing date information and the patent classification symbol information from the given set of patent documents and then obtains the patent classification symbol mast DB 203 or star And extracts the upper patent classification symbol of the obtained patent classification symbol with reference to the table reconstructed by the schema structure. The data of the multidimensional analysis operation execution result table including information such as the above table is extracted from the extracted upper patent classification symbol and year information of the filing date. At this time, if there are two or more kinds of patent classification symbols in one patent document (for example, in the case of the United States, there are IPC and USPC), the classification of each patent classification may be independently performed. If a patent document contains two or more patent classification codes of a single kind, it may be necessary to 1) process only the first main patent classification code, 2) process all patent classification codes, or 3) If weights for classifiers and subpublic classifiers are present, they can be processed to reflect the weights. In this case, in the case of 3), a decimal point may appear in the numerical value, and in this case, rounding is a valid notation. At this time, it is a matter of choice to take any one of the above 1), 2) and 3). It should be noted that other policies other than the above 1), 2) and 3) may be possible. When the method 2) is adopted, the number of lines of the operation result table for multidimensional analysis increases, because at least one new data value is generated for each patent classification symbol (that is, for the sub IPC, It should be done). On the other hand, when the method 3) is adopted, the numerical value can be expressed as a decimal point.

상기 테이블은 특허 지표 중 출원수 또는 등록수와 같은 수량 기준의 지표이다. 본 발명의 하위 특허 분류 기호 포함 롤업/드릴다운과 같은 데이터 처리는 점유율, 집중율, 활동율(AI, Activity Index) 등과 같은 각종 특허 정보 분석 지표에도 활용될 수 있을 것이다.The table is an indicator of the number of patents, such as the number of applications or the number of registrations. The data processing such as roll-up / drill-down including the sub-patent classification symbol of the present invention may be utilized in various patent information analysis indexes such as occupancy rate, concentration rate, activity index (AI)

분석 주제별 다차원 연산용 기초 테이블 없이 다차원 분석용 연산 수행 결과 테이블 생성Perform multidimensional analysis operations without a base table for multidimensional operations by analysis topic.

상기에서는 분석 주제별 다차원 연산용 기초 테이블을 사용하여 다차원 분석용 연산 수행 결과 테이블을 생성하는 것에 대해 설명하였다. 이때, 상기 분석 주제별 다차원 연산용 기초 테이블은 필수적인 것이 아니며, 스타스키마 구조로 재구축된 테이블이 있는 경우, 본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 다차원 분석용 연산 수행 결과 테이블을 생성할 수도 있다. 상기 분석 주제별 다차원 연산용 기초 테이블은 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)의 계산량/정보 처리량을 줄이는 역할을 하기 때문이며, 하나의 상기 분석 주제별 다차원 연산용 기초 테이블이 다수의 테이블 생성에 사용될 수 있기 때문이다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 분석 주제별 다차원 연산용 기초 테이블이 없는 경우에도 상기 스타스키마 구조로 재추축된 테이블로부터 필요한 정보를 입수하고, 이를 가공 처리하여 다차원 연산을 수행할 수 있게 된다.In the above description, generation of a multidimensional analysis operation execution result table using the basis table for multidimensional operation by the analysis subject has been described. The multidimensional analysis operation result table generation module 402 of the present invention generates the multidimensional analysis result table generation module 402 for the multidimensional analysis operation result You can also create a table. The base table for multi-dimensional arithmetic operations according to the analysis subject functions to reduce the amount of computation / information processing of the arithmetic operation result table generation module 402 for multi-dimensional arithmetic operations. One of the base tables for multi- As shown in FIG. The multidimensional analysis operation execution result table generation module 402 obtains necessary information from a table re-aggregated into the star schema structure even when there is no base table for multidimensional operation by the analysis subject, processes the multidimensional analysis result table, .

스타스키마 구조로 재구축된 테이블 없이 다차원 분석용 연산 수행 결과 테이블을 생성Performs multidimensional analysis operations without table reconstructed with star schema structure Generates result table

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 스타스키마 구조로 재구축된 테이블이 아닌 상기 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203) 및/또는 주체 마스트 DB(204)로부터 필요한 데이터를 읽어와 상기 분석 주제별 다차원 연산용 기초 테이블을 생성할 수도 있다. 이경우, 상기 분석 주제별 다차원 연산용 기초 테이블을 생성하는 프로그램(스크립트, script 등)이 상대적으로 조금 복잡할 수 있을 것이며, 그 프로그램 전체 또는 그 프로그램을 구성하는 각 모듈의 재활용성이 상대적으로 떨어질 수는 있을 것이다. 예를 들면, 상기 분석 주제별 다차원 연산용 기초 테이블에는 출원번호별로의 자신 및 자신의 상위 모든 IPC들에 대한 정보가 저장되어 있는데, 이러한 상기 정보들을 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 출원번호에 포함된 IPC를 찾고, 상기 찾은 IPC를 상기 특허 분류 기호 마스트 DB(203) 또는 기타 총상위 특허 분류 기호 테이블 등을 조회하여, 상기 찾은 IPC의 상위 모든 IPC들에 대한 정보를 입수해 온 다음, 이를 토대로 상기 분석 주제별 다차원 연산용 기초 테이블을 생성할 수 있을 것이다.The multidimensional analysis operation result table generation module 402 generates the multidimensional analysis operation result table for generating the multidimensional analysis operation result table 402 by using the patent document mast DB 202, the patent classification symbol mast DB 203 and / or the subject mast DB 204 ) And generate a basis table for the multidimensional operation by the analysis subject. In this case, the program (script, script, etc.) for generating the basic table for the multi-dimensional calculation according to the analysis subject may be relatively complicated and the recyclability of the entire program or each module constituting the program may be relatively low There will be. For example, in the base table for the multi-dimensional calculation according to the analysis subject, information on the IPC itself and all IPCs of the IPC itself is stored for each application number. The information is stored in the multi- (402) finds the IPC included in the application number, inquires the IPC found in the patent classification code mast DB (203) or other top patent classification code table and the like, The base table for the multi-dimensional calculation according to the analysis subject can be generated based on the obtained information.

특허 문건 마스트 DB(202)로부터 분석 주제별 다차원 분석용 연산 수행 결과 테이블 생성Patent document Creation of operation result table for multidimensional analysis by analytic subject from mast DB (202)

상기 다차원 분석용 연산 수행 결과 테이블을 생성하기 위하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 기설정된/입수된 문건 집합(예를 들면 특정 국가의 출원 또는 등록된 모든 문건들의 집합)에 대하여 다음과 같은 단계를 처리한다.In order to generate the multidimensional analysis operation result table, the multidimensional analysis operation result table generation module 402 may generate the multidimensional analysis operation result table by using the predetermined / acquired document set (for example, a set of all documents registered or registered in a specific country) The following steps are performed.

첫째, 상기 기설정된 속성을 지니는 문건 집합에 포함되어 있는 특허 문건의 출원번호 등과 같은 특허 문건을 특정 지을 수 있는 키(key, 문건 고유 정보)값을 입수하고 이를 저장한다. 한편, 상기 표 38와 같은 상기 분석 주제별 다차원 분석용 연산 수행 결과 테이블을 생성하기 위하여, 재료가 되는 표 35 내지 표 37의 데이터 중 적어도 하나 이상의 데이터를 개별 특허 문건 마다를 대상으로 상기 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203) 및/또는 주체 마스트 DB(204)에서 입수하여 생성하고, 생성된 테이터를 통합하여 이를 메모리에 저장한다. 상기 메모리에 저장된 데이터들은 상기 다차원 연산을 위한 기초 테이블에 저장된 데이터와 동등한 데이터일 수 있다. 즉, 상기 메모리에 저장된 데이터들은 적어도 하나 이상의 차원에 관한 정보를 포함하며, 상기 각 레코드에는 다차원 연산을 수행하기 위한 각 차원별 기초 데이터가 입력되어 있어야 한다. 상기 기초 데이터는 존재 유무(1 또는 null)에 관한 정보(출원/등록이 발생했다, 없었다)에 관한 정보와 각종 카운팅 정보(청구항 수 등 본 명세서에서 카운팅 정보로 언급한 정보들 중 어느 하나 이상)가 포함될 수 있다. 상기 차원은 1) 특허 분류 기호의 종류별 특허 분류 기호 차원 2) 날짜 차원, 3) 위치 차원(국가, 지역 등), 4) 주체의 속성 차원(출원인, 발명자, 대리인 중 선택된 어느 하나 이상에 대한 규모, 속성 등에서 구분되는 차원(예를 들면, 출원인을 기업, 대학, 연구기관, 개인 및 기타로 나누고, 기업을 다국적 기업, 대기업, 중소기업, 등으로 나눌 수 있을 것이다. - 이는 주체 마스트 DB(204)에서 각 출원인의 속성을 참조하여 입수될 수 있다.) 중에서 선택되는 어느 하나 이상 또는 이들 중에서 조합되어 생기는 복합 차원 중 어느 하나 이상이 될 수 있다. First, a key (document unique information) value for specifying a patent document such as an application number of a patent document included in the document set having the predetermined attribute is obtained and stored. Meanwhile, in order to generate the multidimensional analysis operation result table for the analysis subject as shown in Table 38, at least one data among the data of Tables 35 to 37 as a material is stored in the patent document mast DB (202), the patent classification symbol mast DB (203) and / or the subject mast DB (204), integrates the generated data, and stores it in the memory. The data stored in the memory may be data equivalent to the data stored in the base table for the multi-dimensional operation. That is, the data stored in the memory includes information on at least one dimension, and basic data for each dimension for performing a multidimensional operation is input to each of the records. The basic data includes information on the presence or absence (1 or null) of information (application / registration has occurred, was absent) and various counting information (any one or more of the information referred to herein as counting information, May be included. The dimensions include: 1) a patent classification symbol for each type of patent classification symbol, 2) a date dimension, 3) a location dimension (country, region, etc.), 4) an attribute dimension of a subject It is possible to divide the applicant into companies, universities, research institutes, individuals and others, and divide the enterprise into multinational corporations, large corporations, small and medium enterprises, (Refer to the attributes of each applicant in the above), or a composite dimension resulting from combination of any of the above.

둘째, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 개별 특허 문건 마다의 표 35 내지 표44의 정보 중 적어도 하나 이상이 통합된 상기 메모리에 저장된 데이터를 대상으로 기설정된 분석 주제별로 다차원 연산을 수행하여 상기 표 38와 같은 분석 주제별 다차원 분석용 연산 수행 결과 테이블을 생성한다.The multidimensional analysis calculation execution result table generation module 402 analyzes the data stored in the memory in which at least one of the information in Tables 35 to 44 for each patent document is integrated, And generates an operation result table for the multi-dimensional analysis according to the analysis subject as shown in Table 38. [

출원인별 IPC 레벨별 랭킹 정보Ranking information by IPC level by applicant

상기 표 38와 같은 다차원 분석용 연산 수행 결과 테이블이 주어질 때, 상기 분석 모듈은 하기와 같은 분석 결과를 생성할 수 있게 된다.Given the multidimensional analysis operation result table as shown in Table 38, the analysis module can generate the following analysis result.

첫째, 출원인이 입수될 때, 상기 입수된 출원인에 대하여, 특허 분류 기호의 각 레벨별 출원수/등록수 기준 랭킹 정보를 생성할 수 있다. 상기 랭킹 정보는 입수된 출원인의 적어도 하나 이상의 특허 분류 기호의 각 레벨별로 상기 다차원 연산된 출원 또는 등록 문건의 카운트 값을 비교하는 방법으로 랭킹 정보를 생성할 수 있다. 예를 들면, 상기 다차원 분석용 연산 수행 결과 테이블데이터에서 출원인 A의 1dot 서브그룹 레벨(C5 레벨)에서 H04B 7/02의 다차원 연산된 문건의 개수 카운팅값과, 출원인 A의 동일한 1dot 서브그룹 레벨에서의 다른 특허 분류 기호에 대응되는 다차원 연산된 문건의 개수를 카운팅값 등을 비교함으로써, 출원인 A의 1 dot 서브그룹 레벨에서의 각 특허 분류 기호별의 다출원/다등록 랭킹을 구할 수 있 게 된다.First, when the applicant is acquired, the applicant can generate ranking information based on the number of applications / registration number per level of the patent classification symbol for the applicant. The ranking information may generate ranking information by a method of comparing count values of the multidimensional calculated application or registered documents for each level of at least one patent classification symbol of the acquired applicant. For example, in the multidimensional analysis operation result table data, the count value of the multi-dimensional calculated document of H04B 7/02 at the 1dot subgroup level (C5 level) of applicant A and the count value of the multidimensional document of the applicant A at the same 1dot subgroup level The multi-application / multi-registration ranking of each patent classification symbol at the 1-dot subgroup level of applicant A can be obtained by comparing the number of multi-dimensional calculated documents corresponding to the other patent classification symbols of patent A .

총량 분석 모듈Total amount analysis module

하기 표 39과 같은 각 출원인별, IPC 레벨별, 연도별 출원(또는 등록) 카운트해 놓은 다차원 분석용 연산 수행 결과 테이블이 있다고 하자.Assume that there is a multidimensional analysis operation execution result table for each applicant, IPC level, and year as filed (or registration) count as shown in Table 39 below.

[표 39][Table 39]

GC GC C4C4 C5C5 C6C6 C7C7 C..C .. 0101 0202 0303 0404 0505 H04B 7/00H04B 7/00 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 66 33 1010 4242 3939 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 1515 77 3030 6060 6464 B29C/100B29C / 100 .... .... H04BH04B ....

상기 다차원 분석용 연산 수행 결과 테이블에는 특정 국가 단위로 모든 출원 인에 대해서 출원인을 기준으로 상기와 같은 다차원 분석용 연산 수행 결과 테이블을 생성할 수 있을 것이다. 예를 들어 출원인 C의 특정 특허 문건에 포함된 특허 분류 기호 및 출원일(날짜)이 있는 가 있으면, 그 특허 문건의 특허 분류 기호 및 출원일 데이터로 특허 분류 기호 체계를 참조하여 상기 표에 대응되는 특정 셀의 카운트 값을 신설하거나(특허 분류 기호, 출원인(AppName) 등), 숫자의 경우 증가시킬 수 있을 것이다. 상기 표와 같이 특정 국가 단위로 모든 출원인에 대하여 다차원 분석용 연산 수행 결과 테이블을 생성하는 경우, 상기 분석 모듈은 상기 다차원 분석용 연산 수행 결과 테이블로부터 하기와 같은 분석 결과들을 생성할 수 있을 것이다.The multidimensional analysis operation result table may be generated based on the applicant for all applicants in a specific country. For example, if there is a patent classification symbol and a filing date (date) included in a specific patent document of the applicant C, the patent classification system and the filing date data of the patent document are referred to, (Such as a patent classification code, an applicant (AppName), etc.), or increase the number. As described above, when generating multidimensional analysis operation execution result tables for all applicants in a specific country, the analysis module can generate the following analysis results from the multidimensional analysis operation execution result table.

먼저 출원인 A의 IPC subclass level 그룹핑된 정보를 보여주기 위해서, 이 레벨에서의 연도별 카운트값을 생성할 수 있을 것이다. 최근 필드의 값은 기설정된 기준일(다차원 분석용 연산 수행 결과 테이블 조회일이 그 예가 될 수 있다.)이 속하는 날짜보다 1년 6개월(18개월)전이 속하는 날)의 해당 연도부터 다차원 분석용 연산 수행 결과 테이블 조회일까지 입수된 특허 문건에서 기인한 카운트 값인 것이 바람직하다. (...해당 카운팅 값을 말한다) 하기 표 40은 그 일례를 보여 준다.To show applicant A's IPC subclass level grouped information in the first place, a yearly count value at this level may be generated. The value of the most recent field is the day of the year in which one year and six months (18 months) of transition belongs to the preset reference date (the date of inquiry of the operation result table for multidimensional analysis) It is preferable that the count value is derived from the patent document obtained up to the inquiry result table inquiry date. (... refers to the corresponding counting value). Table 40 shows an example thereof.

[표 40][Table 40]

랭킹ranking IPC subclassIPC subclass 0101 0202 0303 0404 0505 최근lately 1One H04B(+)H04B (+) 22 B29C(+)B29C (+)

물론 상기 표 40의 H04B를 드릴다운하면, 출원인 A의 특허 문건 중 특허 분 류 기호가 H04B의 직하위에 있는 특허 분류 기호에 해당되는 문건들에 대한 카운트값이 하기 표 41과 같이 나올 것이다. 이는 상기 표와 같은 다차원 분석용 연산 수행 결과 테이블에서 "AppName = A and IPC level = C4"라는 조건으로 카운팅하면 될 것이며, 정열을 C4를 기준으로 하면 될 것이다.(order by IPC level = C4, C4는 IPC subclass를 의미한다.)Of course, if we drill down on H04B of Table 40 above, the count value for the documents corresponding to the patent classification symbol that is above the patent classification symbol of H04B among the patent documents of applicant A will be as shown in Table 41 below. This is done by counting on the condition of "AppName = A and IPC level = C4" in the operation result table for multidimensional analysis as shown in the above table, and ordering will be based on C4 (order by IPC level = C4, C4 Means the IPC subclass.)

[표 41][Table 41]

랭킹ranking IPC subclassIPC subclass IPC main groupIPC main group 0101 0202 0303 0404 0505 최근lately 1One H04B(+)H04B (+) H04B 1/00(+)H04B 1/00 (+) H04B 3/00(+)H04B 3/00 (+) B29C(+)B29C (+)

상기 표 41에서 다시 드릴다운하면 1dot subgroup기준에서의 다차원 연산된 카운팅 값이 나오는 것이 직전에서 설명한 바와 같을 것이다. (물론 상기 H04B를 드릴다운하면, 출원인 A의 특허 문건 중 특허 분류 기호가 H04B의 직하위에 있는 특허 분류 기호에 해당되는 문건들에 대한 카운트값이 하기와 같이 나올 것이다. 이는 상기 표와 같은 다차원 분석용 연산 수행 결과 테이블에서 "AppName = A and IPC level = C5"라는 조건으로 카운팅하면 될 것이며, 합산하여 정열하는 것은 C5를 기준으로 하면 될 것이다.(order by IPC level = C5, C5는 IPC main group 를 의미한다.) 상기 각 셀에 있는 카운팅값은 자신의 하위 특허 분류 기호에 해당되는 모든 값들이 다차원 연산된 값이다.If drilling down again in the above Table 41, the multidimensional computed count value on the 1-dot subgroup basis will be as described immediately before. (Of course, if you drill down on H04B, the count value for the documents that fall under the patent classification mark that is above the patent classification mark H04B among the patent documents of Applicant A will be as follows: (IPC level = C5, C5 is the IPC main group), and the IPC level (C5, C5) will be counted as "AppName = A and IPC level = C5" ). The counting value in each cell is a value obtained by multi-dimensional calculation of all values corresponding to the sub-patent classification code of the cell.

상기에서 H04B 1/00(+)를 드릴 다운하면, 그 하위에 있는 특허 분류 기호가 나올 것이며, 각 셀의 값은 "AppName = A and IPC level = C6"라는 조건으로 카운팅하면 될 것이며, 정열을 C6를 기준으로 하면 될 것이다.(order by IPC level = C6, C6는 IPC 1 dot group 를 의미한다.) 상기 각 셀에 있는 카운팅값은 자신의 하위 특허 분류 기호에 해당되는 모든 값들이 다차원 연산된 값이다. 이와 같이 계속하여 더 이상의 하위 특허 분류 기호가 없을 때까지 드릴 다운할 수 있으며, 추가적으로 드릴 다운할 수 없을 때까지 상기와 같은 조건의 입력으로 필요한 셀 값을 생성할 수 있을 것이다.If you drill down on H04B 1/00 (+) above, you will get the patent classification code below it, and the value of each cell will be counted under the condition "AppName = A and IPC level = C6" C6 (order by IPC level = C6, C6 means IPC 1 dot group). The counting value in each cell is calculated by multiplying all the values corresponding to the sub-patent classification code of the cell Value. In this way, it is possible to continue to drill down until there are no more sub-patent classification symbols, and the necessary cell values can be generated by inputting the above conditions until further drill down is not possible.

상기와 같은 방식은 IPC에 대해서 설명했으나, 이는 USPC, FT, FI, ECLA에 대해서도 마찬가지임은 자명하다 할 것이다.Although the above method has been described for IPC, it will be obvious that the same applies to USPC, FT, FI, and ECLA.

마찬가지로 상기는 출원(또는 등록 총량)기준으로 다차원 연산된 다차원 분석용 연산 수행 결과 테이블의 생성 방법, 드릴 다운 시의 셀값의 계산 방법 및 계산 값의 표현 방식을 설명하였으나, 이는 점유율, 집중율, 및/또는 활동율에 대해서도 당업자에게는 자명하게 다차원 분석용 연산 수행 결과 테이블을 생성할 수 있을 것이다.Similarly, the above describes a method of generating a multi-dimensional arithmetic operation result table for multidimensional analysis, a method of calculating a cell value at the time of drilling down, and a method of expressing a calculated value based on the application (or the total amount of registration) / RTI > and / or the activity rate, as will be apparent to those skilled in the art.

한편, 상기와 같은 방식으로 특허 분류 기호가 개입되어 있는 기설정된 정의식에 대응되는 다른 분석 지표에 대해서도 동일한 방식으로 다차원 연산된 계산값을 생성할 수 있을 것이며, 드릴 다운 시의 셀값을 계산할 수 있을 것이며, 계산된 값을 표현할 수 있을 것이다.On the other hand, it is possible to generate a multidimensional calculation value in the same manner for another analysis index corresponding to the predetermined defining formula in which the patent classification symbol is intervened in the above manner, and the cell value at the time of drilling down can be calculated , And the calculated value can be expressed.

상기와 같은 다차원 분석용 연산 수행 결과 테이블이 있을 때, 적어도 하나 이상의 특정 특허 분류 기호가 주어졌을 때, 카운트 값이 많은 순위로 출원인들의 랭킹 및 그 출원인들의 연도별 카운트값을 생성할 수도 있을 것이다. 예를 들어 H04B가 주어졌다고 가정했을 때, "IPC level = C4 and C4 = H04B"라는 조건을 주고 출원인 별로 합산하여 정열하는 것은 출원인 별로 수행하면 될 것이다.(order by AppName) When there is an operation result table for the multidimensional analysis as described above, when at least one specific patent classification symbol is given, the rank value of the applicants and the applicant's yearly count value may be generated with a high count value. For example, assuming that H04B is given, it will be done by applicant to order by IPC level = C4 and C4 = H04B.

점유율 및 기타 데이터 생성 예Share and other data generation examples

예를 들어 점유율을 "특정 속성을 가지는 목적 문건 집합의 문건 수를 특정 속성을 가지는 전체 문건 집합의 문건 수로 나눈 값"으로 정의한다면 점유율에 대해서도 상기 표 39과 같은 테이블 형식의 데이터를 생성할 수 있을 것이다. 즉, 특정 기업 A의 특정 특허 분류 기호를 대상으로 한 점유율을 구할 경우, 목적 문건 집합은 A의 문건 집합 중 특정 특허 분류 기호를 포함하는 문건 집합이며, 전체 문건 집합은 특정 특허 분류 기호를 포함하는 전체 문건 집합이 될 것이다. 이때, 상기 특허 분류 기호를 포함하는 문건 집합을 구할 때, 그 특허 분류 기호의 하위 특허 분류 기호를 포함하는 문건들도 당연히 포함되어야 함이 마땅할 것이다.For example, if the occupancy rate is defined as "a value obtained by dividing the number of documents of a set of objective documents having a specific attribute by the number of documents of a total document set having a specific attribute", data of a table format as shown in Table 39 can be generated will be. That is, if the occupancy rate of the specific patent classification code of the specific company A is obtained, the target document set is a document set including a specific patent classification symbol among A document sets, and the entire document set includes a specific patent classification symbol It will be a complete set of documents. At this time, when a document set containing the patent classification symbol is obtained, documents containing the sub-patent classification symbol of the patent classification symbol should naturally also be included.

한편, 본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 점유율 뿐만 아니라 다른 수식으로 정의될 수 있는 특허 지표에 대해서도 총량 분석과 동일한 방법으로 연산 수행 결과를 저장해 놓을 수 있을 것이다. Meanwhile, the multidimensional analysis calculation execution result table generation module 402 of the present invention may store the calculation execution results in the same manner as the total amount analysis for the patent index that can be defined by other equations as well as the occupancy rate.

이하, 분석 지표를 소개한다. 1.기술혁신 활동의 집중도 분석 지표에는 1)현시기술우위 지수(Revealed Technological Advantage), 2)현시특허우위 지수(Revealed Patent Advantage), 3) 집중률 지수(CRn, Concentration Ratio n), 4) 허핀달 지수(HHI, Herfindahl Index) 등이 있다.Hereinafter, an analysis index is introduced. (1) Revealed Technological Advantage, (2) Revealed Patent Advantage, (3) Concentration Ratio, and (4) (HHI, Herfindahl Index).

2.기술수준 분석 지표 에는 1) 피인용 가중 특허 수(Patent Count Weighted by Citations), 2) 특허 당 피인용 수(CPP, Cites per Patent), 3) 특허영향 지수(PII, Patent Impact Index), 4) 현재영향 지수(CII, Current Impact Index), 2. Technology level analysis indicators include 1) Patent Count Weighted by Citations, 2) Cites per patent, 3) Patent Impact Index (PII) 4) Current Impact Index (CII)

5) 기술력 지수(TS, Technology Strength), 6) 기술순환주기 지수(TCT, Technology Cycle Time), 7) 과학연계 지수(SL, Science Linkage), 8) 특허 당 평균 청구항 수(Average Claims per Patent), 9) 패밀리 규모(Family Size) 등이 있다.5) TS, Technology Strength, 6) Technology Cycle Time, 7) SL, Science Linkage, 8) Average Claims per Patent, , And 9) Family Size.

3.협력관계 및 지식흐름 분석 지표에는 1) 공동출원 · 발명 특허 수(Number of Patents with co-applicants, with co-inventors), 2) 샐턴 지수(Salton's Index), 3) 인력 유입률, 유출률(Brain Gain, Brain Drain) , 4) 인용관계를 이용한 지식흐름 분석 지수(Index for Knowledge Flow with Patent Citations)3. Cooperation Relationships and Knowledge Flow Analysis Indicators include 1) Number of Patents with co-inventors, 2) Salton's Index, 3) Workforce inflow rate, Brain Gain, Brain Drain), 4) Index for Knowledge Flow with Patent Citations,

상기의 특허 지표보다 좀더 응용성이 뛰어난 분석 지표를 추가적으로 더 도입할 수 있으며, 이들 모두는 수식화시킬 수 있을 것이며, 모든 수식은 질의식으로 대응시킬 수 있음은 당업자에게는 자명할 것이다.It will be apparent to those skilled in the art that additional analytical indices that are more applicable than the above-mentioned patent indicators may be additionally introduced, all of which may be formulated, and that all formulas can be matched with a quality consciousness.

기준 랭킹 생성 방법How to create a baseline ranking

상기 랭킹의 생성 방법에 관하여 더욱 상세하게 설명한다. 상기 출원인 A가 입수(확정)되면, 제1국 또는 제2국에서의 상기 출원인 A의 문건 집합이 입수(확정)될 수 있고, 그 문건 집합에서 IPC 등을 추출할 수 있으며, 상기 추출된 각 IPC에 서 각 단계별로(섹션에서 n dot 서브그룹까지 모두) 그 각 단계의 IPC별로 해당 문건 수를 카운팅할 수 있게 되고, 상기 카운팅된 값으로 각 IPC 레벨별로 다출원/다등록 총량, 집중, 활동 분야의 랭킹을 계산할 수 있다.(특허 지표 기준 출원인 A의 특허 분류 기호 레벨별의 랭킹 정보 생성에 대해서는 전술한 바가 있다.) 상기 계산된 랭킹으로 랭킹이 높은 분야를 추출할 수 있게 되며, 그 추출된 적어도 하나 이상의 IPC 등의 특허 분류 기호가 상위 랭킹 기술 분야가 될 수 있다. 상기에서 설명한 방법은 하나의 주체(또는 확장하면 속성)가 주어졌을 때, 그 주체에 대한 적어도 하나 이상의 해당 특허 분류 기호의 각 단계별 최빈 특허 분류 기호의 랭킹을 추출하는 방법이 된다.(예를 들면, 발명자, 대리인 등에 대해서도 자신의 명의가 포함된 문건 집합 기준으로 특허 분류 기호 레벨별 랭킹을 동일한 방법으로 생성할 수 있다.) The method of generating the ranking will be described in more detail. When the applicant A is obtained (confirmed), the document set of the applicant A in the first or second station can be obtained (confirmed), and the IPC or the like can be extracted from the document set. The IPC can count the number of documents by IPC at each step (from section to n dot subgroup) at each step, and the counted value is the total number of applications / It is possible to calculate the rank of the field of activity. (Generation of ranking information by the patent classification symbol level of the applicant A of the patent index standard has been described above.) The field with high ranking can be extracted by the calculated ranking, At least one extracted patent classification symbol such as IPC may be an upper ranking technology field. The method described above is a method of extracting rankings of the most favorable patent classification symbols for each stage of at least one corresponding patent classification symbol for the subject when given one subject (or extended attribute) (for example, , Inventors, agents, etc.) can also be created in the same way by the patent classification symbol level based on the document set containing their names.

도 19에서는 출원 기준 또는 등록 기준으로 대한민국, 미국, 일본, 유럽 등의 국가 단위로, 출원/등록수, 점유율, 집중율 및/또는 활동율에 대하여 상기 출원인 단위로 IPC 등과 같은 특허 분류 기호 체계의 각 레벨별로 상기 기준 랭킹이 생성되었다는 것을 보여 주고 있다. 도 19에서 삼성전자의 경우, 대한민국 출원문건 기준으로 IPC 서브클래스 레벨에서는 H01L이 1위, H04N이 2위임을 알 수 있다.In FIG. 19, the number of patent classification schemes such as IPC and the like in units of applicants for application / registration number, occupancy rate, concentration rate, and / or activity rate in each country such as Korea, USA, And that the reference ranking is generated for each level. In FIG. 19, it can be seen that H01L is ranked first in the IPC subclass level and H04N is ranked second in the case of Samsung Electronics based on the Korean application document.

이를 더욱 상세하게 상술하면 다음과 같다. 1) 속성을 공유하는 적어도 하나 이상의 문건 집합을 확정하는 단계(예를 들어 출원인이 A, 또는 출원인 A의 발명자 a 또는 대리인 B, 또는 특정 키워드가 공통적으로 포함됨, 특정 기간이 공통됨, 특정 속성을 가지는 문건 집합의 문건을 인용하거나 인용 당함 등과 같이 적어도 하 나 이상의 속성을 공유하는 경우, 전체 문건 집합에서 그 속성을 공유하는 부분 문건 집합을 SQL 쿼리문 또는 검색 엔진으로의 질의를 통해서 생성할 수 있게 된다.) , 2) 문건 집합을 구성하는 개별 문건에 대하여 적어도 하나 이상의 해당 특허 분류 기호를 추출하는 단계, 3) 추출된 특허 분류 기호를 상기 특허 분류 기호 체계를 참조하여 모든 상위 특허 분류 기호를 입수하는 단계, 4) 상기 모든 상위 특허 분류 기호를 특허 분류 기호의 레벨별(예를 들어, IPC의 경우라면 섹션에서 n dot 서브그룹까지) 저장하는 단계, 5) 저장된 특허 분류 기호를 포함하고 있는 개별 문건을 레벨별로 롤업 연산하여 카운팅하는 단계, 6) 카운팅 결과를 참조하여 기설정된 분석 지표별로 계산을 수행하는 단계(총량 계산, 집중율 계산, 활동율 계산 등 기설정된 분석 지표별로의 계산을 수행하는 단계), 7) 각 분석 지표별로의 특허 분류 기호의 각 레벨별로 롤업을 고려하여 랭킹을 계산하는 단계(SQL 명령문 중에서 rank 명령을 활용하면 된다.)를 거치면, 제1국 또는 제2국에서 적어도 하나 이상의 해당 특허 분류 기호의 각 단계별 최빈 특허 분류 기호의 랭킹을 추출할 수 있게 된다. 이때, 문건 집합이 특정 속성(예를 들어 출원인 A(복수의 출원인 일 수도 있다. 이하 같다)와 같은 출원인 속성 , 최근 5년 등과 같은 기간 한정, 발명자 C와 같은 발명자 한정 및/또는 이들 각 속성의 조합된 속성(출원인 A의 발명자 C 등) 공유하는 것일 수 있음은 당연할 것이다. 출원인별 문건 집합일 경우, 제1국에 속하는 모든 출원인별로, 상기 1)~7)을 계산해 놓을 수도 있을 것임도 자명하다 할 것이다. The details will be described in detail as follows. 1) determining a set of at least one document that shares attributes (e.g., applicant A, or inventor A or agent B of applicant A, or a particular keyword common to a particular period of time, If you share at least one attribute such as quoting or quoting a document in a set of documents, you can create a set of partial documents that share that attribute in the entire document set through a query to the SQL query or search engine 2) extracting at least one corresponding patent classification symbol for each document constituting the document set, 3) extracting the extracted patent classification symbol by referring to the patent classification system, 4) All of the above patent classification symbols are classified by the level of the patent classification symbol (for example, in the case of IPC, n dot group), (5) counting individual documents including the stored patent classification symbols by roll-up by level, (6) performing calculation for each predetermined index by referring to the counting result 7) Calculating the ranking by considering the roll-up of each level of the patent classification symbol for each analysis index (the step of calculating the ranking, It is possible to extract rankings of the most favorable patent classification symbols for each step of at least one of the patent classification symbols in the first or second station. At this time, if the set of documents is limited to a certain attribute (eg applicant A (such as applicant A), applicant attributes such as last five years, inventor limitations such as inventor C, and / It may be possible to share the combined attributes (such as inventor C of applicant A). In the case of a set of documents by applicant, it may be possible to calculate 1) to 7) above for all applicants belonging to the first country It will be self-evident.

본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 1) ~7) 단계 중 일련의 어느 한 단계 이상을 수행하여 다차원 분석용 연산 수행 결과 테이블 데이터로 생성해 놓을 수 있을 것이다.The multidimensional analysis operation result table generation module 402 of the present invention may generate at least one of the steps 1) to 7) to perform multidimensional analysis operation result table data.

도 19에서 왼쪽 칼럼에 IPC 서브클래스 레벨의 다출원 기준 IPC별 랭킹이 생성되어 있음에 주목하기 바란다. 본 구현된 실시예적 도면에 나와 있는 수치값들은 상기 분석 모듈에 입력된 값을 전산적인 방법으로 처리한 것이며, 데이터의 보충과 삭제, 변경 등에 따라 가변적이며, 따라서 이는 특정 시점을 기준으로의 실제값과 차이가 있을 수 있다. 그러므로, 본 발명의 발명 사상은 본 명세서에서 각종 표 등에 나온 수치값에 있는 것이 아니라, 그 수치값을 생성하는 구조, 구성, 도구, 방법, 정보 처리 절차, 시스템, 활용 방법 등에 있음을 분명히 한다. 이하, 동일하다.It should be noted that in FIG. 19, a multi-filing reference IPC ranking is generated in the left column at the IPC subclass level. The numerical values shown in the embodiments of the present invention are obtained by processing the values input to the analysis module by a computational method and are variable depending on data supplement, deletion, modification, and so on. Therefore, There may be a difference. Therefore, it is to be understood that the inventive concept of the present invention is not limited to the numerical values shown in the tables and the like in the present specification, but is a structure, a configuration, an apparatus, a method, an information processing procedure, a system, and a utilization method for generating the numerical value. Hereinafter, the same applies.

도 19은 H01L에 계속하여 드릴다운했을 때 생성되는 분석 결과에 대한 일실시예적 도면이다. 드릴 다운 한 경우, 드릴 다운한 것에 대해서만 분석 정보값을 생성한다. 이때, AJAX 기술을 사용하면, 전체 페이지의 리로딩(reloading) 없이 신속하게 변경(드릴 다운)이 일어난 부분에 대한 수치값만을 생성하여 사용자에게 제공해 줄 수 있다. 본 발명에서 사용자는 본 발명의 시스템과의 관계에서는 사용자 컴퓨터(300)로 해석되어야 된다. 즉, 시스템의 입장에서는 1) 사용자에게 제공한다는 것은 실제로는 사용자의 컴퓨터에게 전송한다는 것이며, 2) 사용자가 입력한다는 것은 실제로는 사용자의 컴퓨터로부터 사용자의 입력값을 전송 받는다는 것이며, 사용자의 지정, 특정, 선택 등도 모두 실제로는 사용자의 컴퓨터로부터 사용자가 지정, 선특정, 선택한 정보를 전송받는 것이 되므로, 본 명세서에서는 상기 사용자와 관계된 표현은 모두 이와 같이 해석되어야 함이 타당하다. 다만, 사용자를 중심으로 기술한 이유는 기재의 간편성 때문이다.19 is an exemplary diagram of an analysis result generated when drilling down to H01L. When drilling down, the analysis information value is generated only for the drill down. At this time, using AJAX technology, it is possible to generate only a numerical value for a part where a quick change (drilldown) occurs without reloading the entire page and provide it to a user. In the present invention, the user should be interpreted as the user computer 300 in relation to the system of the present invention. That is, in the system, 1) providing to the user is actually transmitted to the user's computer, 2) input by the user is actually receiving the input value of the user from the user's computer, Selection, and the like are all actually received from the user's computer by the user's designation, pre-selection, and selected information, it is appropriate that all expressions related to the user should be interpreted as described above. However, the reason for the description based on the user is due to the simplicity of description.

도 19는 H01L 21/00 및 기타의 하위 특허 분류 기호에 드릴다운했을 때 생성되는 분석 결과에 대한 일실시예적 도면이다. 드릴 다운하여 총량 기준의 분석 결과가 나옴을 알 수 있다. 드릴 다운은 IPC의 최하위까지 가능하며, 데이터가 있는 한 최하위 레벨까지 드릴 다운 된다. 이는 다른 특허 분류 기호에 대해서도 마찬가지일 것이다.19 is an exemplary diagram of the analysis results generated when drilling down to H01L 21/00 and other subpublic classifications. It can be seen that the result of the analysis based on the total amount comes out by drilling down. Drilling down to the bottom of the IPC is possible, drilling down to the lowest level as long as there is data. This is also true for other patent classification codes.

도 31은 본 발명의 특허 분류 기호를 활용한 기술 영역 분석의 일례로서,미국 출원 전체 문건 기준 IPC H04B 및 이들의 하위 분류에 대한 드릴다운을 포함한 출원 총량 분석에 대한 일실시예적 도면이다. 이때, 상기 각 셀의 분석 데이터들은 하위 특허 분류 기호에 대응하는 데이터가 모두 롤업되어 포함되어 있다.FIG. 31 is an exemplary illustration of an application total amount analysis, including a drilldown on IPC H04B and their subclasses based on the US application entire document, as an example of a technology domain analysis using the patent classification symbol of the present invention. At this time, the analysis data of each cell includes all the data corresponding to the sub-patent classification symbol rolled up.

경쟁 분석용 다차원 분석용 연산 수행 결과 테이블Multidimensional analysis operation result table for competition analysis

경쟁 분석을 수행하기 위하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 경쟁 분석용 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓는다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 어떤 과정을 거쳐서 경쟁 분석용 다차원 분석용 연산 수행 결과 테이블 데이터를 생성하는지에 대해서 설명한다.In order to perform the competition analysis, the multidimensional analysis calculation execution result table generation module 402 generates calculation result table data for multidimensional analysis for competition analysis. A description will be given of how the multidimensional analysis calculation execution result table generation module 402 generates the multidimensional analysis calculation execution result table data for competition analysis.

출원인 관점의 경쟁Competition from Applicant Perspective

출원인 관점의 경쟁이란, 그 출원인A가 속한 국가(제1국) 또는 적어도 하나 이상의 다른 국가(제2국)에서의 다른 출원인들B 간의 충돌로 정의될 수 있다. 한편, 출원인 간의 충돌이란 1) 다출원 기술 분야의 공통, 2) 고집중 분야의 공통, 3) 고활동 분야의 공통 등이 있을 수 있으며, 다출원/고집중/고활동 기술 분야에서 기술 분야는 각 레벨별 IPC 등과 같은 각 레벨별 적어도 하나 이상의 특허 기술 분류 기호로 정의되는 분야인 것이 바람직하다. 예를 들어 제 1국에서 출원인 A가 IPC subclass 기준으로 H01L에 다수 출원하고 있을 때, 제1국에서 IPC subclass 기준으로 H01L에 1) 다수 출원하고 있는 자(점유율이 높은 자), 2) 집중율이 높은 출원인, 3) 활동율이 높은 출원인은 경쟁자가 될 수 있다.(제 2국에서 상기 1), 2), 3) 출원인들은 직접적 또는 잠재적 경쟁자가 될 수 있다.) 한편, 출원인 A가 H01L에 집중율 또는 활동율이 높을 때, 제1국 또는 제2국의 상기 1), 2), 3) 출원인들은 경쟁자가 될 수 있을 것이다.Competition from the applicant's point of view can be defined as a conflict between the applicant A's country (first country) or another applicant B at least one other country (second country). On the other hand, the conflicts between applicants can be classified into 1) common in multiple application technology fields, 2) common in high concentration fields, 3) common in high activity fields, and in the field of application / It is preferable that the field is defined by at least one patent technical classification symbol for each level such as IPC for each level. For example, in the first country, when applicant A is filing a number of applications in H01L on the IPC subclass basis, 1) applicants (who have a high share) to H01L on the basis of IPC subclass in the first country, 2) 3) Applicants may be direct or potential competitors in the second country.) Meanwhile, Applicant A is a member of H01L When the concentration rate or the activity rate is high, the above 1), 2), 3) applicants of the first or second country may become competitors.

출원인 관점의 경쟁 정보 입수 방법How to obtain competitive information from the applicant's point of view

출원인 관점의 경쟁 정보 입수 방법을 설명한다. 상기에서 출원인 A의 각 국가의 각 분석 지표별(총량, 점유율, 집중율, 활동율 및 기타 등등)의 특허 분류 기호의 레벨별의 랭킹 정보의 생성 방법에 대해서 설명한 바 있다. 예를 들어, 삼성전자주식회사의 대한민국 특허 출원 문건 기준 IPC 1 dot 서브클래스(C5 레벨)에서 H04B 7/02이 다출원 서브클래스 IPC로 선택되었을 때 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 다음과 같은 정보를 생성한다.Describe how to obtain competitive information from the applicant's point of view. A method of generating ranking information for each level of patent classification symbols of each analysis index (total amount, occupancy rate, concentration rate, activity rate, and the like) of each country of applicant A has been described above. For example, when H04B 7/02 is selected as a multi-filed subclass IPC in IPC 1 dot subclass (C5 level) based on Korean patent application document of Samsung Electronics Co., Ltd., the multidimensional analysis calculation execution result table generation module 402 Generates the following information.

입수 받은 IPC H04B 7/02에 대하여 제1국에서 출원 문건 기준/등록 문건 기준으로 1) 다출원/다등록 출원인, 2) 상기 H04B 7/02에 집중율이 높은 출원인, 3) 활동율이 높은 출원인, 또는 4) 기타 특허 분석 지표별 계산값이 높은 출원인을 추출할 수 있다. 상기 추출의 방법은 기본적으로 SQL 쿼리문으로 처리할 수 있다. 이때, 총량, 점유율, 집중율, 활동율 등의 분석 지표를 기준으로, 특허 분류 기호의 레벨별로 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓은 경우, 그 다차원 분석용 연산 수행 결과 테이블 데이터에 간략한 SQL 쿼리문으로 접근하여 목표하는 경쟁 관련성이 높은 적어도 하나 이상의 출원인 정보 및 상기 출원인의 기설정된 기간 단위의 출원/등록량 등의 정보를 입수할 수 있다. 만약, 상기 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓지 않고 있는 경우에는 다음과 같은 순서를 취하는 상대적으로 길고 복잡한 SQL 문을 사용하여 목표하는 정보를 입수한다. 2) Applicants who have a high concentration rate in H04B 7/02, 3) Applicants who have a high activity rate in IPC H04B 7/02 Applicant, or 4) applicants with high calculated values for other patent analysis indicators can be extracted. The above extraction method can be basically processed by an SQL query statement. In this case, when the operation table data for the multidimensional analysis is generated for each level of the patent classification symbol based on the analysis indexes such as the total amount, the occupancy rate, the concentration rate, and the activity rate, It is possible to obtain at least one applicant information having a high degree of competition relevant to the target and information such as the application / registration amount of the applicant's predetermined period of time by accessing the SQL query statement. If the multidimensional analysis operation result table data is not generated, the target information is obtained by using a relatively long and complicated SQL statement taking the following sequence.

상기 SQL 문의 로직은 1) 입수 받은 특정 레벨의 특허 분류 기호(예를 들면 IPC H04B 7/02) 및 상기 특허 분류 기호의 하위 특허 분류 기호들(상기 하위 특허 분류 기호들은 특허 분류 기호 마스트 DB(203)에 조회하여 입수한다. 이에 대해서는 전술한 바 있다.)을 포함하고 있는 모든 문건을 국가 단위/국가 통합 단위의 특허 문건 마스트 DB(202)로부터 추출하는 단계(이 경우, 동일 문건이 2회 이상 나올 경우 중복을 제거해야 한다.), 2) 상기 추출된 문건의 서지 사항 정보에서 출원인, 출원일/등록일 등의 날짜 정보를 획득하는 단계, 3) 다출원인/다등록출원인 기준으로 소팅하여 다출원인/다등록출원인에 대한 랭킹 정보를 획득하는 단계, 및/또는 4) 출원일/등록일 정보로부터 기설정된 기간 단위의 출원/등록량을 카운팅 하는 단계를 처리하는 명령으로 이루어진다. 물론, 상기 1) ~4) 단계 중 어느 둘 이상은 한꺼번에 처리될 수 있음은 물론이다. 상기 SQL 로직은 총량 기준으로 설명해 놓았지만, 점유율, 집중율 및 활동율 등의 기타 특허 분석 지표에 관해서도 대동소이한 방법으로 처리할 수 있다. The logic of the SQL statement may include 1) obtaining a specific level of patent classification code (for example, IPC H04B 7/02) and sub-patent classification codes of the patent classification code (the sub-patent classification codes are the patent classification code mast DB 203 (Which is described above), from the patent document master database 202 of the national unit / country integrated unit (in this case, the same document is repeated twice or more 2) obtaining date information such as applicant, filing date / registration date, etc. from the bibliographic information of the extracted document; 3) sorting on the basis of all applicants / And / or 4) a step of counting the application / registration amount in a predetermined period unit from the filing date / registration date information . Of course, it is needless to say that any two or more of the above 1) to 4) steps can be processed at once. Although the above SQL logic is described on the basis of the total amount, other patent analysis indexes such as the occupancy rate, concentration rate and activity rate can be processed in the same manner.

예를 들어, 집중율이 높은 출원인을 찾기 위해서는 상기 2) 단계 후에 출원인 목록을 작성한 다음도 각 출원인의 H04B 7/02에 관한 출원수/등록수를 각 출원인의 모든 출원수/등록수 정보(특허 문건 마스트 DB(202)에서 SQL로 입수할 수 있음)로 나누어 집중율을 계산한 다음, 상기 계산된 집중율로 상기 3) 단계의 소팅을 처리할 수 있을 것이다. 만약, 하기와 같이 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 특허 분류 기호의 레벨별로 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓은 경우에는 간단한 SQL로 처리할 수 있을 것이다.(이러한 간단한 SQL은 당업자에게 극히 용이한 일을 것이다.) For example, in order to find an applicant having a high concentration rate, after the list of applicants is prepared after the step 2), the number of applications / registration number per applicant H04B 7/02 is updated with the number of all applications / (Which can be obtained as SQL in the DB 202), and then the sorting of the step 3) may be processed with the calculated concentration ratio. If the multidimensional analysis operation result table generation module 402 generates the multidimensional analysis operation result table data by the level of the patent classification symbol as described below, it can be processed with simple SQL. Such simple SQL would be extremely easy for a person skilled in the art.)

상기 1) 단계에서 주어진 특허 분류 기호의 하위 특허 분류 기호를 입수하는 것이 확장자(*, ? 등)으로서 용이하게 획득할 수 없는 경우에는 상기 1) 단계의 데이터 처리에서 많은 부하가 걸리는 문제가 있다. 그러므로, 이러한 경우 하기 표 55와 같은 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 하위 특허 분류 기호에 관련된 문건의 카운트 정보를 기초로 롤업 연산을 통해서 생성해 놓은 데이터가 더욱 유용하게 된다.If obtaining the sub-patent classification symbol of the given patent classification symbol in step 1) can not be easily obtained as an extension (*,?, Etc.), there is a problem that much processing is required in the data processing of step 1). Therefore, in this case, the data generated through the roll-up operation based on the count information of the document related to the sub-patent classification symbol is more useful for the multidimensional analysis operation result table generation module 402 shown in Table 55 below.

[표 56][Table 56]

GC GC C5C5 C6C6 C7C7 C..C .. 0101 0202 0303 0404 0505 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/06H04B 7/06 00 00 33 1717 1212 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 H04B 7/08H04B 7/08 33 1One 55 44 33 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 33 22 22 2121 2424 H04B 7/02H04B 7/02 H04B 7/04H04B 7/04 66 33 1010 4242 3939 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/10H04B 7/10 00 00 00 00 00 H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 H04B 7/12H04B 7/12 00 1One 00 00 1One H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 99 33 2020 1818 2424 H04B 7/02H04B 7/02 1515 77 3030 6060 6464 H04B 7/02H04B 7/02 5050 4040 3030 3030 3030

상기 표 56와 같은 다차원 분석용 연산 수행 결과 테이블과 같은 데이터가 있을 때는 상기 IPC 1 dot 레벨(C5 칼럼)에서의 H04B 7/02를 가지는 레코드를 추출한 다음, 그 레코드들의 개수를 출원인 필드인 AppName으로 group by하여, 랭킹을 매기면 된다. 이와 같이 하면 A의 H04B 7/02 기준으로 할 때 경쟁 출원인으로 B 등을 추출할 수 있게 되고, B의 연도별/기간별 출원/등록수는 상기 다차원 분석용 연산 수행 결과 테이블에서 생성한 상기 출원인 B의 문건 중에서 H04B 7/02 및 상기 H04B 7/02의 하위 특허 분류 기호를 포함하는 문건을 처리한 값을 읽어서 사용자들에게 제공할 수 있게 된다.When there is data such as the multidimensional analysis operation result table as shown in Table 56, a record having H04B 7/02 at the IPC 1 dot level (C5 column) is extracted, and the number of records is extracted as Appident field AppName group by, and rank it. In this way, it is possible to extract B and the like as a competitive applicant when the standard of H04B 7/02 of A is extracted, and the application / registration number per year / period of B is obtained by the applicant B It is possible to read out the processed value of the documents including the sub-patent classification codes of H04B 7/02 and H04B 7/02 among the documents and to provide them to the users.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 적어도 하나 이상의 경쟁 분석용 다차원 분석용 연산 수행 결과 테이블 데이터를 생성하는 단계는 다음과 같다. 먼저 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402) 은 기설정된 경쟁의 종류별로 상기 표 56와 같은 다차원 분석용 연산 수행 결과 테이블 데이터를 생성한다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 표65와 같은 형태로 점유율, 집중율, 활동율 또는 기타의 분석 지표로 상기 표65와 같은 테이블/데이터를 생성할 때, 상기 AppName, 특허 분류 기호의 레벨 등은 동등하고, 상기 연도별/기간별 수치값의 내용이 각각 점유율, 집중율, 활동율 또는 기타 분석 지표의 값이 되면 된다.The step of generating the multi-dimensional analysis operation result table generation module 402 generates at least one multi-dimensional analysis operation result table data for competition analysis is as follows. The multidimensional analysis operation result table generation module 402 generates multidimensional analysis operation result table data as shown in Table 56 for each type of competition set. When the multi-dimensional analysis operation result table generation module 402 generates table / data as shown in Table 65 with the occupancy rate, concentration rate, activity rate or other analysis index in the form of Table 65, the AppName, And the level of the patent classification symbol are equivalent, and the content of the numerical value per year / period may be the value of the occupancy rate, the concentration rate, the activity rate, or other analysis index.

상기에서는 입수 받은 IPC H04B 7/02에 대하여 제1국에서 출원 문건 기준/등록 문건 기준으로 1) 다출원/다등록 출원인, 2) 상기 H04B 7/02에 집중율이 높은 출원인, 3) 활동율이 높은 출원인, 또는 4) 기타 특허 분석 지표별 계산값이 높은 출원인을 추출하는 방법에 대하여 설명하였으나, 상기 입수 받은 IPC H04B 7/02에 대하여 제2국을 기준으로 동일한 방법으로 상기 1) ~4)를 추출할 수 있을 것이다.(제1국 특허 문건 데이터를 기준으로 제1국에서 처리했던 바와 동일한 방식으로 제2국 특허 문건 데이터를 기준으로 처리하면 된다. 물론, 각 국가마다의 고유의 특허 분류 기호가 있을 경우, 그 기호들도 IPC 처럼 다단계 계층 구조를 가지며, 그 계층 구조는 dot 등의 수에 의해서 구분될 수 있으므로, 그 기호들에 대해서 IPC와 동등하게 처리하면 된다.)1) Applicants who have filed multiple applications / multi-registrations in IPC H04B 7/02 based on application documents / registration documents in the first country; 2) Applicants with high concentration rate in H04B 7/02; 3) The IPC H04B 7/02 has been described in the same way on the basis of the second country on the basis of the obtained IPC H04B 7/02, (Based on the patent document data of the first country, the patent document data of the second country may be processed in the same manner as that processed by the first country). Of course, If there is a classifier, the symbols have a multi-level hierarchy like IPC, and the hierarchy can be distinguished by the number of dots and so on, so treat them as equivalent to IPC.)

도 33는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 대한민국 기준 삼성전자주식회사의 총량 기준으로 한 대표적 경쟁사 분석에 관한 일실시예적 도면이다. 상기 출원인별 대표적 경쟁사에 대한 분석은 출원인의 다출원 특허 분류 기호 중 IPC 그룹 레벨의 특허 분류 기호를 대상으로 하여, 그 특허 분류 기호들의 랭킹 및 출원량을 구하고, 상기 특허 분류 기호에 다출원하고 있는 출원인의 출원량 정보를 기반으로 기설정된 계산식에 의하여 점수를 매기고, 점수가 높은 출원인을 랭킹과 함께 제공하는 것이다. 상기 대표적 경쟁사 분석은 대표적 경쟁사들의 연도별 출원수/등록수 정보가 함께 나오는 것이 바람직할 것이다. 물론, 상기 출원수/등록수에 클릭을 하게 되면, 상기 출원수/등록수에 내장되어 있는 쿼리식으로 상기 출원수/등록수에 대응되는 문건을 입수하여 간이 분석 모듈(407)로 전송하고, 상기 간이 분석 모듈(407)은 이들 문건에 대하여 간이 분석한 정보를 제공한다. 본 발명의 모든 셀에 나온 수치는 쿼리값에 기반하고 있고, 각 쿼리식으로부터 해당 수치가 나오게 되는 근거 문건을 입수할 수 있게 되어 있다. 이하 모든 셀에 대해 적용된다. 만약 비율값일 경우에는 그 비율을 만들어 내는 수식에 포함된 요소별 수치 각각에 대하여 쿼리가 대응되어 있으므로, 그 쿼리에 기반하여 근거 문건을 입수할 수 있게 된다.33 is an exemplary diagram of a representative competitor analysis based on the total amount of the Korean standard Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system 1 of the present invention. The analysis of the representative competitor by the applicant is performed by obtaining the ranking and the application amount of the patent classification symbols of the IPC group level among the patent applications of the applicant, Based on the applicant's amount of application information, by a predetermined formula, and to provide applicants with high scores along with the rankings. It is preferable that the representative competitor analysis includes the number of applications / registration number of the representative competitors along with the year. Of course, when the application number / registration number is clicked, a document corresponding to the number of applications / registration number is obtained in the query formula embedded in the number of applications / registration number, and the document is transmitted to the simple analysis module 407, The module 407 provides simplified analysis of these documents. The numerical values in all the cells of the present invention are based on the query value, and a document for obtaining the numerical value from each query expression can be obtained. This applies to all cells below. If the value is a ratio value, the query is matched for each elemental value included in the formula for generating the ratio, so that the reference document can be obtained based on the query.

도 33에서 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 대한민국 기준 삼성전자주식회사의 총량 기준 다출원 특허 기술 분류 기호별 경쟁 출원인에 관한 분석의 일실시예를 볼 수 있다. 특정 기술 분류 기호 레벨별, 기술분류 기호별 경쟁출원인을 입수하는 것은 1) 상기 특정 기술 분류 기호 레벨별로 기준 특허 분류 기호들의 랭킹 정보(상기 도 33에서는 1위 H01L, 2위 H04N 등)를 입수하고, 2) 상기 특정 기술 분류 기호 레벨별 기준 특허 분류 기호별(예를 들면 도 33에서 H01L)로 특정 국가(대한민국, 미국, 일본, 유럽 등에서 사용자가 선택하거나 시스템(1)이 자동으로 선택함)에서 출원량이 많거나(출원량 기준), 점유율이 높거나(점유율 기준), 집중율이 높거나(집중율 기준), 활동율이 높거나(활동율 기준) 기타 특허 지표가 높은 출원인을 랭킹과 함께 추출하고, 3) 상기 출원인의 출원수/등록수를 상기 추출한 출원인 정보와 함께 제공해 줄 수 있게 된다. 이때 드릴 다운을 하게 되면, 출원인의 문건이 1개라도 있는 드릴 다운된 특허 분류 기호의 하부 특허 분류 기호에 대하여 상기 1) ~3)을 실시하여 경쟁 출원인에 대한 정보를 제공해 준다. 상기 드릴다운은 최하위 특허 분류 기호까지 드릴다운될 수 있다. 한편, 상기 경쟁사에 대한 것은 IPC뿐만 아니라, USPC, FT 등에 대해서도 동일한 방법으로 수행할 수 있을 것이지만, 국가 선택에는 제한이 있을 것이다.(USPC는 미국 특허 분류 기호이므로, 대한민국과 일본의 특허 문건으로 경쟁 출원인을 입수할 수 없기 때문이다.) 상기 경쟁 출원인은 출원인의 출원 총량 기준으로도 생성할 수 있지만, 출원인의 등록 총량 기준으로도 생성할 수 있을 것이다. FIG. 33 shows an embodiment of the analysis of the competitive applicants according to the patent application technology classification codes of the total amount based on the Korean standard Samsung Electronics Co., Ltd. among all the applicants in the DB possessed by the patent information system (1) of the present invention. To obtain a competitive applicant for each specific technical classification code level and technical classification code, 1) obtain ranking information (reference number H01L, second place H04N, etc. in the above-mentioned FIG. 33) of reference patent classification symbols by the specific technical classification symbol level , 2) a specific country (selected by the user or automatically selected by the system (1) in Korea, the United States, Japan, Europe, etc.) by the reference patent classification symbol (for example, H01L in FIG. 33) , Applicants with high percentage of applications (based on application amount), high share (based on market share), high concentration (based on concentration ratio), high activity rate (based on activity rate) And 3) the application number / registration number of the applicant together with the extracted applicant information. At this time, if the drill-down is performed, the above-mentioned 1) to 3) are performed on the sub-patent classification code of the drilled-down patent classification symbol with one applicant's document to provide information on the competing applicant. The drill-down can be drilled down to the lowest patent classification symbol. On the other hand, the competitor may be carried out in the same manner not only for the IPC but also for the USPC and the FT. However, the selection of the country will be limited. (Since USPC is a US patent classification code, The applicant can not be obtained.) The competitive applicant can be created based on the total amount of the applicant's application, but it can also be created on the basis of the total amount of the applicant's registration.

한편, 기준 특허 분류 기호를 제1국에서의 특정 출원인의 특허 문건을 기준으로 생성해 놓고, 상기 기준 특허 분류 기호별로 다출원, 고집중, 고점유, 고활동 경쟁 출원인을 상기 제1국이 아닌 제2국에서 추출할 수 있을 수 있다. 상기 제1국 및 제2국의 선택은 사용자에 의해서 선택될 수도 있지만, 시스템(1)에 의해서 default값으로 선택될 수도 있다.On the other hand, if the reference patent classification symbol is generated on the basis of the patent document of the specific applicant in the first country, and the applicant who is highly competitive, highly occupied, highly motivated, And may be extracted from the second country. The selection of the first station and the second station may be selected by the user, but may be selected by the system 1 as the default value.

한편, 진입 경쟁 출원인에 대한 분석도 가능하며, 상기 진입 경쟁사에 대한 분석 정보의 생성은 기설정된 최근 기간(예를 들어 최근 7년 이내)의 문건에 대해서만 처리할 수 있을 것이다. 즉, 1) 상기 최근 기간에 출원된/등록된 제1국에서의 출원인의 문건을 기준으로 상기 특허 분류 기호 레벨별 특허 분류 기호의 기준에 관한 정보를 생성하고, 이 특허 분류 기호의 기준별 i) 최근 기간 또는 ii)전체 기간 기준으로 한 다출원, 고집중, 고점유, 고활동 경쟁 출원인을 상기 제1국 또는 제2국에서 추출할 수 있을 수 있을 것이다. 2) 전체 기간에 출원된/등록된 제1국에서의 출원인의 문건을 기준으로 상기 특허 분류 기호 레벨별 특허 분류 기호의 기준에 관한 정보를 생성하고, 이 특허 분류 기호의 기준별 i) 최근 기간 기준으로 한 다출원, 고집중, 고점유, 고활동 경쟁 출원인을 상기 제1국 또는 제2국에서 추출할 수 있을 수 있을 것이다.On the other hand, it is possible to analyze the entry competitor applicant, and the generation of the analysis information for the entry competitor will be able to be processed only for documents of a predetermined recent period (for example, within the last 7 years). That is, 1) information on the criteria of the patent classification symbol per the patent classification symbol level is generated based on the applicant's document in the first country registered / registered in the latest period, and i ) Recent period, or ii) on the basis of the entire period, can be extracted from the first country or the second country. 2) Generate information on the criteria of the patent classification symbol per the patent classification level based on the documents of the applicant in the first country registered / registered in the whole period, Based on the above, it is possible that an applicant who is highly competitive, highly occupied or highly competitive can be extracted from the first country or the second country.

발명자 관점의 경쟁Competition from an inventor's perspective

상기에서는 출원인 관점의 경쟁 정보를 획득하기 위하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 어떻게 데이터를 생성하는가에 대해 설명하였다. 발명자 관점의 경쟁도 출원인 관점의 경쟁을 처리하는 방법(출원인 기준의 문건 집합 단위로, 특허 분류 기호 레벨별로 총량, 점유율, 집중율, 활동율 등과 같은 분석 지표별로 테이블/데이터를 생성해 놓고, 이 데이터에 대해서 출원인 기준으로 접근함)과 동등한 방법(발명자 단위로 문건 집합을 생성하고, 그 발명자별의 문건 집합 단위로 동등한 방법으로 테이블/데이터를 생성한 다음, 이 데이터에 대해 발명자를 기준으로 접근함)으로 생성할 수 있을 것이다. 하기 표 57은 이러한 데이터의 한 예를 보여준다.In the above description, how the multidimensional analysis calculation execution result table generation module 402 generates data in order to obtain competition information from the applicant's perspective has been described. The inventor's view is to compile the table / data for each analysis indicator such as the total amount, the occupancy rate, the concentration rate, the activity rate, etc. by the method of processing the competition from the viewpoint of the applicant Data is accessed on the basis of the applicant) (a set of documents is created for each inventor, and the table / data is generated in an equivalent manner by the set of documents for each inventor, and then the data is accessed by the inventor ). Table 57 below shows an example of such data.

발명자 분석용 다차원 분석용 연산 수행 결과 테이블Inventor analysis multi-dimensional analysis operation result table

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 발명자를 기준으로 생성하는 데이터는 2가지 계열이 있을 수 있다. 첫째는 하나의 출원인의 하부 에 발명자를 배치시키고, 그 출원인의 발명자 단위의 문건 집합을 처리하는 방식으로 데이터를 생성하는 방법이며(즉, 출원인 등이 통상적으로 큰 기업이나 조직이며, 그 조직 내에 그 발명자가 있다고 가정하고, 그 발명자를 출원인의 하부에 연동시키는 방법이다. 출원인A AND 발명자1, 출원인A AND 발명자 2 ... 와 같이 상기 조건을 만족시키는 문건 집합별로 데이터를 생성할 수 있을 것이다.), 둘째는 발명자만을 독립적으로(출원인 자리에 발명자가 들어가는 형식)으로 발명자 단위의 문건 집합을 처리하는 방식으로 데이터를 생성한다. 전자에 대한 예시는 자명할 것이므로 후자에 대하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성한 데이터 형식의 일 실시예를 하기 표 57에서 보여 준다.The data generated by the multi-dimensional analysis operation result table generation module 402 based on the inventor may be two sequences. The first is a method of generating data by placing an inventor in the lower part of one applicant and processing a set of documents of the applicant's inventor (that is, the applicant is usually a large corporation or organization, It is possible to generate data for each set of documents satisfying the above conditions, such as applicant A AND inventor 1, applicant A AND inventor 2, and the like. ), And second, data is generated in such a manner that only the inventor is treated independently (in the form in which the inventor enters the applicant's place) and the set of documents of the inventor is processed. One example of the data format generated by the multi-dimensional analysis operation result table generation module 402 for the latter is shown in Table 57 below because the former will be obvious.

[표 57][Table 57]

IAppNameIAppName InventorInventor C3C3 C4C4 C5C5 C6C6 0000 0101 0202 0303 0404 0505 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 H04N 5/655H04N 5/655 00 00 00 1One 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 H04N 5/655H04N 5/655 00 00 00 1One 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 H04N 5/655H04N 5/655 00 00 00 1One 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 1One 22 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 1One 22 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 1One 22 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/64H04N 5/64 1One 22 00 1One 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/72H04N 5/72 00 1One 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/72H04N 5/72 00 1One 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/72H04N 5/72 00 1One 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/72H04N 5/72 00 1One 00 00 00 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/74H04N 5/74 00 1One 00 1One 1One 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/74H04N 5/74 00 1One 00 1One 1One 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/74H04N 5/74 00 1One 00 1One 1One 00 AA aa H04NH04N H04N 5/00H04N 5/00 H04N 5/74H04N 5/74 00 1One 00 1One 1One 00 AA aa H04NH04N H04N 5/00H04N 5/00 1One 44 00 22 1One 00 AA aa H04NH04N 1One 44 00 22 1One 00 AA bb BB zz

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기와 같은 총량 정보가 있을 때, 특정 출원인의 개별 발명자 단위별로 점유율, 집중율, 활동율 등과 같은 데이터를 생성하는 방법은 상기 출원인 단위에서 동등한 정보를 생성하는 방법을 설명하는 곳에서 설명한 바와 같다. 물론, 상기 표 57과 같은 데이터가 모든 출원인에 대한 모든 발명자 정보를 갖고 있을 경우, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 특정 출원인의 하위가 아닌 개별 발명자 단위에서 상기 총량 정보를 기준으로 하여, 점유율, 집중율, 활동율 등과 같은 특허 분석 지표의 값에 대한 데이터를 생성할 수 있을 것이다.When the multidimensional analysis operation result table generation module 402 has the total amount information as described above, a method of generating data such as occupancy rate, concentration rate, activity rate, and the like for each individual inventor unit of a specific applicant is equal to As described in the section describing how to generate information. Of course, when the data as shown in Table 57 has all the inventor's information for all applicants, the multidimensional analysis calculation execution result table generation module 402 may calculate the total amount information in units of individual inventors , Data on the values of the patent analysis index such as the occupancy rate, the concentration rate, and the activity rate can be generated.

도 35는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 대한민국 특허 출원 총량 기준 삼성전자주식회사의 연도별 다출원 발명자별 분석 결과에 관한 일실시예적 도면이다. 상기 특허 분류 기호 레벨별 영역별 발명자 분석은 1) 특정 출원인의 상기 특허 분류 기호 레벨별 특허 분류 기호 기준 랭킹을 생성하는 단계, 2) 상기 특허 분류 기호 기준 랭킹별로 i) 연도별로의 최다출원 발명자를 정보를 계산하는 단계, ii) 연도 통합별로 다출원 발명자의 랭킹(예를 들면, 1위부터 10위 등) 정보를 계산하는 단계 중에서 어느 하나를 실생하는 단계, 3) 상기 실행되어 추출한 발명자 분석 정보를 사용자에게 제공하는 단계로 우리어 진다. 한편, 상기 특허 분류 기호 레벨별 영역별 발명자 분석은 국가 단위로 출원인을 특정하지 않고, 출원 문건 또는 등록 문건을 기준으로 실행될 수도 있다. 이때는 1) 전체 대상 문건을 기준으로 특허 분류 기호 레벨별 특허 분류 기호 기준 랭킹을 생성하는 단계, 2) 상기 특허 분류 기호 기준 랭킹별로 i) 연도별로의 최다출원 발명자를 정보를 계산하는 단계, ii) 연도 통합별로 다출원 발명자의 랭킹(예를 들면, 1위부터 10위 등) 정보를 계산하는 단계 중에서 어느 하나를 실생하는 단계, 3) 상기 실행되어 추출한 발명자 분석 정보를 사용자에게 제공하는 단계로 우리어 진다.FIG. 35 is an exemplary diagram of an analysis result of multi-filing inventors of Samsung Electronics Co., Ltd. based on the total amount of Korean patent applications among all the applicants in the DB held by the patent information system 1 of the present invention. The inventor's analysis of the patent classification code level by area includes: 1) generating a patent classification symbol reference rank for each patent classification symbol level of a specific applicant; 2) calculating, by the patent classification symbol standard ranking, i) (Ii) calculating the rankings of the inventors (e.g., the first to tenth places) of the inventors according to the year integration, (3) a step of realizing any one of the steps of (3) To the user. On the other hand, the analysis of the inventor by category according to the level of the patent classification symbol may be performed on the basis of the application document or the registration document without specifying the applicant on a country basis. In this case, 1) generating a patent classification symbol reference ranking for each patent classification symbol level on the basis of the entire object document, 2) calculating information on the maximum number of applicants for each year by the patent classification symbol reference ranking, ii) (1) to (10), and (3) a step of calculating the inventor's analysis information for each year, and (3) It breaks down.

도 36은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 대한민국 특허 출원 총량 기준 삼성전자주식회사의 다출원 다출원 발명자별 분석 결과에 관한 일실시예적 도면이다. 발명자 개인별 분석은 국가 단위로 특정 출원인에 관련된 모든 발명자(출원인을 회사로 한 발명자 등)에 대하여 상기 발명자 개인별 분석 정보의 생성은 1) 특정 국가 단위로 상기 출원인에 관련된 모든 출원 문건 또는 등록 문건을 입수하고, 2) 상기 입수된 문건에서 발명자를 추출하고, 3) 발명자별로 문건의 개수를 기설정된 정책에 따라 카운팅하여, 4) 발명자별 랭킹을 생성하고, 5) 상기 랭킹에 따라 최다 발명자별, 연도별/특정 기간별 출원수/등록수 정보를 생성하여 사용자에게 제공한다. 한편, 특정 출원인에 관계하지 않는 경우에는 국가 단위의 모든 특허 문건을 입수하여 상기 2) ~5)를 실행한다. 상기 문건들은 최근 7년 등과 같이 기설정된 기간 동안의 문건만을 대상으로 할 수도 있을 것이다. 이는 본질적으로 문건 집합의 생성의 문제이며, 문건 집합의 생성은 본 명세서의 곳곳에서 언급한 문건 집합의 생성 방법 중에서 선택되는 어느 하나이어도 무방하다.36 is an example of an analysis result of multi-application multi-applicant inventors of Samsung Electronics Co., Ltd. based on the total amount of Korean patent applications among all applicants in the DB possessed by the patent information system (1) of the present invention. The inventor's individual analysis is performed by the inventor's individual analysis information for all inventors (inventors who have filed a patent application) related to a specific applicant at a country level. The inventor's individual analysis information is obtained by 1) obtaining all application documents or registration documents related to the applicant 2) extracting the inventor from the obtained document, 3) counting the number of documents per inventor according to a predetermined policy, 4) generating a ranking for each inventor, and 5) And generates the number of applications / registration number information for each star / specific period and provides it to the user. On the other hand, if it does not relate to a specific applicant, obtain all patent documents of the country unit and execute the above 2) ~ 5). Such documents may be subject to documents for a predetermined period, such as the last seven years. This is essentially a matter of creating a document set, and the creation of the document set may be any one of the methods of generating the document set referred to elsewhere in this specification.

인용 분석용 다차원 분석용 연산 수행 결과 테이블Citation analysis multi-dimensional analysis operation result table

인용 분석을 수행하기 위하여 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 인용 분석용 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓는다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 어떤 과정을 거쳐서 인용 분석용 다차원 분석용 연산 수행 결과 테이블 데이터를 생성하는지에 대해서 미국 특허 출원 제 09/802,847 호(출원인 Samsung Electronics Co., LTD(KR)., 발명의 명칭 User request processing method and apparatus using upstream channel in interactive multimedia contents service, 출원일 2001.3.12)을 예시로 들면서 설명한다. In order to perform citation analysis, the multi-dimensional analysis operation result table generation module 402 generates multi-dimensional analysis operation result table data for citation analysis. As to how the multidimensional analysis operation execution result table generation module 402 generates the multidimensional analysis operation execution result table data for citation analysis through a certain process, U.S. Patent Application No. 09 / 802,847 (filed by Samsung Electronics Co., LTD (KR), entitled " User request processing method and apparatus using upstream channel in interactive multimedia contents service, filed March 12, 2001).

상기 미국 특허 출원 제 09/802,847호의 서지 사항은 표 42에 나타난 바와 같다.The bibliography of the above-mentioned U.S. Patent Application No. 09 / 802,847 is as shown in Table 42.

[표 42][Table 42]

출원번호Application number 출원일Filing date 등록번호Registration Number 등록일Registration date 인용정보(US특허)Citation Information (US Patent) IPCIPC USPCUSPC 09/802,84709 / 802,847 March 12, 2001March 12, 2001 7,302,4647,302,464 November 27, 2007November 27, 2007 5680322
5805804
6044397
6130898
6317131
6611262
6631403
6654761
6654931
6697869
2002/00266425680322
5805804
6044397
6130898
6317131
6611262
6631403
6654761
6654931
6697869
2002/0026642 G06F 15/16

H04N 7/16G06F 15/16

H04N 7/16 709/203

709/217

725/135709/203

709/217

725/135

이때, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 인용과 관련하여 09/802,847의 서지 사항 정보로부터 다음과 같은 인용 parent-child 데이터를 입수한다. child는 자기 자신의 문건 번호이며, patent는 자기가 인용하고 있는 타 문건 번호를 말한다.(이때, 자신의 문건 번호는 출원번호일 수도 있으며, 등록번호 일 수도 있다. 즉, 자신을 특정화시킬 수 있는 번호이면 좋으나, parent에 공개번호(publication number)나 출원번호가 있을 수 있으므로, 모든 문건이 공통적으로 가지고 있는 번호인 출원번호로 통일하는 것이 바람직하다. 하지만, 등록번호로 통일하는 것도 무방하며, 정보처리의 복잡성이 문제되지 않는다면, 문건의 특정에 문제가 없다면 여러 번호를 혼용하는 것도 무방하다. 본 명세서에서는 설명의 편의상 출원번호로 통일한다.) 하기 표 43에 있는 데이터를 입수 데이터 기준 유닛 인용 parent-child 데이터라 한다.At this time, the multi-dimensional analysis operation result table generation module 402 obtains the following cited parent-child data from the bibliography information of 09 / 802,847 regarding citation. A child is his / her own document number, and a patent is a reference number that he / she is citing (in this case, his / her document number may be an application number or a registration number, that is, However, since the parent may have a publication number or an application number, it is preferable to uniformize the application number, which is a common number of all the documents. However, it is also possible to unify the application number with the registration number, If the complexity of the processing is not a concern, it is also possible to mix several numbers if there is no problem in the specification of the document.) In the present specification, the application numbers are unified for ease of explanation. -child data.

[표 43][Table 43]

child-국가child-country child(출원번호)child (application number) parent(입수데이터 그대로)parent parent-국가parent-country USUS 09/802,84709 / 802,847 56803225680322 USUS USUS 09/802,84709 / 802,847 58058045805804 USUS USUS 09/802,84709 / 802,847 60443976044397 USUS USUS 09/802,84709 / 802,847 61308986130898 USUS USUS 09/802,84709 / 802,847 63171316317131 USUS USUS 09/802,84709 / 802,847 66112626611262 USUS USUS 09/802,84709 / 802,847 66314036631403 USUS USUS 09/802,84709 / 802,847 66547616654761 USUS USUS 09/802,84709 / 802,847 66549316654931 USUS USUS 09/802,84709 / 802,847 66978696697869 USUS USUS 09/802,84709 / 802,847 2002/00266422002/0026642 USUS

상기 표 43와 같이 상기 입수된 인용 parent-child 데이터로부터 하기 표 44과 같이 출원번호로 통일된 인용 parent-child 데이터를 생성한다. parent의 등록번호 및 공개번호(출원번호라도 무방함, 모든 번호 정보에는 document type이 있음. A등록, A1공개 등)로부터 각각의 출원번호를 입수한다.(US에서는 출원번호 또는 출원일련번호라고 칭하나, 이는 명칭일 뿐 그 명칭이 가리키는 데이터의 속성은 동일하다. 이상과 이하에서 출원번호라고 통일해서 설명한다.) 하기 표에 있는 데이터를 통일화된 번호 기준 유닛 인용 parent-child 데이터라 한다. 한편, child와 parent칼럼에 있는 문건의 국가 칼럼들은 그 해당 문건이 관련된(출원되거나 등록된) 출처 국가를 말한다. 이러한 국가는 미국뿐만 아니라 여러 나라 국가가 모두 해당될 수 있다. 이하에서는 설명의 편의상 국가 칼럼은 생략하고 설명하나, 국가 칼럼이 있다고 이해하여야 한다. 한편, 상기 출원번호의 표기 형태는 "두자리수 + / + 6자릿수"나 "연도+6자리 수"등 임의의 표기일 수 있다.As shown in Table 43, the cited parent-child data unified with the application number is generated from the obtained parent-child data as shown in Table 44 below. Each application number is obtained from the parent's registration number and public number (the application number is acceptable, all number information has a document type, A registration, A1 disclosure, etc.) (US patent application number or application serial number The data shown in the following table is referred to as a unified number reference unit reference parent-child data. On the other hand, the national columns of the documents in the child and parent columns refer to the country of origin from which the document concerned is related (filed or registered). Such a country can be applied not only to the United States but also to various countries. In the following, the national column is omitted for convenience of explanation, but it should be understood that there is a national column. On the other hand, the notation of the application number may be any notation such as "two digits + / 6 digits" or "year + 6 digits ".

[표 44][Table 44]

child(출원번호)child (application number) parent(출원번호)parent (application number) 09/802,84709 / 802,847 08/451,47008 / 451,470 09/802,84709 / 802,847 08/816,20708 / 816,207 09/802,84709 / 802,847 09/055,92909 / 055,929 09/802,84709 / 802,847 08/969,96508 / 969,965 09/802,84709 / 802,847 09/113,74809 / 113,748 09/802,84709 / 802,847 09/152,00309 / 152,003 09/802,84709 / 802,847 09/309,89509 / 309,895 09/802,84709 / 802,847 09/124,47409 / 124,474 09/802,84709 / 802,847 09/236,46209 / 236,462 09/802,84709 / 802,847 09/138,78209 / 138,782 09/802,84709 / 802,847 09/73639309/736393

상기 표 44과 같은 데이터 처리는 주어진 문건 1개를 기준으로 생성하였으나, 상기와 같은 데이터 처리는 주어진 적어도 하나 이상의 문건 집합 단위로 그 문건 집합에 속한 개별 문건 단위로 처리할 수 있을 것이며, 그러한 결과는 표 45에 정리되어 있다. 이때, 개별 분건 집합 단위에서 parent(출원번호)의 중복을 허용할 수 있다.(즉, 1개의 문건 집합에 속하는 문건 1, 문건 2가 모두 인용 정보에 문건 3을 포함하고 있으면, 문건 3의 2회 중복을 허용할 수 있다는 것이다.) 즉, 문건 집합에 속하는 모든 개별 문건에 대한 통일화된 번호 기준 유닛 인용 parent-child 데이터를 생성할 수 있다. 이때, 가장 대표적인 문건 집합이 US 공개 출원 문건 집합 및 US 등록 특허 문건 집합이 될 수 있다. 사용자가 생성한(특정 검색식을 입력한 다음 검색 결과로 출력되는) 문건 집합 및 본 발명을 실시하는 전산 시스템(1)이 자동적으로 그 대상이 될 수 있음은 자명하다 할 것이다.Although the data processing as shown in Table 44 is generated based on a given document, the data processing as described above may be processed in units of individual documents belonging to the given document set in at least one document set unit, Table 45 summarizes these. At this time, it is possible to allow duplication of parent (application number) in the unit of individual denominations (ie, if documents 1 and 2 belonging to one document set contain document 3 in the reference information, It is possible to create a parent-child data citing a uniform number base unit for every individual document belonging to the set of documents. At this time, the most representative document set can be a set of US open application documents and a set of US registered patent documents. It will be obvious that a set of documents created by the user (outputting a search result after inputting a specific search expression) and the computerized system 1 implementing the present invention can be automatically targeted.

[표 45][Table 45]

아울러, 상기 통일화된 번호 기준 유닛 인용 parent-child 데이터에 포함된 모든 문건에 대해 특허 문건 마스트 DB(202)를 조회하면 하기 표 46와 같이 각 문건별로 서지사항을 추가한 데이터를 생성할 수 있을 것이다. 서지 사항의 추가는 서지 사항을 구성하는 모든 요소 중에서 선택된 어느 하나 이상일 수 있다. 하기 표 46는 각 출원번호에 1개씩만 대응되는 서지사항 중 출원일과 등록일이 추가된 데이터를 보여준다.In addition, if the patent document master DB 202 is searched for all the documents included in the parent-child data citing the unified number reference unit, the data with the bibliography added to each document can be generated as shown in Table 46 below . The addition of a bibliography may be any one or more of the elements constituting the bibliography. Table 46 below shows data added with the application date and the registration date among the bibliographies corresponding to only one application number.

[표 46][Table 46]

child 등록일 child registration date child출원일child filing date child출원번호child application number parent 출원번호parent application number parent출원일parent Filing Date parent 등록일parent Registration Date November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 08/451,47008 / 451,470 May 26, 1995May 26, 1995 October 21, 1997October 21, 1997 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 08/816,20708 / 816,207 March 12, 1997March 12, 1997 September 8, 1998September 8, 1998 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/055,92909 / 055,929 April 7, 1998April 7, 1998 March 28, 2000March 28, 2000 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 08/969,96508 / 969,965 November 25, 1997November 25, 1997 October 10, 2000October 10, 2000 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/113,74809 / 113,748 July 10, 1998July 10, 1998 November 13, 2001November 13, 2001 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/152,00309 / 152,003 September 11, 1998September 11, 1998 August 26, 2003August 26, 2003 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/309,89509 / 309,895 May 11, 1999May 11, 1999 October 7, 2003October 7, 2003 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/124,47409 / 124,474 July 29, 1998July 29, 1998 November 25, 2003November 25, 2003 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/236,46209 / 236,462 January 25, 1999January 25, 1999 November 25, 2003November 25, 2003 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/138,78209 / 138,782 August 24, 1998August 24, 1998 February 24, 2004February 24, 2004 November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/73639309/736393 December 15, 2000December 15, 2000

상기 표 46와 같은 데이터에 각 출원번호별로의 카운트 정보(청구항 수, 도면수, 패밀리 수, 발명자 수, 출원인 수 등 각종 카운팅 정보가 추가적으로 각 카운트 필드별로 추가될 수도 있을 것이다.) 각 출원번호별로 1개씩만 대응되는 사항에 대해서는 1개의 테이블로 생성하는 것이 바람직할 것이다.Count information (number of claims, number of drawings, number of families, number of inventors, number of applicants, and various counting information may be additionally added for each count field in the data as shown in the above Table 46) It is preferable to generate one table for items corresponding to only one item.

이어, 각 출원번호에 2개 이상의 데이터가 대응될 수 있는 필드값으로 상기와 같은 데이터를 생성할 수도 있다. 이들은 출원인, 발명자, 대리인, 특허 분류 기호 등이 될 수 있다. 특허 분류 기호에 대해서는 뒤에서 상술한다. 각 출원번호에 2개 이상의 데이터가 대응될 경우, 1개의 테이블에 모든 필요로 하는 필드값을 다 넣을 수도 있지만, 이렇게 되면 테이블의 크기가 크지는 문제점을 감수해야 한다. 예를 들어 하기 표 47에 있는 09/309,895호는 출원인이 AT'T Corp. (New York, NY)와 Sun Micro Systems (Palo Alto, CA)인데, 이 경우에는 이 출원번호와 관련되 는 아래 표 47과 같은 라인에 출원인 필드가 추가될 때, 하기 표 48과 같이 처리되어야 한다.Then, the above-mentioned data may be generated by using a field value in which two or more pieces of data can be associated with each application number. These may be applicants, inventors, agents, patent classification codes, and the like. The patent classification codes will be described later. If two or more data correspond to each application number, you can put all the required field values into one table, but you have to face the problem of large table size. For example, in U.S. Patent Application Serial No. 09 / 309,895, (New York, NY) and Sun Micro Systems (Palo Alto, Calif.), Where the applicant field is added to the line shown in Table 47 below with respect to this application number, .

[표 47][Table 47]

child 등록일 child registration date child출원일child filing date child출원번호child application number parent 출원번호parent application number parent출원일parent Filing Date parent 등록일parent Registration Date November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/309,89509 / 309,895 May 11, 1999May 11, 1999 October 7, 2003October 7, 2003

[표 48][Table 48]

child 등록일 child registration date child출원일child filing date child출원번호child application number parent 출원번호parent application number parent출원일parent Filing Date parent 등록일parent Registration Date 출원인Applicant November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/309,89509 / 309,895 May 11, 1999May 11, 1999 October 7, 2003October 7, 2003 AT'T CorpAT'T Corp November 27, 2007November 27, 2007 March 12, 2001March 12, 2001 09/802,84709 / 802,847 09/309,89509 / 309,895 May 11, 1999May 11, 1999 October 7, 2003October 7, 2003 Sun Micro SystemsSun Micro Systems

이처럼, 출원인이 2명이 있다면 행의 개수가 2배 늘어나게 된다. 한편, 상기 출원에 대하여 발명자가 6명이 있다면, 출원인과 발명자를 1개의 테이블에 보여줄 경우에는 12개의 행이 필요하게 된다.(2*6 = 12) 이때, (출원인, 발명자)쌍과 관련된 데이터를 제외한 모든 데이터에서 중복이 일어나게 되는 문제가 있다.(물론, 중복이 있을 경우, 분석 결과(예를 들면 연도별 출원인, 연도별 특정 출원인의 발명자 등)을 가져올 때는 테이블 조인이 없기 때문에 효율적이나, 테이블의 크기가 크지는 문제가 있다.)Thus, if there are two applicants, the number of rows will be doubled. On the other hand, if the applicant and the inventor are present in one table, 12 rows are required if the inventor has six applicants. (2 * 6 = 12) At this time, (Of course, there is a problem that duplication occurs in all the data except for the table.) (Of course, when there is duplication, when the analysis result (for example, the applicant for each year and the inventor of the specific applicant for each year) There is a problem in that the size of the recording medium is large.

참고로, 시간/날짜(date)를 기준으로 한 정보 처리는 롤업을 고려할 때는 다음가 같이 처리되는 것이 바람직하다. 예를 들어 March 12, 2001가 있는 경우, 2001년 3월 12일, 2001년 3월, 2001년 1분기, 2001년과 같이 세분화하여 처리하는 것이 롤업 연산 등에서 바람직하다. 하지만, 본 명세서에서는 표기(한정된 폭의 화 면에 수많은 각종 필드를 모두 표현하는) 문제로 이를 생략하나, 이러한 것은 당업자에게는 자명할 것이다.For reference, it is desirable that the information processing based on time / date is processed as follows when considering roll-up. For example, in the case of March 12, 2001, it is desirable to perform the roll-up operation, such as March 12, 2001, March 2001, 2001, and 2001. However, in the present specification, this is omitted because of the problem of representing (representing a large number of various fields on a limited width screen), which will be obvious to those skilled in the art.

한편, 특허 분류 기호와 관련하여, 하위 분류 기호 포함의 관점에서 상기 데이터의 처리를 설명한다. 먼저, 하기 표 49과 같이 child에 해당하는 09/802,847에 대한 데이터처리를 설명하며, 이는 parent에 속하는 문건에 대해서도 동일하게 적용된다. 09/802,847에는 IPC로 G06F 15/16과, H04N 7/16　이 있으며, USPC로 709/203 ; 709/217; 709/231; 725/135이 있다.On the other hand, with respect to the patent classification symbol, the processing of the data will be described from the viewpoint of inclusion of the sub-classification symbol. First, as shown in the following Table 49, data processing for 09 / 802,847 corresponding to child is explained, and the same applies to documents belonging to the parent. 09 / 802,847 has IPCs G06F 15/16 and H04N 7/16, USPC 709/203; 709/217; 709/231; 725/135.

[표 49][Table 49]

출원월2001년 3월Application March 2001 출원년2001Application year 2001 n dot subgroup(C(n+4))n dot subgroup (C (n + 4)) 1 dot subgroup(C5)1 dot subgroup (C5) IPC group(C4)IPC group (C4) IPC subclass(C3)IPC subclass (C3) IPC Class(C2)IPC Class (C2) IPC Section(C1)IPC Section (C1) child출원번호child application number 1One 1One G06F 15/16G06F 15/16 G06F 15/00G06F 15/00 G06FG06F G06G06 GG 09/802,84709 / 802,847 1One 1One H04N7/16H04N7 / 16 H04N 7/00H04N 7/00 H04NH04N H04H04 HH 09/802,84709 / 802,847

상기 표 49에서 C3는 IPC subclass 레벨을 말하며, 다른 것들도 이와 같은 체계임은 상기에서 설명하였다. 2001년 필드의 1은 2001년에 1건이 있다는 말이며, 2001년 3월에도 1이 있는 것은 월단위에서도 그 월에 1건이 있다는 것이며, 상기와 같은 것은 롤업을 고려한 데이터 구조에서는 통상적인 표기 방법이다. 상기의 G06F 15/16 및 H04N 7/16 모두는 1dot subgroup에 속하는 것이었지만, 만약 09/802,847 문건에, H04N 7/169이 포함되어 있다면, 상기 표 49은 다음 표 50와 같이 되었을 것이다.(본 발명 사상의 설명을 위해서 임의로 도입한 내용임)In Table 49, C3 denotes the IPC subclass level, and the others are the same as above. One field in 2001 refers to one in 2001, and there is one in March 2001, which means that there is one per month in month, and the above is a common notation in data structure considering roll-up . All of the above G06F 15/16 and H04N 7/16 belong to the 1dot subgroup, but if H04N 7/169 is included in the document 09 / 802,847, Table 49 above would be as shown in Table 50 below. Which was arbitrarily introduced for the purpose of explaining the inventive idea)

[표 50][Table 50]

출원월2001년 3월Application March 2001 출원년2001Application year 2001 3 dot(C7)3 dot (C7) 2 dot(C6)2 dot (C6) 1 dot subgroup(C5)1 dot subgroup (C5) IPC group(C4)IPC group (C4) IPC subclass(C3)IPC subclass (C3) IPC Class(C2)IPC Class (C2) IPC Section(C1)IPC Section (C1) child출원번호child application number 1One 1One 09/802,84709 / 802,847 1One 1One H04N7/169H04N7 / 169 H04N7/167H04N7 / 167 H04N7/16H04N7 / 16 H04N 7/00H04N 7/00 H04NH04N H04H04 HH 09/802,84709 / 802,847

USPC를 하위 분류 기호 포함의 관점에서 상기 데이터를 처리하면 하기 표 51과 같을 수 있다.The above data may be processed as shown in Table 51 from the viewpoint of inclusion of the USPC in the sub-classification code.

[표 51][Table 51]

출원월2001년 3월Application March 2001 출원년2001Application year 2001 USPC n dot level (C(n+4))USPC n dot level (C (n + 4)) USPC 1 dot level (C5)USPC 1 dot level (C5) USPC no dot level (C4)USPC no dot level (C4) USPC class(C3) USPC class (C3) Meta class (C2) Meta class (C2) Meta super class (C1)Meta superclass (C1) child출원번호child application number 1One 1One 709/203709/203 709/201709/201 709709 09/802,84709 / 802,847 1One 1One 709/217709/217 709709 09/802,84709 / 802,847 1One 1One 709/231709/231 709/230709/230 709709 09/802,84709 / 802,847 1One 1One 725/135725/135 725725 09/802,84709 / 802,847

USPC 709/203(타이틀 정보 : Client/server)는 직상위인 709/201(타이틀 정 보 : DISTRIBUTED DATA PROCESSING)에 있고, 이는 Class 709에 속한다. 다른 것들도 마찬가지이다. 상기의 USPC 등은 모두는 1dot 또는 no dot 레벨에 속하는 것이었지만, 만약 09/802,847 문건에, 725/45가 포함되어 있다면, 상기 표 51은 다음 표 52과 같이 되었을 것이다.(본 발명 사상의 설명을 위해서 임의로 도입한 내용임)USPC 709/203 (title information: Client / server) is in the top 709/201 (title information: DISTRIBUTED DATA PROCESSING), which belongs to Class 709. Other things are the same. All of the above USPCs and the like belong to the 1 dot or no dot level, but if the document No. 09 / 802,847 contains 725/45, Table 51 will be as shown in the following Table 52. Which is arbitrarily introduced for the purpose of the present invention)

[표 52][Table 52]

출원월2001년 3월Application March 2001 출원년2001Application year 2001 4 dot(C8)4 dot (C8) 3 dot(C7)3 dot (C7) 2 dot(C6)2 dot (C6) USPC 1 dot level (C5)USPC 1 dot level (C5) USPC no dot level (C4)USPC no dot level (C4) USPC class(C3) USPC class (C3) Meta class
(C2)Meta class
(C2) Meta super class(C1)Meta superclass (C1) child출원번호child application number 1One 1One 1One 1One 725/45725/45 725/44725/44 725/39725/39 725/38725/38 725/37725/37 725725 09/802,84709 / 802,847

IPC가 H04L 12/56, H04L 12/28, USPC가 370/395, 370/235가 할당되어 있다.(상기 IPC는 대한민국 특허청에서 부여한 것과 다를 수 있으나, 각 국가 단위로 데이터를 처리하므로, 미국 데이터의 처리는 미국 특허청에서 부여된 IPC로 처리하는 것이 바람직하다.) IPC is assigned H04L 12/56, H04L 12/28, USPC 370/395, and 370/235. (The above IPC may be different from that granted by the Korean Intellectual Property Office, Is preferably treated with the IPC granted by the USPTO.)

이때, 특허 분류 기호마다 1개의 라인(row)가 만들어지므로, IPC와 USPC는 1개의 테이블에 생성할 수도 있으나, 그렇지 않는 것이 바람직하다. 이는 특히, 복수개의 출원인, 복수개의 발명자 등과 같이 복수개가 있는 경우에는 더욱 더 그러하지만, 본질적으로 1개의 child에 여러 개의 parent가 있기 때문이다. 즉, 1개의 child에 k개의 parent가 있고, 1개 child에 IPC가 m개, USPC가 n개가 있고, 각각의 k개의 parent 문건마다에 각각 IPC와 USPC가 있을 것이므로, 이를 1개의 테이블에 모두 모아 놓는다면, 엄청나게 많은 라인이 필요할 것이며, 이때, 다수의 데이터가 중복되게 될 것이다. 이는 특히 문건 집합의 크기가 클 경우(예를 들어 미국 등록 특허 전체 집합 등)일 경우, 더욱 더 그러하다. 그러므로, IPC와 USPC를 분리하는 것이 바람직하다. 특허 분류 기호와 관련해서, 테이블 유형은 1) 인용 child-parent 데이터에 child쪽 특허 분류 기호에 대해서만 처리해 놓은 유형, 2) 인용 child-parent 데이터에 parent쪽 특허 분류 기호에 대해서만 처리해 놓은 유형 및 3) 인용 child-parent 데이터에 child쪽과 parent쪽 모두에 대해 특허 분류 기호에 대해서만 처리해 놓은 유형이 있을 수 있다. 3)의 경우에는 라인(row)의 수가 증가하는 문제점이 있으나, 테이블 조인의 가능성은 줄어드는 장점이 있다.At this time, since one line is created for each patent classification symbol, IPC and USPC can be created in one table, but it is preferable not to create one. This is especially true if there are a plurality of applicants, such as a plurality of inventors, etc., but there are essentially several parents in one child. That is, there are k parents in one child, m IPCs in one child, n USPCs, and there will be IPCs and USPCs in each k each parent document. If you do, you will need a huge number of lines, at which time a lot of data will be duplicated. This is even more so, especially if the size of the document set is large (eg, a full set of US patents). Therefore, it is desirable to separate IPC and USPC. In relation to the patent classification, the table type is: 1) the type that is processed only for the child patent data in the child-parent data, 2) the type processed for the parent patent data only in the child-parent data, and 3) There may be a type of child-parent data that has been processed only for the patent classification symbol for both the child and parent sides. 3), there is a problem that the number of rows increases, but the possibility of table joining is reduced.

즉, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 주어진/기설정된 문건 집합에 속하는 개별 문건별로 유닛 인용 parent-child 데이터를 생성하고, 상기 개개의 유닛 인용 parent-child 데이터에 대하여, child 및/또는 parent에 속하는 개별 문건의 서지사항 중 적어도 하나 이상을 입수하여 그 입수된 서지사항을 필드 내용으로 하는 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓을 수 있다.That is, the multi-dimensional analysis operation result table generation module 402 generates unit citation parent-child data for individual documents belonging to a given / set document set, and for each unit citation parent-child data, And / or a bibliography of individual documents belonging to the parent, and generates the multidimensional analysis operation result table data in which the obtained bibliography is the field contents.

상기 주어진/기설정된 문건 집합의 예로 1) 특정 국가의 출원 문건 또는 등록 문건 전체 집합, 2) 특정 IPC/USPC 중 어느 하나 이상으로 된 문건 집합, 3) 특정 출원인별 문건 집합, 4) 특정한 검색식 등으로 생성될 수 있는 검색 기반 생성 문건 집합, 5) 국가 통합 모든 출원 문건 또는 모든 등록 문건 전체 집합 등을 들 수 있다. Examples of the set of given / preset documents are 1) a complete set of application documents or registration documents in a particular country, 2) a set of documents in one or more of the specific IPC / USPCs, 3) a set of documents by specific applicant, 4) And 5) a complete set of all national application documents or all registration documents.

이때 상기 1) 하나의 국가 단위에서의 문건 전체 집합에 대하여 하기 표62와 같은 데이터가 생성되어 있으면 다음과 같은 효용이 있을 수 있다. 상기 데이터를 이루는 각 출원번호/등록번호에는 서지 사항이 대등되며, 그 서지 사항 중 어느 하나 이상이 상기 설명과 같이 결합되어 생성될 수 있음은 물론이다 하겠다.At this time, if the data as shown in Table 62 below are generated for the entire document set in the above 1) country unit, the following utility may be obtained. It will be appreciated that bibliographic data may be stored in each application number / registration number constituting the data, and any one or more of the bibliographic data may be combined and generated as described above.

[표 53][Table 53]

상기와 같은 데이터를 생성하는 방법은 모든 출원/등록 문건을 child 칼럼에 넣고, 각각의 문건마다 그 문건 인용하고 있는 parent문건 번호를 parent에 넣는 방법이다. parent값이 없는 경우에는 null값을 넣으며, 이 경우에는 child 칼럼에만 데이터가 있고, 그 child 문건에 대응되는 parent 문건이 없게 된다. 이 경우, child 칼럼에는 모든 문건 번호가 다 들어가 있게 된다. The method of generating the above data is to put all application / registration documents in the child column, and insert the parent document number which is citing the document for each document into the parent. If there is no parent value, a null value is inserted. In this case, there is only data in the child column, and there is no parent document corresponding to the child. In this case, the child column will contain all the document numbers.

설명의 편의를 위하여 가장 간단한 모델을 도입하자. 문건1, 문건 2, 문건 3, 문건 4가 모두 1개씩의 인용 정보를 포함하고 있고, 인용의 순서가 문건1 -> 문건2 -> 문건3 -> 문건 4라고 가정하고(문건 2는 문건 1을 인용하고, 문건 3은 문건 2를 인용하고 있고, 문건 4는 문건 3을 인용하고 있음을 의미함). 이때, 문건 2를 기준으로 하면(문건 2가 child 칼럼에 있음), 문건 1이 문건 2의 parent 칼럼에 존재하게 되며, 문건 3을 기준으로 하면(문건 3이 child 칼럼에 있음), 문건 2가 parent 칼럼에 존재하게 되며, 문건 4를 기준으로 하면 문건 3이 parent 칼럼에 있게 된다. 즉, 문건 1~4가 모두 child 칼럼에 있으며, parent 칼럼에는 적어도 문건 1~3이 있게 된다. 인용의 깊이(문건 4를 기준으로 할 때, 문건 1은 인용 깊이 3, 문건 2는 인용 깊이 2, 문건 3은 인용 깊이 1이 된다.)를 고려할 때, 문건 3과 관련된 전방 인용(backward citation, 문건 3에 포함된 인용 정보로 생성되는 문건 집합)에서 인용 깊이 2까지는 문건 2 및 문건 1이 되며, 후방 인용(forward citation) 깊이 1에는 문건 4가 있게 된다. 이 경우, 문건 1~4가 모두 child에 있기 때문에 어느 문건을 기준으로도 인용 깊이 1의 인용 문건을 parent 칼럼에서 파악할 수 있고, 파악된 인용 문건 번호를 다시 child에서 찾고, 그 찾은 child의 parent를 찾으면, 인용 깊이 2의 인용 문건을 찾을 수 있게 된다. 이러한 모델을 표 54로 설명하면 다음과 같다.For simplicity, let's introduce the simplest model. (1), (2), (3), and (4) contain one citation information, and the order of citation is 1 -> document 2 -> document 3 -> document 4 , Document 3 refers to document 2, and document 4 refers to document 3). If document 2 is the base (document 2 is in the child column), document 1 is in the parent column of document 2, document 3 is the base (document 3 is in the child column), document 2 parent "column, and based on document 4, document 3 is in the parent column. That is, documents 1 through 4 are all in the child column, and at least the documents 1 through 3 are in the parent column. The depth of the quotation (when document 4 is used, document 1 has a depth of 3, document 2 has a depth of 2, document 3 has a depth of 1), backward citation, The document set generated from the citation information included in document 3) will be document 2 and document 1 from depth 2, and document 4 will be at depth 1 for forward citation. In this case, since documents 1 to 4 are all in child, it is possible to grasp the citation document with depth 1 in the parent column based on any document, find the cited document number again in the child, find the parent of the found child If you do, you will be able to find a citation document with a citation depth of two. These models are described in Table 54 as follows.

[표 54][Table 54]

child(출원번호)child (application number) parent(출원번호)parent (application number) 22 1One 33 22 44 33

상기에서는 문건 1개로 설명하였지만, 이러한 것은 문건 집합에 대해서도 마찬가지임은 자명하다 할 것이다. 이 역시 하기 표 55와 같이 모델로 설명한다. 문건집합3을 기준으로 설명한다.(하기 표 55에서 각 셀(라인 내의 내용)은 실제 다차 원 분석용 연산 수행 결과 테이블의 셀 데이터가 아니라 개념상의 여러 라인의 셀 데이터가 하나의 집합을 이룬다는 의미로 해석되어야 하며, 본 발명 사상의 설명의 편의를 위한 것이다.)In the above, one document has been explained, but it will be obvious that this also applies to the document set. This is also described by a model as shown in Table 55 below. (The contents in the line in the following Table 55 are not cell data of the operation result table of the actual multi-dimensional circle analysis but the cell data of several lines of the concept are one set) And should be interpreted in the sense of the present invention for convenience of explanation.)

[표 55][Table 55]

child(출원번호)child (application number) parent(출원번호)parent (application number) 문건집합2 Document set 2 문건집합 1(문건집합 2의 모든 parent로 생성되는 집합)Document set 1 (set generated by all parent of document set 2) 문건집합 3(특정 속성3을 공유함)Document set 3 (share specific attribute 3) 문건집합 2(문건집합 3의 모든 parent로 생성되는 집합)Document set 2 (set generated by all parent of document set 3) 문건집합 4(문건집합 3의 모든 child로 생성되는 집합)Document set 4 (set of all children of document set 3) 문건집합 3(문건집합 4의 모든 parent로 생성되는 집합)Document set 3 (set generated by all parent of document set 4)

상기 표 55에서 문건집합 1~4는 모두 child 칼럼의 전체집합의 부분집합이 되게 되며, 문건집합 1~3은 모두 parent 칼럼의 전체집합의 부분집합이 된다. 부분집합이 된다는 말은 그 집합에 속하는 모든 문건이 전체집합에 소속되기 때문에, 전체집합에서 확인가능하며 추출 가능하다는 것이다. 문건 집합이 가지는 속성은 임의의 속성일 수 있으나, 1)출원인 명의, 2) 발명자 명의, 3) 각 단계별 IPC 또는 USPC, 4) 국가, 5) 대리인, 6) 기간 범위(출원일/등록일 범위), 7) 문건의 상태(출원중, 등록, 거절 등) 또는 8) 이들 중 적어도 어느 하나 이상의 조합이 일치하는 것 등이 될 수 있다. In Table 55 above, document sets 1 through 4 are all a subset of the entire set of child columns, and document sets 1 through 3 are all a subset of the entire set of parent columns. Subsetting means that all documents belonging to the set belong to the entire set, so they can be identified and extracted from the entire set. The attributes of the document set may be arbitrary attributes, but may be any of the following: 1) the name of the applicant; 2) the name of the inventor; 3) IPC or USPC at each stage; 4) 7) the status of the document (filing, registration, rejection, etc.), or 8) a combination of at least one of these may coincide.

예를 들면, 출원인 Samsung Electronics의 IPC H01L에 속하며, 미국에서 등록된 모든 등록 문건을 문건 집합을 문건 집합 3이라 가정하면, 상기 문건 집합3에 관련된 전방 인용 깊이 1의 모든 인용 문건들로 이루어진 문건 집합2를 추출할 수 있을 것이다. 한편, 상기 문건 집합3에 관련된 후방 인용 깊이 1의 모든 피인용 문 건들로 이루어진 문건집합 4 및, 상기 문건 집합3에 관련된 전방 인용 깊이 2의 문건집합1을 입수할 수 있게 된다. 이때, 상기 문건 집합 1, 문건 집합 2 및 문건 집합 4를 분석할 수 있게 된다. 상기 분석의 대상으로는 1)전방/후방 인용 총량, 2) 전방/후방 다인용 출원인, 3) 전방/후방 다인용 발명자, 4) 전방/후방 다인용 IPC/USPC(각 특허 분류 기호 체계 상의 각 단계별, 드릴 다운은 이미 설명한 바와 같다.), 5)전방/후방 다인용 개별 문건을 찾을 수 있을 것이다. 한편, 상기 1)의 경우, 총량을 연도별/기설정된 기간별로 나누어 표시할 수 있을 것이며, 각종 숫자를 기초로 하여 증감율, 증감속도를 알 수도 있을 것이다. 상기 2)의 경우 다인용 출원인별로 전방/후방 인용량을 총량 및/또는 연도별/기설정된 기간별로 나누어 표시할 수 있을 것이며, 이는 3)~5)에서도 마찬가지일 것이다. 상기 문건 집합 3을 기준으로 분석 대상으로 상리 2) 중 후방 인용 다출원인을 선택하면, 상기 Samsung Electronics의 IPC H01L에 속는 미국 등록 문건을 인용하고 있는 후출원 출원인에 관한 정보를 알 수 있을 것이다. 이러한 후출원 출원인 집합에는 Samsung Electronics가 포함되어 있을 수 있으며, 이는 자기 인용(self citation)이 되며, 상기 자기 인용의 수치값을 다른 수치값으로 나누거나 기설정된 처리하면 자기 인용에 대한 여러가지 분석도 가능할 것이다. 또한, 후출원 출원인을 랭킹별로 보게 되면, 어느 출원인이 상기 Samsung Electronics의 IPC H01L에 속하는 문건을 많이 인용하고 있는지를 알 수 있게 된다. 물론, 상기 분석 결과(후방 인용수)에는 관련된 문건을 입수할 수 있는 링크가 걸려 있으며, 사용자가 그 링크를 클릭하면 관련된 문건(후방 인용 문건)을 특허 문건 마스트 DB(202)에서 입수하여 사용자에게 제 공해 줄 수 있을 것이다.For example, suppose that the document set belonging to the applicant Samsung Electronics, IPC H01L, and all registered documents registered in the United States are set to document set 3, a set of documents consisting of all quoted documents of the depth 1 2 will be extracted. On the other hand, it is possible to obtain the document set 4 consisting of all the citation motions of the backward reference depth 1 related to the document set 3 and the document set 1 of the front reference depth 2 related to the document set 3. At this time, the document set 1, the document set 2, and the document set 4 can be analyzed. 2) front / rear patent applicants; 3) front / rear patent applicants; 4) front / rear patent IPC / USPCs (each of the patent classification schemes); Step-by-step, drill-down has already been explained), and 5) you will find individual documents for front / rear multiple. On the other hand, in case of 1) above, the total amount may be divided by year / predetermined period, and the increase / decrease rate may be known based on various numbers. In case of 2) above, the amount of front / rear quotation for each applicant for multiple applicants will be divided by the total amount and / or year / period, which will be the same in 3) to 5). If the multiple applicants for the backward quotation of 2) are selected based on the document set 3 as the analysis target, information on the applicant who has filed the US registered document belonging to the Samsung Electronics IPC H01L will be known. This set of later applicants may include Samsung Electronics, which is a self-citation, and can be used to divide the numerical value of the self-citations into different numerical values, or to perform various analyzes of self-citations by pre- will be. In addition, if the applicant of the latter application is viewed by ranking, it is possible to know which applicant is quoting a lot of documents belonging to the Samsung Electronics IPC H01L. If the user clicks the link, a related document (backward reference document) is obtained from the patent document master DB 202, and the user inputs the related document (backward reference document) You will be able to provide it.

한편, 하나의 문건 집합이 주어졌을 때(예를 들어 문건 집합3) 상기 문건 집합3의 parent인 문건 집합 2를 찾고, 상기 문건 집합 2에서 기설정된 조건(예를 들면, 최근 10년 내에 출원된 문건 등의 기간 조건 적용)을 만족하는 문건 집합 2'''를 생성하고, 상기 문건 집합2'''를 분석 대상 문건 집합으로 하여 기설정된 상기 적어도 하나 이상의 분석을 수행하고, 그 분석 결과를 생성할 수 있음은 당연하다 할 것이다.On the other hand, when one document set is given (e. G. Document set 3), the document set 2 which is the parent of the document set 3 is searched for, and in the document set 2, , And a document set 2 '' 'that satisfies a periodic condition such as a document is applied to the document set 2' '', and performs the at least one predetermined analysis by using the document set 2 '' 'as an analysis target document set, It is natural to be able to do.

인용 분석 대상 문건 집합의 4종류 구성Four types of document set for citation analysis

4가지 종류4 types

한편, 인용 분석과 관련할 때, 4가지 종류의 분석 대상 문건 집합이 가능할 것이다. 하나의 기준 문건 집합이 주어질 때, 인용과 관련하여 다음 4가지 인용 분석 대상 문건 집합이 확정된다.On the other hand, when citation analysis is concerned, there will be four kinds of documents to be analyzed. When a set of reference documents is given, the following four set of citation analysis documents are determined with reference to the citation.

도 38는 기준 문건 집합에 대하여 인용 분석과 관련하여 4가지 종류의 분석의 대상이 되는 문건 집합이 생길 수 있음을 보여 주고 있다. 도 38에는 "자사가 인용한 타사 특허(자->타)", "자사를 인용한 타사 특허(타->자)", 타사가 인용한 자사특허(타->자)", "타사를 인용한 자사특허(자->타)"가 나와 있는데, 이들 각각은 하기 첫째 내지 넷째에 순차적으로 대응된다.FIG. 38 shows that there may be a set of documents to be subjected to the four kinds of analysis regarding the citation analysis for the reference document set. FIG. 38 shows an example in which "the third party patent (the third party) quoted by the company", "the third party patent (the third party) quoting the company", the third party's patent (the third party) quoted by the third party " Quot; patent (quot; - > ta) ", respectively, each of which is sequentially corresponded to the first to fourth.

첫째, 상기 기준 문건 집합에 포함된 개별 기준 문건이 인용하고 있는 전체 전방(backward) 인용 문건 집합이다. 상기 전체 전방 인용 문건 집합은 기준 문건 각각에 포함되어 있는 인용 정보(reference cited)에 대응되는 모든 문건의 합집합 으로 확정될 수 있을 것이다.(이때, 상기 합집합은 중복을 제거한 합집합일 수도 있지만, 중복을 허용하고 있는 합집합일 수도 있다. 예를 들면, #1번 문건과 #2번 문건이 a문건을 인용 정보로 포함하고 있을 때, a는 카운팅의 입장에서는 2번 인용된 중요 문건일 수 있으므로, 중복을 허용하는 것이 바람직하다.(다차원 분석용 연산 수행 결과 테이블에서 parent에 동일 레코드가 중복되어 존재한다. 즉, parent a에 관계된 레코드가 #1, #2에 기인한 2개 존재함). 한편, 단순히 a의 존재(출원인, 날짜, 기타 서지 사항)의 입장에서는 중복을 제거하는(합집합 연산을 수행하여) 것이 바람직할 것이다. 본 발명의 분석에서는 카운팅이 중심이 되므로, 중복을 허용하는 것이 더 타당하다.)First, it is a set of backward cited documents cited by the individual reference documents contained in the reference document set. The total set of forward quoted documents may be determined as a union of all documents corresponding to the reference cited included in each of the reference documents. (In this case, the union may be a union obtained by removing duplicates, For example, if a document # 1 and a document # 2 contain a document a as citation information, a may be an important document quoted twice in the counting position, so duplicate (There are duplicate records in the parent in the multi-dimensional analysis operation result table, that is, there are two records related to the parent a due to # 1 and # 2). On the other hand, It would be desirable to simply remove redundancy (by performing a union operation) in the presence of a (applicant, date, and other bibliography). In the analysis of the present invention, , It is more reasonable to allow duplication.)

둘째, 상기 기준 문건 집합에 포함된 개별 기준 문건을 인용하고 있는 전체 후방(forward) 인용 문건 집합이다. 상기 전체 후방 인용 문건 집합을 구하는 방법은 1) 개별 기준 문건을 인용하고 있는 문건 번호를 검색하고, 이를 합하는 방법 또는 2) 상기 전체 특허 문건에 대하여 생성한 child-parent 테이블에서 기준 문건 번호를 parent 칼럼에서 찾고, parent 칼럼에 있는 기준 문건 번호에 대응되는 child 문건 번호(2개 이상 있을 수 있음)를 child 칼럼에서 찾아서 이들을 합하는 방법이 있다. 2)의 방법이 더욱 바람직하다.Second, it is a set of forward quotation documents citing individual reference documents included in the reference document set. The method for obtaining the total document set of backward quotation is as follows: 1) a document number citing the individual reference document is searched and summed, or 2) a reference document number is generated in the child-parent table generated for the entire patent document, , Find the child document number (which may be more than one) corresponding to the reference document number in the parent column in the child column, and sum them together. 2) is more preferred.

셋째, 전체 후방 인용 관련 기준 문건 집합이다. 이 문건 집합은 기준 문건 집합을 구성하는 모든 기준 문건 중에서 타 문건에 의해 인용을 당한 적이 있는 모든 기준 문건으로 구성되는 문건 집합이다. 이 문건 집합은 상기 둘째 문건 집합의 parent로 구성되는 문건 집합이다.Third, it is a set of reference documents related to the entire backward citation. This set of documents is a set of documents consisting of all the reference documents that constitute the reference document set and all the reference documents that have been quoted by other documents. This document set is a document set consisting of the parent of the second document set.

넷째, 전체 전방 인용 관련 기준 문건 집합이다. 이 문건 집합은 기준 문건 집합을 구성하는 모든 기준 문건 중에서 인용 정보를 포함하고 있는 기준 문건으로 구성되는 문건 집합이다. 이 문건 집합은 상기 첫째 문건 집합의 child로 구성되는 문건 집합이다.Fourth, it is a set of reference documents related to the entire forward citation. This set of documents is a set of documents consisting of reference documents containing cited information among all the reference documents constituting the reference document set. This document set is a document set consisting of the children of the first document set.

상기 도 39의 최상단 탭에는 1) "특허 문건 전체 집합에 대한 인용분석", 2) "기술영역별 인용 정밀분석", 3) "발명자별 인용 분석" 및 4) "다인용 개별 문건별 인용 분석"이 있는데, 이들은 모두 기준 문건 집합의 확정과 관련있다. 상기 1)"특허 문건 전체 집합에 대한 인용분석"은 출원인 또는 본 발명의 문건 집합의 확정과 관계된 모든 문건 확정 방법 중에서 선택되는 어느 하나에 의해 확정된 기준 문건 집합으로서의 특허 문건 전체 집합을 의미한다. 상기 2) "기술영역별 인용 정밀분석"은 상기 특허 문건 전체 집합을 특허 분류 기호(IPC, USPC 등)의 레벨별 특허 분류 기호 기준 랭킹별로 정리된 특허 분류 기호별로 세분화된 문건 집합을 상기 기준 문건 집합으로 한다는 것을 의미한다. 상기, 3) "발명자별 인용 분석"은 상기 특허 문건 전체 집합에서 발명자를 추출한 다음, 랭킹을 계산하고, 랭킹이 높은 발명자별로 세분화된 문건 집합을 상기 기준 문건 집합으로 한다는 것을 의미한다. 상기 4) "다인용 개별 문건별 인용 분석"은 상기 특허 문건 전체 집합에 포함된 모든 문건에 대하여 인용/피인용 개수를 조사한 다음, 인용/피인용 랭킹이 높은 문건 을 추출하고, 개개의 문건을 기준 문건 집합으로 한다는 것을 의미한다.The uppermost tab in FIG. 39 includes 1) "citation analysis for a total set of patent documents" 2) "precise analysis of citation by technology area" 3) "citation analysis by inventor" 4) "citation analysis for individual documents "All of which are related to the establishment of a set of reference documents. The above 1) "citation analysis of the entire set of patent documents" means a complete set of patent documents as a set of reference documents determined by any of the document selection methods related to the applicant or the determination of the document set of the present invention. The above 2) "Precise Analysis of Citation by Technology Area" is a process for classifying the entire set of patent documents into a set of documents classified by patent classification symbol sorted by the patent classification symbol reference rank by level of the patent classification symbol (IPC, USPC, It means that it is a set. The above-mentioned 3) "citation analysis by inventor" means that the inventor is extracted from the entire set of the patent documents, and then the ranking is calculated, and the set of documents classified by the inventor having the highest ranking is used as the reference document set. The above-mentioned 4) "Analysis of citation by individual document for multiple documents" is to examine the number of documents cited / cited for all documents included in the entire set of documents, extract documents with high citation / pile ranking, It means that it is a set of reference documents.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 기준 문건 집합이 확정되면, 상기 기준 문건 집합에 대응하여 상기 인용 분석 대상 문건 집합으로서의 1) 전체 전방 인용 문건 집합, 2) 전체 후방 인용 문건 집합, 3) 전체 전방 인용 관련 기준 문건 집합, 및/또는 4) 전체 후방 인용 관련 기준 문건 집합을 확정하고, 상기 1)~4) 중 어느 하나 이상의 문건 집합에 대하여 본 명세서에서 기술하고 있는 다차원 분석용 연산 수행 결과 테이블을 생성하고, 이를 저장한다.When the reference document set is determined, the multidimensional analysis operation result table generation module 402 generates a document set of 1) an entire forward reference document as the document set to be analyzed for the reference corresponding to the reference document set, 2) And / or 4) a set of related reference documents for the entire backward reference, and a multidimensional analysis described herein for any one or more of the document sets 1) - 4) And stores the result table.

인용 분석의 결과Results of citation analysis

본 발명의 상기 분석 모듈은 상기 확정된 1) 전체 전방 인용 문건 집합, 2) 전체 후방 인용 문건 집합, 3) 전체 전방 인용 관련 기준 문건 집합, 및/또는 4) 전체 후방 인용 관련 기준 문건 집합별을 대상으로 한 상기 분석 주제별로의 다차원 분석용 연산 수행 결과 테이블에 대하여 기설정된 분석식으로 분석 결과를 입수한다. 한편, 상기 분석 모듈은 상기 다차원 분석용 연산 수행 결과 테이블이 아닌 상기 1)~4) 중 어느 하나 이상의 문건 집합에 대한 에서 본 발명의 명세서에서 기술하고 있는 각종 분석을 수행할 수도 있지만, 이 경우에는 다차원 연산 결과를 얻을 수 없거나, 이러한 다차원 연산 결과를 얻기 위해 전산 자원을 많이 소모해야 하는 문제점이 있어 크게 바람직하지는 않다. The analytical module of the present invention may include a set of documents identified by 1) an entire forward citation, 2) a set of full backward references, 3) a set of reference documents related to an entire forward citation, and / or 4) The analysis result is obtained by a predetermined analysis formula on the operation result table for the multi-dimensional analysis by the subject of the analysis. Meanwhile, the analysis module may perform various analyzes described in the specification of the present invention for any one or more of the document sets 1) to 4) other than the multidimensional analysis operation result table, but in this case, There is a problem that the result of the multidimensional operation can not be obtained or the computer resources are consumed in order to obtain the result of the multidimensional operation.

상기 분석 모듈은 상기 1) ~4)의 문건 집합별로 생성되는 상기 다차원 분석 용 연산 수행 결과 테이블에 접근하여 기설정된 분석식으로 목적하는 인용 분석 결과를 추출해 낸다. 상기 목적하는 인용 분석 결과에는 다음과 같은 것들 있을 수 있다.The analysis module accesses the multidimensional analysis operation result table generated for each document set of 1) to 4), and extracts a desired citation analysis result in a predetermined analysis formula. The results of the analysis of the desired citation may include the following.

첫째, 출원인을 중심으로 한 것으로 i) 출원인 랭킹, ii) 출원인별의 적어도 하나 이상의 특허 분류 기호 체계에서의 적어도 하나 이상의 레벨별 특허 분류 기호 단위의 랭킹, iii) 다발명자 랭킹, iv) 문건 출현 빈도 기준 랭킹 및/또는 이들 각각에 대하여 시간 차원 또는 다른 차원으로 롤업 연산한 수치값 데이터 및/또는 이들 각각에 대응되는 문건 번호에 관한 정보를 추출할 수 있다. 즉, 상기 1) ~4)의 인용 분석 대상 문건 집합에 대하여 서지 사항에 포함된 각종 필드별로의 랭킹값 및/또는 이들의 시간 또는 다른 차원으로 롤업 연산한 값 및/또는 각종 값에 해당하는 구체적인 개별 문건 자체를 입수하기 위한 문건 번호에 대한 정보를 생성할 수 있을 것이다.I) ranking of patent categorizers by at least one level in at least one or more patent categorization systems by applicant, iii) multiple inventor ranking, iv) document appearance frequency, The reference rankings and / or the numerical value data rolled up into a time dimension or another dimension for each of them, and / or information on the document number corresponding to each of them. That is, for the citation analysis target document set of 1) to 4) described above, a ranking value for each field included in the bibliographic data and / or a value obtained by roll- It will be able to generate information on the document number to obtain the individual document itself.

둘째, 특허 분류 기호를 중심으로 한 것으로 i) 적어도 하나 이상의 특허 분류 기호 체계에서의 적어도 하나 이상의 레벨별 특허 분류 기호 단위의 랭킹, ii) 레벨별 특허 분류 기호에 속하는 출원인, iii) 레벨별 특허 분류 기호에 속하는 발명자 들의 랭킹, iv) 레벨별 특허 분류 기호에 속하는 문의 문건 출현 빈도 기준 랭킹 및/또는 이들 각각에 대하여 시간 차원 또는 다른 차원으로 롤업 연산한 수치값 데이터 및/또는 이들 각각에 대응되는 문건 번호에 관한 정보를 추출할 수 있다.(Ii) the applicant belonging to the patent classification code per level, (iii) the patent classification by level, (iii) the patent classification by level, Iv) numerical value data of the inquiry document belonging to the level-specific patent classification symbol and / or numerical value data which is rolled up into a time dimension or another dimension for each of them, and / or a document corresponding to each of them Information about the number can be extracted.

즉, 상기 1) ~4)의 인용 분석 대상 문건 집합에 대하여 서지 사항에 포함된 각종 필드별로의 랭킹값 및/또는 이들의 시간 또는 다른 차원으로 롤업 연산한 값 및/또는 각종 값에 해당하는 구체적인 개별 문건 자체를 입수하기 위한 문건 번호에 대한 정보를 생성할 수 있을 것이다.That is, for the citation analysis target document set of 1) to 4) described above, a ranking value for each field included in the bibliographic data and / or a value obtained by roll- It will be able to generate information on the document number to obtain the individual document itself.

셋째, 출원인 속성(출원인의 종류(기업, 대학, 개인 등), 기업일 경우 재무 속성, 기업 평가 지표 등)을 기준으로 한 것으로, i) 출원인의 종류, ii) 매출액 등과 같은 기업 규모 평가 지표별, iii) 연평균 이익율 등과 같은 기업 재무 평가 요소별 랭킹 및/또는 이들 각각에 대하여 시간 차원 또는 다른 차원으로 롤업 연산한 수치값 데이터 및/또는 이들 각각에 대응되는 문건 번호에 관한 정보를 추출할 수 있다. 즉, 상기 1) ~4)의 인용 분석 대상 문건 집합에 대하여 서지 사항에 포함된 각종 필드별로의 랭킹값 및/또는 이들의 시간 또는 다른 차원으로 롤업 연산한 값 및/또는 각종 값에 해당하는 구체적인 개별 문건 자체를 입수하기 위한 문건 번호에 대한 정보를 생성할 수 있을 것이다.Third, based on the attributes of the applicant (applicant's type (enterprise, university, individual, etc.), financial property, corporate evaluation index, etc.), i) type of applicant, ii) , iii) rankings of corporate financial evaluation elements such as an annual average profit rate, and / or numerical value data rolled up into a time dimension or another dimension for each of these, and / or information about a document number corresponding to each of them . That is, for the citation analysis target document set of 1) to 4) described above, a ranking value for each field included in the bibliographic data and / or a value obtained by roll- It will be able to generate information on the document number to obtain the individual document itself.

넷째, 상기 첫째 내지 셋째에서 수치값 데이터가 있을 때 이들의 증감율, 증감 속도 등의 변화값에 대한 데이터들이다.Fourth, when there are numerical value data in the above first to third data, there are data on the change values such as the rate of increase / decrease, the rate of increase / decrease.

상기 인용 분석을 위하여 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성하는 (출원인 인용 기준) 인용 분석용 다차원 분석용 연산 수행 결과 테이블은 주로 출원인이 자신의 문건에서 인용한 특허 문건을 대상으로 생성되었다. 한편, 상기 심사관 인용 문건에 대해서도 동일한 다차원 분석용 연산 수행 결과 테이블을 생성해 놓을 수 있음은 자명할 것이다. 즉, 심사관이 심사 과정에서 특정 출원 문건(child 문건이 된다)에 대하여 심사 과정에서 인용한 문건을 상기 출원 문건의 parent 문건으로 보게 되면, 상기 본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)의 모든 작용과 그 결과물로서의 (심사관 인용 기준) 인용 분석용 다차원 분석용 연산 수행 결과 테이블 및 상기 결과물을 이용하는 방법은 상기에서 출원인 인용 문건 기준으로 한 것과 완전히 동등하게 된다. 한편, 통합적인 인용 분석을 위하여, 하나의 출원 문건을 기준으로 출원인 인용 문건(parent)과 심사관 인용 문건(parent 2)을 합집합 연산하여 통합 인용 문건집합(parent-combined)을 생성할 수 있게 되고, 상기 통합된 인용 문건 집합을 대상으로 상기에서 출원인 인용 문건을 기준으로 한 것과 동일한 처리를 수행하면, 동등한 결과물을 얻게 되고, 상기 동등한 결과물을 이용하는 방법도 동등하게 된다. 본 문단에서 설명한 것은 출원인 인용 문건 기준으로 설명한 바를 참고하면, 당업자에게 자명하므로, 별도로 장황하게 중복하여 설명하지는 않는다.The multidimensional analysis operation result table generated by the multidimensional analysis operation execution result table generation module 402 (cited applicant reference) for citation analysis is mainly used for a patent document cited from the applicant's own document . On the other hand, it will be obvious that the same multidimensional analysis calculation execution result table can be generated for the above-mentioned examiner's citation document. That is, if the examiner refers to a document cited in the examination process for a specific application document (a child document) during the examination process as a parent document of the application document, the multidimensional analysis operation execution result table generation module 402 ) And the result of the operation (table of examiner quotation) citation analysis multi-dimensional analysis operation result table and the method of using the result are completely equivalent to those based on the applicant's citation document. Meanwhile, for an integrated citation analysis, it is possible to generate a parent-combined document by performing a union calculation of the applicant's citation document and the examiner's citation document (parent 2) based on a single application document, If the same process as that based on the applicant's citation document is performed on the integrated citation document set in the above, an equivalent result is obtained, and a method of using the equivalent result is also equivalent. The description in this paragraph is self-explanatory to those skilled in the art with reference to the description based on the document of the applicant's citation, so it is not separately redundantly described.

이어 도면을 참조하면서 더욱 상세하게 설명한다.This will be described in more detail with reference to the drawings.

도 38은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 인용 총량에 대한 연도별 분석 결과에 대한 일실시예적 도면이다. 도 38에서는 총량탭이 선택되어 있음을 보여준다. 본 명세서에 첨부된 도면 중 분석 화면에 대한 구현예적 실시예에서 각종 탭들이 나오는데, 상기 탭의 글씨가 굻게 되어 있는 것은 선택된 탭이라는 것을 의미한다. 상기 탭의 종류는 기본적으로 1) 출원 문건 기준이냐 등록 문건 기준이냐와 같은 문건 집합의 확정 시의 문건 집합의 속성에 관한 기준, 2) 출원/등 록 총량 기준, 점유율 기준, 집중율 기준, 활동율 기준 또는 기타 분석 지표별 기준, 3) 대한민국, 미국, 일본, 유럽 등과 같은 국가 확정과 관계된 기준, 총량, 출원인, 발명자, 기술, 개별 문건 등과 같은 분석의 목적이나 주체에 관한 기준, 5) 각종 수치값, 수치값의 증감율, 수치값의 증감 속도에 대한 기준, 6) IPC, USPC, FT, FI, ECLA 등과 같은 특허 분류 기호의 선택과 각 선택된 특허 분류 기호의 레벨에 관한 기준 등이 있을 수 있다.38 shows a set of backward quoted documents of the present invention when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention are set as reference documents, , Which is an example of the results of yearly analysis of the total amount of citations. 38 shows that the total amount tab is selected. In the illustrated embodiment of the present invention, various tabs are displayed in the exemplary embodiment of the present invention. The types of the tabs are basically 1) criteria for the property of the document set at the time of establishing the document set such as the standard of the application document or the document of the registration document, 2) the standard of the application / registration total amount, 3) standards related to the determination of countries such as the Republic of Korea, the United States of America, Japan, and Europe, and the criteria for the purpose and subject of analysis, such as applicants, inventors, 6) criteria for the selection of patent classification symbols such as IPC, USPC, FT, FI, ECLA, etc. and the level of each selected patent classification symbol. have.

도 39는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.39 is a diagram showing a set of backward quoted documents according to the present invention when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention are set as reference documents, , Which is an exemplary embodiment of the analysis result of the applicant of the present invention by year.

도 40은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 특허 분류 기호(IPC 메인 그룹 레벨)에 대한 연도별 분석 결과에 대한 일실시예적 도면이다. 이때, 상기 IPC들은 드릴 다운될 수 있음은 물론이며, 이때에도 드릴 다운과 관계된 하위 특허 분류 기호에 대한 인용 분석 결과가 추출되어 사용자에게 제공된다. 도 41은 그러한 것을 보여 준다.FIG. 40 shows a set of backward quoted documents of the present invention when all the US patent applications of Samsung Electronics Co., Ltd. among all the applicants in the database held by the patent information system 1 of the present invention are set as reference documents, (IPC main group level) according to an embodiment of the present invention. At this time, the IPCs can be drilled down. At this time, the citation analysis result of the sub-patent classification related to the drill-down is extracted and provided to the user. Figure 41 shows that.

도 41은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 특허 분류 기호(IPC 메인 그룹 레벨)에 드릴 다운했을 때를 반영한 연도별 분석 결과에 대한 일실시예적 도면이다.41 is a diagram showing a set of backward quoted documents according to the present invention when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention are set as reference documents, (IPC main group level), which is a multi-patent classification code (IPC main group level).

도 42는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 발명자에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 42 shows a set of backward quoted documents of the present invention when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention are set as a reference document set, , Which is an example of an analysis result of the inventor by year for each year.

도 43은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 최다 인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 43 is a diagram showing a set of documents of the present invention, which is a set of reference documents of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system (1) , Which is an exemplary embodiment of the analysis results of the most cited applicants by year.

도 44는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 최다 인용 발명자에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 44 is a diagram showing a set of documents of the present invention which are forward references of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention is set as a reference document set, , Which is an example of the results of yearly analysis of the most cited inventors.

도 45는 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, IPC 메인 그룹 별 최다 인용 기술에 대한 분석 중에서 IPC에 드릴 다운 했을 시의 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 45 is a diagram showing a set of documents of the present invention, which is a set of reference documents of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. is set as a reference document set among all applicants in the DB held by the patent information system 1 of the present invention, , Which is an exemplary embodiment of the analysis results of years when drilling down to the IPC among the analysis of the most cited technologies per IPC main group.

도 46은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, USPC 서브 클래스(no dot, 클래스 직하위) 최다 인용 기술에 대한 분석 중에서 드릴 다운 했을 시의 연도별 분석 결과에 대한 일실시예적 도면이다.46 shows a set of forward quotations of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention is set as a reference document set, (No dot, subclass subclass). The results of the analysis are summarized as follows.

도 47은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 전체 후방 인용 관련 기준 문건 집합을 분석 대상 문건 집합으로 한, 총량 기준 분석 결과 및 이 분석 결과에 대한 본 발명의 리포팅 모듈(406) 중 차트 생성 모듈(406-2)이 생성한 차트의 일 실시예적 도면이다. 본 발명의 리포팅 모듈(406)은 표를 생성하는 표 생성 모듈(406-1), 차트를 생성하는 차트 생성 모듈(406-2), 그래프를 생성하는 그래프 생성 모듈(406-3), 리포트를 생성하는 리포트 생성 모듈(406-4) 중 어느 하나 이상으로 구성된다.47 shows a set of reference documents related to the entire rear citation of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system 1 of the present invention is set as a reference document set, And a chart generated by the chart generation module 406-2 of the reporting module 406 of the present invention with respect to the analysis result. The reporting module 406 of the present invention includes a table generating module 406-1 for generating a table, a chart generating module 406-2 for generating a chart, a graph generating module 406-3 for generating a graph, And a report generation module 406-4 for generating a report.

도 48은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 전체 후방 인용 관련 기준 문건 집합을 분석 대상 문건 집합으로 한, 인용을 많이 받은 발명자의 연도별 인용 총량에 관한 분석 결과 및, 분석 결과에 보이는 특정한 숫자를 클릭했을 때, 그 숫자에 해당하는 문건 집합에 대하여 간이한 분석 결과(문건 목록, 최다 출원인의 연도별 출원수/등록수, 최다 발명자의 연도별 출원수/등록수, 최대 기술 분야별(IPC, USPC, FT) 연도별 출원수/등록수(드릴다운 포함))를 제 공해 주는 간이 분석 모듈(407)이 생성한 문건 목록에 대한 일실시예적 도면이다.48 shows a set of reference documents related to the entire rear citation of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system 1 of the present invention is set as a reference document set, , The results of the analysis on the total number of citations by the inventors who received a lot of citations and the results of the simple analysis on the set of documents corresponding to the number when the specific number shown in the analysis result is clicked A simple analysis module 407 for providing the number of applications / registration by year, the number of applications / registration by year of the most inventors, and the number of applications / registration numbers (including drill-down) by the maximum technology field (IPC, USPC, FT) FIG. 8 is an exemplary diagram of a list of generated documents. FIG.

도 49는 본 발명의 간이 분석 모듈(407)에서도 최대 기술 분야별(IPC, USPC, FT)로 드릴 다운 기능을 제공함으로 보여주는 일실시예적 도면이다.49 is an exemplary diagram showing a drill-down function according to the maximum technology field (IPC, USPC, FT) in the simplified analysis module 407 of the present invention.

도 50은 본 발명의 특허 정보 시스템(1)이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 출원 문건 중 다출원 IPC 서브클래스 단위의 문건을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 인용 총량에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 50 is a diagram showing the relationship between the backward and forward directions of the present invention when all of the US application documents of Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system 1 of the present invention are documents of multi- ) Is an example of an analysis result of the total amount of citations, which is a set of documents to be analyzed, by year.

특허 기술 분류별 기술 분석용 다차원 분석용 연산 수행 결과 테이블Patent analysis result for multi-dimensional analysis

이어서, 특허 기술 분류 활용 기술 분석에 대해서 설명한다. 특허 기술 분류 활용 기술 분석이란, 본 발명의 문건 집합 중에서 특허 분류 기호를 통해서 확정되는 문건 집합에 대한 분석을 의미한다. 특허 분류 기호별로 생성되는 특허 기술 분류 활용 기술 분석용 문건 집합의 종류는 1) 특정 특허 분류 기호의 종류(IPC, USPC, FT, ECLA 등)에서 선택되는 적어도 하나 이상의 특허 분류 기호를 포함하는 문건으로 생성되는 문건 집합(2 이상의 특허 분류 기호가 개입되는 경우 이들간의 (OR 연산, AND 연산, NOT 연산 등 각종 연산을 포함한다.), 2) 2 종류 이상의 특허 분류 기호를 포함하여 생성되는 문건 집합(IPC AND USPC 등)이 있다. 이들 모두는 1) 개별 국가 단위, 2) 국가 통합 단위일 수 있으며, 국가 단위로 중복은 제거(합집합 연산의 결과)되어야 하며, 3) 메인 특허 분류 기호만을 대상으로 할 것인지, 모든 특허 분류 기호를 포함하여 처리할 것인지는 기설정된 정책 또는 사용자의 선택에 따라 다를 것이며, 이들은 모두 문건 집합의 확정과 관련되어 있다.Next, we will explain the technical analysis of the utilization of patent technology classification. Patent Technology Classification Utilization Analysis refers to the analysis of a set of documents determined through patent classification symbols among the document sets of the present invention. Patent Classification Techniques Generated by Patent Classification Classes The types of document classifications used for analysis include 1) documents containing at least one patent classification code selected from the types of specific patent classification codes (IPC, USPC, FT, ECLA, etc.) (Including various operations such as OR operation, AND operation and NOT operation) when two or more patent classification codes are intervened, (2) a set of documents generated including two or more kinds of patent classification symbols IPC AND USPC). All of these can be 1) individual country units, 2) national integrated units, and national redundancy should be eliminated (as a result of union), 3) whether to target only the main patent classification code, Whether or not to include it will depend on the pre-established policy or user's choice, all of which are related to the confirmation of the document set.

특허 기술 분류 활용 기술 분석(이하, 기술 분석이라 한다.)에서 인출할 수 있는 정보는 상기 특허 기술 분류 활용 기술 분석용 문건 집합(이하, 기술 분석용 문건 집합)을 대상으로, 상기 문건 집합에 대한 1) 점유율, 2) 집중율, 3) 활동율 및 4) 기타 특허 분석 지표에 대한 정보이다. 이때, 상기 1) ~4)를 위한 데이터 처리에서 주어진 특허 분류 기호의 하위 특허 분류 기호에 대응되는 문건들을 분석 대상 문건 집합에 포함함은 당연할 것이다.The information that can be extracted from the patent technology classification utilization technology analysis (hereinafter, referred to as the technology analysis) is a patent document classification analysis technique document analysis set (hereinafter referred to as a technology document collection set) 1) market share, 2) concentration rate, 3) activity rate, and 4) information on other patent analysis indicators. At this time, it is natural that the documents corresponding to the sub-patent classification symbol of the given patent classification symbol in the data processing for the above 1) to 4) are included in the analysis target document set.

특허 기술 분류 활용 기술 분석 중 먼저 총량 분석부터 설명한다. 총량 분석은 특허 기술 분류 기호의 레벨별 기간별/연도별 출원수/등록수 등과 같은 양적인 관점에서의 분석 결과를 제공하기 위한 것이며, 특정 특허 분류 기호의 하위 특허 분류 기호가 포함된 문건까지 롤업 연산되어 그 다차원 연산된 결과가 저장된다.(특정 특허 분류가 포함되어 있는 특허 문건의 수 및 그 하위 특허 분류가 포함된 특허 문건의 수가 총합되어(중복이 제거됨은 당연할 것이다.) 계산된 수치이다.) 하기 표 58는 USPC에 대한 출원 총량의 연도별 분포에 관한 다차원 분석용 연산 수행 결과 테이블의 일 실시예적 구현 형태이다. 상기 USPC 002048000은 2/48, 002049100은 2/49.1과 같다. 이는 USPC의 표기법의 문제이다.(앞의 세자리는 class number, 뒤의 6자리를 1000으로 나누었을 때 나온 숫자를 "/" 좌우에 배치하면 통상 공보에 표기되는 USPC가 된다.)In the analysis of patent technology classification, firstly, the analysis of the total amount will be described. The total amount analysis is intended to provide a quantitative analysis result such as the number of applications / registration numbers per period / period by level of the patent technical classification code, and the documents including the sub-patent classification code of the specific patent classification code are rolled up, (The number of patent documents containing a specific patent classification and the number of patent documents containing the sub-patent classification are summed up (it is natural that redundancy is eliminated). Table 58 below is an exemplary implementation of an operation result table for multidimensional analysis on the yearly distribution of the total number of applications for the USPC. USPC 002048000 is 2/48, and 002049100 is 2 / 49.1. This is a matter of the notation of the USPC. (The first three digits are the class number, and the 6 digits after the second digit are divided by 1000.

[표 58][Table 58]

ano dotano dot 1 dot1 dot 2dot2dot 3 dot3 dot 4 dot4 dot 0101 0202 0303 0404 0505 0606 002048000002048000 002049100002049100 002049200002049200 44 1One 66 33 1One 88 002048000002048000 002049100002049100 002049200002049200 44 1One 66 33 1One 88 002048000002048000 002049100002049100 002049200002049200 44 1One 66 33 1One 88 002048000002048000 002049100002049100 002049200002049200 44 1One 66 33 1One 88 002048000002048000 002049100002049100 002049300002049300 00 1One 1One 00 1One 00 002048000002048000 002049100002049100 002049300002049300 00 1One 1One 00 1One 00 002048000002048000 002049100002049100 002049300002049300 00 1One 1One 00 1One 00 002048000002048000 002049100002049100 002049300002049300 00 1One 1One 00 1One 00 002048000002048000 002049100002049100 002049400002049400 002049500002049500 00 1One 33 1One 1One 22 002048000002048000 002049100002049100 002049400002049400 002049500002049500 00 1One 33 1One 1One 22 002048000002048000 002049100002049100 002049400002049400 002049500002049500 00 1One 33 1One 1One 22 002048000002048000 002049100002049100 002049400002049400 1One 22 66 00 22 44 002048000002048000 002049100002049100 002049400002049400 1One 22 66 00 22 44

상기에서 입수 받은 특정 USPC에서 출원수/등록수를 알고 싶다면, 그 특정 USPC 레벨(class to n dot) 칼럼에서 연도별/기간별로 다차원 연산된 숫자 또는 합계를 획득하면 된다.If you want to know the number of applications / registrations in a specific USPC obtained above, you can obtain the numbers or sums that have been calculated by year / period in the specific USPC level (class to n dot) column.

한편, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 출원인별 다단계 특허 분류 기호별 분석을 수행하기 위하여 하기 표 59과 같은 다차원 분석용 연산 수행 결과 테이블 데이터를 생성해 놓을 수 있다.The multidimensional analysis calculation execution result table generation module 402 may generate the multidimensional analysis calculation result table data as shown in Table 59 below in order to perform analysis according to the applicant's multi-stage patent classification symbol.

[표 59][Table 59]

IAppNameIAppName C3C3 C4C4 C5C5 C6C6 C7C7 C8C8 0505 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/10C12N 15/10 　　 22 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 C12N 15/12C12N 15/12 　 1One AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 C12N 15/31C12N 15/31 C12N 15/33C12N 15/33 1One AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 　　 22 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/10C12N 15/10 　　 22 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 C12N 15/12C12N 15/12 　 1One AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 C12N 15/31C12N 15/31 　 1One AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 　　 22 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/10C12N 15/10 　　 22 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 C12N 15/11C12N 15/11 　　 44 AA C12NC12N C12N 15/09C12N 15/09 C12N 15/00C12N 15/00 　　　 66 AA C12NC12N 　 C12N 15/00C12N 15/00 　　　 66 AA C12NC12N 　　　　　 66 BB

상기 표 59과 같은 데이터가 있을 때, C12N 15/00를 기준으로 다출원인을 추출하는 경우, C12N 15/00(IPC 서브그룹은 C4 레벨이라 약칭한다.)은 IPC C4 레벨을 기준으로 자기 자신 및 그 하위 특허 분류 기호에 관한 문건 카운트 정보까지 롤업 연산해 놓은 것은 GID 15이므로, 상기 테이블에서, 조건식에 "GID = 15 and IPC = C12N 15/00 and IPC 레벨 = C4"라고 명령하면, 이 조건에 맞는 출원인별 문건 수(GID 15행에 나온 문건 수)가 나온다. In the case of the data shown in the above Table 59, when extracting multiple applicants based on C12N 15/00, C12N 15/00 (the IPC subgroup is abbreviated as C4 level) The GID 15 that has rolled up the document count information about the subpopulation classification symbol is given as GID 15, so that when the conditional expression "GID = 15 and IPC = C12N 15/00 and IPC level = C4" The number of documents per eligible applicant (number of documents in GID 15) is shown.

상기와 같은 총량 카운트 정보가 있을 경우, 점유율, 집중율, 활동율 등과 같은 분석 지표별로 상기 다차원 분석용 연산 수행 결과 테이블과 같은 데이터를 생성해 놓을 수 있음은 전술한 바와 같은나, 일 실시예로 하기와 같은 다차원 분석용 연산 수행 결과 테이블 데이터를 제시한다. 한편, 점유율 등 기타 특허 지표에 대해서도 동등한 연산을 수행할 수 있음은 당업자에게 자명할 것이다.When there is the total amount count information as described above, data such as the operation result table for the multidimensional analysis may be generated for each analysis index such as the occupancy rate, the concentration rate, the activity rate, etc. As described above, The following table data of the operation result for the multidimensional analysis is presented. It will be apparent to those skilled in the art that equivalent operations can be performed on other patent indices such as market share.

융합 분석용 다차원 분석용 연산 수행 결과 테이블Multidimensional analysis operation result table for fusion analysis

이하, 융합 분석을 위한 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성하는 다차원 분석용 연산 수행 결과 테이블 데이터에 대해서 설명한다.Hereinafter, multidimensional analysis operation result table data generated by the multidimensional analysis operation result table generation module 402 for fusion analysis will be described.

본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 본 발명의 융합 분석을 위하여 하기와 같은 다차원 분석용 연산 수행 결과 테이블 데이터를 생성한다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 입력되거나 설정되거나 확정되는 임의의 특허 문건 집합을 입수 받은 다음, 상기 문건 집합에 포함되어 있는 1종류의 특허 분류 기호가 복수개가 결합되어 있을 때(IPC만 있을 때는 IPC가 2개 이상, IPC와 USPC가 있을 때는 IPC가 2개 이상이면 처리하고, 하나만 처리하지 않으며, USPC도 2개 이상일때만 USPC 다차원 분석용 연산 수행 결과 테이블로 처리한다.), 상기 그 종류의 특허 분류 기호에 대하여 하기 표 72와 같은 데이터를 생성한다.The multi-dimensional analysis operation result table generation module 402 of the present invention generates operation result table data for multidimensional analysis as follows for the fusion analysis of the present invention. The multidimensional analysis operation result table generation module 402 receives an arbitrary set of patent documents inputted, set or determined, and then, when a plurality of patent classification symbols included in the document set are combined (If there is only IPC, there are two or more IPCs. If there are IPCs and USPCs, the process is performed if there are two or more IPCs. , And generates data as shown in the following Table 72 for the above-mentioned patent classification symbol.

[표 72][Table 72]

문건 번호Article number AppNameAppName Main
C1Main
C1 Main CnMain Cn Sub 1
C1Sub 1
C1 Sub 1
CmSub 1
Cm Sub i
C1Sub i
C1 Sub i
CkSub i
Ck date date 기타 Etc 1One AA 22 BB

하나의 특허 문건에는 IPC, USPC, FI, FT, ECLA와 같은 특허 분류 기호의 종류 중 적어도 하나 이상이 포함되며, 상기 특허 분류 기호가 적어도 하나 이상 정보가 포함되어 있다. 상기 표 72와 같은 데이터는 문건 번호를 기준으로 필수적인 Main 특허 분류 기호와 선택적인 Sub 특허 분류 기호가 1개 이상 있을 수 있음을 보여 주고 있다.(상기에서 Main 특허 분류 기호의 종류와 Sub 특허 분류 기호의 종류는 같은 종류이다.) 이때, 상기 문건에 포함된 특허 분류 기호가 문건에 포함되어 있는 자기 자신 특허 분류 기호 및 자신의 모든 상위 특허 분류 기호를 나열하고 있음을 알 수 있다. 자기 자신의 주(main) 특허 분류 기호가 IPC 2 dot 서브클래스에 해당하는 것이라고 가정한다면, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 특허 분류 기호 마스트 DB(203)를 참조하여 주 특허 분류 기호가 C6 레벨임을 확정하고, 상기 주 특허 분류 기호를 C6 레벨에 입력하고, 좌측의 C5 레벨에는 1 dot 레벨의 상기 주 특허 분류 기호의 직상위 특허 분류 기호를 입력하며, C4 레벨에는 메인그룹 레벨의 특허 분류 기호를 입력하고, 이러한 과정을 최상위 레벨까지 일련으로 입력한다. 상기 특허 문건에 포함된 적어도 하나 이상의 서브 IPC에 대해서도 상기 메인 IPC에서와 마찬가지로 처리한다. 이때, 출원인 등과 같은 다른 서지 사항 데이터들은 상기 문건 번호를 통해서 입수할 수 있는 정보이 므로, 이러한 것들은 선택적인 요소이다.One patent document includes at least one of the types of patent classification symbols such as IPC, USPC, FI, FT and ECLA, and the patent classification symbol includes at least one information. The data as shown in the above Table 72 shows that there can be one or more essential Main classification codes and optional Sub classification codes based on the document number. The patent classification code included in the document lists the own patent classification code included in the document and all the upper patent classification codes of the patent classification code included in the document. Assuming that its own main patent classification symbol corresponds to the IPC 2 dot subclass, the multidimensional analysis operation execution result table generation module 402 refers to the patent classification symbol mast DB 203, The classification symbol is the C6 level, the main patent classification symbol is inputted to the C6 level, the upper right patent classification symbol of the main patent classification symbol of 1 dot level is inputted to the left C5 level, Enter the patent classification code for the level, and enter the process up to the top level in a series. At least one sub IPC included in the patent document is processed in the same manner as in the main IPC. At this time, since other bibliographic data such as the applicant are information available through the document number, these are optional elements.

대한민국 특허출원 제10-2005-0111868호에는 2006년 01월 자로 H04B 7/26 및 H04B 7/15가 부여되어 있다. 이를 통해서 표 60을 통하여 예시적으로 설명한다.(표기의 편의상 메인그룹 이하에서의 특허 분류 기호에서 서브클래스 부분은 생략한다.Korean Patent Application No. 10-2005-0111868 discloses H04B 7/26 and H04B 7/15 as January 2006. (For the sake of convenience, the subclass portion in the patent classification symbol in the main group and below is omitted.

[표 60][Table 60]

문건 번호Article number AppNameAppName Main
C1Main
C1 C2C2 C3C3 C4C4 C5C5 C6C6 Sub
C1Sub
C1 C2C2 C3C3 C4C4 C5C5 C6C6 기타Etc 10-2005-011186810-2005-0111868 삼성전자Samsung HH H04H04 H04BH04B 7/007/00 7/247/24 7/267/26 HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15

만약 Sub IPC가 2개 이상이면, 하기 표 61와 같이 상기 H04B 7/15관련된 정보의 우측에 병렬적으로 동일한 방식으로 처리한다. 대한민국 특허출원 제10-2006-0012606호에는 2006년 01월 자로 부여된 H04B 7/04, H04B 7/155, H04Q 7/30 특허 분류 기호가 있는데, 이를 통해서 예시적으로 설명한다. (표기의 편의상 AppName 등은 생략하며, H04Q 7/30에 관한 것은 아래 줄에 표기한다.)If there are two or more Sub IPCs, the same processing is performed in parallel on the right side of the H04B 7/15 related information as shown in Table 61 below. Korean Patent Application No. 10-2006-0012606 discloses a patent classification code H04B 7/04, H04B 7/155, and H04Q 7/30 granted on Jan. 2006, which will be described by way of example. (AppName etc. are omitted for convenience, and H04Q 7/30 is indicated on the line below.)

[표 61][Table 61]

문건 번호Article number Main C1Main C1 C2C2 C3C3 C4C4 C5C5 C6C6 Sub 1
C1Sub 1
C1 C2C2 C3C3 C4C4 C5C5 C6C6 C7C7 10-2006-001260610-2006-0012606 HH H04H04 H04BH04B 7/007/00 7/027/02 7/047/04 HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15 7/1557/155 Sub2 C1Sub2 C1 C2C2 C3C3 C4C4 C5C5 C6C6 C7C7 HH H04H04 H04QH04Q 7/007/00 7/207/20 7/307/30 ....

하기 표 62는 대한민국 특허 출원 제 10-2005-0042032호는 H04B 7/02 및 H04B 7/14를 특허 분류 기호 정보로 포함하고 있다. Korean Patent Application No. 10-2005-0042032 includes H04B 7/02 and H04B 7/14 as patent classification information.

[표 62][Table 62]

문건 번호Article number AppNameAppName Main
C1Main
C1 C2C2 C3C3 C4C4 C5C5 C6C6 Sub
C1Sub
C1 C2C2 C3C3 C4C4 C5C5 C6C6 기타Etc 10-2005-004203210-2005-0042032 에스케이텔레콤SK Telecom HH H04H04 H04BH04B 7/007/00 7/027/02 HH H04H04 H04BH04B 7/007/00 7/147/14

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)을 통하여 상기 다차원 분석용 연산 수행 결과 테이블 데이터와 같은 것을 USPC, FT 등에 대해서도 마찬가지로 생성할 수 있음은 당업자에게 자명할 것이다.It will be apparent to those skilled in the art that through the multidimensional analysis operation execution result table generation module 402, the same operations as the multidimensional analysis operation result table data can be similarly generated for USPC, FT, and the like.

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 레벨별 복수 특허 분류 기호의 일련 정보로부터 하기 표 63에 포함된 것과 같은 복수 특허 분류 기호쌍 정보(pair information)을 생성한다. 상기 복수 특허 분류 기호쌍의 종류는 상기 동종 특허 분류 기호 관계성 전처리 모듈의 설명에서 상술한 바 있다. 하기 표 63에서 출원일 등과 같은 1개만이 나올 수 있는 서지 사항은 옆으로 얼마든지 신설하여 포함시켜 놓을 수 있다.(예를 들면 등록일 등) 하지만, 공동 출원인이나 공동 발명자 등과 같을 경우에는 별도의 행을 신설해야 하는 것은 전술한 바와 같다. 즉, 출원인이나 발명자만 다른 동일한 레코드가 생기며, 이와 같은 정보를 기반으로 출원인별 융합 분석 및 발명자별 융합 분석이 가능해 진다.The multidimensional analysis calculation execution result table generation module 402 generates a plurality of patent classification symbol pair information as shown in Table 63 from the sequence information of the plurality of patent classification symbols for each level. The types of the plurality of patent classification symbols are described in the description of the same type of patent classification related preprocessing module. In Table 63, only one bibliography, such as the filing date, may be newly added to the side (for example, the date of registration). However, in the case of a joint applicant or co-inventor, What should be newly established is as described above. In other words, the same record is generated only by the applicant or the inventor. Based on such information, it is possible to analyze the applicant's convergence and the inventor's convergence.

본 모델을 설명하기 위하여 특허 문건이 대한민국 특허출원 제10-2005-0111868호, 대한민국 특허출원 제10-2006-0012606호, 및 대한민국 특허 출원 제 10-2005-0042032호 3개만이 있다고 가정하자.(물론, 입수 되는 모든 문건에 대해서 반복적으로 처리하면 모든 문건 집합을 기준으로 한 모든 특허 분류 기호 조합 (Ai, Bj)에 대한 하기와 같은 데이터의 생성이 가능할 것이다.) 이때, 특허 분류 기호 조합 (Ai, Bj)에 다음과 같은 테이블의 생성이 가능할 것이다. 특허 분류 기호 조합은 대한민국 특허출원 제10-2005-0111868호에서 4개 쌍, 대한민국 특허출원 제10-2006-0012606호에서는 (메인 IPC , 서브 IPC) 쌍에서 22쌍(6+16), (서브 IPC, 서브 IPC) 쌍에서 20쌍 및 대한민국 특허 출원 제 10-2005-0042032호에서는 1쌍이 나오며, 3 특허 출원에서 47개의 쌍이 나온다. 이들 쌍을 모두 표현하는 것도 당연히 가능하나, 설명의 편의상 일정 부분을 생략한 하기 표 63과 같은 테이블로 전체 47개에 대한 설명에 갈음하는 방식으로 본 발명 사상을 설명한다.To explain this model, suppose that there are only three patent documents: Korean Patent Application No. 10-2005-0111868, Korean Patent Application No. 10-2006-0012606, and Korean Patent Application No. 10-2005-0042032. Of course, it is possible to generate the following data for all patent classification combinations (Ai, Bj) based on all document sets by repeatedly processing all the documents obtained. , Bj) will be able to generate the following table. The combination of the patent classification symbols is 22 pairs (6 + 16) in the pair (4 pairs in Korean patent application No. 10-2005-0111868) and 22 pairs (6 + 16) in the Korean patent application No. 10-2006-0012606 IPC, and Sub IPC) pairs, and one pair in Korea Patent Application No. 10-2005-0042032, and 47 pairs in three patent applications. Although it is naturally possible to express both of these pairs, the concept of the present invention will be described in a manner of replacing the description with respect to all 47 by the table as shown in Table 63 below in which a certain portion is omitted for convenience of explanation.

[표76][Table 76]

특허 분류 기호 조합 (Ai, Bj)Patent classification symbol combination (Ai, Bj) 문건 번호Article number 출원인Applicant 출원일Filing date 가중치 정책 1 적용Apply weighting policy 1 가중치 정책 2 적용Apply weighting policy 2 (H04B 7/15 ,H04B 7/26)(H04B 7/15, H04B 7/26) 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One (H04B 7/15,H04B 7/24)(H04B 7/15, H04B 7/24) 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One (H04B 7/14,H04B 7/26)(H04B 7/14, H04B 7/26) 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One (H04B 7/14,H04B 7/24)(H04B 7/14, H04B 7/24) 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One (H04B 7/155,H04B 7/04)(H04B 7/155, H04B 7/04) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.12.04December 4, 2006 1/31/3 0.75/20.75 / 2 (H04B 7/155,H04B 7/02)(H04B 7/155, H04B 7/02) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 (H04B 7/15,H04B 7/04)(H04B 7/15, H04B 7/04) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 (H04B 7/15,H04B 7/02)(H04B 7/15, H04B 7/02) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/04)(H04B 7/14, H04B 7/04) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/02)(H04B 7/14, H04B 7/02) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 (H04B 7/14,H04B 7/02)(H04B 7/14, H04B 7/02) 10-2005-004203210-2005-0042032 에스케이텔레콤SK Telecom 2006.12.04December 4, 2006 1One 1One (H04Q 7/00, H04B 7/00)(H04Q 7/00, H04B 7/00) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1 (H04Q 7/00, H04B)(H04Q 7/00, H04B) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1 (H04Q,H04B)(H04Q, H04B) 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1

상기 특허 분류 기호의 조합(Ai, Bj) 방식으로 데이터를 생성할 경우, 소팅이나 인덱스 처리 및/또는 롤업 연산에 장애가 있을 수 있다. 그러므로, 개념적으로 특허 분류 기호의 조합을 하기 표 64과 같이 처리하는 것이 더욱 바람직하며, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 복수 특허 분류 기호쌍 정보를 다음과 같이 생성할 수도 있을 것이다.When data is generated in the combination of the above-described patent classification symbols (Ai, Bj), there may be obstacles in sorting, indexing, and / or roll-up operations. Therefore, it is more preferable to conceptually process combinations of patent classification symbols as shown in Table 64 below, and the multi-dimensional analysis operation result table generation module 402 may generate the plural patent classification symbol pair information as follows There will be.

[표 64][Table 64]

From Main IPCFrom Main IPC From Sub IPC From Sub IPC 문건 번호Article number 출원인Applicant 출원일Filing date 가중치 정책 1 적용Apply weighting policy 1 가중치 정책 2 적용Apply weighting policy 2 H04B 7/15 H04B 7/15 H04B 7/26H04B 7/26 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One H04B 7/15H04B 7/15 H04B 7/24H04B 7/24 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One H04B 7/14H04B 7/14 H04B 7/26H04B 7/26 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One H04B 7/14H04B 7/14 H04B 7/24H04B 7/24 10-2005-011186810-2005-0111868 삼성전자Samsung 2006.12.04December 4, 2006 1One 1One H04B 7/155H04B 7/155 H04B 7/04H04B 7/04 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.12.04December 4, 2006 1/31/3 0.75/20.75 / 2 H04B 7/155H04B 7/155 H04B 7/02H04B 7/02 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 H04B 7/15H04B 7/15 H04B 7/04H04B 7/04 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 H04B 7/15H04B 7/15 H04B 7/02H04B 7/02 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 H04B 7/14H04B 7/14 H04B 7/04H04B 7/04 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 H04B 7/14H04B 7/14 H04B 7/02H04B 7/02 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.75/20.75 / 2 H04B 7/14H04B 7/14 H04B 7/02H04B 7/02 10-2005-004203210-2005-0042032 에스케이텔레콤SK Telecom 2006.12.04December 4, 2006 1One 1One H04Q 7/00H04Q 7/00 H04B 7/00H04B 7/00 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1 H04Q 7/00H04Q 7/00 H04BH04B 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1 H04QH04Q H04BH04B 10-2006-001260610-2006-0012606 삼성전자Samsung 2006.02.092006.02.09 1/31/3 0.25/10.25 / 1

통계 처리와 분석의 목적 상 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 하기 표 65과 같이 상기 복수 특허 분류 기호쌍 정보를 생성하는 것이 더욱 바람직할 것이다.(기재의 편의상 개개의 문건 번호는 표시하지 않았으나, 위의 표 64에 있는 문건 번호의 순서와 완전 동일하다. 그리고, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성하는 하기 표 65과 같은 다차원 분석용 연산 수행 결과 테이블 데이터에서 문건 번호의 서지 사항(출원인, 발명자, 출원일, 등록일 등의 기타 서지 사항에 관한 정보도 함께 포함되는 것이 바람직할 것이다.) (하기에서 M/S는 특허 분류 기호가 메인 IPC에서 기인한 것이면 M, 서브 IPC에서 기인한 것이면 S라고 표기한다는 것을 의미하는 필드이다. 이는 융합의 기준이 가급적 Main IPC를 축으로 파악하는 것이 바람직하기 때문이다.)For the purpose of statistical processing and analysis, the multidimensional analysis operation execution result table generation module 402 preferably generates the plural patent classification symbol pair information as shown in Table 65 below. (For convenience of description, But it is the same as the sequence of the document numbers in Table 64. The multidimensional analysis operation result table generation module 402 generates the multidimensional analysis operation result table In the data, it is desirable that the bibliographic number of the document number (including the applicant, the inventor, the filing date, the date of registration, and other bibliographic information such as the registration date) M, and a sub-IPC, it is a field that means that it is represented by S. This indicates that the convergence criterion is preferably the axis of Main IPC It is because that preferred.)

[표 65][Table 65]

C3C3 C4C4 C5C5 C6C6 M/SM / S 문건번호Article number M/SM / S C1C1 C2C2 C3C3 C4C4 H04BH04B 7/007/00 7/147/14 7/157/15 SS H04B 7/15 H04B 7/15 H04B 7/26H04B 7/26 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 7/157/15 SS H04B 7/15H04B 7/15 H04B 7/24H04B 7/24 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 SS H04B 7/14H04B 7/14 H04B 7/26H04B 7/26 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 SS H04B 7/14H04B 7/14 H04B 7/24H04B 7/24 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 7/157/15 7/1557/155 SS H04B 7/155H04B 7/155 H04B 7/04H04B 7/04 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 7/157/15 7/1557/155 SS H04B 7/155H04B 7/155 H04B 7/02H04B 7/02 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 7/157/15 SS H04B 7/15H04B 7/15 H04B 7/04H04B 7/04 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 7/157/15 SS H04B 7/15H04B 7/15 H04B 7/02H04B 7/02 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 SS H04B 7/14H04B 7/14 H04B 7/04H04B 7/04 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 SS H04B 7/14H04B 7/14 H04B 7/02H04B 7/02 MM HH H04H04 H04BH04B 7/007/00 H04BH04B 7/007/00 7/147/14 SS H04B 7/14H04B 7/14 H04B 7/02H04B 7/02 MM HH H04H04 H04BH04B 7/007/00 H04Q H04Q 7/007/00 SS H04Q 7/00H04Q 7/00 H04B 7/00H04B 7/00 MM HH H04H04 H04BH04B 7/007/00 H04QH04Q 7/007/00 SS H04Q 7/00H04Q 7/00 H04BH04B MM HH H04H04 H04BH04B H04QH04Q SS H04QH04Q H04BH04B MM HH H04H04 H04BH04B

2개 이상의 특허 분류 기호가 포함되어 있는 모든 특허 문건을 대상으로 한 상기와 같은 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블 데이터가 있는 경우, 하기와 같은 정보를 추출/계산해 낼 수 있다.When there are table data of the operation result table for multidimensional analysis including the above-mentioned fusion information targeting all patent documents including two or more patent classification codes, the following information can be extracted / calculated.

첫째, 특허 분류 기호가 입수되었을 때의 주체 정보 미활용 융합 분석이다. 하나의 특허 분류 기호가 입수되거나 주어졌을 때, 상기 특허 분류 기호와 잘 융합하는(쌍으로 만들어 지는 빈도가 높은) 특허 분류 기호의 랭킹을 찾을 수 있을 것이다. 상기 표에서 예를 들면, H04B 7/04와 잘 융합하는 특허 분류 기호로 C7(IPC 3 dot 레벨) 레벨에서는 H04B 7/155, C6 레벨에서는 H04B 7/15 등과 같이 각 IPC 레벨에서의 융합의 빈도가 높은 IPC들을 입수할 수 있고, 이들에 대해서 통계/분석 정보를 생성하면(group by, count, 및 rank 명령으로 처리하면 된다), 융합 빈도가 높은 IPC들의 순위 정보를 알 수 있게 된다.(예를 들면, "H04B 7/04와 가장 잘 융합하는 C6 레벨의 IPC는 H04B 7/15이며, 차순위로 잘 융합하는 C6 레벨의 IPC는 H04Q 7/30이다"와 같은 융합 관련 랭킹 정보를 생성할 수 있게 된다. 물론, 상기 융합이 발생한 구체적인 문건은 상기 문건 번호를 통해서 구체적으로 확인 가능할 수도 있지만, 융합이 발생한 IPC 쌍을 검색 엔진 또는 DBMS(201)에 질의함으로써, 해당되는 문건들을 입수할 수도 있게 된다. 상기 입수된 특허 분류 기호와 잘 융합하는 특허 분류 기호들을 IPC 각 레벨별(IPC 서브클래스, 메인그룹, 1 dot, 2 dot, ...로 연도별/기간별 통계/분석한 정보를 생성할 수 있을 것이다. 이는 상기 테이블에서 입수된 특허 분류 기호(예를 들면, H04B 7/04)를 한쪽(가급적 메인 IPC)에서 찾고, 상기 찾은 특허 분류 기호가 포함된 모든 레코드(행)에서 C1~Cn 중에서 레벨을 확정(칼럼이 결정됨)하고, 상기 확정된 레벨에 존재하는 모든 특허 분류 기호에 대하여 통계/분석(group by, count 및 rank 명령으로 가능하다) 아울러, 상기 다차원 분석용 연산 수행 결과 테이블 데이터에 기간/날짜 정보가 포함되어 있는 경우, 상기 입수된 특허 분류 기호와 잘 융합하는 특허 분류 기호들을 상기 기간별(예를 들면 연도별)로의 통계/분석 데이터를 생성할 수 있다. First, it is an analysis of non-use fusion of subject information when a patent classification code is obtained. When one patent classification symbol is obtained or given, one can find a ranking of the patent classification that fits well with the patent classification symbol (high frequency of being paired). In the above table, for example, the frequency of fusion at each IPC level, such as H04B 7/155 at C7 (IPC 3 dot level) level and H04B 7/15 at C6 level, is a patent classification code that fuses well with H04B 7/04 (By group, count, and rank commands), it is possible to obtain ranking information of IPCs with high convergence frequency For example, "the C6 level IPC that fits best with H04B 7/04 is H04B 7/15, and the C6 level IPC that fits well with the next level is H04Q 7/30" Of course, the specific document in which the fusion occurred can be specifically confirmed through the document number, but it is also possible to obtain corresponding documents by querying the search engine or the DBMS 201 for the IPC pair in which fusion has occurred The obtained patent classification Patent classification codes that are well-compatible with the IPC sub-class can be generated by analyzing IPC sub-class, main group, 1 dot, 2 dot, ... according to year / period, (Preferably H04B 7/04) from one side (preferably the main IPC) and the level is determined among C1 to Cn in all the records (rows) including the found patent classification symbol (By group, count, and rank commands) for all the patent classification symbols existing in the determined level, and period / date information is included in the operation result table data for the multidimensional analysis , It is possible to generate statistical / analytical data for the period (for example, by year) for the patent classification symbols that fuse well with the obtained patent classification symbol.

상기 융합 분석은 하나의 특허 분류 기호에 대해서 설명하였으나, 2 이상의 복수개의 특허 분류 기호가 입수되었을 때의 융합 분석도 상기 입수된 2 이상의 복수개의 특허 분류 기호 개개에 대하여 융합의 빈도가 높은 특허 분류 기호를 획득하고, 이를 합하면 될 것이다. 한편, 정책에 따라서, 하나의 문건에 상기 입수된 복수개의 특허 문건에서 중복 해당하는 경우가 발생하는데, 이때 중복을 허용하면서 통계/분석 정보를 생성할 수도 있고, 중복을 제거하면서 통계/분석 정보를 생성할 수도 있다. 후자의 경우에는 상기 입수된 복수개의 특허 분류 기호를 질의한 다음, 질의 결과로 나오는 레코드 각각에 포함된 문건 번호에 distinct 명령 등과 같은 중복 제거 명령을 수행한 다음, 중복이 제거된 레코드만을 기준으로 각 IPC 레 벨별로 잘 융합되는 특허 분류 기호에 대한 통계/분석 정보를 생성할 수 있게 된다. 이러한 중복의 문제가 발생하게 되는 근본적인 이유는 융합 정보를 처리할 때, 하위 특허 분류 기호에서 융합이 발생했다면, 그 각각의 상위 특허 분류 기호간에서는 당연히 융합이 발생했다고 보기 때문이다. 한편, 복수개의 특허 분류 기호는 동일한 IPC 레벨일 필요는 없으며, 전체 IPC 분류 체계에서 선택되는 어느 2 이상의 특허 분류 기호에 대해서 상기 복수개의 특허 분류 기호가 입수되었을 때의 발명 사상을 그대로 적용할 수 있음은 당연하다 할 것이다.Although the fusion analysis has been described with respect to one patent classification symbol, the fusion analysis when a plurality of patent classification symbols of two or more are obtained is also applicable to a plurality of patent classification symbols obtained from the two or more patent classification symbols , And sum it up. On the other hand, according to the policy, when a plurality of patent documents obtained in one document are overlapped, a statistical / analysis information may be generated while permitting duplication, and statistical / analysis information may be generated while eliminating redundancy. . In the latter case, after querying the plurality of patent classification codes obtained, the duplicate removal instruction such as a distinct instruction is performed on the document number included in each of the records resulting from the query, and then, It is possible to generate statistical / analytical information about the patent classification that is well converged by the IPC level. The reason for such a duplication problem is that when convergence information is processed, if a fusion occurs in a sub-patent classification symbol, fusion is naturally occurred between the respective upper classifications. On the other hand, the plurality of patent classification symbols do not have to be the same IPC level, and the inventive ideology when the plurality of patent classification symbols are obtained can be applied to any two or more patent classification symbols selected from the entire IPC classification system Of course.

둘째, 특허 분류 기호가 입수되었을 때의 주체 정보 활용 융합 분석이다. 상기 입수된 특허 분류 기호(1개 또는 복수개)에 가장 잘 융합하는 특허 분류 기호 집합과 관련된 융합 발생 문건 집합이 있을 때, 상기 융합 발생 문건 집합의 문건 번호에 관련된 서지 사항 정보를 활용하여, 각 서지 사항의 필드별 통계/분석 정보를 생성할 수 있다. 예를 들어, H04B 7/00과 융합이 일어난 특허 분류 기호를 포함하고 있는 대한민국 특허청에 출원된 삼성전자주식회사의 특허 문건 집합을 분석 대상 문건 집합으로 하여, 상기 문건 집합에 있는 IPC들을 단계별(IPC 서브클래스, 메인그룹, 1 dot, 2 dot...로 분할하고(이때, 하위 특허 분류 기호 포함의 사상은 당연히 적용된다.) 각 분할된 각 단계별로 어떤 IPC들이 연도별/기간별로 융합이 발생한 문건의 개수에 관한 정보를 제공해 줄 수 있게 된다. 이와 같은 정보는 삼성전자주식회사의 각 발명자별로도 제공해 줄 수 있게 된다.Second, analysis of the use of subject information when the patent classification code is obtained. When there is a set of convergence occurrence documents related to the set of patent classification codes that best fuse with the obtained patent classification code (one or a plurality of), the bibliographic information related to the document number of the convergence occurrence document set is used, Field-specific statistical / analytical information of the field. For example, a set of patent documents of Samsung Electronics Co., Ltd., filed with the Korean Intellectual Property Office, which includes a patent classification code that has been merged with H04B 7/00, is set as a document to be analyzed, and the IPCs in the document set are classified Class, main group, 1 dot, 2 dot ... (at this time, the idea of including the subpopulation classification is of course applicable). Documents where certain IPCs are merged by year / The information can be provided for each inventor of Samsung Electronics Co., Ltd.

셋째, 주어진 문건 집합 내부에서의 융합 분석이다. 상기 주어진 문건 집합은 1) 출원인, 2) 발명자, 3) 특허 분류 기호, 4) 국가, 5) 날짜 등 서지 사항을 구성하는 각종 필드 중 적어도 어느 하나 이상의 조합에 의해서 생성될 수 있다. 문건 집합이 입수되면, 상기 문건 집합을 상기 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블에 질의하면, 입수된 문건 집합 중에서 융합이 발생한 문건 집합을 특정할 수 있게 된다. (문건 번호 및 그 문건 번호가 포함된 레코드가 특정되게 된다.) 이때, 상기 융합이 발생한 문건 집합을 대상으로 생성할 수 있는 정보는 다음과 같다. 1) 융합이 발생한 문건 집합에서 최빈 각 IPC 레벨별 IPC를 추출하고, 상기 추출된 최빈 IPC별로, 융합이 발생한 각 레벨별 IPC에 대한 통계/분석 정보를 생성할 수 있을 것이다.(상기 통계/분석 정보는 카운팅 정보이며, 상기 카운팅 정보값이 있으면, 증감율, 증감속도 등도 구할 수 있게 된다.) 2) 상기 융합이 발생한 문건 집합을 분석 대상 문건 집합으로 한 본 발명 사상의 전반에 걸진 통계/분석 데이터를 생성할 수 있게 된다.(최다 발명자, 최다 출원인, 최다 융합 특허 분류 기호 등을 기준으로 한 이들의 랭킹 정보 등)Third, it is a fusion analysis within a given set of documents. The set of documents given above may be generated by a combination of at least one of the various fields constituting the bibliography such as 1) applicant, 2) inventor, 3) patent classification symbol, 4) country, 5) date. When a set of documents is obtained, the set of documents can be identified from the obtained document set by querying the operation result table for multidimensional analysis including the fusion information. (A record including the document number and the document number is specified). Here, the information that can be generated for the document set in which the fusion occurs is as follows. 1) It is possible to extract IPCs for each IPC level from the document set in which fusion occurs, and generate statistical / analysis information for IPCs for each level in which the convergence occurred, by the extracted optimal IPCs. The information is counting information. If the counting information value is present, the rate of change, rate of increase, and the like can also be obtained.) 2) Statistics / analysis data on the first half of the present invention, (The ranking information of the most inventors, the most applicants, the most convergent patent classification symbols, etc.)

넷째, 융합 대상 발견을 위한 통계/분석이다. 이는 주어진 문건 집합 내부에서 기준을 발견하고, 상기 기준에 대하여, 제1국 또는 제2국의 모든 또는 기설정된 문건 집합(예를 들면 출원일 기준 최근 7년간 출원된 문건 집합)을 대상으로 생성된 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블 데이터에 질의하여, 각 기준별로의 융합 통계/분석 정보를 입수하는 통계/분석을 말한다. 상기 기중의 예를 들면, 특정 출원인 또는 출원인별 총량/점유율/집중율/활동율 또는 기타 분석 지표별로의 특허 분류 기호 레벨별 최빈값이 될 수 있을 것이다. 더욱 구체적으로 예를 들면, 삼성전자주식회사의 한국에서의 메인그룹 기준 고집중율 IPC들을 추출 할 수 있으며, 이들 추출된 고집중율 기준 IPC들이 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블 데이터에 질의하는 기준 IPC가 될 것이다. 이때, 상기 질의하는 대상이 되는 상기 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블 데이터는 제1국(상기 기준이 생성된 국가)이 될 수도 있지만, 제2국(상기 기준의 생성과는 무관한 국가)의 특허 문건 집합에서 생성된 것이다. 상기의 방법을 활용하면, 대한민국의 삼성전자의 특허 문건에서 고집중율 IPC 메인그룹별 IPC를 추출하고, 상기 추출된 IPC를 통해서 일본 또는 미국의 특허 문건 집합으로 생성된 융합 정보 포함 다차원 분석용 연산 수행 결과 테이블 데이터에 질의하여, 일본 또는 미국에서 상기 추출된 고집중율 메인그룹별 IPC들과 잘 융합하는(융합의 빈도가 높은) 각 단계별 특허 분류 기호 정보를 입수할 수 있게 된다.Fourth, statistics / analysis for fusion target discovery. This means that a criterion is found within a given set of documents and that a convergence of all or a predetermined set of documents of the first or second country (for example, a set of documents filed during the last seven years on the filing date) This is statistical / analytical analysis that obtains convergence statistics / analysis information for each criterion by querying the operation result table data including information. For example, the mode of the patent classification symbol level by total applicant or applicant total amount / market share / concentration ratio / activity rate or other analysis index. More specifically, it is possible to extract, for example, main group standard consensus IPCs of Samsung Electronics Co., Ltd. in Korea, and to extract reference IPCs . At this time, the operation result table data for multidimensional analysis including the fusion information, which is the object to be queried, may be the first station (the country in which the criterion is generated), but the second station ) Was created from a set of patent documents. Using the above method, the IPC for each main group is extracted from Samsung Electronics' patent documents of Korea, and the multi-dimensional analysis including the convergence information generated by the patent document set of Japan or USA is performed through the extracted IPC It is possible to obtain the patent classification symbol information for each stage that is well-integrated with the IPCs of the concentrated main group, which is extracted in Japan or the United States, where the fusion frequency is high, by querying the result table data.

그리고, 본 발명에서는 다단계 특허 분류 기호 체계 상 직하위 특허 분류 기호가 매개된 롤업 연산 처리를 통한 다차원 분석용 연산 수행 결과 테이블 데이터 생성에 대해서 설명하였다. 그리고, 롤업이 전제되거나, 설령 전제되지 않은 경우라도 실시간으로 주어진 특허 분류 기호의 직하위 특허 분류 기호를 대상으로 한 정보를 생성하여 특정 특허 분류 기호의 직하위 특허 분류 기호에 관한 정보를 드릴 다운 정보를 사용자에게 제공해 줄 수 있었다. 이러한 본 발명의 발명 사상은 융합 분석에도 그대로 적용된다. 하기는 융합 분석에서의 드릴 다운에 대해 더욱 상세한 설명을 제공하고 있다.In the present invention, generation of operation result table data for multidimensional analysis by roll-up processing mediated by a sub-patent classification symbol on a multi-stage patent classification system is described. Also, even if the roll-up is presupposed or even if it is not presupposed, information about the sub-classifications of a given patent classification symbol is generated in real time, and information about the sub- To the user. The inventive idea of the present invention is applied to fusion analysis as it is. The following provides a more detailed description of drill down in the fusion analysis.

하기 표79에서의 기준 IPC는 입수되거나 주어진 IPC이며, 대상 IPC는 제1국 또는 제2국 전체 융합 발생 문건 집합 또는 기설정되거나 입수된 문건 집합 중에서 추출된 융합 발생 문건 집합 또는 기설정되거나 입수된 융합 발생 문건 집합에서 입수하는 최빈/빈도 높은 IPC이다. 이때, 기준 및 대상 IPC의 레벨 선택이 가능함은 물론이다 하겠다. (물론, 상기 기준 IPC는 제1국 또는 제2국 전체 융합 발생 문건 집합 또는 기설정되거나 입수된 문건 집합 중에서 추출된 융합 발생 문건 집합 또는 기설정되거나 입수된 융합 발생 문건 집합에서 입수하는 최빈/빈도 높은 IPC일 수 있음은 당연할 것이다.) 이때, 기준 IPC와 대상 IPC 각각에 대하여 사용자로부터 독립적인 IPC 레벨 선택이 있을 경우, 선택 정보를 입수하여 선택된 레벨별로 계산된 결과를 추출하여 사용자에게 제공한다.The reference IPC in Table 79 below is the IPC obtained or given, and the target IPC is the set of convergent documents extracted from either the first or second total convergence document set or the previously established or obtained document set, It is the most frequent / frequent IPC obtained from the convergence document set. At this time, it is of course possible to select the level of the reference and target IPCs. (Of course, the reference IPC is a set of convergence documents extracted from either a first or second country full convergence document set or a pre-established or acquired document set, or a set of pre- If there is an independent IPC level selection from the user for each of the reference IPC and the target IPC, the selection information is obtained, and the calculated result for each selected level is extracted and provided to the user .

[표 66][Table 66]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 최근lately 합계Sum 1One B60C 3/00 (+)
H01B 1/00 (+)B60C 3/00 (+)
H01B 1/00 (+) 1111 5252 44 22 H03M 3/00 (+)
B29C 31/00 (+)H03M 3/00 (+)
B29C 31/00 (+) 22 55 99

드릴 다운된 융합 통계/분석 정보를 입수하기 위하여 상기 표 66에서 기준 IPC인 B60C 3/00을 누른 경우, 본 발명의 시스템(1)은 하기 표 67에서와 같은 방식으로 드릴 다운된 데이터를 생성하여 사용자에게 제공해 줄 수 있을 것이다. (드릴 다운은 IPC 체계 하에서 자신의 하위에서만을 입수하며, 각 입수된 IPC 모두가 나타나는 것이 아니라 융합이 발생한 IPC만 나타난다.)When the reference IPC B60C 3/00 in Table 66 is pressed to obtain drilled-down convergence statistical / analysis information, the system 1 of the present invention generates drilled-down data in the manner shown in Table 67 below You will be able to provide it to users. (Drilldown is only available to subordinates under the IPC scheme, and not all of the IPCs that are available appear, but only IPCs that have converged.)

[표 67][Table 67]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 최근lately 합계Sum 1One B60C 3/00 (+)
H01B 1/00(+)B60C 3/00 (+)
H01B 1/00 (+) 횡단면에 특징이 있는 타이어

도전재료를 특징으로 하는 도체 또는 도전물체Tires characterized by cross-section

A conductor or a conductive object characterized by a conductive material 1111 5252 44 6767 B60C 3/04(+)B60C 3/04 (+) 단면의 상대촌법에 특징이 있는 것A feature of the cross section method of section 55 3030 33 4040 B60C 3/06(+)B60C 3/06 (+) 비대칭의 것Asymmetric 44 1010 1One 1212 B60C 3/08(+)B60C 3/08 (+) 보관 또는 불사용상태에서 접을 수 있는 것Foldable in storage or unused condition 1One 22 00 22 22 H03M 3/00 (+)
B29C 31/00 (+)H03M 3/00 (+)
B29C 31/00 (+) 22 55 99 1616

(비고) 52 = from 하위(30+10+2) + 자신(10)(Remarks) 52 = from child (30 + 10 + 2) + self (10)

상기 표 67과 같은 결과 데이터에서 본 시스템의 사용자가 어디서든 H01B 1/00를 누른 경우에는 본 시스템(1)은 다음 표 68과 같은 형식의 드릴 다운된 데이터를 생성하여 사용자에게 제공해 줄 수 있다.If the user of the present system presses H01B 1/00 anywhere in the result data as shown in Table 67, the system 1 can generate drilled-down data of the format shown in the following Table 68 and provide it to the user.

[표 68][Table 68]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 최근lately 합계Sum 1One B60C 3/00 (+)
H01B 1/00(+)B60C 3/00 (+)
H01B 1/00 (+)
횡단면에 특징이 있는 타이어Tires characterized by cross-section 1111 5252 44 H01B 1/02(+)H01B 1/02 (+) 주로 금속 또는 합금으로 된 것Mainly of metals or alloys 33 3535 22 H01B 1/04(+)H01B 1/04 (+) 주로 카본 실리콘혼화물, 카본 또는 실리콘으로서 되는 것Mainly made of carbon silicon, carbon or silicon 22 1010 00 H01B 1/06(+)H01B 1/06 (+) 주로 다른 비금속물질로 되는 것Mainly made of other non-metallic materials 22 00 00 22 H03M 3/00 (+)
B29C 31/00 (+)H03M 3/00 (+)
B29C 31/00 (+) 22 55 99

한편, 본 시스템(1)은 드릴 다운과 관계하여 다음과 같은 결과 데이터를 사 용자에게 제공해 줄 수 있을 것이다. 하기 표 69에서의 기준 IPC는 입수되거나 주어진 IPC이며, 1위에서 n위까지의 랭킹의 대상이 되는 대상 IPC는 제1국 또는 제2국 전체 융합 발생 문건 집합 또는 기설정되거나 입수된 문건 집합 중에서 추출된 융합 발생 문건 집합 또는 기설정되거나 입수된 융합 발생 문건 집합에서 입수하는 최빈/빈도 높은 IPC이다. 이때, 기준 및 대상 IPC의 레벨 선택이 가능함은 물론이다 하겠다.(물론, 상기 기준 IPC는 제1국 또는 제2국 전체 융합 발생 문건 집합 또는 기설정되거나 입수된 문건 집합 중에서 추출된 융합 발생 문건 집합 또는 기설정되거나 입수된 융합 발생 문건 집합에서 입수하는 최빈/빈도 높은 IPC일 수 있음은 당연할 것이다.)On the other hand, the system 1 can provide the following result data to the user in relation to the drill-down. The reference IPC in Table 69 below is the IPC obtained or given, and the target IPC, which is ranked from the first to the nth ranked, is extracted from the aggregation document set of the first or second whole convergence document, The most frequent / frequent IPCs that are obtained from a set of established convergence documents or from a set of pre-established or acquired convergence documents. (Of course, the reference IPC is a set of the convergence document collection of the first or second convergence document collection or the document collection set of the pre-established or acquired document set, Or it may be the most frequent / frequent IPC available from a set of pre-established or acquired convergence documents.)

[표 69][Table 69]

순위ranking 기준 IPCStandard IPC IPC 설명IPC Description 1위1st 2위2nd place 3위3rd place 1One B60C 3/00 (+)B60C 3/00 (+) 횡단면에 특징이 있는 타이어Tires characterized by cross-section H01B 1/00(67) (*)H01B 1/00 (67) (*) G03B 5/00(15) (*)G03B 5/00 (15) (*) C07B 31/00(10) (*)C07B 31/00 (10) (*) 22 H03M 3/00 (+)H03M 3/00 (+)

이때, 상기 표 69와 같은 결과 데이터에서 H01B 1/00 (*)를 눌렀을 때는 다음 표 70과 같은 기준 IPC와 대상 IPC의 융합 분석 결과가 나오는 것이 바람직할 것이다.At this time, when H01B 1/00 (*) is pressed in the result data as shown in Table 69, it is preferable that the result of the fusion analysis of the reference IPC and the target IPC as shown in the following Table 70 is displayed.

[표 70][Table 70]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 최근lately 합계Sum 1One B60C 3/00 (+)
H01B 1/00 (+)B60C 3/00 (+)
H01B 1/00 (+) 횡단면에 특징이 있는 타이어
Tires characterized by cross-section
횡단면에 특징이 있는 타이어 (more)

도전재료를 특징으로 하는 도체 또는 도전물체

Tires characterized by cross-section (more)

A conductor or a conductive object characterized by a conductive material

1111 5252 44 6767 H01B 1/02(+)H01B 1/02 (+) . 주로 금속 또는 합금으로 된 것. Mainly of metals or alloys 33 3535 22 4040 H01B 1/04(+)H01B 1/04 (+) 주로 카본 실리콘혼화물, 카본 또는 실리콘으로서 되는 것Mainly made of carbon silicon, carbon or silicon 22 1010 00 1212 H01B 1/06(+)H01B 1/06 (+) . 주로 다른 비금속물질로 되는 것. Mainly made of other non-metallic materials 22 00 00 22

사용자들에게 보이는 모든 IPC에는 IPC 설명이 포함되는 것이 바람직할 것이다. 사용자들 중에서 각 IPC를 모두 기억하기를 기대하기를 어렵기 때문에, 각 IPC의 타이틀 정보가 1차적으로 상기 IPC 설명 정보로 나오는 것이 바람직할 것이다. 이때, 각 IPC의 타이틀 정보는 간략하고, 특징만을 설명하는 것이 대부분이므로, 2차적으로 IPC에 대한 설명에 클릭했을 때 POP UP 등으로 그 IPC의 상위 IPC들을 모두 또는 일정 단계까지(예를 들면 클래스 레벨까지) 보여 주는 것이 바람직할 것이다.It would be desirable for all IPCs visible to users to include an IPC description. It is difficult to expect to store all of the IPCs among the users, so that it is preferable that the title information of each IPC firstly appears in the IPC description information. At this time, since title information of each IPC is brief and explains only the characteristics, when the description of the IPC is clicked in the second place, the upper IPCs of the IPC are all up to a predetermined level (for example, Level).

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 IPC를 중심으로 융합 분석을 위한 다차원 분석용 연산 수행 결과 테이블 데이터를 어떻게 생성하는 지에 대해 설명하였으나, 본 방법은 USPC, FT, FI, ECLA 등의 다른 특허 분류 기호에 대해서도 동등한 방법으로 적용될 수 있음은 당업자에게 자명하다 할 것이다.The method for generating multidimensional analysis operation result table data for the fusion analysis based on the IPC has been described in the USC, FT, FI, ECLA, etc. It will be apparent to those skilled in the art that other patent classification schemes of the invention may be applied in an equivalent manner.

한편, 상기 융합 분석에서 융합 발생 특허 분류 기호 조합은 2개의 특허 분류 기호의 조합(Ai, Bj)을 중심으로 설명했으나, 본 발명 사상은 3개 이상의 융합 발생 특허 분류 기호의 조합 (Ai, Bj, Ck...)에 대해서도 본 발명 사상은 그대로 적용됨은 당업자에게 자명할 것이다. 예를 상기 2개의 특허 분류 기호의 조합을 중심으로 융합분석을 할 경우에는 동종 특허 분류 기호가 2개 이상 포함되어 있는 특허 문건을 융합 발생 특허 문건 집합으로 하였으나, 3개의 특허 분류 기호의 조합을 중심으로 융합 분석을 하는 경우에는 동종 특허 분류 기호가 3개 이상 포함되어 있는 특허 문건을 융합 발생 특허 문건 집합으로 하여야 한다. 그리고 2개의 특허 분류 기호의 조합일 경우, 하나의 문건에 n개의 특허 분류 기호가 있을 경우, nC2개의 (Ai, Bj) 조합이 나왔으나, 3개의 특허 분류 기호의 조합일 경우, nC3 개의 (Ai, Bj, Ck)가 나오며, (Ai, Bj, Ck)를 (Ai, Bj)처럼, 하나의 단위로 하여 처리할 수 있을 것이다. 이 경우에도 Ai, Bj, Ck 중에서 어느 하나는 메인 IPC에서 기인된 것이 포함될 수도 있으며, 모두 서브 IPC에서 기인한 것일 수도 있다.In the fusion analysis, the combination of the patent classification codes generated by fusion is described based on the combination of two patent classification codes (Ai, Bj). However, in the present invention, combinations of three or more fusion patent classification codes (Ai, Bj, Ck ...) will be obvious to those skilled in the art. For example, in the case of fusion analysis based on the combination of the two patent classification symbols, a patent document including two or more similar patent classification symbols is defined as a set of fusion patent documents. However, , A patent document containing three or more patents of the same kind should be used as a set of fusion patent documents. In the case of a combination of two patent classification codes, there are nC2 (Ai, Bj) combinations when there are n patent classification codes in one document. However, if there are three patent classification codes, , Bj, Ck), and (Ai, Bj, Ck) can be processed as one unit like (Ai, Bj). In this case also, any one of Ai, Bj, and Ck may be originated from the main IPC, or all may originate from the sub IPC.

대표 어구 분석용 다차원 분석용 연산 수행 결과 테이블Analysis result table for multidimensional analysis for representative phrase analysis

본 발명의 대표 어구 추출 전처리 모듈을 통한 대표 어구 생성 방법은 전술한 바 있다. 상기 대표 어구 추출 전처리 모듈은 하기 표 71와 같은 데이터를 생성한다는 것은 전술한 바가 있다.A representative word generation method using the representative word extraction preprocessing module of the present invention has been described above. The representative word extraction preprocessing module generates data as shown in Table 71 below.

[표 71][Table 71]

본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 대표 어구 분석을 위한 다차원 분석용 연산 수행 결과 테이블 데이터를 생성하기 위하여, 상기 표84 와 같은 데이터를 입수하고, 상기 입수한 문건 번호의 서지 사항에 포함된 각종 정보(출원인, 특허 분류 기호, 발명자, 날짜 정보 등)를 특허 문건 마스트 DB(202)에서 추출하여(문건 번호를 키값으로 하여 조인 연산을 수행하여 처리할 수도 있다.) 하기 표 72와 같은 형태의 다차원 분석용 연산 수행 결과 테이블 데이터를 생성한다. 먼저 특허 분류 기호를 추출하여 하기와 같은 데이터를 생성할 수 있을 것이다. 편의상 문건 번호 #1에는 H04B 7/15라는 IPC가 1개 포함되어 있고, 문건 번호 #2에는 H04B 7/26 및 H04B 7/02라는 IPC가 포함되어 있다고 가정하자.The multidimensional analysis calculation execution result table generation module 402 of the present invention acquires data as shown in Table 84 to generate multidimensional analysis calculation result table data for analysis of representative phrases, (Patent applicant, patent classification code, inventor, date information, etc.) included in the bibliography of the patent document master DB 202 (the patent document master DB 202 may perform processing by performing a join operation with the document number as a key value). The multidimensional analysis operation result table data of the form shown in Table 72 below is generated. First, the patent classification code can be extracted to generate the following data. For convenience, suppose document # 1 contains one IPC, H04B 7/15, and document number # 2, contains IPCs H04B 7/26 and H04B 7/02.

[표 72][Table 72]

어구 IDPhrase ID 어구Fish 필드field 출현회수Number of times of appearance 문건 번호Article number C1C1 C2C2 C3C3 C4C4 C5C5 C6C6 1One abcabc DD 33 #1#One HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15 1One abcabc CC 22 #1#One HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15 1One abcabc DD 22 #2#2 HH H04H04 H04BH04B 7/007/00 7/247/24 7/267/26 1One abcabc CC 1One #2#2 HH H04H04 H04BH04B 7/007/00 7/247/24 7/267/26 1One abcabc DD 22 #2#2 HH H04H04 H04BH04B 7/007/00 7/027/02 1One abcabc CC 1One #2#2 HH H04H04 H04BH04B 7/007/00 7/027/02 22 bcdbcd DD 1One #1#One HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15 22 bcdbcd CC 1One #1#One HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15 33 acac DD 1One #1#One HH H04H04 H04BH04B 7/007/00 7/147/14 7/157/15

문건 번호 #2에는 H04B 7/26 및 H04B 7/02라는 IPC가 대응되어 있으므로, 각 레코드마다 2개의 IPC 정보를 처리하기 위한 레코드가 생겼다. 상기는 IPC에 대해서 설명해 놓았지만, 본 발명 사상이 USPC, FT 등에도 동일한 방식으로 적용됨은 전술한 바와 같고, 당업자에게 자명할 것이다.Since document number # 2 corresponds to IPC H04B 7/26 and H04B 7/02, a record for processing two pieces of IPC information is generated for each record. Although the IPC has been described above, it will be apparent to those skilled in the art that the present invention is applied to USPC, FT, etc. in the same manner as described above.

먼저 특허 분류 기호를 추출하여 하기 표 73과 같은 데이터를 생성할 수 있을 것이다. 편의상 문건 번호 #1에는 출원인 A가 포함되어 있고, 문건 번호 #2에는 출원인 B와 C가 포함되어 있다고 가정하자. 또한, 상기 문건 번호 #1은 2005년 5월에 출원되었다고 하고, #2는 2006년 1월에 출원되었다고 하자. 이 경우 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 하기 표 73와 같은 형태의 다차원 분석용 연산 수행 결과 테이블 데이터를 생성할 수 있을 것이다. (하기 표에서 05/1Q는 2005년 1분기를 의미한다.)First, the patent classification symbol can be extracted and the data as shown in Table 73 below can be generated. For convenience, suppose that applicant A is included in document number # 1 and applicant B and C are included in document number # 2. It is also assumed that the document No. 1 is filed in May 2005, and the filing filed in January 2006. In this case, the multidimensional analysis operation result table generation module 402 of the present invention can generate multidimensional analysis operation result table data of the form shown in Table 73 below. (05 / 1Q in the following table refers to the first quarter of 2005)

[표 73][Table 73]

어구 IDPhrase ID 어구Fish 필드field 출현회수Number of times of appearance 문건 번호Article number 0505 05/Q105 / Q1 5/2Q5 / 2Q 5/3Q5 / 3Q 5/4Q5 / 4Q 0606 06/1Q06 / 1Q AppNameAppName 1One abcabc DD 33 #1#One 1One 1One AA 1One abcabc CC 22 #1#One 1One 1One AA 1One abcabc DD 22 #2#2 1One 1One BB 1One abcabc CC 1One #2#2 1One 1One BB 1One abcabc DD 22 #2#2 1One 1One CC 1One abcabc CC 1One #2#2 1One 1One CC 22 bcdbcd DD 1One #1#One 1One 1One AA 22 bcdbcd CC 1One #1#One 1One 1One AA 33 acac DD 1One #1#One 1One 1One AA

문건 번호 #2에는 출원인 B, C가 포함되어 있으므로, 각 레코드마다 2개의 출원인 정보를 처리하기 위한 레코드가 생겼다.Since the applicant B and C are included in the document number # 2, there is a record for processing two pieces of applicant information for each record.

본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 상기 대표어구와 관련한 상기 2개의 다차원 분석용 연산 수행 결과 테이블 데이터를 1개의 통합된 다차원 분석용 연산 수행 결과 테이블 데이터로 생성해 놓을 수도 있을 것이다. 한편, 번역을 적용한다면, 상기 어구들에 대해 언어 정보에 대응되는 각 언어별 번역어를 어구로 한 다차원 분석용 연산 수행 결과 테이블 데이터를 생성할 수 있을 것이다. 상기 AppName 필드는 출원인명을 의미하며, 상기 필드 옆에 발명자 필드를 생성할 수 있고, 개별 발명자마다 하나의 레코드를 생성할 수 있을 것이다. 만약 출원인이 2이고, 발명자가 3인이면, 출원인 정보와 발명자 정보가 없던 1개의 레코드가 출원인과 발명자를 1개씩 포함하는 6(2*3 = 6)개의 레코드로 확장된다.The multidimensional analysis operation result table generation module 402 of the present invention generates the two multidimensional analysis operation result table data related to the representative word as one integrated multidimensional analysis operation result table data It might be. On the other hand, if a translation is applied, an operation result table data for multidimensional analysis using the translations of each language corresponding to the language information for the phrases can be generated. The AppName field indicates an applicant's name, an inventor field can be generated next to the field, and one record can be generated for each individual inventor. If the applicant is 2 and the inventor is 3, then one record without applicant and inventor information is extended to 6 (2 * 3 = 6) records containing one applicant and one inventor.

상기 표 72 및 표 73과 같은 데이터가 있는 경우, 대표어구와 관련하여 하기와 같은 정보를 생성할 수 있다.If there is data such as those in Tables 72 and 73, the following information can be generated with respect to the representative word.

첫째, 1) 출원인별 또는 발명자별 또는 출원인의 발명자, 2) 특정 특허 분류 기호의 특정 레벨(예를 들면 IPC 메인그룹 등), 3) 특정 기간(년, 분기, 월 등), 4) 특정 국가별로 어구 집합을 제공할 수 있을 것이다. 예를 들면, 대한민국 특허 출원 문건 기준 IPC 메인그룹 H04B 1/00, 2005년 1분기, 삼성전자주식회사가 출원한 문건에(문건의 특허 청구 범위 또는 초록 또는 특정 필드에) 등장하는 어구들의 목록을 추출할 수 있을 것이다. 이는 상기 표 72 및/또는 표 73와 같은 다차원 분석용 연산 수행 결과 테이블 데이터가 있을 때 간단한 SQL 문으로 처리 가능함은 당업자에게 자명할 것이다.3) a specific period (year, quarter, month, etc.); 4) a specific country or territory of a specific country (eg, It would be possible to provide a set of phrases. For example, a list of phrases appearing in a document filed by Samsung Electronics Co., Ltd. (in the claims or abstracts or in a specific field of a document) is extracted from IPC Main Group H04B 1/00, 2005, You can do it. It will be apparent to those skilled in the art that when there are table data of operation result data for multidimensional analysis as shown in Table 72 and / or Table 73, it can be processed by a simple SQL statement.

둘째, 1) 출원인별 또는 발명자별 또는 출원인의 발명자, 2) 특정 특허 분류 기호의 특정 레벨(예를 들면 IPC 메인그룹 등), 3) 특정 기간(년, 분기, 월 등), 4) 특정 국가별로 대표 어구 집합을 추출하여 제공할 수 있을 것이다. 대표 어구 추출은 대표 어구 추출 정책에 따라 달라진다. 상기 대표 어구 추출 정책의 예로 다음과 같은 것들이 있을 수 있다.2) a specific level of a specific patent classification (eg IPC main group); 3) a specific period (year, quarter, month, etc.); 4) It is possible to extract and provide a representative phrase set. The representative phrase extraction depends on the representative phrase extraction policy. Examples of the representative phrase extraction policy may include the following.

1) 추출된 특정 어구에 대하여 "출현 빈도 확률 = 특정 문건 집합에서의 출현 빈도/자신의 상위 문건 집합에서 출현 빈도*100"를 처리한 값이 기설정된 제1 협범위값에 포함되어 있는 경우이다. 1) For a specific phrase extracted, "Occurrence frequency probability = Occurrence frequency in a set of specific documents / Occurrence frequency in the set of its own document * 100" is included in the preset first narrow range value .

2) 특정 어구에 대하여 그 특정 어구의 상기 출현 빈도 확률을 처리한 값이 기설정된 제1 협범위값에 포함되어 있어 있지 않고, 기설정된 제2 광범위값에 포함되어 있으면서, 상기 출현 빈도 확률이 기설정된 랭킹 이내에 포함되는 경우이다. 이는 특정 문건 집합에서 상기 1)의 대표 어구 추출 정책을 사용한 결과 대표 어구의 수가 현격히 적을 경우에 잘 적용될 수 있을 것이다.2) a value obtained by processing the appearance frequency probability of the specific phrase for a specific phrase is not included in the predetermined first narrow range value but is included in the predetermined second wide value, And is included within the set rankings. This can be applied to cases where the number of representative phrases is remarkably small as a result of using the representative phrase extraction policy of the above 1) in a specific document set.

3) 상기 추출된 특정 어구에 대하여, 특정 문건 집합에서의 출현 빈도의 증감율 기설정된 제1 협범위값에 포함되어 있는 경우이다.3) For the extracted specific phrase, the increase / decrease rate of the occurrence frequency in the specific document set is included in the first narrow range value set in advance.

상기 출현 빈도 확률을 구할 때, 자신의 상위 문건 집합이 언급되는데, 상기 상위 문건 집합의 종류는 1) 자신의 문건 집합을 특정짓는 요소가 적어도 하나 이상의 차원(dimension)을 포함할 경우 그 차원의 직상위 또는 모든 상위의 문건 집합이 되는 유형의 상위 문건 집합(한꺼번에 2가지 이상의 차원의 직상위 또는 모든 상위가 가능함은 물론이다.), 2) 순수한 키워드만으로 문건 집합이 특정될 때는 그 문건 집합의 전체 집합(국가 단위의 문건 집합일 경우, 그 국가의 모든 문건 집합)이 되는 상위 문건 집합이 있다. 상기 1) 유형의 상위 문건 집합에 대해 상세히 설명한다. 예를 들어, IPC H01L 1/00으로 특징되는 문건 집합의 경우에는 i) H01L 또는 이들의 상위 특허 분류 기호(H01, H)로 특정되는 문건 집합이 상기 상위 문건 집합이 될 수 있다. 한편, 대한민국 삼성전자주식회사의 2000년도 1분기 IPC H01L 1/00로 특정되는 문건 집합의 상위 문건 집합에는 i) 상위 특허 분류 기호, ii) 2000년도 또는 연도 통합, iii) 삼성전자주식회사의 모든 국가(미국, 일본, EU 등, 물론, 이때, 상기 어구의 그 국가 해당 언어로의 번역이 전제될 수도 있을 것이다.), iv) 대한민국의 모든 출원 또는 v) 상기 i) 내지 iv) 중 어느 하나 이상의 조합으로 생성되는 문건 집합이 상기 상위 문건 집합이 될 수 있을 것이다. 문건 집합이 특정한 키워드와 상기 차원의 조합으로 생성되는 경우(예를 들면, "RFID tag"라는 키워드와 대한민국 삼성전자주식회사의 2000년도와 같은 국가, 출원인, 연도 차원이 결합됨)에도 상기 키워드는 유지한 채 상기 i) 내지 v)를 적용하면, 상위 문건 집합의 생성이 가능함은 물론이다 하겠다. 한편, 상기 i) ~v)의 상위 문건 집합별로 상기 제1 협범위값 및/또는 상기 제2 광범위값들이 정해져 있는 것이 바람직하다.When finding the probability of occurrence, the upper set of documents is referred to. The types of the upper set of documents are as follows. 1) When an element forming the document set includes at least one or more dimensions, (Of course, it is possible to have an upper level or all upper level of two or more levels at a time), 2) when a document set is specified only by a pure keyword, the entire document set There is a set of superscripts that is a set (in the case of a set of national documents, a set of all documents of that country). The upper document set of the above 1) type will be described in detail. For example, in the case of a set of documents characterized by IPC H01L 1/00, a set of documents identified by i) H01L or their top patent classification codes (H01, H) may be the set of documents above. The top set of documents specified by IPC H01L 1/00 of Samsung Electronics Korea Co., Ltd. in the first quarter of 2000 includes i) the top patent classification symbol, ii) the year 2000 or year integration, iii) all countries of Samsung Electronics Co., Translation of the above phrase into the corresponding language may be presumed), iv) all applications from the Republic of Korea, or v) any combination of any of i) to iv) A set of documents generated by the user can be the above document set. When the document set is generated by a combination of a specific keyword and the dimension (for example, the keyword "RFID tag" is combined with the country, applicant, year dimension such as year 2000 of Samsung Electronics Co., Ltd. of Korea) Of course, if i) to v) above are applied, it is of course possible to generate a super document set. On the other hand, it is preferable that the first narrow range value and / or the second wide range values are determined for each set of upper documents of i) to v).

한편, 문건 집합을 특정짓는 요소에 적어도 하나 이상의 차원을 포함하여 생성된 문건 집합에서 생성되는 대표 어구 추출하는 경우, 그 차원을 축으로 드릴 다운이 가능할 것이다. 드릴 다운 될 때는 그 차원 요소별로 기설정된 조건을 만족하는 대표 어구를 추출하여 그 결과를 사용자들에게 제공해 줄 수 있다. 하기 표 74 및 표 75은 대표 어구 관련된 드릴 다운의 일 실시예를 보여주고 있다. 상기 예는 00연도에 대해 대표 어구를 표시해 주고 있다. 이때, 01연도를 클릭하면 표 74과 같이 01연도에 해당하는 대표 어구를 추출하여 보여 준다.On the other hand, when extracting a representative phrase generated from a document set generated by including at least one dimension in a specific element of a document set, it will be possible to drill down on the dimension as an axis. When drilling down, it is possible to extract a representative phrase that satisfies predetermined conditions for each dimension element and provide the result to the users. Tables 74 and 75 below illustrate one embodiment of a drill down involving representative phrases. The above example shows representative phrases for 00 years. At this time, if you click 01 year, the representative phrase corresponding to 01 year is extracted and shown as Table 74.

[표 74][Table 74]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 0202 0303 0404 0505 0606 최근lately 합계Sum 1One B60C 3/00 (+)B60C 3/00 (+) 횡단면에 특징이 있는 타이어Tires characterized by cross-section 미끄럼, 마찰력 향상, 타이어 공명음 주파수, 소음, 정전기 방출
Slip, Improved Friction, Tire Resonance Sound Frequency, Noise, Static Emission
22 H03M 3/00 (+)H03M 3/00 (+) 변조, 시그마 델타, 펄스폭 변조, 적문기 리셋Modulation, sigma delta, pulse width modulation, red-eye reset

상기 대표 어구는 출현 빈도 확률에 따라 칼라, 폰트 사이즈 등을 달리하는 것이 바람직하다. 한편, 상기 대표 어구에 클릭했을 때는 상기 대표 어구를 포함하고 있는 문건을 추출해 줄 수 있을 것이다. 문건 추출은 1) 상기 대표 어구 추출을 위한 상기 다차원 분석용 연산 수행 결과 테이블 데이터에 문건 번호가 있다면 그 문건 번호를 기초로 하여 특허 문건 마스트 DB(202)에 질의하여 해당 문건을 추출하여 제공해 주는 방법, 2) 상기에서 나온 모든 차원과 대표 어구를 조합하여 검색 엔진에 질의하는 방법이 있을 수 있다. 후자의 예를 들면, 만약 대표 어구 "소음"를 선택한 경우(이 대표 어구가 특허 청구 범위(Claim, C)에서 나온 것일 경우, 입력하는 검색식은 "IPC = B60C 3/00 AND Claim = "소음", Date = (20000101~20001231", 국가 = 대한민국"처럼 될 것이다. 만약, 이 대표 어구를 가지는 미국 문건을 보기를 원할 경우에는 "국가 = 미국"으로 치환하여 검색 엔진에 질의하면 될 것이며, 이때, 키워드에 해당하는 "소음"이 미국 문건의 기본 언어인 영어가 아닐 경우(즉, 언어의 종류를 판단하여) 본 시스템(1)에 포함된 번역 시스 템을 이용하여 "소음"를 영어로 번역하여 "noise"로 상기 "소음" 자리에 입력하여 검색함은 물론이다 하겠다.It is preferable that the representative word is different in color, font size and the like according to the appearance frequency probability. On the other hand, when the representative phrase is clicked on, the document containing the representative phrase can be extracted. The document extraction is performed by: 1) extracting the corresponding document by querying the patent document master DB 202 based on the document number if the document number is present in the multidimensional analysis operation result table data for extracting the representative word, , And 2) there is a method of querying a search engine by combining all dimensions and representative phrases from the above. In the latter example, if the representative phrase "noise" is selected (if this representative phrase is from the claim (C), the search formula entered is "IPC = B60C 3/00 AND Claim = , Date = (20000101 ~ 20001231 ", Country = Republic of Korea".) If you want to see US documents with this representative phrase, you can query the search engine by replacing "Country = US" If the "noise" corresponding to the keyword is not English, which is the basic language of US documents (ie, judging the type of language), "noise" is translated into English using the translation system included in this system (1) it is of course possible to input and search for the "noise"

드릴 다운된 대표 어구를 추출하기 위하여 표 74에서 기준 IPC인 B60C 3/00을 누른 경우((+)표시를 눌러도 됨, 이는 인터페이스의 문제임), 본 발명의 시스템(1)은 하기 표 75과 같은 방식으로 드릴 다운된 데이터를 생성하여 사용자에게 제공해 줄 수 있을 것이다. (드릴 다운은 IPC 체계 하에서 자신의 하위에서만을 입수하며, 각 입수된 IPC 모두가 나타나는 것이 아님은 물론일 것이다. 예를 들어 B60C 3/02에 문건이 없거나, 대표 어구가 없다면 이 IPC에 대한 대표 어구는 제시되지 않는다. 이때, 상기 B60C 3/00에 해당하는 대표 어구는 보여 줄 수도 있고, 감출수도 있을 것이다. The system 1 of the present invention may be configured as shown in Table 75 as below, in order to extract the drilled-down representative phrases, if the reference IPC B60C 3/00 is pressed in Table 74 The data can be generated and provided to the user. (Drill down is only available to subordinates under the IPC scheme, not to all of the IPCs that are available.) For example, if there is no document in B60C 3/02, or there is no representative phrase, the representative for this IPC At this time, the representative phrases corresponding to B 60 C 3/00 may be shown or hidden.

[표 75][Table 75]

순위ranking 기준 IPC
대상 IPCStandard IPC
Target IPC IPC 설명IPC Description 0000 0101 최근lately 합계Sum 1One B60C 3/00 (+)B60C 3/00 (+) 횡단면에 특징이 있는 타이어Tires characterized by cross-section 미끄럼, 마찰력 향상, 타이어 공명음 주파수, 소음, 정전기 방출Slip, Improved Friction, Tire Resonance Sound Frequency, Noise, Static Emission B60C 3/04(+)B60C 3/04 (+) 단면의 상대촌법에 특징이 있는 것A feature of the cross section method of section 조정 안정성, 래디얼 타이어, 압축 공기Adjustment stability, radial tires, compressed air B60C 3/06(+)B60C 3/06 (+) 비대칭의 것Asymmetric 내마모성, 코너링, 비대칭 자가 측벽, 비대칭 래디얼 타이어Abrasion resistance, cornering, asymmetric self-sidewall, asymmetric radial tire B60C 3/08(+)B60C 3/08 (+) 보관 또는 불사용상태에서 접을 수 있는 것Foldable in storage or unused condition 림 이탈, 프페어 타이어 캐리어, 보관 장치Rim release, fair tire carrier, storage device 22 H03M 3/00 (+)H03M 3/00 (+) 차동변조Differential modulation

상기와 같은 예시는 대표 어구를 제시하는 일 실시예일 뿐, 본 발명 사상이 적용되는 분석 결과 어느 곳에서나 상기 대표 어구를 추출하여 제공해 줄 수 있음은 물론이다 할 것이다. 이는 본 발명 사상이 적용되는 분석 결과에는 문건 집합이 대응되어 있고, 본 발명의 대표 어구 상기 문건 집합(상기 문건 집합을 구성하는 특허 문건들)을 입수함으로써 이를 재료로 하여 생성되기 때문이다.It will be appreciated that the above-described example is merely an example of presenting representative phrases, and the representative phrases can be extracted and provided at any place of the analysis result to which the present invention is applied. This is because the analysis result to which the idea of the present invention is applied corresponds to the document set, and the representative phrase of the present invention is generated by obtaining the document set (the patent documents constituting the document set).

본 발명의 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 선택되는 적어도 하나 이상의 차원별로 다차원 연산된 계산을 수행하여 다차원(n-차원) 큐브를 생성해 놓는다. 본 발명의 다차원 큐브 생성의 특징 중의 하나는 특허 분류 기호가 차원에 포함되어 큐브가 생성될 때, 주어진 상기 특허 분류 기호에 대한 롤업 연산 값이 생성될 때, 상기 특허 분류 기호 및 상기 특허 분류 기호의 상위 특허 분류 기호을 함께 고려하여 롤업 연산값을 생성하는 것이다. 이렇게 롤업 연산값이 생성되면, 임의의 특허 분류 기호에 대한 롤업 연산값을 입수할 때, 그 특허 분류 기호의 하위 특허 분류 기호들에 대한 모든 값들이 다차원 연산된 수치값들이 반영되어 생성된 값을 입수하게 된다. 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 주어진 특허 문건에 포함된 특허 분류 기호가 있을 때, 상기 특허 분류 기호 뿐만 아니라, 상기 특허 분류 기호의 상위 특허 분류 기호에도 상기 특허 분류 기호에 대한 처리값을 반영한다. 예를 들면, 문건 번호 #1에 H04B 7/06이라는 IPC가 부여되어 있을 경우, 이 문건으로 큐브 다차원 분석용 연산 수행 결과 테이블을 생성할 때, H04B 7/06에도 카운팅 값 1을 부여하고, 상기 H04B 7/06은 직상위인 H04B 7/04, H04B 7/02에도 카운팅값을 부여해야 한다. 물론, H04B 7/00이상의 상위 레벨에도 카운팅값이 부여됨은 상식적일 것이다. 이를 위해 상기 표들에서와 마찬가지로, 모든 주어진 특허 분류 기호(예를 들면 IPC)에 대하여 자 신의 상위 특허 분류 기호들을 자신이 속한 줄(row)에 일렬로 배치해 놓고, 그 각 상위 특허 분류 기호에 대응되는 셀에 주어진 특허 분류 기호에서 기인한 값을 반영하여 셀값을 계산한다. 이러한 본 발명만의 독특한 롤업 연산 방식은 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402) 및 본 명세서의 곳곳에서 반복적으로 설명해 놓았다.The multidimensional analysis operation result table generation module 402 of the present invention generates a multidimensional (n-dimensional) cube by performing a multidimensional operation calculation for each selected at least one dimension. One of the features of the multidimensional cube generation of the present invention is that when the cube is generated by including the patent classification symbol in the dimension, when the roll-up operation value for the given patent classification symbol is generated, the patent classification symbol and the patent classification symbol And generate the rollup operation value by considering the upper patent classification symbol. When the roll-up operation value is generated, when the roll-up operation value for an arbitrary patent classification symbol is obtained, all values of the sub-patent classification symbols of the patent classification symbol are reflected by the multidimensional numerical values, . The multidimensional analysis operation result table generation module 402 of the present invention generates the multidimensional analysis operation result table generation module 402 in accordance with the patent classification notation Reflects the processing value for the symbol. For example, when an IPC named H04B 7/06 is assigned to document number # 1, a count value of 1 is assigned to H04B 7/06 when generating an operation result table for cubic multidimensional analysis using this document. H04B 7/06 should also be given a count value in the upper hierarchy H04B 7/04, H04B 7/02. Of course, it is common sense that counting values are also given to higher levels of H04B 7/00 and higher. For this purpose, as in the above tables, for each given patent classification code (for example, IPC), the upper-level patent classification codes of their own are arranged in a row on the basis of their own patent classification codes, The cell value is calculated by reflecting the value derived from the given patent classification code. The unique roll-up operation method according to the present invention is repeatedly described in the operation result table generation module 402 for multi-dimensional analysis and throughout the description.

자기 집합 기준 자기 집합 분석과 자기 기준 타분석 유형Self-set criterion Self-set analysis and self-criterion Other type of analysis

본 발명의 분석 모듈은 다양한 분석 주제별 분석 모듈의 집합으로 구성된다. 상기 분석 주제별 분석 모듈은 분석 주제와 관련된 분석 결과를 생성하기 위하여, 상기 분석 주제에 대응되는 분석 주제별 분석 쿼리식을 포함하고 있다. 상기 분석 주제별 분석 쿼리식으로 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성한 다차원 분석용 연산 수행 결과 테이블에 접근하여 목적하는 주제별 분석 결과를 입수해 낸다.The analysis module of the present invention is composed of a set of various analysis subject-specific analysis modules. The analysis subject-specific analysis module includes an analysis subject-specific query expression corresponding to the analysis subject, in order to generate an analysis result related to the subject. Dimensional analytical operation result table generated by the multidimensional analytical operation result table generation module 402 in the analytic subject-specific analytic query expression, and obtains the subject analysis results.

본 발명의 분석 모듈이 생성하는 분석 결과값과 최초 확정되는 분석 대상 문건 집합과는 다음 2가지 관계가 있다. The analysis result value generated by the analysis module of the present invention and the initially determined document to be analyzed have the following two relationships.

첫째는 "자기 집합 기준 자기 집합 분석 유형"으로 상기 분석 결과값이 상기 분석 대상 문건 집합에서 나오는 경우이다. 예를 들면 특정한 출원인의 최빈 기술 분야별(IPC) 각 단계별 연도별 출원수, 점유율, 집중율, 활동율 데이터나, 각 출원인의 개별 발명자별 기술 분야별(IPC) 각 단계별 연도별 출원수, 점유율, 집중율, 활동율 데이터나, 출원인별 또는 개별 발명자별 (기술 분야별) 피인용수 등에 대한 데이터가 이의 예가 된다. 이때, 상기 특정한 출원인의 최빈 기술 분야를 추출해 내는데, 상기 특정한 출원인 자기 자신과 관계된 문건 집합으로 최빈 기술 분야(IPC)를 추출해 내며, 상기 추출된 최빈 기술 분야(IPC)가 기준이 되어 그 기준에 따라 연도별 출원수나 점유율 등의 분석 결과가 만들어 진다는 것이다. 그러므로, 자기의 문건 집합을 대상으로 기준으로 생성해 내고, 그 기준별 최종 분석 결과도 자신의 연도별 출원수이므로, 자기 자신에 대한 분석이 되게 된다. 상기 "자기 집합 기준 자기 집합 분석" 유형은 출력되는 최종 분석 결과가 최초 입력된 문건 집합(자기 문건 집합)에서 생성될 수 있는 특징이 있다.(물론, 점유율 등의 비율값 등을 구할 때는 기준되는 전체값이 필요함은 당연하다.) The first is the case where the result of the analysis is derived from the document set to be analyzed as the "self-set reference self-set analysis type ". For example, the number of applications, market share, concentration rate and activity rate data for each stage of IPC of a specific applicant, and the number of applications, market share, and concentration of each applicant by IPC Data on activity rate data, data on the number of applicants, or the number of persons cited by individual inventors (technical fields), and the like are examples thereof. At this time, the most specific technical field of the specific applicant is extracted. The IPC is extracted from a set of documents related to the specific applicant himself / herself, and the extracted IPC is used as a standard, The analysis results such as the number of applications and market share by year are created. Therefore, it is created based on the set of documents of one's own, and the final analysis result of each criterion is the number of applications per year, so it becomes an analysis of oneself. The above-mentioned "self-set criterion self-set analysis" type is characterized in that the output result of the final analysis can be generated from the first set of document sets (self-document set). (Of course, All values are required.)

두번째는 "자기 집합 기준 타 집합 분석" 유형으로, 상기 분석 결과값이 상기 분석 대상 문건 집합에서 나오지 않는 경우이다. 예를 들면, 분석 결과로 출원인 A의 문건을 중 다출원 기술 분야별로 다수 인용하는 인용자에 대한 데이터를 얻으려는 경우, 최초로 입력하는 데이터는 출원인 A의 문건 집합에 관한 것이지만, 최종적으로 출력하는 데이터는 A가 아닌 타인들의 문건 집합에 관한 데이터이다. 또한, 자신의 최빈 메인그룹 단계의 IPC별로, 그 IPC에 다출원하고 있는 출원인이나, 그 IPC에 집중율이 높은 출원인을 랭킹을 매겨서 추출하는 경우도 "자기 집합 기준 타 집합 분석"기준은 자기 자신의 문건 집합에서 메인그룹별 최빈 IPC를 추출한다는 면에서는 자기 집합 기준이며, 최종 결과가 타인들의 문건 집합에 대한 것이라는 점에서 타 집합 분석이 된다.The second case is a type of "self-set criterion other-set analysis", in which the above-described analysis result value does not appear in the analysis target set. For example, if you want to obtain data about the citations of applicant A by multiple application technology fields, the first data you input is related to the document set of applicant A, but the final output data Is data about the document set of the non-A person. Also, in the case where the applicant who has been filed in the IPC or the applicant who has a high concentration rate in the IPC is extracted by the IPC of his or her most recent main group stage, the criteria of "self- The IPC of the main group is extracted from the document set of the main group and the final result is the set of documents of the others.

이러한 두번째 경우에는 2단계 이상의 계산이 필요하게 된다. 상기 예에서는 우선적으로 출원인 A의 문건 집합을 다출원 기술 분야별로 출력하는 분석과, 상기 출력된 각 기술 분야별 문건 집합을 대상으로 이 문건 집합에 속하는 문건을 인용하는 타인들의 문건 집합을 구하며, 이 타문건 집합을 분석 대상 문건 집합으로 하여 다출원인들을 추출하여 최종적인 분석 결과로 제공한다.In this second case, two or more calculations are required. In the above example, first, a set of documents of applicant A is outputted for each of the technical fields of application, and a set of documents of the other persons quoting documents belonging to the set of documents for the output set of documents for each technical field is obtained, A set of documents is extracted as a set of documents to be analyzed, and the applicants are extracted as final analysis results.

상기 자기 집합 기준 자기 집합 분석에 대해서는 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 다차원 분석용 연산 수행 결과 테이블에서 기설정된 최종 결과까지 동일한 다차원 분석용 연산 수행 결과 테이블에 생성해 놓을 수 있다. 예를 들어, 출원인별 최빈 IPC별 연도별 출원수나 점유율과 같은 분석 결과를 생성하기 위해서는 1개의 다차원 분석용 연산 수행 결과 테이블로도 충분한 경우가 많다.The multidimensional analysis operation result table generation module 402 may generate the same multidimensional analysis operation result table from the multidimensional analysis operation result table to the predetermined final result . For example, one multidimensional analytical operation result table is often sufficient to generate analytical results such as number of applicants or market share per applicant, by IPC per year.

한편, 상기 자기 집합 기준 타 집합 분석 유형의 경우에는 1) 최종 결과까지 미리 생성해 놓는 경우, 2) 최종 결과까지를 각 단계별로 나누고, 각 단계별에서 생성되는 단계별 결과를 다음 단계의 입력값으로 하는 방법이 있을 수 있다. 후자의 경우, 1) 최종 결과까지 단계별로 구분될 수 있을 것, 2) 최초 및 각 단계별 결과를 입력값으로 하여 다음 단계의 출력값을 생성할 수 있도록 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 입력값에 대한 출력값을 계산해 놓은 다차원 분석용 연산 수행 결과 테이블 데이터나 큐브 데이터가 있을 것, 3) 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 최초 및 각 단계별 결과를 입력값으로 받을 수 있을 것과 같은 조건이 성립해야 한다.On the other hand, in the case of the self-set reference other-set analysis type, 1) when the final result is generated up to the final result, 2) the result up to the final result is divided into each step, There can be a way. In the latter case, the multi-dimensional analysis operation execution result table generation module 402 (hereinafter, referred to as " multi-dimensional analysis operation result table generation module " ) Has multivalued analysis operation result table data or cube data for calculating an output value for the input value, and 3) the multidimensional analysis operation result table generation module 402 inputs the first and each step result The same conditions must be met as you can get by value.

상기 출원인 A의 문건을 중 다출원 기술 분야별로 다수 인용하는 인용자(출원인, assignee)에 대한 데이터를 얻으려는 경우의 예를 들어서 설명하면 다음과 같다. 상기 결과는 다음과 같은 단계를 거쳐 입수될 수 있다. 1) 출원인 A가 확정되는 단계(출원인 A는 모든 출원인 전체 집합 중의 한 원소이므로, A는 확정된다), 2) 하나의 국가(예를 들면 미국)에서의 A의 문건 집합이 확정되는 단계(예를 들어 A의 전체 출원 문건 집합이라 하자), 3) A의 모든 문건 집합에는 IPC나 USPC 등이 적어도 하나 이상 부여되어 있으므로, 선택된 특허 분류 기호의 종류와 레벨(예를 들어 IPC, subclass레벨)이 주어진다면 그 주어진 값을 기준으로, 주어지지 않는다면 모든 특허 분류 기호의 종류 및 모든 레벨에서), 각 종류 및 레벨에 속하는 문건 부분 집합(예를 들면, IPC H04N, H01B 등등)이 확정되는 단계, 4) 기술 분야별로 문건 부분 집합을 카운팅하여 다출원 기술 분야의 랭킹을 추출하는 단계, 5) 각 문건 부분 집합별로(예를 들면, A의 모든 문건 중 IPC H04N에 대하여) 그 문건 집합에 속하는 개별 문건을 추출(예를 들면 문건 #1,...#n)하는 단계, 6) 각 개별 문건별로 이 문건 번호를 인용하고 있는 타문건 번호들을 추출하는 단계, 7) 추출된 타문건 번호의 출원인을 추출하는 단계, 8) 추출된 타문건 번호의 출원인을 상기 부분 문건 집합별로 합하는 단계(출원인의 빈도값은 함께 저장된다), 9) 상기 합해진 출원인을 빈도 순으로 상기 기술 분야별(특허 기술 분류의 종류 및 레벨)별로 저장하는 단계(이때, 각 출원인별 문건의 개수(및/또는 타문건 번호 정보)도 함께 저장된다)를 거치면 출원인 A의 문건 중 다출원 기술 분야(특허 기술 분류 및 레벨)에 따른 다수 인용하는 출원인을 확정해 놓은 수 있다. 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 모든 출원인 중에서 선택되는 어느 하나 이상의 출원인을 대상으로 하여 상기 1) ~9)까지의 단계를 수행한 다음 그 결과값을 데이터로 저장해 놓을 수 있다. 상기 1~2) 및/또는 3)는 최초 자기 입력에 대응되는 분석 대상 문건 집합의 확정과 관련되어 있고, 4)~5)는 최초 자기 입력 기준에 대한 자기 자신의 분석에 대응되며, 6)은 분석 결과를 얻기 위한 타문건 집합의 생성과 관련되며, 7) ~9)는 최종 타출력에 대응되는 최종 분석 결과의 생성과 관련된다. 이 경우, 상기 분석 모듈은 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성해 놓은 최종 분석 결과를 입수되는 조건에 맞게 불러와서 보여 주면 된다. 상기 조건은 상기 사용자 또는 기타로부터 입력되는 IPC 등의 특허 기술 분류의 종류 및 레벨, 국가 등이 될 것이다.Hereinafter, an example will be described in which the applicant A obtains data on an applicant (applicant, assignee) citing a plurality of documents by application field. The above results can be obtained through the following steps. 1) the stage where Applicant A is confirmed (since Applicant A is one of the entire set of all applicants, A is confirmed); 2) the stage in which the document set of A in one country (eg, the United States) (Eg, IPC, subclass level), because all the document sets of A have at least one IPC or USPC, and so on. (For example, IPC H04N, H01B, etc.) belonging to each class and level are established, and if not given, given the given value, (B) extracting the rankings of the multi-application technology fields by counting the subset of documents for each technical field; (5) for each subset of documents (eg for IPC H04N of all documents of A) (6) extracting other document numbers citing the document number for each individual document, (7) extracting the document number of the extracted other document number (8) collecting the applicants of the extracted other document numbers by the partial document set (the applicant's frequency values are stored together), 9) sorting the combined applicants by the technical field (And / or other document number information) for each applicant is also stored), the applicant A's documents are classified according to the multi-filed technical field (patent technical classification and level) A number of applicants may be identified. The multidimensional analysis operation result table generation module 402 may perform the steps 1) to 9) with respect to any one or more applicants selected from among all applicants, and then store the resultant values as data . 4) to 5) correspond to the self-analysis of the first self-entry criterion, and 6) to the first self- 7) to 9) relate to the generation of final analysis results corresponding to the final output. In this case, the analysis module may fetch the final analysis result generated by the multidimensional analysis calculation execution result table generation module 402 according to the obtained conditions. The condition may be the type and level of the patent technical classification such as the IPC inputted from the user or the like, the country, and the like.

상기 1) ~9)의 단계가 상기 다단계형 분석을 거칠때는 다음과 같은 그룹화된 다단계가 가능할 것이다. 이 경우 상기 분석 모듈은 다음과 같은 방식으로 작동 될 수 있을 것이다.When the steps 1) to 9) are performed through the multistage analysis, the following grouped multistage method will be possible. In this case, the analysis module may be operated in the following manner.

첫째, 상기 분석 모듈이 출원인 A의 다출원 기술 분야별 문건 집합을 확정하는 단계이다. 이 때 상기 분석 모듈은 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 모든 출원인에 대하여 1)~4)의 과정을 수행하여 다출원 기술 분야의 랭킹 및 그 기술 분야별로의 문건 부분 집합의 수를 확정해 놓은 데이터로부터 출원인 A에 관한 데이터를 입수할 수 있다.First, the analysis module establishes a document set for each application field of applicant A. At this time, in the analysis module, the multidimensional analysis calculation execution result table generation module 402 performs the processes of 1) to 4) for all applicants to determine ranking of the multi-application technology field and a subset of documents Data on applicant A can be obtained from the number-confirmed data.

둘째, 상기 분석 모듈의 문건 집합 입수 모듈이 상기 5)의 문건 번호를 입수하는 단계이다. 상기 문건 부분 집합의 수가 있을 때, 상기 모든 수는 SQL 쿼리에 대응될 수 있으므로, 상기 SQL 쿼리로 상기 5)의 문건 번호를 확정할 수 있다. 상기 문건 번호는 사전에 다차원 분석용 연산 수행 결과 테이블 또는 큐브 데이터에 저장되어 있을 수도 있지만, 저장하지 않는 것이 합리적일 것이다.(SQL로 언제든지 불러 낼 수 있기 때문이다). 상기 문건 집합 입수 모듈은 기설정된 조건에 따른 문건 집합을 입수하여 합집합 연산을 수행하는 모듈을 말한다. 이 경우 SQL이 문건 집합을 입수하기 위한 조건에 해당된다.Second, the document collection acquisition module of the analysis module obtains the document number of 5). When there is a number of the document subset, all the numbers can correspond to the SQL query, so that the document number of the 5) can be determined by the SQL query. The document number may be stored in advance in the multidimensional analysis operation execution result table or the cube data, but it would be reasonable not to store it (because it can be called up by SQL at any time). The document collection module receives a document set according to predetermined conditions and performs a union operation. In this case, SQL is the condition for obtaining the document set.

셋째, 상기 분석 모듈의 상기 문건 집합 입수 모듈이 상기 6)단계를 수행한다. 상기 문건 집합 입수 모듈이 상기 5) 단계의 문건 번호를 서지 사항 DB에 질의하거나, 검색 엔진에 질의함으로써 6) 단계의 타문건 집합이 입수된다. 물론, 상기 서지 사항 DB에는 각 문건은 자신이 인용하고 있는 타문건 번호에 관한 정보(reference 정보)를 포함하고 있거나, 상기 검색 인덱스(401-2)에 특정 문건 번호를 질의했을 때, 질의된 특정 문건 번호를 상기 인용/레퍼런스(reference) 정보로 포함하고 있는 문건 번호를 출력해 줄 수 있어야 한다.(즉, 인덱싱을 수행할 때 a문건에 인용 정보로 #1, #2 문건 번호가 있을 때, 검색 인덱서(401-3)가 인용 필드의 검색을 위해서 a <- #1, a <- #2와 같이 역파일 정보를 생성하여 검색 인덱스(401-2)로 처리해 놓은 경우, 검색 엔진의 인용 검색 필드에 #1이라는 문건 번호를 입력하면, 자신을 인용하고 있는 문건 a를 포함한 여러 문건을 출력해 주게 된다.)Third, the document collecting module of the analysis module performs step 6). The document collection module receives the document number of step 5) or queries the search engine to obtain the document collection of step 6). Of course, in the bibliographic database, each document includes information (reference information) about the other document number that the user himself / herself is referring to, or when a specific document number is inquired to the search index 401-2, (That is, when indexing is performed, a document has # 1 and # 2 document numbers as citation information), the document number should be able to be output as the reference number. When the search indexer 401-3 generates the inverse file information such as a <- # 1 and a <- # 2 for the search of the quoted field and processes it as the search index 401-2, If you enter the document number # 1 in the field, it prints a number of documents including the document a that quotes itself.)

넷째, 상기 6)단계를 통하여 분석의 대상이 되는 타문건 집합이 생성되고, 이렇게 생성된 타문건 집합에 대한 7) 이후의 각종 분석은 본 발명의 직접 분석 모듈이 수행한다. 상기 분석 모듈은 상기 분석 모듈에 포함되어 있는 직접 분석 모듈을 호출하여 상기 7)~9)를 수행한다. 상기 직접 분석 모듈은 주어진 임의의 문건 집합에 대하여 문건 집합이 주어질 때, 기설정된 분석 프로세서를 수행하는 분석 엔진을 말한다.(사전에 기설정될 수 없는 문건 집합, 기설정되기에는 문건 집합의 경우의 수가 너무 많은 경우에 그 문건 집합의 분석 결과를 사전에 생성해 놓는 것은 불가능하거나 불합리하다. 그러므로, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 4) 단계까지만 생성해 놓는 것이 합리적이다.) 직접 분석 모듈이 수행하는 역할은 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 수행하는 역할과 대등하다. 즉, 상기 5)의 단계를 통하여 Fourth, a set of other documents to be analyzed is generated through the step 6), and various analyzes after the generated document set 7) are performed by the direct analysis module of the present invention. The analysis module calls the direct analysis module included in the analysis module to perform steps 7) to 9). The direct analysis module refers to an analysis engine that performs a predetermined analysis process when given a set of documents for a given set of documents (ie, a set of documents that can not be pre-set, It is impossible or unreasonable to preliminarily generate the analysis result of the document set in the case where the number is too large. Therefore, it is reasonable that the multidimensional analysis operation execution result table generation module 402 generates only the step 4). ) The role performed by the direct analysis module is equivalent to the role performed by the operation result table generation module 402 for the multidimensional analysis. That is, through the step 5)

상기 직접 분석 모듈 및 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 분석 대상 문건 집합이 입력되는 최초 단계만이 다를 뿐이며, 일단 문건 집합이 입수된 후 진행되는 분석 결과 데이터 생성 과정은 동일하다.The direct analysis module and the operation result table generation module 402 for multidimensional analysis are different from each other only in the initial stage in which the document set to be analyzed is input, and the analysis result data generation process performed after the document set is once acquired is the same .

자기 집합 기준 타 집합 분석의 유형Types of self-set criteria Other sets of analysis

상기 자기 집합 기준 타 집합 분석에 대해 더욱 상세하게 설명한다. 상기 자기 집합 기준 타 집합 분석의 첫번째 단계는 자기 집합을 대상으로 기준 추출하는 것이다. 상기 기준에는 자기 문건 집합을 기준으로 하여 1) 출원이나 등록이 많은 레벨별 특허 분류 기호, 2) 집중율/점유율/활동율이 레벨별 특허 분류 기호, 3) 국내/해외 패밀리 사이즈가 큰 레벨별 특허 분류 기호, 4) 피인용이 많은 레벨별 특허 분류 기호, 5) 융합이 다수 발생하는 레벨별 특허 분류 기호, 6) 청구항의 평균 개수 등과 같은 각종 카운팅 값이 높은 레벨별 특허 분류 기호 등이 있을 수 있다. 상기 1) ~6)는 기설정된 기간(예를 들어 출원일 기준 최근 7개년 간 등)별로 추출할 수도 있을 것이며, 상기 1) ~6)는 상기 출원수/등록수, 점유율/집중율/활동율, 패밀리사이즈, 피인용, 융합의 계량화된 수치의 변동값(증감율, 변화속도 등)이 기설정된 기준을 넘는 경우를 조건으로 추출할 수도 있을 것이며, 상기 1) ~6)을 국가 단위로도 추출할 수 있을 것이다.The above-mentioned self-set reference other-set analysis will be described in more detail. The first step of the above-mentioned self-set reference other-set analysis is to extract a reference to a self-set. Based on the self-document set, the above criteria include 1) a patent classification symbol for each level with many applications or registrations, 2) a patent classification symbol for each concentration level / occupancy rate / activity rate, and 3) A patent classification code for each level, 4) a patent classification code for each level, 5) a patent classification code for each level in which a lot of convergence occurs, and 6) an average number of claims, etc. . The above 1) to 6) may be extracted for each predetermined period (for example, the last 7 years based on the filing date), and 1) to 6) (1) to (6) above may also be extracted on a nation-by-country basis, provided that the variation value (increase / decrease rate, change rate, etc.) of the family size, It will be possible.

도 17 등에 포함된 랭킹 1위 H01L, 2위 H04N 등은 자기 집합을 대상으로 하여 추출된 기준의 일례를 보여 준다. 상기 자기 집합은 본 명세서에서 설명하는 문건 집합의 확정에 따라 확정된 문건 집합을 의미한다.The ranking first place H01L, the second place H04N, etc. shown in Fig. 17 and the like show an example of the criterion extracted for the self-assembly. The self-set refers to a set of documents determined according to the determination of the document set described in this specification.

둘째 단계는, 상기 자기 집합을 대상으로 추출된 기준에 대하여, 분석 결과를 생성하는 단계이다. 상기 분석 결과를 생성하기 위하여 상기 분석 주제별 분석 모듈은 기설정된 분석 질의식으로 기설정된 다차원 분석용 연산 수행 결과 테이블에 질의한다. 이때, 상기 다차원 분석용 연산 수행 결과 테이블은 상기 자기 집합이 아닌 집합 또는 자기 집합이 포함될 수도 있는 타 전체 집합을 대상으로 생성된 것이다. 수 있다. 예를 들어, 자기 집합이 아닌 집합의 경우에는 1) 자기 집합이 제 1국 기준의 집합일 경우, 제1국이 아닌 제 2국의 문건 집합이 대표적이다. 한편, 상기 자기 집합이 포함될 수도 있는 타 전체 집합에는 인용 분석에서 활용되는 i) 전체 전방 인용 문건 집합, ii) 전체 후방 인용 문건 집합을 대상으로 한 1) 전체 전방 인용 문건 집합, 2) 전체 후방 인용 문건 집합이 대표적이다. The second step is a step of generating an analysis result for a criterion extracted for the self-set. In order to generate the analysis result, the analysis subject-specific analysis module queries the operation result table for multidimensional analysis predetermined by a predetermined analysis quality consciousness. At this time, the operation result table for the multidimensional analysis is generated for a set other than the self-set or a whole other set that may include a self-set. . For example, in the case of a set that is not a self-set, 1) When the self-set is a set of the first country, the document set of the second country is representative. On the other hand, the other whole sets, which may include the above-mentioned self-aggregation, include (i) a set of entire frontal citation documents used in citation analysis, (ii) a set of frontal citation documents, A collection of documents is representative.

한편, 자기 집합 기준 타 집합 분석의 대표적인 예로 경쟁 분석이 있다. 출원인 단위 경쟁 분석을 예로 들면서 경쟁 분석을 설명한다. 제 1국의 특정 출원인의 전체 또는 일부의 문건 집합을 대상으로 하여, 상기 문건 집합을 자기 집합으로 하여 적어도 하나 이상의 특허 분류 기호 레벨 단위(예를 들면 IPC 서브클래스)에 서 상기 출원인의 다출원/다등록 기준 IPC, 고집중율, 고점유율, 고활동율 또는 기타 특허 지표별로 랭킹이 높은 IPC를 추출할 수 있을 것이다. 상기 추출한 IPC에 대하여 제1국 또는 제2국에서 상기 다출원/다등록 출원인, 집중율이 높은 출원인 점유율이 높은 출원인, 고활동율이 높은 출원인 또는 기타 특허 지표가 높은 출원인의 랭킹을 추출할 수 있을 것이다. 한편, 출원인 단위가 아니더라도 출원인의 발명자 단위, 발명자 단위, 대리인 단위 등의 각종 단위별로 본 문단에서 출원인 단위에서 수행한 정보 처리를 동일하게 하여, 그 단위별로의 경쟁 분석을 수행할 수 있다. 물론, 상기의 타 집합 분석 결과를 생성하기 위하여 본 발명의 분석 모듈은 IPC 각 레벨별로 출원/등록 총량, 점유율, 집중율, 활동율이 다차원 연산되어 포함되어 있는 다차원 분석용 연산 수행 결과 테이블에 기설정된 분석 질의식으로 접근하여 목적하는 분석 결과를 추출해 낸다.On the other hand, there is competition analysis as a representative example of the other set analysis of the self-collection standard. Describe competition analysis by taking applicant unit competition analysis as an example. A plurality of patent applications of the applicant in at least one or more patent classification level units (for example, an IPC subclass) with the document set as a self-assembly, targeting a set of documents of all or a part of specific applicants of the first country, It is possible to extract IPCs with high ranking according to multi-registration standard IPC, high concentration ratio, high occupancy rate, high activity rate or other patent indicators. It is possible to extract the rankings of the above-mentioned multi-application / multi-registration applicant, the applicant having a high concentration rate, the applicant having a high occupancy rate, the applicant having a high activity rate or the applicant having a high patent index in the first country or the second country with respect to the extracted IPC There will be. On the other hand, even if it is not the applicant unit, it is possible to carry out the competition analysis for each unit of the applicant's unit, the inventor's unit, and the representative unit by the same processing of the information carried out by the applicant unit in this paragraph. Of course, in order to generate the above-mentioned other set analysis result, the analysis module of the present invention includes a plurality of IPC calculation levels for each level of the IPC, including a total of application / registration amount, occupancy rate, concentration rate, The analysis results are retrieved by approaching the set analytical quality consciousness.

상기 분석 모듈은 분석 주제별 분석 쿼리식으로 해당 다차원 분석용 연산 수행 결과 테이블에 질의하여 분석 결과를 입수하여 출력한다. 이때, 상기 분석 결과의 출력은 각 언어별로 출력되는 것이 바람직하다. The analysis module queries the operation result table for the corresponding multidimensional analysis in the analytical subject-based analysis query expression, obtains the analysis result, and outputs the analysis result. At this time, the output of the analysis result is preferably output for each language.

한편, 수치값이 주어졌을 때, 그 수치값의 변화값(증감율, 속도, 가속도) 및 통계값(평균과 표준편차 등)도 유용한 정보가 될 수 있다. 이때, 1) 그 변화값과 통계값까지 다차원 분석용 연산 수행 결과 테이블 에 생성해 놓는 방법도 적용될 수 있으며, 2) 상기 분석 모듈 단에서 입수된 수치값으로 즉석에서 계산하는 방법이 있을 수 있다. 이때, 그 수치값이 단순한 출원/등록량 등과 같은 카운팅값인 경우 2)의 방법이 용이할 수 있으며, 점유율, 집중율 및 활동율과 같은 비율값 인 경우 1)의 방법이 더 용이할 수 있을 것이다. 예를 들어, A 기업의 H04N 7/02 기술 영역에서 2000년도부터 2005년까지의 출원 총계나 출원수 증감율 등은 그 수치값이 카운팅 정보이므로, 연도별 출원 수만으로 용이하게 계산될 수 있다. 한편, 2000년도부터 2005년도까지의 A 기업의 H04N 7/02 영역에서의 연도별 점유율, 집중율, 및 활동율 수치값으로, 평균값(비율값의 총계는 의미가 없음)을 계산하는 경우, 비율값의 합산이 의미가 없으므로, 각 분석 지표에 대응되는 공식을 연도별이 아니라, 2000년부터 2005년까지의 연도 통합적으로 적용해야 한다. 그러므로, 비율값의 계산은 상기 2)의 방법과 같이 사전에 다차원 분석용 연산 수행 결과 테이블의 생성 시 함께 생성되는 것이 바람직할 것이다.On the other hand, when a numerical value is given, the change values (increase / decrease rate, acceleration, acceleration) and statistical values (average and standard deviation, etc.) of the numerical value can also be useful information. In this case, 1) a method of generating the change value and the statistic value in the operation result table for multidimensional analysis may be applied, and 2) there may be a method of instantly calculating the numerical value obtained from the analysis module. At this time, if the numerical value is a counting value such as a simple application / registration amount, the method 2) may be easy, and if the ratio value such as the occupancy rate, the concentration rate and the activity rate, will be. For example, the total number of applications, the number of applications, and the rate of increase / decrease in the number of applications from 2000 to 2005 in the technology field of Company A H04N 7/02 can be easily calculated by the number of applications per year since the numerical values are counting information. On the other hand, when the average value (the sum of the ratio values is meaningless) is calculated by the numerical value of occupancy rate, concentration rate, and activity rate in year H04N 7/02 of company A from 2000 to 2005, Since the sum of the values does not make sense, the formula corresponding to each analytical index should be applied integrally throughout the year from 2000 to 2005, not by year. Therefore, it is preferable that the ratio value is generated at the time of generation of the operation result table for the multidimensional analysis in advance as in the method 2).

본 발명의 분석 모듈에는 특허 문건 마스트 DB(202), 특허 분류 기호 마스트 DB(203), 주체 마스트 DB(204) 등과 같이 DB 자체를 대상으로 하는 마스트 DB 분석 모듈이 있다. 상기 마스트 DB 분석 모듈은 주제별로 마련되며, 상기 마스트 DB 분석 모듈이 수행하는 것은 분석 주제마다 어떤 마스트 DB의 어떤 테이블을 조인하여 나온 값을 어떻게 처리하는 지에 대한 정보를 담과 있는 SQL 쿼리문 또는 상기 SQL 쿼리문이 입수한 데이터를 재처리하는 애플리케인션을 포함하고 있다. 상기 마스트 DB 분석 모듈은 서브 쿼리(sub query)를 다수 사용하여야 하는데, 상기 서브 쿼리는 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성한 것과 동등한 다차원 분석용 연산 수행 결과 테이블의 일부(1개의 레코드 또는 복수개의 레코드) 또는 전부의 내용을 생성하는 역할을 담당한다.The analysis module of the present invention includes a mast DB analysis module that targets the DB itself such as a patent document mast DB 202, a patent classification symbol mast DB 203, and a subject mast DB 204. The mast DB analysis module is provided for each topic. The mast DB analysis module performs an SQL query statement containing information on how to process a value obtained by joining a table of a mast DB for each analysis subject, It contains an application that reprocesses data obtained by SQL query statements. The mast DB analysis module should use a plurality of subqueries. The subquery is a part of the multidimensional analysis operation result table equivalent to the one generated by the multidimensional analysis operation result table generation module 402 One record or a plurality of records) or the entire content.

사용자가 검색어 등을 입력하여 생성되는 사용자 직접 생성 문건 집합을 분 석 대상 문건 집합으로 하는 경우를 예로 들면서 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402) 및 상기 서브 쿼리에 대해서 더욱 상세하게 설명한다. 사용자 직접 생성 문건 집합은 사용자가 생성하기 전에는 문건 집합이 사전에 확정되지 않으므로, 사전에 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)으로 다차원 분석용 연산 수행 결과 테이블을 사전에 생성해 놓을 수 없는 문제가 있다. 그러므로, 사용자 직접 생성 문건 집합을 대상으로 분석을 수행하는 경우에는, 분석 대상 문건 집합 확정 후 다음 2가지 방법 중 어느 한 가지 방법 이상으로 사용한다. 첫째, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 상기 사용자 직접 생성 문건 집합을 대상으로 다차원 분석용 연산 수행 결과 테이블을 생성해 놓고, 상기 다차원 분석용 연산 수행 결과 테이블을 대상으로 각종 분석 주제별 분석 모듈이 해당 분석 질의식으로 상기 테이블에 접근하여, 분석 결과를 생성하는 방법이다. 둘째는 상기 마스트 DB 분석 모듈이 상기 사용자 직접 생성 문건 집합에 포함된 문건에 대하여 적어도 하나 이상의 서브 쿼리를 사용하여 다차원 분석용 연산 수행 결과 테이블에 포함된 데이터와 대등한 데이터(테이블 또는 뷰)를 생성해 놓고, 상기 데이터에 상기 분석 주제별 마스트 DB 분석 모듈이 해당 분석 질의식(메인 쿼리)으로 상기 생성해 놓은 데이터에 접근하여 분석 결과를 생성하는 방법이다. 상기 다차원 분석용 연산 수행 결과 테이블에 포함된 데이터와 대등한 데이터란 말은 롤업 연산 또는 큐브 연산과 같이 적어도 하나 이상의 차원으로 차 원 연산 처리한 결과와 대등한 데이터를 말한다.The multidimensional analysis operation execution result table generation module 402 and the sub-query will be described in more detail by taking as an example a case where a set of user-created documents generated by a user inputting a search word or the like is set as an analysis target document set . Since the document set is not determined in advance before the user creates the document set, the multidimensional analysis operation result table generation module 402 can not generate the multidimensional analysis operation result table in advance there is a problem. Therefore, when analyzing a set of user-created documents, it is necessary to use one or more of the following two methods after confirming the document set to be analyzed. First, the multidimensional analysis calculation execution result table generation module 402 generates a multidimensional analysis calculation execution result table on the user-generated document collection set, and performs various analysis The subject-specific analysis module accesses the table with the corresponding analysis quality consciousness, and generates the analysis result. Second, the mast DB analysis module generates data (table or view) equivalent to the data included in the operation result table for multidimensional analysis using at least one or more sub-queries for documents included in the user-created document set And a mast DB analysis module according to the analysis subject matter accesses the data generated by the analysis query consciousness (main query) to generate the analysis result. The data equivalent to the data included in the operation result table for multidimensional analysis refers to data equivalent to a result of performing a difference operation on at least one dimension such as a roll-up operation or a cube operation.

분석의 방향 계열Direction of analysis series

상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 생성하는 다차원 분석용 연산 수행 결과 테이블은 다음과 같은 계열이 있을 수 있다. 분석이란 본질적으로 특정한 데이터에서 목적하는 바를 입수하는 것이므로, 분석의 종류는 1)분석 대상 문건 집합의 속성과 2) 분석 주제에 의해서 결정나게 되며, 부가적으로 3) 분석 결과를 출력해 주는 형태에 따라 결정나게 된다. 그러므로, 상기 1), 2) 및/또는 3)에 따라 분석 종류별로 이름붙이기(naming)이 되게 된다. The multi-dimensional analysis operation result table generated by the multi-dimensional analysis operation result table generation module 402 may have the following sequence. Analysis is basically to obtain the desired data from specific data, so the type of analysis is determined by 1) the attribute of the document set to be analyzed, 2) the subject of analysis, and 3) the form of outputting the analysis result It is decided according to. Therefore, naming is performed according to the types of analysis according to the above 1), 2) and / or 3).

이하, 각종 분석 종류를 예시적으로 설명하면서, 상기 1), 2), 3)과 관련 지어 본다.Hereinafter, the various kinds of analysis will be exemplarily described and related to the above 1), 2) and 3).

첫째, 분석 대상 문건 집합의 속성은 본 명세서에서 다수 언급한 적이 있는 문건 집합이 확정되었을 때의 그 확정의 기준에 따라 결정된다. 출원인, 발명자, 특정 레벨의 특허 분류 기호, 전방 인용 문건 집합 등등은 모두 분석 대상 문건 집합의 속성에 따른 것이다. 이들 분석 대상 문건 집합은 본 발명의 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)이 입수하는 문건 집합의 단위의 한 예가 된다.First, the attributes of the document set to be analyzed are determined according to the criteria of the determination when a set of documents mentioned in this specification has been identified. The applicant, the inventor, the patent classification code of a certain level, the set of forward quotation documents, etc. all depend on the attribute of the document set to be analyzed. These sets of analysis target documents are an example of a unit of a document set obtained by the multidimensional analysis calculation execution result table generation module 402 of the present invention.

둘째, 분석 주제에 따른 분류로, 상기 분석 주제에 따라, 상기 다차원 분석용 연산 수행 결과 테이블 생성 모듈(402)은 어떠한 다차원 분석용 연산 수행 결과 테이블을 생성할 것인지와 상기 다차원 분석용 연산 수행 결과 테이블에 어떠한 분 석 질의식으로 질의할 것인지를 결정하게 된다.According to the analysis subject, the multidimensional analysis calculation execution result table generation module 402 determines which multidimensional analysis calculation execution result table is to be generated and the multidimensional analysis calculation execution result table 402, Which will determine what kind of analytical ques- tion will be asked.

셋째, 분석 결과를 출력해 주는 형태에 따른 분류로 이에는 다음과 같은 2가지가 있다. 1) 단일 분석과 2) 비교 분석이 있다. 단일 분석에는 한 종류의 분석 결과를 1개의 단일한 기준으로 제시해 주는 것이며, 비교 분석은 한 종류(동일한 종류)의 분석 결과를 2개 이상의 기준으로 제시해 주는 것이다. 예를 들어 2 이상의 출원인별/기업별(예를 들면 삼성전자와 LG전자)로 동일한 분석 결과(예를 들면 연도별 IPC 레벨별 특정 국가에서의 출원수)를 함께 제시해 주는 출원인별/기업별 비교 분석의 가장 단순한 예가 된다. 이때, 상기 예에서 연도별 IPC 레벨별 특정 국가에서의 출원수는 상기 2개 이상의 출원인별/기업별로 동일한 종류의 분석 결과가 됨을 알 수 있다. 비교 분석 결과는 비교 대상의 선택에 따라 사용자에게 최종적으로 출력해 주는 결과가 달라지므로, 사용자로부터 비교 대상의 선택을 입수 받고, 각 선택 대상별로 상기 다차원 분석용 연산 수행 결과 테이블에 기설정된 분석 주제별 분석 질의식으로 질의한 다음, 각 선택 대상별로의 분석 결과를 결합하면 된다. 상기 예에서는 삼성전자 및 LG전자에 대해서 각각 동등한 분석 결과를 생성하고, 이를 비교하는 방식으로 결합하면, 비교 분석 결과값이 생성되게 된다.Third, there are two kinds of classification according to the type that outputs the analysis result. 1) single analysis and 2) comparative analysis. In a single analysis, one kind of analysis result is presented as one single criterion, and comparative analysis is a kind of (same kind) analysis result is presented as two or more criteria. For example, a comparison of applicants / companies by two or more applicants / companies (eg, Samsung Electronics and LG Electronics) showing the same analysis result (for example, the number of applications in certain countries by IPC level by year) It is the simplest example of analysis. In this case, it can be seen that the number of applications in a specific country by IPC level by year is the same type of analysis result for each of the two or more applicants / companies. Since the result of the final output to the user is changed according to the selection of the comparison object, the selection result of the comparison object is obtained from the user, and the analysis subject matter analysis You can query by quality consciousness, and then combine the analysis results for each selected item. In the above example, if the same analysis results are generated for Samsung Electronics and LG Electronics respectively and are combined in a manner of comparing them, a result of the comparative analysis is generated.

감시 모듈(403)The monitoring module (403)

특허 정보의 감시란, 업데이트 되는 특허 정보를 대상으로한 감시(watching, monitoring, alerting) 활동을 말한다.Monitoring of patent information refers to monitoring, monitoring, and alerting activities on updated patent information.

감시 서비스를 수행하기 위해서는 1) 감시를 위한 검색식을 등록받는 단계; 기설정된 시간 단위로 상기 검색식을 검색 엔진 또는 DBMS(201)에 질의하고(질의할 때는 신규 문건을 찾는 것이 목적이므로, 시간 범위(예를 들면 공개 일자 기준으로 검색 질의하는 일자 포함 기준으로 과거 6일을 시간 범위로 한정함)을 검색식에 한정하는 것이 일반적이다.), 신규 특허 문건이 있는 경우 이를 감시 서비스 받은 자에게 전송하는 단계를 거칠 수 있다. 감시 서비스의 특별한 형태는 도 ***에 있는 기술, 기업, 제품 등의 분류 체계를 트리 구조로 구성하고, 트리를 구성하는 각 노드(node)에 감식 검색식을 결합하는 방법을 제시할 수 있다.To perform the monitoring service, 1) registering a search formula for monitoring; (For example, searching for a new document on the basis of a date including the date of inquiry on the basis of a public date), the query is searched in the search engine or the DBMS 201 The work is limited to the time range) is limited to the search formula), and if there is a new patent document, it may be transmitted to the surveillance service recipient. A special form of surveillance service can be constructed by constructing a tree structure of the classification system of technology, enterprise, product, etc. shown in Fig. ***, and combining a keyword search expression with each node constituting the tree .

본 발명의 감시 모듈(403)에 입력되는 감시 검색식은 출원인 명의 단위 특허 정보 서비스 시스템 및/또는 발명자 명의 단위 특허 정보 서비스 시스템(1)에 사전에 세팅이 되어 있는 것이 특징이다. 즉, 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템을 생성할 때, 또는 생성 이후라도 사용자의 개입 없이, 상기 출원인 명의 단위에서 필요한 감시 검색식 또는 상기 발명자 명의 단위에서 필요한 감시 검색식이 자동으로 추출되어 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템의 감시 서비스에 세팅이 되게 된다. 상기 세팅되는 감시 검색식의 종류 및 상기 세팅되는 감시 검색식을 생성하는 방법은 다음과 같은 것이 있을 수 있다. 먼저 문건 집합 단위에서 사전에 세팅되는 감시 검색식의 종류 및 생성 방법을 설명한다. 상기 감시 검색식에 포함될 정보는 상기 본 발명의 분석 모듈에서 생성되는 분석 결과인 경우가 다수 있다. 상기와 같은 예로는 상기 수치값, 상기 비 율값 등의 최빈값, 최대값 등을 들 수가 있다. 구체적인 예는 하나씩 설명한다.The monitoring search formula input to the monitoring module 403 of the present invention is pre-set in the unit patent information service system of the applicant and / or the unit patent information service system 1 of the inventor. That is, when the unit patent information service system of the applicant's name or the unit patent information service system of the inventor is created, or after the generation thereof, the necessary surveillance search expression or the unit of the inventor's name in units of the applicant's name The search formula is automatically extracted and set in the monitoring service of the unit patent information service system of the applicant's name or the unit patent information service system of the inventor's name. The type of the monitoring search formula to be set and the method of generating the monitor search formula may be as follows. First, the type and generation method of the monitoring search formula set in advance in the document set unit will be described. The information to be included in the monitoring search formula may be the analysis result generated by the analysis module of the present invention. Examples of such values include a mode value such as the numerical value and the rate value, a maximum value, and the like. Specific examples are described one by one.

출원인 명의 단위 또는 발명자 명의 단위 감시Unit monitoring of applicant's name or inventor's name

첫째, 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 발명자 명의 단위 특허 정보 서비스 시스템의 관리 대상이 되는 문건 집합에서 적어도 하나 이상의 최빈 특허 분류 기호를 빈도 순으로 랭킹을 매겨 추출하고, 상기 최빈 특허 분류 기호를 기설정된 국가 또는 특허 정보 마스트 DB가 입수하는 문건의 발생국(WIPO를 포함한다) 중에서 선택되는 적어도 어느 하나 이상의 국가별로 감시할 검색식으로 등록해 놓을 수 있다. First, at least one or more of the most preferred patent classification symbols are ranked in order of frequency from the document set to be managed by the unit patent information service system of the applicant's name or the unit patent information service system of the inventor's name, And may be registered as a search formula to be monitored by at least one country selected from the set country or the country where the document acquired by the patent information mast DB (including WIPO).

출원인 명의로 특정되는 특허 문건 집합에서 적어도 하나 이상의 최빈 특허 분류 기호에 대해서 1) 국내외에서 가장 많은 특허를 출원하거나 등록하는 자, 2) 국내외에서 일정 개수 이상을 출원하면서도 상기 최빈 특허 분류 기호에 집중율이 높은 자, 3) 국내외에서 기설정된 최근 기간 내에 특허 출원 속도나 특허 등록 속도가 기설정된 기준 이상이 되는 출원인을 추출하고, 상기 출원인을 감시 검색식으로 할 수 있다.1) at least one patent application filed or registered at home and abroad, 2) a patent application filed on or more than a certain number of domestic patents, 3) The applicant whose patent application rate or patent registration rate is higher than the predetermined standard within the predetermined period set in domestic and foreign countries can be extracted and the applicant can be made into the monitoring search formula.

적어도 하나 이상의 최빈 특허 분류 기호에 대해서 상기 특허 분류 기호 및 상기 특허 분류 기호의 하위 특허 분류 기호에 대응되는 기설정된 최근 기간 내의 신규 문건 수의 증가 속도가 높은 특허 분류 기호를 감시 검색식으로 할 수 있다.A patent classification code having a high rate of increase in the number of new documents within a predetermined recent period corresponding to the patent classification code and the subpublic classification code of the patent classification code for at least one or more of the most preferred patent classification codes can be made a monitoring search formula .

적어도 하나 이상의 최빈 특허 분류 기호에 가장 많은 특허를 출원하거나 등록하는 적어도 하나 이상의 국내외의 발명자를 추출해 내고, 상기 발명자를 감시 검색식으로 할 수 있다.It is possible to extract at least one or more domestic and foreign inventors who apply for or register the largest number of patents in at least one or more of the most preferable patent classification codes,

상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 발명자 명의 단위 특허 정보 서비스 시스템의 관리 대상이 되는 문건 집합에서 인용 정보(인용(backward citation), 피인용(forward citation)를 추출해 내고, 국내외적으로 최빈 인용자 및/또는 최빈 피인용자를 추출해 내고, 상기 최빈 인용자 또는 피인용자를 출원인으로 하여 감시 검색식으로 할 수 있다.(Backward citation, forward citation) are extracted from the document set to be managed by the unit patent information service system of the applicant's name or the unit patent information service system of the inventor's name, And / or the most probable applicant can be extracted, and the most probable applicant or applicant can be used as the applicant for the monitoring search formula.

상기 문건 집합의 어느 하나 이상의 개별 특허 문건을 인용하는 자를 감시하기 위하여(상기 특허 문건을 인용하는 자를 찾아 내기 위해서), 상기 문건 집합을 이루는 전체 특허 문건의 출원 번호 또는 등록 번호를 OR 연산자로 묶어 감시 검색식으로 할 수 있다.In order to monitor a person who cites one or more individual patent documents of the document set (in order to find a person citing the patent document), the application number or registration number of the entire patent document constituting the document set is grouped into an OR operator It can be done by a search expression.

상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템이 관리하는 전체 문건 집합 단위의 감시 검색식은 각 감시 주제별로 폴더화 되어 감시 서비스 하위에 존재하면서 사용자에게 노출되는 것이 바람직하다. 예를 들어 설명하면, 상기 첫번째 방법의 감시 검색식의 감시 주제는 "최빈 특허 분류 기호를 통한 특허 감시"라는 이름의 폴더가 만들어 지는 것이 바람직하며, 상기 폴더 내에는 각 국가별로 하위 폴더 내지는 하위 계층 구조가 만들어 지는 것이 바람직할 것이다. 상기 전체 문건 집합 단위의 감시 검색식으로 감시한 결과는 상기 폴더 또는 계층 구조에서 확인되는 것이 바람직할 것이다. 물론, 상기 사전에 세팅되는 검색식을 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템의 관리자 또는 발명자가 수정할 수 있음은 물론이다 하겠다.It is preferable that the monitoring search formula of the whole document set unit managed by the unit patent information service system of the applicant's name or the unit patent information service system of the inventor is folded for each monitoring subject and exposed to the user while being present under the monitoring service. For example, it is preferable that a folder named "Patent surveillance through the best patent classification code" is created in the monitoring subject of the surveillance search formula of the first method. In the folder, It would be desirable for the structure to be made. It is preferable that the result monitored by the monitoring search formula of the entire document set unit is confirmed in the folder or the hierarchical structure. Of course, the search formula to be set in advance can be modified by the manager or the inventor of the unit patent information service system of the applicant or the unit patent information service system of the inventor.

특허 인텔리전스 모듈(60)Patent intelligence module (60)

이하, 본 발명의 특허 인텔리전스 모듈(60)에 대하여 설명한다. 본 발명의 특허 인텔리전스 모듈(60)은 고차원적인 고급 특허 분석을 수행하고, 상기 고급 특허 분석 수행 결과를 저장해 놓거나, 사용자에게 제공하는 기능을 수행한다.Hereinafter, the patent intelligence module 60 of the present invention will be described. The patent intelligence module 60 of the present invention performs a high-level advanced patent analysis, stores the result of the advanced patent analysis, or provides the user with the result of the advanced patent analysis.

상기 특허 인텔리전스 모듈(60)에는 인용 분석 모듈(601-1)과 비교 분석 모듈(601-2)이 포함된 분석 모듈(601), 핵심 분야의 발견과 관련된 핵심 분야 발견 모듈(602), 관련성 있는 집단 정보를 생성해 내는 유관성 발견 모듈(603), red ocean의 정도 등과 같은 문건 집합 단위의 집합적 속성을 찾아내는 문건 집합 단위 집합적 속성 발견 모듈(604), 신규 주체의 발견이나 선발 주체의 발견 등을 수행하는 포트폴리오 분석 모듈(605), 융합 분석 모듈(606), 유망 분야를 찾아내는 유망 분야 분석 모듈(607), 공백 기술을 찾아내는 공북 기술 분석 모듈(608), 특허 전략을 분석하는 특허 전략 분석 모듈(609) 중 어느 하나 이상을 포함하고 있다.The patent intelligence module 60 includes an analysis module 601 including a citation analysis module 601-1 and a comparison analysis module 601-2, a core field discovery module 602 related to discovery of a core field, A relatedness finding module 603 for generating collective information, a document collective attribute discovery module 604 for finding collective attributes of document aggregation units such as the degree of red ocean, a discovery module 604 for discovering a new subject, A fusion analysis module 606, a prospective field analysis module 607 for finding a promising field, a technology analysis module 606 for finding a blank technology, and a patent strategy analyzing a patent strategy. And a module 609. [0060]

상기 핵심 분야 발견 모듈(602)에는 핵심 출원인을 발견하는 핵심 출원인 발견 모듈(602-1), 핵심 발명자 발견 모듈(602-2), 핵심 대리인 발견 모듈(602-3), 핵심 특허 분류 기호 발견 모듈(602-4) 등이 있다. The core field discovery module 602 includes a core applicant discovery module 602-1 for discovering a core applicant, a core inventor discovery module 602-2, a core agent discovery module 602-3, (602-4).

상기 유관성 발견 모듈(603)에는 유관 출원인 발견 모듈(603-1), 유관 발명자 발견 모듈(603-2), 유관 대리인 발견 모듈(603-3), 유관 특허 분류 기호 발견 모듈이 있다. The relatedness finding module 603 includes a related applicant discovery module 603-1, a related inventor discovery module 603-2, a related agent discovery module 603-3, and a related patent classification symbol discovery module.

상기 문건 집합 단위 집합적 속성 발견 모듈(604)에는 red ocean 속성 발견 모듈(604-1)이 있다. The document set collective attribute discovery module 604 includes a red ocean attribute discovery module 604-1.

상기 포트폴리오 분석 모듈(605)에는 신규 주체 발견 모듈(605-1) 및 선발 주체 발견 모듈(605-2)이 있으며, 상기 신규 주체 발견 모듈(605-1)에는 신규 진입 출원인 발견 모듈(605-1-1), 신규 진입 발명자 발견 모듈(605-1-2), 신규 진입 대리인 발견 모듈(605-1-3), 상기 선발 주체 발견 모듈(605-2)에는 선발 출원인 발견 모듈(605-2-1), 선발 발명자 발견 모듈(605-2-2), 선발 대리인 발견 모듈(605-2-3)이 있다. The new subject discovery module 605-1 includes a new entry applicant discovery module 605-1 and a new subject discovery module 605-1. -1, the new entry inventor detection module 605-1-2, the new entry agent detection module 605-1-3, and the selection subject discovery module 605-2 are provided with a selection applicant discovery module 605-2- 1, the inventor finding module 605-2-2, and the selecting agent discovery module 605-2-3.

상기 융합 분석 모듈(606)에는 주체별 기술 융합 패턴 분석 모듈(606-1), 기술 분야별 기술 융합 패턴 분석 모듈(606-2), 국가별 기술 융합 패턴 분석 모듈(606-3), 시기별 기술 융합 패턴 분석 모듈(606-4)이 있다. 본 발명의 상기 복수 특허 분류 기호 관계성 전처리 모듈(3900) 및 융합 분석용 다차원 분석용 연산 수행 결과 테이블(205 -7)의 생성 모듈과 이를 이용한 분석 모듈에서 상기 융합 분석 모듈(606)에 대하여 설명하였다. 상기에서 기술 또는 기술 분야를 IPC나 USPC, FT 등과 같은 특허 분류 기호로 한정하는 경우, 특허 분류 기호별, 출원인/발명자별 특허 분류 기호의 관점에서, 국가별 특허 분류 기호의 관점에서 이에 대해서는 상기에서 충분히 설명하였고, 상기 각각에 대하여 시기에 대한 한정을 할 수 있음을 설명하였다.The fusion analysis module 606 includes a subject fusion fusion pattern analysis module 606-1, a technology fusion fusion pattern analysis module 606-2, a country fusion fusion pattern analysis module 606-3, And a fusion pattern analysis module 606-4. The generation module of the multi-patent classification symbol preliminary processing module 3900 and the multidimensional analysis operation execution result table 205 -7 for fusion analysis of the present invention and the fusion module 606 in the analysis module using the module Respectively. In the case where the technical or technical field is limited to patent classification symbols such as IPC, USPC, FT and the like, from the viewpoint of the patent classification symbol and the patent classification symbol per applicant / inventor, And explained that it is possible to limit the timing for each of the above.

상기 유망 분야 분석 모듈(607)에는 활용 유망 분야 분석 모듈(607-1), 융합 유망 분야 분석 모듈(607-2)이 있다. 상기 특허 전략 분석 모듈(609)에는 특허 전략 평가 지표 분석 모듈(609-1), 특허 활용도 분석 모듈(609-2), 인수 특허 분석 모듈(609-3) 등이 있다.The prospective field analysis module 607 includes a prospective field analysis module 607-1 and a prospective fusion field analysis module 607-2. The patent strategy analysis module 609 includes a patent strategy evaluation index analysis module 609-1, a patent utilization analysis module 609-2, and an acquired patent analysis module 609-3.

출원인 명의 단위 특허 정보 서비스 시스템Unit patent information service system of applicant's name

이어, 본 발명의 특허 정보 시스템의 일 응용예로서 계층적으로 통합된 특허 정보 서비스 시스템 및 개별 단위 특허 정보 서비스 시스템 에 대하여 설명한다.As an application example of the patent information system of the present invention, a hierarchically integrated patent information service system and an individual unit patent information service system will be described.

본 발명의 출원인 명의 단위 특허 정보 서비스 시스템이란, 개별적인 출원인 명의 단위로 된 특허 정보 시스템을 말하며, 상기 출원인 명의 단위 특허 정보 서비스 시스템이 포함하는 특허 문건 정보에는 상기 출원인 명의가 출원인 및/또는 특허권자 필드에 포함되어 있어, 출원인 명의가 특허 문건 집합을 특정하며, 이렇게 특정된 특허 문건 집합을 대상으로 상기에서 설명한 본 발명 사상의 각종 분석, 통계 처리, 정보 추출 등의 가공 데이터가 사전 또는 실시간으로 생성되어 사용자 컴퓨터에 제공되는 것을 특징으로 한다. 본 발명의 발명자 명의 단위 특허 정보 서비스 시스템이란, 개별적인 발명자 명의 단위로 된 특허 정보 시스템을 말하며, 상기 발명자 명의 단위 특허 정보 서비스 시스템은 특정한 발명자와 관련된 특허 문건 집합을 대상으로 본 발명 사상의 각종 분석, 통계 처리, 청부 추출 등의 가공 데이터가 사전 또는 실시간으로 생성되어 사용자 컴퓨터에 제공되는 것을 특징으로 한다.The unit patent information service system of the applicant of the present invention is a patent information system in units of individual applicants, and the patent document information included in the unit patent information service system of the applicant includes the name of the applicant in the applicant and / Processing data such as various analyzes, statistical processing, information extraction, and the like of the present invention described above are generated in advance or in real time on the specified set of patent documents, And is provided to a computer. The inventor's unit patent information service system refers to a patent information system in units of individual inventor's name. The inventor's unit patent information service system includes a patent document set related to a specific inventor, Statistical processing, contract extraction, and the like are generated in advance or in real time and provided to the user's computer.

하나의 출원인 명의 단위 특허 정보 서비스 시스템에는 적어도 하나 이상의 발명자 명의 단위 특허 정보 서비스 시스템이 대응될 수 있음은 물론이다 하겠다. 출원인 명의 단위 특허 정보 서비스 시스템의 하부에는 상기 적어도 하나 이상의 발명자 명의 단위 특허 정보 서비스 시스템이 구성되어 있을 수 있으며 이때, 상기 출원인 명의 단위 특허 정보 서비스 시스템의 관리자가 특정한 발명자의 상기 발명자 명의 단위 특허 정보 서비스 시스템으로의 접근 권한을 관리할 수 있을 것이다. It goes without saying that at least one unit patent information service system of the inventor's name can correspond to the unit patent information service system of one applicant. The unit patent information service system of the inventor may be configured under the unit patent information service system of the applicant. At this time, the manager of the unit patent information service system of the applicant, You will be able to manage access to the system.

본 발명의 개별 단위 출원인 명의 단위 특허 정보 서비스 시스템에 대해서는 도 52에 그 상세한 구성이 나와 있다. 상기 개별 단위의 출원인 명의 단위 특허 정보 서비스 시스템에는 상기 출원인 명의 단위 특허 정보 서비스 시스템을 생성하는 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진, 발명자 단위의 특허 정보 시스템을 생성하는 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진, 상기 출원인 명의 단위 특허 정보 서비스 시스템 및/또는 상기 발명자 명의 단위 특허 정보 서비스 시스템을 관리하는 관리 모듈, 출원인 명의 단위로 관리되는 출원인 명의 단위 특허 정보 데이터, 발명자 단위로 관리되는 발명자 명의 단위 특허 정보 데이터가 있다.The detailed configuration of the unit patent information service system of the individual unit applicant of the present invention is shown in Fig. In the unit patent information service system of the individual unit, the unit patent information service system generation engine of the applicant's name, which generates the unit patent information service system of the applicant, the unit patent information service system name of the inventor who creates the patent information system of each inventor An engine, a management module for managing the unit patent information service system of the applicant's name and / or the unit patent information service system of the inventor, unit patent information data of applicants managed in units of applicants, unit patent information of the inventor name There is data.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진(1100)에는 출원 명의인이 결정될 때, 상기 출원 명의인을 출원인/특허권자로 하는 적어도 하나 이상의 국가에의 문건을 입수하는 출원 명의인 단위 문건 입수 모듈(1110), 입수된 출원 명의인 단위의 문건에 대한 정보를 데이터(테이블, DB or view에 들어있는 것을 포함한다)로 생성하는 출원 명의인 단위 문건 집합 DB 생성 모듈(1130) 및 출원 명의인 단위로 특허 정보의 통계, 분석 및 감시, 리포팅 등의 본 발명의 인텔리전스 정보를 생성하는 출원인 명의 단위 특허 정보 인텔리전스 생성 모듈(1150)이 포 함되어 있다.In the unit patent information service system generation engine 1100 of the patent owner, when the applicant is determined, the unit document acquisition module 1110 which obtains the document to at least one country having the applicant / patentee as the applicant, A unit document set DB generation module 1130, which is an application name that generates information on the document of the unit of the applicant's name in the data (including the table or DB or view), and the statistical analysis, A unit patent information intelligence generation module 1150 of the applicant who generates the intelligence information of the present invention such as monitoring, reporting and the like.

본 발명의 상기 발명자 단위 특허 정보 서비스 시스템 생성 엔진(1300)은 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성엔진과 마찬가지로 작동하며, 그 하부에 발명자 단위 문건 입수 모듈(1310), 발명자 단위 문건 집합 DB 생성 모듈(1330) 및 발명자 명의 단위 특허 정보 인텔리전스 생성 모듈(1350)을 포함할 수 있다.The inventor's unit patent information service system generation engine 1300 of the present invention operates in the same manner as the unit patent information service system generation engine of the applicant and includes an inventor's unit document acquisition module 1310, (1330) and an inventor's unit patent information intelligence generation module (1350).

본 발명의 관리 모듈(1500)에는 출원인 명의 단위 특허 정보 서비스 시스템을 관리하는 출원 명의인 단위 관리 모듈(1510)과 개별 발명자 단위의 발명자 명의 단위 특허 정보 서비스 시스템을 관리하는 발명자 명의 단위 관리 모듈 및 관리 정보를 포함하고 있는 관리 정보 DB(1550)가 있다. 상기 출원인 명의 단위 관리 모듈에는 출원인 명의 단위의 문건 집합 목록을 관리하는 출원인 명의 단위 문건 집합 관리 모듈(1511), 출원인 명의 단위의 특허 정보 인텔리전스를 관리하는 출원인 명의 단위 특허 정보 인텔리전스 관리 모듈(1513) 및 출원인 명의 단위에 속한 개별 발명자의 권한을 관리하는 발명자 권한 관리 모듈(1515)가 있으며, 발명자 등을 적어도 하나 이상의 단계를 두어 다단계 계층으로 관리하는 다단 계층 생성 관리 모듈(1517)이 있다. 한편, 상기 발명자 단위 관리 모듈(1530)에는 발명자 명의 단위 문건 집합 관리 모듈(1531), 발명자 명의 단위 특허 정보 인텔리전스 관리 모듈(1533)In the management module 1500 of the present invention, the unit management module 1510 of the name of the applicant who manages the unit patent information service system of the applicant, the unit management module of the name of the inventor managing the unit patent information system of the inventor And a management information DB 1550 including the management information DB 1550. The unit management module of the applicant's name includes a unit document set management module 1511 of the applicant who manages the document set list of the applicant's name unit, a unit patent information intelligence management module 1513 of the applicant who manages the patent information intelligence unit of the applicant's name, There is an inventor's rights management module 1515 for managing the rights of the individual inventors belonging to the unit of the applicant's name, and a multi-tier hierarchical generation management module 1517 for managing the inventors and the like as a multi-tier hierarchy with at least one step. Meanwhile, the inventor's unit management module 1530 includes a unit document set management module 1531, an inventor's unit patent information intelligence management module 1533,

한편, 상기 출원인 명의 단위 특허 정보 데이터(1700)에는 출원인 명의 단위 특허 문건 집합 DB(1710) 및 출원인 명의 단위 특허 정보 인텔리전스 DB(1730)가 있으며, 상기 발명자 단위 특허 문건 마스트 DB(1900)에는 발명자 단위 특허 문건 집합 DB(1910) 및 발명자 단위 특허 정보 인텔리전스 DB(1930)가 있다.On the other hand, the unit patent information data 1700 of the applicant includes unit patent document set DB 1710 of applicant's name and unit patent information intelligence DB 1730 of applicant's name. In the inventor's unit patent document mast DB 1900, A patent document set DB 1910 and an inventor unit patent information intelligence DB 1930.

한편, 개별적인 출원인 명의 단위 특허 정보 서비스 시스템은 적어도 하나 이상의 다른 출원인 명의 단위 특허 정보 서비스 시스템을 가질 수 있으며, 상기 다른 출원인 명의 단위 특허 정보 서비스 시스템을 가지는 방법은 상기 다른 출원인 명의 단위 특허 정보 서비스 시스템으로의 링크 정보를 가지고 있거나, 다른 출원인 명의 단위 특허 정보 서비스 시스템이 자신의 관리 권한 하에 있는 방법이 있을 수 있다. 이처럼 상기 다른 출원인 명의 단위 특허 정보 서비스 시스템을 가지는 것은 발명자 단위 특허 정보 서비스 시스템의 주체인 개별 발명자도 가능한 것이다.Meanwhile, a unit patent information service system of an individual applicant may have a unit patent information service system of at least one other applicant, and a method of having the unit patent information service system of another applicant may be a unit patent information service system of another applicant , Or there may be a method in which the unit patent information service system of another applicant is under his / her management authority. Having the unit patent information service system of another applicant as described above is also possible for an individual inventor who is the subject of the inventor's unit patent information service system.

본 발명의 개별 단위 특허 정보 서비스 시스템 생성 엔진은 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진 및 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진 중 적어도 어느 하나 이상을 포함한다. 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진에는 출원인 명의 단위 문건 집합 입수 모듈, 출원인 명의 단위 문건 집합 관리 모듈을 포함하며, 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진에는 발명자 명의 추출 모듈, 발명자 명의 단위 문건 집합 입수 모듈 및 발명자 명의 단위 문건 집합 관리 모듈을 포함한다. 본 발명의 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진과 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 출원인 명의를 입수 받았을 때, 적어도 하나 이상의 국가별로, 상기 국가별 특허 DB에서 상기 출원인 명의로 출원되거나 등록된 모든 특허 문건을 입수하고, 상기 입수된 특허 문건에서 개별 발명자 명의를 자동적으로 추출하고, 상기 개별 발명자 명의마다의 특허 문건을 자동적으로 입수하여, 출원인 명의 단위의 특허 문건 집합과 상기 출원인 명의 하의 개별 발명자 단위의 특허 문건 집합을 생성하고, 상기 출원인 명의 단위 문건 집합과 상기 발명자 단위 문건 집합에 대한 관리를The individual unit patent information service system generation engine of the present invention includes at least one of the unit patent information service system generation engine of the applicant's name and the unit patent information service system generation engine of the inventor's name. The unit patent information service system generation engine of the applicant includes a unit document set acquisition module of the applicant's name and a unit document set management module of the applicant's name. The unit patent information service system generation engine of the inventor's name includes an extraction module of the name of the inventor, An aggregate acquisition module and an inventor's unit document aggregation management module. The unit patent information service system generation engine of the applicant of the present invention and the unit patent information service system generation engine of the inventor have been registered in the patent database for each country by at least one country when the name of the applicant is obtained, It is possible to obtain all patent documents, automatically extract the names of individual inventors from the obtained patent documents, automatically acquire the patent documents for each individual inventor name, and obtain a set of patent documents in units of the applicant's name and individual inventor's units under the applicant's name A set of patent documents of the applicant, and management of the set of patent documents of the inventor

본 발명의 출원인 명의 단위 특허 정보 서비스 시스템은 상기 출원인 명의와 관련된 특허 문건 집합을 포함하고 있다. 상기 특허 문건 집합은 상기 출원인 명의로 출원되거나 등록된 출원인 명의 단위 문건 집합과 상기 출원인 명의 단위의 문건 집합에 포함된 개별 발명자별의 발명자 명의 단위의 문건 집합을 포함한다. 상기 출원인 명의 단위 문건 집합과 상기 발명자 명의 단위 문건 집합은 적어도 하나 이상의 국가별 특허 DB를 포함하는 전체 특허 DB로부터 입수할 수도 있지만, 개별 국가별 특허 DB로부터 입수되는 것이 바람직할 것이다. 이를 위해서 각 국가별로 상기 출원인 식별 코드를 통일시켜 놓았을 경우(출원인 대표명화를 제1국 기준으로뿐만 아니라, 적어도 하나 이상의 제 2국 기준으로도 대표명화를 시키고, 상기 제 1국의 출원인과 제 2국의 출원인이 동일한 주체일 경우에는 동일한 출원인 식별 코드를 할당할 수 있다. 출원인은 제1국 기준 또는 제2국 기준 중 적어도 어느 하나 이상으로 대표명화가 되어 있는 것이 바람직하다.The unit patent information service system of the applicant of the present invention includes a set of patent documents related to the applicant's name. The patent document set includes a document set of the applicant's name registered or registered in the applicant's name and a document set of the inventor's name per individual inventor included in the document set of the applicant's name. The set of unit documents of the applicant and the set of unit documents of the inventor name may be obtained from the entire patent DB including at least one country patent DB, but preferably from the patent database of each country. In order to achieve this, if the applicant's identification code is unified for each country (the applicant's representative name is not only based on the first country but also on the basis of at least one second country, If the applicant of the second country is the same subject, the same applicant's identification code may be assigned. The applicant is preferably represented by at least one of the first country standard or the second country standard.

상기와 같은 과정을 통하여, 특정한 출원인 명의 단위에 속하는 특허 문건 부분 집합을 추출할 수 있게 되고, 상기 특허 문건 부분 집합을 관리 대상 문건으 로 하는 출원인 명의 단위 특허 정보 서비스 시스템이 생성될 수 있게 된다.Through the above process, a partial patent document subset belonging to a unit of a specific applicant can be extracted, and a unit patent information service system in the name of the applicant having a subset of the patent document as a management target document can be created.

도 76은 출원인 명의 단위 특허 정보 서비스 시스템 관리자가 자신이 관리하고 있는 문건 집합에 대해 상기 CRUD를 수행하는 과정에 대한 일실시예적 과정을 도시하고 있다.76 shows an exemplary process of the CRUD executed by the unit patent information service system administrator of the applicant on a document set managed by the unit patent information service system administrator.

상기 관리 모듈은 출원인 명의 단위 특허 정보 서비스 시스템에 관한 관리 권한을 가진 관리자의 접속을 허용하고(S2620), 관리자에 의한 관리 대상 특허 문건 집합에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하며(S2630), 관리자에 의한 발명자 명의 단위 특허 정보 서비스 시스템에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하고(S2640), 관리자에 의한 제2의 출원인 명의 단위 특허 정보 서비스 시스템에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하며(S2650), 관리자에 의한 출원인 명의 단위 특허 정보 서비스 시스템에서 제공하는 서비스에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하고(S2660), 입수된 변동 사항을 저장한다(S2670).The management module permits the administrator having the management authority on the unit patent information service system of the applicant's name (S2620). When there is a change in the management target patent document set by the administrator, the change module obtains the change in the set (S2630 If there is a change in the unit patent information service system of the inventor name by the manager (S2640), the change information is obtained in step S2640. If there is a change in the unit patent information service system of the second applicant by the manager (S2650). When there is a change in the service provided by the unit patent information service system of the applicant by the manager (S2660), the change information is obtained (S2660) and the obtained change information is stored (S2670) .

도 77은 발명자 명의 단위 특허 정보 서비스 시스템 관리자가 자신이 관리하고 있는 문건 집합에 대해 상기 CRUD를 수행하는 과정에 대한 일실시예적 과정을 도시하고 있다.FIG. 77 shows an exemplary process of the CRUD performed by the unit patent information service system administrator of the inventor on a document set managed by the unit patent information service system administrator.

상기 관리 모듈은 발명자 명의 단위 특허 정보 서비스 시스템에 관한 관리 권한을 가진 관리자의 접속을 허용하고(S2720), 관리자에 의한 관리 대상 특허 문건 집합에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하고 (S2730), 관리자에 의한 발명자 명의 단위 특허 정보 서비스 시스템에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하며(S2740), 관리자에 의한 제2의 출원인 명의 단위 특허 정보 서비스 시스템에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하고(S2750), 관리자에 의한 출원인 명의 단위 특허 정보 서비스 시스템에서 제공하는 서비스에 대한 변동 사항이 있을 경우 그 변동 사항을 입수하며(S2760), 입수된 변동 사항을 저장한다(S2770).The management module permits the administrator having the management authority of the unit patent information service system in the name of the inventor (S2720). When there is a change in the management target patent document set by the manager, the management module obtains the change in the management target patent document set (S2730 If there is a change in the unit patent information service system of the inventor's name by the manager (S2740), if there is a change in the unit patent information service system of the second applicant by the manager, (S2750). If there is a change in the service provided by the unit patent information service system of the applicant by the manager (S2760), the change is obtained (S2760) and the obtained change is stored (S2770) .

발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진이 작동하는 방법은 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진과 동등하며, 다만, 대상 특허 문건 집합이 발명자 단위로 특정되는 것만 다르다.The method by which the unit patent information service system generation engine of the inventor operates is the same as that of the unit patent information service system generation engine of the applicant, except that the target patent document set is specified by the inventor.

발명자별 문건 추출Extracting documents by inventor

발명자 명의 추출 모듈이 생성한 발명자 전체 목록이 있을 경우, 상기 발명자와 상기 발명자가 발명자로 등재되어 있는 특허 문건 부분 집합을 추출한다.If there is a whole list of inventors created by the extracting module of the inventor's name, the inventor and the inventor extract a subset of patent documents listed as inventors.

도 66은 전체 출원인 명의 특허 정보 데이터에서 1개의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.FIG. 66 is a flowchart of a process of creating a unit patent information service system of a single applicant in the patent information data of all applicants and generating a unit patent information service system of the inventor name in each inventor included in a set of patent documents constituting the unit patent information service system of the applicant And a method of performing the above method.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S1620), 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색 엔진 또는 DB질의를 통하여 검색하며(S1630), 출원인 명의로 출원되거나 등록된 문건이 있는 가를 판단한(S1640) 다음 있는 경우, 검색된 특허 문건을 관리 대상 문건 집합으로 하는 출원인 명의 단위 특허 정보 서비스 시스템을 생성한 다(S1650). 이어, 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 상기 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키며(S1660), 상기 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S1670). 만약 문건이 없다면 출원인 명의 단위 특허 정보 서비스 시스템을 생성하지 않는다.In step S1620, the unit patent information service system generation engine of the registered name searches the patent document registered or registered in the name of the acquired applicant through a search engine or a DB query (S1630) (S1640), it is determined whether there is a document to be registered or registered (S1640). Then, the unit patent information service system generation engine of the inventor extracts the inventors from the searched set of patent documents, and corresponds to the patent documents for each inventor (S1660). Then, the patent documents corresponding to the extracted inventors are set as a management target document set A unit patent information service system of the inventor name is created (S1670). If there is no document, the patent information service system of the applicant's name is not created.

도 67은 출원인 명의 목록을 입수하고, 각 목록에 대하여 전체 출원인 명의 단위 특허 정보 데이터에서 목록에 있는 출원인 개수만큼 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고, 상기 각 생성된 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.67 shows a list of applicant names, generates a unit patent information service system of applicants by the number of applicants in the list from the unit patent information data of all applicants for each list, And shows an embodiment of a method of generating a unit patent information service system in the name of the inventor by the inventor included in the set of patent documents constituting the system.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 목록 정보를 입수하고(S1720), 목록에 있는 개별 출원 명의인 정보를 입수하며(S1730), 입수된 개별 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하고(S1740), 문건이 있는 가를 판단하고(S1750) 있는 경우, 검색된 특허 문건을 관리 대상 문건 집합으로 하는 개별 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S1760), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S1770), 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성하며(S1780), 상기와 같은 과정을 상기 입수받은 출원 명의인 목록에 처리하지 않은 출원 명의인이 있는가를 판단 한(S1790), 다음 없을 때까지 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성과 발명자 명의 단위 특허 정보 서비스 시스템 생성을 반복한다.The unit patent information service system generation engine of the patent application obtains the registered information of the applicant's name (S1720), acquires the information of the name of the individual application in the list (S1730), obtains the registered patent document (S1740). If it is judged that there is a document (S1750), a unit patent information service system of an individual applicant's name, which uses the searched patent document as a management target document set, is created (S1760) The information service system generation engine extracts the inventor from the set of patent documents, and corresponds to the patent document for each inventor (S1770). Then, the unit patent information service system of the inventor name, which sets the patent document corresponding to each extracted inventor as the management target document set, is created (S1780), the above process is not performed on the list of the acquired application name (S1790), the unit patent information service system of the applicant's name and the unit patent information service system of the inventor's name are repeated until there is no next applicant.

도 68은 입수된 문건 집합에서 출원인 명의 목록을 입수하고, 각 목록에 대하여 전체 출원인 명의 단위 특허 정보 데이터에서 목록에 있는 출원인 개수만큼 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고, 상기 각 생성된 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.68 is a view for obtaining a list of applicants from the acquired document set, generating a unit patent information service system of the applicant's name as the number of applicants in the list from the unit patent information data of the entire applicant for each list, An inventor's unit patent information service system included in a set of patent documents constituting a unit patent information service system is generated according to an embodiment of the present invention.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원인 명의 단위 특허 정보 서비스 시스템 생성을 위한 문건 집합을 입수하고(S1820), 상기 문건 집합에서 출원 명의인을 추출하고 출원 명의인 목록 정보를 생성한다(S1830)The unit patent information service system generation engine obtains a document set for creating a unit patent information service system in the applicant's name (S1820), extracts the applicant's name from the document set, and generates the applicant's name list information (S1830)

이어, 상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 목록 정보를 입수하고 목록에 있는 개별 출원 명의인 정보를 입수하며(S1840), 입수된 개별 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하고(S1750), 문건이 있는 가를 판단한(S1860) 다음 있는 경우, 검색된 특허 문건을 관리 대상 문건 집합으로 하는 개별 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S1870), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S1880), 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성하며(S1890), 상기와 같은 과 정을 상기 입수받은 출원 명의인 목록에 처리하지 않은 출원 명의인 있는지를 판단하고(S1891) 없을 때까지 상기 출원인 명의 단위 특허 정보 서비스 시스템 생성과 발명자 명의 단위 특허 정보 서비스 시스템 생성을 반복한다.In step S1840, the unit-patent information service system generation engine obtains the registered information of the applicant's name and obtains the information of the individual applicant's name in the list (S1840). The patented information service system generation engine searches the registered or registered patent document (S1850). If it is determined that there is a document (S1860), the unit patent information service system of the individual applicant's name is searched (S1870) and the unit patent information service The system creation engine extracts the inventor from the set of patent documents, responds to the patent document for each inventor (S1880), and creates the unit patent information service system of the inventor's name in which the patent document corresponding to each extracted inventor is a management target document set S1890), the above-described process is applied to the list of the acquired application name Determining whether the holder did not filed and until there is (S1891) and repeats the above patent information services unit of the applicant who created the system and inventors who patent information service system units produced.

도 69은 출원인 명의 단위 특허 정보 서비스 시스템의 생성을 주문 받은 경우, 전체 출원인 명의 단위 특허 정보 데이터에서 1개의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.69 is a flowchart showing a process of generating a unit patent information service system of a single applicant from the unit patent information data of all the applicants when the generation of the unit patent information service system of the applicant is ordered, And shows an embodiment of a method of generating a unit patent information service system in the name of the inventor by the inventor included in the set.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 외부의 제3자로부터 입수하고(S1920), 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색 엔진 또는 DB질의를 통하여 검색하며(S1930), 출원인 명의로 출원되거나 등록된 문건이 있는가를 판단하고(S1940) 있는 경우, 검색된 특허 문건을 관리 대상 문건 집합으로 하는 출원인 명의 단위 특허 정보 서비스 시스템을 생성한다(S1950). 이어, 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 상기 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키며(S1960), 상기 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S1970). 만약 문건이 없다면 출원인 명의 단위 특허 정보 서비스 시스템을 생성하지 않는다.In step S1920, the unit patent information service system generation engine of the patent application obtains the application owner information from the external third party (S1920), searches the patent document registered or registered in the name of the acquired applicant through the search engine or the DB query (S1930). If it is determined that there is a document registered or registered in the name of the applicant (S1940), the unit patent information service system in the name of the applicant is searched (S1950). Then, the unit patent information service system generation engine of the inventor extracts the inventors from the searched set of patent documents and corresponds to the patent documents for each inventor (S1960). Then, the patent documents corresponding to the extracted inventors are set as a management target document set A unit patent information service system of the inventor name is created (S1970). If there is no document, the patent information service system of the applicant's name is not created.

상기 도 66 내지 도 69의 방법은 개별 국가를 특정하지 않거나, 개별 국가를 특정하는 경우나 모두 적용될 수 있을 것이다. 만약 개별 국가를 특정하지 않는 경우라면 출원인 대표명화가 전제되어야 할 것임은 분명하다 할 것이다.(특히 언어가 다른 국가일 경우에는 더욱 그러할 것이다.)The method of FIGS. 66 to 69 may be applied to both cases where the individual country is not specified or the individual country is specified. If it does not specify an individual country, it will be clear that the applicant's representative name should be presupposed (especially if the language is different).

도 70은 국가 단위로 출원인 명의 단위 특허 정보 서비스 시스템을 생성하는 방법에 관한 것이다. 본 국가 단위로 출원인 명의 단위 특허 정보 서비스 시스템을 생성하는 것은 국가 단위의 출원인 명의 단위 특허 정보 데이터를 대상으로 국가 단위의 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 것을 본질로 한다. 따라서, 상기 도 66 내지 도 69의 방법은 모두 적용될 수 있다.FIG. 70 relates to a method for generating a unit patent information service system in the name of applicant in a country unit. In order to create a unit patent information service system of the applicant's name in this country unit, it is necessary to create unit patent information service system of unit applicant's name and unit patent information service system of the inventor's name, . Therefore, the methods of FIGS. 66 to 69 can be applied to all of them.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S2020), 국가 목록을 입수하며(S2030), 입수된 국가 목록에서 개별 국가 단위로 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하고(S2040), 문건이 있는 지 판단하고(S2050) 없는 경우에는 국가 단위에서 완료하며, 있는 경우에는 국가 단위에서 검색된 특허 문건을 관리 대상 문건 집합으로 하는 국가 단위의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2060), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 국가 단위에서 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키며(S2070), 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 국가 단위의 발명자 명의 단위 특허 정보 서비스 시스템을 생성하며(S2080) 국가 목록에 처리하지 않은 국가가 있는가를 판단하여(S2090) 없는 경우 완료하고, 있는 경우에는 입수된 국가 목록에서 개별 국가 단위로 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하는(S2040) 단계를 다시 수행한다.The unit patent information service system generation engine of the patent application obtains the information of the applicant's name (S2020), obtains the country list (S2030), and registers the patent document registered or registered in the name of the applicant (S2050). If it is determined that there is a document (S2050), the unit patent information service is completed in the country unit. If there is a patent document found in the country unit, (S2060). Then, the unit patent information service system generation engine of the inventor extracts the inventors from the set of patent documents at national level, and copes with patent documents for each inventor (S2070). Then, The inventor's name unit patent information service (S2080), it is judged whether there is a country that has not been processed in the country list (S2090). In the case where there is no such country, the patented document registered or registered in the name of the applicant The step of searching (S2040) is performed again.

도 71은 패밀리 정보를 활용하여 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 국가 단위로 생성하는 과정의 일 실시예를 보여주고 있다.FIG. 71 shows an example of a process of generating a unit patent information service system of an applicant's name and a unit patent information service system of an inventor in a country unit by utilizing family information.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S2120), 제1국에서 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하며(S2130), 문건이 있는가를 판단한(S240) 다음, 없으면 제 1국에서 완료하고 있는 경우 제1국에서 검색된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고(S2150), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제1국에서 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키며(S2160), 제1국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2170) 이어, 상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 제 1국에서 검색된 특허 문건의 특허 출원 번호를 기준으로 하는 제2국에서의 패밀리 특허 번호를 입수하며(S2180), 제2국에서의 패밀리 특허 번호로 입수될 수 있는 특허 문건을 관리 대상 문건 집합으로 하는 제 2국에서의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고(S2190), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제2국에서 입수되는 특허 문건 집합에서 발명자를 추출하 고, 발명자별로 특허 문건을 대응시키며(S2191), 제 2국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제2국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2193),In step S2120, the unit-patent information service system generation engine of the patent application obtains the information of the applicant's name (S2120), searches for the patent document registered or registered in the name of the applicant acquired in the first country (S2130) S240). If not, the unit patent information service system of the first country's standard applicant name is created (S2150), and the unit patent information system The patent information service system generation engine extracts the inventors from the set of patent documents retrieved from the first station, and maps the patent documents to each inventor (S2160). The patent document corresponding to each inventor extracted from the first country is set as a management target document set (S2170). Then, the unit patent information system The information service system generation engine obtains the family patent number in the second country based on the patent application number of the patent document searched in the first country (S2180), and obtains the patent number that can be obtained as the family patent number in the second country (S2190). Then, the unit patent information service system generation engine of the inventor's name sets the patent document set obtained from the second country to the inventor's patent information service system (S2191), and creates a unit patent information service system in the name of a second-party standard inventor in which the patent document corresponding to each inventor extracted from the second country is a management target document set S2193),

도 72는 우선권 주장 번호를 활용하여 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 국가 단위로 생성하는 과정의 일 실시예를 보여주고 있다.72 shows an embodiment of a process of generating a unit patent information service system of an applicant's name and a unit patent information service system of an inventor in a country unit by using a priority claim number.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S2220), 제1국에서 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하는 단계(S2230), 문건이 있는지를 판단하며(S2240) 없는 경우 제 1국에서 완료하며, 있는 경우 제1국에서 검색된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2250), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제1국에서 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S2260), 제1국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2270).In step S2220, the unit patent information service system generation engine of the patent application obtains the information of the applicant's name (S2220), searching for a patent document registered or registered in the name of the applicant acquired in the first country (S2230) (Step S2240). If there is no document, the first patent information service system of the first country, which is completed in the first country and, if there is a patent document searched in the first country, is set as the management target document set (S2250) The unit patent information service system generation engine of the inventor's name extracts the inventors from the set of patent documents retrieved from the first station, and corresponds to the patent documents for each inventor (S2260). The patent documents corresponding to the inventors extracted from the first station are managed A unit patent information service system of the first inventor name of the first country is created (S2270).

이어, 상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 제 1국에서 검색된 특허 문건의 특허 출원 번호 목록을 대상으로 하여 그 번호들을 제2국에서의 우선권 주장 번호에서 검색하고(S2280), 제2국에서의 검색되는 특허 문건을 관리 대상 문건 집합으로 하는 제 2국에서의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2290), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제2국에서 입수되는 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S2291), 제 2국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제2국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2293).Then, the unit patent information service system generation engine of the patent application searches the list of patent application numbers of the patent documents retrieved from the first station from the priority claim number in the second station (S2280) (S2290). Then, the unit patent information service system generation engine of the inventor's name is obtained from the second country, which is obtained from the second country (S2291). In the case of a unit patent information service system in the name of a second-party standard inventor, which sets the patent document corresponding to each inventor extracted from the second country as a management target document set, extracts the inventor from the set of patent documents, (S2293).

도 73은 패밀리 정보를 활용하여 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 국가 단위로 생성하는 과정의 일 실시예를 보여주고 있다.FIG. 73 shows an example of a process of generating a unit patent information service system of an applicant's name and a unit patent information service system of an inventor in a country unit by utilizing family information.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S2320), 제1국에서 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하며(S2330), 문건이 있는가를 판단하고(S2340), 있는 경우 제1국에서 검색된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고(S2350), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제1국에서 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S2360), 제1국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2370),In step S2320, the unit patent information service system generation engine of the patent application obtains the registered patent information (S2320), and searches for a patent document registered or registered in the name of the applicant acquired in the first country (S2330) (S2340). If there is the unit patent information service system, the unit patent information service system of the first country reference applicant is created (S2350). The unit patent information service system generation engine of the inventor's name The inventor is extracted from the set of patent documents retrieved from the first country, the patent document is corresponded for each inventor (S2360), and the patent document corresponding to each inventor extracted from the first country is referred to as the first inventor The unit patent information service system is generated (S2370)

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 제 1국에서 검색된 특허 문건의 특허 출원 번호를 기준으로 하는 제2국에서의 패밀리 특허 번호를 입수하고(S2380), 제2국에서의 패밀리 특허 번호로 입수될 수 있는 특허 문건을 관리 대상 문건 집합으로 하는 제 2국에서의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2390), 제2국에서 입수되는 특허 문건을 패밀리 특허 번호를 매개로 하여 제 1국에서 검색된 특허 출원 번호를 기준으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템에 통합하며(S2391), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 통합된 제2국에서 입수되는 특허 문건을 관리 대상 문건 집합으로 하는 제2국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2393).The unit patent information service system generation engine of the patent application obtains the family patent number in the second country based on the patent application number of the patent document searched in the first country (S2380), and obtains the family patent number (S2390) a patent information service system in the second country in which the patent document that can be obtained is set as a management target document (S2390), and the patent document obtained from the second country is referred to as the first patent information service system (S2391), and then the unit patent information service system generation engine of the inventor's name is integrated with the patent document obtained from the integrated second country (S2393), which is a unit patent information service system in the name of the second inventor.

도 74는 패밀리 정보 및 우선권 주장 정보를 활용하여 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 국가 단위로 생성하는 과정의 일 실시예를 보여주고 있다.74 shows an example of a process of generating a unit patent information service system of an applicant's name and a unit patent information service system of an inventor in a country unit by using family information and priority claim information.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 출원 명의인 정보를 입수하고(S2420), 제1국에서 입수된 출원 명의인의 명의로 출원되거나 등록된 특허 문건을 검색하며(S2403), 문건이 있는가를 판단한(S2440) 다음, 있는 경우 제1국에서 검색된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2450), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제1국에서 검색된 특허 문건 집합에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키고(S2460), 제1국에서 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2470).The unit patent information service system generation engine obtains the information of the applicant's name (S2420), searches for a patent document registered or registered in the name of the applicant acquired in the first country (S2403), and determines whether there is a document Next, if there is a unit patent information service system, the unit patent information service system of the first country reference applicant is created (S2450), and the unit patent information service system generation engine of the inventor's name The inventor is extracted from the set of patent documents retrieved from the first country, the patent document is corresponded to each inventor (S2460), and the name of the first inventor of the first country, which makes the patent document corresponding to each inventor extracted from the first country, The unit patent information service system is created (S2470).

이어, 상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 제 1국 에서 검색된 특허 문건의 특허 출원 번호 목록을 대상으로 하여 그 번호들을 제2국에서의 우선권 주장 번호에서 검색하고(S2480), 제2국에서의 검색되는 특허 문건을 관리 대상 문건 집합으로 하는 제 2국에서의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하며(S2490), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 제2국에서 입수되는 특허 문건을 우선권 주장 번호를 매개로 하여 제 1국에서 검색된 특허 출원 번호를 기준으로 하는 제1국 기준 발명자 명의 단위 특허 정보 서비스 시스템에 통합하고(S2491), 통합된 제2국에서 입수되는 특허 문건을 관리 대상 문건 집합으로 하는 제2국 기준 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2493).Then, the unit patent information service system generation engine of the patent application searches the list of patent application numbers of patent documents searched in the first station from the priority claim number in the second station (S2480) The unit patent information service system generation engine of the inventor's name is acquired from the second country (step S2490) The patent document is integrated into the unit patent information service system of the first inventor name based on the patent application number retrieved from the first country via the priority claim number (S2491), the patent document obtained from the integrated second country (Step S2493), which is a unit patent information service system of the second inventor name of the second country, which is a management target document set.

도 75는 신규 문건이 추가 될 때 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진 및 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진이 어떠한 과정을 수행하는 가에 대한 일 실시예적 과정을 보여주고 있다.FIG. 75 shows an exemplary process of the applicant's unit patent information service system generation engine and inventor's unit patent information service system generation engine when a new document is added.

상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 신규 특허 문건을 입수하고(S2620), 신규 특허 문건에서 출원인 정보를 추출하며(S2630), 출원인 정보로 된 기존 출원인 명의 단위 특허 정보 서비스 시스템이 있는가를 판단한다(S2640).The unit patent information service system generation engine of the patent application obtains a new patent document (S2620), extracts the applicant information from the new patent document (S2630), and judges whether there is a unit patent information service system in the name of the existing applicant with the applicant information (S2640).

판단 결과 있는 경우에는 기존 출원인 명의 단위 특허 정보 서비스 시스템에 신규 특허 문건 정보를 추가하며(S2650), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 발명자 정보를 추출하고(S2660), 기존 출원인 명의 단위 특허 정보 서비스 시스템 하부에 발명자 정보로 된 기존 발명자 명의 단위 특허 정보 서비스 시스템이 있는가를 판단한(S2680) 다음, 있는 경우 기존 발명자 명의 단위 특허 정보 서비스 시스템에 신규 특허 문건 정보를 추가하며(S2690), 없는 경우 발명자 정보를 기준으로 발명자 명의 단위 특허 정보 서비스 시스템을 생성하고, 신규 특허 문건 정보를 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2699).If the judgment result is YES, the new patent information information is added to the unit patent information service system of the existing applicant (S2650). Then, the unit patent information service system generation engine of the inventor's name extracts the inventor information (S2660) If it is determined that there is an existing inventor's unit patent information service system in the lower part of the patent information service system (S2680), new patent document information is added to the unit patent information service system of the existing inventor (S2690) The unit patent information service system of the inventor's name is created based on the inventor's information, and the unit patent information service system of the inventor's name is created (S2699) with the new patent document information as the management target document set.

한편, 출원인 정보로 된 기존 출원인 명의 특허 정보 서비스 시스템이 없는 경우에는 상기 출원 명의인 단위 특허 정보 서비스 시스템 생성 엔진은 신규 특허 문건을 관리 대상 문건 집합으로 하는 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고(S2693), 이어 상기 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진은 신규 특허 문건에서 발명자를 추출하고, 발명자별로 특허 문건을 대응시키며(S2695), 추출된 발명자별로 대응된 특허 문건을 관리 대상 문건 집합으로 하는 발명자 명의 단위 특허 정보 서비스 시스템을 생성한다(S2697).On the other hand, if there is no patent information service system of an existing applicant with applicant information, the unit patent information service system generation engine of the applicant's name generates a unit patent information service system of applicant's name with a new patent document as a management target document set (S2693 Then, the unit patent information service system generation engine of the inventor extracts the inventor from the new patent document and makes the patent document correspond to each inventor (S2695). Then, the inventor who sets the patent document corresponding to each extracted inventor as the management target document set Unit patent information service system is created (S2697).

출원인 명의 단위 또는 발명자 명의 단위 분석Unit analysis of applicant's name or inventor's name

본 발명의 분석 모듈에 세팅되는 적어도 하나 이상의 분석 알고리즘은 출원인 명의 단위 특허 정보 서비스 시스템 및/또는 발명자 명의 단위 특허 정보 서비스 시스템에 사전에 세팅이 되어 있는 것이 특징이다. 즉, 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템을 생성할 때, 또는 생성 이후라도 사용자의 개입 없이, 상기 출원인 명의 단위에서 필요한 분석 알고리즘 또는 상기 발명자 명의 단위에서 필요한 분석 알고리즘이 자동으로 세팅되어 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명 자 명의 단위 특허 정보 서비스 시스템의 분석 서비스를 구성하게 된다. 상기 분석 알고리즘의 실행은 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템을 생성할 때에 실행되어 그 분석 결과를 상기 출원인 명의 단위 특허 정보 서비스 시스템 또는 상기 발명자 명의 단위 특허 정보 서비스 시스템이 사전에 갖고 있는 것이 바람직하다. 하지만, 상기 분석 알고리즘의 실행 속도가 빠르거나, 실시간 또는 사용자에게 기다림을 주지 않을 정도로 빠른 경우, 상기 분석 알고리즘은 상기 분석 주제에 대하여 외부로부터의 접근(관리자 또는 사용자의 클릭 등)이 있을 때 실행될 수도 있을 것이다. 그러므로, 적어도 2 이상의 분석 알고리즘으로 분석 모듈이 구성될 때, 상기 분석 주제 및 그에 따르는 분석 알고리즘은 사전에 세팅되며 그 실행 시점은 다를 수 있을 것이다.The at least one analysis algorithm set in the analysis module of the present invention is pre-set in the unit patent information service system of the applicant and / or the unit patent information service system of the inventor. That is, when the unit patent information service system of the applicant's name or the unit patent information service system of the inventor is created or after the creation, the analysis algorithm required in units of the applicant's name or the analysis algorithm Is automatically set to constitute the analysis service of the unit patent information service system of the applicant's name or the unit patent information service system of the inventor's name. The execution of the analysis algorithm is executed when generating the unit patent information service system of the applicant's name or the unit patent information service system of the inventor's name and outputs the analysis result to the unit patent information service system of the applicant's name or the unit patent information service system It is preferable to have this in advance. However, if the execution speed of the analysis algorithm is fast, or is fast enough to not give real time or waiting for the user, the analysis algorithm may be executed when there is external access (such as clicking of an administrator or a user) There will be. Therefore, when the analysis module is configured with at least two analysis algorithms, the analysis subject and the analysis algorithm thereof are set in advance, and the execution time thereof may be different.

관리 모듈Management module

이하, 상기 관리 모듈에 포함된 다단 계층 관리 모듈에 대해서 설명한다. 먼저 다단 계층 관리의 문제가 발생하는 상황에 대해 설명한다. 하나의 출원인이 상당히 많은 수의 발명자가 소속되어 있을 경우, 상기 발명자가 적어도 하나 이상의 계층적 조직에 속할 수 있다. 예를 들어 설명하면, 한 사람의 발명자는 a1팀<a부서<A1계열<A회사<alpha그룹에 소속될 수 있을 것이다. 그러므로, 관리적인 측면에서는 a1팀은 상기 발명자 단위의 특허 정보 서비스 시스템 및 a1팀과 관련된 특허 문건을 그 관리 대상으로 하는 것이 당연할 것이며, a부서는 a1 내지 an까지의 각 부서에 관련된 발명자 명의 단위 특허 정보 서비스 시스템 및/또는 특허 문건을 관리의 대상으로 하는 것이, A1계열은 a부서부터 z부서에 관련된 발명자 명의 단위 특허 정보 서비스 시스템 및/또는 특허 문건을 까지를, A회사는 A1계열부터 AM계열까지의 관련된 발명자 명의 단위 특허 정보 서비스 시스템 및/또는 특허 문건을, alpha그룹은 A회사부터 Z회사까지의 출원인 명의 단위 특허 정보 서비스 시스템, 발명자 명의 단위 특허 정보 서비스 시스템 및/또는 특허 문건 등을 관리하여 할 것이다. 하지만, 공개되는 문건에서 입수할 수 있는 정보는 발명자 명의와 회사 명의뿐인 경우가 대부분이므로(즉, 특허 문건에는 출원인 정보 이외에 그 발명자가 소속된 집단에 관한 정보는 없는 것이 통상이다), 각 단계의 관리자 및/또는 최고 단계의 관리자가 특정 특허 문건이 어디에 소속되어야 되는지에 대한 정리를 해 줄 필요가 있게 된다. 상기 특허 정보 서비스 시스템은 관리될 특허 문건을 통해서 생성되기 때문에, 상기 특허 문건이 어느 발명자, 어느 팀, 어느 부서, 어느 계열, 어느 회사에 대응되는지에 대한 소속 정보가 있을 경우, 상기 소속 정보를 통해서 특허 정보 서비스 시스템을 생성할 수 있게 된다. 상기 소속 정보를 정하기 위해서 특허 문건과 발명자, 회사에 관한 정보는 특허 문건 자체에서 입수할 수 있다. 그러므로, 상기 발명자가 어느 팀, 어느 부서, 어느 계열, 어느 회사에 속하는지를 알거나, 상기 문건 자체가 어느 팀, 어느 부서, 어느 계열, 어느 회사에 속하는지를 아는 등, 상기 소속 정보를 대응시키는 임의의 수단에 의해, 상기 특허 문건이 소속되는 집단이 결정될 수 있다.Hereinafter, the multi-layer management module included in the management module will be described. First, a description will be given of a situation in which a problem of multi-layer management occurs. If an applicant has a considerably large number of inventors, the inventor may belong to at least one hierarchical organization. For example, one inventor may belong to the a1 team <a department <A1 series <A company <alpha group. Therefore, in the management aspect, it is natural that the a1 team manages the patent documents related to the patent information service system and the a1 team of the inventor unit, and the a department is a unit of the inventor's name related to each department from a1 to an The patent information service system and / or the patent document are subject to management, the A1 series covers the patent information service system and / or the patent document of the inventor's name related to the department a to the z department, The alpha group is a unit patent information service system in the name of the applicant from A company to Z company, a unit patent information service system in the name of the inventor, and / or a patent document, etc. Will manage it. However, since most of the information available in the published document is only the inventor's name and company name (that is, the patent document usually contains no information on the group to which the inventor belongs in addition to the applicant's information) The manager and / or the highest level manager need to be able to organize where the specific patent document should belong. Since the patent information service system is generated through a patent document to be managed, when there is information on belonging to which inventor, which team, which department, which series, which company the patent document corresponds to, The patent information service system can be created. In order to determine the affiliation information, information about the patent document, the inventor, and the company can be obtained from the patent document itself. Therefore, it is possible to know whether the inventor belongs to which team, which department, which series, which company, or which team, which department, which series, which company the document itself belongs to, , The group to which the patent document belongs can be determined.

개별 단위 특허 정보 시스템 관리 모듈Individual unit patent information system management module

2 이상의 출원인 명의 단위 특허 정보 시스템이 생성되는 경우, 상기 특허 정보 서비스 시스템들이 관리될 필요가 있으며, 상기 관리는 개단 특허 정보 서비 스 시스템 관리 모듈이 수행한다. 상기 개단 특허 정보 서비스 시스템 관리 모듈은 상기 출원인 명의 단위 특허 정보 서비스 시스템에 대한 인증, 권한 관리, 권한 이양, 삭제, 위치나 소속의 변경 등의 관리를 수행한다.When two or more applicants' unit patent information systems are created, the patent information service systems need to be managed, and the management is performed by the open patent information service system management module. The open patent information service system management module manages authentication, authority management, transfer of authority, deletion, change of location and affiliation of the unit patent information service system of the applicant.

상기와 같이 다수의 출원인 명의 단위 특허 정보 서비스 시스템들이 생성되었을 경우, 전체 시스템적 관점에서 통합하여 관리할 필요가 있으며, 상기 통합 관리를 수행하는 것은 본 발명의 통합 관리 모듈이 수행한다. 도 65는 통합 관리 모듈의 구성의 일 실시예를 보여 주고 있는데, 상기 통합 관리 모듈에는 개별 단위 특허 정보 시스템 다단 그룹화 모듈(6100)과 개별 단위 특허 정보 시스템 관리 모듈(6300)이 있어, 개별 단위 특허 정보 서비스 시스템을 다단 그룹화하며, 상기 다단 그룹화된 특허 정보 서비스 시스템을 관리할 수 있게 된다.When the unit patent information service systems of a plurality of applicants are created as described above, it is necessary to integrally manage the patent information service systems from the viewpoint of the entire system, and the integrated management module of the present invention performs the integrated management. FIG. 65 shows an embodiment of the configuration of the integrated management module. The integrated management module includes an individual unit patent information system multi-stage grouping module 6100 and an individual unit patent information system management module 6300, The information service system can be multi-tiered, and the multi-tier grouped patent information service system can be managed.

출원인 단위 특허 정보 서비스 시스템의 구현예Implementation of Applicant's Patent Information Service System

도 124은 나스닥 등록 기업 중의 하나인 3COM(6번)을 선택했을 때에 나타나는 3COM의 특허 정보 시스템의 내부 중 특허 포트폴리오 중 특허 리스트 중 국가 선택에서 미국 특허 탭을 선택했을 때 나오는 일실시예적 구현예이다. 상기 3COM에 대한 입력이 있을 경우, 본 발명의 특허 정보 서비스 시스템은 3COM에 대한 미국 문건을 검색하거나, 특허 문건 마스트 DB에 조회하여 3COM에 대한 문건 목록을 생성하여 사용자에게 제시할 수 있다.124 is an exemplary embodiment of the 3COM patent information system, which is selected when 3COM (No. 6) is selected as one of Nasdaq-registered companies, when the US patent tab is selected in the patent list of the patent portfolio among the patent portfolios . When there is an input to the 3COM, the patent information service system of the present invention can retrieve US documents for 3COM or query the patent document master DB to generate a document list for 3COM and present it to the user.

도 125는 도 124에서 발명자 리스트를 선택한 다음 국가 탭에서 미국을 선택했을 때 3COM의 발명자 목록이 나오는 화면이다. 본 발명의 특허 정보 서비스 시스템은 상기 특허 문건 마스트 DB에 조회하거나, 상기 발명자 분석용 다차원 분석용 연산 수행 결과 테이블에 질의하여 3COM의 모든 발명자 목록을 추출할 수 있을 것이다. 한편, 3COM의 발명자를 ABC 순으로 나열할 수도 있지만, 다출원/다등록 또는 다인용 등의 다른 순서로 나열하는 것도 당업자에게는 자명할 것이다.125 is a screen in which a list of inventors of 3COM is displayed when the inventor list is selected in FIG. 124 and the United States is selected in the countries tab. The patent information service system of the present invention can retrieve all the inventor's lists of 3COM by querying the patent document master DB or querying the operation result table of the inventor's analysis multi-dimensional analysis operation. On the other hand, the inventors of 3COM may be listed in ABC order, but it will be obvious to those skilled in the art to arrange them in different order such as multi-application / multi-registration or multi-user.

도 126는 도 125에서 발명자 리스트에서 Aldous Stepha..(9번)을 선택했을 때 이 발명자와 관계된 특허 문건 목록이 나오는 화면이다. 본 발명의 상기 특허 정보 서비스 시스템은 발명자에 대한 선택 정보를 입수받은 다음, 그 입수 받은 발명자 정보로 상기 특허 문건 마스트 DB에 조회함으로써, 그 발명자에 해당되는 문건 목록을 생성하여 사용자에게 제시할 수 있을 것이다. FIG. 126 is a screen listing a patent document related to the inventor when Aldous Stepha .. (No. 9) is selected from the inventor list in FIG. 125. FIG. The patent information service system of the present invention can obtain selection information for the inventor and then search the patent document master DB with the acquired inventor information to create a list of documents corresponding to the inventor and present it to the user will be.

도 127은 3COM에 대한 도 124에서 통계 분석탭을 누르고 기술영역 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 IPC 서브클래스 레벨에서의 각 IPC별 연도별 출원수에 대한 일실시예적 구현예이다. 이 화면에는 드릴 다운과 특정 셀값을 눌렀을 때 나오는 간이 분석 화면이 팝업으로 나와 있다.Figure 127 shows the number of applications for each IPC per year at the IPC subclass level based on the total amount of applications when the United States is clicked in the systematic analysis menu (SA), the United States by pressing the statistical analysis tab in Figure 124 for 3COM, Is an example embodiment. This screen pops up a simple analysis screen that appears when you drill down and press a specific cell value.

도 128은 3COM에 대한 도 124에서 통계 분석탭을 누르고 경쟁사 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 USPC 서브클래스 레벨에서의 각 USPC별 연도별 출원수에 기준한 경쟁 출원인에 대한 정보가 나오는 일실시예적 구현예이다. FIG. 128 shows the results of the competition based on the number of applications per USPC per year at the USPC subclass level based on the total amount of applications when the user clicks on the statistical analysis tab in FIG. 124 for 3COM and the US systematic analysis menu This is an exemplary embodiment in which information on the applicant comes out.

도 129는 3COM에 대한 도 124에서 통계 분석탭을 누르고 미국인용 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 3COM의 출원 문건 전체 집합을 인용 기준 문건 집합으로 했을 경우, 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다. 129 shows a case where the statistical analysis tab of FIG. 124 for 3COM is clicked and the entire set of application documents of the 3COM filed when the US is clicked on the systematic analysis menu (SA) and the country tab is set as a citation reference document set, (backward) citation document set as an analysis target document set.

도 130은 3COM에 대한 도 124에서 통계 분석탭을 누르고 발명자 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 IPC 서브클래스 레벨에서의 각 IPC별 연도별 다출원 발명자에 대한 일실시예적 구현예이다. 130 is a graphical representation of the results of the inventor's systematic analysis menu (SA) for the 3COM, the statistical analysis tab of FIG. 124, the number of applicants for each year by IPC at the IPC subclass level based on the total amount of applications, Is an example embodiment.

도 131은 하위 특허 분류에 관한 정보를 자동으로 포함시켜 특허 문건 집합에 대한 가공 특허 정보를 생성하는 일실시예적 방법에 관한 도면이다.FIG. 131 is a diagram of an exemplary method of automatically generating processed patent information for a set of patent documents by automatically including information on a sub-patent classification.

도 132는 특허 문건 집합이 선택되었을 때, 다차원 분석을 포함하는 가공 특허 정보를 생성하는 일실시예적 방법에 관한 도면이다.132 is a diagram of an exemplary method for generating processed patent information, including multidimensional analysis, when a set of patent documents is selected.

도 133은 특허 문건 집합이 선택되었을 때 융합 분석을 수행하는 일실시예적 방법에 관한 도면이다. 융합 분석 수행 결과의 일 예시는 도 142내지 143에 있다. 융합 분석의 수행 결과는 네트워크 다이어그램으로 표현될 수도 있다. 즉, 출원인, 발명자, 특허 분류 기호, 대표 어구나 기술 키워드, 인용 특허 문건 번호(하나의 특허 문건에 포함된 n개의 인용 특허(reference)들을 말함) 등이 네트워크 다이어그램 상에서 노드를 구성하고, 각 노드와 노드 사이에는 공통(co-occurrence)이 발생한 경우의 숫자값을 표시할 수도 있다. 노드의 크기는 해당 문건의 숫자 또는 기타 지표를 반영할 수 있을 것이다. 물론, 숫자에 클릭할 경우, 도 48에서와 같이 해당 문건 집합의 리스트를 보여 줄 수 있음은 물론이다 할 것이다. 네트워크 다이어그램을 생성하기 위해서는 특정되는 특허 문건 집합에서 추출한 노드 간에 association 결과 데이터가 생성되어 있어야 한다.133 is a diagram of an exemplary method for performing a fusion analysis when a set of patent documents is selected. One example of the result of performing the fusion analysis is shown in FIG. 142 to FIG. The result of performing the fusion analysis may be represented by a network diagram. That is, the applicant, the inventor, the patent classification code, the representative keyword, the technical keyword, the patent document number (referring to n patent references included in one patent document) constitute nodes on the network diagram, And a numerical value in the case where a co-occurrence occurs between the node and the node. The size of the node may reflect the number of the document or other indicator. Needless to say, if a user clicks on a number, the list of the document set can be displayed as shown in FIG. In order to generate the network diagram, the association result data must be generated between the nodes extracted from the specified patent document set.

도 134는 수치에 대한 선택이 있는 경우, 그 수치에 대응되는 특허 문건 정보를 제공해 주는 일실시예적 방법에 관한 도면이다. 이에 대한 구체적인 예시는 도 48에서 확인할 수 있다. 도 48에서는 숫자를 눌렀을 때 팝업 창이 열려 특허 리스트를 제공해 주는 것을 보여주고 있다.134 is a diagram of an exemplary method for providing patent document information corresponding to a numerical value, when there is a selection for the numerical value; A concrete example of this can be seen in FIG. In Figure 48, a pop-up window opens when a number is pressed, showing that the patent list is provided.

도 135는 출원인별, 특허 분류 기호별, 발명자별, 대리인별로 다차원 분석을 제공하는 일실시예적 방법에 관한 도면이다. 본 발명의 도 17 내지 도 50에는 각 분석 종류별로 다양한 메뉴가 존재함을 예시적으로 보여 주고 있다.135 is a diagram of an exemplary method for providing multi-dimensional analysis by applicant, patent classification, inventor, and agent. 17 to 50 of the present invention illustratively show that there are various menus for each analysis type.

한편, 특허 독립의 원칙 상 제1국에 출원되거나 등록된 특허라도 제2국에 출원되거나 등록되지 않은 경우, 제2국에서는 보호 받지 못한다. 그러므로, 제1국 특허 검색 결과에서 나온 특허 리스트를 구성하는 개별 특허 문건에 대하여 적어도 하나 이상의 제2국에서의 상기 개별 특허 문건에 관련된 패밀리 정보가 있는지를 조사할 수 있을 것이다. 조사 결과 없는 경우라면, 상기 개별 특허 문건이 국제 출원 중인지를 조사할 수 있을 것이며, 국제 출원 중인 경우 최선일 또는 제1국에서의 기준일에서 기설정된 기간(30개월 내지 31개월 등의 국내 단계 진입 기간)이 경과했는지와 제1국 출원일로부터 1년이 경과했는지 등을 조회한다. 조회 결과 경과된 경우 제2국에 진입할 수 없으므로, 상기 제1국 특허의 검색 결과의 기설정된 주변에 제2국에서의 활용성에 대한 정보를 포함시킬 수 있다. 물론, 제2국에 진입했더라도 현 재 상태 정보가 취하, 무효, 거절 확정, 소멸 등이 된 경우에도 활용성에 대한 정보를 제공해 줄 수 있을 것이다. 한편, 제 2국에 진입되었거나, PCT가 출원 중이며, 기설정된 기간을 도과하지 않은 경우나, 제1국 출원일로부터 1년이 경과하지 않은 경우에는 위험/경고/주의 표시를 활용성 정보로 표시해 줄 수 있을 것이다. 이는 특히, 상기 사용자의 신상 정보에서 제2국에 속한 사용자인 경우를 상기 시스템(10)이 알고 있는 경우에 더욱 유용할 것이며, 나아가 상기 사용자가 자신이 관심을 가지는 제2국들을 지정할 때 더욱 더 유용할 것이다. 제2국은 당연힝 1개 국가 이상일 수 있다. 도 136은 패밀리 정보를 이용하여 제1국에서 검색된 특허의 제2국에서의 활용 가능성을 판단하는 일실시예적 방법에 관한 도면이다. 도 137은 도136에서의 방법을 JP를 제1국으로 하고, #1번 특허 문건에 대하여 KR, US, EU를 제1국으로 하여 제2국에서의 활용 가능성을 판단하는 일실시예적 방법에 대한 흐름도이다. 도 138은 도136 내지 도 137에서 제시되는 방법을 실시했을 경우의 검색 결과 리스트 화면의 일실시예적 도면이다. 검색 결과 리스트의 각 항목에 각 국가별로의 국내 단계에서의 활용성 정보가 포함되어 있음을 알 수 있다. On the other hand, even if a patent is filed or registered in the first country under the principle of patent independence, it is not protected in the second country unless it is filed or registered in the second country. Therefore, it is possible to check whether there is family information related to the individual patent documents in at least one or more second countries with respect to the individual patent documents constituting the patent list from the first country patent search result. If there is no investigation result, it will be possible to check whether the individual patent document is internationally filed, and if it is an international application, And whether or not one year has elapsed since the filing of the first international application. When the inquiry result has elapsed, it is not possible to enter the second station, so that the information about the availability in the second station can be included in the predetermined vicinity of the search result of the first station patent. Of course, even if you enter a second country, information on availability can be provided even if the current status information is withdrawn, invalidated, rejected, or disappears. On the other hand, if the applicant enters the second country, the PCT is pending, does not exceed the predetermined period, or has not passed one year since the filing date of the first country, the risk / warning / It will be possible. This will be particularly useful when the system 10 knows that the user is a user belonging to the second station in the user's personal information and furthermore when the user designates the second stations he is interested in, It will be useful. The second country may be, of course, more than one country. 136 is a diagram of an exemplary method for determining the availability of patents retrieved from a first country in a second country using family information. FIG. 137 is a flowchart illustrating an example method for determining whether the method of FIG. 136 is JP as a first station and that the use of KR, US, and EU as a first station for a # 1 patent document in a second station FIG. 138 is an exemplary diagram of a search result list screen when the method shown in Figs. 136 to 137 is carried out. It can be seen that each item in the search result list includes information on the utilization level at the national level for each country.

감시 서비스 내지는 사용자 생성 디렉토리 서비스의 일 구현예로 도 139 내지 도 141의 서비스를 들 수 있다. 사용자는 자신이 관심있는 기술 체계/제품 체계/경쟁사 등을 트리 구조로 구성하고, 트리를 구성하는 각 노드에 검색식이나 기타 수신자 e-mail 등의 관련 정보를 수록해 놓을 수 있을 것이다. 이때 상기 시스템(10)은 기설정된 주기 또는 신규 특허 문건이 입수되었을 때, 상기 검색식을 조회하고, 특허 문건이 발견된 경우에는 상기 특허 문건을 e-mail 등으로 발송해 줄 수 있을 것 이다. 이때, 기계 번역 시스템을 활용하여 수신자가 원하는 언어로 발송할 수 있을 것이다. 한편, 상기 사용자 생성 디렉토리에 검색식이나 문건 집합이 할당된다면, 상기 디렉토리를 구성하는 각 노드에는 특허 문건 집합이 특정될 수 있으며, 문건 집합을 대상으로 하는 본 발명의 각종 분석도 상기 노드 단위로 수행될 수 있을 것이다. 물론, 2개 이상의 노드를 선정하고, 상기 노드 간의 집합 연산을 수행하여 생성되는 새로운 특허 문건 집합에 대하여 상기 각종 분석을 수행할 수 있을 것이다.One example of the monitoring service or the user-created directory service is the services shown in FIG. 139 to FIG. The user can construct a tree structure of the technical system / product structure / competitor that he / she is interested in, and can store related information such as a search expression or other recipient e-mail in each node constituting the tree. At this time, the system 10 may inquire the search formula when a predetermined period or a new patent document is obtained, and if the patent document is found, the system 10 may send the patent document by e-mail or the like. At this time, the recipient can send out the desired language using the machine translation system. Meanwhile, if a retrieval expression or a document set is allocated to the user-created directory, a set of patent documents can be specified for each node constituting the directory, and various analyzes of the present invention for a document set can also be performed for each node . Of course, two or more nodes may be selected, and various analyzes may be performed on a new set of patent documents generated by performing an aggregation operation between the nodes.

도 139는 사용자별로 다단계 계층 구조로 기술 트리를 구성하고, 기술 트리에 검색식을 포함한 각종 정보가 포함되어 있는 본 발명의 감시 서비스/디렉토리 서비스 방법의 일 실시예적 구현예이다.FIG. 139 is an exemplary embodiment of a surveillance service / directory service method of the present invention in which a description tree is configured in a multi-level hierarchical structure for each user and various information including a search formula is included in a description tree.

도 144 내지 도 145는 본 발명의 특허 정보 시스템을 구현하는데 활용되는 인터페이스의 일 예들이다. 상기 인터페이스에는 옵션의 선택과 관련된 인터페이스가 많다. 상기 인터페이스에는 1) 시간 종류 선택 인터페이스, 2) 시간 구간 선택 인터페이스, 3) 국가 선택 인터페이스 4) 특허 분류 기호 선택 인터페이스 5) 인용 특화 인터페이스, 6) 패밀리 정보 포함 선택 인터페이스, 7) 검색 후 선택 인터페이스 중 어느 하나 이상을 포함한다. 144 to 145 are examples of interfaces used to implement the patent information system of the present invention. The interface has many interfaces associated with the selection of options. The interface includes: 1) a time type selection interface, 2) a time interval selection interface, 3) a country selection interface, 4) a patent classification symbol selection interface, 5) a citation specialized interface, 6) And any one or more of them.

상기 시간 종류 인터페이스에는 출원일, 공개일 및 등록일 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 시간 구간 선택 인터페이스는 시작 구간과 종료 구간 각각에 대한 년/월/주/일 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 국가 선택 인터페이스는 적어도 2 이상의 국가 에 대한 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 특허 분류 기호 선택 인터페이스는 특허 분류 기호의 종류 선택 인터페이스, 특허 분류 기호 레벨 선택 인터페이스, 메인 특허 분류 기호 및 서브 특허 분류 기호 선택 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 인용 특화 인터페이스에는 전방 인용과 후방 인용 중 어느 하나 이상의 선택에 관한 인용 방향 인터페이스, 인용 깊이의 선택에 관한 인용 깊이 인터페이스, 직접 인용과 간접 인용 중 어느 하나 이상의 인용의 선택에 관한 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함하며, 상기 패밀리 정보 포함 선택 인터페이스에는 패밀리 정보 포함 여부 및 국가 범위의 선택에 관한 인터페이스 중 어느 하나 이상의 선택을 제공하는 인터페이스를 포함한다.Wherein the time type interface comprises an interface providing a selection of at least one of a filing date, a publication date, and a registration date, wherein the time period selection interface is configured to select one or more of year / month / week / Wherein the country selection interface includes an interface for providing at least one selection for at least two countries, wherein the patent classification symbol selection interface includes a classification selection type interface for a patent classification symbol, An interface providing a selection of one or more of a selection interface, a main patent classification symbol, and a subpublic classifier selection interface, wherein the citation specialized interface includes a citation direction interface for selection of at least one of a front citation and a rear citation, An interface for providing a selection of at least one of a citation depth interface for selection of depth, an interface for selection of one or more citations between direct citations and indirect citations, wherein the family information containing selection interface includes: And an interface for providing a selection of one or more of the interfaces relating to the selection of the range.

상기 검색 후 선택 인터페이스는 상기 처리 대상 문건 집합을 검색 대상으로 하여 기설정된 필드 검색을 제시하고, 상기 필드에 사용자 컴퓨터로부터 검색어를 입수받고, 상기 검색어에 대응되는 검색 결과를 제공하고, 상기 검색 결과 중에서 적어도 하나 이상의 선택 정보를 입수 받아, 상기 선택 정보와 관련되는 특허 문건 집합만을 처리 대상 문건 집합으로 하도록 처리하는 인터페이스를 포함한다. 예를 들면, 특정된 특허 문건 집합에서 특정한 출원인들만을 대상으로 할 경우라면, 출원인 검색 필드에 출원인을 입력하면, 상기 특정된 특허 문건 집합에 포함된 출원인 목록이 나오며, 사용자가 이들 중 어느 하나 이상을 선택했을 경우, 그 선택 정보를 입수 받아, 그 선택된 문건 집합만을 대상으로 하거나, 그 선택된 문건 집합을 제외하는 등과 같은 기능을 수행하여 특허 문건 집합의 재특정(respecification)이 완료되도록 한다.Wherein the post-search selection interface presents a pre-set field search with the target document set as a search target, receives a search word from the user's computer in the field, provides a search result corresponding to the search word, And an interface that receives at least one selection information and processes only a set of patent documents related to the selection information as a set of object documents to be processed. For example, if only a specific applicant is targeted in a specified set of patent documents, if the applicant is entered in the applicant search field, a list of applicants included in the specified set of patent documents comes out, , The selection information is received, and only the selected document set is selected, or the selected document set is excluded, so that the respecification of the set of patent documents is completed.

본 발명은 특허 정보 산업 전반에 활용할 수 있다. 구체적으로는 검색, 분석, 컨설팅, 감시 , 경쟁사에 대한 분석, 기술 융합 패턴 조사 등의 각종 특허 정보 유관 산업에 포괄적으로 활용될 수 있다.The present invention can be applied to the entire patent information industry. Specifically, it can be applied to various patent information related industries such as search, analysis, consulting, surveillance, analysis of competitors, and technology fusion pattern research.

도 1은 본 발명의 특허 정보 시스템 구성도에 관한 일실시예적 도면이다.BRIEF DESCRIPTION OF DRAWINGS FIG. 1 is a diagram showing an embodiment of a patent information system according to the present invention. FIG.

도 2는 본 발명의 DB 부에 관한 일실시예적 도면이다.2 is a diagram showing an embodiment of the DB unit of the present invention.

도 3은 본 발명의 특허 문건 마스트 DB에 관한 일실시예적 도면이다.Fig. 3 is an embodiment of the patent document mast DB of the present invention.

도 4는 본 발명의 특허 분류 기호 마스트 DB에 관한 일실시예적 도면이다.Fig. 4 is a diagram showing an embodiment of the patent classification code mast DB of the present invention.

도 5는 본 발명의 주체 마스트 DB에 관한 일실시예적 도면이다.5 is a diagram showing an embodiment of the subject mast DB of the present invention.

도 6은 본 발명의 다차원 분석용 연산 수행 결과 테이블 DB에 관한 일실시예적 도면이다.FIG. 6 is a diagram illustrating an operation result table DB for multidimensional analysis of the present invention.

도 7은 본 발명의 지원 DB부 및 2차 가공 DB부에 관한 일실시예적 도면이다.FIG. 7 is an exemplary diagram of a support DB part and a secondary processing DB part of the present invention. FIG.

도 8은 본 발명의 전처리 모듈부에 관한 일실시예적 도면이다.8 is a diagram illustrating an embodiment of the pretreatment module unit of the present invention.

도 9는 본 발명의 마스트 DB 생성 모듈에 관한 일실시예적 도면이다.9 is a diagram showing an embodiment of a mast DB generating module according to the present invention.

도 10은 본 발명의 인용 정보 전처리 모듈에 관한 일실시예적 도면이다.10 is a diagram illustrating an embodiment of a citation information preprocessing module of the present invention.

도 11은 본 발명의 특허 정보 처리 기본 모듈에 관한 일실시예적 도면이다.11 is a diagram showing an embodiment of a patent information processing basic module of the present invention.

도 12는 본 발명의 지원 모듈에 관한 일실시예적 도면이다.12 is an exemplary diagram of a support module of the present invention.

도 13은 본 발명의 특허 인텔리전스 모듈에 관한 일실시예적 도면이다.13 is an exemplary diagram of a patent intelligence module of the present invention.

도 14는 본 발명의 delphion는 본 발명의 하위분류포함검색는 본 발명의 분석이 되지 않는 근거에 관한 일실시예적 도면이다.Fig. 14 is an exemplary diagram of the delphion of the present invention relating to the reason why the sub-classification-containing search of the present invention is not analyzed according to the present invention.

도 15는 본 발명의 디렉토리 생성 모듈의 일 구현예에 관한 일실시예적 도면이다.15 is an exemplary diagram of an embodiment of the directory creation module of the present invention.

도 16은 본 발명의 특허 분류 기호 검색 모듈이 작동되는 일 구현예에 관한 일실시예적 도면이다.16 is an exemplary diagram of an embodiment in which the patent classification code search module of the present invention is operated.

도 17은 본 발명의 대한민국에 출원한 삼성전자주식회사의 IPC 서브클래스 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 17 is a diagram showing an embodiment analysis result of the number of applications per year for each multi-application IPC at the IPC subclass level of Samsung Electronics Co., Ltd. filed in the Republic of Korea of the present invention.

도 18은 H01L에 드릴다운했을 때 생성되는 분석 결과에 대한 일실시예적 도면이다.18 is an exemplary diagram of the analysis result generated when the user drills down to H01L.

도 19는 H01L 21/00 및 기타의 하위 특허 분류 기호에 드릴다운했을 때 생성되는 분석 결과에 대한 일실시예적 도면이다.19 is an exemplary diagram of the analysis results generated when drilling down to H01L 21/00 and other subpublic classifications.

도 20은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 IPC 메인그룹 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 20 is a diagram showing an exemplary analysis result of application number data per year of IPC main group level of Samsung Electronics Co., Ltd. among all applicants in the DB possessed by the patent information system of the present invention.

도 21은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 IPC 1 dot 서브 그룹 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 21 is a diagram showing an exemplary analysis result of the number of applications per year by IPC at the IPC 1-dot subgroup level of Samsung Electronics Co., Ltd. among all applicants in the DB possessed by the patent information system of the present invention.

도 22는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 미국에 출원한 삼성전자주식회사의 출원 문건 기준 IPC 서브클래스 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 22 is a graph showing the result of analysis of one embodiment of the number of applications per year by IPC subclass level based on application documents of Samsung Electronics Co., Ltd. filed in the United States among all applicants in the DB owned by the patent information system of the present invention FIG.

도 23은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 미국에 등록한 삼성전자주식회사의 등록 문건 기준 IPC 메인 그룹 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 23 is a diagram showing an embodiment analysis result of the number of applications per year for each multi-application IPC at the IPC main group level based on the registration document of Samsung Electronics Co., Ltd., registered in the US among all applicants in the DB owned by the patent information system of the present invention to be.

도 24는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 미국에 출원한 General Motors의 USPC no dot(sub class) 레벨의 다출원 USPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 24 is a graph showing the result of analysis of one embodiment of the number of applications per USPC according to the USPC no dot (subclass) level of General Motors filed in the US among all the applicants in the DB owned by the patent information system of the present invention FIG.

도 25는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 미국에 출원한 General Motors의 USPC 1 dot 레벨의 다출원 IPC별 연도별 출원수 데이터에 대한 일실시예적 분석 결과에 대한 도면이다.FIG. 25 is a diagram showing results of analysis of one embodiment of the application number data per year of multi-application IPC at USPC 1 dot level of General Motors, which was filed in the United States among all applicants in the DB possessed by the patent information system of the present invention.

도 26은 본 발명의 대한민국 출원 문건 기준 IPC H04B에 대한 출원 총량 분석 및 드릴 다운의 예에 대한 일실시예적 도면이다.26 is an exemplary diagram of an example of a total amount analysis and drill down for IPC H04B based on Korean application document of the present invention.

도 27은 본 발명의 미국 출원 문건 기준 IPC H04B에 대한 출원 총량 분석 및 드릴 다운의 예에 대한 일실시예적 도면이다.Figure 27 is an exemplary illustration of an example of a total volume analysis and drill down for IPC H04B based on the US application of the present invention.

도 28은 본 발명의 대한민국 출원 문건 기준 IPC H04B에 대한 다출원 기업별 총량 분석에 관한 일실시예적 도면이다.FIG. 28 is a diagram showing an embodiment of the total amount analysis for each applicant company of IPC H04B based on the Korean application document of the present invention.

도 29는 본 발명의 대한민국 출원 문건 기준 IPC H04B에 대한 점유율 기준 다출원인에 대한 일실시예적 도면이다.FIG. 29 is an exemplary view of applicants based on the occupancy rate of IPC H04B based on the Korean application document of the present invention.

도 30은 본 발명의 대한민국 출원 문건 기준 IPC H04B에 대한 활동율 기준 다출원인에 대한 일실시예적 도면이다.Figure 30 is an embodiment of an applicant of the present invention based on the Korean application document based on IPC H04B.

도 31은 본 발명의 미국 출원 문건 기준 IPC H04B 및 이들의 하위 분류에 대한 드릴다운을 포함한 출원 총량 분석에 대한 일실시예적 도면이다.Figure 31 is an exemplary diagram of a total volume analysis including a drill down to IPC H04B and their subclasses based on the US application of the present invention.

도 32는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 대한민국 기준 삼성전자주식회사의 총량 기준으로 한 대표적 경쟁사 분석에 관한 일실 시예적 도면이다.FIG. 32 is a diagram illustrating a representative competitor analysis based on the total amount of the Korean standard Samsung Electronics Co., Ltd. among all the applicants in the DB possessed by the patent information system of the present invention.

도 33은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 대한민국 기준 삼성전자주식회사의 총량 기준 다출원 특허 기술 분류 기호별 경쟁 출원인에 관한 분석의 일실시예적 도면이다.FIG. 33 is an exemplary diagram of an analysis on a competitive applicant according to the patent application technology classification number based on the total amount of all the applicants in the database owned by the patent information system of the present invention, Samsung Electronics Co., Ltd., Korea.

도 34는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 미국 특허 기준 삼성전자주식회사의 IPC 메인 그룹 기준 출원 총량 기준 경쟁 출원인에 분석의 일실시예적 도면이다.FIG. 34 is an exemplary diagram of an analysis of all the applicants in the DB held by the patent information system of the present invention to the competition applicant based on the IPC main group standard application total amount of US Patent Standard Samsung Electronics Co., Ltd. FIG.

도 35는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 대한민국 특허 출원 총량 기준 삼성전자주식회사의 연도별 다출원 발명자별 분석 결과에 관한 일실시예적 도면이다.FIG. 35 is an exemplary diagram of an analysis result of multi-filing inventors of SAMSUNG ELECTRONICS CO., LTD. Based on the total amount of Korean patent applications among all the applicants in the DB held by the patent information system of the present invention.

도 36은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 대한민국 특허 출원 총량 기준 삼성전자주식회사의 다출원 다출원 발명자별 분석 결과에 관한 일실시예적 도면이다.36 is an example of an analysis result of multi-application multi-applicant inventors of Samsung Electronics Co., Ltd. based on the total amount of Korean patent applications among all applicants in the DB possessed by the patent information system of the present invention.

도 37은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원 총량을 기준으로 할 때 USPC 서브클래스(no dot, 클래스 직하위) 기준 다출원 경쟁 출원인에 대한 랭킹 분석 결과에 대한 일실시예적 도면이다.FIG. 37 is a USPC subclass (based on the total amount of all US patent applications of Samsung Electronics Co., Ltd.) among all applicants in the DB held by the patent information system of the present invention. Ranking analysis Fig.

도 38은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 인용 총량에 대한 연 도별 분석 결과에 대한 일실시예적 도면이다.FIG. 38 is a graph showing the relationship between the backward citation document set of the present invention and the document set of the analysis when all the US patent applications of Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system of the present invention are set as reference documents. An example of the result of analysis of the total amount of citations by year.

도 39는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 39 is a graph showing the relationship between the backward citation document set of the present invention and the document set for analysis when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system of the present invention are set as the reference document set. FIG. 4 is an exemplary diagram of an analysis result of an applicant of the present invention by year; FIG.

도 40은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 특허 분류 기호(IPC 메인 그룹 레벨)에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 40 is a graph showing the relationship between the backward quoted document set of the present invention and the document set for analysis when all the US patent applications of Samsung Electronics Co., Ltd., among all the applicants in the database held by the patent information system of the present invention, An IPC main group level patent classification code (IPC main group level).

도 41은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 특허 분류 기호(IPC 메인 그룹 레벨)에 드릴 다운했을 때를 반영한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 41 is a graph showing the relationship between the backward quoted document set of the present invention and the document set for analysis when all the US patent applications of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system of the present invention are set as the reference document set. And an analysis result for each year reflecting a case where a patent classification code (IPC main group level) is drilled down.

도 42는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 출원을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 발명자에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 42 is a graph showing the relationship between the backward quoted document set of the present invention and the document set of the analysis when all the US patent applications of Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system of the present invention are set as the reference document set. An embodiment of the result of yearly analysis of the inventor of the present invention.

도 43은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후 방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 최다 인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 43 is a diagram showing a set of document specifications for analysis of the present invention when all US patent registration of US patent corporation among all applicants in the DB owned by the patent information system of the present invention is set as a reference document set. , And an example of the yearly analysis result for the most cited applicant.

도 44는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 최다 인용 발명자에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 44 is a view showing a set of document specifications for analysis of the invention, which is a set of forward quotations of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. is set as a reference document set among all applicants in the DB possessed by the patent information system of the present invention, An embodiment of the result of yearly analysis of the most cited inventors.

도 45는 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, IPC 메인 그룹 별 최다 인용 기술에 대한 분석 중에서 IPC에 드릴 다운 했을 시의 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 45 is a diagram showing a set of document specifications for analysis of the invention, which is a set of forward quotations of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. is set as a reference document set among all applicants in the DB possessed by the patent information system of the present invention, It is an exemplary diagram of the yearly analysis result when drilling down to the IPC among the analysis of the most cited technologies per IPC main group.

도 46은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 후방(forward) 인용 문건 집합을 분석 대상 문건 집합으로 한, USPC 서브 클래스(no dot, 클래스 직하위) 최다 인용 기술에 대한 분석 중에서 드릴 다운 했을 시의 연도별 분석 결과에 대한 일실시예적 도면이다.FIG. 46 is a diagram showing a set of reference documents of the present invention when all the US patent registration of Samsung Electronics Co., Ltd. is set as a reference document set among all applicants in the DB held by the patent information system of the present invention, It is an exemplary diagram of the yearly analysis result when the USPC subclass (no dot, subclass subclass) most quoted technology analysis is drill down.

도 47은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 전체 후방 인용 관련 기준 문건 집합을 분석 대상 문건 집합으로 한, 총량 기준 분석 결과 및 이 분석 결과에 대한 본 발명의 리포팅 모듈 중 차트 생성 모듈이 생성한 차트의 일 실시예적 도면이다.47 is a diagram showing a set of reference documents related to the entire rear citation of the present invention when all US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system of the present invention is set as a reference document set, And a chart generated by the chart generating module among the reporting modules of the present invention with respect to the analysis result.

도 48은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 특허 등록을 기준 문건 집합으로 할 때의 본 발명의 전체 후방 인용 관련 기준 문건 집합을 분석 대상 문건 집합으로 한, 인용을 많이 받은 발명자의 연도별 인용 총량에 관한 분석 결과 및, 분석 결과에 보이는 특정한 숫자를 클릭했을 때, 그 숫자에 해당하는 문건 집합에 대하여 간이한 분석 결과(문건 목록, 최다 출원인의 연도별 출원수/등록수, 최다 발명자의 연도별 출원수/등록수, 최대 기술 분야별(IPC, USPC, FT) 연도별 출원수/등록수(드릴다운 포함))를 제공해 주는 간이 분석 모듈이 생성한 문건 목록에 대한 일실시예적 도면이다.FIG. 48 is a diagram showing a set of reference documents related to the entire rear citation according to the present invention when all US patent registration of Samsung Electronics Co., Ltd. among all applicants in the DB held by the patent information system of the present invention is set as a reference document set, The results of the analysis of the cited total amount of inventors who received a lot of citations and the results of simple analysis (document list, application by year of the largest applicant, Number of applications / number of registrations, number of applications / number of registrations by year, maximum number of applications by IPC, USPC, FT by year (including drill-down) Fig.

도 49는 본 발명의 간이 분석 모듈에서도 최대 기술 분야별(IPC, USPC, FT)로 드릴 다운 기능을 제공함으로 보여주는 일실시예적 도면이다.FIG. 49 is an exemplary diagram showing a drill down function according to the maximum technology field (IPC, USPC, FT) in the simplified analysis module of the present invention.

도 50은 본 발명의 특허 정보 시스템이 보유한 DB 내의 모든 출원인 중 삼성전자주식회사의 모든 미국 출원 문건 중 다출원 IPC 서브클래스 단위의 문건을 기준 문건 집합으로 할 때의 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 인용 총량에 대한 연도별 분석 결과에 대한 일실시예적 도면이다.50 shows the backward citation document of the present invention when all of the US application documents of Samsung Electronics Co., Ltd. among all the applicants in the DB held by the patent information system of the present invention are documents of multi-application IPC subclass unit as a reference document set FIG. 5 is an exemplary diagram of the yearly analysis result on the total amount of citations, which is a set of documents to be analyzed.

도 51은 본 발명의 특허 정보 시스템의 일 서브 시스템인 계층적 특허 정보 서비스 시스템에 관한 일실시예적 도면이다.51 is a diagram showing an embodiment of a hierarchical patent information service system which is one sub-system of the patent information system of the present invention.

도 52는 본 발명의 개별 단위 특허 정보 시스템 생성 엔진에 관한 일실시예적 도면이다.52 is an exemplary diagram of an individual unit patent information system generation engine of the present invention.

도 53은 본 발명의 특허 정보 전처리 모듈의 내부 구성에 관한 일실시예적 도면이다.53 is a diagram showing an internal configuration of a patent information preprocessing module of the present invention.

도 54는 본 발명의 가중치 전처리 모듈에 관한 일실시예적 도면이다.54 is a diagram illustrating an embodiment of the weight pre-processing module of the present invention.

도 55는 본 발명의 인용 정보 전처리 모듈에 관한 일실시예적 도면이다.55 is a diagram showing an embodiment of the citation information preprocessing module of the present invention.

도 56은 본 발명의 특허 분류 기호 전처리 모듈(301-3-1 또는 3500)에 관한 일실시예적 도면이다.56 is an exemplary diagram of the patent classification symbol preprocessing module 301-3-1 or 3500 of the present invention.

도 57은 본 발명의 출원인 대표명화 전처리 모듈에 관한 일실시예적 도면이다.57 is a diagram showing an embodiment of a representative pre-processing module of the present invention.

도 58은 본 발명의 대표 어구 추출 전처리 모듈에 관한 일실시예적 도면이다.FIG. 58 is a diagram showing an embodiment of the representative word extraction pre-processing module of the present invention. FIG.

도 59는 본 발명의 패밀리 정보 전처리 모듈에 관한 일실시예적 도면이다.59 is a diagram showing an embodiment of the family information preprocessing module of the present invention.

도 60은 본 발명의 복수 특허 분류 기호 관계성 전처리 모듈에 관한 일실시예적 도면이다.FIG. 60 is a diagram illustrating an embodiment of a plurality of patent classification symbol related preprocessing modules of the present invention. FIG.

도 61은 본 발명의 특허 분류 기호별 통계 전처리 모듈에 관한 일실시예적 도면이다.61 is a diagram showing an embodiment of a statistical preprocessing module according to patent classification symbols of the present invention.

도 62는 본 발명의 특허 정보 인텔리전스 모듈에 관한 일실시예적 도면이다.62 is an exemplary diagram of a patent information intelligence module of the present invention.

도 63은 본 발명의 분석 모듈에 관한 일실시예적 도면이다.63 is a diagram showing an embodiment of the analysis module of the present invention.

도 64는 본 발명의 특허 정보 시스템 배치 생성 엔진에 관한 일실시예적 도면이다.FIG. 64 is an exemplary diagram of a patent information system batch generation engine of the present invention. FIG.

도 65는 본 발명의 통합 관리 모듈의 구성에 관한 일실시예적 도면이다.65 is a diagram showing an example of the configuration of the integrated management module of the present invention.

도 66은 전체 특허 정보 데이터베이스(2300)에서 1개의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.FIG. 66 is a diagram showing an example of a patent information service system according to the present invention. In FIG. 66, a unit patent information service system of a single applicant is created in the entire patent information database 2300, FIG. 4 shows one embodiment of the method for generating the data.

도 67은 출원인 명의 목록을 입수하고, 각 목록에 대하여 전체 특허 정보 데이터베이스(2300)에서 목록에 있는 출원인 개수만큼 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고, 상기 각 생성된 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.67 shows a list of the names of applicants, generates a unit patent information service system of the applicant by the number of applicants listed in the entire patent information database 2300 for each list, And shows an embodiment of a method of generating a unit patent information service system in the name of the inventor by the inventor included in the set of patent documents constituting the system.

도 68은 입수된 문건 집합에서 출원인 명의 목록을 입수하고, 각 목록에 대하여 전체 특허 정보 데이터베이스(2300)에서 목록에 있는 출원인 개수만큼 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고, 상기 각 생성된 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.68 shows a list of the applicants in the obtained document set, generates a unit patent information service system of the applicant's name as the number of applicants in the list in the entire patent information database 2300 for each list, An inventor's unit patent information service system included in a set of patent documents constituting a unit patent information service system is generated according to an embodiment of the present invention.

도 69은 출원인 명의 단위 특허 정보 서비스 시스템의 생성을 주문 받은 경우, 전체 특허 정보 데이터베이스(2300)에서 1개의 출원인 명의 단위 특허 정보 서비스 시스템을 생성하고 상기 출원인 명의 단위 특허 정보 서비스 시스템을 구성하는 특허 문건 집합에 포함된 발명자별 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 방법의 일 실시예를 보여 주고 있다.FIG. 69 shows an example of a patent information service system in which a unit patent information service system of a single applicant is created in the entire patent information database 2300 when the generation of the unit patent information service system by the applicant is ordered, And shows an embodiment of a method of generating a unit patent information service system in the name of the inventor by the inventor included in the set.

도 70은 국가 단위로 출원인 명의 단위 특허 정보 서비스 시스템을 생성하는 방법에 관한 것이다. 본 국가 단위로 출원인 명의 단위 특허 정보 서비스 시스템을 생성하는 것은 국가 단위의 특허 정보 데이터베이스(2300)를 대상으로 국가 단위의 출원인 명의 단위 특허 정보 서비스 시스템 및 발명자 명의 단위 특허 정보 서비스 시스템을 생성하는 것을 본질로 한다. 따라서, 상기 도 66 내지 도 69의 방법은 모두 적용될 수 있다.FIG. 70 relates to a method for generating a unit patent information service system in the name of applicant in a country unit. In order to create a unit patent information service system of the applicant in the unit of country, it is necessary to create a unit patent information service system of a unit name of a applicant and a unit patent information service system of a unit name of an inventor in a national patent information database 2300 . Therefore, the methods of FIGS. 66 to 69 can be applied to all of them.

도 75는 신규 문건이 추가 될 때 출원인 명의 단위 특허 정보 서비스 시스템 생성 엔진 및 발명자 명의 단위 특허 정보 서비스 시스템 생성 엔진이 어떠한 과정 을 수행하는 가에 대한 일 실시예적 과정을 보여주고 있다.FIG. 75 shows an exemplary process of the applicant's unit patent information service system generation engine and inventor's unit patent information service system generation engine when a new document is added.

도 76은 본 발명의 에 관한 일실시예적 도면이다.Fig. 76 is a diagram of an embodiment relating to the present invention. Fig.

도 78은 본 발명의 에 관한 일실시예적 도면이다.78 is a diagram of an embodiment relating to the present invention.

도 79는 본 발명의 비용 지출 관점 가중치 전처리 모듈(3311)이 가중치를 처리하는 방법에 관한 일실시예적 도면이다.79 is an exemplary diagram of a method in which the cost expenditure perspective weight pre-processing module 3311 of the present invention processes weights.

도 80은 본 발명의 인용 관점 가중치 전처리 모듈(3313)이 인용 관점에서 가중치를 전처리하는 방식에 관한 일실시예적 도면이다.80 is an exemplary diagram of a way in which the citation-based weight pre-processing module 3313 of the present invention prepares a weight in terms of a citation.

도 81은 본 발명의 분쟁 관점 가중치 전처리 모듈(3315)이 가중치를 처리하는 방식에 관한 일실시예적 도면이다.81 is an exemplary diagram of a manner in which the conflict-of-interest weight pre-processing module 3315 of the present invention processes weights.

도 82는 본 발명의 집중 관점 가중치 전처리 모듈(3317)이 가중치를 처리하는 방식에 관한 일실시예적 도면이다.82 is an exemplary diagram of a manner in which the concentrated-point weighting preprocessing module 3317 of the present invention processes weights.

도 83은 본 발명의 발명자 단위에서 가중치를 처리할 수도 있으며, 본 가중치를 처리하는 방식에 관한 일실시예적 도면이다.FIG. 83 is an exemplary diagram of a method for processing weights in the unit of an inventor of the present invention and processing the weights.

도 84는 본 발명의 출원인 단위 가중치 전처리 모듈(3331)이 가중치를 처리하는 방식에 관한 일실시예적 도면이다.FIG. 84 is an exemplary diagram of a manner in which the unit weight pre-processing module 3331, which is an applicant of the present invention, processes weights.

도 85는 본 발명의 발명자 단위 가중치 전처리 모듈(3333)이 가중치를 처리하는 방식에 관한 일실시예적 도면이다.85 is an exemplary diagram of a manner in which the inventor's unit weight pre-processing module 3333 of the present invention processes weights.

도 86은 본 발명의 대리인 단위 가중치 전처리 모듈(3335)이 가중치를 처리하는 방식에 관한 일실시예적 도면이다.86 is an exemplary diagram of a manner in which the proxy unit weight preprocessing module 3335 of the present invention processes weights.

도 87은 본 발명의 패밀리 정보 전처리 엔진(3810)이 패밀리 정보를 처리하는 방법에 관한 일실시예적 도면이다.87 is an exemplary diagram of a method by which the family information preprocessing engine 3810 of the present invention processes family information.

도 88은 본 발명의 인용 번호를 대표화하는 인용 정보 전처리 방법에 관한 일실시예적 도면이다.88 is an exemplary diagram of a citation information preprocessing method for representing the citation number of the present invention.

도 89는 본 발명의 특정 문건을 인용하고 있는 후출원 문건에 대한 정보인 후방 인용 문건 정보의 획득한 후 상기 후방 인용 문건 정보를 상기 특정 문건의 문건 정보에 포함하는 방법에 관한 일실시예적 도면이다.FIG. 89 is an exemplary diagram of a method for incorporating the backward quotation document information into the document information of the specific document after acquiring the backward quotation document information, which is information on the later-filed document citing the specific document of the present invention .

도 90은 본 발명의 인용 정보 전처리 방법에 관한 일실시예적 도면이다.90 is a diagram illustrating an embodiment of a citation information preprocessing method of the present invention.

도 91은 본 발명의 변형 특허 분류 기호를 처리하는 특허 분류 기호 전처리 엔진의 처리 방법에 관한 일실시예적 도면이다.91 is an exemplary diagram of a processing method of a patent classification symbol preprocessing engine for processing a modified patent classification code of the present invention.

도 92는 본 발명의 계층적 변형 특허 분류 기호 데이터베이스 생성 모듈이 상기 변형 특허 분류 기호 데이터베이스를 생성하는 방법에 관한 일실시예적 도면이다.FIG. 92 is an exemplary diagram of a hierarchical modified patent classification symbol database generation module of the present invention for generating the modified patent classification symbol database. FIG.

도 93은 USPC 특허 분류 기호로 트리 구조를 만드는 일 실시예적 방법에 관한 도면이다.93 is a diagram of an exemplary method of making a tree structure with a USPC patent classification symbol.

도 94는 도 93의 USPC 특허 분류 기호로 트리 구조에 대응되는 변형 특허 분류 기호를 할당하는 방법에 관한 일 실시예적 방법에 관한 도면이다.94 is a diagram of an exemplary method of assigning a modified patent classification symbol corresponding to a tree structure to the USPC patent classification symbol of FIG. 93; FIG.

도 95는 도 93의 USPC 특허 분류 기호로 만든 트리 구조와 동일한 구조를 가 지는 변형 특허 분류 기호로 구성되는 트리 구조에 관한 일 실시예적 도면이다.FIG. 95 is an exemplary diagram of a tree structure composed of modified patent classification symbols having the same structure as the tree structure formed by the USPC patent classification symbol of FIG. 93; FIG.

도 96은 Index to U.S. Patent Classification (a.k.a., Classification Index File) 파일)로 도 93와 같은 특허 분류 기호의 트리 구조를 만들 수 있음을 보여 주는 실시예적 도면이다.FIG. 96 is an exemplary diagram showing that a tree structure of a patent classification symbol as shown in FIG. 93 can be created with Index to US Patent Classification (aka, Classification Index File) file).

도 97은 본 발명의 특허 분류 기호 전처리 방법에 관한 일실시예적 도면이다.97 is a diagram showing an embodiment of a pretreatment method of a patent classification mark of the present invention.

도 98은 본 발명의 대표명화 전처리 모듈이 출원인 대표명화를 수행하는 방법에 관한 일실시예적 도면이다.FIG. 98 is an exemplary diagram of a method for performing the representative renormalization of the representative description pre-processing module of the present invention.

도 99는 본 발명의 우선권 주장 번호를 활용하는 대표명화 방법에 관한 일실시예적 도면이다.FIG. 99 is a diagram illustrating an exemplary embodiment of the present invention using a priority assertion number; FIG.

도 100은 본 발명의 특허 분류 기호별로 통계값을 전처리하는 방법에 관한 일실시예적 도면이다.100 is a diagram illustrating an embodiment of a method for preprocessing statistical values for each patent classification symbol of the present invention.

도 101은 본 발명의 주어진 특허 분류 기호에 대하여 하위 특허 분류 기호를 자동으로 포함시켜 통계값, 파라미터 또는 계산값을 생성하는 방법에 관한 일실시예적 도면이다.101 is an exemplary diagram of a method for automatically generating a statistical value, parameter or calculated value by automatically including a subpublic classifier for a given patent classifier of the present invention.

도 102는 본 발명의 특정한 문건 집합의 문건 부분 집합에 대하여 인용 또는 피인용 정보를 하위 특허 분류 기호를 포함하는 관점에서 특허 분류 기호별 통계값, 파라미터값, 또는 계산값을 생성하는 방법에 관한 일실시예적 도면이다.102 shows a method for generating statistical values, parameter values, or calculated values according to patent classification symbols from the viewpoint of including a sub-patent classification symbol for citation or reference information on a document subset of a specific document set of the present invention Fig.

도 103은 본 발명의 복수 특허 분류 기호 관계성 전처리 방법에 관한 일실시예적 도면이다.FIG. 103 is a diagram showing an embodiment of a method for preprocessing a plurality of patent classification symbols according to the present invention.

도 104는 본 발명의 비교 관점의 복수 특허 분류 기호 관계성 전처리 방법에 관한 일실시예적 도면이다.FIG. 104 is a diagram showing an embodiment of a plurality of patent classification symbol relationship preprocessing methods in a comparative view of the present invention. FIG.

도 105는 본 발명의 대표 어구 정보 전처리 방법 에 관한 일실시예적 도면이다.105 is an exemplary diagram of a representative word information preprocessing method of the present invention.

도 106은 본 발명의 대표 어구 정보 전처리 방법 에 관한 일실시예적 도면이다.106 is an exemplary diagram of a representative word information preprocessing method of the present invention.

도 107은 본 발명의 대표 어구 정보 전처리 방법 에 관한 일실시예적 도면이다.107 is an exemplary diagram of a representative word information preprocessing method of the present invention.

도 108은 본 발명의 분석 지표 계산 방법에 관한 일실시예적 도면이다.FIG. 108 is a diagram illustrating an embodiment of an analysis index calculation method according to the present invention. FIG.

도 109는 본 발명의 분석 대상 특허 문건 집합 입수 방법에 관한 일실시예적 도면이다.FIG. 109 is an exemplary diagram of a method for obtaining a set of patent documents analyzed according to the present invention. FIG.

도 110은 본 발명의 동향 분석 방법에 관한 일실시예적 도면이다.110 is a diagram illustrating an embodiment of a trend analysis method of the present invention.

도 111은 본 발명의 간이 분석 방법에 관한 일실시예적 도면이다.FIG. 111 is a diagram illustrating an embodiment of the simplified analysis method of the present invention. FIG.

도 112는 본 발명의 인용 분석 방법에 관한 일실시예적 도면이다.112 is an exemplary diagram of a citation analysis method of the present invention.

도 113은 본 발명의 피인용 분석 방법에 관한 일실시예적 도면이다.113 is a diagram showing an embodiment of the analysis method for cited article of the present invention.

도 114는 본 발명의 심사 인용 분석 방법에 관한 일실시예적 도면이다.FIG. 114 is a diagram showing an embodiment of the method for analyzing the citation of the present invention. FIG.

도 115는 본 발명의 복수 특허 분류 기호 분석 방법 에 관한 일실시예적 도면이다.115 is a diagram showing an embodiment of a method for analyzing a plurality of patent classification symbols of the present invention.

도 116은 본 발명의 복수 특허 분류 기호 분석 방법 에 관한 일실시예적 도면이다.116 is an exemplary diagram of a method for analyzing a plurality of patent classification symbols of the present invention.

도 117은 본 발명의 복수 특허 분류 기호 분석 방법 에 관한 일실시예적 도면이다.117 is a diagram illustrating an embodiment of a method for analyzing a plurality of patent classification symbols of the present invention.

도 118은 본 발명의 개별 단위 특허 정보 시스템 다단 그룹화 모듈 작동 방법에 관한 일실시예적 도면이다.118 is an exemplary diagram of a method of operating the individual unit patent information system multi-stage grouping module of the present invention.

도 119는 본 발명의 특허 정보 시스템 배치 생성 방법에 관한 일실시예적 도면이다.119 is an exemplary diagram of a method for generating a patent information system batch of the present invention.

도 120은 본 발명의 출원인 단위 특허 정보 시스템이 구현된 화면의 일 실시예로 대한민국의 최다 출원인 Top 500에 대한 출원인 목록 화면이 나와 있는 일구현예적 실시예이다. 국가 탭에서는 대한민국이 선택되고, 그 하부 탭의 Top 500, 거래소 등록 기업, 코스닥 등록기업, 다출원 기업, 전체 기업 등에서 Top 500탭이 선택된 화면이다.FIG. 120 is an exemplary embodiment of an applicant list screen for the top 500 applicant of the Republic of Korea as an embodiment of a screen on which the unit patent information system of the present invention is implemented. In the country tab, Korea is selected and the Top 500 tab is selected on the top 500 tab of the subtab, the registered company of the exchange, the registered company of the KOSDAQ, all of the applicants, and the whole company.

도 121은 본 발명의 출원인 단위 특허 정보 시스템이 구현된 화면의 일 실시예로 미국의 나스닥 등록 기업에 대한 출원인 목록 화면이 나와 있는 일구현예적 실시예이다. 국가 탭에서는 미국이 선택되고, 그 하부 탭의 Top 500, NYSE(뉴욕 증권 거래소), NASDAQ, AMEX, 전체 기업 등에서 NASDAQ 선택된 화면이다. NASDAQ 등록 기업들은 다수 있으므로, ABC 순의 탭과 전체 NASDAQ기업 탭이 있다.FIG. 121 is an exemplary embodiment of an applicant list screen for an Nasdaq-registered company in the United States, which is an embodiment of a screen on which the unit patent information system of the present invention is implemented. In the Countries tab, the United States is selected, and NASDAQ is selected in the Top 500, NYSE (New York Stock Exchange), NASDAQ, AMEX, There are a number of NASDAQ registered companies, so there are tabs in ABC order and the entire NASDAQ company tab.

도 122은 본 발명의 출원인 단위 특허 정보 시스템이 구현된 화면의 일 실시예로 일본의 JASDAQ 등록 기업에 대한 출원인 목록 화면이 나와 있는 일구현예적 실시예이다. 국가 탭에서는 일본이 선택되고, 그 하부 탭의 Top 500, JP1(동경증권거래소 1부 등록 기업), JP2(동경증권거래소 2부 등록 기업), JPM(MOTHERS 등록기업), JASDAQ 등록기업, 전체 기업 등에서 JASDAQ 선택된 화면이다.FIG. 122 is an exemplary embodiment of an applicant list screen for a JASDAQ registered company in Japan as an embodiment of a screen on which the unit patent information system of the present invention is implemented. In the country tab, Japan is selected, and the top 500, JP1 (1 registered company of Tokyo Stock Exchange), JP2 (Tokyo Stock Exchange 2 registered company), JPM (MOTHERS registered company), JASDAQ registered company, JASDAQ in the screen is selected.

도 123은 본 발명의 출원인 단위 특허 정보 시스템이 구현된 화면의 일 실시예로 유럽의 런던증권거래소 등록 기업에 대한 출원인 목록 화면이 나와 있는 일구현예적 실시예이다. 국가 탭에서는 유럽이 선택되고, 그 하부 탭의 Top 500, UK1(런던증시), AIM(런던증시), OVERSEAS LISTED(런던증시 상장 외국 기업), EURONEXT, 전체 기업 등에서 UK1이 선택된 화면이다. 프랑크푸르트 상장 기업 리스트는 미도시되어 있다.FIG. 123 is an exemplary embodiment of a list screen of applicants to a London Stock Exchange registered company in Europe as an embodiment of a screen on which the unit patent information system of the present invention is implemented. Europe is selected in the countries tab and UK1 is selected on the subtabs Top 500, UK1 (London stock exchange), AIM (London stock exchange), OVERSEAS LISTED (London foreign exchange listed companies), EURONEXT, The list of listed companies in Frankfurt is not shown.

도 124은 도 121에서 나스닥 등록 기업 중의 하나인 3COM(6번)을 선택했을 때에 나타나는 3COM의 특허 정보 시스템의 내부 중 특허 포트폴리오 중 특허 리스트 중 국가 선택에서 미국 특허 탭을 선택했을 때 나오는 일실시예적 구현예이다.FIG. 124 shows an example of an embodiment when a US patent tab is selected from the list of patents among the patent portfolios among 3COM's patent information systems that appear when 3COM (No. 6), which is one of NASDAQ registered companies in FIG. 121, This is an implementation example.

도 125는 도 124에서 발명자 리스트를 선택한 다음 국가 탭에서 미국을 선택했을 때 3COM의 발명자 목록이 나오는 화면이다.125 is a screen in which a list of inventors of 3COM is displayed when the inventor list is selected in FIG. 124 and the United States is selected in the countries tab.

도 126는 도 125에서 발명자 리스트에서 Aldous Stepha..(9번)을 선택했을 때 이 발명자와 관계된 특허 문건 목록이 나오는 화면이다.FIG. 126 is a screen listing a patent document related to the inventor when Aldous Stepha .. (No. 9) is selected from the inventor list in FIG. 125. FIG.

도 127은 3COM에 대한 도 124에서 통계 분석탭을 누르고 기술영역 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 IPC 서브클래스 레벨에서의 각 IPC별 연도별 출원수에 대한 일실시예적 구현예이다. 이 화면에는 드릴 다운과 특정 셀값을 눌렀을 때 나오는 간이 분석 화면이 팝업으로 나와 있다. 이는 삼성전자주식회사에 대한 구현예가 있는 도 17과 구조적으로 동등하다.Figure 127 shows the number of applications for each IPC per year at the IPC subclass level based on the total amount of applications when the United States is clicked in the systematic analysis menu (SA), the United States by pressing the statistical analysis tab in Figure 124 for 3COM, Is an example embodiment. This screen pops up a simple analysis screen that appears when you drill down and press a specific cell value. This is structurally equivalent to Figure 17, which is an implementation example for Samsung Electronics Co.,

도 128은 3COM에 대한 도 124에서 통계 분석탭을 누르고 경쟁사 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 USPC 서브클래스 레벨에서의 각 USPC별 연도별 출원수에 기준한 경쟁 출원인에 대한 정보가 나오는 일실시예적 구현예이다. 이 화면은 삼성전자주식회사에 대한 구현예가 있는 도 38과 구조적으로 동등하다.FIG. 128 shows the results of the competition based on the number of applications per USPC per year at the USPC subclass level based on the total amount of applications when the user clicks on the statistical analysis tab in FIG. 124 for 3COM and the US systematic analysis menu This is an exemplary embodiment in which information on the applicant comes out. This screen is structurally equivalent to Fig. 38 with an implementation example for Samsung Electronics Co., Ltd. Fig.

도 129는 3COM에 대한 도 124에서 통계 분석탭을 누르고 미국인용 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 3COM의 출원 문건 전체 집합을 인용 기준 문건 집합으로 했을 경우, 본 발명의 전방(backward) 인용 문건 집합을 분석 대상 문건 집합으로 한, 다인용 출원인에 대한 연도별 분석 결과에 대한 일실시예적 도면이다. 이 화면은 삼성전자주식회사에 구현예가 있는 도 39와 구조적으로 동등하다.129 shows a case where the statistical analysis tab of FIG. 124 for 3COM is clicked and the entire set of application documents of the 3COM filed when the US is clicked on the systematic analysis menu (SA) and the country tab is set as a citation reference document set, (backward) citation document set as an analysis target document set. This screen is structurally equivalent to Fig. 39, which is an implementation example of Samsung Electronics Co., Ltd. Fig.

도 130은 3COM에 대한 도 124에서 통계 분석탭을 누르고 발명자 SA(systematic analysis) 메뉴, 국가 탭에서 미국을 눌렀을 때 나오는 출원총량 기준 IPC 서브클래스 레벨에서의 각 IPC별 연도별 다출원 발명자에 대한 일실시예적 구현예이다. 이는 삼성전자주식회사에 대한 구현예가 있는 도35와 구조적으로 동등하다.130 is a graphical representation of the results of the inventor's systematic analysis menu (SA) for the 3COM, the statistical analysis tab of FIG. 124, the number of applicants for each year by IPC at the IPC subclass level based on the total amount of applications, Is an example embodiment. This is structurally equivalent to Fig. 35, which is an implementation example for Samsung Electronics Co., Ltd. Fig.

도 133은 특허 문건 집합이 선택되었을 때 융합 분석을 수행하는 일실시예적 방법에 관한 도면이다.133 is a diagram of an exemplary method for performing a fusion analysis when a set of patent documents is selected.

도 134는 수치에 대한 선택이 있는 경우, 그 수치에 대응되는 특허 문건 정보를 제공해 주는 일실시예적 방법에 관한 도면이다.134 is a diagram of an exemplary method for providing patent document information corresponding to a numerical value, when there is a selection for the numerical value;

도 135는 출원인별, 특허 분류 기호별, 발명자별, 대리인별로 다차원 분석을 제공하는 일실시예적 방법에 관한 도면이다.135 is a diagram of an exemplary method for providing multi-dimensional analysis by applicant, patent classification, inventor, and agent.

도 136은 패밀리 정보를 이용하여 제1국에서 검색된 특허의 제2국에서의 활용 가능성을 판단하는 일실시예적 방법에 관한 도면이다.136 is a diagram of an exemplary method for determining the availability of patents retrieved from a first country in a second country using family information.

도 137은 도136에서의 방법을 JP를 제1국으로 하고, #1번 특허 문건에 대하여 KR, US, EU를 제1국으로 하여 제2국에서의 활용 가능성을 판단하는 일실시예적 방법에 대한 흐름도이다.FIG. 137 is a flowchart illustrating an example method for determining whether the method of FIG. 136 is JP as a first station and that the use of KR, US, and EU as a first station for a # 1 patent document in a second station FIG.

도 138은 도136 내지 도 137에서 제시되는 방법을 실시했을 경우의 검색 결과 리스트 화면의 일실시예적 도면이다. 검색 결과 리스트의 각 항목에 각 국가별로의 국내 단계에서의 활용성 정보가 포함되어 있음을 알 수 있다.138 is an exemplary diagram of a search result list screen when the method shown in Figs. 136 to 137 is carried out. It can be seen that each item in the search result list includes information on the utilization level at the national level for each country.

도 140은 도 139의 방법이 제품에 적용된 일 실시예적 구현예이다.140 is an exemplary embodiment in which the method of FIG. 139 is applied to a product.

도 141은 도 139의 방법이 경쟁사 등의 회사 체계에 적용된 일 실시예적 구현예이다.141 is an exemplary embodiment in which the method of FIG. 139 is applied to a company system of a competitor or the like.

도 142는 발명자에 대하여 융합 분석을 수행하고, 이를 네트워크 다이어그램으로 구현한 일 실시예적 구현예이다.142 is an exemplary embodiment in which a fusion analysis is performed on the inventor and the network diagram is implemented.

도 143은 기술 키워드 및 대표 어구에 대한 융합 분석을 수행하고 이를 네트워크 다이어그램으로 구현한 일 실시예적 구현예이다.FIG. 143 is an exemplary embodiment in which a fusion analysis is performed on a technical keyword and a representative word, and is implemented as a network diagram.

도 144 내지 도 145는 본 발명의 특허 정보 시스템을 구현하는데 활용되는 인터페이스의 일 예들이다.144 to 145 are examples of interfaces used to implement the patent information system of the present invention.

Claims

A system for providing patent convergence analysis information to a user's computer,

Obtaining selection information on a set of patent documents;

Specifying a set of patent documents to be processed based on the selection information;

And generating processed patent information by performing at least one predetermined process on the specified set of patent documents to be processed,

The method for receiving the selection information includes: 1) a method of receiving input from a user computer through a search formula; and 2) information constituting a bibliography included in the predetermined patent document presented on the user's computer from the user's computer A method for receiving selection information on a specific set of patent documents selected from among a set of at least one patent document previously generated by the user and 4) a method for receiving selection information for at least one A method of receiving a condition is used, and more than one method is used,

Wherein the processed patent information is at least one or more multidimensional analysis information generated through at least one predetermined analysis process on the specified set of patent documents to be processed,

The individual dimensions constituting the multidimensional are selected from among patent documents contained in 1) time, 2) patent classification symbol, 3) applicant, 4) inventor, 5) agent, 6) country, 7) Any one or more,

The predetermined analysis process is an analysis process for a fusion pattern,

Wherein the multidimensional analysis information includes at least one predetermined counting value for the at least one individual dimension, a calculation value by at least one predetermined calculation formula, a query by at least one quality consciousness, A result value, and a processing result value for at least one or more analysis processing algorithms,

Wherein the system for providing patent information visualizes the result of the fusion analysis processing into a network diagram including at least two or more nodes,

Wherein the visualization process further includes at least one of information about the node and information about a line connecting the node and the node.

The method according to claim 1,

The analysis process is a fusion analysis process between patent classification symbols,

The convergence analysis process between the patent classification symbols may be performed by analyzing patent documents including two or more patent classification symbols on the same patent classification system among the patent documents included in the set of patent documents to be processed, Performing association analysis, frequent pattern analysis, correlation analysis, and clustering between two or more patent categories,

In the fusion analysis processing between the patent classification symbols,

When two or more patent classification symbols are included in one patent document in the same patent classification system, if the two or more patent classification symbols are not the highest patent classification symbol in the patent classification system, , The fusion analysis processing is performed for each of the two or more patent classification symbols including the upper patent classification symbol up to a predetermined level of the two or more patent classification symbols Information fusion method.

The method according to claim 1,

The convergence analysis processing may be performed for individual patent documents constituting the set of patent documents to be processed by 1) fusion analysis processing between at least two technical keywords generated based on text information extracted from at least one field of one patent document , 2) the analysis of the convergence between the co-inventors on the patent document with the co-inventor, 3) the analysis of the convergence between the co-applicants on the patent document with the co-applicant, and 4) the citation included in the citation information of one patent document And a fusion analysis process between patent documents,

The fusion analysis processing may include at least one of association analysis, frequent pattern analysis, correlation analysis, and clustering between the two or more technical keywords, the co-inventor, the co-applicant, and the joint patent documents included in the patent documents The patent information fusion analysis method comprising the steps of:

delete

The method according to claim 1,

The information about the line connecting the node and the node further includes numerical information,

Receiving the numerical value or selection information for the line from the user computer;

Extracting, from the patent information database including the set of patent documents to be processed, the found patent document information corresponding to the numerical information; And

And transmitting the extracted patent document information to the user computer.

The system

Document mast DB;

Patent classification code mast DB;

A patent analysis intelligence module that performs at least one patent analysis process on the specified set of analyzed patent documents;

A sub-patent classification code preprocessing module unit,

The sub-patent classification code preprocessing module may, when the first patent classification code is given, if the first patent classification code corresponds to a predetermined level on the patent classification code system to which the first patent classification code belongs, The information is processed so as to include the patent information corresponding to the sub-patent classification code of the classification symbol,

And a patent analysis intelligence module for performing at least one patent analysis process on the specified set of analyzed patent documents,

When the patent information analysis intelligence module generates the analysis information, the analysis information includes (1) a method of processing all the patent classification including patent classification symbols included in the processing target document set, and (2) main patent classification). In case of processing including all of the above patent classification, a) a method of treating all patent classes with equal weight, and b) a method of processing main patent classification (main patent classification) patent classification and sub-patent classification are applied with different weights to generate analytical information,

A method of receiving the set of patent documents to be analyzed is selected from among 1) a method of receiving input from a user computer as a search formula, 2) information constituting a bibliography included in the predetermined patent document presented from the user computer to the user computer A method for receiving selection information on at least one set of patent documents selected from at least one set of patent documents previously generated by the user and 4) At least one set of at least one set of conditions is provided,

The patent information generated by the patent analysis intelligence module is at least one or more multidimensional analysis information generated through at least one predetermined analysis process on the specified set of patent documents to be processed,

The predetermined analysis process is a fusion pattern analysis process,

The patent analysis intelligence module includes a function of visualizing the fusion analysis processing result as a network diagram including at least two nodes,

The method according to claim 6,

In the fusion analysis processing between the patent classification symbols,

When two or more patent classification symbols are included in one patent document in the same patent classification system, if the two or more patent classification symbols are not the highest patent classification symbol in the patent classification system, , The fusion analysis processing is performed for each of the two or more patent classification symbols including the upper patent classification symbol up to a predetermined level of the two or more patent classification symbols. Fusion analysis delivery system.

The method according to claim 6,

The fusion analysis processing may include at least one of association analysis, frequent pattern analysis, correlation analysis, and clustering between the two or more technical keywords, the co-inventor, the co-applicant, and the joint patent documents included in the patent documents The patent information fusion analysis providing system.