KR20110104816A - Method on producing information on fusion information of technology using patent data - Google Patents

Method on producing information on fusion information of technology using patent data Download PDF

Info

Publication number
KR20110104816A
KR20110104816A KR1020100023932A KR20100023932A KR20110104816A KR 20110104816 A KR20110104816 A KR 20110104816A KR 1020100023932 A KR1020100023932 A KR 1020100023932A KR 20100023932 A KR20100023932 A KR 20100023932A KR 20110104816 A KR20110104816 A KR 20110104816A
Authority
KR
South Korea
Prior art keywords
classification
fusion
combination
unit
information
Prior art date
Application number
KR1020100023932A
Other languages
Korean (ko)
Inventor
강민수
Original Assignee
(주)광개토연구소
강민수
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by (주)광개토연구소, 강민수 filed Critical (주)광개토연구소
Priority to KR1020100023932A priority Critical patent/KR20110104816A/en
Publication of KR20110104816A publication Critical patent/KR20110104816A/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Technology Law (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to a method and a system for generating technology fusion pattern information using cited patent information, and more particularly, to a method and system for generating fusion information using citation related patent information.
Utilizing the present invention, it is possible to effectively discover and generate fusion information between patent classifications, fusion information between technology keywords, and fusion information between technology / part / element technology / product.

Description

Method on Producing Information on Fusion Information of Technology Using Patent Data}

The present invention relates to a method for generating technology fusion pattern information using citation patent information, and more particularly, to a method for generating fusion information using citation related patent information.

With shorter technology replacement cycles and greater market volatility, the return on investment (ROI) of R & D continues to deteriorate. In response to this situation, voices are increasingly stressing the importance of open innovation and convergence technologies.

Existing convergences mainly consisted of 1) declarative convergence such as IT-BT convergence, 2) exemplary convergence at higher levels such as five sense experience display devices and services, and 3) individual convergence such as biochips. Despite the nominal emphasis on the importance of convergence under these circumstances, no specific guidance or direction is available.

Patent information is a collection of technical collective intelligence, a valuable human asset that has the characteristics that most of the world's technical knowledge is first disclosed as a patent. By exploring this patent information effectively, you will find patterns in technology convergence.

In 2007 and 2009, Kwanggato Research Institute, the applicant of this patent, disclosed a patent information fusion analysis technology using patent classification information and keywords. Prior patents include Korean Patent Application Nos. 10-2007-0061273, 10-2007-0002555, 10-2007-0129437, 10-2007-0129270, 10-2007-0129387, 10-2007-00129367, 10-2008-0126562, 10-2009-0082997 and the like. This prior patent describes in detail a method of finding a fusion pattern through association analysis of patent classification information and keywords. These patents focus on exploring fusion in terms of the existence of associations.

However, seeing fusion as "something new in the area of interest" requires a different approach to discovering fusion. In conventional association analysis, there is a problem in that it is difficult to distinguish the difference between frequently related and recently related. Accordingly, there has been a demand for technology development that can reflect new perspectives of convergence while applying correlation analysis.

In particular, there is an urgent need for the development of a method and system for finding a fusion pattern utilizing citation-related patent information that is important in patent information.

The problem to be solved by the present invention is to disclose a method for generating fusion information by using the patent information related to the citation.

Another problem to be solved by the present invention is to disclose a system for generating fusion information by utilizing the patent information related to the citation.

In order to achieve the technical problem to be achieved by the present invention, in the patent information system, (A) the first divided patent by dividing (A) at least one or more patent sets generated or obtained by applying at least one division processing criteria Generating a set and at least one second divided patent set; (B) a first citation splitting comprising one or more cited patent documents of patent documents included in the first split patent set and the second split patent set for each of the first split patent set and the second split patent set; Generating a patent set and at least one second cited split patent set; (C) In the patent documents included in the first cited divided patent set and the second cited divided patent set, a predetermined unit fusion element corresponding to the patent document is set to the first cited divided patent set and the second cited divided patent. Generating for each set; And (D) selecting a unit fusion element that meets a predetermined fusion processing criterion among the unit fusion elements generated for each of the first and second cited divided patent sets. The element is generated by including any one or more of a patent classification, a keyword extracted from the patent document, a keyword processed the extracted keyword, step (C) is the first cited divided patent set and the first A patent document including two or more patent classifications based on at least one patent classification among patent documents included in the two-quotation split patent set, or the first and second citation split patent sets and the second citation. The patent document includes two or more keywords extracted or processed from patent documents included in the divided patent set. It proposes a method for generating fusion information, characterized in that.

The unit fusion element is any one or more selected from a patent classification combination consisting of two or more patent classifications, a key keyword combination consisting of two or more keywords, and an index combination corresponding to a patent classification, and the patent classification combination and the index. Preferably, the combination is generated based on two or more patent classifications included in the patent document.

The fusion processing criterion is that the unit fusion element is present only in one predetermined patent set, or 2) the frequency of the unit fusion element for each of the divided patent sets is equal to or greater than a predetermined difference, or 3) the divided patent. The frequency of the unit fusion element for each set is equal to or greater than a preset increase rate, or 4) the frequency of the unit fusion element for each of the divided patent sets is greater than or equal to a preset increase acceleration, or 5) the unit fusion element for each of the divided patent sets It is preferable that the frequency of is to establish a predetermined function relationship or a predetermined condition relationship.

When the unit fusion element is an index combination, the method for generating the index combination may include: 1) an index using an index corresponding to the original patent classification only by the original patent classification explicitly indicated in the patent document included in each partition set. And a method of generating a combination, and 2) referring to the patent classification system to which the original patent classification belongs, with respect to the explicitly listed original patent classification, and including an index corresponding to a preset upper patent classification of the original patent classification. It is preferable that any one method selected from among the methods for generating the index combination is used.

The divisional processing criteria are generated by any one or any combination of two or more of an issuer country, a time range, an applicant or an owner or assignee, an inventor, a patent classification, a patent subject classification, and a predetermined classification attribute included in the patent document. The time that is the basis of the time range is any one of a priority date, an application date, a publication date, and a registration date of the patent document, or any combination of two or more thereof, and the patent classification is any one of IPC, USPC, FT, FI, and ECLA. One or any combination of two or more, wherein the patent subject classification is any one or more of the subject classifications generated using any one or more of the index of the IPC, the index of the USPC, or the IPC, USPC, FT, FI, and ECLA. The predetermined classification attribute constitutes a classification attribute for the applicant, the owner, or the assignee, and the patent classification. That the not less than one of a classified-patent classification properties, classification attribute for the inventors, or the subjects classified-patent classification properties are preferred.

When the unit fusion element is a patent classification combination, the method of generating the patent classification combination may include 1) generating the patent classification combination only by the original patent classification of the patent documents included in each of the divided sets; and 2) the original patent classification. Regarding the patent classification system to which the original patent classification belongs, with respect to any one of the methods selected from the method of generating the patent classification combination by including a predetermined upper patent classification of the original patent classification is used. desirable.

A first method for generating nCr (r is less than or equal to n and greater than 1) patent classification combinations when the original patent classification is n (n is an integer), the original patent classification is n (n is an integer) M, m is an integer, m is a predetermined upper patent classification for each of the original patent classifications, with reference to the patent classification system information to which the n original patent classifications belong to each of the n original patent classifications. generates a top patent classification greater than or equal to n, and wherein (n + m) Cr (r is less than or equal to (n + m) and greater than 1 with the n original patent classifications and the m top patent classifications ) A second method of generating patent classification combinations, and a third method of generating the patent classification combinations by applying a weight of a predetermined criterion to the patent classification combinations generated in the first method or the second method. To practice one or more inventions It is good.

And displaying the selected unit fusion element in a preset format. The preset format may be any one of a table format, a chart format, a graph, and a network diagram format.

When the unit fusion element is displayed, at least one of a patent classification code, title information corresponding to the patent classification code, and an index corresponding to the patent classification code is displayed, or a keyword is displayed.

The at least one patent set generated or obtained is a cited patent set consisting of a first-generated patent set, at least one cited patent document of a patent document included in the first-generated patent set, and a patent document included in the first-generated patent set. A patent made up of a cited patent set consisting of one or more of the cited patent documents of the patent document included in any one or more patent sets of the family patent sets in at least one country of the patent documents included in the first-generated patent set. It is preferable that it is a collection.

In order to achieve the technical problem to be achieved by the present invention, in the patent information system, at least one or more patent sets generated or received by dividing by applying at least one or more division processing criteria, the first divided patent set and at least one or more second divisions An aggregation dividing unit for dividing into patent sets; A unit fusion element generation unit configured to generate a predetermined unit fusion element for each of at least one divided patent set divided by the set divider and at least one divided patent set selected from at least one cited divided patent set generated for each of the divided patent sets ; And a fusion pattern processor configured to select a unit fusion element that meets a predetermined fusion processing criterion among the unit fusion elements, wherein the cited split patent set is a citation of at least one step of a patent document included in the first split patent set. At least one of the first cited split patent set composed of patent documents and the second cited split patent set composed of one or more cited patent documents of a patent document included in the second divided patent set, wherein the unit fusion element is And a patent classification combination, a keyword combination extracted from the patent document, and an index combination. The unit fusion element may be generated by a first divided patent set, a second divided patent set, and the first combination. At least one kind of patent documents included in the single-cited split patent set and the second-cited split patent set A patent document including two or more patent classifications based on a patent classification, or the first divided patent set, the second divided patent set, the first cited split patent set, and the second cited split. Is applied to a patent document including two or more keywords extracted or processed from a patent document included in the patent set, the first divided patent set, the second divided patent set, the first cited divided patent set and Includes two or more patent classifications based on at least one or more types of patent classification among the patent documents included in the second cited divided patent set, wherein the two or more patent classifications are applied to patent documents corresponding to different indices The present invention provides a patent information system for generating fusion information.

The aggregation dividing unit may further include a dividing process reference acquiring unit which receives a dividing process criterion from a user computer using the patent information system, wherein the dividing process reference acquiring unit inputs an issuer input interface and a time range included in the patent document. And one or more of an interface, an applicant, an owner or assignee input interface, an inventor input interface, a patent classification input interface, a patent subject classification input interface, and a predetermined classification attribute input interface. It is preferable to divide the generated or received patent set by applying the division processing criteria obtained through the standard acquisition unit.

And a patent classification fusion preprocessor for generating the patent classification combination based on two or more patent classifications included in the patent document, wherein the method for generating the patent classification combination by the patent classification fusion preprocessor comprises: Generates nCr (r is less than or equal to n and greater than 1) patent classification combinations when n (n is an integer) number of patent classifications included in the patent document, depending on the type of at least one patent classification included in In the first method, when there are n (n is an integer) patent classifications included in the patent document, the patent classification system information of the n patent classifications is referred to for the n patent classifications. M (m is an integer, m is greater than or equal to n) patent classifications up to a preset upper patent classification, and the n patent classifications and the m patent classifications are (n + m) Cr (r (n + m) beam Generating the patent classification combination by applying a weight of a predetermined criterion to the patent classification combination generated in the first method or the second method Is performed by any one of the third methods of carrying out the invention of any one or more of the methods, and when the first method or the second method is implemented, the generated patent classification combination is a patent classification system to which the patent classification belongs. It is preferable to use any one or more of a method of extracting a predetermined upper patent classification of the patent classification with reference to the information, and generating based on the extracted upper patent classification.

It is preferable to further include a cited patent set acquisition unit for obtaining the forward-cited patent documents cited by the patent included in the divided patent set for each of the divided patent set.

And a fusion processing reference policy DB including information on the fusion processing criteria. The fusion processing criteria further includes a frequency of the patent classification combination for each patent classification combination or a frequency of the core keyword combination for each key keyword combination. Is greater than or equal to a predetermined difference for each of the divided patent sets, is greater than or equal to a preset increase speed, is greater than or equal to a predetermined increase acceleration, or establishes a predetermined function relationship or a predetermined condition relationship. Preferably, at least one of the fusion processing reference values for each of the fusion processing criteria is provided by the system or obtained from a user computer using the system.

And a fusion pattern visualization unit configured to display the selected unit fusion elements in a preset format, wherein the preset format is any one of a tabular format, a chart format, a graph, or a network diagram format. The unit supports a function of displaying one or more of a patent classification code, title information corresponding to the patent classification code, and an index corresponding to the patent classification code or displaying the keyword when the unit fusion element is displayed. It is preferable to.

Utilizing the present invention, it is possible to effectively generate fusion information between patent classifications, fusion information between technology keywords, and fusion information between technology / parts / element technology / products.

According to the present invention, a fusion pattern can be easily discovered through various information processing such as a set operation using unit fusion elements for a divided patent set that is divided by various division criteria for a patent set that wants to know fusion information. Can be.

According to the present invention, information processing such as a set operation using a unit fusion element is performed on a divided patent set divided by various division criteria for a patent set that wants to know the fusion information and a divided cited patent set by each divided set. The fusion pattern can be easily found.

By utilizing the present invention, fusion information can be generated in advance for various objects such as each patent classification, each applicant, and each inventor, thereby providing users with fusion information quickly and conveniently.

1 is a diagram illustrating an exemplary network configuration of a patent information system of the present invention.
2 is a diagram illustrating an exemplary embodiment of a patent information system of the present invention.
3 is an exemplary diagram of a data processing unit of the present invention.
4 is an exemplary diagram of an analysis information generation unit of the present invention.
5 is an exemplary diagram of an evaluation information generating unit of the present invention.
6 is an exemplary diagram of a feature analysis information generation unit of the present invention.
7 is an exemplary diagram of a specialized object discovery unit of the present invention.
8 is an exemplary view of a similar patent set generating unit of the present invention.
9 is an exemplary diagram of an initial signal detector of the present invention.
FIG. 10 is an exemplary diagram of a niche description region discovery unit of the present invention. FIG.
11 is an exemplary diagram of an open innovation target discovery unit according to the present invention.
12 is an exemplary view of a future technology discovery unit of the present invention.
FIG. 13 is a view illustrating an additional service providing unit of the present invention. FIG.
14 is an exemplary diagram of a fusion information processing unit of the present invention.
15 is an exemplary diagram for a simple two-division convergence information model of the present invention.
16 is a diagram for one embodiment of a composite partition fusion information model of the present invention.
17 is an exemplary diagram of a simple two-division forward citation fusion information model of the present invention.
18 is a diagram for one embodiment of a composite split forward citation fusion information model of the present invention.
19 is a diagram illustrating an exemplary configuration of a key keyword DB of the present invention.
20 is a diagram for one embodiment of a method of generating fusion information in a simple two-division fusion information model of the present invention.
21 is a diagram for one embodiment of a method of generating fusion information in a simple two-division forward citation fusion information model of the present invention.
FIG. 22 is an exemplary diagram for another method of generating fusion information in the simple two-division forward citation fusion information model of the present invention. FIG.
FIG. 23 is an exemplary diagram of another method for generating fusion information in the simple two-division forward citation fusion information model of the present invention. FIG.
24 is an exemplary capture screen of information provided by WIPO for IPC and catchword.
25 is an example capture screen for an index for a USPC provided by the USPTO.

Hereinafter, it demonstrates in detail, referring drawings.

14 is a diagram illustrating an exemplary configuration of the fusion information processing unit 900 of the present invention. The fusion information processing unit 900 may include at least a predetermined fusion processing population obtained by the fusion processing population acquisition unit 910 for obtaining a patent set for fusion information processing and the fusion processing population collection acquisition unit 910. A fusion processing target set generation unit 920 for generating a fusion processing target set by dividing into two or more patent sets, a unit fusion element generation unit 970 for generating a unit fusion element from patent information for fusion processing, and the A fusion process reference information unit 930 which provides a reference, a fusion pattern processor 940 for processing a fusion pattern according to a fusion process criterion defined or specified by the fusion process, and a fusion pattern analyzer for analyzing a fusion pattern ( 950), any one or more of the fusion pattern information batch generation unit 960 for continuously generating fusion pattern information two or more times according to a predetermined fusion processing criterion. There may be and.

In the present invention, any one or more of a keyword, a patent classification, and an index may be used as the unit fusion element generated by the unit fusion element generator 970. The keyword may be a keyword series that meets a predetermined criterion such as a technology keyword, a product keyword, and the like. In the present invention, the keyword may include one word element, one phrase element, and one clause element. It is used to include a keyword pair including a keyword pair including at least two keyword pairs selected from a narrow meaning keyword or the narrow meaning keyword. (In the present invention, unless otherwise stated, the term keyword is used. The keyword pair includes a co-occurrence keyword pair (or simply referred to as an air pair). Patent classifications include IPC, USPC, FT, FI, ECLA. An index, also called a catchword, refers to a system in which at least one patent classification is associated with a word, phrase, or clause. Among the indexes, there is a catchword that processes the IPC, and there is an index to USPC issued by the US Patent and Trademark Office. The index also has a hierarchical structure like a patent classification. In the index, keywords corresponding to product names / part names / element descriptions are often embedded. The index makes the classification of patents easy to find and is illustrated in FIGS. 24 and 25. FIG. 24 is a catchword for IPCs provided by WIPO, and information about them is ipcr _ catchwordindex _20100101 at http://www.wipo.int/ipc/itos4ipc/ITSupport_and_download_area/20100101/MasterFiles/ as of March 2010 . Available as a zip file. In FIG. 24, it can be seen that G06C 1/00 corresponds to ABACUSES. In this case, ABACUSES is called an index corresponding to G06C 1/00. Accordingly, G06C 1/00 can be reverse mapped to Abacuses. Meanwhile, as can be seen from the index ABARADING in FIG. 24, it can be seen that the catchword system has at least one hierarchical structure. ABARADING shows that it is a two-tier hierarchy.

FIG. 25 is a USPTO index for USPC, and information about them can be found at http://www.uspto.gov/web/patents/classification/uspcindex/indextouspc.htm as of March 2010. It can be seen that the index abrading is classified into three levels of hierarchy in the contents of FIG. 25. It can be seen that the index Abrading corresponds to USPC 451/38, and there is a patent classification of a class other than Class 451 in the lower index of Abrading.

The keyword may be generated by processing the text included in the patent specification, which is in charge of the core keyword generator 210 of the data processor 200 of the present invention. The key keyword generator 210 extracts a keyword from a phrase or sentence corresponding to each field of the patent specification. The extraction of the air pair is extracted through the combination between the terms in the proximity distance (the distance satisfying the distance standard between the registered terms in one sentence). The field may be any one or more of various fields constituting the patent specification, such as the name of the invention, the claims, the summary, the detailed description of the invention, industrial applicability, effects, and the prior art (background art). The key keyword generation unit 210 generates a key keyword set for the n keywords extracted from the field. When generating the key keyword, the key keyword generator 210 performs synonym processing, thesaurus processing, etc. to select the key keywords by grouping the terms having substantially the same, equivalent, or equivalent meaning as a representative term. It is preferable. Meanwhile, when processing with the representative term, it may be desirable to perform synonyms and thesaurus processing for two or more languages in one patent document by using a dictionary or a machine translator. On the other hand, it is more preferable that the representative term or the extracted key keywords are translated into at least one language through the dictionary or machine translator. Extracting a keyword (commonly referred to in this technical field as a term) or an air pair through natural language processing belongs to techniques known in the field of natural language processing technology. For the extracted n keywords (including air pairs, it is natural), a core keyword set (a core air pair set is selectively included to represent the patent specification by applying a predetermined key keyword selection algorithm) Will be selected). The most frequently used algorithms use term frequency (TF) and inverse document frequency (IDF). In the field of natural language processing technology, various functional expressions using TF and IDF as variables are disclosed, and it is natural that other complex equations such as weights for each field can be applied in a policy manner. In this case, the core keyword set composed of only the narrow meaning keywords and the core keyword set composed only of the air pairs are generated separately or the keywords having the narrow meaning and the air pair are equally processed by the algorithm. Core Keyword and Core Air Pair The core keyword selection algorithm may generate a set of key keyword pairs for n keyword pairs when the key keyword selection algorithm processes the air pairs as keywords. On the other hand, when the core keyword generator 210 extracts a new term consisting of two or more words / vocabulary / words, it is often difficult to determine whether the new term is a term having technical meaning. have. In this case, there may be a method of determining whether a new term has a technical meaning by using an external search engine such as google.com. The core keyword generation unit 210 performs at least one or more predetermined processing such as quote processing (queries processing method of google.com processing an exact match) and the like, and then extracts the new term. When the search result is transmitted to an external search service system, the search result is received from the external search service system, and the search result is satisfied, a predetermined criterion is satisfied, and the extracted new term is processed as a normal term. The analysis of the search results measures the number of search results (the number of hits, which teaches how many search results match the query), and examples of the preset criteria are 1,000 or more in English and other languages. It could be more than 100 and so on. For example, as of March 2010, querying google.com for "patent informatics" and "patent informatics services", respectively, yields 67,300 and 279 results, in which case "patent informatics" is treated as a term. patent informatics services "may not be treated as a new term. On the other hand, you can query a system that provides a description of a term, such as wikipedia.org, rather than a search engine such as google.com, and treat the term as a new term if it exists.

Through the above-described key keyword selection algorithm, it will be obvious that one or more key keyword sets corresponding to one patent document may be selectively included. That is, the key keyword set KS (Pi) = {K1 (Pi), K2 (Pi), ..., Ki (Pi), Kj (Pi) ,,,, Kn (Pi) corresponding to the i th patent document Pi } You get In the above description, i, j, n are integers, and Kn (Pi) refers to the nth key keyword selected from the i th patent document Pi. A set of key keywords can be mapped to a single patent document, because: 1) specific fields (for example, claims or summaries), 2) different weights for each field, 3) selection of two or more key keywords There may be various reasons, such as the use of an algorithm, 4) reference range of IDF calculation, 5) term extraction method, etc.) Core keyword set KS (Pi) = {K1 (Pi), K2 corresponding to the i th patent document Pi. (Pi), ..., Ki (Pi), Kj (Pi) ,,,, Kn (Pi)} may be stored in the core keyword DB 130 based on a Pi or a key value corresponding to Pi. Can be. In the present invention, the Pi is described as a key value specifying the patent document for convenience of explanation. For example, the Pi may be an application number, publication number, registration number or any patent document identification number to which a country or patent kind or other element is combined. It will be appreciated that the Pi may be associated with various surges or processed surges or other processed information corresponding to the patent document. Typically, relational DBs correlate various information with the key value.

In this case, the unit fusion element generator 970 performs a combination process according to a preset combination process criterion for the core keyword set. Combination processing generates nCr (n is an integer, r is an integer, r is 2 or more and less than or equal to n) key keyword combinations (Keyword Combination, KC, (Ki, Kj)) for n key keywords. Say that. For example, if there are 10 key keywords and r is 2, a set of frequency maintaining key keyword combinations consisting of 10 C2 = 45 two key keyword combinations is generated. 2-4 are preferable, and, as for said r, 2-3 are more preferable. In the fusion process of the present invention, the key keyword combination is processed as a processing unit. Maintaining frequency consisting of nCr key keyword combinations corresponding to i th patent document PiKCS (Pi) = {KC1 (Pi), KC2 (Pi), ..., KCi (Pi), KCj (Pi ) ,,,, KCn (Pi), ..., KCnCr (Pi)}. Naturally, the constituent elements and the number vary according to the keyword maintaining key keyword combination set (KCS) r. Of course, a different frequency keeping key keyword combination set is generated according to r.

Meanwhile, the unit fusion element generator 970 may generate a patent classification combination set (CCS) such as the frequency maintenance key keyword combination set for patent classification. The patent classification combination set may not be generated for all patent documents, and may be generated only when two or more patent classifications correspond to at least one or more types of patent classifications in one patent document. For example, if there is more than two IPCs in one patent document or two or more USPCs even if only one IPC is used, two or more IPCs and two or more USPCs, respectively, may be classified into a classification classification combination (CC) (Ci , Cj)) may be generated. Included in the patent document, the original patent classification is called the original patent classification.

When there are n original patent classifications for each type of patent classification in the i-th patent document Pi, the original classification set corresponding to the Pi is OCS (Pi) = {C1 (Pi), C2 (Pi) , ..., Ci (Pi), Cj (Pi) ,,,, Cn (Pi)}. I, j, n are integers, and Cn (Pi) refers to the nth patent classification selected from the i th patent document Pi. Similar to the generation of the frequency maintaining key keyword combination set, the original patent classification combination set OCCS (Pi) = {OCC1 (Pi), OCC2 (Pi), ..., OCCi (Pi), OCCj (Pi), OCCn (Pi), OCCnCr (Pi)}. Of course, the original patent classification combination set may be generated for each type of patent classification. In this case, Japanese patent documents may be generated for each IPC, FT, and FI. In this way, the generation of the patent classification combination set for the original patent classification can be handled equivalently to the generation of the frequency maintenance key keyword combination set. The original patent classification combination set becomes a negotiated patent classification combination set.

On the other hand, since the patent classification has a hierarchical structure, in addition to the combination of the revealed patent classification, an implicit patent classification combination set can be generated. For a method of generating a combination set of the above-mentioned nested patent classifications, the applicants / applicants' patent applications 10-2007-0061273, 10-2007-0002555, 10-2007-0129437, 10-2007-0129270, 10-2007-0129387 , 10-2007-00129367, 10-2008-0126562, 10-2009-0082997 and the like. The following are the contents of the published patent. The nested patent classification combination set is called an ICCS (Implicit Classification Combination Set).

Korean Patent Application No. 10-2005-0111868 is issued January 04, 2006 H04B 7/26 and H04B 7/15. This will be described by way of example. The parents of H04B 7/26 become H04B 7/24, H04B 7/00, H04B, H04, H in turn. The parents of H04B 7/15 are, in turn, H04B 7/14 and H04B 7/00. In this case, the lowest common patent classification code is H04B 7/00. Therefore, the table is shown in the following table. The cell associated with H04B 7/00 has no cell value as described above. It is necessary to generate a nested patent classification combination set up to just before the common parent of each patent classification, and limit the depth for generating the nested patent classification combination set such as a subgroup. If there is no limitation, taking IPC as an example, there is a problem that too many patent classification combination sets are generated from excessive upper patent classifications when sections are different.

H04B 7/26 H04B 7/24 H04B 7/00 H04B 7/15 One One H04B 7/14 One One H04B 7/00

Table 1 shows the original patent classification combinations (H04B 7/15, H04B 7/26), (H04B 7/14, H04B 7/26), (H04B 715 /, H04B 7/24), and (H04B 7/14). , H04B 7/24). In Table 1 above, the common parent is H04B 7/00, and depth limitations were handled in subgroups.

On the other hand, when there are three or more patent classifications, the nested patent classification combinations shown in Table 1 may be generated for each patent classification combination, and the nested patent classification combinations considering the main patent classification and the sub patent classification are considered. May be possible. In this regard, the patents of H04B 7/04, H04B 7/155, and H04Q 7/30, which are assigned to the Korean Patent Application No. 10-2006-0012606 in the patents of the applicant / inventor as of January 2006, will be described as an example. I'm laying.

On the other hand, processing patent classification combination sets can be introduced. Since the original patent classification shown in the patent document has a different depth (depth, the number of dots included in the title information) for each patent classification, the depth can be kept constant for each original patent classification with reference to the patent classification system. You can introduce a patent classification that has been processed to make it work. An example of the processing to be maintained at a constant depth is, in the case of IPC, a process of adjusting the original patent classification to the subgroup or 1 dot subgroup level with reference to the patent classification system. The set of patent classification combinations generated by processed patent classification is called Processed Classification Combination Set (PCCS).

The patent classification combination set of the present invention is a patent classification combination set including at least one of the original patent classification combination set, the nested patent classification combination set, and the processed patent classification combination set. The following describes the patent classification combination set. In the fusion information processing of the present invention, the user can of course select the processing range of the patent classification combination set.

On the other hand, when there is a patent classification, the unit fusion element generation unit 970 may generate an index combination set (ICS, Index Combination Set) such as the frequency-maintenance key keyword combination set even for an index corresponding to the patent classification. . The index combination set is not generated for all patent documents, and two or more patent classifications correspond to at least one or more types of patent classifications in one patent document, and the two or more patent classifications correspond to different indices. It can only be created when it is used (except in the case of the use of a higher patent classification described below). For example, one patent document has two or more IPCs, and each IPC corresponds to a different index, or Is at least two USPCs, and if the USPCs correspond to different indices, index combinations (Index Combination, IC, (Ii, Ij)) are applied to two or more IPCs and two or more USPCs, respectively. Can be generated. The original patent classification is included in the patent document and is called the original patent classification, and the index corresponding to the original patent classification is called cause index.

When there are n original patent classifications for each type of patent classification in the i-th patent document Pi, there is an original classification set corresponding to the Pi, and a source index set corresponding to the original patent classification set. Index Set, OIS) gives OIS (Pi) = {I1 (Pi), I2 (Pi), ..., Ii (Pi), Ij (Pi) ,,, In (Pi)}. In the above description, i, j, n are integers, and In (Pi) refers to the nth index selected from the i th patent document Pi. Similar to the generation of the frequency-maintaining core keyword combination set, the original index combination set (OICS) corresponding to the i-th patent document Pi (OICS) OICS (Pi) = {OIC1 (Pi), OIC2 (Pi), .. , OICi (Pi), OICj (Pi) ,,,, OICn (Pi) ,,,, OICnCr (Pi)}. The cause index combination set may be generated according to the type of patent classification to which the index is associated. In this case, US patent documents may be generated for each IPC and USPC. In this way, the generation of the cause dex combination set for the original patent classification can be handled equivalently to the generation of the frequency maintenance key keyword combination set. The cause index combination set becomes a narrow index combination set.

Meanwhile, a nested index combination corresponding to the nested patent classification combination may be possible. When generating the nested index combination, 1) a method of constructing a nested patent classification combination as shown in Table 1 and removing a combination corresponding to the same index combination among the patent classification combinations, and 2) nested patent classification as shown in Table 1 above. When extracting the upper patent classifications of the original patent classification to be arranged on the horizontal axis and the vertical axis to generate the combination, a method of generating a table in abbreviated form by removing the upper patent classification corresponding to the same index may be used.

On the other hand, a processing index combination set can be introduced. Since the original patent classification shown in the patent document has a different depth (depth, the number of dots included in the title information) for each patent classification, the depth can be kept constant for each original patent classification with reference to the patent classification system. The processing may be performed so that an index corresponding to the processed patent classification may be introduced. The set of index combinations created using processed patent classifications is called a Processed Index Combination Set (PICS).

The index combination set of the present invention is an index combination set including at least one of the cause index combination set, the nested index combination set, and the processing index combination set. The following describes the index combination set. In the fusion information processing of the present invention, the user can of course select the processing range of the index combination set.

The unit fusion element generation unit 970 is a hybrid combination by combining two or more patent classifications revealed, upper patent classifications up to a predetermined hierarchy of the two or more patent classifications revealed, and / or the key keywords and / or indexes. Create a hybrid combination set. That is, n key keywords extracted from a patent document, m patent classifications (separately processed according to types of patent classifications, or preset upper patent classifications for each m patent classifications) included in bibliographic matters of a patent document, and an index A hybrid combination set can be generated as an element of the combination set generation.

The frequency maintaining key keyword combination set KCS (Pi) = {KC1 (Pi), KC2 (Pi), ..., KIi (Pi), KIj (Pi) ,,,, KCn (Pi) for the patent document Pi , ..., KCnCr (Pi)} is 1) stored in correspondence with the Pi, the stored KCS (Pi) is used, 2) when Pi is called, in real time from the key keyword KS (Pi) The method produced and used may be used. The 1) pre-generated storage method and the 2) real-time generation method for the frequency maintenance key keyword combination set will be the same for the patent classification and the index. That is, the patent classification combination set CCS (Pi) = {CC1 (Pi), CC2 (Pi), ..., CCi (Pi), CCj (Pi) ,,, CCn (Pi) for the patent document Pi. , ..., CCnCr (Pi)} is stored corresponding to the Pi, the stored CCS (Pi) may be used, and when Pi is called, generated and used in real time from the patent classification set CS (Pi) Could be And the index combination set ICS (Pi) = {IC1 (Pi), IC2 (Pi), ..., ICi (Pi), ICj (Pi) ,,, ICn (Pi), ..., ICnCr (Pi)} is stored corresponding to the Pi, and the stored ICS (Pi) may be used, and may be generated and used in real time from the index set IS (Pi) when Pi is called. will be.

Next, a method of fusion processing for each fusion processing population will be described in more detail. The fusion processing population refers to a set of targets for which a user of the patent information system 1000 wants to know fusion information. The convergence processing population may include 1) input of a search term to a search engine or 2) input of a query utilizing a field constituting the patent DB 112, 3) input of a patent information extraction condition corresponding to the search term or the query, and the like. Can be generated. The patent information extraction condition may be any one of the issuing country, time range, applicant, owner or assignee, inventor, patent classification, (if there is a patent subject classification and a predetermined classification attribute, etc.) included in the patent document. Or a combination of any two or more of the above, wherein the time that is the basis of the time range is any one or any combination of two or more of the priority date, application date, publication date, and registration date of the patent document, and the patent classification Is any one or combination of two or more of IPC, USPC, FT, FI, and ECLA, and the patent subject classification is one or more of the catchword of the IPC, the catchword of the USPC, or the IPC, USPC, FT, FI, and ECLA. Is one or more of the subject classifications generated using, and the predetermined classification attribute is the applicant or the owner or the assignee On may be limited to the classification of the property, classified-patent classification properties constituting the patent classification, the classification properties of the inventor, or any one or more of the patent subject by Category classification properties.

The generation of the fusion process population is in charge of the fusion process population acquisition unit 910 of the present invention. The convergence processing collection acquisition unit 910 obtains a specified input through a user's input or a predetermined setting through a predetermined setting value or a batch setting for at least one predetermined object (for example, all IPCs). Extract a patent set. At this time, the patent set to be extracted primarily may be sufficient as the key value of the patent document. This is because processing information such as bibliographic information, frequency maintenance key keyword combination set, patent classification combination set, etc. of the patent document corresponding to the key value can be easily obtained as the key value of the patent document. Obtaining a patent set including the IPC from the patent DB 112, etc.) will be a common daily routine for those skilled in the art of performing a patent search or analysis.

Subsequently, the set dividing unit 921 of the fusion processing target set generation unit 920 of the present invention divides the set into at least two with respect to the obtained fusion processing population. The aggregation dividing unit 921 provides at least one division processing criterion to the user through the division processing reference providing unit 921-1. The division processing reference acquisition unit 921-2 of the present invention obtains selection information on the division processing criteria or input information on the division processing criteria from the user computer 5000. The split processing criteria may be at least one combination, selection or input of any criteria such as 1) time, 2) conditions (registered, published, split or not, ...), 3) applicant, 4) ... etc. have. For example, it can be divided into three years and three years before the current filing date. 15 is an exemplary diagram of the two-split model. 16 is an exemplary diagram for a complex partition model. When the fusion process population is referred to as T, when the T is divided into two, two divided patent sets are generated. One of the divided patent sets is A, and the other is B. For example, A is a set of patents within three years of the filing date, and B is a set of patents three years before the filing date. The line between A and B (DC) is the dividing dividing line, which is a conceptual line for the visualization of the dividing conditions. If n + 1 splitting is performed as shown in FIG. 16, n + 1 split patent sets such as A, B1, ..., Bn are generated. In this case, n division lines exist, and each division line corresponds to a division condition. In this case, the n partitioning conditions may be the same series (for example, when the file is divided by one year based on the filing date, the attribute of the splitting condition is the same series as the time attribute) but may be another series or another attribute. For example, A and B1 may be time attributes, and B1 and Bn may independently specify partitioning conditions on an applicant basis. The division condition is stored in the division processing reference DB 921-3. The partitioning condition may be set by a user, but at least one criterion may be designated as a default value in the system. On the other hand, the division will naturally include subdividing A again into several.

The cited patent set obtaining unit 922 of the present invention may obtain any one or more of a front cited patent and a back cited patent of a patent document included in the patent set on a patent set basis. For example, with respect to the A, the forward cited patents may be obtained for each individual patent document which is an element of A, and the union operation may be performed to generate the forward cited patent set PA of A. Similarly, you can create a parent set of B (PB) or parent set of Bn (PBn). On the other hand, it is possible to generate a child set of A (backward patent set CA) consisting of back patent documents citing individual patent documents that are elements of A. The patent documents constituting the PA are listed in the bibliography of the patent documents constituting A, so it is relatively easy to create a set. However, in order to generate a CA, it is necessary to find a child patent document having individual patent documents that are elements of the parent field A. Therefore, it is relatively less easy. However, the generation of PA or CA in DB information processing is extremely easy for those skilled in the art. The cited patent set obtaining unit 922 of the present invention may obtain the forward cited set in the divided patent set unit. On the other hand, when generating the cited patent set, various criteria (e.g., citation depth, potential citations included) or various restrictions (applicant limitation, period limitation, country restriction, patent classification restriction such as restriction of patent or key keyword limitation) Etc.) may be used to generate various cited patent sets. That is, for A or Bi, PA or PBi may be composed of only part of all cited patents of patent documents constituting A or all patent documents constituting Bi.

On the other hand, in the present invention, the designated set acquisition unit 923 obtains designated information on the patent set designated by the user from the user computer 5000, and serves to obtain the designated patent set. As an example of the designated patent set, a patent set managed by a user may be an example. With respect to the patent set obtained by the designated set obtaining unit 923, it is possible to obtain a set division and a forward cited patent set for each divided set.

Next, an embodiment of the fusion information processing of the present invention will be described in more detail. For convenience of description, A and B sets are divided and generated by dividing the T into two parts for the fusion processing population T, PA as the forward cited patent set for A, and PB as the forward cited patent set for B. Let's say there is. Although there is no patent document belonging to A and B in common (may be in a very special case, but not in most cases), there may be a patent document belonging to PA and PB in common. Let's call PAB a patent document common to PA and PB. In other words, a patent document belonging to PAB naturally belongs to PA or PB.

The A, B, PA, PB, PAB is a set of patents, and may include at least one or more patent documents. (Of course, it may be an empty set, in which case it is obvious that there are no patent documents.) Let Pi be the Pith patent document it belongs to. In this case, patent documents belonging to B, PA, PB, and PAB will be Pi (B), Pi (PA), Pi (PB), and Pi (PAB), respectively.

Maintain Frequency Corresponding to Pi Patent Document Set of Key Keyword Combinations KCS (Pi) = {KC1 (Pi), KC2 (Pi), ..., KCi (Pi), KCj (Pi) ,,,, KCn (Pi), ..., KCnCr (Pi)}, Patent Classification Combination Set CCS (Pi) = {CC1 (Pi), CC2 (Pi), ..., CCi (Pi), CCj (Pi) ,,,, CCn (Pi) ), ,,,, CCnCr (Pi)}, index combination set ICS (Pi) = {IC1 (Pi), IC2 (Pi), ..., ICi (Pi), ICj (Pi) ,,,, ICn ( Pi), ,,,, ICnCr (Pi)}, a combination set as described above can be generated also for a patent document set having at least one patent document as an element.

The unit fusion element generating unit of the present invention has two methods for generating a frequency maintenance core keyword combination set, patent classification combination set, and index combination set in the patent document set unit. The first is a method of constructing a frequency-ignoring combination set, which performs a simple union operation on the key keyword combination set, the patent classification combination set, and the index combination set. For example, for all patent documents Pi and Pj constituting a patent document set, the frequency maintenance of the Pi key keyword combination set KCS (Pi) = {KC1 (Pi), KC2 (Pi), ..., KCi (Pi), KCj (Pi) ,,,, KCn (Pi), ..., KCnCr (Pi)} and maintaining the frequency of Pj Key keyword combination set KCS (Pi) = {KC1 (Pi), KC2 (Pi) , ..., KCi (Pi), KCj (Pi) ,,,,, KCn (Pi), ..., KCnCr (Pi)}. Since it is a union operation of a set unit, duplicate key keyword combinations are treated as one, even if they are duplicated several times. In this case, even if the same key keyword combinations (Ki, Kj) appear six times in ten patent documents, they are treated like one.

The second method is to construct a frequency maintenance combination set, which is a union operation for the frequency maintenance core keyword combination set, the patent classification combination set, and the index combination set for each Pi, but internally stores the frequency for each combination. For example, when a particular KCi is n times from the KC (keyword combinations) of the patent documents constituting a specific patent document set, the number of recovery n is mapped to a unique number (management ID) that specifies the KCi. . In this case, if the same key keyword combination (Ki, Kj) appears six times in ten patent documents, frequency information is maintained as (Ki, Kj): 6.

Let's say that all patent documents belonging to A are P (A) (which can be understood as abbreviations for Patent Documents of Set A) and the A's i-th patent document is Pi (A). In this case, the frequency maintaining key keyword combination set corresponding to Pi (A) KCS (Pi (A)) = {KC1 (Pi (A)), KC2 (Pi (A)), ..., KCi (Pi (A )), KCj (Pi (A)) ,,,, KCn (Pi (A)), ..., KCnCr (Pi (A))}, and patent classification combination set CCS (Pi (A)). ) = {(CC1 (Pi (A)), CC2 (Pi (A)), ..., CCi (Pi (A)), CCj (Pi (A)) ,,,, CCn (Pi (A)) , ,,,, CCnCr (Pi (A))}, and the index combination set ICS (Pi (A)) = {(IC1 (Pi (A)), IC2 (Pi (A)), .. , ICi (Pi (A)), ICj (Pi (A)) ,,,, ICn (Pi (A)) ,,,,, ICnCr (Pi (A))}. The same shall apply to Pi (B), Pi (PA), Pi (PB), and Pi (PAB), as shown in parentheses refer to a specific set of patent documents.

The fusion pattern processor 940 of the present invention processes the fusion pattern according to the fusion process reference information of the fusion process reference information unit 930. The convergence processing reference information may be a standard designated by the patent information system 1000 by default, but may be obtained from the user. The fusion processing reference information unit 930 of the present invention provides the fusion processing criteria to the user computer 5000 through the fusion processing reference providing unit 931, and is selected or input by the user through the fusion processing criteria obtaining unit 932. One fusion treatment criterion is obtained. Subsequently, the fusion processing criteria for each user is generated through the fusion processing criteria generation unit 933, and the generated fusion processing criteria are stored in the fusion processing criteria policy DB 934.

The information processing of the fusion pattern processing unit 940 will be described, taking as an example one of the fusion processing criteria designated by the patent information system 1000 as a default.

The default fusion processing criteria may include the following series, and a user may select various fusion processing criteria described below, and the patent information system 1000 may fuse the selected fusion processing criteria by applying the selected fusion processing criteria. Process the information. The following description will be made based on the application of the above-described frequency ignore combination set configuration method, which will be described later.

The first is the set of differences between the sets of divisions. The fusion pattern processing unit 940 obtains the frequency maintaining key keyword combination set KCS (Pi (A)) and KCS (Pi (B)) for A and B for T, and obtains KCS (Pi (A))- Perform a difference set operation called KCS (Pi (B)). The key keyword combination belonging to "KCS (Pi (A))-KCS (Pi (B))" is a combination that exists only in KCS (Pi (A)), not in KCS (Pi (B)). The key keyword combination belonging to "KCS (Pi (A))-KCS (Pi (B))" is related only to the A set and becomes a newly imported key keyword combination to the A set. On the other hand, the fusion pattern processing unit 940 obtains the patent classification combination set CCS (Pi (A)) and CCS (Pi (B)) for A and B for the T, and CCS (Pi (A))- A difference set operation called CCS (Pi (B)) is performed. The patent classification combination belonging to "CCS (Pi (A))-CCS (Pi (B))" is a combination which exists only in CCS (Pi (A)), not in CCS (Pi (B)). A patent classification combination belonging to "CCS (Pi (A))-CCS (Pi (B))" is related only to the A set and becomes a newly imported patent classification combination to the A set. A combination set (key keyword, patent classification, index, or hybrid) generated as a result of a set operation such as a newly imported patent classification combination into the A set may be analyzed by the fusion pattern analyzer 950 of the present invention. (Hereafter, it is the same)

In the above, the process of the fusion pattern processing unit 940 is described for a patent set divided into two for T. The difference set operation may be similarly applied to the patent set divided into n + 1. When T is divided into A, B1, B2, ,, and Bn, the following set operation may be exemplarily possible.

1) KCS (Pi (A))-KCS (Pi (B1)), ,, KCS (Pi (Bi))-KCS (Pi (B (i + 1))), KCS (Pi (Bi))- A set operation between sets of key keyword combination sets that maintain the frequency between sets of adjacent patents divided by adjacent or divided processing criteria, such as KCS (Pi (Bj)),,

2) CCS (Pi (A))-CCS (Pi (B1)),,, CCS (Pi (Bi))-CCS (Pi (B (i + 1))), CCS (Pi (Bi))- A set operation between sets of patent classification combinations between sets of adjacent patents divided by CCS (Pi (Bj)),, or divided by division processing criteria;

3) Maintaining the frequency of the patent set which unionized two or more divided patent sets obtained by the user's selection The key keyword combination set and / or the patent classification combination set may be a unit of operation for the difference set operation.

The second is the family of difference sets between forward cited patent sets. The fusion pattern processing unit 940 obtains a frequency and key keyword combination set KCS (Pi (PA)) and KCS (Pi (PB)) for maintaining the frequency of PA and PB for A and B, and KCS (Pi (PA)). Perform a difference set operation called KCS (Pi (PB)). The key keyword combination belonging to "KCS (Pi (PA))-KCS (Pi (PB))" is a combination which exists only in KCS (Pi (PA)), not in KCS (Pi (PB)). The key keyword combination belonging to "KCS (Pi (PA))-KCS (Pi (PB))" is related to the PA set only and becomes the newly imported key keyword combination. Meanwhile, the fusion pattern processing unit 940 obtains the patent classification combination set CCS (Pi (PA)) and CCS (Pi (PB)) of PA and PB for A and B, thereby obtaining CCS (Pi (PA)). Perform a difference set operation called CCS (Pi (PB)). The patent classification combination belonging to "CCS (Pi (PA))-CCS (Pi (PB))" is a combination which exists only in CCS (Pi (PA)), not in CCS (Pi (PB)). Patent classification combinations belonging to "CCS (Pi (PA))-CCS (Pi (PB))" are related to the PA set only and are newly imported patent classification combinations in the PA set.

In the above, the process of the fusion pattern processing unit 940 has been described with respect to a forward cited patent set for a patent set divided into two with respect to T. The difference set operation can be similarly applied to a patent set divided into n + 1. have. If T is divided into A, B1, B2, ,,, Bn, the forward cited patent set for each of them would be PA, PB1, PB2, ,,, PBn, and the following set operations would be exemplarily possible: will be. Meanwhile, the forward cited patent set of T becomes PT.

1) KCS (Pi (PA))-KCS (Pi (PB1)), ,, KCS (Pi (PBi))-KCS (Pi (PB (i + 1))), KCS (Pi (PBi))- Maintain frequency between the divided patent sets which are adjacent to each other or divided by division processing criteria such as KCS (Pi (PBj)),, etc.

2) CCS (Pi (PA))-CCS (Pi (PB1)),,, CCS (Pi (PBi))-CCS (Pi (PB (i + 1))), CCS (Pi (PBi))- A set operation between a set of patent classification combinations between a set of divided patents adjacent to each other or divided by division processing criteria such as CCS (Pi (PBj)),,

3) Maintaining the frequency of the patent set which unionized two or more divided patent sets obtained by the user's selection The key keyword combination set and / or the patent classification combination set may be a unit of operation for the difference set operation.

The third is a set of difference operations that utilizes both a split set (in some cases a population) and a forward cited patent set. The fusion pattern processing unit 940 is KCS (Pi (A)), KCS (Pi (T)), KCS (Pi (PA)), KCS (Pi (PT)), KCS (Pi (PAB)) and KCS ( Pi (PB)), KCS (Pi (A))-KCS (Pi (PA)), KCS (Pi (A))-KCS (Pi (PT)), KCS (Pi (A))-KCS (Pi (B))-KCS (Pi (PT)), KCS (Pi (PA))-KCS (Pi (A)), KCS (Pi (PA))-KCS (Pi (T)), KCS (Pi (A))-KCS (Pi (PT)), KCS (Pi (PAB))-KCS (Pi (A)) performs one or more difference set operations. On the other hand, the fusion pattern processing unit 940 is the fusion pattern processing unit 940 CCS (Pi (A)), CCS (Pi (T)), CCS (Pi (PA)), CCS (Pi (PT)), Obtain CCS (Pi (PAB)) and CCS (Pi (PB)), CCS (Pi (A))-CCS (Pi (PA)), CCS (Pi (A))-CCS (Pi (PT)) , CCS (Pi (A))-CCS (Pi (B))-CCS (Pi (PT)), CCS (Pi (PA))-CCS (Pi (A)), CCS (Pi (PA))-CCS Difference operations of one or more of (Pi (T)), CCS (Pi (A))-CCS (Pi (PT)), CCS (Pi (PAB))-CCS (Pi (A)) are performed.

In the above, the process of the fusion pattern processing unit 940 has been described with respect to a forward cited patent set for a patent set divided into two with respect to T. The difference set operation can be similarly applied to a patent set divided into n + 1. have. If T is divided into A, B1, B2, ,,, Bn, then the forward cited patent set for each of them will be A, PA1, PA2, ,,, PAn, and the following set operations may be exemplarily possible: will be.

1) KCS (Pi (A))-KCS (Pi (PA1)), ,, KCS (Pi (PAi))-KCS (Pi (PA (i + 1))), KCS (Pi (PAi))- Maintaining the frequency between adjacent patent sets divided by adjacent or divided processing criteria such as KCS (Pi (PAj)),, etc.

2) CCS (Pi (A))-CCS (Pi (PA1)),,, CCS (Pi (PAi))-CCS (Pi (PA (i + 1))), CCS (Pi (PAi))- A set operation between a set of patent classification combinations between a set of divided patents adjacent to each other or divided by division processing criteria such as CCS (Pi (PAj)),,

3) KCS (Pi (PA))-KCS (Pi (A1)), ,, KCS (Pi (Ai))-KCS (Pi (A (i + 1))), KCS (Pi (Ai))- Maintain frequency between the divided patent sets adjacent to each other or divided by the division processing criteria such as KCS (Pi (Aj)),, etc.

4) CCS (Pi (PA))-CCS (Pi (A1)),,, CCS (Pi (Ai))-CCS (Pi (A (i + 1))), CCS (Pi (Ai))- A set operation between a set of patent classification combinations between a set of divided patents adjacent to each other or divided by division processing criteria such as CCS (Pi (Aj)),,

5) Maintaining the frequency of the patent set which unionized two or more divided patent sets obtained by the user's selection The key keyword combination set and / or the patent classification combination set may be a unit of operation for the difference set operation.

The core keyword combination, the patent classification combination, and the index combination satisfying that the difference calculation operation generated by the fusion pattern processing unit of the present invention is newly imported into the patent document set of interest with respect to the patent document set to be compared which is one example of the predetermined fusion processing criteria May be the result of screening treatment.

Next, the information processing method of the fusion pattern processing unit 940 for the frequency maintaining combination set configuration method will be described.

First, the frequency maintenance difference operation between two divided sets will be described first. The invention idea of frequency maintenance difference operation is described by taking key keyword combination difference operation as an example. In this method, the frequency information n of the specific key keyword combination (Ki, Kj) in a specific patent document set (for example, A) is maintained as (Ki, Kj): n. In this case, when the (Ki, Kj) also exists in the B set, when the difference operation is specified as AB, the result of the frequency maintenance key keyword difference operation for the (Ki, Kj) is (Ki, Kj). ): (nm). In the case of the frequency ignore key keyword difference operation, (Ki, Kj) is a key keyword combination of both A and B, and thus, (Ki, Kj) is not included in the result of the difference operation. On the other hand, in the case of the frequency maintenance key keyword operation, it is determined whether (Ki, Kj) is included in the key keyword difference operation result according to the relationship between n and m. The relationship between n and m is managed by the fusion processing reference information unit 930 of the present invention.

Examples of the relationship between n and m include 1) when nm is greater than or equal to a predetermined numerical value, 2) when nm is greater than or equal to a predetermined numerical value, and m is less than or equal to a predetermined numerical value. 4) the function f (n, m) satisfies a predetermined condition, and n or m also meets a predetermined condition. have. Referring to the specific numerical value as an example, 1) has an advantage that is well applied when the difference between n and m is simply large, such as 100 and 3. However, in the above 1), when comparing the case where n and m are 100 and 30 and the case of 50 and 1, the difference value is larger in the former even though the latter meets the fusion treatment criteria. Many of both and B do not have a newly imported attribute.) Therefore, in this case, the method of 2) is valid. An example of the method of 2) may be a difference in frequency of 10 or more, m cannot exceed 2, and the like. On the other hand, in the case where the size of A (the number of patent documents belonging to A) is small and the value of n has a relatively small value, such as when the size of B is large, n is applied to n, m, such as square treatment. By processing the function, the frequency maintenance difference operation can be performed.

Although T has been described based on two divisions of A and B, the idea of the present invention can be similarly applied to the case where T is divided into A, B1, B2, ,,, Bn. It will be apparent to those skilled in the art based on what is described in the difference set operation at the time of construction.

On the other hand, when the set is divided into n + 1 rather than divided into two, if the frequency of the (Ki, Kj) f1, ... f (n + 1) for each divided patent document set, the f1 ~ Calculate increase / decrease, increase / decrease / change of f (n + 1). When the increase / decrease / change of the (Ki, Kj) satisfies a predetermined condition, the (Ki, Kj) may be included in the frequency maintenance key keyword difference calculation result, and the predetermined condition is the fusion. The processing reference information unit 930 manages this.

Meanwhile, the present invention can be applied to a frequency maintenance difference operation between forward cited patent sets for a key keyword combination, and a frequency maintenance difference operation using both a split set (in some cases, a collective) and a forward cited patent set. It will be apparent to those skilled in the art that the present invention can be applied to. In addition, although the frequency maintenance difference calculation method has been described with respect to key keyword combinations, it will be apparent to those skilled in the art that the present invention can be similarly applied to patent classification combinations and index combinations. In other words, the invention idea of the frequency maintenance difference set operation method described above is equally applicable to 1) n + 1 division set processing, 2) processing for forward citation patent set, and the like for patent classification combinations and index combinations. May be applied.

The result of the difference operation generated by the fusion pattern processor of the present invention is 1) a key keyword combination difference set, 2) a patent classification combination difference set, and 3) an index combination set difference set. KCSS (Keyword Combination Subtraction Set), CCSS (Classification Combination Subtraction Set), ICSS (Index Combination Subtraction Set). The core keyword combination set belonging to the above-mentioned "KCS (Pi (PA))-KCS (Pi (PB))" is referred to as KCSS (PA-PB) (KCSS of Patent Document Set A and Patent Document Set B). . The KCSS, CCSS, and ICSS may include at least one selected from a case in which a frequency disregarding method is applied and a case in which a frequency maintenance method is applied.

Subsequently, the fusion pattern analyzer 950 of the present invention analyzes various sets of calculation results including various sets of difference calculation results of the present invention generated by the fusion pattern processor 940. The fusion pattern analyzer 950 generates predetermined analysis information about the difference set operation result set.

The fusion pattern analyzer 950 may perform a predetermined analysis on all combination subset sets (CSSs) constituting the KCSS, CCSS, and ICSS. An example of the preset analysis will be as follows. Discusses key keywords, patent classifications, and / or indexes.

The fusion pattern analyzer 950 may generate analysis information such as 1) to 6) when there is a specific KCSS, and this method may be applied to CCSS and ICSS as well.

1) In the network analysis of the KCSS, the fusion pattern analysis unit 950 analyzes the network relationship between individual key keywords constituting a key keyword combination that is an element of the KCSS. The network relationship is applied to any network analysis known to those skilled in the network analysis technology field for the key keyword combinations (Ki, Kj) that make up all KCSS. The network analysis may find hub key keywords in the key keyword network. The hub key keyword refers to a key keyword Ki having the largest number of network branches, a predetermined number, or more than a percentile.

2) In the visualization of network analysis, the fusion pattern analyzer 950 may be visualized by drawing a connecting line between Ki and Kj of the key keyword combinations (Ki, Kj) of KCSS. The line may be drawn only for key keywords having more than a predetermined branch, which is obvious among those skilled in the network analysis visualization technology.

3) If there is a hub key keyword, as an example of the network analysis visualization, 1) a key keyword network diagram associated with the Ki in the KCSS for the hub keyword Ki, and 2) the difference Based on the comparison target set (for example, B in the case of AB), the key keyword network diagram associated with the Ki may be displayed in contrast. In this case, there is an advantage in that the change / change / difference of the key keyword well associated with the key keyword Ki for each partitioned set can be shown.

4) For the key keyword combination (Ki, Kj) constituting the KCSS, (Ki, Kj) by querying a patent document including the Ki and the Kj in one patent document Pi in common; The patent document set having (Ki, Kj) as key keyword combinations can be mapped to. The correspondence may provide a list of patent documents, a bibliography, a full text, etc. of a patent document set corresponding to a search / query result when the user clicks on the (Ki, Kj). On the other hand, when the association of (Ki, Kj) is displayed on a network diagram (e.g., when the association is indicated by a line, etc.), the (Ki, Kj) is keyed around the display of the association. You can display the number of patent documents that you include as keyword combinations. In this case, when the user clicks on the numerical value, a list of related patent documents, bibliographic details, and the like may be provided.

On the other hand, when the frequency maintaining combination set configuration method is used, the fusion pattern analysis unit 950 is the extraction of the most frequent key keyword combination, the fusion pattern analysis unit 950 is the (Ki, Kj) of the elements of the KCSS The most frequent frequency can be extracted in order of frequency. In the case of disregarding the frequency, the frequency of the (Ki, Kj) constituting the KCSS is 1. On the other hand, the above 1) to 4) of the frequency ignoring method can be applied to the frequency maintaining method as it is, of course.

On the other hand, the fusion pattern analysis unit 950 6) performs the time series analysis for each (Ki, Kj), the applicant analysis, the inventor analysis, the patent classification analysis for each of the least key keyword combinations. The fusion pattern analysis unit 950 obtains a set of patent documents related to the least common key keyword combination (for example, the method of querying / searching a set of patent documents simultaneously including the Ki and the Kj as key keywords). And analysis information based on at least one selected from various bibliographic matters (applicant, inventor, time, patent classification, etc.) of the patent document set.

The fusion pattern analysis unit 950 treats the patent classification combinations Ci and Cj, which are elements of the CCSS, as the key keyword combinations Ki and Kj, and sets the same as the patent classification combinations 1) to 6). You will be able to perform the analysis. Subsequently, the information processing of the fusion pattern analyzer 950 will be described based on a specific part of the CCSS that is different from the KCSS.

1) By extracting the most frequent patent classification combination, the fusion pattern analysis unit 950 extracts the most frequent frequency order among the elements (Ci, Cj) of the CCSS. At this time, the (Ci, Cj) may extract the CC for each depth of the patent classification (for example, 1 dot subgroup). When a CC is extracted for each 1 dot subgroup, if a specific Ci is a 3 dot subgroup, the Ci is referred to a patent classification system, and the 1 dot sub is a parent (= Grandparent) of a parent of the 3 dot subgroup. The group can be obtained, transform all CCs into 1 dot subgroups, and then extract the most frequent patent classification combinations. The depth of the patent classification may be obtained by selection from the user, or may be set by default by the patent information system 1000. When the depth of the patent classification described in this paragraph is set, if the specific patent classification is a lower patent classification less than or equal to the depth of the set patent classification, the lower patent classification is converted into an upper patent classification of the preset depth, and the converted higher patent The method of statistical processing or information processing by classification is commonly applied in the present invention.

2) As an analysis of the most frequent patent classification combinations, the fusion pattern analysis unit 950 performs time series analysis for each of the least frequent patent classification combinations, an applicant analysis for the least frequent patent classification combinations, an inventor analysis, and a patent classification analysis. The fusion pattern analysis unit 950 obtains a set of patent documents related to the most frequent patent classification combination, and based on any one or more selected from various bibliographic matters (applicant, inventor, time, patent classification, etc.) of the set of patent documents. Generate statistical information. In this case, when the depth of patent classification is set, the fusion pattern analyzer 950 performs the processing described in the above paragraph.

3) In the network analysis of the CCSS, the fusion pattern analysis unit 950 analyzes the network relationship between the individual patent classifications constituting the patent classification combination that is an element of the CCSS. The network relationship is applied to any network analysis in the network analysis technology field for patent classification combinations (Ci, Cj) of all CCSSs related to SS. In this case, when the depth of patent classification is set, the fusion pattern analyzer 950 performs the processing described above. In other words, if it is set as a 1 dot subgroup, the lower patent classification of less than 2 dot subgroup among all Ci and Cj is converted into the upper patent classification of the 1 dot subgroup by referring to the patent classification system, and converted into the converted higher patent classification. Perform network analysis for (Ci, Cj).

4) In the visualization of network analysis, the fusion pattern analysis unit 950 may be visualized by drawing a connection line between Ci and Cj of patent classification combinations (Ci, Cj) of all CCSSs related to SS. The visualization or network analysis will find the Ci with the most branches. In this case, when the depth of patent classification is set, the fusion pattern analyzer 950 performs the processing described above. In other words, if it is set as a 1 dot subgroup, the lower patent classification of less than 2 dot subgroup among all Ci and Cj is converted into the upper patent classification of the 1 dot subgroup by referring to the patent classification system, and converted into the converted higher patent classification. Perform visualization of network analysis for (Ci, Cj).

The visualization of the fusion pattern is performed in the fusion pattern visualization unit 951 of the present invention. The fusion pattern analyzer 950 may further include a fusion pattern UI unit 952 (user interface unit). The fusion pattern UI unit 952 may further include the following functions.

First, it is a limiting function for the partition set. The above limitations include: 1) inputting a search word to the search engine, 2) input of a query utilizing a field constituting the patent DB 112, 3) input or selection of a patent information extraction condition corresponding to the search word, or the query, and the like. Can be generated. The patent information extraction condition may be any one of the issuing country, time range, applicant, owner or assignee, inventor, patent classification, (if there is a patent subject classification and a predetermined classification attribute, etc.) included in the patent document. Or a combination of any two or more of the above, wherein the time that is the basis of the time range is any one or any combination of two or more of the priority date, application date, publication date, and registration date of the patent document, and the patent classification Is any one or combination of two or more of IPC, USPC, FT, FI, and ECLA, and the patent subject classification is one or more of the catchword of the IPC, the catchword of the USPC, or the IPC, USPC, FT, FI, and ECLA. Is one or more of the subject classifications generated using, and the predetermined classification attribute is the applicant or the owner or the assignee For the classification properties can be, or any one or more of the patent subject by Category classification properties of the classified-patent classification properties, the inventors constituting the category attribute, the patent classification. For example, when there is a distinction between an applicant, a company, a university, a research institute, an individual, etc., the applicant may be limited to only applicants having university attributes, and applicants having a specific number of applied / registered patents for a specific period of time. Applicant) may be limited only. In this case, the fusion pattern analysis unit 950 performs a predetermined fusion information processing for the limited set of partition set. On the other hand, since (Ki, Kj), (Ci, Cj), (Ii, Ij) and the like can be associated with a set of patent documents (described above by a search / query method), the limitation on the divided set is limited. The same restriction as that performed may be performed for the KCSS, CCSS, and ICSS. When the limitation is carried out, 1) only (Ki, Kj), (Ci, Cj), (Ii, Ij) which are relevant only to the patent set related to the limitation may be left out or eliminated (especially in the case of frequency ignoring), 2) analysis information is generated by subtracting the frequency values of (Ki, Kj), (Ci, Cj), (Ii, Ij) related to the patent set related to the limitation (especially in the case of the frequency maintenance method), 3 ) (Ki, Kj), (Ci, Cj), and (Ii, Ij) mandrel analysis information related to the patent set related to the limitation may be generated (especially in the case of a frequency maintenance method).

The second is quantity information processing. Frequency information of Ki may be displayed on each of the key keyword Ki nodes, or a frequency of (Ki, Kj) may be displayed on a line connecting Ki and Kj. When the user clicks on the frequency, information (patent document list, bibliographic information, professional information, etc.) about a patent document set having Ki and Kj as key keywords is provided to the user computer 5000, or the user When clicking on the frequency of (Ki, Kj), it is possible to provide the user computer 5000 with information about a patent document set including both Ki and Kj as key keywords. The processing in this paragraph for the keyword applies equally to the processing for patent classification. On the other hand, according to the user's selection, when providing information on the patent document set for Ci or (Ci, Cj), the patent document set related to the upper patent classification of a predetermined depth in the patent classification system of Ci, Cj You can optionally provide

Third is the labeling display processing function. The labeling display process refers to processing by displaying different marks (for example, colors, patterns, lines, etc.) for each labeling object such as the applicant, inventor, time display, and the like.

Fourth, the filtering comparison processing function. The filtering comparison process refers to a process of comparing the patent documents of Samsung Electronics and LG Electronics among the divided sets or the (Ki, Kj), (Ci, Cj), and (Ii, Ij). When the filtering comparison process is combined with the labeling display process, blue for (Ki, Kj) related to Samsung Electronics, red for (Ki, Kj) related to LG Electronics, and purple for common (Ki, Kj) You will be able to compare and display. The same applies to patent classification.

Fifth, the conversion display function of the patent classification display into the index display. The patent classification corresponds to the index in a 1: 1 or 1: n correspondence. Therefore, where the patent classification is displayed, at least one index corresponding to the patent classification may be displayed according to the index display selection of the user.

Sixth, it is a conversion display function of the patent classification display to the title information display. The patent classification is in a code system, and what a particular code means is unknown unless the title information of the code is stored. Therefore, each time a patent classification appears visually, title information for that patent classification will need to be provided. The way in which the title information appears is 1) when a user request or interaction (for example, placing a pointer of a pointing device such as a mouse in a patent classification), or 2) providing a patent classification and title information together. And 3) title information is given priority. On the other hand, when there is a specific patent classification Ci, the title information is provided only when 1) only the title information of Ci is provided. Etc.) and / or title information up to the upper classification, 3) in addition to the Ci title information, a predetermined upper classification of Ci (for example, a superior patent classification) and / or a title of the patent classification. Information) and Ci's predetermined lower patent classification (for example, the subordinate patent classification and / or up to the titles of the subordinate patent classification).

Next, the fusion pattern report generator 953 of the present invention will be described. The fusion pattern report generator 953 may be configured according to a user's selection or default selection value of the patent information system 1000 among information generated by the fusion pattern visualization unit 951 or the fusion pattern UI unit 952. Create a fusion pattern report in the set report format. Of course, the reporting tool can also utilize various reporting tools such as crystal report and oz report. Since the generation of the report is an easy technique for those skilled in the art, detailed description thereof will be omitted. The fusion pattern report is based on the user's selection or the command of the patent information system 1000 under the control of the fusion pattern information batch generation unit 960, and arranged according to a predetermined patent classification, a predetermined applicant, and a predetermined inventor. It can also be created as a task.

Next, the fusion pattern information arrangement generation unit 960 of the present invention will be described. Batch generation of fusion pattern information refers to performing preset fusion information processing as a batch job. In order to perform a batch job, 1) selection of a separate population subject to batch work, 2) conditions for processing convergence information (set of division criteria, fusion processing criteria, fusion patterns (core keyword fusion, patent classification fusion, Hybrid fusion), and fusion pattern analysis (including visualization). An example of the divided population may include a patent document set composed of patent documents corresponding to all patent classifications below a predetermined depth (for example, a subclass) on the patent classification system. Naturally, the patent classification includes any one or more of IPC, USPC, FT, FI, and ECLA. In this case, when constructing a patent document set, the idea of automatically including lower patent classification codes may be applied. For example, when a patent document of a specific patent classification is extracted from the patent DB 112, it refers to extracting a patent document including all sub-patent classifications on the patent classification system including the patent classification. This is a problem of notation of a patent classification, and in the case of IPC, there is a problem in that a patent corresponding to a 2 dot subgroup below the 1 dot subgroup cannot be extracted by a method of searching for an extension or the like below the 1 dot subgroup.

Next, the information processing method of the patent information system 1000 of the present invention will be described in more detail with reference to FIGS. 20 to 23 of the present invention. 20 is an information processing method of the patent information system 1000 associated with the simple two-split model of FIG. 15 or the complex split model of FIG. In the patent information system 1000, the patent information system 1000 (A) divides at least one or more patent sets generated or received by applying at least one division processing criterion to divide the first divided patent set and the at least one agent. A two-part patent set is generated (SF11). The created or received patent set is the fusion processing set described above. The generation is generated by the patent information system 1000 according to a preset setting rule through the fusion processing target set generation unit 920 of the present invention, or by a user's search expression or a query input or a patent DB 112. It may be generated by input of patent information extraction conditions. Meanwhile, a method of receiving an external file upload or a patent document set managed by the user from the user of the patent information system 1000, or obtaining the patent information extraction condition through the designated set acquisition unit of the present invention. Patent sets can be obtained by clicking on objects (links, etc.) in which the set extraction conditions are embedded.

Subsequently, the patent information system 1000 may include (B) converting a predetermined unit fusion element corresponding to the patent document from the patent documents included in the first divided patent set and the second divided patent set to the first divided patent set. And generate each of the second divided patent sets (SF12). The unit fusion element is preferably composed of one or more selected from patent classification combinations, index combinations and key keyword combinations. Naturally, the generated unit fusion elements may be a set, and the set may be a patent classification combination set, an index combination set, a key keyword combination set, and the like.

Subsequently, the patent information system 1000 selects a unit fusion element that satisfies a predetermined fusion processing criterion among the unit fusion elements generated for each of the first divided patent set and the second divided patent set (SF13). do. An example of a preset fusion processing criterion may be a difference operation between unit fusion elements corresponding to the first divided patent set and unit fusion elements corresponding to the second divided patent set. Another example of the preset fusion processing criterion may be a difference operation between unit fusion elements corresponding to the i th divided patent set and unit fusion elements corresponding to the i + 1 th divided patent set.

FIG. 21 is an information processing method of the patent information system 1000 related to the forward citation model or the compound citation splitting model of FIGS. 17 to 18. The patent information system 1000 divides the generated or received at least one patent set by applying at least one division processing criterion to generate a first divided patent set and at least one second divided patent set (SF21). )do.

Subsequently, the patent information system 1000 includes (E) at least one step of patent documents included in the first and second divided patent sets for each of the first and second divided patent sets. A first cited split patent set and at least one second cited split patent set composed of cited patent documents are generated (SF22). When generating the cited split patent set, two things may be considered.

The first is the question of the depth of citation or the citation stage. The citation depth (= citation step) means that the c cited patent documents have a citation depth of 1 when the self patent includes information on n parent patents. For every patent document ni, the grand parent patents included in the ni patent document have a citation depth of 2. The larger the citation depth, the larger the size of the relevant citation patent set, but the more likely it is to include a number of weakly related patents. The citation depth is processed by the fusion processing reference information unit 930 and is determined by a user's selection or a default value of the patent information system 1000. The default value is preferably 1 to 2, more preferably 1.

Second is whether to include latent cited patents. A latent citation is a patent that is not cited in a self-patent document, but which consists of patent documents whose filing date is earlier than the self-patent document among the patent documents cited by a post-patent document than a self-patent document citing a patent document cited by the self-patent document. Say a set of documents. The latent cited patent includes 1) obtaining a cited patent document of a self-patent document, 2) obtaining a cited patent document based on the self-patent document based on the cited patent document, and 3) applying for the self-patent document based on the cited patent document Acquisition of cited patent documents, and 4) removal of patent documents included in 1) of patent documents obtained in 3). At this time, when obtaining the potential cited patent of the divided patent set, it is required to go through the process of 1) to 4) for each individual patent document in the divided patent set. That is, 1) to 4) is processed in units of individual patent documents, and 1) to 4) should not be processed in units of sets. The inclusion of the latent cited patent is processed by the fusion processing reference information unit 930 and is determined by a user's selection or a default value of the patent information system 1000. The default value can be non-inclusive.

Subsequently, the patent information system 1000 includes (F) first citation of a predetermined unit fusion element corresponding to the patent document in patent documents included in the first and second cited divided patent sets. Generated based on the divided patent set and the second cited divided patent set (SF23), and (G) a predetermined fusion processing criterion among the unit fusion elements generated by the first cited divided patent set and the second cited divided patent set Selecting unit fusion elements are screened (SF24). The information processing for each of the divided sets is used as it is for the information processing for each of the cited divided patent sets.

An example of the preset fusion processing criteria may be a difference operation between unit fusion elements corresponding to the first citation split patent set and unit fusion elements corresponding to the second citation split patent set. Another example of the preset fusion processing criteria may be a difference operation between unit fusion elements corresponding to the i th cited split patent set and unit fusion elements corresponding to the i + 1 th cited split patent set.

Next, an information processing method of the patent information system 1000 will be described with reference to FIG. 22. The patent information system 1000 divides (H) the generated or obtained at least one patent set by applying at least one division processing criterion to generate a first divided patent set and at least one second divided patent set (SF31). (I) the first divided patent set and the second divided patent set, wherein the first divided patent set comprising at least one cited patent document of the patent document included in the first divided patent set and the second divided patent set Generate a single cited split patent set and at least one second cited split patent set (SF32), and (J) correspond to the patent document in patent documents included in the first split patent set and the first cited split patent set A predetermined unit fusion element is generated for each of the first divided patent set and the first cited divided patent set (SF33).

Subsequently, the patent information system 1000 selects the unit fusion element (K34) that satisfies a predetermined fusion processing criterion among the unit fusion elements (SF34). An example of a preset fusion processing criterion may be a difference operation between unit fusion elements corresponding to the first cited split patent set and unit fusion elements corresponding to the first split patent set. Another example of a preset fusion processing criterion may be a difference operation between unit fusion elements corresponding to the first divided patent set and unit fusion elements corresponding to the first cited divided patent set.

On the other hand, the information processing of step (J) and step (K) has been processed for the first divided patent set and the first cited divided patent set, but the i-divided patent set and the i-th cited divided patent Of course, it can be processed to the set, the i-divided patent set, the j-th cited split patent set may be processed as a matter of course.

Next, an information processing method of the patent information system 1000 will be described with reference to FIG. 23.

The patent information system 1000 divides the at least one patent set generated or obtained by applying at least one split processing criterion to generate a first split patent set and at least one second split patent set (SF). )and,

(M) a first citation splitting comprising one or more cited patent documents of patent documents included in the first split patent set and the second split patent set for each of the first split patent set and the second split patent set; Generate (SF) a patent set and at least one second cited split patent set, and (N) the first split patent set, the second split patent set, the first cited split patent set, and the second cited split patent set In the patent document included in the set, a predetermined unit fusion element corresponding to the patent document is generated for each of the first divided patent set, the second divided patent set, the first cited divided patent set, and the second cited divided patent set. And (O) selecting unit fusion elements that meet predetermined fusion processing criteria among the unit fusion elements. Examples of the preset convergence processing criteria may include unit fusion elements corresponding to the first divided patent set, unit fusion elements corresponding to the second divided patent set, and unit fusion elements corresponding to the first cited divided patent set. And a set operation of a preset order of two or more selected from unit fusion elements corresponding to the second cited divided patent set. The unit fusion elements corresponding to the set form a set, and the set may be a unit fusion element set, and the operation of the set unit may be processed.

The unit fusion elements that satisfy the predetermined fusion processing criteria of the patent information system 1000 may be selected as follows.

The first is to accept the unit fusion element set as a result of the difference operation.

A second unit fusion element set composed of unit fusion elements corresponding to the first divided patent set, and a second unit fusion element set composed of unit fusion elements corresponding to the at least one second divided patent set And a first cited unit fusion element set composed of unit fusion elements corresponding to the first cited divided patent set, and a second cited unit composed of unit fusion elements corresponding to the at least one second cited divided patent set. For the unit fusion elements constituting the set of fusion elements, only unit fusion elements for which a predetermined function value whose frequency is an independent variable such as frequency and increase rate of the frequency are more than a predetermined difference are selected.

The present invention can be used in various industries related to technical information, such as patent information industry, technology analysis and consulting industry, R & D support business, information analysis industry.

100: data part 1000: patent information system
110: patent data section 111: patent specification file section
112: Patent DB 113: Patent Classification DB
114: other patent data DB 114-1: Catchword DB
114-2: IndextoUSPC DB 120: non-patent data part
121: non-patent file unit 122: non-patent DB
123: company information DB 124: other non-patent data DB
130: key keyword DB 131: technology keyword DB
131-1: Technical keyword air pair DB 132: Product keyword DB
132-1: Product Keyword Air Pair DB 133: Syntax Keyword DB
133-1: Keyword Syntax Air Pair DB 133-2: Syntax Keyword Air Pair DB
133-3: Syntax by pattern DB 140: Patent classification metadata DB
150 applicant DB 160: rule data portion
170: language data unit 180: other DB
190: purpose-specific data unit 200: data processing unit
210: key keyword generator 211: keyword extractor
212: technical keyword generator 213: product keyword generator
214: syntax keyword generator 220: classification metadata generator
223: Core keyword stratification unit 224: Patent automatic classification unit
230: purpose-specific data generation unit 231: statistical analysis data generation unit
232: citation analysis data generation unit 233: discovery analysis data generation unit
300: search processing unit 3000: linkage system
400: analysis information generation unit 400: analysis information generation unit
410: analysis data acquisition unit 410: special analysis information generation unit
420: specialized target discovery unit 421: analysis index DB
421: Similar patent set discovery unit 422: Nich technology area discovery unit
423: Initial signal detection unit 424: Open innovation target detection unit
425: future technology discovery unit 430: analysis item selection unit
440: option processing unit 441: data limited option processing unit
442: display option processing unit 443: other option processing unit
450: analysis execution unit 451: analysis command syntax DB
460: analysis result reporting unit 461: table generation unit
462: chart generator 463: diagram generator
464: Report generation unit 470: Document set call unit
490: infinite analysis provider 500: evaluation information generation unit
5000: user computer 510: patent evaluation information generation unit
511: Applicant evaluation information generation unit 512: Inventor evaluation information generation unit
513: agent evaluation information generation unit 514: document set evaluation information generation unit
520: key keyword evaluation information generation unit 600: special analysis information generation unit
610: comparison analysis unit 611: comparison item selection unit
612: display comparison result selection unit 620: what if analysis unit
621: set operation unit 622: simulation unit
630: analysis of the inclusion / consultation 630: analysis of the inclusion / consultation
640: association analysis unit 641: association target selection unit
642: homogeneous association analysis unit 643: heterogeneous association analysis unit
644: association result display selection unit 650: risk / opportunity analysis unit
651: risk / opportunity pattern selection unit 652: risk / opportunity factor management unit
660: 2X2 analyzer 661: indicator selection unit
700: specialized target discovery unit 710: similar patent set discovery unit
710: similar patent set generation unit 711: similar patent set generation data generation unit
711-1: Core keyword generation unit 711-2: Upper patent classification code extraction unit
711-3: Citation / cited patent document extraction unit 711-4: Weighting unit
711-4a: key keyword weighting unit 711-4b: patent classification code weighting unit
711-4c: Citation / cited depth weighting unit 712: Similar patent set generation unit
713: document set selection unit 714: similar patent set providing unit
715: Document similarity evaluation information generation unit 716: Invalid reference detection unit
720: initial signal detector 721: initial signal pattern
722: Initial signal pattern measurement unit 723: Complex data analysis unit
724: Initial signal determination unit 724-1: Initial signal pattern reference unit
725: initial signal evaluation information generation unit 726: initial signal discovery batch processing unit
726: initial signal discovery batch processing unit 730: niche technology region discovery unit
731: document set selection unit 732: network diagram comparison unit
733: n-dimensional matrix selection unit 734: prior patent display unit
735: niche technology area evaluation information generation unit 740: open innovation target discovery unit
741: source selector 742: target attribute designator
743: source-target analysis unit 744: evaluation information generation unit for open innovation
750: future technology discovery unit 751: leading target discovery unit
752: lead target designator 753: lead target analyzer
754: future technology applied product analysis unit 755: future technology evaluation information generation unit
760: patent strategy discovery unit 800: additional service provider
810: member management unit 811: member data unit
811a: Member document DB 811b: Member mapping information DB
811c: member billing DB 812: member directory generation unit
813: Member flag generator 814: Member mapping information generator
814a: member keyword mapping information generator 814b: member applicant mapping information generator
820: platform service provider 821: web service provider
821a: Web service account manager 821b: Web service interface manager
830: community service provider 840: e-commerce service provider
841: e-commerce target DB 842: e-commerce processing unit
843: open market generation unit 850: billing processing unit
860: translation processing unit 870: comment processing unit
871: comment receiving unit 872: comment storage unit
873: identity determination unit 874: comment delivery unit
900: converged information processing unit 9000: wired and wireless network
910: convergence processing collection acquisition unit 920: convergence processing target set generation unit
921: set divider 921-1: divide processing reference provider
921-2: Division processing criteria acquisition part 921-3: Division processing criteria DB
922: citation patent set acquisition part 922-1: forward citation patent set acquisition part
923: collection set information acquisition unit 930: convergence processing reference information unit
931: Fusion processing criteria providing unit 932: Fusion processing criteria acquisition unit
933: Fusion processing criteria generation unit 934: Fusion processing criteria policy DB
940: fusion pattern processing unit 941: keyword fusion pattern processing unit
942: Patent classification fusion pattern processing unit 943: Hybrid fusion pattern processing unit
944: comparison information generation unit 945: comparison information processing unit
950: Fusion pattern analyzer 951: Fusion pattern visualization unit
952: Fusion pattern UI unit 960: Fusion pattern information batch generation unit
961: fusion pattern information batch generation policy unit 962: fusion pattern information DB
970: unit fusion element generator 971: core keyword unit fusion element generator
972: Patent classification unit fusion element generation unit 973: Hybrid fusion element generation unit
974: index unit fusion element generation unit 992-2: backward citation patent set acquisition unit

Claims (10)

In a patent information system, the patent information system
(A) dividing the generated or received at least one patent set by applying at least one split processing criterion to generate a first split patent set and at least one second split patent set;
(B) a first citation splitting comprising one or more cited patent documents of patent documents included in the first split patent set and the second split patent set for each of the first split patent set and the second split patent set; Generating a patent set and at least one second cited split patent set;
(C) In the patent documents included in the first cited divided patent set and the second cited divided patent set, a predetermined unit fusion element corresponding to the patent document is set to the first cited divided patent set and the second cited divided patent. Generating for each set; And
(D) selecting a unit fusion element that satisfies a predetermined fusion processing criterion among the unit fusion elements generated for each of the first and second cited divided patent sets;
The unit fusion element is to be generated including any one or more of a patent classification, a keyword extracted from the patent document, a keyword processed the extracted keyword,
Step (C) is performed on a patent document including two or more patent classifications based on at least one or more types of patent classifications of patent documents included in the first and second cited divided patent sets. Or patent documents including two or more keywords extracted or processed from patent documents included in the first cited split patent set and the second cited split patent set. How to produce.
The method of claim 1,
The unit fusion element is at least one selected from a patent classification combination consisting of two or more patent classifications, a key keyword combination consisting of two or more keywords, and an index combination corresponding to a patent classification,
Wherein the patent classification combination and the index combination are generated based on two or more patent classifications included in the patent document.
The method of claim 2,
The fusion processing criterion is that the unit fusion element is present only in one predetermined patent set, or 2) the frequency of the unit fusion element for each of the divided patent sets is equal to or greater than a predetermined difference, or 3) the divided patent. The frequency of the unit fusion element for each set is equal to or greater than a preset increase rate, or 4) the frequency of the unit fusion element for each of the divided patent sets is greater than or equal to a preset increase acceleration, or 5) the unit fusion element for each of the divided patent sets A method of generating fusion information, characterized in that the frequency of establishes a predetermined function relationship or a predetermined condition relationship.
The method of claim 2,
When the unit fusion element is an index combination, the method for generating the index combination may include: 1) an index using an index corresponding to the original patent classification only by the original patent classification explicitly indicated in the patent document included in each partition set. And a method of generating a combination, and 2) referring to the patent classification system to which the original patent classification belongs, with respect to the explicitly listed original patent classification, and including an index corresponding to a preset upper patent classification of the original patent classification. The method of generating convergence information, characterized in that any one method selected from among the methods for generating the index combination is used.
The method of claim 1,
The divisional processing criteria are generated by any one or any combination of two or more of an issuer country, a time range, an applicant or an owner or assignee, an inventor, a patent classification, a patent subject classification, and a predetermined classification attribute included in the patent document.
The time that is the basis of the time range is to use any one or any combination of two or more of the priority date, application date, publication date and registration date of the patent document,
The patent classification is any one of IPC, USPC, FT, FI and ECLA, or any combination of two or more,
The patent subject classification is any one or more of the subject classification generated using any one or more of the index of the IPC, the index of the USPC or the IPC, USPC, FT, FI and ECLA,
The preset classification attribute is any one or more of a classification attribute for the applicant, the owner or the assignee, a classification attribute for each patent classification constituting the patent classification, a classification attribute for the inventors, or a classification attribute for the patent subject classification. Fusion information generation method, characterized in that.
The method of claim 2,
When the unit fusion element is a patent classification combination, the method of generating the patent classification combination may include 1) generating the patent classification combination only by the original patent classification of the patent documents included in each of the divided sets; and 2) the original patent classification. Regarding the patent classification system to which the original patent classification belongs, with respect to any one method selected from among the method for generating the patent classification combination by including a predetermined upper patent classification of the original patent classification is used. Fusion information generation method characterized in that.
The method of claim 6,
A first method for generating nCr (r is less than or equal to n and greater than 1) patent classification combinations when the original patent classification is n (n is an integer),
When there are n original patent classifications (n is an integer), with reference to patent classification system information to which the n original patent classifications belong to each of the n original patent classifications, the preset upper patent for each original patent classification M up to the classification (m is an integer, m is greater than or equal to n) generates the top patent classifications, and the n original patent classifications and the m top patent classifications are (n + m) Cr (r (( n + m) less than or equal to and greater than 1)
Inventing any one or more of the third method of generating the patent classification combination by applying a weight of a predetermined criterion to the patent classification combination generated in the first method or the second method. How to generate fusion information.
The method of claim 1,
And displaying the selected unit fusion element in a preset format.
The preset format is any one of a tabular format, a chart format, a graph or a network diagram format.
The method of claim 8,
When the unit fusion element is displayed, at least one of a patent classification code, title information corresponding to the patent classification code, and an index corresponding to the patent classification code is displayed, or a keyword is displayed. How to generate fusion information.
The method of claim 1,
The at least one patent set generated or obtained is a cited patent set consisting of a first-generated patent set, at least one cited patent document of a patent document included in the first-generated patent set, and a patent document included in the first-generated patent set. A patent made up of a cited patent set consisting of one or more of the cited patent documents of the patent document included in any one or more patent sets of the family patent sets in at least one country of the patent documents included in the first-generated patent set. Fusion information generation method characterized in that the set.
KR1020100023932A 2010-03-17 2010-03-17 Method on producing information on fusion information of technology using patent data KR20110104816A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020100023932A KR20110104816A (en) 2010-03-17 2010-03-17 Method on producing information on fusion information of technology using patent data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020100023932A KR20110104816A (en) 2010-03-17 2010-03-17 Method on producing information on fusion information of technology using patent data

Publications (1)

Publication Number Publication Date
KR20110104816A true KR20110104816A (en) 2011-09-23

Family

ID=44955425

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020100023932A KR20110104816A (en) 2010-03-17 2010-03-17 Method on producing information on fusion information of technology using patent data

Country Status (1)

Country Link
KR (1) KR20110104816A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014061864A1 (en) * 2012-10-15 2014-04-24 ㈜광개토연구소 Method and system for providing analysis information for patent having technological convergence

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014061864A1 (en) * 2012-10-15 2014-04-24 ㈜광개토연구소 Method and system for providing analysis information for patent having technological convergence

Similar Documents

Publication Publication Date Title
Nentwig et al. A survey of current link discovery frameworks
CN101595476B (en) System, method and computer program for client definition information architecture
JP2024027087A (en) Standard medical term management system and method based on general model
Zaki et al. BioCarian: search engine for exploratory searches in heterogeneous biological databases
Senger Assessment of the significance of patent-derived information for the early identification of compound–target interaction hypotheses
KR101610938B1 (en) Method and System on Producing Information on Unoccupied Fusion Candidate Information Using Patent Data
KR20120033934A (en) Method and system on processing information on patent litigation through multi-dimensional patent litigation expectation model
KR20110104816A (en) Method on producing information on fusion information of technology using patent data
KR20110104815A (en) Method and system on producing information on fusion information using patent data
KR20110104813A (en) Method and system on producing information on fusion information using patent data
Hidders et al. A formal model of dataflow repositories
LaFond et al. Furthering a comprehensive SETI bibliography
KR20110104814A (en) Method producing information on technology fusion information of citation information using divided patent set data
Herr et al. The NIH visual browser: An interactive visualization of biomedical research
Catalá-López et al. A cross-sectional analysis identified co-authorship networks and scientific collaboration on reporting guidelines for health research
Locatelli et al. Exploring BIM and NLP applications: A scientometric approach
KR101199827B1 (en) System, Media, Program and Method on Making Patent Litigation Prediction Model
KR20200028711A (en) System of discovering technology seller candidates based on patent information and providing reason why the seller candidates are likely to have good technology to buy
KR20200028713A (en) System of discovering technology buyer candidates based on patent information and providing reason why the buyer candidates are likely to buy
KR101255181B1 (en) System, Media and Method on Making Patent Litigation Prediction Model
KR101271115B1 (en) System, Media, Program and Method on Generating Patent Risk Hedging Prediction Information
Oliveira et al. Towards a Data Catalog for Data Analytics
Miñarro-Giménez et al. A semantic query interface for the OGO platform
AU2012244384B2 (en) System, method, and computer program for a consumer defined information architecture
Cruz et al. Agreement Maker: a Visual Tool for Aligning Heterogeneous Ontologies

Legal Events

Date Code Title Description
WITN Withdrawal due to no request for examination