WO2009075554A2 - Patent information providing method and system - Google Patents

Patent information providing method and system Download PDF

Info

Publication number
WO2009075554A2
WO2009075554A2 PCT/KR2008/007398 KR2008007398W WO2009075554A2 WO 2009075554 A2 WO2009075554 A2 WO 2009075554A2 KR 2008007398 W KR2008007398 W KR 2008007398W WO 2009075554 A2 WO2009075554 A2 WO 2009075554A2
Authority
WO
WIPO (PCT)
Prior art keywords
information
patent classification
analysis
classification code
patent document
Prior art date
Application number
PCT/KR2008/007398
Other languages
English (en)
French (fr)
Other versions
WO2009075554A3 (en
Inventor
Min Soo Kang
Original Assignee
Kwanggaeto Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kwanggaeto Co., Ltd. filed Critical Kwanggaeto Co., Ltd.
Publication of WO2009075554A2 publication Critical patent/WO2009075554A2/en
Publication of WO2009075554A3 publication Critical patent/WO2009075554A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2216/00Indexing scheme relating to additional aspects of information retrieval not explicitly covered by G06F16/00 and subgroups
    • G06F2216/11Patent retrieval

Definitions

  • the present invention relates to a method for providing patent information, and more particularly to a method and a system for providing patent information, which includes multi-dimensional patent analysis information.
  • Patent information has characteristics of technological information, rights information, and management information, and has an increasing importance in the globalized international competition.
  • technological information the patent information enables the understanding of technology development trends and technical ideas applied to individual patents.
  • rights information the patent information enables the understanding of the scope of each patent and the degree of domestic or foreign patent acquisition.
  • management information the patent information enables the understanding of technology development trends of competing companies.
  • Patent information can be obtained through various methods, representative of which include 1) a method using a patent information system provided by a patent office of each nation, 2) a method using the patent information system developed by a non-governmental company, and 3) a method using a patent information system developed by an individual company based on the object of the company.
  • Representative examples of the method of 1) include patent information systems provided by the Korean Intellectual Property Office (KIPO; www.kipo.go.kr) and the Korean Intellectual Property Rights Information Service (KIPRIS; www.kipris.or.kr), and representative examples of method 2) include www.delphion.com of the United States, www.patolis.co.jp of Japan, www.wips.co.kr and www.wisdomain.com of the Republic of Korea.
  • Non-governmental companies of each nation such as Thomson Scientific operating www.delphion.com, possess their own constructed patent information database, and provide search results in various ways in response to a search formula input through a patent search engine.
  • most major worldwide technology- advanced companies have constructed and are operating their own internal patent search/management systems. However, it is difficult for an external user to access the internal patent search/management systems of those companies.
  • the exclusive patent management systems use on-line patent information service companies, such as www.delphion.com, which are accessible from outside, and collect and manage patent information through platforms provided by on-line patent information service companies. Further, it is a worldwide trend that inventions of most patent applications filed in the name of a company are made by inventors belonging to the company and those inventors collect domestic or foreign patent informationin relation to their own inventions. This is because more than 70% of worldwide new technology information is disclosed in the form of patent information in spite of existence of the patent publication term and patent information contains ideas more commercially available than other information including thesis information. However, many of these inventors are not experienced in patent search. Moreover, it is convenient for even inventors experienced in patent search to access external patent informationservice providing companies, such as www.delphion.com, and obtain patent information from them.
  • a first object of the present invention is to provide various methods of pre-processing patent document information, a method of generating data or information necessary for patent analysis based on the pre-processed patent information, a method for various patent analyses through the generated information, and a patent information system thereof.
  • a second object of the present invention is to provide a method for automatically constructing company-based, applicant-based, inventor- based, agent-based, and patent classification code-based patent information systems, and the systems thereof.
  • a third object of the present invention is to provide a method and a system for information processing, which can provide monitor, search, directory, and analysis services automatically including lower patent classification codes of a to-be-processed patent classification code.
  • a fourth object of the present invention is to provide a method and a patent information system for providing patent information obtained by achieving the first to third objects to multiple users on line.
  • a method of providing patent information by a system providing patent information to a computer of a user including the steps of: obtaining selection information for a patent document set; specifying a to-be-processed patent document set based on the selection information; and generating processed patent information by performing at least one preset processing for the specified to-be-processed patent document set, wherein at least one patent classification code is used in at least one of the step of specifying the to-be-processed patent document set and the step of generating the processed patent information by performing said at least one preset processing, use of the patent classification code corresponds to processing of information including patent information of a first patent classification code and lower patent classification codes of the first patent classification code when the first patent classification code is given in at least one of the step of obtaining the selection information and the step of generating the processed patent information and corresponds to a preset level in a patent classification code system to which the first patent classification code belongs, and the patent classification code belongs to at least one of
  • the preset level is a dot level patent classification code
  • the dot level patent classification code includes a dot or dots in title information thereof, which is a description corresponding to the dot level patent classification code in the patent classification code system.
  • the dot level patent classification code includes a dot or dots in title information thereof, which is a description corresponding to the dot level patent classification code from among patent classification codes of a subgroup or below when the dot level patent classification code belongs to the IPC system, includes a dot or dots in title information thereof, which is a description corresponding to the dot level patent classification code from among patent classification codes of a subclass or below when the dot level patent classification code belongs to the USPC system, includes a dot or dots in title information thereof, which is a description corresponding to the dot level patent classification code from among patent classification codes of a subgroup or below when the dot level patent classification code belongs to the FI system of Japan, includes a dot or dots in title information thereof, which is a description corresponding
  • the selection information is obtained by at least one of: 1) an obtainment method of receiving input of the selection information through a search formula from the computer of the user; 2) an obtainment method of receiving selection information, which is selected from Bibliographical information included in the preset patent document presented in the computer of the user and is transmitted from the computer of the user; 3) an obtainment method of receiving selection information on a particular patent document set selected in at least one patent document set generated in advance by the user; and 4) an obtainment method of receiving at least one preset condition from the system.
  • the step of specifying the to-be- processed patent document set when one of the obtainment methods 1) to 4) has been executed, at least one first patent classification code is included in the selection information during execution of the methods 1) to 4), and the first patent classification code is a dot level patent classification code, a patent document set including all lower patent classification codes of the dot level first patent classification code is specified as the to-be-processed patent document set.
  • a process of specifying the patent document set including all lower patent classification codes of the dot level first patent classification code as the to-be-processed patent document set includes the steps of: obtaining all lower patent classification codes of the dot level first patent classification code; and specifying the to-be-processed patent document set by using a search formula or query formula including the first patent classification code and all the lower patent classification codes thereof.
  • individual patents of the patent document set include at least one patent granted a weight according to at least one preset weight granting policy, and the processed patent information is generated while reflecting the weight granted to the individual patent.
  • the weight granting policy includes calculation of at least one of 1) preset counting information and 2) preset calculation value information in relation to the individual patent
  • a counting value for the preset counting information includes at least one of 1) a number of families in at least one of a first nation and a second nation, 2) a number of claims, and 3) a cited-by number
  • the preset calculation value information includes at least one of 1) at least one evaluation score for an inventor of the patent, 2) at least one evaluation score for an applicant of the patent, 3) at least one preset calculation value for a patent classification code included in the patent of the inventor, 4) at least one preset calculation value for a patent classification code included in the patent of the applicant, 5) at least one preset calculation value reflecting financial information of the applicant, and 6) at least one preset calculation value for the patent classification code itself included in the patent.
  • processed patent information corresponding to the patent classification code information is provided together when the processed patent information is provided to the computer of the user, and the processed patent information provides processed patent information corresponding to at least one direct lower patent classification code of the patent classification code in a patent classification code system including the patent classification codewhen there is a request for the processed patent information corresponding to at least one direct lower patent classification code of the patent classification code from the computer of the user.
  • a method of providing patent information by a system providing patent information to a computer of a user including the steps of: obtaining selection information for a patent document set; specifying a to-be-processed patent document set based on the selection information; and generating processed patent information by performing at least one preset processing for the specified to-be-processed patent document set, wherein the selection information is obtained by at least one of: 1) an obtainment method of receiving input of the selection information through a search formula from the computer of the user; 2) an obtainment method of receiving selection information, which is selected from Bibliographical information included in the preset patent document presented in the computer of the user and is transmitted from the computer of the user; 3) an obtainment method of receiving selection information on a particular patent document set selected from at least one patent document set generated in advance by the user; and 4) an obtainment method of receiving at least one preset condition from the system, the processed patent information includes at least one piece of mult i-dimensional analysis information generated through at least
  • the analysis information is generated by applying at least one of 1) an analysis method for analysis of all patent classification codes and 2) an analysis method for analysis of only main patent classification codes, and the analysis method for analysis of all patent classification codes generates analysis information by applying at least one of a) an analysis method of granting a uniform weight to all patent classification codes, and b) an analysis method of granting different weights to main patent classification codes and sub- patent classification codes.
  • the analysis is an analysis of convergence between patent classification codes, which includes at least one of association analysis, frequent occurrence pattern analysis, correlation analysis, and grouping between at least two patent classification codes included in patent documents including at least two patent classification codes of an identical patent classification system from among the patent documents included in the to-be-processed patent document set. It is also preferred that, in the analysis of convergence between patent classification codes, when said at least two patent classification codes of the identical patent classification system are included in a single patent document and when said at least two patent classification codes are not highest patent classification codes or patent classification codes of a preset level, the analysis of convergence between patent classification codes is performed for each of said at least two patent classification codes together with higher patent classification codes thereof up to a preset level of each of said at least two patent classification codes.
  • the analysis of convergence includes at least one of 1) a convergence analysis between at least two technical keywords generated based on text information extracted from at least one preset field of one patent document, 2) a convergence analysis between co- inventors of a patent document having co-inventors, 3) a convergence analysis between co-applicants of a patent document having co-applicants, and 4) a convergence analysis between citation patent documents included in citation information of a single patent document, for individual patent documents of the to-be-processed patent document set, and the convergence analysis includes at least one of association analysis, frequent occurrence pattern analysis, correlation analysis, and grouping between at least two keywords, co-inventors, co-applicants, and co-citation patent documents included in the patent documents.
  • the method further includes the step of visualizing a result of the convergence analysis through a network diagram including at least two nodes, wherein the visualized result include at least one of information on the nodes and information on lines interconnecting the nodes.
  • the information on the lines interconnecting the nodes includes numerical information
  • the method further includes the steps of-" obtaining selection information on the numerical information or the lines from the computer of the user; extracting root patent document information corresponding to the numerical information from a patent information database including the to-be- processed patent document set; and transmitting the extracted root patent document information to the computer of the user.
  • the method further includes the steps of: obtaining change information of the to-be-processed patent document set from the computer of the user; and specifying the to-be-processed patent document set based on the obtained change information, wherein the obtaining of the change information includes'• providing an interface for the obtaining of the change information to the computer of the user; and obtaining selection information of the user for the interface, the interface includes at least one of 1) time type selection interface, 2) time interval selection interface, 3) nation selection interface 4) patent classification code selection interface 5) citation specification interface, 6) family information-including selection interface, and 7) after-search selection interface, the time type selection interface includes a selection interface providing selection of at least one of application date, publication date, and registration date, the time interval selection interface includes a selection interface providing selection of at least one of year/month/week/day of each of a starting period and an ending period, the nation selection interface includes a selection interface providing selection of at least one between at least two nations, the patent classification code selection interface includes a selection interface providing selection of at least one between at
  • the analysis is a set-based citation analysis that treats at least two patent documents as one analysis unit
  • the set-based citation analysis includes at least one of 1) backward citation patent document set analysis, 2) forward citation patent document set analysis, and 3) citation occurrence self patent document set analysis, the analysis of 1) to 3) creates at least one preset multi-dimensional analysis information from the patent document set, which is an object of the analysis of 1) to 3)
  • a backward citation patent document set specified for the backward citation patent document set analysis includes a document set including citation information of at least one citation depth included in the to-be-processed patent document set
  • a forward citation patent document set specified for the forward citation patent document set analysis includes a later-filed document set including citation information of at least one citation depth citing the to-be- processed patent document set
  • a citation occurrence self patent document set specified for the citation occurrence self patent document set analysis includes a patent document set including patent documents cited at least one time from among the patent documents of the to
  • specification of the backward citation patent document set includes specification of the patent document set and an indirect backward citation document set of the patent document set and specification of the forward citation patent document set includes specification of the patent document set and an indirect forward citation document set of the patent document set
  • the indirect backward citation document set includes patent documents, which have application dates prior to application dates of to-be-processed patent documents of the to-be-processed patent document set and commonly cite patent documents cited by the to-be-processed patent documents
  • the indirect forward citation document set includes patent documents, which have application dates later than application dates of to-be- processed patent documents of the to-be-processed patent document set and commonly cite patent documents cited by the to-be-processed patent documents.
  • the multi-dimensional analysis information is transmitted in a form of a table including numerical information to the computer of the user, and the method further includes the steps of: receiving selection information on the numerical information from the computer of the user; extracting root patent document information corresponding to the numerical information from a patent information database including the to-be-processed patent document set; and transmitting the extracted root patent document information to the computer of the user.
  • At least one of the step of specifying the to-be-processed patent document set and the step of generating the processed patent information by performing at least one preset processing uses at least one patent classification code
  • use of the patent classification code includes information processing of a first patent classification code together with lower patent classification codes of the first patent classification code when the first patent classification code is given in at least one of the step of receiving the selection information and the step of generating the processed patent information and corresponds to a preset level in a patent classification code system to which the first patent classification code belongs, and the patent classification code system includes at least one of the IPC, USPC, FI and FT of Japan, and ECLA.
  • the obtainment method of receiving selection information includes the steps of: providing at least one of applicant information including at least one applicant, patent classification code information including at least one patent classification code, inventor information including at least one inventor, and agent information including at least one agent to the computer of the user; obtaining selection information on at least one of the applicant information, the patent classification code information, the inventor information, and the agent information from the computer of the user; and generating a preset multi-dimensional analysis result by processing a patent document set specified by the selection information as a to-be-processed patent document set, and providing the generated preset multi-dimensional analysis result to the computer of the user.
  • the system generates at least one medium multi-dimensional analysis result table for easily and rapidly creating a multi-dimensional analysis result according to the applicants, the patent classification codes, the inventors, or the agents, and the method further includes the steps of: providing at least one of at least one preset applicant-based multi-dimensional analysis list or menu, at least one preset patent classification code-based multi-dimensional analysis list or menu, at least one preset inventor-based multi-dimensional analysis list or menu, and at least one preset agent-based multi-dimensional analysis list or menu to the computer of the user; obtaining selection information on the multi-dimensional analysis list or menu from the computer of the user; and when an analysis list of the obtained selection information is obtainable from the medium multi-dimensional analysis result table, generating an analysis result of the obtained selection information by referring to the medium multi-dimensional analysis result table and providing the generated analysis result to the computer of the user.
  • a method of providing patent information by a system providing patent information to a computer of a user including the steps of: obtaining patent document information on at least one patent document published in a first nation; when more than one year has passed after an application date of the published patent document in the first nation and a date on which the patent document information is obtained, referring to at least one of 1) PCT application information of the published patent document, 2) second nation family information claiming a priority to a second nation based on the published patent document of the first nation, and 3) current status information of a second nation patent document corresponding to the second nation family information; determining if the published patent document corresponds to a first case in which the published patent document has no PCT application information and no second nation family information, a second case in which the published patent document has PCT application information but has no second nation family information, and the application date of the published patent document in the first nation corresponds to a date after passage of a preset term from the date on which the patent document
  • the present invention has the following effects. First, through various pre-processings, it is possible to efficiently construct a patent information database from patent-related root data, such as patent document information or patent classification code data.
  • FIG. 1 illustrates a construction of a patent information system according to an embodiment of the present invention
  • FIG. 2 illustrates a DB unit according to an embodiment of the present invention
  • FIG. 3 illustrates a patent document master DB according to an embodiment of the present invention
  • FIG. 4 illustrates a patent classification code master DB according to an embodiment of the present invention
  • FIG. 5 illustrates a subject master DB according to an embodiment of the present invention
  • FIG. 6 illustrates a multi-dimensional analysis operation result table DB according to an embodiment of the present invention
  • FIG. 7 illustrates a support DB unit and a secondary processing DB unit according to an embodiment of the present invention
  • FIG. 8 illustrates a pre-processing module according to an embodiment of the present invention
  • FIG. 9 illustrates a master DB generation module according to an embodiment of the present invention
  • FIG. 10 illustrates a citation information pre-processing module according to an embodiment of the present invention
  • FIG. 11 illustrates a patent information processing basic module according to an embodiment of the present invention
  • FIG. 12 illustrates a support module according to an embodiment of the present invention
  • FIG. 14 illustrates a patent intelligence module according to an embodiment of the present invention
  • FIG. 15 illustrates a directory generating module according to an embodiment of the present invention
  • FIG. 16 illustrates an operation of a patent classification code search module according to an embodiment of the present invention
  • FIG. 16 illustrates an analysis result for data of the number of applications of each year for each numerously-filed IPC of an IPC sub ⁇ class level filed in the Republic of Korea by Samsung Electronics Inc.
  • FIG. 18 illustrates an analysis result generated by a drill-down to HOlL according to an embodiment of the present invention
  • FIG. 19 illustrates an analysis result generated by a drill-down to HOlL 21/00 and lower patent codes thereof according to an embodiment of the present invention
  • FIG. 20 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC main group level by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 21 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC 1 dot sub-group level by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 20 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC main group level by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 21 illustrates an analysis result for numerously-filed
  • FIG. 22 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC sub-class level based on application documents filed in the US by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 23 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC sub-class level based on registered documents registered in the US by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 23 illustrates an analysis result for numerously-filed IPC- based each-year applications-number data of the IPC sub-class level based on registered documents registered in the US by Samsung Electronics Inc. from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 23 illustrates an analysis result for numerously-filed IPC- based each-year applications-number
  • FIG. 24 illustrates an analysis result for numerously-filed USPC- based each-year applications-number data of the USPC no dot (sub-class) level filed in the US by General Motors from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 25 illustrates an analysis result for numerously-filed USPC- based each-year applications-number data of the USPC 1 dot level filed in the US by General Motors from among all applicants within a DB kept by a patent information system according to an embodiment of the present invention
  • FIG. 26 illustrates analysis and drill-down of total applications in relation to IPC H04B based on application documents filed in the Republic of Korea according to an embodiment of the present invention
  • FIG. 27 illustrates analysis and drill-down of total applications in relation to IPC H04B based on application documents filed in the US according to an embodiment of the present invention
  • FIG. 28 illustrates analysis of total applications of each numerously-filed company in relation to IPC H04B based on application documents filed in the Republic of Korea according to an embodiment of the present invention
  • FIG. 29 illustrates analysis of occupancy-based numerously-filed applicants in relation to IPC H04B based on application documents filed in the Republic of Korea according to an embodiment of the present invention
  • FIG. 30 illustrates analysis of activity index-based numerously- filed applicants in relation to IPC H04B based on application documents filed in the Republic of Korea according to an embodiment of the present invention
  • FIG. 31 illustrates total application amount analysis including a drill-down of IPC H04B and lower classifications thereof based on US application documents according to an embodiment of the present invention
  • FIG. 32 illustrates a representative competing company analysis based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 33 illustrates a representative competing applicant analysis for each numerously-filed patent classificatio code based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 31 illustrates total application amount analysis including a drill-down of IPC H04B and lower classifications thereof based on US application documents according to an embodiment of the present invention
  • FIG. 32 illustrates a representative competing company analysis based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 34 illustrates a representative competing applicant analysis based on an IPC main group of the total amount of the Samsung Electronics among all the applicants in US in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 35 illustrates a numerously-filed inventor-based analysis result according to the year based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system 1 of the present invention according to the exemplary embodiment of the present invention
  • FIG. 36 illustrates a numerously-filed inventor-based analysis result for each year based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 35 illustrates a numerously-filed inventor-based analysis result according to the year based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system of the present invention according to the exemplary embodiment of the present invention
  • FIG. 36 illustrates
  • FIG. 38 illustrates a year-based analysis result of citation total amount for the to-be-analyzed document set according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set;
  • FIG. 39 illustrates a year-based analysis result for the multiple- citing applicant according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set;
  • FIG. 40 illustrates a year-based analysis result for the multi- citing patent classification code (IPC main group level) according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set ;
  • IPC main group level multi- citing patent classification code
  • FIG. 41 illustrates a year-based analysis result reflecting the drill down of the multi-citing patent classification code (IPC main group level) according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set;
  • IPC main group level multi-citing patent classification code
  • FIG. 42 illustrates a year-based analysis result of the multi- citing inventor according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the
  • FIG. 43 illustrates a year-based analysis result of the most citing applicant with respect to the to-be-analyzed document set according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set;
  • FIG. 44 illustrates a year-based analysis result for the most citing inventor according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set;
  • FIG. 45 illustrates a year-based analysis result when the IPC in the analysis for the most citing technology according to the IPC main group is drilled down according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set;
  • FIG. 46 illustrates a year-based analysis result when the IPC in the analysis for the most citing cited technology of the USPC sub class (no dot, class direct subordinate) is drilled down according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set;
  • FIG. 47 illustrates a total amount reference analysis result and a chart of the analysis result generated by a chart generating module among a reporting module of the present invention for the total amount reference analysis result with respect to the to-be-analyzed document set in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system are considered as the reference document set, the entire forward citation document set serves as the to-be-analyzed document set;
  • FIG. 48 illustrates the analysis result of a year-based citation total amount of the often-cited inventor and a document list generated by a simplified analysis module that provides the simplified analysis result (document list, the number of applications/registrations of the most- filing applicant according to the year, the year-based number of applications/registrations according to the most-filing inventor, the number of applications/registrations of the most-filing technical field (IPC, USPC, FT) according to the year (including drill-down) for the document set corresponding to the specific number when the specific number represented in the analysis result is clicked, with respect to the to-be- analyzed document set in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set;
  • FIG. 49 illustrates that the simplified analysis module according to the present invention also provides a function of the drill-down according to the most-filing technology field (IPC, USPC, FT
  • FIG. 50 illustrates a year-based analysis result of the citation total amount with respect to the to-be-analyzed document set, in which, when the numerously-filed IPC sub class unit document in all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set ;
  • FIG. 51 illustrates a hierarchical patent informatoin service system, which is a sub-system of a patent informatoin system according to an embodiment of the present invention
  • FIG. 52 illustrates an individual unit patent information system generating enginean according to an embodiment of the present invention
  • FIG. 53 illustrates an internal construction of a patent information pre-processing module according to an embodiment of the present invention
  • FIG. 54 illustrates a weight pre-processing module according to an embodiment of the present invention
  • FIG. 55 illustrates a citation information pre-processing module according to an embodiment of the present invention
  • FIG. 56 illustrates a patent classification code pre-processing module 301-3-1 or 3500 according to an embodiment of the present invention
  • FIG. 57 illustrates a representative applicant name pre-processing module according to an embodiment of the present invention!
  • FIG. 58 illustrates a representing phrase extracting pre ⁇ processing module according to an embodiment of the present invention
  • FIG. 59 illustrates a family information pre-processing module according to an embodiment of the present invention
  • FIG. 60 illustrates a multiple patent classification code correlation pre-processing module according to an embodiment of the present invention
  • FIG. 61 illustrates a structure of the patent classification code- based statistic pre-processing module according to an embodiment of the present invention
  • FIG. 62 illustrates a patent information intelligence module according to an embodiment of the present invention
  • FIG. 63 illustrates an analysis module according to an embodiment of the present invention
  • FIG. 64 illustrates a patent information system arrangement generation engine according to an embodiment of the present invention
  • FIG. 65 shows a configuration of an integration management module according to an embodiment of the present invention
  • FIG. 66 shows a process of generating one applicant name-based patent information service system from entire applicant name patent information database 2300, and generating an inventor name-based patent information service system according to each inventor included in a patent document set included in the applicant name-based patent information service system, according to an embodiment of the present invention
  • FIG. 67 shows a process of obtaining an applicant name list, generating applicant name-based patent information service systems corresponding to the number of applicants according to each list from entire applicant name-based patent information database 2300, and generating an inventor name-based patent information service system according to each inventor included in the patent document set included in each of the generated applicant name-based patent information service systems, according to an embodiment of the present invention
  • *114FIG. 68 shows a process of obtaining an applicant name list from an obtained document set, generating applicant name-based patent information service systems corresponding to the number of applicants according to each list from entire applicant name-based patent information database 2300, and generating an inventor name-based patent information service system according to each inventor included in the patent document set included in each of the generated applicant name-based patent information service systems, according to an embodiment of the present invention;
  • FIG. 69 shows an embodiment of a method, which, upon receiving an order requesting the generation of an applicant name-based patent information service system, generates a single applicant name-based patent information service system from the entire applicant name-based patent information database 2300, and generates an inventor name-based patent information service system for each inventor included in a patent document set belonging to the applicant name-based patent information service system;
  • FIG. 70 is a flowchart showing a method for generating an applicant name-based patent information service system in the unit of a nation, wherein the nature of the generation of the applicant name-based patent information service system in the unit of a nation is to generate the applicant name-based patent information service system and the inventor name-based patent information service system in the unit of a nation from applicant name-based patent information database 2300 in the unit of a nation, and it is thus possible to apply all of the methods as shown in FIGs. 66 to 69;
  • FIG. 71 shows an embodiment of a process for generating the applicant name-based patent information service system and the inventor name-based patent information service system in the unit of a nation by using family information
  • FIG. 72 shows an embodiment of a process for generating an applicant name-based patent information service system and an inventor name-based patent information service system in the unit of a nation by using a priority claim number
  • FIG. 73 shows an embodiment of a process for generating an applicant name-based patent information service system and an inventor name-based patent information service system in the unit of a nation by using family information!
  • FIG. 74 shows an embodiment of a process for generating an applicant name-based patent information service system and an inventor name-based patent information service system in the unit of a nation by using family information and priority claim information;
  • FIG. 75 shows an embodiment of a process performed by both the applicant name-based patent information service system generating engine and the inventor name-based patent information service system generating engine when new documents are added;
  • FIG. 76 shows a process that an applicant name-based patent information service system administrator performs the CRUD for its managed document set, according to an embodiment of the present invention
  • FIG. 77 shows a process that an inventor name-based patent information service system administrator performs the CRUD for its managed document set, according to an embodiment of the present invention
  • FIG. 78 illustrates a method of processing the counting by the counting pre-processing module according to an embodiment of the present invention
  • FIG. 79 illustrates a method of processing the weight by the expense expenditure view weight pre-processing module 3311 according to an embodiment of the present invention
  • FIG. 80 illustrates a method of pre-processing the weight by the citation view weight pre-processing module in the citation view is closely similar with that by the expense expenditure view weight pre-processing module 3313 according to an embodiment of the present invention
  • FIG. 81 illustrates a method of processing weight by the dispute view weight pre-processing module 3315 according to an embodiment of the present invention
  • FIG. 82 illustrates a method of processing weight by the concentration rate view weight pre-processing module 3317 according to an embodiment of the present invention
  • FIG. 83 illustrates a method of processing the weight by the concentration rate view weight pre-processing module according to an embodiment of the present invention
  • FIG. 84 illustrates a method of processing weight by the applicant-based weight pre-processing module 3331 according to an embodiment of the present invention!
  • FIG. 85 illustrates a method of processing weight by the inventor- based weight pre-processing module 3333 according to an embodiment of the present invention
  • FIG. 86 illustrates a method of processing weight by the agent- based weight pre-processing module 3335 according to an embodiment of the present invention
  • FIG. 87 illustrates a method of processing the family information by the family information pre-processing engine of the family information pre-processing module 3810 according to an embodiment of the present invention
  • FIG. 88 illustrates a method of citation information pre- processing that represents the citation number according to an embodiment of the present invention
  • FIG. 88 illustrates a method of obtaining the forward citation document information that is the information on a forward application document citing the specific document and then store the forward citation document information in the document information of the specific document, according to an embodiment of the present invention
  • FIG. 90 illustrates a method for pre-processing citation information according to an embodiment of the present invention
  • FIG. 91 illustrates a processing method of a patent classification code pre-processing engine for processing a modified patent classification code according to an embodiment of the present invention
  • FIG. 92 illustrates a method of generating a database of the modified patent classification codes by a hierarchical modified patent classification code generating module according to an embodiment of the present invention
  • FIG. 93 illustrates a method of generating a USFC tree structure from USPC patent classification codes according to an embodiment of the present invention
  • FIG. 94 illustrates a method of allocating USPC patent classification codes to modified patent classification codes corresponding to the USPC tree structure of FIG. 93 according to an embodiment of the present invention
  • FIG. 95 illustrates a modified tree structure including the modified patent classification codes according to an embodiment of the present invention, which has the same structure as the USPC tree structure generated from the USPC patent classification codes of FIG. 93;
  • FIG. 96 illustrates that the USPC patent classification codes of FIG. 93 can be generated by using an Index to U.S. Patent Classification file (a.k.a., Classification Index file;
  • FIG. 97 illustrates a patent classification code pre-processing method according to an embodiment of the present invention
  • FIG. 98 illustrates a method of determining a representative applicant name by a representative applicant name pre-processing module according to an embodiment of the present invention!
  • FIG. 99 illustrates a representative applicant name determining method using a priority number according to an embodiment of the present invention
  • FIG. 100 is a flowchart showing an example of pre-processing of statistic values according to each patent classification code by the patent classification code-based statistic pre-processing module;
  • FIG. 101 illustrates a method of generating a statistic value, parameter or calculation value, which automatically includes lower patent classification codes of a given patent classification code according to an embodiment of the present invention!
  • FIG. 103 illustrates a method of generating a patent classification code-based statistic value, parameter, or calculation value, which reflects backward citation information or forward citation information including lower patent classification codes, for a subset of a particular document set, according to an embodiment of the present invention
  • FIG. 104 illustrates a a method of pre-processing and then comparing multiple patent classification codes for at least two document sets by the homogeneous multi-patent classification code correlation pre- processing module according to an embodiment of the present invention
  • FIG. 105 illustrates a representing phrase pre-processing method according to an embodiment of the present invention
  • FIG. 106 illustrates a representing phrase pre-processing method according to another embodiment of the present invention.
  • FIG. 107 illustrates a representing phrase pre-processing method according to another embodiment of the present invention.
  • FIG. 108 illustrates an analysis index obtaining method according to an embodiment of the present invention
  • FIG. 109 illustrates a method of obtaining a to-be-analyzed patent document set according to an embodiment of the present invention
  • FIG. 110 illustrates a patent trend analysis according to an embodiment of the present invention
  • FIG. Ill illustrates a simplified analysis according to an embodiment of the present invention
  • FIG. 112 illustrates a citation analysis according to an embodiment of the present invention
  • FIG. 113 illustrates a cited-by analysis according to an embodiment of the present invention
  • FIG. 114 illustrates an examination citation analysis according to an embodiment of the present invention
  • FIG. 115 illustrates a multiple patent classification code analysis according to an embodiment of the present invention.
  • FIG. 116 illustrates a multiple patent classification code analysis according to another embodiment of the present invention.
  • FIG. 117 illustrates a multiple patent classification code analysis according to another embodiment of the present invention.
  • FIG. 118 illustrates an individual unit patent information system multi-stage grouping module operating method according to another embodiment of the present invention.
  • FIG. 119 illustrates a patent information system arrangement generating method according to another embodiment of the present invention.
  • FIG. 120 illustrates a screen implementing an applicant-based patent information system according to an embodiment of the present invention, which shows a list of 500 top Korean applicants according to the order of the number of their applications, wherein Republic of Korea has been selected in the nation tab, and a top 500 tab has been selected in the top 500 of its lower tab, the stock market-listed company, KOSDAQ- listed company, numerously-filed company, entire companies;
  • FIG. 121 illustrates a screen implementing an applicant-based patent information system according to another embodiment of the present invention, which shows a list of 500 top US applicants according to the order of the number of their applications, wherein US has been selected in the nation tab, and a top 500 tab has been selected in the top 500 of its lower tab, the stock NYSE-Iisted company, NASDAQ-listed company, numerously-filed company, and entire companies;
  • FIG. 122 illustrates a screen implementing an applicant-based patent information system according to another embodiment of the present invention, which shows a list of 500 top Japanse applicants according to the order of the number of their applications, wherein Japan has been selected in the nation tab, and a top 500 tab has been selected in the top 500 of its lower tab, the JPl-Iisted company, JP2-1isted company, JPM (MOTHERS-listed company), JASDAQ-listed company, and entire companies;
  • FIG. 122 illustrates a screen implementing an applicant-based patent information system according to another embodiment of the present invention, which shows a list of 500 top Japanse applicants according to the order of the number of their applications, wherein Japan has been selected in the nation tab, and a top 500 tab has been selected in the top 500 of its lower tab, the JPl-Iisted company, JP2-1isted company, JPM (MOTHERS-listed company), JASDAQ-listed company, and entire companies;
  • JPl-Iisted company JP2-1isted company
  • FIG. 123 illustrates a screen implementing an applicant-based patent information system according to another embodiment of the present invention, which shows a list of 500 top European applicants according to the order of the number of their applications, wherein European has been selected in the nation tab, and a top 500 tab has been selected in the top 500 of its lower tab, UKl (London stock market), AIM (London stock market), OVERSEAS LISTED, EURONEXT, and entire companies;
  • FIG. 124 shows one implemented embodiment obtained when a U.S patent tap is selected from a country selection of a patent list of a patent portfolio inside of a patent information system of 3CCM, which appears when 3COM (number 6), which is one of companies registered on the NASDAQ has been selected;
  • FIG. 125 shows a screen, on which the inventor list of 3CCM is displayed when U.S has been selected on the country tab after clicking the inventor list menu in FIG. 124;
  • FIG. 126 shows a screen, on which a patent document list relating to an inventor is displayed when Aldous Stepha..(numeral 9), has been selected from the inventor list of FIG. 125;
  • FIG. 127 shows an implemented embodiment about the each-year's application number of each IPC in an IPC sub-class level based on the total number of applications, which is shown when a statistic analysis tab has been selected, a technical area SA(systematic analysis) menu in FIG. 124 has been selected, and U.S in a nation tab has been selected.
  • a simplified analysis screen which is shown when drill-down and a specific cell value has been selected, has been shown as a pop-up
  • FIG. 128 shows an embodiment showing information about a competitive applicant based on the each-year's application number of each USPC in an USPC sub-class level based on the total number of applications, which is shown when the statistic analysis tab in FIG. 124 has been selected, the competitive company SA (systematic analysis) menu has been selected, and U.S has been selected from the nation tab;
  • FIG. 129 is a view showing an embodiment of an each-year's analysis result regarding a numerously-citing applicant when a backward citing document set of the present invention is regarded to be a to-be- analyzed patent document set in a case where the entire application document set of 3CCM, which is shown when, respective to 3CCM, the statistic analysis tab has been selected, the U.S citation SA (systematic analysis) menu has been selected, and U.S has been selected from the nation tab in FIG. 124, has been regarded as citation reference document set; FIG.
  • FIG. 131 is a view showing an embodiment of a method for generating processed patent information about a patent document set by automatically including information about low rank patent classification
  • FIG. 132 is a view showing an embodiment of a method for generating processed patent information including multi-dimensional analysis when a patent document set has been selected
  • FIG. 133 is a view showing an embodiment of a method for performing convergence analysis when a patent document set has been selected
  • FIG. 134 is a view showing an embodiment of a method for providing patent document information corresponding to a value in a case where the value has been selected
  • FIG. 135 is a view showing an embodiment of a method for providing multi-dimensional analysis according to each applicant, patent classification code, inventor, and agent;
  • FIG. 136 is a view showing an embodiment of a method for determining if the patent searched in the first nation can be available in the second nation by using family information;
  • FIG. 137 is a flowchart showing an embodiment of a method for determining the availability possibility in the second nation, in which JP is designated as the first nation, and KR, US, and EU are designated as the first nations respective to #numeral 1 patent document in the method of FIG. 136;
  • FIG. 138 shows an embodiment of a search result list view obtained when the methods suggested in FIGs. 136 and 137 have been performed, wherein availability information in an interval phase according to each nation has been included in each item of the search result list;
  • FIG. 139 shows an embodiment of a monitoring service/directory service method of the present invention, in which a technology tree is configured by using a multi-stage hierarchical structure according to the user, and a large variety of information including a search formula is included in the technology tree;
  • FIG. 140 illustrates an application of the method of FIG. 139 to a product according to an embodiment of the present invention
  • FIG. 141 illustrates an application of the method of FIG. 139 to a company system of a competing company according to an embodiment of the present invention
  • FIG. 142 illustrates a network diagram obtained by implementing a convergence analysis for inventors according to an embodiment of the present invention!
  • FIG. 143 illustrates a network diagram obtained by implementing a convergence analysis on technological keywords and representing phrase according to an embodiment of the present invention.
  • FIGs. 144 to 145 show examples of an interface used for implementing the patent information system of the present invention, in which the interface includes many interfaces relating to option selection.
  • PREMISE INFORMATION PREMISE INFORMATION
  • a patent office of every nation issues gazettes for disclosing applied patents or registered patents satisfying predetermined conditions.
  • the disclosed patent information basically includes Bibliographical information and information relating to a patent text.
  • the Bibliographical information indispensably includes information on the nation issuing the patent document, a title of the invention, and information on the applicant (including assignee, as used herein)/patentee (including assignee, as used herein), information on the inventor, information on the patent classification code (at least one code selected from IPC, USPC, FI, FT, ECLA codes, as used herein), information on various dates such as the application date, and information on various numbers such as the application number.
  • the Bibliographical information selectively includes abstract information, information on a representative drawing when the patent includes at least one drawing, information on reference (which includes at least one of reference information on a prior art disclosed by the applicant, a document searched or examined by the examiner, and information on the patent classification code), information on a priority, information on the examiner, etc.
  • the Bibliographical information may further include information on the abstract, and a representative claim or claim 1.
  • patent text contains information on technical contents of the patent.
  • a text of a typical patent document includes information on the title of the invention, information on the configuration of the invention, information on the claims, and information on at least one of the effect of the invention, industrial applicability, the technical subject to be solved by the invention, description on drawings, and the prior arts. Further, the patent text inevitably or selectively includes information on drawings according to the type of the inventive technology or selection of the inventor.
  • Each patent document is assigned at least one patent classification code according to at least one patent classification code system.
  • Each document issued by the nation is such as Republic of Korea,
  • each classification code corresponds one-to-one to a description of the corresponding patent classification code.
  • one dot or dots may be provided to the title information of a classification code below a predetermined depth.
  • the following example shows the hierarchy of the patent classification code system, existence of the title information, existence of dot(s) providing information on a relative position before the title information in the classification system.
  • H electricity Class HOl basic electrical element
  • Sub-class HOlF magnet main group: HOlF 1/00 magnet or magnetic body characterized by its magnetic material 1-dot sub-group :l/01 *those made from inorganic materials
  • the patent classification codes corresponding to all the sub-groups have a structure of "sub-class” + "number/number”, and at least one dot is attached to each of the titles corresponding to most of these patent classification code (the dot or dots are attached to most of the titles except for the title of number/00, which is usually a code belonging to the main group).
  • the smaller the number of dots attached to a title the higher the concept of the title relative to its surrounding in the technical classification.
  • the larger the number of dots attached to a title the lower/the more detailed the concept of the title relative to its surrounding.
  • the IPC has a multi-stage hierarchical structure (tree structure).
  • the USPC, FT, and ECLA also have a multi-stage hierarchical structure.
  • the multi-stage hierarchical structure of each patent classification code can be processed into a database.
  • the patent information system 1 according to the present invention is connected to a user computer 300 through a wire/wireless network 200.
  • the user computer 300 includes a computer used by a human, such as a PC, a notebook, a wire/wireless communication terminal, or a game console, and any computer connected to the patent information system 1, such as a third server other than the patent information system 1, a server of a system or an organ, or a third program module.
  • a third server other than the patent information system 1 a server of a system or an organ, or a third program module.
  • the patent information system 1 includes a DB unit 20 relating to various types of data such as patent data, a pre-processing module 30 for various pre-processing, a patent information processing basic module 40 for processing and analysis of patent information, a support module 50 (including a subscriber information processing module 501, a multi ⁇ national language processing module 502, and a translation module 503) for various supports such subscriber management and policy management, a patent intelligence module 60 relating a high quality analysis of patent information (since the patent intelligence module usually generates high quality analysis patent information mainly through patent analysis, so it is also named a patent analysis intelligence module or briefly an analysis module, which have the same meaning as used herein), and a hierarchically united patent information service system for generating a patent information service assembly in the unit of an applicant, inventor, or agent.
  • a DB unit 20 relating to various types of data such as patent data
  • a pre-processing module 30 for various pre-processing
  • a patent information processing basic module 40 for processing and analysis of patent information
  • the patent information system 1 includes six elements.
  • the first element is various pre-processing modules, which perform various pre-processings of an obtained patent document according to purposes thereof.
  • the second element is the patent information processing basic module 40, which includes modules for performing: 1) search function, 2) analysis function, 3) monitoring function, 4) analysis function, 5) patent document set-obtaining obtainment function, and 6) various multi-stage directory generating function, reporting function.
  • the third element is the patent analysis intelligence module 60, which generates various types of in-depth patent analysis information including 1) reference analysis and comparison analysis.
  • the fourth element is a patent information service system generating module, which generates a patent information system 1 according to the applicant, inventor, agent, and patent classification code.
  • the fifth element is the support element 50, which performs various supporting functions for application of the inventive idea in various environments of the network 200, including 1) subscriber information processing, 2) multi-national language processing, 3) translation processing, 4) web service processing.
  • the sixth element is various databases, which include:
  • the DB unit 20 includes a DBMS 201 for managing the DBs, a patent document master DB 202 storing synthetically storing patent documents, a patent classification code master DB 203 storing information on the patent classification code, a subject master DB 204 storing information on the applicant, inventor, company, etc., an operation result table DB 205 storing results of multi-dimensional analysis for multi-dimensional analysis, a support DB unit 206 storing information on subscribers and various policies or options, a secondary processing DB unit 207 storing results of processing of various data.
  • the menu DB includes marking methods for the menus according to each nation or each language.
  • the patent information system 1 of the present invention reads locale information in a browser of an accessing user, extracts a menu of a nation/language corresponding to the extracted locale information from the menu DB, generates a screen including the menu made by the extracted language, and transmits the generated screen to the user.
  • the pre-processing modules 30 may include at least one of: 1) a root patent document processing module 301-1 2) a counting pre ⁇ processing module 3100 3) a patent classification code-based statistic pre-processing module 3200 4) a weight pre-processing module 3300 5) a citation information pre-processing module 34006) a patent classification code pre-processing module 301-3-1 or 3500 7) a representative applicant pre-processing module 301-4-1-1 or 3600 8) a representing phrase extraction pre-processing module 3700 9) a family information pre ⁇ processing module 3800 10) a multiple patent classification code relationship pre-processing module 3900 11) a rightful party change information pre-processing module 302 and 12) an administration processing information obtaining module 303.
  • the root patent document processing module 301-1 processes obtained patent document data (XML, SQML or another type) according to each nation (a patent document of a PCT application is published by the WIPO, which is considered as a nation although it is not a nation).
  • the root patent document processing module 301-1 performs predetermined processings of the obtained patent document data, and then works the obtained patent document data into modified patent document data of a format processible by the patent information system 1 of the present invention.
  • the predetermined processings may include: 1) error filtering 2) patterned error correction 3) manual error correction 4) conversion of SGML document data to XML document data; and 5) standard format conversion.
  • the root patent document processing module 301-1 by passing the obtained patent document data through at least one predetermined error check filter 301-1-1, determines if the obtained patent document data satisfies a format that can be treated by the patent information system 1 of the present invention.
  • Patent document master DB generation module 301-2 The patent document master DB generation module 301-2 of the present invention generates a patent document master DB 202 based on the patent document data generated by the root patent document processing module 301-1 and information processed by various pre-processing modules of the present invention.
  • Sub-modules included in the patent document master DB generation module 301-2 of the present invention include: 1) a bibliographical DB generation module 301-2-1 for generating a root Bibliographical DB 202-1-1, a processed Bibliographical DB 202-1-2, a translation Bibliographical DB 202-1-3, and a representative drawing DB 202-1-4; 2) a document DB generation module 301-2-2 for generating a patent document DB 202-2-1, a translated patent document DB 202-2-2, and a processed patent document DB 202-2-3; 3) a rightful party change DB generation module 301-2-3 for generating a US assignee change DB 202-3-1 and a right establishment change DB 202-3-2; 4) a family information DB generation module 301-2-4 for generating a nation-based patent status DB 202-4-1 and a nation-based family DB 202-4-2; 5) a citation information DB generation module 301-2-5 for generating an applicant-citation information DB 202-5-1 and an examiner-citation
  • the Bibliographical DB includes root Bibliographical information obtainable directly from the patent document data, processed Bibliographical information obtained by processing the patent document data itself or the entire patent document data set, translated Bibliographical DB from languages, which require translation from among the root Bibliographical information and the processed Bibliographical information, and other information.
  • the Bibliographical DB is generated by the Bibliographical DB generation module of the present invention.
  • the document DB includes patent document data itself, translated document data, a part or all of which has been translated into various languages, and processed patent document data converted from the patent document data or translated patent document data into various types of files in formats of pdf, doc, tiff, html, etc.
  • the patent document data, the translated patent document data, the processed patent document data may exist in the form of a patent document DB, a translated patent document DB, and a processed patent document DB, respectively.
  • the document DB is generated by the document DB generation module of the present invention.
  • the rightful party change information DB includes an assignee change information DB for the US patent documents, and may include information on names of patentees and patent applicants managed by the patent offices of various countries, and information on establishment registration and cancellation registration of various licensees.
  • the rightful party change information DB is generated by the rightful party change DB generation module of the present invention.
  • the family information DB includes nation-based family data that includes information relating to the family patent document for the respective nations related to an individual patent document, and nation- based patent status data that includes information on status (current status of a nation-based patent application, whether or not to obtain the right, etc.) in the nations of the respective family patent documents.
  • the family information DB is generated by a family information DB generation module.
  • the citation information DB includes applicant citation information DB generated based on citation information cited by the applicant and examiner-citation information DB generated based on information on other patent documents cited for examining by the examiner for the specific patent application (mainly, the document cited for denying the novelty and inventive step of the patent application in the procedure of the examination).
  • the citation information DB is generated by the citation information DB generation module.
  • the administration processing information DB includes data generated based on administration processing information of the respective patent documents generated by the patent offices of the respective nations with respect to the patent documents.
  • the administration processing information DB is generated by the administration information DB generation module.
  • the Bibliographical DB generation module obtains processed information of the patent document data and/or entire patent document data processed by various pre-processing modules and generates processed Bibliographical data, in which DB including the processed Bibliographical data is referred to as the processed Bibliographical DB.
  • the processed Bibliographical particulars include 1) various counting information, 2) various calculations and evaluations information, etc.
  • the various operation and evaluations information include 1) an calculation value of various patent indexes, such as the total number of applications, the total number of registrations, occupancy rate, concentration rate, activity rate, etc.
  • Patent classification code master DB generation module 301-3 The pre-processing module of the present invention includes the patent classification code master DB generation module 301-3 and the DB generated by the patent classification code master DB generation module 301-3 includes:
  • root patent classification code DB 203-1 storing raw data of the various patent classification codes obtained from the patent offices of the respective nations or patent root data sources;
  • modified patent classification code DB 203-2 storing various patent classification codes modified suitable for the usage of the patent information system 1 of the present invention, which is generated by a modified patent classification code generation module 301-3-3;
  • total higher patent classification code set DB 203-3 collecting and storing all the higher patent classification codes of any patent classification code, which is generated by a total higher patent classification code set generation module 301-3-1-1;
  • sub-patent classification code set DB 203-4 storing information on direct sub or every sub-patent classification code of any patent classification code, which is generated by a patent classification code set generation module 301-3-1-2, in which the patent classification code set generation module 301-3-1-2 includes a direct sub-patent classification code set generation module 301-3-1-2-1 for generating only the direct sub-patent classification code and a total sub-patent classification code set generation module 301-3-1-2-2 for generating all the sub-patent classification code sets;
  • patent classification code tree table DB 203-5 storing a patent classification code system in a tree structure, which is generated by a patent classification code tree table generation module (not shown);
  • total higher patent classification code table DB 203-6 storing the total higher patent classification code of any patent classification code according to the level, which is generated by a total higher patent classification code set generation module 301-3-1-1.
  • the renewal is processed by a patent classification renewal module 301-3-5 of the present invention, and the preset module relating to the renewed patent classification code performs the preset processing.
  • the patent classification code having a multi-stage hierarchical structure has a correspondence correlation between the patent classification code and the title information of the patent classification code, a plurality of examples of which has been suggested in the specification of the present invention.
  • a search engine that is identical to the search module 401 in the present invention
  • an extension/wildcard in the DBMS 201 can be used for obtaining information on all the sub-patent classification codes of the given patent classification code in the hierarchical structure. For example, if the extension/wildcard (e.g. ?) is attached to HOlF, to be queried in a search word or query word, the information on all of the sub-patent classifications of HOlF can be obtained.
  • the patent document corresponding to the higher and sub-patent classification code according to the hierarchical structure attribute corresponds to the higher patent classification code. That is, if the patent classification code of H01F1/04 is assigned to the specific patent document, the patent document corresponds to not only H01F1/032 but also H01F1/03, HOlFl/01, and HOIFI/OO.
  • the patent information relating to H01F1/03 includes not only patent information relating to H01F1/03, but also the patent information relating to every sub-patent classification code of H01F1/03 according to the hierarchical structure system of the patent classification code. That is, the patent information relating to H01F1/03 must include the patent information relating to not only H01F1/032, H01F1/04, but also every sub-patent classification code of HOlFl/03 according to the patent classification code system.
  • FIG. 14 shows an example to which such an inventive idea of the present invention is not applied. As shown in FIG. 14, there are more documents corresponding to A61B 3/02 that is a subordinate classification of A61B 3/00, than those corresponding to A61B 3/00.
  • the patent classification code system (tree structure) is searched with respect to the given patent classification code by a scheme, such as depth first search, to obtain information on every sub-patent classification code.
  • the information obtaining procedure may perform for every query/search.
  • every sub-patent classification code for every patent classification code or patent classification code including the dot(s) attached to title information is first obtained and stored, and then the information on every stored sub-patent classification code can be obtained upon query/search.
  • the fact that the respective nodes included in the tree structure is searched by a scheme of depth first search, or the like, for finding a direct sub-node and/or every sub-node of the respective nodes corresponds to a basic computer science so that the description thereof will be omitted.
  • the information of the obtained direct sub-node and/or every sub-node being corresponded to the specific node to be stored is also the basic concept, so that the description thereof will be omitted.
  • the marking of the patent classification code is modified while the given patent classification code system (tree structure) is maintained so as to generate the modified patent classification code, and the first and second methods can be performed for the modified patent classification code. Meanwhile, if a scheme of referring the patent classification code system and modifying the patent classification code is appropriately selected, the marking of the patent classification code can be modified capable of responding to range search/range query.
  • Exemplary description based on H04B 7/00 the tree structure between the patent classification codes is described with an exemplary embodiment of the patent classification code corresponding to main group H04B 7/00 (entitled a wireless transmitting system) based on IPC 7 th edition and its sub-patent classification code, however, such a method and description are not limited to this embodiment, but are identically or equally applied to entire IPC areas. Further, such a method and description are also identically or equally applied to other patent classification code systems (e.g. USPC, FI, FT, or ECLA) in which the dot structure attached to the title indicates the higher and subordinate hierarchy.
  • the main group H04B 7/00, the sub-patent classification code thereof, and the title thereof have the following configuration based on IPC 7 th edition.
  • H04B 7/00Wireless transmitting system i.e. Wireless transmitting system using an electromagnetic fields H04B 7/005.control of transmission; equalization H04B 7/01.phase shift reduction H04B 7/015.echo effect reduction H04B 7/02.diversity system
  • H04B 7/185 ...space based or airborne stations H04B 7/19 earth synchronization stations H04B 7/195 asynchronization stations
  • H04B 7/216 code division or spread-spectrum multiple access H04B 7/22.scattering wave system
  • H04B 7/24.those for communication between at least two scores H04B 7/26.. at least one score capable of moving The tree hierarchical structure of the above code based on the above dot(s) can be more easily represented as shown in Table 1 below. [Table 1]
  • a total higher patent classification code table generation module receives the patent classification code from the patent classification code mater DB so as to generate the higher patent classification code table as shown in Table 2 of the present invention It is preferred that the total higher patent classification code table generation module generates the higher patent classification code table whenever the information on the patent classification code is renewed.
  • the total higher patent classification code table generation module can generate the total higher patent classification code table having an identical formation to the above table, even for the modified patent classification code DB of the present invention.
  • IPC_ID represents an ID of the specific IPC
  • S represents a section level
  • C represents a class level
  • SC represents a sub-class
  • MG represents a main group
  • ldot represents a level in which 1 dot is attached to the title information
  • 2dots represent a level in which 2 dots are attached to the title information
  • 3dots represent a level in which 3 dots are attached to the title information (so n dots represent a level in which n dots are attached to the title information), and self represents IPC itself.
  • the results are H, H04, H04B, H04B7/00, H04B7/02, H04B7/04, H04B7/06, and H04B7/06, and when the results are organized according to the field structure, data represented in IPC_ID69970 can be generated.
  • every higher patent classification code of the given patent classification code is obtained by 1) searching for the given patent classification code in a self field and obtaining the patent classification code according to each level, while going up to the higher level along a row of the searched patent classification code.
  • the collected and databased total higher patent classification code of the given patent classification code is referred to as total higher patent classification code set DB of the present invention
  • every extracted and databased sub-patent classification code of the specific patent classification code by utilizing the total higher patent classification code table is referred to as sub-patent classification code set DB of the present invention.
  • the searched patent classification codes are H04B7/15, H04B7/155, H04B7/165, H04B7/17, H04B7/185, H04B7/19, H04B7/195, H04B7/204, H04B7/208, H04B7/212, and H04B7/216, which correspond to the specific patent classification code itself and the sub-patent classification codes of H04B7/15. If the specific patent classification code is excluded from the searched patent classification codes, it is of course that every sub-patent classification code can correspond to the specific patent classification code itself.
  • the information on the specific patent classification code itself and its every sub-patent classification code can be generated as sub-patent classification code set DB.
  • every sub-patent classification code of the specific patent classification code is stored, it is possible to generate total sub-patent classification code set DB. Further, it is possible to count the number of every sub-patent classification codes including the specific patent classification code itself (here 11, and 10 sub-patent classification codes excluding the specific patent classification code itself).
  • the counting is processed by a counting pre-processing module of the present invention.
  • the resultants are H04B7/155, H04B7/185, and H04B7/204.
  • the direct sub-patent classification codes of the specific patent classification code are stored, it can be generated as direct sub-patent classification code set DB.
  • stage-based open means to open the IPC directory structure according to the stage (section, class, sub ⁇ class, main group, 1 dot sub-group, 2 dot sub-group, 3 dot sub-group, 4 dot sub-group, ... n dot sub-group), in which only the patent classification code included in the corresponding stage is represented.
  • the value (search result, statistic value, calculation value, analysis value, etc.) corresponding to the respective stages is preferably the value considering both the specific patent classification code and its every sub-patent classification code.
  • the represented value of H04B7/155 must include both H04B7/155, and H04B7/165 and H04B7/165 which are subordinate to H04B7/155.
  • the value of only the specific patent classification code can be represented in the specific case (e.g. user's selection).
  • the afore-mentioned table structure (the table used for searching the sub-patent classification code of the specific patent classification code itself in the three steps) can be utilized for the search described later, obtaining necessary information with an SQL search command, statistic analysis processing, etc., and further if the value of the respective patent classification codes according to the preset rule is previously calculated for analysis convenience, etc., the table structure can be utilized for view, materialized view, (multi-dimensional analysis operation result table) cube, etc.
  • a roll-up operation (any one value includes the values of its subordinate subjects, so as to determine one's value) is utilized in such calculation, and the above table can be utilized for the roll-up operation.
  • the year value can be generated by summing up each value of a quarter of a year, and the value of a quarter of a year can be generated by summing up a value of the respective months included in the corresponding quarter of the year.
  • the value of the specific patent classification code e.g. the number of applications according to the year
  • the table having the above formation can be generated for every IPC patent classification code, and by including the table and the afore-mentioned language-based title information, root IPC patent classification code DB corresponding to IPC among the root patent classification code DB can be generated.
  • the table also can be generated with respect to USPC, FT, FI, ECLA, etc.
  • the directory generation module 405 of the present invention provides the direct sub-patent classification code according to the patent classification code and indicates the counting value of the patent classification code and direct sub-patent classification code next to the patent classification code and direct sub-patent classification code.
  • the counting value includes at least one selected from 1) the number of every sub-patent classification, 2) the number of direct sub-patent classification, 3) depth of classification, 4) the number of patent documents corresponding to every sub-patent classification, and 5) the number of patent documents corresponding to the direct sub-patent classification, and it is preferred that 1), 2), and 5) are indicated together.
  • the directory generation module (405) reads at least one of 1) to 5) from i) the counting values performed by the counting pre-processing module of the present invention, or from any one of ii) a total quantity analysis multi-dimensional analysis operation result table described later, iii) a patent classification code tree table, or iv) a total higher patent classification code table, so as to indicate 1) to 5).
  • the patent classification code information is arranged in a directory form, a sign with color or an icon is indicated in the respective directories so as to improve the accessibility of the user. That is, by providing different colors or preset icon according to the cases where there are many direct sub-patent classifications, there are many sub-patent classifications, the depth is deep, there are many corresponding patent documents, the user can more easily recognize the internal attribute (the number of sub-patent classifications, the depth of the classification, the number of patent documents, etc.) of the directory. Especially, if there are many sub-patent classifications of the specific patent classification code directory or the depth of the classification is deep, it evidence? that there are many patent applications belonging to the corresponding field and the various technologies have been filed so that the specific patent classification code directory is more important than other directory.
  • FIG. 15 illustrates the directory generation module 405 according to an exemplary embodiment of the present invention.
  • the directory generation module 405 As noted in FIG. 15, from the preset stage of the directory, it is possible to check a box corresponding to at least one patent classification code, and the patent classification code selected with the checked box is searched for each nation or is conducted the patent analysis based on the patent classification code.
  • the directory generation module 405 of the present invention includes a patent classification code utilization directory generation module 405-1 for generating the directory of the patent classification code corresponding to the multi-stage patent classification code system according to the kinds of at least one patent classification code and a subject-based directory generation module 405-2 for generating a multistage directory according to various analysis subjects, or other confirmable subject which are introduced in the present invention.
  • the operating example of the subject-based directory generation module is shown in FIG. ***.
  • a user or an administrator of the system 10 can generate the directory having the multi-stage hierarchical structure according to the specific subject and make search formula, query formula, a keyword, a patent classification code, etc. to correspond to the directory.
  • the system 10 can search and monitor the patent classification code using the corresponding information and the various analysis of the present specification can be conducted for the searched or monitored result.
  • Sub-hierarchy patent classification code processing module searches a sub-hierarchy patent classification code of the patent classification code obtained from the search formula inputted by the user through the hierarchical patent classification code DB.
  • At least one from three ways is necessary for including the patent document corresponding to the sub-hierarchy patent classification code of the obtained patent classification code in the search result, which is processed by the sub-hierarchy patent classification code processing module.
  • the sub-hierarchy patent classification code processing module searches every sub-hierarchy patent classification code of the obtained patent classification code based on an obtaining time score using the hierarchical patent classification code DB.
  • the patent classification codes in the hierarchical patent classification code DB include the tree structure, it is possible to search the sub-hierarchy patent classification code. If the specific patent classification code is included in the obtained search formula, the sub-hierarchy patent classification code processing module searches all the sub-hierarchy patent classification codes of the specific patent classification code along the tree structure and transmits the result to a modified search formula generation module.
  • the modified search formula generation module generates new search formula that includes the received patent classification codes and queries the generated search formula to the search engine.
  • the sub-hierarchy patent classification code processing module searches the sub-hierarchy patent classification codes of every patent classification code, matches the searched sub-hierarchy patent classification codes to a matching table, and stores them in advance, and if the specific patent classification code is included in the obtained search formula, the sub-hierarchy patent classification code processing module directly transmits the sub-hierarchy patent classification codes matched to the specific patent classification code to the modified search formula generation module.
  • the range of the sub-hierarchy patent classification code extracted by the sub-hierarchy patent classification code extracting module can be classified into two methods. One is a method of extracting the direct sub-patent classification code of the obtained patent classification code and the other is a method of extracting every sub- patent classification code of the obtained patent classification code. At this time, even in extracting the direct sub-patent classification code, the direct sub-patent classification code of the extracted direct sub- patent classification code is extracted, followed by repeating such the extraction, every sub-patent classification code of the obtained patent classification code can be extracted.
  • the direct sub-patent classification codes include H04B 7/155, H04B 7/185, and H04B 7/204
  • every sub-patent classification codes include H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216.
  • the direct sub- patent classification code and every sub-patent classification code can be easily extracted in both the child -> parent corresponding correlation and the parent -> child corresponding correlation directly or recursively.
  • the subordinate hierarchy patent classification code extracting module combines and stores the extracted subordinate hierarchy patent classification code. If the subordinate hierarchy patent classification code extracting module stores only the direct sub-patent classification code of the patent classification code of H04B 7/15 included in the search formula, it combines 104B 7/15, H04B 7/155, H04B 7/185, and H04B 7/204 ⁇ to store, and if subordinate hierarchy patent classification code extracting module stores every sub-patent classification code together with the direct sub-patent classification code of the patent classification code of H04B 7/15 included in the search formula, it combines and stores ⁇ 04B 7/15, H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 ⁇ .
  • the modified search formula generation module unifies the combined patent classification code with the original existing search formula so as to generate the modified search formula.
  • the following search formula is preferable.
  • the sub-hierarchy patent classification code processing module searches information relating to the sub-hierarchy patent classification code of the patent classification code so as to allow the modified search formula generation module to process the searched information.
  • the search engine receives the search formula including the patent classification code from the computer of the user. At this time, if the patent classification code included in the obtained search formula includes the dot(s) attached to the title information, the search engine obtains the sub-patent classification code in the patent classification code system through the patent classification code pre-processing engine, ties the obtained patent classification code with OR, generates the modified search formula including the patent classification code tied with
  • a typical search engine supports the range search. For example, in searching H04B 7/15, it is possible to receive a response of the search result more quickly when querying to the search engine with "From H04B 7/15 to H04B 7/216", rather than when querying with tieing H04B 7/15 and H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216 with OR.
  • the more the sub-hierarchy patent classification codes the larger the difference of the search speed.
  • the elements of the modified search formula are slightly differently generated according to whether or not the search engine or DBMS supports the processing of an irregular range symbol for the range.
  • the irregular range symbols processing means to process the case where the symbol written in both ends of the range are different, like ⁇ A ⁇ , E> ⁇ . If the search engine supports the irregular range symbol processing, it generates the modified search formula element of ⁇ (patent classification codes ⁇ obtained patent classification code, patent classification codes ⁇ next sibling of the obtained patent classification code) ⁇ with respect to the obtained patent classification code.
  • the modified search formula element of ⁇ (the obtained classification code) or (patent classification codes ⁇ the obtained patent classification code, patent classification codes ⁇ a next sibiling of the obtained patent classification code) is generated. If the modified patent classification code arranged in a lexical manner is used, the modified patent classification code is of course processed by the above method.
  • the core of the patent classification code pre-processing engine of the present invention is that when the specific patent classification code is given, the patent classification code pre-processing engine automatically processes even the every sub-hierarchy (child) or direct sub-hierarchy patent classification of the specific patent classification code and reflects it to every search formula and query formula for the search, analysis, monitor, etc.
  • the search result can be displayed by clustering according to the searcher's choice.
  • the clustering means a scheme of tieing the search result and representing the tied search result for each group, which is applied to the present invention as follows. If there is a clustering window representing the clustered structure or the selection of the clustering level is available, the search result of a dot sub-group corresponding the preset or preselected clustering level and a sub-group below the dot sub-group is only represented, and if the searcher selects the subordinate or higher clustering level (the clustering level corresponds to the dot level), the search result corresponding to the selected clustering level is represented.
  • the searcher selects the H04B 7/14, H04B 7/145, H04B 7/15 which are direct subordinate of the H04B 7/14 are represented in the form of the sub-directory folder, and in this case, the search result shows only the patent classification code corresponding to H04B 7/14 and the sub-patent classification code of the H04B 7/14. If the searcher selects the H04B 7/15 again, H04B 7/155, H04B 7/185, and H04B 7/204 which are subordinate to H04B 7/15 are displayed and the search result shows only the patent classification code corresponding to H04B 7/15 and the sub-patent classification code of H04B 7/15.
  • the direct sub-patent classification code is H04B 7/155, H04B 7/185, and H04B 7/204 so as to indicate ⁇
  • every sub-patent classification code is H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, H04B 7/216 so as to indicate ⁇ 1(T. It is preferable to display both because the number of direct sub-patent classification codes is small but the number of sub-patent classification codes under the direct sub-patent classification code is much great.
  • the patent family refers to the set of document information including the document which are directly related to the specific document in domestic or foreign.
  • the domestic patent family within one nation includes 1) a divisional application, 2) converted application or dual application, 3) patent application claiming the right of domestic priority (the terms for regulating the patent family range is slightly different depending on the nations, such as a continuation-in-part application, reissue patent, etc. in the US).
  • the foreign patent family includes 1) the International Patent Application (PCT), 2) the application claiming the right of priority under the Paris Convention, 3) the national-phase patent application of foreign application when 1) or 2) is entered or filed with the patent office of the specific nation, and (1) the divisional application, (2) the converted application or dual application, and (3) the application claiming the right of domestic priority (the regulating condition of the patent family range is slightly different depending on the nations, such as a continuation-in-part application, reissue patent, etc. in US) of 3) application.
  • the construction of the family information pre-processing module for preprocessing the patent family information of the present invention is well illustrated in FIG. 9.
  • a family information pre-processing engine 3810 is required for processing the family information, and various DBs are required for obtaining the family information.
  • the necessary DBs include information on the right of priority under the Paris Convention DB 3671, PAJ DB 3673, KPA DB 3675, Inpadoc DB 3677, other family information DB 3830, etc.
  • Inpadoc DB 3677 serves an important role.
  • the family information pre-processing module closely relates to processing and updating the patent document master DB, because 1) new patent documents are continuously, published, 2) most of the patent information is obtained for each nation and new patent documents are continuously published according to the respective nations so that the family information can be changed according to 1) or 2) at any time. That is, it is necessary to renew the family information when obtaining 1) or 2) documents, and the family information on obtaining 1) or 2) documents is renewed by the family information pre-processing module of the present invention.
  • the family information can be found through the specific application number indicated in the original application information and priority information of the existing document or new document.
  • the method of processing the family information by the family information pre-processing engine of the family information pre-processing module is illustrated in FIG. 87.
  • the family information pre-processing module obtains at least one patent document information (S3720), identifies if the family information, such as priority number, divisional application, converted application, or continuation-in-part application, is present in the patent document information (S3730), co-relates the information of the patent document with an unique document number of the family patent document if the family information is present in the patent document information, so as to store it in patent DB or search index (S3740), and counts and stores the number of nation-based family information of the selectively obtained patent document, or adds the number of family information if the unique document number of the family patent document includes the information on the number of family information (S3750).
  • the family information processed by the family information preprocessing module is stored in the family information DB of the present invention.
  • the citation information pre-processing module of the present invention includes at least one of a citation information obtaining module 3400-1 for obtaining information relating to citing and cited information renewing module 3400-2 for renewing cited information based on citing information.
  • the citation information obtaining module includes an applicant-cited information obtaining module 3400-1-1 for obtaining the citation-related information from the information cited by the applicant in the applicant's patent document and an examiner-cited information obtaining module 3400-1-2 for obtaining information cited by the examiner in the examination.
  • the applicant-cited information obtaining module 3400-1-1 includes an applicant-cited patent document information obtaining module 3400-1-1-1 and an applicant-cited non-patent document information obtaining module 3400-1-1-2.
  • the applicant-cited patent document information obtaining module 3400-1-1-1 operates when the citing information is the patent information. Further, the examiner-cited information obtaining module 3400-1-2 includes an examiner-cited patent document information obtaining module 3400-1-1-1 and an examiner-cited non-patent document information obtaining module 3400-1-2.
  • the citation information pre-processing module processes the citation information obtained by the citation information obtaining module with a preset manner.
  • the patent citation information includes 1) prior art citation information that is other patent document cited by the applicant as a prior art, 2) reference citation information that is a patent document cited by the examiner as the reference in examining the specific patent document, and 3) indirect citation information. It is natural that the citation is incurred in domestic and foreign. Generally, the citation of other patent as the prior art is included in the prior art document information in the patent document of Republic of Korea and Japan, and is included in the reference information in the patent document of US. Such the prior art citation information can be obtained from the Bibliographical particular or body contents of the patent document. Further, many nations publishes the reference citation information, which also can be identified in administration information published from the patent office.
  • the present specification describes with mainly focusing the prior art citation information, but if the obtained reference citation information can be treated like the prior art citation information, the above description can be equally applied thereto.
  • the indirect citation includes indirect backward citation and indirect forward citation.
  • the indirect backward citation document set of the patent document set includes the patent documents having the application date that is earlier than that of the to-be-processed patent document and simultaneously citing the patent document that is cited by the to-be-processed patent document.
  • the indirect forward citation document set of the patent document set includes the patent documents having the application date that is later than that of the to-be-processed patent document and simultaneously cited in the patent document that cites the to-be-processed patent document.
  • the indirect citation is resulted from the difference between the patent application date and the patent publication date.
  • the citation information pre-processing module of the present invention pre-processes the prior art citation information and reference citation information, and as shown in FIG. 5, the citation information pre-processing module includes a prior art citation information pre ⁇ processing module for pre-processing the prior art citation information and a reference citation information pre-processing module for preprocessing the reference citation information.
  • the backward citation is included in the Bibliographical particular or body contents of the specific patent document so that the document information relating to the backward citation is stored in the search index of the specific document or specific document information.
  • the forward citation relating to the specific document is not stored in the specific document information, but is stored in the search index of the document that cites the specific document or citing document information. Therefore, it is necessary to search the forward citation relating to the specific document, correlating the searched forward citation with the specific document, and store it, which are processed by the prior art citation information pre ⁇ processing module.
  • Prior art citation information pre-processing module queries the application number of the specific document and the unique number value of the specific document to the search index or DBMS so as to obtain the forward citation information of the specific document that is cited-by.
  • the prior art citation information pre-processing module can store the forward citation information of the obtained specific document in the search index in the step of indexing the specific document and stores it as the forward citation information of the specific document in the patent document master DB.
  • the forward citation information is the information which must be always monitored when the patent information is renewed. Therefore, when the new patent document is published so the search index is renewed or the patent document master DB is renewed, the prior art citation information preprocessing module queries if the renewed patent document citing the specific document is present so as to renew the forward citation information.
  • the citation information pre-processing module receives at least one citation number present in the specific document (S3820), obtains the citation number in which the attribute of the citation number is not preset (S3830), queries the obtained citation number to the search engine or DBMS so as to receive the citation number having the preset attribute (S3840), and represents the received citation number with the obtained citation number having the preset attribute (S3850). It is necessary to obtain the forward citation document information that is the information on a forward application document citing the specific document and then store the forward citation document information in the document information of the specific document, an exemplary method of which is illustrated in FIG. 89.
  • the citation information pre-processing module obtains specific document data (S3920), obtains at least one of the application number, registration number, and publication number included in the specific document data (S3940), queries the obtained number to a citation field of the search engine or queries the obtained number to the DBMS by limiting the citation field (S3950), receives the forward citation information as the query result (S3960), and stores the forward citation information in the Bibliographical particular of the document data (S3970).
  • Reference citation information pre-processing module pre- processes the reference citation information by the same manner as the prior art citation information pre-processing module pre-processes the prior art citation information.
  • the counting pre-processing module includes an individual document-based counting pre-processing module for processing the counting according to each individual document, and a multiple document-based counting pre ⁇ processing module for processing the counting in the unit of at least two documents.
  • the individual document-based counting pre-processing module can calculate numerical information of at least one from 1) the number of applicants and/or patentees, 2) the number of inventors, 3) the number of claims for each step, such as the step of filing an application or registration, 4) the number of specification pages, 5) the number of drawings, 6) the number of kinds of a patent classification code, 7) the number of patent classification codes according to the respective kinds, 8) the number of references, 9) the number of references cited by the examiner, 10) the number of patent classification codes searched by the examiner, 11) the number of priorities, 12) the number of nation-based family patents, 13) the number of total families, 14) the number of independent claims, 15) the number of dependent claims, 16) the number of patents for each nation among the references, 17) the total number of patent documents among the references, and 18) the number of non-patent documents among the references.
  • the individual document-based counting pre-processing module can calculate any one of information on 1) the term from the application date to the registration date, 2) the term from the priority date to the application date, 3) the term from the application date to the date of requesting an examination, and 4) the term from the date of the Office Action to the date of filing a response to the Office Action from the individual patent document or the administration processing information of the patent office that publishes the individual patent document .
  • the individual document-based counting pre-processing module can obtain information on whether or not to utilize or apply the specific system according to the individual nation from the individual patent document or administration information, in which the information includes 1) whether to utilize the examination request system, 2) whether to utilize the divisional application, 3) whether to utilize the continuation-in-part application, 4) whether to utilize the reissue patent system, 5) whether to request the accelerated examination, 6) whether to utilize the decision appealing trial, 6) whether to occur a non-party litigation. If the system is utilized or applied, the counting value can be set as 1, and if it is not, the counting value can be set as O.
  • the individual document-based counting pre-processing module searches the patent document master DB relating to the individual patent document (using the search engine or DBMS) so as to obtain and count the information relating to the individual patent document of at least one second nation, in which the information includes 1) the number of nations of application, 2) the number of nations of registration, 3) whether or not to file the PCT application, 4) the number of families according to the nation, and 5) the total number of families.
  • the individual document-based counting pre-processing module can recognize and count any one of 1) forward citation (the domestic and foreign patent documents citing the specific patent document), 2) backward citation (the previous patent documents cited by the specific patent document), and 3) indirect citation through the search engine and DBMS.
  • the individual document-based counting pre-processing module can obtain information on 1) whether to request an location, 2) whether or not provide the information, and 3) whether to occur a party trial for the individual patent document.
  • Information 1) to 3) can be obtained directly or by a search method from the Korean Patent Office or the Industrial Attribute Consumer in Korea.
  • the value counted by the individual document- based counting pre-processing module is stored in the processing Bibliographical DB of the patent information master DB. Further, it is more preferable that the counting value is included in the index that is generated when the search engine indexes the patent document for storing.
  • a method of including the counting value in the index includes at least one of 1) operating the individual document-based counting pre-processing module when the search engine indexes the patent document so as to obtain the counting value, or 2) obtaining the counting value stored in the patent information master DB.
  • the counting pre-processing module counts 1) the number of total sub-patent classification codes and 2) the number of direct sub-patent classification code of each patent classification code referring to the patent classification code master DB and stores the counting value in the patent classification code master DB.
  • the counting pre-processing module obtains at least one patent document information S2820 and calls at least one individual document-based counting pre-processing module according to the obtained patent document S2830, the individual document- based counting pre-processing module processes the counting of the patent document information S2840 and stores the counting processing result in the patent DB or search index together with the document unique number of the patent document information S2850.
  • the multiple document-based counting pre-processing module obtains the score by collecting information on the multiple documents for the subject set counted by the individual document- based counting pre-processing module. That is, the multiple document-based counting pre-processing module generates the counting data for at least two documents set.
  • the present invention suggests a method of granting the weight of the patent document on a basis of the information capable of being recognized with the data by the system.
  • the weight of the patent document is granted by a document-based weight pre-processing module.
  • the weight pre-processing module of the present invention includes the document-based weight pre-processing module 3310 and a subject-based weight pre-processing module 3330.
  • the document-based weight pre ⁇ processing module 3310 includes an expense expenditure view weight pre ⁇ processing module 3311, a citation view weight pre-processing module 3313, a dispute view weight pre-processing module 3315, and/or concentration rate view weight pre-processing module 3317. Further, the subject-based weight pre-processing module 3330 includes an applicant-based weight pre ⁇ processing module 3331, an inventor-based weight pre-processing module 3333, and/or an agent-based weight pre-processing module 3335.
  • weight granting elements can influence on the weight of the patent document, and can be obtained from the Bibliographical particular information, specification contents information, or administration processing information of the individual document.
  • the patent application is a joint application and is a result of cooperation between two or more different subjects, so that the joint application is more important than a non-joint application in a statistic view, and also if there are multiple inventors, the invention has been completed with the cooperative research of the several inventors so as to be more important in the statistic view.
  • the foreign application requires greatly larger expense than the domestic application, so if there is an application in a foreign nation or many foreign applications, the applicant expends the substantial expense for the foreign patents so that it is reasonable to consider that the patent is relatively more important to the applicant.
  • the weight granting elements In view of the problem of existence, if the weight granting elements do not exist, the numerical value is 0, and if the weight granting elements exist, the numerical value is 1, and if the weight granting elements are the number of claims, the numerical value is 1 or higher.) and a weight score is granted according to the weight granting element according to the existence and/or numerical value.
  • the expense expenditure view weight pre-processing module recognizes the existence and numerical value of the particular weight granting elements considered in the respective views 1) to 5) and grants the weight score according to the weight granting elements.
  • the expense expenditure view weight pre-processing module obtains at least one patent document information S2920, obtains a counting result value according to at least one preset counting references relating to the expense expenditure from the obtained patent document information using the counting pre-processing module or through the patent DB or search index S2930, and obtains the weight according to the counting references of the patent information from the patent information processing policy DB S2940.
  • the information according to the preset counting reference is referred in the patent information processing policy stored in the patent information processing policy DB.
  • the patent information processing policy DB stores the policy for processing the weight, such as (the number of claims 0.1 + the number of independent claims 0.3), according to the respective subjects in the respective views.
  • the expense expenditure view preprocessing module generates an expense weight value with counting reference-based weight and the counting result value according to the counting references S2940 and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information S2950.
  • a module for processing weight of the individual document in the citation view refers to the citation view weight pre-processing module.
  • the weight granting elements includes 1) in the forward citation view, the number of total forward citations, the number of depths of forward citation, the number of first-level forward citations, the average term of forward citation, and the concentration rate of forward citation in comparison with the term and 2) in the examiner citation view, the examiner citation.
  • the citation view weight pre-processing module obtains at least one patent document information (S3020), obtains the counting result value according to at least one preset counting reference relating to the backward citation and/or forward citation, or examination citation from the obtained patent document information using the counting pre-processing module or through the patent DB or search index (S3030), refers the patent information processing policy DB and obtains the weight according to the counting reference (S3040), generates the citation view weight value with the counting-reference-based weight and the counting result value according to the counting reference (S3050), and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information (S3060).
  • the information according to the preset counting reference relating to the citation refers to the patent information processing policy stored in the patent information processing policy DB.
  • the patent information processing policy DB in the citation view stores the policy of processing the weight, such as "the number of forward citation countings 0.3", according to the subjects in the respective views.
  • Dispute view weight pre-processing module There is a high possibility in that the more the dispute relating to one patent document, the more important the patent.
  • the disputative elements recognized in the patent information include 1) invalidation trial, 2) location, 3) information provision, 4) passive or active right scope identification trial, etc.
  • the dispute view weight pre-processing module recognizes the existence and numerical value of the weight granting element and grants the weight score according to the weight granting element in the disputative view.
  • FIG. 81 A method of processing weight by the dispute view weight preprocessing module is illustrated in FIG. 81, which is closely similar with that by the expense expenditure view weight pre-processing module and citation view weight pre-processing module.
  • the dispute view weight pre-processing module obtains at least one patent document information (S3120), obtains the counting result value according to at least one preset counting reference relating to the dispute from the obtained patent document information using the counting pre-processing module or through the patent DB or search index (S3130), refers to the patent information processing policy DB and obtains the weight according to the counting reference (S3140), generates the dispute view weight value with the counting reference-based weight and the counting result value according to the counting reference (S3150), and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information (S3160).
  • the essence of the method of processing the weight by the dispute view weight pre-processing module is to perform the counting of the number of disputes, obtains the weight corresponding to the number of countings from the patent information processing policy DB, and generates the dispute view weight value.
  • the multiple applicants files the applications for the technical field to which one application belongs so that 1) the number of applications is large, 2) the increase/decrease rate or speed of the application is high, 3) the number of applicants is large, or 4) the variation of the occupancy rate exceeds the preset range in relative with another technical field, there is a high possibility in that the technical field may be more important.
  • the technical field to which the multiple applicants are concentrated can be measured by an analysis element, such as technology attractiveness, the application increase/decrease rate, the application increase/decrease speed, the variation of the occupancy rate, etc.
  • the technical field of this paragraph is identified from the hierarchical patent classification code included in the patent document, such as IPC, USPC, FI, FT, and ECLA, is recognized in the respective hierarchy units.
  • the sub-patent classification code is automatically included and the respective analysis element-based numerical value is calculated. That is, if H04B 7/26 as the IPC is given, it is possible to calculate the various analysis element-based numerical values for the IPC of H04B 7/26 and also calculate the various analysis element-based numerical values in the H04B 7/00 view.
  • the concentration rate view weight pre-processing module of the present invention recognizes the existence and the numerical value of the weight granting element and grants the weight score according to the respective weight granting elements in the concentration rate view.
  • the method of processing weight by the concentration rate view weight pre-processing module is closely similar with that by the expense expenditure weight pre-processing module and citation view weight pre ⁇ processing module. That is, the concentration rate view weight pre- processing module measures the value of the patent information analysis index meaning the concentration rate, such as concentration rate/activity rate/occupancy rate in the technical field of the applicant/patentee with the Bibliographical information of the respective individual documents, obtains the weight of the measured value from the patent information processing policy DB, and generates the concentration rate view weight value.
  • FIG. 82 The method of processing weight by the concentration rate view weight pre-processing module is illustrated in FIG. 82, which is closely similar with that by the expense expenditure weight pre-processing module and citation view weight pre-processing module.
  • the concentration rate view weight pre-processing module obtains at least one patent document information (S3220), obtains applicant information and patent technology classification information from the patent document DB (S3230), calculates, by means of the value of the obtained applicant information and patent technology classification information value, the value of applicant's concentration rate/activity rate/occupancy rate in the patent technology classification or at least one higher patent technology classification code obtained in the patent technology classification code system referring to the patent technology classification code DB on a basis of at least one reference (S3240), obtains the weight information on the reference-based concentration rate/activity rate/occupancy rate from the patent information processing policy DB (S3250), generates a concentration rate view weight value with the weight according to the reference and the concentration rate result value (S3260), and stores the generated weight value in the patent DB and search index together with the unique document number of the patent document information (S3270).
  • the concentration rate view weight pre-processing module can process the weight based on the inventor, not based on the applicant, and a method of processing the weight by the concentration rate view weight pre-processing module is illustrated in FIG. 83.
  • the concentration rate view weight pre-processing module obtains at least one patent document information (S3320), obtains inventor information and patent technology classification information from the patent document information (S3330), calculates the concentration rate of the inventor for the patent technology classification or at least one higher patent technology classification code obtained in the patent technology classification code system with reference to the patent technology classification code DB with the value of obtained inventor information and patent technology classification information value on a basis of at least one reference (S3340), obtains the weight information on the reference-based concentration rate (S3350), generates the concentration rate view weight value with the weight according to the reference and the concentration rate result value (S3360), and stores the generated weight value in the patent DB and search index together with the unique document number of the patent document information (S3370).
  • the subject involved in the single patent document includes an applicant, an inventor, and an agent.
  • the subject-based weight pre-processing module pre-processes the weight of the patent document in the subject view. AS shown in FIG. 4, the subject-based weight pre-processing module includes an applicant-based weight pre-processing module, an inventor-based weight pre-processing module, and an agent-based weight pre-processing module according to the subject.
  • Applicant-based weight pre-processing module If the applicant of the document concentrates on the technical field included in the specific document, there is a high possibility that the application is important.
  • the technical filed on which the applicant of the specific document concentrates can be measured with the analysis element, such as concentration rate, occupancy rate, and AI in the technical field.
  • the applicant-based weight pre-processing module of the present invention recognizes the numerical value of the weight granting element and grants the weight score according to the respective weight granting elements in the analysis element view.
  • a method of processing weight by the applicant-based weight pre ⁇ processing module is illustrated in FIG. 84.
  • the applicant-based weight pre-processing module obtains reference information for processing weight based on the applicant (S3420), obtains a result value of the document in the name of the applicant conforming with the reference calculated by at least one document-based weight pre-processing module (S3430), obtains weight information according to the document-based weight pre-processing module from the patent information processing policy DB (S3440), generates a reference-based applicant-based weight value obtained with the respective weight and respective result values of the document-based weight pre-processing module (S3450), and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information, together with the reference or independently (S3460).
  • the inventor of the document concentrates on the technical field included in the specific document, there is a high possibility that the application is important.
  • the technical filed on which the inventor of the specific document concentrates can be measured with the analysis element, such as concentration rate, occupancy rate, and AI (in the formula, the inventor is substituted for the applicant in the applicant- reference formula) on a basis of the inventor of the technical field.
  • the inventor-based weight pre-processing module of the present invention recognizes the numerical value of the weight granting element and grants the weight score according to the respective weight granting elements in the analysis element view.
  • the inventor-based weight pre-processing module obtains reference information for processing weight based on the inventor (S3520), obtains a result value of the document in the name of the inventor conforming with the reference calculated by at least one document-based weight pre-processing module (S3530), obtains weight information according to the document-based weight pre-processing module from the patent information processing policy DB (S3540), generates a reference-based inventor-based weight value obtained with the weight of the respective document-based weight pre-processing modules and the result values (S3550), and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information, together with or independent of the reference.
  • the weight information may include the policy, such as "registration rate in comparison with total previous applications ⁇ ⁇ .1 + the number of foreign application families ⁇ ⁇ .5".
  • Agent-based weight pre-processing module If the agent of the document concentrates on the technical field included in the specific document, there is a high possibility that the specification written by the above agent is more in-depth than that written by another agent.
  • the technical filed on which the agent of the specific document concentrate can be measured with the analysis element, such as concentration rate, occupancy rate, and AI (in the formula, the agent is substituted for the applicant in the applicant-reference formula) on a basis of the agent of the technical field.
  • the agent-based weight pre-processing module of the present invention recognizes the numerical value of the weight granting element and grants the weight score according to the respective weight granting elements in the analysis element view.
  • the weight based on the agent is less important that that of the weight based on the applicant or based on the inventor.
  • the agent-based weight pre- processing module obtains reference information for processing weight based on the agent (S3620), obtains a result value of the document in the name of the agent corresponding to the reference calculated by at least one document-based weight pre-processing module (S3630), obtains weight information according to the document-based weight pre-processing module from the patent information processing policy DB (S3640), generates a reference-based agent-based weight value obtained with the weight of the respective document-based weight pre-processing modules and the result values (S3650), and stores the generated weight value in the patent DB or search index together with the unique document number of the patent document information, together with or independent of the reference (S3660).
  • the weight information may include the policy, such as "registration rate in comparison with applications*0.3".
  • the document-based weight pre-processing module and subject-based weight pre-processing module of the present invention grants the preset weight to the weight granting elements obtained from the at least one weight pre-processing modules and calculates the weight in the unit of the document.
  • the respective weight granting modules according to the respective views or the weight granting modules according to the subjects grant the preset weight to the weight granting element obtained by themselves and calculate the weight in the unit of the document in themselves view.
  • the respective weights can be variously granted according to the input of the administrator or user.
  • Subject master DB generation module 301-4 Kind of subject
  • the subject in the patent document generally includes the 1) applicant, 2) inventor, and 3) agent.
  • the applicant includes an organization, such as a company, and an individual.
  • the DB storing and managing the subject in the present invention is referred to as subject master DB 204, and the module of generating the subject master DB 204 is referred to as the subject master DB generation module 301-4.
  • the subject master DB 204 further includes represented applicant's name DB, represented inventor's name DB, and represented agent's name DB, and independently further includes company information DB, which will be hereinafter described.
  • the subject master DB generation module 301-4 of the present invention includes a name-representation pre-processing module 301-4-1.
  • the name-representation pre-processing module includes at least one of an applicant's name representation pre-processing module 301-4-1-1 for processing the representation of the applicant's name, an inventor's name representation pre-processing module 301-4-1-2 for processing the representation of the inventor's name, and an agent's name representation pre-processing module 301-4-1-3 for processing the representation of the agent's name.
  • the subject master DB generation module 301-4 of the present invention further includes a company information DB generation module.
  • the company information DB includes at least one of financial information of the company, company status information, accounts information, a main product, a name of a representative, a company registration number, a taxpayer identification number, a web-site, a telephone number, and a fax number.
  • the subject master DB 204 particularly further includes company information DB 204-1, represented applicant's name DB 204-2, represented inventor's name DB 204-3, represented agent's name DB 204-4, and/or organization information DB 204-5.
  • the representing phrase refers to a word or a phrase including at least two words capable of representing the entire specific document or a partial of the specific document.
  • the basic attribute of the representing phrase of the present invention is to substantially rarely come out in the preset representing phrase extraction entire documents set, but often come out by the preset reference or more in the preset given representing phrase extraction entire document set
  • a kind of preset representing phrase extraction entire document set includes 1) a patent document set generated by a method of specifying a patent document set with the specific patent classification code in the specific classification code system in the specific nation DB, 2) a patent document set generated by a method of specifying a patent document set with the specific applicant's name in the specific nation DB, 3) a patent document set generated by a method of specifying a patent document set with the specific inventor's name (i.e.
  • the representing phrase extraction target document set can be any subset of the representing phrase extraction entire document set (a subset including the documents in which at least one specific attribute of one document (e.g. the specific classification code, such as IPC, applicant, term, nation, agent or inventor, or at least one combination thereof) is common).
  • the specific classification code such as IPC, applicant, term, nation, agent or inventor, or at least one combination thereof
  • the representing phrase extracting target document set (a super set including a target set) is a concept opposite to the representing phrase extraction entire document set (target set), and if it is a subset of the representing phrase extraction entire document set, it is sufficient, but the smaller the size of the subset, the better.
  • the representing phrase extraction target document set includes 1) a single specific application, every application of the specific company, an application of the specific inventor, an application of the specific IPC of the specific inventor, every application of the specific IPC of the specific company, every application of the specific year of the specific IPC, etc.
  • a part of the representing phrase extraction target document includes the claims, the independent claims of the claims, the dependent claims of the claims, the effect of the invention or industrial applicability, and abstract.
  • the common part of the respective documents of the preset representing phrase extraction target document set can be the aforementioned claims of every document included in the representing phrase extraction target document set.
  • the representing phrase includes 1) a phrase constructed with only noun, 2) a phrase constructed with a noun and verb, and 3) a phrase constructed with even an adjective and adverb. It is preferred that the length of the representing phrase is equivalent to that of 1 to 5 words with a reference of the number of words, and more preferably 2 to 3 words. Because, the technical concept is typically generated with one word, but in the many cases, the technical concept, technical action, or technical effect is constructed with 2 to 3 words. Next, the step of extracting the representing phrase will be described in detail.
  • the representing phrase extraction pre-processing module extracts the representing phrase.
  • the representing phrase extraction pre-processing module of the present invention includes 1) a language-based morpheme analysis engine (analyzer), 2) a phrase generating engine, 3) a phrase-based counting engine, 4) various dictionaries DB including a thesaurus dictionary/patent technology term dictionary, etc., and 5) a representing phrase extracting engine.
  • a language-based morpheme analysis engine analyzer
  • a phrase generating engine generating engine
  • 3) a phrase-based counting engine e.g., a phrase-based counting engine
  • various dictionaries DB including a thesaurus dictionary/patent technology term dictionary, etc.
  • the representing phrase extraction pre-processing module includes a representing phrase extracting pre-processing engine 3710 including an algorithm of extracting the representing phrase, representing phrase extraction policy DB 3730 including information on a policy serving as a reference for extracting the representing phrase, and representing phrase-frequency-application number correspondence DB 3750 including information on which location (field, etc.) of which document includes various counting or calculation information, such as a frequency value of the representing phrase, and the representing phrase when the representing phrase is extracted.
  • the representing phrase extraction pre-processing module further includes thesaurus DB 3770 for processing a synonym and representing phrase translation DB 3790 including translation information on the representing phrase using a translation system for processing at least two kinds of languages or a pre-translated dictionary.
  • the representing phrase extracting pre-processing engine of the present invention will be described in more detail.
  • the representing phrase extracting pre-processing engine extracts a terminology when a sentence or a paragraph of the specific field is inputted.
  • the phrase generating engine of the present invention generates a phrase array including 1 to 5 words (preferably, 2 to 3 words). For example, it is assumed that the words outputted when one sentence included in one patent document is inputted are a, b, c, d, e, f, g, in sequence. At this time, if a phrase generating unit of the phrase generating engine includes 3 words, the phrase generating engine generates the word-based phrase of abc, bed, cde, def, and efg. The phrase generating engine generates the phrase in the unit of the sentence for the given document or a part of the document. For example of generating the words for the part of the document, the phrase can be generated with respect to the sentences included in the claims.
  • a method of generating the phrase array includes 1) a method of, in the unit of one sentence, if n number of phrases is extracted from a single sentence, generating the phrase of nC2, nC3, nC4, nC5 by means of n number of phrases (e.g. if nC2 is applied, the phrases are generated as ab, ac, ad, ae, af, ag, be, ...fg), 2) if there is m number of phrases in the unit of the document, applying above 1) method for the unit of the document, and 3) applying the phrase including 2 to 5 words in sequence (e.g.
  • phrase array becomes ab, be, cd, de, ef, fg, and if three words are sequentially arrayed, then phrase array becomes abc, bed, cde, def, efg. It is of course that the arrays including both two words and three words can be used.
  • the phrase passing through the morpheme analyzer is names as a term.
  • the term can be differentiated according to a setting value in the morpheme analyzer.
  • an example of the setting value includes that 1) an uninflected word is only separated, 2) an inflected word is only separated but a citation form is only extracted, and 3)a citation form is only extracted in an adjective or adverb.
  • the phrase-based counting engine essentially generates (phrase, document number) or (phrase, counting value) information according to the generated phrase, preferably (phrase, counting value, document number) or (phrase, document number, location field name (e.g.
  • the phrase-based counting engine stores the generated information in the representing phrase DB 207-2.
  • Table 5 represents an example of data included in the representing phrase DB 207-2.
  • Table 5 represents the example of data generated by means of the specific word according to the countries. [Table 5]
  • the data represented in Table 5 are generated from patent documents 10-2003-0012345 and 10-2003-0056789, and it can be identified that the phrase generating engine generates the phrase in the unit of the 3 words. Further, it can be identified that the phrase of abc is present two times in the claims and five times in the detailed description of the invention. Further, it can be identified that the phrase of bed is present three times in the claims and four times in the detailed description of the invention. Further, it can be identified that the phrase of abc is present one time in the claims and two times in the detailed description of the invention of the application the patent document 10-2003-0056789. Therefore, the phrase-based counting engine of the present invention can generate the data of Table 5 for every repetitively or recursively obtained patent document.
  • phrase generating engine can generate the word with the unit of two words, and it is obvious that the phrase-based counting engine of the present invention generates data of Table 11 for the phrase generated with the unit of two words. Further, the phrase-based counting engine generates data of Table 5 by means of the phrase generated in the unit of one word or four to five words.
  • an ID according to the phrase may be a problem.
  • a method of generating the ID according to the phrase is well known in typical DB technology, but several exemplary methods will be suggested.
  • First method is that an ID is sequentially granted for an initially generated phrase, a next generated phrase is searched in the existing phrase, and if the next generated phrase does not exist, a new ID is granted, and if the next generated phrase exists, the existing ID is granted.
  • Such the processes are repetitively and recursively performed for every obtained phrase.
  • the second method is that a temporary ID is sequentially granted for every obtained phrase without granting the ID for the phrase, the data of Table 11 is generated, and an identical formal ID is granted to an identical phrase while reading every phrase.
  • Such the processes are repetitively and recursively performed for every phrase that has not been granted the formal ID.
  • phrase-based counting engine of the present invention can generate a phrase absolute ID whenever every word is generated and processed and grants the generated phrase absolute ID to every word.
  • the phrase-based counting engine of the present invention can generate a word-based accumulated counting value on a basis of the target document set. It is obvious that in the target document set, the accumulated counting value can be calculated based on one document as represented in the above table, and also based on the specific field (e.g. the claim) within one document.
  • the phrase-based counting engine of the present invention can generate a word-based accumulated counting value based on the entire document set and store it by a manner of Table 6 (In the above example, it is assumed that the entire document set includes patent documents 10-2003- 0012345 and 10-2003-0056789, and the numerical value is inputted based on the above table, and in actually, the inventive idea of the present invention is applied for the large entire document set.) [Table 6]
  • phrase-based counting engine of the present invention can generate the respective phrase-based accumulated counting values as represented in Table 7 with reference to the specific field (e.g. the claim) in the document.
  • the representing phrase extracting engine of the present invention If there is the (phrase, document number) data extracted from the representing phrase extraction entire document set, it is easy to generate the (phrase, counting value, document number) data as indicated in Table 8 with the data.
  • the (phrase, counting value, document number) data can be generated by increasing the counting value by 1 if an identical word is present.
  • the representing phrase extracting engine of the present invention can generate (phrase, counting value, document number) data on a basis of (phrase, document number) data and (phrase, counting value, document number, location field name in the document number) data on a basis of (phrase, document number, location field name in the document number) data.
  • Such the process can be performed for at least one representing phrase extraction entire document set that is a mother set of the representing phrase extraction target document set, as well as for every document included in at least one representing phrase extraction target document set.
  • the representing phrase extracting engine can generate (phrase, document number) data or (phrase, document number, location field name in the document number) data for the representing phrase extraction entire document set including 1) every document included in one nation DB, 2) every document corresponding to at least one given patent classification code, 3) every document of the specific applicant, 4) every document for the specific inventor, or 5) every document generated by limiting a term with respect to respective 1) to 5), or 6) every document generated by combining 1) to 5), respectively (including any set operation, such as union, difference component, intersection).
  • the representing phrase extracting engine can generate (phrase, document number) data or (phrase, document number, location field name in the document number) data for every document corresponding to any subset of the representing phrase extraction entire document set.
  • the representing phrase extracting engine calculates frequency F in the preset representing phrase extraction target document set and 2) total frequency T in the representing phrase extraction entire document set that is a mother set of the representing phrase extraction target document set according to the phrase with respect to at least one preset representing phrase extraction target document set of at least one representing phrase extraction entire document set. It is of course that all frequency of phrase A can be calculated in the representing phrase extraction entire document set and all frequency of target set AT of the phrase can be calculated in the representing phrase extraction target document set.
  • the representing phrase extraction engine extracts the representing phrase of the representing phrase extraction target document set given in the preset representing phrase extraction entire document set referring to the representing phrase extraction policy DB.
  • the representing phrase extraction policy is that whether the probability value of the phrase in (phrase, probability value of the phrase) is included in the preset reference range under the preset condition.
  • An example of the probability value is F/T, F/A, and F/AT value according to the specific phrase.
  • the representing phrase extraction policy is if 1) a variation value (increase/decrease rate, increase/decrease speed) of F, T, A, and AT value and 2) a variation value (increase/decrease rate, increase/decrease speed) of F/T, F/A, and F/AT value satisfies the preset reference range.
  • the preset condition of the representing phrase extraction policy can correspond a value of another reference range to the probability value in (phrase, the probability value of the phrase) according to the term, nation, applicant, inventor, at least one patent classification code, or document set holding in common the preset attribute.
  • the value according to the different reference range can be applied to the probability value for the phrase extracted from the IPC H section including a lot of patent applications and the probability value of the words extracted in IPC D section including a small number of patent applications.
  • a different reference can be flexibly applied to 1) the variation value (increase/decrease rate, increase/decrease speed) of F, T, A, AT value and 2) the variation value (increase/decrease rate, increase/decrease speed) of F/T, F/A, F/AT value according to the attribute of the document set (representing phrase extraction entire document set, representing phrase extraction target document set) from which the phrase is extracted.
  • the representing phrase extracting engine of the present invention 1) corresponds the probability value according to the phrase ID (frequency of the specific phrase/the number of total phrase) on a basis of the frequency of the total phrase, selects the words corresponding to a preset probability value range as a candidate representing phrase, refers a preset representing phrase selection rule, and selects the representing phrase according to the target document set. It is obvious that the frequency can be calculated according to the specific field. In the example of Table 8, it is possible to count the words-based frequency according to the document number and extract the representing phrase corresponding to the preset representing phrase extraction policy according to the document number. [Table 8]
  • the (phrase, document number) data essentially equals to the (phrase absolute ID, phrase, document number) data.
  • the phrase ID corresponds to the respective absolute phrase IDs
  • the document number including the phrase corresponds to every absolute phrase ID
  • the Bibliographical particulars correspond to the respective document numbers. Therefore, the document number and the phrase ID-based frequency according to the document number are obtained with the document number in the target document set.
  • the document number place can corresponds to the IPC of the preset level, the specific applicant, or the specific inventor, and the phrase ID-based frequency according to the target document set can be counted in each case.
  • the counting of the phrase ID-based frequency according to the target document set can be implemented through a join command according to the table in the DBMS.
  • a selection rule of the representing phrase extraction policy included in the representing phrase extraction policy DB is that 1) the frequency (T) with respect to the total phrase numbers (e.g. 1,000,000,000) is less than 1/1000%, 2) the frequency in the target document number is equal to or over 1/1,000,000%, 3) the frequency T/total frequency TA ranges from 10% to 1%, it is examined if the respective phrases can be the representing phrase of the respective document numbers.
  • the phrase of abc cannot be for any one of document numbers #1 to #4 (violates rule I)), the phrase of bed can be only for document number #2 (violates rule 2) for the document numbers 1 and 4), and the phrase of cde can be only for document number #4 (violates rule 3) for the document number #1).
  • the representing phrase of document number #1 does not include the phrase of abc, bed, and cde
  • the representing phrase of document number #3 does not include the phrase of abc
  • the representing phrase of document number #4 includes the phrase of cde.
  • the representing phrase extraction pre-processing module of the present invention can generate the words- based counting data by using the an indexer 401-3 of the search engine and the index and extract the representing phrase from the content according to the document, the preset document set, and the specific field, or all the specific fields extracted from the preset document set on a basis of the phrase-based counting data with the aforementioned method.
  • the phrase of bed can be corresponded to the representing phrase of document number #2.
  • the representing phrase can be stored in the unit of the representing phrase extraction target document set or in the unit of the individual document included in the representing phrase extraction target document set. That is, at least one representing phrase can be corresponded to one representing phrase extraction target document set unit or the individual document. At least one representing phrase corresponds in the unit of the individual field of the representing phrase extraction target document set or in the unit of the individual filed of the individual document included in the representing phrase extraction target document set.
  • the phrase of bed is corresponded to document number #2 to be stored. At this time, when document number #2 is called, the phrase of bed can be indicated.
  • the specific document number is assigned too many representing phrase. That is, there may be 100 representing phrases satisfying the above exemplary rule for the specific document, and in this case, 4) a rule of extracting 10 to 30 representing phrase according to a higher sequence of the frequency calculated for every representing phrase is applied so as to limit the number of the representing phrase. In the meantime, if there may be one or no representing phrase for the specific document, above rule 3) in the exemplary rules is relieved and applied to this case so as to maintain 10 to 30 numbers of the representing phrases. If the above procedure is repeatedly performed for every document or every documents set, the representing phrase within the preset number range (e.g. 10 to 30) can be corresponded to every document.
  • the preset number range e.g. 10 to 30
  • the representing phrase corresponding to the document number it is possible to obtain the representing phrase corresponding to the document number.
  • the representing phrase and the calculation value for the representing phrase e.g. the frequency in the document of the document number, the frequency in the total documents, etc.
  • the representing phrase and the calculation value for the representing phrase are treated as an example of the Bibliographical particular for the document number so as to be utilized for various analyses.
  • the frequency is calculated in the unit of the document, but the frequency also can be calculated in the unit of the content included in 1) the document set, 2) the field within the document (the claim, etc.), or 3) the field within every document included in the document set (e.g. claim 1 in the claims included in every document corresponding to IPC H04B 7/02 filed from the year from 2000 to 2005) in Korea.
  • the field is included in the inside of one document, it can be equally treated as one document (in view of the document number correspondence).
  • the reason of using the search engine is that the search engine generally has a great performance of counting the number of the search word values so as to easily convert the data indexed by the indexer 401-3 into the data for DB.
  • the indexer 401-3 stores the obtained term, a document number including the term, or a field name representing the term, case by case. For example, (term 1, #1) represents that term 1 is obtained from the document number #1, and (CL:term 1, #1) represents that the term 1 is obtained from claims (Claim, CL) of document number #1.
  • the index of the search engine stores a great number of data, such as (term 1, #1) or (CL:term 1, #1), and if term 1 is inputted for the search word or term 1 is inputted with limiting the search field as the claim, document number #1 is outputted as the search result. Meanwhile, the search engine very rapidly calculates how many term 1 is included in the entire documents or in the claims (typically, the search engine first returns the total numbers of the search results, and then shows a part of a front of the total number).
  • the phrase generating engine of the present invention receives the n number of phrase/terms, which are passing the morpheme analysis engine (analyzer), combines the received n number of terms by a preset manner, transmits the combined terms to the indexer 401- 3 of the search engine together with the document number (at this time, if the indexer has already obtained the document number, it is not necessary to transmit the document number.
  • the document number is included and transmitted as the source of the transmitted combined term).
  • a method of combining the terms includes any one of 1) combining, in the unit of one sentence, all the terms obtained from a single sentence on a basis of 2 to 5 terms with any available method, 2) combining all the terms included the entire document- baseds on a basis of 2 to 5 terms with any available method (in this case, the combination of the terms is very large so that it takes much time for processing, but it is possible to generate the most fine word set.), and 3) sequentially combining the terms on a basis of 2 to 5 terms in a single sentence.
  • the method of processing in the unit of a paragraph of the specific field can be identically treated with the method of processing the entire document-based.
  • the phrase generating engine of the present invention obtains the outputted phrase and generates the phrase array including 1 to 5 words (preferably, 2 to 3 words). For example, if one sentence included in a single patent document is inputted, the outputted words are assumed as abcde f, g in sequence according to the morpheme. At this time, if the phrase generating unit of the phrase generating engine includes 3 words, the phrase generating engine sequentially generates the phrase, abc, bed, cde, def, and efg according to the word, or phrase including two words, such as ab, ac, ad, ae, af, ag, be, ..., and fg, which is previously described.
  • the phrase generating engine transmits the phrase (multiple terms), such as abc, bed, or ab, ac to the indexer 401-3.
  • the indexer 401-3 stores the transmitted phrase (multiple terms) in the form of (abc, #1), (bed, #1), (ab, #1), (ac, #1), etc.
  • the indexer 401-3 includes even the specific field name presenting the phrase (multiple terms), such as (CL : abc, #1), (CL : bed, #1), (CL : ab, #1), (CL : ac, #1) so as to store them in the index.
  • Table 9 below is a conceptual table illustrating an example of the index. [Table 9]
  • the word of abc is present three times in the detailed description and two times in the claim of document number #1
  • the phrase of bed is present one time in the detailed description and one time in the claim of document number #1
  • the phrase of ac is present one time in the detailed description of document number #1
  • the phrase of abc is present two times in the detailed description and one time in the claim of document number #2.
  • the phrase-based counting engine of the present invention receives the data of the index and generates the data relating to the number of phrases (multiple terms) and the document number serving as the source of the phrase.
  • the generated data is identical or corresponding to the data generated by the aforementioned method, and the example for the index is shown in Table 10.
  • the representing phrase extraction preprocessing module of the present invention generates the phrase-based counting data by utilizing the indexer 401-3 of the search engine and the index and extracts the representing phrase from the contents according to the documents, the preset document sets, the specific field, or every specific field extracted from the preset document set.
  • the representing phrase can be stored in the unit of the representing phrase extraction target document set or in the unit of the individual document included in the representing phrase extraction target document set. That is, at least one representing phrase can correspond to one representing phrase extraction target document set or the individual document. At least one representing phrase can correspond to the unit of the individual field of the representing phrase extraction target document set or the unit of the individual field of the individual document included in the representing phrase extraction target document set.
  • the representing phrase extracting engine of the present invention also generates the document number information corresponding to the phrase. This is for the purpose that, if the document number is corresponding to the phrase, it is easy to extract the document set in which the phrase serves as the representing phrase. If the document set in which the phrase serves as the representing phrase can be easily extracted, the document set is determined as the to-be-analyzed document set, to be utilized for various analysis.
  • one document set various analysis supported by the analysis module and the patent intelligence module 60 of the present invention according to 1) the various analysis indexes, such as the number of applications/the number of registrations/the rate of registration/occupancy rate/concentration rate/according to the nation/year/applicant, 2) distribution analysis according to the multi-stages (depth) of the IPC or other patent classification code according to the nations/years/applicants, and 3) citing and cited analysis according to the detailed document set (according to the applicant, patent classification code) of the document set, can be performed. Further, if the document number does not correspond to the word and the document including the specific phrase is extracted through the search engine, it may occur a problem of failing to discriminate if the specific phrases are the representing phrase of the document .
  • the representing phrase information pre-processing method is well illustrated in FIGs. 105 to 107. Referring to FIG. 105, a method of generating the combination for extracting the representing phrase through the representing phrase extraction pre-processing module will be briefly described.
  • the representing phrase extraction pre-processing module obtains the document set including at least two patent documents (S5520), processes the content according to the entire individual patent documents or various fields extracted from the document set with the morpheme analyzer so as to extract at least two phrases (term or terminology) (S5530), generates all the combinations of the phrase with the preset method based on the preset reference (S5540), and stores the generated combination of the phrase in the DB or transmits it to the search indexer 401-3 together with the patent document information (S5550), so as to generate the combination of the phrase for extracting the representing phrase.
  • a method of generating the combination of the phrase for extracting the representing phrase according to the field / the partial document set through the representing phrase extraction pre-processing module and extracting the combination satisfying the preset condition will be briefly described with reference of FIG. 106.
  • the representing phrase extraction pre-processing module obtains the document set including at least two patent document (S5620), processes the content according to the entire individual patent documents or various fields extracted from the document set with the morpheme analyzer so as to extract at least two phrases (term or terminology) (S5630), generates all the combinations of the phrase with the preset method based on the preset reference (S5640), and stores the generated combination in the DB or transmits it to the search indexer 401-3 together with the patent document information (S5650), obtains the Bibliographical particular of the respective documents included in the document set and generates the document sub-set based on at least one reference (S5660), and obtains the number of phrase combinations limited to the document subset and the number of phrase combinations included in the document (entire) set so as to extract the phrase combination satisfying the preset reference according to the entire document or various fields (S5670).
  • a method of generating the combination for extracting the representing phrase through the representing phrase extraction pre- processing module and comparing the generated combination according to the respective document sets will be briefly described with reference to FIG. 107.
  • the representing phrase extraction pre-processing module obtains multiple document sets including at least two patent documents (S5720), processes the contents according to the entire individual patent documents or various fields extracted from the document set by the morpheme analyzer according to the respective document sets so as to extract at least two phrase (term) (S5730), generates the combination of the extracted phrase (S5740), and stores the generated combination in the DB or transmits it to the search indexer 401-3 together with the patent document information
  • the multi-patent classification code correlation pre-processing module of the present invention is intended to find and use the correlation hidden between the multiple patent classification codes, which have been granted to a single patent document.
  • the multi-patent classification code correlation pre-processing module includes a homogeneous multi-patent classification code correlation pre-processing module and a heterogeneous multi-patent classification code correlation pre-processing module.
  • the homogeneous multi-patent classification code correlation pre-processing module is intended to find the correlation hidden between homogeneous patent classification codes given to a single patent document based on one of the IPC, UPC, and FT (i.e.
  • FIG. 60 shows the configuration of the multi-patent classification code correlation pre-processing module, the configuration of the homogeneous multi-patent classification code correlation pre-processing module, and the configuration of the heterogeneous multi-patent classification code correlation pre-processing module.
  • the homogeneous multi-patent classification code correlation pre-processing module of the present invention will be described. Even when a single document is granted multiple patent classification codes of the same type or two or more types of patent classification codes, multiple patent classification codes on a selected single patent classification code system are processed by the homogeneous multi-patent classification code correlation pre-processing module.
  • the multiple patent classification codes are processed by the homogeneous multi-patent classification code correlation pre-processing engine of the present invention, and the result of the processing is stored in a homogeneous representing phrase multiple patent classification code correlation DB 207-3 of the present invention.
  • the uppermost one serves as a section. Further, let us assume that a patent classification code system name, such as IPC, exists above the section. Then, the Ans and the Bns have at least one common node under at least a section or a patent classification code system named IPC (even when they are very far from each other, for example, when their sections are different from each other, they have at least one common node because they share the same patent classification code system named IPC). From among the common nodes, a node of the lowermost level in the patent classification code system is named a lowermost common patent classification code.
  • B(j+1) is equal to A(i+1), which itself is a parent node
  • it is unnecessary to discuss the correlations between all As and B(j+l)( A(i+D), because the latters inevitably exist in parent nodes. That is to say, it is preferable to omit consideration of a lowermost common parent node in a correlation calculation table.
  • the homogeneous multi-patent classification code correlation pre-processing module processes the multiple patent document codes, it can generate the data as shown in Table 11 by referring to the patent classification code system data (tree structure).
  • A(i+1) and B(j+1) are unnecessary cells in an actual table (only the cells up to Ai and up to Bj are necessary in the correlation table), they also are indicated in the table for convenience of description (in order to show that they are omitted in the cells). That is, an actual table includes only the cells up to Ai and Bj .
  • intersection couples for example, (Al, Bl), ... (Ai, Bj), etc.
  • Korean Patent Application No. 10-2005-0111868 dated January of 2006 is granted H04B 7/26 and H04B 7/15.
  • the following description is based on this example.
  • Parents of H04B 7/26 include, in turn, H04B 7/24, H04B 7/00, H04B, H04, and H.
  • Parents of H04B 7/15 include, in turn, H04B 7/14 and H04B 7/00.
  • the lowermost common patent classification code is H04B 7/00. Therefore, these codes can be arranged as shown in Table 12 below. As described above, cells relating to H04B 7/00 have no values.
  • a main patent classification code is first determined and taken, and the other patent classification code is then taken from (n-1) sub-patent classification codes. Then, (n-1) combinations and (n-1) tables are obtained, and all patent classification codes in each table are treated equally.
  • nC2 methods of taking a combination of two patent classification codes from the n patent classification codes according to the mathematical combination theory. For each of the nC2 patent classification code combinations, the treatment corresponding to the two patent classification codes is performed, so as to obtain nC2 tables, each of which has information on all intersection couples. At this time, an intersection couple relating to the main patent classification code may be assigned a weight relating to the main patent classification code.
  • intersection couple relating to a sub- patent classification code may be assigned a weight relating to the sub- patent classification code.
  • Korean Patent Application No. 10-2006-0012606 has patent classification codes assigned on January of 2006, which include H04B 7/04, H04B 7/155, and H04Q 7/30.
  • Parent nodes of H04B 7/04include H04B 7/02, H04B 7/00, H04B, H04, and H parent nodes of H04B 7/155 include H04B 7/15, H04B 7/14, H04B 7/00, H04B, H04, and H
  • parent nodes of H04Q 7/30 include H04Q 7/20, H04Q 7/00, H04Q, H04, and H.
  • the lowermost common patent classification code for a combination of H04B 7/04 and H04B 7/155 is H04B 7/00
  • the lowermost common patent classification code for a combination of H04B 7/155and H04Q 7/30 is H04
  • the lowermost common patent classification code for a combination of H04B 7/04 and H04Q 7/30 is H04.
  • Table 13 below is a correlation table of H04B 7/04 and H04B 7/155.
  • Table 15 is a correlation table of H04B 7/155 and H04Q
  • a larger weight may be given to a correlation table relating to the main patent classification code while a smaller weight is given to a correlation table relating to the sub-patent classification code.
  • a correlation table relating to the main patent classification code may be given 1 while a correlation table relating to the sub-patent classification code is given 0.5.
  • a uniform weight may be used for a patent document having multiple patent classification codes. For example, let us assume that, when a patent document has two patent classification codes, each patent classification code has a correlation couple value of 1. Then, in the case of a patent document having n patent classification codes, it is possible to uniformly allocate a value obtained by dividing the correlation couple value by nC2, which is the number of possible combinations, to the patent classification code couples. That is, it is possible to uniformly allocate a weight of l/(nC2). Then, the correlation table of H04B 7/155 and H04Q 7/30 is expressed by Table 17 below.
  • different weights may be assigned to a couple relating to the main patent classification code and a couple relating to the sub-patent classification code. For example, when there are n patent classification codes, there are (n-1) combinations relating to the main patent classification code. Therefore, it is possible to allocate 75% of the weight to combinations relating to the main patent classification codes while allocating a weight of 0.75*1/(n-1) to relevant couples relating to them. When 25% of the weight is given to all combinations relating to the sub-patent classification codes, it is possible to allocate a weight of 0.25*l/((n-l)C2) to all couples relating to the sub-patent classification codes.
  • Table 19 is a correlation table of H04B 7/04 and H04Q 7/30.
  • Table 20 is a correlation table of H04B 7/155 and H04Q 7/30.
  • Korean patent application No. 10-2005-0042032 has two patent classification codes including H04B 7/02 and H04B 7/14, which are in a next sibling relation (that is, H04B 7/02 and H04B 7/14 have the same parent node, i.e. H04B 7/00, which is the direct higher patent classification code).
  • H04B 7/02 and H04B 7/14 there is only one correlation table, and the table has only one cell.
  • Table 21 below corresponds to the correlation table for the Korean patent application No. 10-2005-0042032.
  • H04B 7/02 and H04B 7/14 are strongly related to each other for the particular patent document.
  • the Korean Patent Application No. 10-2006-0012606 has three tables, which have 6 cells, 16 cells, and 20 cells, respectively.
  • an important issue is how to appraise the contribution degree of the Korean patent application No. 10-2005-0042032 and the contribution degree of the Korean Patent Application No. 10-2006-0012606.
  • the weight of the contribution degree may be treated as follows.
  • the weight or contribution degree to the particular patent classification code combination (Ai, Bj) may have a value obtained by dividing 1 by the number of all cells of all tables of each patent document.
  • the weight or contribution degree relating to Korean patent application No. 10-2005-0042032 is 1 (which implies that there is one table and one cell)
  • the weight or contribution degree relating to Korean Patent Application No. 10-2006-0012606 is 1/(6+16+20, which corresponds to the number of all cells in the three tables).
  • the weight indicating the contribution degree may have a value obtained by dividing 1 by the number of all cells within the tables relating to the particular patent classification code combination (Ai, Bj).
  • the weight or contribution degree relating to the document of the Korean patent application No. 10-2005-0042032 has a value of 1 (which implies that there is one table and one cell), and the weight or contribution degree relating to Korean Patent Application No. 10-2006-0012606 is 1/(6, which corresponds to the number of all cells in the three tables).
  • the weight or contribution degree may have a value obtained by multiplying 1 by the number of all cells within the tables relating to the particular patent classification code combination (Ai, Bj).
  • the weight or contribution degree relating to the document of the Korean patent application No. 10-2005-0042032 has a value of 1 (which implies that there is one table and one cell), and the weight or contribution degree relating to Korean Patent Application No. 10-2006-0012606 is 6, which corresponds to the number of all cells in the three tables).
  • Such a weight granting method is expected to be useful for calculation of convergence degree of heterogeneous technologies. That is, when a table includes a small number of cells, concerned technologies may correspond to strongly-related technologies bound together by the technology classification system itself. Therefore, the larger a table, the larger the distance between the two technologies in the technology classification system.
  • every patent classification code combination contains information on a document, based on which the combination is created. That is, the patent classification code combination (H04B 7/02, H04B 7/14) corresponds to a combination created based on Korean Patent Application No. 10-2006-0012606 and Korean patent application No. 10-2005-0042032, and the patent classification code combination (H04B 7/15, H04B 7/24) corresponds to a combination created based on Korean Patent Application No. 10-2005- 0111868. Therefore, a patent classification code combination (Ai, BjCorresponds to a specific document number, which may correspond to all information in relation to the document including all Bibliographical information of the document having the specific document number.
  • Every table as described above has two relevant patent classification code combinations and corresponds to a patent document having the patent classification code combinations, and the patent document may correspond to Bibliographical information of the patent document. That is, all patent classification code combinations (Ai, Bj) existing in the correlation table of H04B 7/155 and H04Q 7/30 correspond to Korean Patent Application No. 10-2005-0111868 as a specific document number. Further, by way of an example, in Bibliographical information of the document corresponding to the specific document number, an application number of the document is 10-2006-0012606, an applicant thereof is Samsung electronics, inventors thereof are Do-in Choi and Seong-Tak Hwang, and application date thereof is February 9, 2006.
  • the document may correspond to all information of the document, which includes Bibliographical information and a specific document number.
  • the specific document number may be an application number.
  • such information described above may be mapped to or correspond to the Bibliographical information of the US application.
  • the homogeneous multi-patent classification code correlation pre-processing module creates a predetermined number of tables for each pre-patent classification code combination by referring to the patent classification code system data (tree structure) of each patent classification code combination.
  • numeric values in each table may be values reflecting weights based on a predetermined standard.
  • the homogeneous multi-patent classification code correlation pre-processing module may create patent classification code combination (Ai, Bj)-based correlation result data as shown in Table 22 below by applying a predetermined weight to the patent classification code combinations (Ai, Bj) extracted from each table.
  • n denotes the number of patent classification codes
  • Table 22 contains correlation data created for the patent classification code combination (Ai, Bj) in relation to three patent documents including Korean Patent Application No. 10-2005-0111868, Korean Patent Application No. 10-2006-0012606, and Korean patent application No. 10-2005-0042032. It is obvious to one skilled in the art that it is possible to repeatedly or recursively apply the above method to all the obtained document. It is also obvious to one skilledin the art that it is possible to create correlation data for each patent classification code combination (Ai, Bj) based on at least one of preset weights or contribution degrees.
  • the homogeneous multi-patent classification code correlation pre-processing module may, for a couple (Ai, Bj), store at least of 1) a document number containing the (Ai, Bj) value, 2) weight (Wij) according to particular situation(i)-based weight policy(j), 3) whether (M,S) or (S,S), 4) other attribute information of (Ai, Bj) in a database.
  • the method of processing multiple patent classification codes by the homogeneous multi-patent classification code correlation pre-processing module is not limited to the IPC. That is, in the completely same way as in the IPC, multiple patent classification codes of US patent documents may be processed by means of USPC, multiple patent classification codes of Japanese patent documents may be processed by means of FT and FI, and multiple patent classification codes of European patent documents may be processed by means of ECLA.
  • the technology convergence pattern analysis module can process two types of information as follows.
  • the homogeneous multi-patent classification code correlation pre ⁇ processing module is expected to be capable of finding a patent classification code most frequently coupling with the particular patent classification code. This can be easily obtained by finding all the couples containing the given particular patent classification code and counting the number of times of occurrence of the couples.
  • a preset weight or contribution degree weight it is possible to find a patent classification code most frequently coupling with the particular patent classification code.
  • the patent classification code system (tree structure), it is possible to find a patent classification code most frequently coupling in a preset level.
  • the homogeneous multi-patent classification code correlation pre-processing module can find a patent classification code most frequently coupling with H04B 7/26, and can a patent classification code most frequently coupling with H04B 7/26 in the level of the IFC sub class, IPC main group, or IPC subgroup.
  • the homogeneous multi-patent classification code correlation pre-processing module can generate various analysis results by using the relation between the patent classification code couple and Bibliographical information corresponding to the patent document number.
  • the analysis results may be obtained through analysis of a combination between the patent classification code couple and at least one selected from 1) nation, 2) period, 3) applicant, 4) inventor, and 5) agent. For example, it is possible to obtain a most- frequent Iy occurring patent classification code couple among Korean Patent Applications of years from 2000 to 2005 of Samsung Electronics Inc.
  • the homogeneous multi-patent classification code correlation pre-processing module can find a patent classification code, which is most frequently combined with the given patent classification code. Also, when a patent classification code is given, the homogeneous multi-patent classification code correlation pre-processing module can find a patent classification code, which is most frequently combined with the given patent classification code, from a document set based on at least one selected from 1) nation, 2) period, 3) applicant, 4) inventor, and 5) agent.
  • the homogeneous multi-patent classification code correlation pre-processing module can find a patent classification code most-frequentIy combined with H04B 7/26 according to each stage in a patent classification code system, and can find a patent classification code most-frequentIy combined with H04B 7/26 in Korean Patent Applications of years from 2000 to 2005 of Samsung Electronics, Co. Ltd.
  • the homogeneous multi-patent classification code correlation pre-processing module can find a patent classification code most-frequentIy combined with H04B 7/26 according to each stage in a patent classification code system, and can find a patent classification code most-frequentIy combined with H04B 7/26 in Korean Patent Applications of years from 2000 to 2005 of Samsung Electronics, Co. Ltd.
  • the homogeneous multi-patent classification code correlation pre ⁇ processing module can cause a particular patent document set to correspond to the given patent classification code combination (Ai, Bj). Then, the homogeneous multi-patent classification code correlation pre-processing module can perform various analyses for the corresponding patent document set, which yield results of analyses for the patent classification code set (Ai, Bj).
  • the particular patent document set may be: 1) a patent document set generated by specifying the patent document set by a particular classification code according to a particular patent classification code system in a database of a particular nation, 2) a patent document set generated by specifying the patent document set by a particular applicant name in a database of a particular nation, 3) a patent document set generated by specifying the patent document set by a particular inventor name included in a patent document of a particular applicant name (i.e.
  • a particular applicant name and a particular inventor name) in a database of a particular nation 4) a patent document set generated by specifying the patent document set by a particular agent name in a database of a particular nation, 5) a patent document set generated by specifying the patent document set by a particular applicant name and a particular classification code according to a particular patent classification code system in a database of a particular nation, 6) a patent document set generated by specifying the patent document set by a particular applicant name, a particular inventor name, and a particular classification code according to a particular patent classification code system in a database of a particular nation, 7) a patent document set generated by specifying the patent document set by a particular applicant name and a particular agent name in a database of a particular nation, 8) all patent document sets of a particular nation, 9) all patent document sets of at least two nations, 10) a patent document set specified by a predetermined period further to the conditions of 1) to 9), or 11) a patent document set specified by a particular condition, such as whether registered
  • the homogeneous multi-patent classification code correlation pre-processing module may extract documents having at least two patent classification codes from at least one document set specified by the conditions of 1) to lDdescribed above, and obtain the patent classification code combination (Ai, Bj) from the extracted documents.
  • the homogeneous multi-patent classification code correlation pre-processing module For the obtained patent classification code combination (Ai, Bj), the homogeneous multi-patent classification code correlation pre-processing module generates correlation data as in the tables, and performs arrangement or analysis of the generated correlation data (e.g. analysis of frequency, a most-rare value, a maximum value, a minimum value, a number according to each year, and number according to each subject such as an applicant or an inventor).
  • the analysis as described above it is possible to obtain information on technology convergence in view of the patent classification code combination (Ai, Bj) according to at least one document set specified by the conditions 1) to 11) or according to the extracted document set.
  • the above-described repeating execution method by the homogeneous multi-patent classification code correlation pre-processing module is only one example, and one skilled in the art can try various calculation methods according to spirit of the present invention. It is natural that the results (for example, tables having values for each patent classification code couple regardless of a granted weight) of the above process for all patent classification codes of all patent documents each having at least two patent classification codes may be either stored in a database or provided through a view realized in real time or a materialized view.
  • the homogeneous multi-patent classification code correlation pre-processing module obtains a document set including at least one patent document (S5320), extracts multi-code documents having at least two patent classification codes from the obtained document set (S5330), generates multiple patent classification code combinations included in the extracted multi-code documents according to a predetermined standard for discriminated processing of the main patent classification code and the sub-patent classification code (S5340), obtains information on parent nodes of each patent classification code included in each of the patent classification code combinations in the patent classification code system (S5350), obtains a lowest common patent classification code for the parent nodes (S5360), generates parent node patent classification code couples including itself up to just before the lowest common patent classification code (S5370), stores each couple after granting the couple a uniform weight or a weight according to a predetermined standard (S5380), and generates a predetermined standard- based statistic value, parameter or calculation value for the stored couple (S5320), extracts multi-code documents having at least two patent classification codes from the obtained document set (S
  • the homogeneous multi-patent classification code correlation pre ⁇ processing module obtains at least two document sets each including at least one patent document (S5410), extracts a multi-code patent document having at least two patent classification codes from each document set (S5430), generates multiple patent classification code combinations included in the multi-code patent documents according to a predetermined standard for discriminated processing of the main patent classification code and the sub-patent classification code (S5440), obtains information on parent nodes of each patent classification code included in each of the patent classification code combinations in the patent classification code system (S5450), obtains a lowest common patent classification code for the parent nodes (S5460), generates parent node patent classification code couples including itself up to just before the lowest common patent classification code (S5470), stores each couple after granting the couple a uniform weight or a weight according to a predetermined standard (S5480), generates a predetermined standard-based statistic value, parameter or calculation value for the stored couple (S5490), and compares the statistic value, parameter or calculation value for each of the document sets (
  • FIGs. 115 to 117 show examples of a method for analysis of multiple patent classification codes.
  • FIG. 115 illustrates an example of a method for presenting analysis information on most-frequent Iy converging patent classification codes based on an obtained patent classification code.
  • the homogeneous multi-patent classification code correlation pre ⁇ processing module obtains at least one patent classification code (S6520), obtains information on most-frequentIy converging patent classification codes based on the obtained patent classification code (S6530), and displays the most-frequent Iy converging patent classification codes in at least one stage reflecting a patent classification code system or performs at least one analysis preset for a patent document set corresponding to the most-frequentIy converging patent classification codes (S6540).
  • FIG. 116 illustrates an example of a method for extracting most- frequent patent classification codes from an obtained document set, and presenting analysis information on most-frequentIy converging patent classification codes based on the extracted patent classification codes.
  • the homogeneous multi-patent classification code correlation pre ⁇ processing module obtains at least one document set (S6620), obtains patent classification codes for each document of the document set (S6630), ranks the obtained patent classification codes for each document based on a most-frequent value in the level of the document set (S6640), obtains information on most-frequent Iy converging patent classification codes for each of the ranked patent classification codes (S6650), and displays the most-frequent Iy converging patent classification codes in at least one stage reflecting a patent classification code system or performs at least one analysis preset for a patent document set corresponding to the most- frequent Iy converging patent classification codes (S6660).
  • FIG. 117 illustrates an example of a method for extracting most- frequently converging patent classification codes from patent classification codes extracted from each document in an obtained document set, and presenting analysis information generated by synthesizing the extracted most-frequentIy converging patent classification codes.
  • the homogeneous multi-patent classification code correlation pre- processing module obtains at least one document set (S6720), obtains patent classification codes for each document of the document set (S6730), obtains information on most-frequentIy converging patent classification codes from the obtained patent classification codes for each document (S6740), synthesizes the obtained information on most-frequent Iy converging patent classification codes for each document (S6750), and displays the most-frequent Iy converging patent classification codes in at least one stage reflecting a patent classification code system or performs at least one analysis preset for a patent document set corresponding to the most-frequent Iy converging patent classification codes (S6760).
  • the homogeneous multi-patent classification code correlation preprocessing module may be applied to simple patent classification code couples (PCl, ..., PCi, PCj,..., PCn) included in each document, without the inventive sprit including lower patent classification codes.
  • PCl simple patent classification code couples
  • PCi PCi
  • PCj PCi
  • PCn PCn
  • the homogeneous multi-patent classification code correlation preprocessing module described above processes homogeneous patent classification codes.
  • the heterogeneous multi-patent classification code correlation pre-processing module will be described.
  • the heterogeneous multi-patent classification code correlation pre ⁇ processing module has nearly the same operation principle as that of the homogeneous multi-patent classification code correlation pre-processing module.
  • Almost all of patent documents issued in the United States include both IPC codes and USPC codes, and patent documents issued in Japan also include both IPC codes and FT codes. Let us assume that one patent document is granted IPCl, ..., IPCn and USPCl, ..., USPCm.
  • the correlation combination between IPCl and USPCl corresponds to a couple of a main IPC code and a main USPC code.
  • the correlation combination between IPCl and each of USPC2,...,USPCm corresponds to a couple of a main IPC code and a sub USPC code.
  • the correlation combination between each of IPC2,... IPCn and USPCl corresponds to a couple of a sub IPC code and a main USPC code.
  • the correlation combination between each of IPC2,... IPCn and each of USPC2, ...,USPCm corresponds to a couple of a sub IPC code and a sub USPC code.
  • the heterogeneous multi-patent classification code correlation pre-processing module can perform the same operation as the homogeneous multi-patent classification code correlation pre-processing module.
  • the processed codes are heterogeneous patent classification codes, they have no minimum common patent classification code and can be limited to preset values. For example, a couple of a given IPC code and a USPC code may be limited to have higher patent classification codes up to the subclass of the IPC and the class of the USPC.
  • the heterogeneous multi-patent classification code correlation pre-processing module may generate correlation tables as described above for a given patent classification code combination (Ai, Bj) (wherein Ai and Bj are patent classification codes of different systems).
  • the generated correlation tables may be granted a weight or contribution degree weight.
  • each heterogeneous patent classification code combination (wherein Ai and Bj are patent classification codes of different systems) correspond to a patent document number and is possible to perform counting and calculation reflecting Bibliographical information obtained from the patent document number.
  • Patent classification code-based statistic pre-processing module Hierarchy of patent classification code
  • a patent classification code-based statistic pre-processing module of the present invention performs a pre-processing for finding a preset statistic value for patent classification codes in at least one patent classification code system from at least one nation-based patent DB.
  • FIG. 61 illustrates a structure of the patent classification code- based statistic pre-processing module.
  • the patent classification code- based statistic pre-processing module includes a patent classification code-based statistic pre-processing engine 3210 for pre-processing statistic values according to the patent classification code, and a pre- processed patent classification code-based statistic DB 3230 corresponding to a view, table, or information system storing the pre-processed patent classification code-based statistic values.
  • Patent classification codes dealt with the patent classification code-based statistic pre-processing module include at least one of the IPC, USPC, FT, FI, and ECLA codes. It is obvious that nation-based patent DBs processed by the patent classification code-based statistic pre-processing module include patent DBs issued by Republic of Korea, the United States of America, Japan, and Europe(EPO), as well as patent DBs issued by the patent offices of the other nations. Each nation may commonly use the IPC, while having its own specific classification system (e.g. US patent office has USPC or UPC, Japanese patent office has FT or FI, and Europe patent office has ECLA).
  • each patent classification code system has its own levels.
  • the IPC system has levels including section, subsection, class, subclass, group, and subgroup, and has sub-levels under the subgroup, which have a hierarchical structure expressed by the number of dots attached to the title information corresponding to a relevant patent classification code.
  • An example of the hierarchical structure has been described in detail above in relation to the patent classification code pre-processing module 301-3-1 or 3500.
  • levels of IPC patent classification codes according to the present invention include IPC section, class, subclass, main group, ldot subgroup, 2 dot subgroup, ..., and n-dot subgroup. Referring to those levels, it is possible to identify their hierarchical structure in the classification code up to the main group. However, it is impossible to identify their hierarchical structure from the subgroup without the dot information included in the title. Use of the number of dots in identifying the hierarchical structure is employed from the subclass in the case of the USPC system and after preceding seven digits (theme code + two English characters) in the entire FT codes in the case of the FT system.
  • FIG. 100 is a flowchart showing an example of pre-processing of statistic values according to each patent classification code by the patent classification code-based statistic pre-processing module.
  • the patent classification code-based statistic pre-processing module modifies and stores patent classification codes in a database by using a patent classification code tree structure, so that a particular patent classification code system of a first nation may automatically include classification codes of a given patent classification code (S5020), obtains a search formula or search query including a patent classification code (S5030), converts a given patent classification code included in the search formula or search query to a modified patent classification code (S5040), performs a search by using a search formula or search query including the modified patent classification code by a search engine or DBMS 201 (S5050), obtains patent document information as a result of the search and modifies the obtained patent document information to be measurable (S5060), and performs a function of counting rank information of the result of the search in the unit of preset time or synthesized time based on a preset
  • the patent classification code-based statistic preprocessing module can generate a statistic value, parameter or calculation value, which automatically includes lower patent classification codes of a given patent classification code, and an example of such a method is illustrated in FIG. 101.
  • the patent classification code-based statistic pre-processing module obtains a search formula including a patent classification code (S5120), obtains relevant patent document information including patent information lower patent classification codes of the patent classification code included in the search formula (S5130), and processes backward citation information and forward citation information included in the patent document, so as to generate a preset statistic value, parameter, or calculation value (S5140).
  • the patent classification code-based statistic pre- processing module can generate a patent classification code-based statistic value, parameter, or calculation value, which reflects backward citation information or forward citation information including lower patent classification codes, for a subset of a particular document set, and an example of such a method is illustrated in FIG. 102.
  • the patent classification code-based statistic pre-processing module obtains at least one document set (S5220), extracts information on the applicant, inventor, agent, at least one patent classification code of every document included in the document set (S5230), generates a document subset from the document set based on at least one item of the extracted information (S5240), obtains information on prior-filed documents cited by all documents of the document subset (S5250) or obtains information on later-filed documents citing all the documents of the document subset (S5260), and generates a preset backward citation and forward citation- relating statistic value, parameter or calculation value for the obtained prior-filed documents and/or later-filed documents (S5270).
  • the patent classification code-based statistic pre-processing module processes the preset statistic value, the following options are taken into consideration and can be selected in a crossed manner.
  • a statistic value without lower classification codes can be generated only by obtaining patent documents including H04B 7/15 from among patent documents of each nation.
  • the same can be said about the USPC, FT, etc., and the same description thereof is omitted here.
  • H04B 7/15 it is possible to first obtain information that H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 exist under H04B 7/15 from the patent classification code hierarchical structure information, store the existence of H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208, H04B 7/212, and H04B 7/216 under H04B 7/15, and obtain a preset statistic value automatically including H04B 7/155, H04B 7/165, H04B 7/17, H04B 7/185, H04B 7/19, H04B 7/195, H04B 7/204, H04B 7/208,
  • Countermeasures for this case may include ⁇ 1) a method of employing only the main patent classification code (usually, the first appearing code is the main patent classification code) as an object of the statistic processing; 2) a processing method giving a uniform weight to the main patent classification code and sub-patent classification codes! 3) and a processing method giving different weights to the main patent classification code and sub-patent classification codes (for example, when there are (n+1) patent classification codes, the main patent classification code may be granted a weight of 50% while the sub- patent classification codes are granted a weight of 50%*l/n).
  • methods 1) to 3 methods 1) and 3) are preferred, and method 3) is most preferred.
  • the third issue is a processing method when there are multiple applicants or multiple inventors.
  • Countermeasures for this case include: 1) a method of giving a weight of 100% to each applicant or inventor and 2) a method of equally dividing the weight by the number of applicants or inventors (when there are n inventors, each inventor is granted a weight of 100%*1/n). It is preferred that each applicant is granted a weight of 100% in view of independence of implementation, while each inventor is granted a weight of 1/n.
  • the same method as that for the applicant or inventor can be applied to the agent, and it is preferred that each agent is given a weight of 100%.
  • the patent classification code-based statistic pre-processing module either may count in advance either the number of just lower patent classification codes of the given patent classification code in the patent classification code system, or the number of all the lower patent classification codes of the given patent classification code.
  • the patent classification code-based statistic pre-processing module may calculate in advance the following information based on application and/or registration according to preset terms in a predetermined nation-based patent DB.
  • the above-mentioned calculation can be achieved because it is possible to determine a patent document set satisfying the conditions, such as the patent classification code, appointed nation, preset time, and application or registration. By counting for the determined patent document set, it is possible to calculate various parameters as follows.
  • the each-year's application number can be calculated by obtaining the number of all patent documents filed in a particular year.
  • the increase rate can be calculated by (current term number past term number)/past term number*100%
  • the increase speed can be calculated by (current term number past term numberVtime interval
  • the increase acceleration can be calculated by a change rate with respect to the time interval of the increase speed.
  • the number of applicants of each year can be obtained (in the case of a joint application, the above-mentioned options can be applied).
  • the applicant number increase rate can be calculated, and an average applicant number and an average applicant number increase rate for each application can also be calculated.
  • an each-year's inventor number, inventor number increase rate, average inventor number, and average inventor number increase rate can be calculated by the same method as that of the each- year's applicant number.
  • the application claims can be divided into independent claims and dependent claims, it is possible to calculate the number of independent claims, the number of dependent claims, and their increase rates.
  • each application by calculating the number of patent classification codes of each application, it is possible to obtain a each-year's patent classification code number and patent classification code number increase rate and obtain an average patent classification code number and average patent classification code number increase rate for each application. Further, in the case of a patent DB of a nation using multiple patent classification codes, such as US or Japan, those values can be calculated in advance for each patent classification code.
  • each application has family information of the application, it is possible to calculate an each-year's family number, an each-year's family number increase rate, and an average family nation number and an average family nation number increase rate of each application based on the family information.
  • Parameters calculable based on the registration include registration number, registration number increase rate, registration number increase speed, registration number increase acceleration, registered party number, registered party number increase rate, average registered party number, average registered party number increase rate, inventor number, inventor number increase rate, average inventor number, average inventor number increase rate, registered claim number, registered claim number increase rate, average registered claim number, average registered claim number increase rate, patent classification code number, patent classification code number increase rate, average patent classification code number, average patent classification code number increase rate, family number, family number increase rate, average family nation number, and average family nation number increase rate.
  • the parameters described above essentially include counted values, change rates and change values of the counted values, statistic quantities (average, standard deviation, etc.) of the counted values, and function values (e.g. registration rate) of at least two counted values.
  • the patent classification code-based statistic pre-processing module may previously calculate the following information according to each of preset terms based on the application and/or registration, for patent DB of each nation in relation to a given particular patent classification code. Such a calculation is possible because it is possible to determine a patent document set satisfying conditions including the patent classification code, appointed nation, preset time, and application or registration. By counting for the determined patent document set, it is possible to calculate various parameters as follows. For example, through counting for the determined patent document set, it is possible to extract each-year's or whole-year's maximum applicants, inventors, agents and give rankings to the extracted applicants, inventors, or agents.
  • the applications-number-based concentration degree can be calculated by (all applications of a particular subject having a particular patent classification code during a particular period/all applications of the particular subject during the particular period)*100%.
  • the activity index can be calculated by [(all applications of a particular subject having a particular patent classification code during a particular period/all applications of the particular subject during the particular period)/(all applications having the particular patent classification code during the particular period/all applications during the particular period)*100%].
  • the search engine or DBMS 201 can calculate values of functions, input values of which are numerical values that can be obtained either indirectly through pre-calculation or directly, and can give rankings to the applicants, inventors, or agents based on the calculated function values.
  • the function values according to the present invention as described above may include all patent analysis indexes defined as values obtainable by the search engine or DBMS 201in a patent DB from among all conventional definable patent analysis indexes that can be utilized as patent analysis indexes. Further, instead of calculating the rankings of applicants, inventors or agents based on the number of applications as described above, it is possible to calculate the rankings of applicants, inventors or agents for the patent index or function based on other parameters, such as applications-number increase rate and applications-number increase speed.
  • Those parameters may include the number of application claims, application claim number increase rate, each-application's average application claim number, each-application's average application claim number increase rate, patent classification code number, patent classification code number increase rate, each application's average patent classification code number, each application's average patent classification code number increase rate, family number, family number increase rate, each application's average family nation number, and each application's average family nation number increase rate.
  • the patent classification code-based statistic pre-processing module can generate citation-related information as follows based on the application and/or registration for each preset term from a nation-based patent DB (e.g. a patent DB constructed by patent documents issued by the US patent office)including citation information of a given particular patent classification code.
  • a nation-based patent DB e.g. a patent DB constructed by patent documents issued by the US patent office
  • citation information of a given particular patent classification code e.g. a patent DB constructed by patent documents issued by the US patent office
  • Such a calculation is possible because it is possible to determine a patent document set satisfying the conditions, such as the patent classification code, appointed nation, preset time, and application or registration.
  • Bibliographical information of a US patent document includes citation information, such as others' patents cited by the document itself. That is, when document B cites document A and document a, document A and document a are cited documents of document B, and document B is a citing document of document A and document a.
  • an applicant (including assignee), an inventor, an agent, and a patent classification code relating to document B are a citing applicant, a citing inventor, a citing agent, and a citing patent classification code, respectively, and an applicant (including assignee), an inventor, an agent, and a patent classification code relating to document A and document a are a cited applicant, a cited inventor, a cited agent, and a cited patent classification code, respectively.
  • Bibliographical information of document B contains numbers of document A and document a.
  • This process can be performed for all the determined or specified documents. That is, when all specified patent documents are processed one by one, it is possible to establish mapping relations between all patent documents through a computational method or other methods. When the documents are processed one by one, it is natural that the document, which is an object of each processing, is in the standpoint of document B. At this time, when both document A and document a are included in a universal set named US patent DB, information in relation to document A and document a may include a record of mapping information between documents A and a and document B. When one of document A and document a is a document managed b a patent DB of another nation also, mapping information between documents A and a and document B can be recorded in information relating to the document of the patent DB of that nation.
  • the patent classification code-based statistic pre-processing module can collect cited patent document numbers (numbers of documents A and documents a) included in patent documents of the patent document set, wherein the collected cited patent document number has a corresponding cited patent document, and the cited patent document naturally includes Bibliographical information. Therefore, a cited patent document set cited by all the patent documents of the specified patent document set can also be specified at a particular calculation time point. Accordingly, the specified cited patent document set can be an object of analysis, counting or calculation. Here, repetitive counting is naturally permitted.
  • the cited document when one cited document is cited by multiple documents of the patent document set including the citing information, the cited document is required to have a weight or counting value corresponding to the number of its citations. That is, it is rational to consider that each cited document number has been multiplied by a weight corresponding to the number of its citations. For example, when document A has been cited five times and document a has been cited three times, it is preferred to give a weight of 5 to document A and a weight of 3 to document a in calculation of the following parameters, such as the number of all citing applications. Based on the patent document set including the citing information, a word j°citingj ⁇ can be attached to all calculation values for the cited patent document set. In the repetitive counting, it is possible to calculate a ranking of a cited document number, which has been cited the largest number of times.
  • the patent classification code-based statistic pre-processing module can calculate the following information for the cited patent document set (a document set including documents A and documents a) or a specified patent document set. Further, as quantitative standards, it is possible to obtain the number of citing applications, citing applications- number increase rate, and citing applications-number increase speed (these can be understood from the number of documents included in the cited patent document set), and it is possible to calculate the citing applications-number increase acceleration, citing applicant number, citing applicant number increase rate, citing average applicant number, citing average applicant number increase rate, citing inventor number, citing inventor number increase rate, citing average inventor number, citing average inventor number increase rate, citing application claim number, citing application claim number increase rate, citing average application claim number, citing average application claim number increase rate, citing patent classification code number, citing patent classification code number increase rate, citing average patent classification code number, and citing average patent classification code number increase rate.
  • the method of counting the citing applications-number, citing application number increase rate, citing application number increase speed, citing application number increase acceleration, citing applicant number, citing applicant number increase rate, citing average applicant number, citing average applicant number increase rate, citing inventor number, citing inventor number increase rate, citing average inventor number, citing average inventor number increase rate, citing application claim number, citing application claim number increase rate, citing average application claim number, citing average application claim number increase rate, citing patent classification code number, citing patent classification code number increase rate, citing average patent classification code number, and citing average patent classification code number increase rate is the same as the method of counting the parameters as described above.
  • the patent classification code-based statistic pre-processing module can calculate rankings of applicants, inventors, agents or patent classification codes according to each of the parameters from the cited patent document set. That is, the patent classification code-based statistic pre-processing module can calculate rankings of maximum cited applicants and rankings of maximum cited inventors based on the number of applications. In calculating the rankings of the patent classification codes, the patent classification code-based statistic pre ⁇ processing module can calculate rankings of maximum cited patent classification codes for each stage in each patent classification code system by using a lower patent classification code system of the present invention.
  • each patent document included in the cited patent document set includes at least one patent classification code and each of all higher patent classification codes of the included patent classification code (in the patent classification code system) also has been cited. Therefore, it is preferred that this point of view is reflected in calculation of the each-stage maximum cited patent classification code rankings.
  • US patent documents included in the cited patent document set may include patent classification codes of both USPC and IPC.
  • the patent classification code-based statistic pre-processing module can acquire applicant information from a patent document set including the citing information (a document set including document B), can specify the cited patent document set from an applicant-based patent document set including the citing information (a patent document set relating to one of all the applicants is expected to include document B), and can calculate various parameters, such as the number of citing applications, from the specified cited patent document set. At this time, for each parameter also, the rankings of the applicants can be calculated.
  • the patent classification code-based statistic pre-processing module can generate the cited document set from the each-applicant-based document set in the same way and can calculate various parameters for the cited document set.
  • the patent classification code-based statistic pre-processing module can generate a citing document set, i.e. a document set citing each patent document included in the specified patent document set.
  • a citing document set i.e. a document set citing each patent document included in the specified patent document set.
  • document B is cited by document C and document c
  • document C and document c it is noted from the mapping relation that document B is cited by document C and document c.
  • the number of document B is input in the citing document number field of a search engine.
  • document C, etc. having citing information including document B is output as a result of the search.
  • a query such as select
  • appointing the number of document B as a citing number field in the DBMS 201
  • the mapping relation of B->C and B->c has been arranged as data, it is possible to easily specify a document set including document C citing document B.
  • the patent classification code-based statistic preprocessing module can calculate the following cited-by information (in view of the cited document B, information on the documents included in the citing document C is cited-by information) from a citing document set citing each patent document of the specified patent document set.
  • the patent classification code-based statistic pre-processing module can collect numbers of citing patent document citing each patent document of the specified patent document set (number of document C and number of document c) by making an inquiry to the search engine or DBMS 201. It goes without saying that the number of each of the collected citing patent document corresponds to Bibliographical information. Therefore, a citing patent document set citing all patent documents of the specified patent document set is also specified at a particular calculation time point. Therefore, the specified citing patent document set can be an object of analysis, counting or calculation.
  • the specified patent document set includes document B and document b and when both document B and document B are cited by document C, it is natural that document C should be given a weight of 2. That is, document C belongs to both a result of search by document B and a result of search by document b.
  • all calculation values for the citing patent document set may be given a name "cited" or "cited- by.”
  • the patent classification code-based statistic pre-processing module can obtain the above-mentioned information for the citing patent document set (document set including document C and document c).
  • the patent classification code-based statistic pre-processing module can calculate rankings of applicants, inventors, agents or patent classification codes according to each parameter. That is, the patent classification code- based statistic pre-processing module can calculate rankings of maximum citing applicants and rankings of maximum citing inventors based on the number of applications.
  • the patent classification code- based rankings it is possible to calculate rankings of maximum cited patent classification codes for each stage of each patent classification code system by using the lower patent classification code system of the present invention. That is, it is natural to consider that each patent document included in the citing patent document set includes at least one patent classification code and each of all higher patent classification codes of the included patent classification code (in the patent classification code system) also relates to the citing. Therefore, it is preferred that this point of view is reflected in calculation of the each- stage maximum citing patent classification code rankings.
  • US patent documents included in the citing patent document set may include patent classification codes of both USPC and IPC.
  • the patent classification code-based statistic pre-processing module can analyze the specified patent document set (which includes document B and document b) itself. At this time also, it is preferred to grant a weight corresponding to the number of repetition by the allowed repetitive counting. For example, when document B has been cited five times and document b has been cited three times, it is preferred to give a weight of 5 to document B and a weight of 3 to document b in calculation of the following parameters, such as the number of all cited applications. By reflecting the weights described above, the patent classification code- based statistic pre-processing module can calculate rankings of most- frequent Iy cited patent documents in the specified patent document set.
  • the patent classification code- based statistic pre-processing module can count or calculate the number of cited applications (since document B is cited by document C, document B can use the name j°citedj ⁇ ) from the specified patent document set (which includes document B and document b).
  • the patent classification code-based statistic pre-processing module can acquire applicant information from the specified patent document set (document set including document B), can specify the citing patent document set from the applicant-based patent document set (document B is expected to be included in a patent document set of one applicant from among the most-frequent applicants), and can calculate various parameters, such as the number of cited applications, from the specified citing patent document set. At this time, for each parameter, the rankings of the applicants can be calculated.
  • the patent classification code-based statistic pre-processing module can generate the citing document set from the each-applicant-based document set in the same way and can calculate various parameters for the citing document set.
  • the method of calculating various parameters by the patent classification code-based statistic pre-processing module of the present invention is characterized in that calculation values are generated for each specified or determined patent document set. Therefore, in a standpoint of the patent classification code-based statistic pre ⁇ processing module, existence of a specified patent document set itself is important and attributes and size of the patent document set are not the problem (however, in order to perform calculation in relation to citing, the specified patent document set is required to include citing information).
  • the patent classification code-based statistic preprocessing module of the present invention can calculate at least one parameter among the parameters for a patent document set appointed by a user using the patent classification code-based statistic pre-processing module.
  • Patent information processing basic module 40 includes 1) a search engine module, 2) a multidimensional analysis operation result table generation module 402, 3) an analysis module, 4) a monitor module 403, 5) a patent document set obtaining module, 6) a directory creation module 405, 7) a reporting module 406, 8) simplified analysis module 407.
  • Heterogeneous multi-patent classification code correlation preprocessing module The homogeneous multi-patent classification code correlation pre ⁇ processing module described above processes homogeneous patent classification codes.
  • the heterogeneous multi-patent classification code correlation pre-processing module will be described.
  • the heterogeneous multi-patent classification code correlation pre- processing module has nearly the same operation principle as that of the homogeneous multi-patent classification code correlation pre-processing module.
  • Almost all of patent documents issued in the United States include both IPC codes and USPC codes, and patent documents issued in Japan also include both IPC codes and FT codes. Let us assume that one patent document is granted IPCl, ..., IPCn and USPCl, ..., USPCm.
  • the correlation combination between IPCl and USPCl corresponds to a couple of a main IPC code and a main USPC code.
  • the correlation combination between IPCl and each of USPC2,...,USPCm corresponds to a couple of a main IPC code and a sub USPC code.
  • the correlation combination between each of IPC2,... IPCn and USPCl corresponds to a couple of a sub IPC code and a main USPC code.
  • the correlation combination between each of IPC2,... IPCn and each of USPC2,...,USPCm corresponds to a couple of a sub IPC code and a sub USPC code.
  • the heterogeneous multi-patent classification code correlation pre-processing module can perform the same operation as the homogeneous multi-patent classification code correlation pre-processing module.
  • the processed codes are heterogeneous patent classification codes, they have no minimum common patent classification code and can be limited to preset values. For example, a couple of a given IPC code and a USPC code may be limited to have higher patent classification codes up to the subclass of the IPC and the class of the USPC.
  • the heterogeneous multi-patent classification code correlation preprocessing module may generate correlation tables as described above for a given patent classification code combination (Ai, Bj) (wherein Ai and Bj are patent classification codes of different systems).
  • the generated correlation tables may be granted a weight or contribution degree weight.
  • each heterogeneous patent classification code combination (wherein Ai and Bj are patent classification codes of different systems) correspond to a patent document number and is possible to perform counting and calculation reflecting Bibliographical information obtained from the patent document number.
  • Search of a patent classification code refers to a search process in which, when a technical keyword is input as a search phrase, a technical classification code including the technical keyword as title information is output as a search result.
  • H electricity Class HOl basic electrical element
  • Sub-class HOF magnet main group: HOlF 1/OOmagnet or magnetic body characterized by its magnetic material
  • 6-dot sub-group HOF 1/053 ******those including rare-earth metal
  • the first characteristic of the patent classification code search according to the present invention is to provide a lower patent classification code found through the search, together with higher patent classification codes of the lower patent classification code found through the search.
  • “hard magnetic material” is input as a search phrase
  • the preset level is subclass in the case of IPC, class in the case of USPC, theme in the case of FT, and subclass in the case of ECLA or FI. Of course, it is okay to present a search result even above or below the preset level .
  • HOlF magnet main group HOlF 1/00 magnet or magnetic body characterized by its magnetic material
  • a keyword input as a search phrase is searched for in the patent classification code indexes including patent classification codes and title information attached to the patent classification codes, so as to find at least one patent classification code as a result of the search, which includes the input search phrase in the title information of the patent classification code (when "hard magnetic material" is input, HOF 1/032 is found).
  • the found patent classification code is searched for in the patent classification code master DB 203, so as to find higher patent classification codes of the patent classification code up to a preset stage ( HOF 1/03, HOF 1/01, HOF 1/00, and HOF are found).
  • the found higher patent classification codes are searched for in the patent classification code master DB 203, so as to obtain title information of the found higher patent classification codes.
  • patent classification codes and their title information obtained in the first to third stages described above are output or displayed together with information indicating the hierarchical structure including dots.
  • title information of each patent classification code includes title information of higher patent classification codes up to the highest patent classification code in Table 23, it is possible to allow title information of each patent classification code to be incorporated with title information of higher patent classification codes up to only a preset level. It is preferred that the preset level allowing the incorporation is subclass in the case of IPC, class in the case of USPC, theme subclass in the case of FT, and subclass in the case of ECLA or FI. In a state in which the patent classification codes have been indexed line by line with the incorporated title information, input of "hard magnetic material” AND “rare-earth” as a search phrase can yield "HOF 1/053" as a result of the search.
  • HOlFmagnet main group HOF 1/OOmagnet or magnetic body characterized by its magnetic material 1-dot sub-group :H01F 1/01 *those made from inorganic materials
  • FIG. 16 illustrates an example of an operation of the patent classification code search module 401 according to the present invention.
  • the search includes: a step for selection of a patent classification code a step for selection of a search phrase in order to find the patent classification code a step for input of at least one search phrase! and a step of performing the search using the input search formula, thereby generating a search result.
  • the patent classification code search module 401 can achieve the object of the present invention without using the incorporated title information.
  • the patent classification code search module 401 can achieve the same object by examining if they have a child-parent relation in the patent classification code system.
  • the patent document set obtaining module 404 includes an automatically-selected document set obtaining module 404-1 and a user-created document set obtaining module
  • the user-created document set obtaining module 404-2 includes a search-formula-using document obtaining module 404-2-1 and a selection-on- directory document obtaining module 404-2-2 that enables obtainment of a document set through selection on a directory capable of specifying a document set, such as an IPC directory.
  • the patent document set obtaining module 404 can obtain or select a patent document set generated by either itself or others, and the patent document set obtaining module 404 can generate a new patent document set through a set operation (including a union, a difference set, a NOT operation) between at least two patent document sets.
  • the search-formula-using document obtaining module 404-2-1 includes a search engine-using document set obtaining module 404-2-1-1 making an inquiry to the search engine, and a DBMS-using document set obtaining module 404-2-1-2 making an inquiry to the DBMS 201, which are divided according to an object to which the search formula is sent.
  • the automatically-selected document set obtaining module 404-1 automatically obtain various determined patent document sets as described above in relation to the determination of the patent document set according to the present invention, except for the patent document sets generated by the user.
  • a multi-dimensional analysis operation result table generating module 402 Hereinafter, the multi-dimensional analysis operation result table generating module 402 of the present invention will be described.
  • an SQL query is conventionally used to retrieve a required result.
  • a DB structure is we11-designed, multiple tables have to be joined to deduce one result.
  • a table size is large, it takes a very long time to process a selection query.
  • the extracted 1 dot subgroup IPC based on the data of all Korea patents of Samsung Electronics Inc., the number of each-year's applications is searched out and numerously-filed 1 dot subgroup IPCs are searched out.
  • the ratio of the number of applications of Samsung Electronics Inc. to the number of all applications is obtained, thereby obtaining an occupancy rate of Samsung Electronics Inc. in each numerously-filed 1 dot subgroup IPC. If this is made into one SQL query, the SQL query is too long, and requires a long process time.
  • the multi-dimensional operation includes a roll-up operation, a cube operation, or the like, and the result of such an operation for patent data is stored in a DB table.
  • the table may be called a cube, a materialized view, a multi-dimensional analysis operation result table, a view, or the like, but the various names indicate the common subject.
  • an SQL query is used to extract required data.
  • a MDX (multidimensional expression) query is conventionally used, but the MDX query is generally called SQL for convenience sake in the present specification
  • an SQL query is made for access to a multi-dimensional analysis roll-up operation result table
  • an MDX query is made for access to a multi-dimensional analysis cube operation result table.
  • SQL query is representatively used for convenience sake in the description of the present specification.
  • an SQL query on a multi-dimensional analysis operation result table can be understood in two ways, such as 1) when the table made a roll-up operation, an SQL query is made, and 2) when the table made a cube operation, an MDX query is made.
  • multi-dimensional analysis indicates that the analysis is performed by one or more dimensions (of course, one-dimensional analysis is included).
  • an operation result of the steps from 1 to n-1 (or n) is stored when it is assumed that a user's required analysis result is logically generated by n information-processing steps.
  • an operation result only up to an intermediate step is stored in the multi ⁇ dimensional analysis operation result table, and a final analysis result may be generated by a DBMS or SQL query formula, because an SQL query or DBMS has an operation function.
  • the multi-dimensional analysis operation result table generating module 402 indicates an engine for performing a roll-up or cube operation of patent information, or another required operation or a data process, and generating the result as a table in order to rapidly output required information.
  • an analysis module of the present invention which uses the table, can rapidly generate required data by making a simplified SQJ- query in a table generated by the multi-dimensional analysis operation result table generating module 402 when a roll-up, a drill-down, or a drill-through occurs.
  • these multi ⁇ dimensional analysis operation result table generating module 402 performs a roll-up operation and/or a cube operation in each of at least one or more analysis subject categories, and stores the results in a table.
  • the patent document master DB 202 includes a Bibliographical information master DB, etc.
  • the patent classification code master DB 203 includes a patent classification code DB according to each type of a patent classification code system, etc.
  • the subject master DB 204 includes an applicant name DB, an inventor name DB, etc.
  • the applicant name DB includes a representative applicant name made for each nation.
  • the multi-dimensional analysis operation result table generating module 402 performs a multi-dimensional analysis operation and generates the result in a table.
  • the table may include multiple tables, and the multiple tables are configured as a DW (data warehouse).
  • the DB schema of a patent information table used by the multi-dimensional analysis operation result table generating module 402 is preferably configured as a star-schema.
  • An E-R diagram (Entity Relationship Diagram) of tables re ⁇ configured as a star-schema has a FACT table at the center, and a dimension table as a reference information table referring to the FACT table.
  • the FACT table preferably includes Bibliographical information data on a patent document in each field (the data of the dimension table is excluded, but in the dimension table, at least one ID of each dimension, such as an IPC_ID indicating the specific IFCXs) included in a patent document, an application ID indicating a specific ap ⁇ licant(s), a date ID indicating a date, is included).
  • the dimension table includes tables of other objects that may be included in a dimension, such as a patent classification code table (IPC is necessary, and in the case of a FACT table of each nation, a patent classification code of the nation), a date table (for example, year), a subject table (for example, applicant/agent/inventor), a location table (for example, nation/location/company). Meanwhile, of course, in the dimension table, IDs corresponding to various IDs included in the FACT table are included.
  • an IPC dimension table includes the IPC_ID which corresponds to a specific one IPC.
  • the tables re-configured as a star-schema may be basically configured according to each nation or unified nations. In unified nations or in other cases, some data related to multiple nations, such as family information data or INPADOC/DocDB data, is preferably managed by an additional table.
  • the multi-dimensional analysis operation result table generating module 402 generates a multi-dimensional analysis operation result table for various analysis objects of the present invention by using the data of a patent information table re-configured in a star-schema. Meanwhile, the multi-dimensional analysis operation result table generating module 402 may generate another multi-dimensional analysis operation result table by using the generated multi-dimensional analysis operation result table, so as to analyze another analysis subject or a complicated analysis subject.
  • the multi-dimensional analysis operation result table generating module 402 uses at least one or more of 1) a patent information table re-configured in a star-schema, 2) a patent document master DB 202, a patent classification code master DB 203, and a subject master DB 204, and 3) a multi-dimensional analysis operation result table.
  • the patent information table re-configured in a star- schema configuration generally represents 1) to 3), but 2) and 3) are not excluded.
  • the multi-dimensional analysis operation result table generating module 402 includes an analysis DW generation module 402-1 for generating the analysis DW, various kinds of modules generating an analysis-subject- based multi-dimensional analysis operation result, and other modules for performing a function required for generating a multi-dimensional analysis operation result of the present specification.
  • the various kinds of modules for generating an analysis-subject-based multi-dimensional analysis operation result include a total amount analysis multidimensional analysis operation result table generating module 402-2, a citation-analysis multi-dimensional analysis operation result table generating module 402-3, a competitive-analysis multi-dimensional analysis operation result table generating module 402-4, an inventor-analysis multi-dimensional analysis operation result table generating module 402-5, a patent technology classification-analysis multi-dimensional analysis operation result table generating module 402-6, a convergence-analysis multi-dimensional analysis operation result table generating module 402-7, a representing phrase-analysis multi-dimensional analysis operation result table generating module 402-8, etc.
  • the modules generate an analysis DW 205-1, a total amount analysis multi-dimensional analysis operation result table 205-2, a citation-analysis multi-dimensional analysis operation result table 205-3, a competitive-analysis multi-dimensional analysis operation result table 205-4, an inventor-analysis multi-dimensional analysis operation result table 205-5, a patent technology classification- analysis multi-dimensional analysis operation result table 205-6, a convergence-analysis multi-dimensional analysis operation result table 205-7, and a representing phrase-analysis mult i-dimensional analysis operation result table 205-8, respectively.
  • the relation between the mult i-dimensional analysis operation result table generating module 402 and the analysis module of the present invention will be described.
  • the analysis module includes at least one or more analysis formula (an SQL query) corresponding to an analysis subject, obtains a required analysis result of each analysis subject by the SQL query in the multi-dimensional analysis operation result table, processes the result provided from the system 1 of the present invention, and provides the processed result to a user of the present system.
  • an SQL query an analysis formula
  • the multi-dimensional analysis operation result table generating module 402 may generate at least one or more tables according to each analysis object within a wider range than a predetermined scale/stage in consideration of the performance of the analysis module, such as decrease in extraction/calculation/obtaining time of the required data, and reduction of computer resource consumption.
  • the multi-dimensional analysis operation result table generating module 402 although final analysis result display data to be provided to a user's screen can be generated, it is more preferable that the final analysis result display data is generated by using various kinds of commands provided from an SQL query itself after data only up to certain intermediate stages in each analysis subject is generated.
  • the latter case is preferable because lots of final analysis result displays result in huge inefficiency (waste of computer resource).
  • the optimization of computer resource is thrown up (the response speed is required to be maximized), the former case may be allowable.
  • the roll up computation means adding from a small unit to a large unit in one direction with respect to one dimension at one time. That is, one example of the roll up computation is an addition from a month unit value to a quarter year unit value and then from the quarter year unit value to a year unit value, and thus the roll up means approaching, acquiring and extracting the result from the adding up to a higher unit in one dimension.
  • 58 data in the year of 2006 are comprised of 6 cases in a first quarter, 15 cases in a second quarter, 24 cases in a third quarter, and 13 cases in a fourth quarter, and these 58 data in 2006 are again divided into 6 cases in a first quarter, 15 cases in a second quarter, 24 cases in a third quarter, and 13 cases in a fourth quarter (because they are added from the devided separate cases), and also cases in each quarter year can be divided into a month unit.
  • an operation of approaching, acquiring and extracting by subdividing from a bigger unit to a smaller unit in one dimension is called drill down.
  • the roll up and drill down is like both faces of a penny, and if there is a multi- dimension computed data by performing a multi-dimension computation such as a roll up computation, etc. on data from the smallest unit to the biggest unit, we can see the data by a drill down from the biggest higher rank unit to the lower rank unit.
  • OLAP online analytic process
  • a conventional OLAP engine supports functions such as a drill down, a drill through, etc.
  • the roll up and drill down will be explained as an example of table 26.
  • the following table 33 represents the number of a patent application of a specific corporation "A" based on H04B 7/02.
  • Numbers inside parentheses, ( ) refer to the number of documents classified as a specific IPC from among applied patent documents of corporation "A”.
  • numbers inside braces, ⁇ ⁇ are multi dimension computed numbers.
  • Title information is added for reference, and it is independent of the roll up. However, because it is difficult for users to understand what a classification code means only by a patent classification code when drilling down, providing the title information is proper, and furthermore, the title information is included to facilitate understanding the multi- stage structure of the patent classification code (the reason why lower patent classification codes are included).
  • 4 dot subgroup is not below H04B 7/02, but it is added for reference to show the multi-stage structure of the patent classification code. Except the below table, tables without the 4-dot subgroup and its title information will be provided and explained. [Table 26]
  • multi-dimensional computed documents of H04B 7/00 equal 2650, which indicates the number of documents including H04B 7/00 from among patent documents and a multi-dimensional computed number in another patent classification code below this patent classification code.
  • "" included in Table 26 means that lower patent classification codes comprise other things except the recited in Table 33. It can be known that obtaining only information recited as H04B 7/02 is improper when patent document information for H04B 7/02 is taken (in search or counting, or any other statistic and analysis calculations). The reason is that it is more proper that the patent information for H04B 7/02 includes patent information for all patent classification code under H04B 7/02 in a hierarchical structure of a patent classification code system.
  • a counting value for H04B 7/04 includes a value recited as H04B 7/04, a value recited as H04B 7/06 and a value recited as H04B 7/08.
  • H04B 7/015 does not have lower patent classification code information, so an additional change may not be found.
  • H04B 7/02 data provided in the following Table 29 can be shown. [Table 29]
  • the Table 27 through Table 37 show rolled up application date/application year for a specific patent classification code of corporation "A". That is, each multi-mension computed result can be divided into from the past to the year of 2000, 2001, 2002, 2003, 2004, 2005 and from the year in which a year and six months from the present date belongs to until the present date ("recently"is short for this period.) during drill down. To do this, a numerical value of documents corresponding to each classification code for a patent document set of corporation "A" needs to be rolled up in a year unit beforehand.
  • Table 31 shows its one example.
  • this system 1 can accept that user's action, extract data for lower dimensions (for example, a quarter) of the year of 2001 and provide it for the user. This is provided in Table 32. [Table 32]
  • time can be a dimension that can be a direction axis in roll up or drill down as various units such as date- week-month-quarter year-year and the like.
  • a patent classification code on a patent classification code system can be a dimension that can be a direction axis in roll up or drill down.
  • the most basic time dimension can be selected from unit periods of one or more date-week-month-quarter year-year-a plurality of years, and the above multi-stage time dimension can be applied to many kinds of time attribute fields included in a patent document such as application date, publication date, registration date, etc.
  • the patent classification code dimension can be established by reflecting multi-stage of a patent classification code system.
  • a bundle of at least one of the selected patent classification codes can be established as a separate dimension in one unit. For example, for a major theme of "RFID", a multi-stage minor theme corresponding to the major theme is established, and then an individual dimension can be established by corresponding a bundle of the patent classification codes to the minor theme.
  • This individual dimension is particularly of use to a personalized multi-dimension analysis operation result table / (a personalized cube) a table / a cube made by an individual user to achieve the purpose of analysis.
  • a region can be one dimension.
  • a unit which can be easily obtainable from a patent document of region units is a nation, and a dimension having a multi-stage class can be generated by dividing from address information into regions in one nation.
  • a subject can be one dimension.
  • the subject includes applicants, inventors, agents, etc., and the attribute of an applicant (the attributes of a corporation, a laboratory, a university, etc.), a quantitative attribute such as their scale (large, medium, small, etc.), an inter- applicant attribute such as head office-subsidiary and the like can also be a dimension.
  • corporation's financial information and corporation's monetary information such as a stock price, a sale, a profit ratio, etc. of corporations can be a dimension.
  • a meta attribute that can arbitrarily correspond to a corporation such as a global corporation, a local corporation, etc. can be a dimension.
  • an attribute regarding conditions of a document such as application, registration, rejection, etc. can be a dimension.
  • a group each counting value belongs to can be a dimension.
  • One example of these groups includes 1 through 5, 6 through 10, 10 through 15, and more than 15 claims.
  • the number of joint applicants, the number of joint inventors, the number of family (the number of domestic family, the number of overseas family, etc.) and the like may not be a dimension in itself, but when the above number is grouped, a group the above number belongs to can be a dimension.
  • a multi-dimension analysis operation result table generating module 402 according to the present invention generates a multi-dimension analysis operation result table by performing a multi-dimension computation on at least one selected dimension.
  • the multi-dimension analysis operation result table generating module 402 according to the present invention generates the multi-dimension analysis operation result table, we suppose that a patent classification code is included in a dimension.
  • a multi-dimension computation value for the given patent classification code is generated by considering the patent classification code and a higher patent classification code.
  • a multi-dimension computation value for any patent classification code is generated in this way and obtained, a value generated by reflecting the multi-dimension computed numerical value on all values for the patent classification code and its lower patent classification codes is obtained.
  • the multi-dimension analysis operation result table generating module 402 reflects a value resulting from performing a multi-dimension computation on the patent classification code on a higher patent classification code of the patent classification code as well as the patent classification code. For example, when document number #1 is assigned H04B 7/06 in IPC and multi-dimension computation data is generated by this document, H04B 7/06 is assigned the counting value of 1, and H04B 7/04 and H04B 7/02 that are immediate higher levels of H04B 7/06 should be assigned a counting value. Of course, it is obvious that higher levels more than H04B 7/00 are assigned a counting value.
  • a dimension can mainly be a direction axis of the roll up and drill down. Representatives of the dimension are a patent classification code such as IPC, etc. and time. Also, a nation, a region, an applicant, an inventor, status, citation, family information and the like can be a dimension axis.
  • the multi-dimension analysis operation result table generating module 402 according to the present invention calculates an analysis index value beforehand using at least one of the selected dimensions as an object axis of roll up, wherein the analysis index value includes total amount, an occupying index, a concentration index, an activity index or the like.
  • a dimension can be subdivided into steps as entire nations - a separate nation - a region within a separate nation, such as entire nations - Korea - a special city of Seoul - GangNam district .... etc.
  • patent information includes a nation and/or address information of an applicant and/or an inventor).
  • a dimension can also be divided in the same way, for example, time - degree - period —
  • the roll up/drill down will be in the direction of applicant (entire nations) applicant (entire nations) inventor of each nation. Meanwhile, the roll up/drill down will be according to the attributes of an applicant in one nation.
  • an applicant can be divided into organization units such as a corporation, a university, a laboratory, etc., the corporation can be divided into a large enterprise, enterprise of middle standing, and small and medium enterprises, the university can be divided into a national/public university, a private university, a college, etc., and the laboratory can be subdivided into a public laboratory and a private laboratory, and therefore, roll up/drill down regarding these things will be performed.
  • mapping information for the applicant must exist. For example, if applicant "A" is a corporation, information that its scale is a small and medium enterprise needs to be mapped.
  • roll up/drill down can be performed. For example, if there is information regarding sale, profit, stock price and rise and fall ratio of each of the formers as financial statements of a corporation, roll up/drill down in various categories such as corporation - sale scale - the range of the sale scale, corporation - profit - rise and fall ratio of the profit, or corporation - stock price - the range of the rise and fall ratio of the stock price can be performed. That is, the nature of the roll up/drill down means that if there are lots of category information on one object, the processing result according to each category is arranged.
  • One patent document includes a nation, time, an applicant, inventors and patent classification code information, and the applicant information further includes region information of the applicant. Further, if an applicant is a corporation, financial statements, etc. of the corporation can be built as corporation information DB.
  • corporation information DB 1) if one selected category such as patent classification code information in an entire or any defined patent document set unit has a multi-stage hierarchical structure, many kinds of information corresponding to patent classification codes selected in that category are processed (searched, counted or any other statistic and analysis calculations are performed) and the results can be output, and 2) roll up/drill down can be performed based on various intersections for more than two kinds of categories (also, it is obvious that when the kinds of the category is n, the possible number of the intersections are nCri (that is,n Combination r,and herein, r equals or is bigger than 1 and equals n or is smaller than n), all combinations may not be selected and some combinations can have a special analytic meaning.)in an entire
  • the first case is when using 2 categories. For example, when roll up/drill down is performed on category "A" (such as time) and "B" category (such as IFC), it is desirable to drill down in one direction (For example, a time axis as a detail IPC is not drilled down but only IPC is spreaded, which is shown when the concept of the roll up/drill down was explained in the above) for the benefit of expressing the results of information processing (search, counting or any other statistic and analysis calculations).
  • the roll up should be performed in both directions. That is, the roll up is performed based on the value of information for a cell comprising all combinations of a predefined unit of category"A"and a defined unit of category"B", and roll up information needs to be processed in a direction of category"A"in each cell and in a direction of"B"category.
  • the immediate above table shows one example.
  • Each number in a cell represents the value of information of the cell
  • a number in ⁇ ⁇ represents the number of patent registrations rolled up for each year from the year of 2000 to 2005 in each patent classification code level
  • 17 in H04B 7/02 ⁇ ⁇ represents the number of patent documents rolled up for H04B 7/02 and up to its lower patent classification codes.
  • the above table is rolled up with respect to years (for example, how many documents exist in the year of 2005, how many of them exist in H04B 7/02 and in 2005, etc. are indicated), but preferably they can be indicated after roll up, and it is preferable that multiple-dimension can be performed with respect to quarter years and months of 2005.)
  • the second case is when using more than 3 categories. Also, it is preferable that drill down is spreated in one direction, and spreading in two directions is allowed. However, it is preferred to avoid spreading in three directions because it is impossible or much difficult to express visually. Likewise, it is preferable that roll up has been computed for all categories, but it is better that roll up for a dimension axis which does not require beforehand roll up according to roll up computation amount and use frequency is not performed and then is processed when the use is started. That is, if roll up is not performed, necessary information can be rolled up and processed at the starting time, but it takes a relatively long time to process roll up.
  • the effect of drill down for each year occurs by obtaining year information from bibliographic data of all objective documents, dividing and calculating it according to each year, and outputting the result for each year.
  • information is divided and calculated beforehand, and rolled up, the rolled up year information just needs to be read and displayed later when drilling down for each year, thereby having an advantageous effect in the speed of a response.
  • a multi-dimension analysis operation result table that is generated by a multi-dimension analysis operation result table generating module 402 according to the present invention will be described in detail. Furthermore, the way the analysis module gains access to the multi-dimension analysis operation result table and what data the analysis module generates will be described.
  • application date recitation mentioned in a patent document has the recitation of year-month-date. Therefore, if counting is performed in a daily unit and in each patent classification code, multiple-dimension computation can be performed in a monthly unit, in a quarter year unit and in a yearly unit.
  • Table 33 shows a part of an embodiment of a table after multi-dimension computation is performed.
  • Data provided in the following Table 33 refer to data that is generated by a multi-dimension analysis operation result table generating module 402, wherein the multi- dimension analysis operation result table generating module 402 generates the data by counting the number of a patent document beforehand in an applicant unit, in a yearly unit and in each layer of multi-stage IPC, based on an applicant, IPC and information on date for each year in a patent information table rebuilt with the Star Schema.
  • Data such as Table 33 can be stored in any format such as table/view/materialized view or the like, and data included in a stored table can be extracted by using a proper query.
  • one example takes the number of applications for each year and for a multiple of application IPC (multiple-dimension computed) at the standard of IPC 1 dot subgroup of applicant "A” (hereinafter, referred to as "C5 level"). Also, it is preferable that the number of applicantions for applicants and years is generated.
  • the multi- dimension analysis operation result table generating module 402 needs to perform multi-dimension computation for information corresponding to lower patent classification codes of a patent classification code which is inaccessible by wild cards (*, ?, etc.) or the like with the reference to patent classification code DBs such as IPC and the like.
  • IPC patent classification code
  • Table 33 [Table 33]
  • AppName refers to the name of an applicant
  • a number after C refers to node layers from a root node of IPC (Cl is a section, C2 is a class, C3 is a sub class, C4 is a main group, C5 is 1 dot sub group, C6 is 2 dot sub group, C7 is 3 dot sub group.
  • C8 through C20 can be assigned. Typically, up to C15 will be enough), and 01, 02, etc. refer to years. Years exist before the year of 2000, and the number for this omitted. When the present time is assumed to be 2007, a numerical value based on application dates of documents published until now represents data in each cell.
  • a value entered into each cell of a sum field represents an adding value of values in each year
  • a gray period represents an adding value of values in each year belonging to a unit period from a year a date corresponding to one year and six months (normally, a laid-open publication period after an application date) before the present date belongs to until a present date.
  • GID is used to show the result that multi-dimension computation has been performed up to which level.
  • H04B 7/02 of GID 7 shows the result that multiple-dimension computation up to itself (C5) has been performed
  • H04B 7/02 of GID 3 which is one step lower than that, shows the result that multiple-dimension computation up to C6 level (except itself) has been performed.
  • 24 is obtained from GID 1, and 24 refers to the number of patent documents including a patent classification code of H04B 7/02. GID value can be recited in any way (this is arbitrarily defined.), but the present specification indicates it in a way of forming a series of "2 11""1 " .
  • GID refers to the steps of roll up, wherein GID 0 indicates roll up to C8 level, GID 1 indicates roll up to C7 level, GID 3 indicates roll up to C6 level, GID 7 indicates roll up to C5 level, GID 15 indicates roll up to C4 level, GID 31 indicates roll up to C3 level, and GID 63 indicates roll up to C2 level.
  • GID 0 means roll up to C8 level and GID 1 means roll up to C7 level. (The above example does not have C8, and therefore a value of roll up to C7 equals that of roll up to CS.)
  • the GID indicates a code that means roll up to the corresponding level.
  • GID 15 indicates C4 level. This means that information based on roll up to a main group in IPC, to 0 dot in USPC, to 00 level in FT, to a main group in ECLA (roll up of itself and up to its all lower patent classification) is written in that line. Therefore, if information in any level is wanted to be seen, multi-dimension computed information up to that level can be seen by specifying GID corresponding to that level.
  • GID is a short term, and it means "roll up level code". In the above and in the following, the roll up level code and GID are used as a same meaning.
  • 00 level means a level that consists of two alphabetic letters without any numeric digit such as AA, etc.
  • a "supercategory" indicates that USPC or FT has lots of items in a class and a theme code level, respectively, and therefore the items have been grouped.
  • each patent classification code in each line has its immediate higher patent classification code on the left.
  • the multi-dimension analysis operation result table generating module according to the present invention generates multi-dimension computed numerical value per each IPC symbol as data provided in Table 33. This is generated with respect to all patent classification codes included in each patent document group of all patent document groups of applicant "A" in each nation.
  • data provided in Table 33 can be generated regarding all application documents, or it can be generated regarding only all registered documents.
  • the multi-dimension analysis operation result table generating module 402 generates a mutl-dimension analysis operation result table such as Table 33 for each document set having a predefined attribute.
  • the multi-dimension analysis operation result table generating module 402 includes the steps of 1) obtaining a key value (document unique information) characterizing a patent document, such as the number of the patent document included in a patent group having the predefined attribute, 2) extracting key values characterizing the patent document from FACT table of a table rebuilt with the Star Skima, and 3) generating a multi- dimension analysis operation result table regarding at least one predefined analysis theme with respect to only patent document corresponding to the extracted key value.
  • the multi-dimension analysis operation result table generating module 402 After performing the steps of 1) and 2), the multi-dimension analysis operation result table generating module 402 generates to-be-processed patent document set which will be subject to processing.
  • the FACT table includes FACT information for entire document sets.
  • the FACT information includes document unique information, and further includes many kinds of bibliographic data information.
  • the document unique information includes application number or document unique code or document series code.
  • the multi-dimension analysis operation result table generating module 402 generates at least one multi-dimension analysis operation result table regarding at least one predefined theme with respect to all documents in a predefined analysis theme, in each nation or in entire nations.
  • the document set can be any document set which shares at least one predefinable attribute.
  • Examples of these any document set includes 1) a patent document set which is generated by a method of specifying a patent document set with a specific patent classification code on a specific patent classification code system in a specific nation DB, 2) a patent document set which is generated by a method of specifying a patent document set with the name of a specific applicant in a specific nation DB, 3) a patent document set which is generated by a method of specifying a patent document set with the name of a specific inventor included in a patent document of a specific applicant as an inventor (That is, including the name of an applicant and the name of an inventor) in a specific nation DB, 4) a patent document set which is generated by a method of specifying a patent document set with the name of a specific agent in a specific nation DB, 5) a patent document set which is generated by a method of specifying a patent document set with the name of a specific applicant and a specific patent classification code on a specific patent classification code system in a specific nation DB, 6) a patent document
  • the above any document set may be generated just with respect to family information regarding other nations besides a first nation.
  • the above any document set may be a document set consisting of ony reissued patent documents.
  • the document set mentioned in this paragraph is exemplary. Therefore, for these exemplary document sets, the multi-dimension analysis operation result table generating module 402 desirably generates at least one multi-dimension analysis operation result table regarding at least one predefined theme with respect to the above documents according to a predefined analysis theme.
  • the multi-dimension analysis operation result table generating module 402 can generate a multi-dimension analysis operation result table such as Table 33 regarding any document set generated by a user.
  • the multi-dimension analysis operation result table generating module 402 includes the steps of 1) obtaining a key value (document unique information) characterizing a patent document, such as the number of the patent document included in any patent group generated by a user, 2) extracting key values characterizing the patent document from FACT table of a table rebuilt with the Star Skima, and 3) generating a multi- dimension analysis operation result table regarding at least one predefined analysis theme with respect to only patent document corresponding to the extracted key value.
  • a processing method of a total amount data multi-dimension analysis operation result table generating module 402 To generate a multi-dimension analysis operation result table such as Table 33, the multi-dimension analysis operation result table generating module 402 performs the following steps on the predefined/obtained document set (For example, all document sets applied for or registered in a specific nation).
  • a key value (document unique information) characterizing a patent document such as the number of the patent document included in a patent group having the predefined attribute is obtained, only key values characterizing the patent document from FACT table of a table rebuilt with the Star Skima are extracted, and then a command set for at least one predefined analysis theme-related multi-dimension analysis basic table is obtained regarding only the patent document corresponding to the extracted key value.
  • This command set can be script.
  • an analysis theme- related multi-dimension computation execution command set with respect to the multi-dimension computation basic table can be obtained simulataneously).
  • an analysis theme-related command set to generate at least one predefined analysis theme-related multi-dimension operation result table is obtained without any other procedures. That is, in this step, to-be-processed patent document set on which the multi-dimension analysis operation result table generating module 402 perfoms a predefined processing is determined and a command set for each analysis theme to perform the predefined processing is determined. Secondly, the multi-dimension computation basic table is generated by executing the command set for each analysis theme with respect to a separate document of the to-be-processed patent document set.
  • the multi- dimension computation basic table includes information on at least one dimension, and the each record needs to have basic data for each dimension to perform multi-dimension computation.
  • the basic data include information on existence/non-existence (1 or null) (for example, application/registration has occurred or has not occurred) and many kinds of counting information (more than one information mentioned as counting information in this specification, for example, the number of claims, etc.).
  • the dimension includes more than one selected from the following dimensions or more than one of a complex dimension, that is, a combination of the following dimensions.
  • the dimensions include 1) a patent classification code dimension for the kinds of each patent classification code, 2) a date dimension, 3) a position dimension (a nation, a region, etc.), 4) a subject's attribute dimension (a dimension distinguishable in the scale, the attribute, etc. of more than one selected from an applicant, an inventor or an agent (for example, the applicant is divided as a corporation, a university, a laboratory, an individual and the others, the corporation is divided as a multinational corporation, a large enterprise, small and medium enterprises, etc.) can be obtained from subject master DB 204 with reference to the attribute of each applicant.).
  • a field must be provided in a mult i-dimension computation basic table to input bibliographic data needed for each dimension.
  • a patent classification code dimension of the field includes a field for each level in order to accommodate all higher or lower patent classification codes on a patent classification code system such as from a section to n dot subgroup with IPC, and (from a supercategory) from a class to n dot level with USPC, etc. Also, it would be better that a field in the smallest unit for analysis exists for a time dimension. In the case of patent analysis, an analysis up to a monthly field will be enough when a daily or weekly unit analysis is not needed. Also, if an applicant attribute dimension exists, a field corresponding to a corporation (a multinational corporation, a large enterprise, small and ' medium enterprises), a university, a laboratory, an individual and the others is needed.
  • a period and applicant attribute may be input like the following.
  • Data provided in the following Table 36 resulted from performing a partial mult i-dimension computation in a year or an applicant attribute, which can be processed at the time of multi-dimension computation, or can be generated beforehand. [Table 36]
  • a method of generating such data is provided.
  • a patent classification code included in a document is queried of patent classification code master DB 203, all patent classification codes of the patent classification code is obtained, and all higher patent classification codes obtained are input according to each level. For example, if IPC is H04B 7/06, H04B 7/06 is input to 3 dot subgroup (C7) level, H04B 7/04 is input to 2 dot subgroup (C6) level, H04B 7/02 is input to 1 dot subgroup (C5) level, H04B 7/00 is input to main group (C4) level, H04B is input to subclass (C3) level, H04 is input to class (C2), and H is input to section (Cl) level.
  • a value needed for a time dimension is input with respect to a necessary standard date (for example, an application date) of many kinds of date information such as an application date, a registration date, etc. If an application date is a standard date (for example, Jan. 3, 2005) and time dimension is divided as a monthly dimension, a quarter year dimension and a yearly dimension, Jan. 1, 2005 is input. 3) other bibliographic data are input to other dimension. Then, 1 is input to a large corporation.
  • a field value such as the position of an applicant is filled by using address information, the number of family of a patent document is queried by inquiring of the patent document master DB 202, which category a queried value belongs to is determined, and 1 is recorded in that category.
  • a range of the number of claims is processed like the number of family.
  • one patent document has more than two kinds of patent classification codes (for example, IPC and USPC)
  • the multi-dimension computation basic table of each kind can be generated, or more than two kinds of patent classification codes can be processed within one table.
  • a document includes a multiple of patent classification codes of the same kind, it is desirable to generate an individual record with respect to each patent classification code.
  • an applicant is more than two joint applicants, generating records for each applicant which have the same information for each applicant is desired.
  • Table 38 shows the result of performing roll up computation regarding up to a yearly dimension of time dimension, with respect to IPC dimension. (For convenient recitation, roll up computation regarding documents of the year of 2004 in a quarter year-year unit was indicated). [Table 38]
  • the method of performing multiple-dimension computation is divided into a method of performing roll up computation and a method of performing cube computation.
  • the roll up computation and the cube computation respectively, will be described with respect to a dimension such as IPC, a date and an applicant.
  • IPC dimension, date dimension and applicant dimension exist, the roll up computation proceeds in one direction selected from 3 dimensions, but the cube computation proceeds in 3P3 direction (the number of permutations occuring when 3 of 3 are selected, herein, 6).
  • the roll up computation proceeds in one direction such as l->2->3, but the cube computation proceeds in all six directions including l->3->2, 2->l->3, 2->3->l, 3->l->2, 3->2->l as well as l->2->3, or proceeds in one or more direction selected from the above directions.
  • a roll up computation result in an applicant unit occurs
  • a roll up computation result in only IPC is needed or if a roll up computation result in only IPC and date dimension like Table 38 is needed
  • a roll up computation is performed by designating a desired roll up direction only with respect to one or more selected dimensions.
  • the multi-dimension analysis result table generating module 402 obtains application date information and patent classification code information from a given patent document, and extracts higher patent classification codes of the obtained patent classification code with reference to patent classification code master DB 203, which includes data of a patent classification code system or a table rebuilt with Star Skima. Data of the multi-dimension analysis operation result table including information such as the above table is completed using the extracted higher patent classification codes and year information of the application date.
  • one patent document includes more than two kinds of patent classification codes (for example, USA has IPC and USPC), each kind of patent classification code is independently processed.
  • one patent document includes two or more patent classification codes of one kind
  • 3 if a numerical value has a decimal point, rounding off the number will be a proper inscription method.
  • which policy to take out of the above 1), 2) and 3) is optional.
  • another policy besides 1), 2) and 3) can also be selected.
  • the method 2) is taken, a line number of the multi- dimension analysis operation result table increases since at least one new data value is generated for each patent classification code (that is, roll up computation for sub IPC is needed).
  • the method of 3) is taken, the numerical value can include a decimal point.
  • the table is an index regarding a quantity such as the number of applications or the number of registrations of patent indices.
  • Data processing such as roll up/drill down including lower patent classification codes according to the present invention can be used for many kinds of indices such as an occupying index, a concentration index and an activity index.
  • the multi-dimension analysis operation result table is generated using an analysis theme-related multi-dimension computation basic table.
  • the analysis theme-related multi-dimension computation basic table is not necessary, and if a table rebuilt with Star Skima structure exists, the multi-dimension analysis result table generating module 402 according to the present invention can generate the multi-dimension analysis operation result table. It is because the analysis theme-related multi-dimension computation basic table can reduce the calculation amount/the information processing amount of the multi-dimension analysis result table generating module 402, the one analysis theme-related multi-dimension computation basic table can be used in generating multiple tables.
  • the multi-dimension analysis result table generating module 402 can perform the multi-dimension computation by obtaining necessary information from a table rebuilt with Star Skima structure and processing it if the analysis theme-related multi-dimension computation basic table does not exist.
  • the multi-dimension analysis result table generating module 402 can read necessary data from not a table rebuilt with Star Skima structure, but the patent document master DB 202, the patent classification code DB 203 and/or the subject master DB 204 and generate the analysis theme- related multi-dimension computation basic table.
  • a program (script or the like) generating the analysis theme-related multi-dimension computation basic table can be relatively less complicate, and recycling capability of the entire program or each module forming that program can be relatively lowered.
  • the analysis theme-related multi-dimension computation basic table stores information on IPC and all its higher IPC for each application number, wherein the multi-dimension analysis result table generating module 402 according to the present invention finds IPC included in the application number, queries of the patent classification code master DB 203 or all other higher patent classification code table, etc., obtains information on all higher IPCs of the found IPC and generates the analysis theme-related multi-dimension computation basic table based on this.
  • the multi-dimension analysis operation result table generating module 402 performs the following steps regarding the predefined/obtained document set (for example, all document sets applied for or registered in a specific nation).
  • a key value (document unique information) characterizing a patent document such as the application number, etc. of the patent document included in a document set having a predefined attribute is obtained and stored.
  • an analysis theme-related multi-dimension operation result table such as the following Table 38
  • at least one data of material data of Table 35 through Table 37 for each patent document is obtained and generated from the patent document master DB 202, the patent classification code master DB 203 and/or the subject master DB 204, the generated data is combined, and the combined data is stored in a memory.
  • the data stored in a memory can be the same data as the data stored in the multi-dimension computation basic table.
  • data stored in the memory includes information on at least one dimension, and the each record has to include each dimension basic data for performing multi-dimension computation.
  • the basic data includes information on existence/non-existence (1 or null) (for example, an application/a registration has occurred or has not occurred) and many kinds of counting information (more than one information mentioned as counting information in this specification, for example, the number of claims, etc.).
  • the dimension includes more than one selected from the following dimensions or more than one of a complex dimension, that is, a combination of the following dimensions.
  • the dimensions include 1) a patent classification code dimension for the kinds of each patent classification code, 2) a date dimension, 3) a position dimension (a nation, a region, etc.), 4) a subject's attribute dimension (a dimension distinguishable in the scale, the attribute, etc. of more than one selected from an applicant, an inventor or an agent (for example, the applicant is divided as a corporation, a university, a laboratory, an individual and the others, the corporation is divided as a multinational corporation, a large enterprise, small and medium enterprises, etc.) can be obtained from subject master DB 204 with reference to the attribute of each applicant.).
  • the multi-dimension operation result table generating module 402 performs predefined analysis theme-related multi-dimension computation on data stored in the memory by combining at least one information of Table 35 through Table 44 for each patent document and generates an analysis theme-related multi-dimension analysis operation.
  • IPC LEVEL RANKING INFORMATION FOR AN INDIVIDUAL APPLICANT When a multi-dimension analysis operation result table such as Table 38 is given, the analysis module can generate the following analysis result.
  • the number of applications/the number of registrations-based ranking information for each level of patent classification codes is generated with respect to the obtained applicant.
  • the ranking information is generated by comparing count values of the multi-dimension computed application or registration document with respect to each level of at least one patent classification codes of the obtained applicant.
  • a numerous-filed/numerous- registered ranking for each patent classification code in 1 dot subgroup level of applicant "A” can be obtained by comparing a counting value of the number of multi-dimension computed documents in H04B 7/02 of 1 dot subgroup level (C5 level) of applicant "A” and a counting value of the number of mult i-dimension computed documents in another patent classification code of 1 dot subgroup level of applicant "A" from among the multi-dimension analysis operation result table data.
  • Table 39 provides a multi-dimension analysis operation result table which has counted the number of applications (or registrations) for each applicant, each IPC level and each year. [Table 39]
  • the multi-dimension analysis operation result table has been generated with respect to each applicant of all applicants in a specific nation. For example, when a patent classification code and an application date included in a specific document of applicant "C" are provided, a counting value of a specific cell (a patent classification code, an applicant (AppName)) corresponding to the above Table is created with reference to a patent classification code system and by using the patent classification code and the application date data of the patent document or a number can increase. If a multi-dimension operation result table for all applicants in a specific nation unit such as the above Table is generated, the analysis module can generate the following analysis result from the multi-dimension analysis operation result table.
  • a yearly counting value in this level can be generated.
  • a value of a recent field is a counting value based on patent documents obtained from a corresponding year that one year and six months (18 months) prior to a predefined basis date (one example thereof includes an inquiry date of a multi-dimension analysis operation result table) belongs to until the inquiry date of a multi-dimension analysis operation result table. (... refers to the corresponding value.)
  • Table 40 shows one example. [Table 40]
  • a counting value in each cell is a value resulting from performing a multi-dimension computation on all values corresponding to its lower patent classification codes. Drill down can be performed continuously until its lower patent classification code does not exist any longer, and values needed in cells can be generated by entering the above condition until an additional drill down may not be performed. Such method has been described regarding IPC, but it is obvious that this also can be applied to USPC, FT, FI and ECLA.
  • a multi-dimension computed multi- dimension analysis operation result table generating method, a cell value calculating method and a calculated value indicating method during drill down have been described with respect to applications (or the total amount of registrations), but it is well known in the art that the raulti- dimension analysis operation result table can be generated with respect to an occupying index, a concentration index and/or an activity index.
  • an occupying index is defined as "a value equal to dividing the number of the documents of an object document set having specific attributes by the number of the documents of an entire document set having specific attributes
  • data similar to the above Table 39 can be generated with respect to the occupying index. That is, if an occupying index with respect to a specific patent code of a specific corporation "A" is obtained, an object document set will be a document set including a specific patent classification code of applicant "A" document set, and an entire document set will be an entire document set including a specific patent classification code.
  • documents including the patent classification code and its lower patent classification code should be included.
  • the multi-dimension analysis operation result table generating module 402 can store a computation result regarding a patent index which can be defined by other equations as well as an occupying index using the same method as the total amount analysis.
  • a technical innovation activity concentration degree analysis indexes include a) revealed technological advantage, 2) revealed Patent Advantage, 3) concentration Ratio n (CRn), 4) herfindahl Index (HHI), etc.
  • a technical level analysis index includes 1) patent count weighted by citations, 2)) cites per patent (CPP), 3) patent impact index
  • PII current impact index
  • CII current impact index
  • TS technology strength
  • TCT technology cycle time
  • SL science linkage
  • Etc average claims per patent
  • a cooperation relation and knowledge flow analysis index includes 1) number of patents with co-applicants, and with co-inventors),
  • the document set of applicant A can be obtained (determined) from the first nation or the second nation, IPC, etc. can be extracted from the document set, the number of corresponding documents of IPC in each step can be counted in the extracted IPC according to each step (all from a section to n dot subgroup), the total number of numerous applications/numerous registrations, concentration, the ranking of an activity field can be calculated by using the counted value according to each IPC level (ranking information generation according to each patent classification code level of patent index reference applicant A has been described above).
  • the high ranking field in can be extracted by the calculated ranking, and a patent classification code of at least one extracted IPC, etc.
  • the above described scheme becomes a scheme for extracting the ranking of the poorest patent classification code of each step in at least one corresponding patent classification code about the given subject (for example, respective to an inventor, agent, etc, ranking according to each patent classification code level can be generated based on a document set including own name of the inventor, agent, etc. by the same scheme).
  • FIG. 19 shows a view, in which the reference ranking is generated according to each level of the patent classification code system, such as IPC, etc., according to each nation, such as Korea, U.S, Japan, Europe, etc, based on application or registration and according to each application respective to the number of applications/registrations, an occupancy ratio, and a concentration ratio and/or activity ratio.
  • HOlL is in the first rank
  • H04N is in the second rank in an IPC sub-class level base on Korea application documents.
  • the document set may have a specific common characteristic (for example, an applicant characteristic, such as applicant A (it may be a plurality of applicants, as used herein), period limitation, such as the 5 recent years, etc., inventor limitation, such as inventor C, and/or a characteristic obtained by combining respective characteristics (inventor C of applicant A, etc.)).
  • an applicant characteristic such as applicant A (it may be a plurality of applicants, as used herein)
  • period limitation such as the 5 recent years, etc.
  • inventor limitation such as inventor C
  • the multi-dimensional analysis operation result table generating module 402 of the present invention can generate multi-dimensional analysis operation result table data by performing at least one of steps 1) to 7).
  • FIG. 19 is a view showing an embodiment of an analysis result generated when drill-down is continuously performed respective to HOlL.
  • an analysis information value only about an object, to which drill-down has been performed, is generated.
  • AJAX technology it is possible to generate only a numerical value respective to a part where change (drill-down) rapidly has occurred without reloading the entire page and provide the numerical value to the user.
  • the user has to be interpreted as a user computer 300 in view of relation between the user and the system of the present invention. That is, in view of the system, 1) "something is provided to the user”actual Iy means “something is transmitted to a user's computer,” 2)"the user inputs something” actually means “a value input by the user is received by the user's computer," and designation, specification, and selection by the user, etc. actually mean receiving information, which has been designated, pre-specified, and selected by the user, from the user's computer. Therefore, in the specification of the present invention, it is appropriate to interpret expressions relating to the user as described above. However, the reason why expressions based on the user are shown is to make descriptions convenient/more easily understandable.
  • FIG. 19 is a view showing an embodiment of an analysis result generated when drill-down is performed to HOlL 21/00 and other lower patent classification code. It can be understood that an analysis result based on the total amount is obtained through drill-down.
  • the drill-down can be performed down to the lowest level of IPC, and it is possible to perform drill-down down to the lowest level as long as data exists therein. Also, this is equally applied to other patent classification codes.
  • FIG. 31 is a view showing an example of a technical field analysis utilizing the patent classification code of the present invention, and is a view showing an embodiment of an application total amount analysis including drill-down respective to IPC H04B based on entire U.S application documents and lower classification thereof. At this time, all pieces of data corresponding to a lower patent classification code, which have been rolled up, are included in the analysis data of each cell. Citation analysis-multi-dimensional analysis operation result table
  • the multi-dimensional analysis operation result table generating module 402 In order to perform a citation analysis, the multi-dimensional analysis operation result table generating module 402 generates a citation analysis multi-dimensional analysis operation result table data.
  • a procedure where the multi-dimensional analysis operation result table generating module 402 generates a citation analysis multi-dimensional analysis operation result table data will be described with reference to U.S patent application No. 09/802,847 (the applicant: Samsung Electronics Co., LTD(KR), the title- user request processing method and apparatus using upstream channel in interactive multimedia contents service, and application data: 2001.3.12).
  • the multi-dimensional analysis operation result table generating module 402 obtains citation parent-child data as described below from the Bibliographical information of patent application No. 09/802,847 in relation to citation.
  • the child refers to the document number of a corresponding document itself, and the patent refers to the document number of other document cited by the corresponding document (at this time, the document number may also be an application number, and may also be a registration number).
  • the document number is a number, which allows the corresponding document to be specified, it is desirable that the document numbers are unified into application numbers, which are common numbers that all documents have, because it is possible that a publication number or application number exists in the parent data. However, it has also no problem to unify document numbers into registration numbers.
  • parent-child data is generated based on a unified application number from the obtained citation parent-child data as shown in Table. 44.
  • Each application number is obtained from the registration number and the publication number (there is no problem if it is an application number, all pieces of number information have each document type, and A registration, Al publication, etc.) of the parent (an application number or an application serial number is used in U.S, but this is only a title. Therefore, characteristics of data indicated by the title are the same.
  • an application number is used as a unified number).
  • Data of the table below refers to a unified number reference-based citation parent-child data.
  • nation columns of documents in child and parent columns show source nations, to which a corresponding document relates (is applied or registered). Such source nations may be various countries as well as U.S. Although the nation columns are omitted hereinafter in order to achieve convenience in description, it has to be understood that nation columns exist. Meanwhile, the notation form of the application number may be an arbitrary notation, such as "double digits + / + six digits" or "year +6 digits, " etc. [Table 44]
  • Table 45 A result obtained through such data processing is shown in Table 45.
  • it is possible to allow the repetition of parent (application number) according to each document set i.e. when document 1 and document 2, which are included in one document set, include document 3 in citation information, duplicates of the document 3 can be allowed). That is, it is possible to generate unified number reference- based citation parent-child data respective to all individual documents included in a document set.
  • the most representative document set may be a US open application document set and a US registered patent document set. It is obvious that a document set generated by the user (which is output as a next search result after inputting a specific search formula) and a document set automatically generated by a computation system 1 performing the present invention can be the most representative document set. [Table 45]
  • Table 46 shows data, to which an application date and a registration date among the Bibliographical items corresponding to respective application numbers are added one by one.
  • count information of each application number (various pieces of count information, such as the number of claims, the number of drawings, the number of families, the number of inventors, the number of applicants, etc.) to the data show in Table 46 according to each count field. It is desirable to generate one table respective to an item corresponding to each application number one by one.
  • the above described data can be generated as a field value, in which at least two pieces of data can correspond to each application number.
  • These may be an applicant, inventor, agent, patent classification code, etc.
  • the patent classification code will be described later.
  • all necessary field values can be input into one table.
  • applicants of patent application No. 09/309,895 shown in Table 47 are AT 1 T Corp. (New York, NY) and Sun Micro Systems (Palo Alto, CA).
  • an application field has to be added to the same row of Table 47 below relating to the application number.
  • data processing based on time/date is performed as described below in consideration of roll-up.
  • March 12, 2001 it is preferable to perform data processing after departmentalizing the data into March 12, 2001, March, 2001, and the first quarter of 2001, for roll-up calculation.
  • departmentalized data due to a notation problem (that various kinds of numerous fields are expressed on a screen with limited width) departmentalized data are omitted.
  • patent classification code the data processing will be described in view of lower classification code inclusion.
  • C3 refers to an IPC subclass level, and it has been described above that others follow the same organization.
  • the numeral 1 recorded in the field of 2001 means that there is one document in 2001, and the numeral 1 recorded in the field of March, 2001 means that there is one document in a corresponding month based on each month.
  • the above mentioned notation method is a typical notation method used for data construction in consideration of roll-up. Although both G06F 15/16 and H04N 7/16 have been included in 1 dot subgroup, if H04N 7/169 is included in the document with the application number of 09/802,847, Table 49 above would be the same as Table 50 below (this is arbitrarily introduced for describing the spirit of the present invention). [Table 50]
  • H04L 12/56, H04L 12/28, USPC7 ⁇ 370/395, and 370/235 of IPC have been allocated (the IPC may be different from IPC given by Korea patent office, but data is processed according to each nation so that U.S data is preferably processed by IPC given by U.S patent office).
  • one row is generated by each patent classification code. Therefore, although IPC code and USPC code may be generated in one table, it is preferred that they are not generated in one table. Particularly, in a case where multiple codes, such as multiple applicants and multiple inventors, exist, it is further preferred that the codes are not generated in one table. Essentially, this is because one child corresponds to several parents.
  • one gorge data may correspond to parent data, the number of pieces of the parent data being K, one gorge data may correspond to IPC codes, the number of the IPC codes being m, and USPC codes, the number of the USPC codes being n, and each IPC code and each USPC code may exist in respective parent documents, the number of the parent documents being K. Therefore, if they are gathered into one table, a significantly large number of lines is required. At this time, multiple pieces of data are repeated. This repetition occurs more often in a case where the size of a document set is large (e.g. a U.S registration patent entire set, etc.). Therefore, it is preferable to separate IPC codes and USPC codes from each other.
  • table types such as 1) a table type where only patent classification codes of the child side of citation child-parent data have been processed, 2) a table type where only patent classification codes of the parent side of citation child-parent data have been processed, 3) a table type where only the patent classification codes of the both child and parent sides of citation child-parent data have been processed.
  • table type 3 there may be a problem of increase in the number of rows, but there is an advantage in that possibility of table join is reduced.
  • the multi-dimensional analysis operation result table generating module 402 generates unit citation parent-child data according to each document included in a given/preset document set and obtains at least one of Bibliographical items of each document included in the child and/or parent, respective to each piece of unit citation parent-child data, thereby generating multi-dimensional analysis operation result table data having the obtained Bibliographical item as field content.
  • the given/preset document set there may be 1) a specific nation-application document or registration document entire set, 2) a document set based on at least one of specific IPC/USPC codes, 3) a document set according to each specific applicant, 4) a search-based generation document set, which can be generated by a specific search formula, etc., 5) an entire nation integration-all application document or all registration document set, etc.
  • a method for generating such data is a method for inputting all application/registration document numbers into a child column and inputting numbers of parent documents citing the application/registration documents into a parent column respective to each application/registration document. When there is no parent value, a null value is input. At this time, data exist in only child column, and a parent document respective to a corresponding child document does not exist. In this case, all document numbers is input into the child column.
  • each of document 1, document 2, document 3, and document 4 includes one piece of citation information, and a citation order is in the order of document 1 -> document 2 -> document 3 -> document 4 (it means that document 2 cites document 1, document 3 cites document 2, and document 4 cites document 3).
  • document 1 exists in the parent column of document 2
  • document 3 document 3 exists in the child column
  • document 2 exists in the parent column.
  • document 4 exists in the parent column. That is, all documents 1 to 4 exist in the child column, and at least, documents 1 to 3 exist in the parent column.
  • document 1 and document 2 exist in the range to citation depth 2 of backward citation (a document set generated by citation information included in document 3) relating to document 3, and document 4 exists with forward citation depth 1.
  • backward citation a document set generated by citation information included in document 3
  • document 4 exists with forward citation depth 1.
  • documents 1 to 4 exist in the child column, it is possible to find a citation document with citation dept 1 in the parent column based on any document, and when the found citation document number is again found in the child column, and a parent column corresponding to the found child column is found, a citation document with citation depth 2 can be found.
  • Table 54 will be described below with Table 54. [Table 54]
  • a characteristic of a document set may be an arbitrary characteristic and also may be a characteristic of 1) an applicant name, 2) inventor name, 3) IPC or USPC of each step, 4) nation, 5) agent, 6) term range (an application date/registration date range), 7) document state (under an application procedure, registration, rejection, etc.), or 8) combination of at least one among the above-mentioned items as a common item.
  • document set 3 For example, it is assumed that the document set of all registered documents, which are included in IPC HOlL, the applicant of which is Samsung Electronics, and have been registered in U.S, is document set 3, document set 2 consisting of all citation documents with backward citation depth 1 relating to document set 3 may be extracted. Meanwhile, it is possible to obtain document set 4 consisting of all cited documents with forward citation dept 1 relating to document set 3 and document set 1 with backward citation depth 2 relating to document set 3. At this time, document set 1, document set 2, and document set 4 can be analyzed.
  • each backward/forward numerously-citing document can be found as an object respective to the analysis. Meanwhile, in the case of 1), it is possible to display the total amount after classifying it according to each-year's/preset period, and it is also possible to know an increase/decrease ratio and an increase/decrease rate based on the various kinds of numbers.
  • FIG. 38 illustrates 4 types of to-be-analyzed document set relating to the citation analysis with respect to the reference document set.
  • 4 types of to-be-analyzed document set include
  • Type 1 is an entire backward citation document set cited by an individual reference document included in the reference document set.
  • the entire backward citation document set can be determined as union of every document corresponding to the citing information (reference cited) included in the respective reference documents (at this time, the duplication may be removed, but the duplication may be allowed in the union).
  • document #1 and document #2 include document a as the citing information
  • document a is cited two times so that it may considered as an important document in the view of the counting, and thus it is preferred that the duplication is allowed (an identical record is duplicative in the parent in the multi-dimensional analysis operation result table. Two records relating to parent a are present resulted from documents #1 and #2).
  • Type 2 is an entire forward citation document set citing the individual reference document included in the reference document set.
  • the method of obtaining the entire forward citation document set includes 1) a method of searching the document number citing the individual reference document and summing the searched document number and 2) a method of searching for the reference document number in the patent column of the child-parent table generated for the entire patent document, searching for at least one or more child document number corresponding to the reference document number included in the parent column in the child column, and summing the searched numbers.
  • Method 2 is more preferable.
  • Type 3 is an entire forward citation-relating reference document set.
  • the entire forward citation-relating reference document set is a document set including every reference document that had been cited by another document from among every reference document included in the reference document set.
  • the entire forward citation-relating reference document set includes the parent document of the document set of type 2).
  • Type 4) is an entire backward citation-relating reference document set.
  • the entire backward citation-relating reference document set includes the reference document including citing information from every reference document included in the reference document set.
  • the entire backward citation-relating reference document set includes the child patent document of the document set of type 1).
  • a topmost tap includes 1) citation analysis for the patent document entire set, 2) technology area-based citation thorough analysis, 3) inventor-based citation analysis, 4) multi-citation individual document-based citation analysis, and those are all related to the determination of the reference document set.
  • 1) citation analysis for the patent document entire set the patent document entire set serves as a reference document set determined by any one selected from all the document determining methods relating to the determination of the document set of the present invention.
  • the multi-dimensional analysis operation result table generating module 402 if the reference document set is determined, determines 1) the entire backward citation document set, 2) the entire forward citation document set, 3) the entire backward citation-relating reference document set, and/or 4) the entire forward citation-relating reference document set as the to-be-citation-analyzed document set corresponding to the determined reference document set, generates the multi-dimensional analysis operation result table described in the present specification with respect to any one document set of 1) to 4), and stores the generated table.
  • the analysis module of the present invention obtains the analysis result of the analysis subject-based multi-dimensional analysis operation result table according to the determined 1) the entire backward citation document set, 2) the entire forward citation document set, 3) the entire backward citation-relating reference document set, and/or 4) the entire forward citation-relating reference document set with a preset analysis formula.
  • the analysis module can conduct various analysis described in the present invention for any one or more of document sets 1) to 4), not for the multi-dimensional analysis operation result table.
  • this case has a problem in that it is impossible to obtain the multi-dimensional operation result or it requires much computation resources for obtaining the multi-dimensional operation result, so as not to be much recommended.
  • the analysis module accesses the multi-dimensional analysis operation result table generated according to document sets 1) to 4) and extracts the object citation analysis result with the preset analysis formula.
  • the object citation analysis result includes the followings.
  • the analysis module extracts, according to the applicant information on, Oapplicant ranking, ii) ranking in the unit of at least one level-based patent classification code in at least one patent classification code system according to the applicant, iii) multi-patent inventors ranking, and iv) the document frequency reference ranking, numerical value data of the respective rankings calculated by roll-up operation according to the time dimension or other dimension, and/or the document number corresponding to the respective rankings. That is, the analysis module can generate information on the document number for obtaining the specific individual document itself corresponding to the ranking value according to the various fields included in the Bibliographical particulars, the value calculated by the roll-up operation according to the time or other dimension, and/or the various values with respect to to-be-citation-analyzed document sets 1) to 4).
  • the analysis module extracts, centering on the patent classification code, information on i) the ranking in the unit of at least one level-based patent classification code in at least one patent classification code system, ii) applicant belonging to the level-based patent classification code, iii) the ranking of the inventors belonging to the level-based patent classification code, and iv) the query document frequency reference ranking belonging to the level-based patent classification code, numerical value data of the respective rankings calculated by the roll-up operation according to the time dimension or other dimension, and/or the document number corresponding to the respective rankings.
  • the analysis module can generate information on the document number for obtaining the specific individual document itself corresponding to the ranking value according to the various fields included in the Bibliographical particulars ' , the value of the ranking calculated by the roll-up operation according to the time or other dimension, and/or the various values with respect to to-be-citation- analyzed document sets 1) to 4).
  • the analysis module extracts, centering on the applicant attribute (a kind of applicant (company, university, individual, etc.), in the case of the company, financial attribute, company evaluation index, etc.), information on i) a kind of applicants, ii) the company size evaluation index, such as the sales, iii) the ranking according to the company financial evaluation elements, such as an year-average profit ratio, and/or the numerical value data of i) to iii) calculated by the roll-up operation according to the time dimension or other dimension, and/or the document number corresponding to i) to iii).
  • the analysis module can generate information on the document number for obtaining the specific individual document itself corresponding to the ranking value according to the various fields included in the Bibliographical particulars, the value calculated by the roll-up operation according to the time or other dimension, and/or the various values with respect to the to-be-citation-analyzed document sets 1) to 4).
  • the analysis module extracts the data of the variation value, such as increase/decrease rate and increase/decrease speed of the numerical value data if the numerical value data is present in above first to third information.
  • the (based on the applicant citation) to-be-citation analyzed multi-dimensional analysis operation result table generated by the multi ⁇ dimensional analysis operation result table generating module 402 is mainly generated subjecting to the patent document cited in the applicant's own document.
  • FIG. 38 illustrates a year-based analysis result of citation total amount for the to-be-analyzed document set according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set.
  • FIG. 38 illustrates that a total amount tap is selected.
  • the accompanied drawings show various taps in the analysis image according to the exemplary embodiment, in which if the character of the tap is thick, it means that the tap is selected.
  • a kind of taps basically includes 1) the reference for the attribute of the document set when determining the document set, such as the application document reference or registration document reference, 2) the reference of the application/registration total amount, occupancy rate, concentration rate, activity rate, or other analysis index, 3) the reference relating to the nation determination, such as Korea, US, Japan, and Europe, 4) the reference relating to an object or a subject of the analysis, such as the total amount, applicant, inventor, technology, and individual document, 5) the reference for various numerical values, increase/decrease rate of the numerical value, and increase/decrease speed of the numerical value, 6) the reference for selection of the patent classification code, such as IPC, USPC, FT, FI, and ECLA and the level of the patent classification code.
  • FIG. 39 illustrates a year-based analysis result for the multiple- citing applicant according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set.
  • FIG. 40 illustrates a year-based analysis result for the multi- citing patent classification code (IPC main group level) according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set.
  • the IPCs can be drilled down in which the citation analysis result for the sub-patent classification code relating to the drill down is extracted to be provided to the user, which is illustrated in FIG.41.
  • FIG. 41 illustrates a year-based analysis result reflecting the drill down of the multi-citing patent classification code (IPC main group level) according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set.
  • FIG. 42 illustrates a year-based analysis result of the multi- citing inventor according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set.
  • FIG. 41 illustrates a year-based analysis result reflecting the drill down of the multi-citing patent classification code (IPC main group level) according to an exemplary embodiment of the present invention in which, when all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB
  • FIG. 43 illustrates a year-based analysis result of the most citing applicant with respect to the to-be-analyzed document set according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set .
  • FIG. 44 illustrates a year-based analysis result for the most citing inventor according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set.
  • FIG. 45 illustrates a year-based analysis result when the IPC in the analysis for the most citing technology according to the IPC main group is drilled down according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set.
  • FIG. 46 illustrates a year-based analysis result when the IPC in the analysis for the most citing cited technology of the USPC sub class (no dot, class direct subordinate) is drilled down according to an exemplary embodiment of the present invention in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set .
  • FIG. 47 illustrates a total amount reference analysis result and a chart of the analysis result generated by a chart generating module 406-2 among a reporting module 406 of the present invention for the total amount reference analysis result with respect to the to-be-analyzed document set in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the entire forward citation document set serves as the to-be-analyzed document set.
  • the reporting module 406 of the present invention includes at least one of a table generating module 406-1 for generating a table, a chart generating module 406-2 for generating a chart, a graph generating module 406-3 for generating a graph, and a report generating module 406-4 for generating a report .
  • FIG. 48 illustrates the analysis result of a year-based citation total amount of the often-cited inventor and a document list generated by a simplified analysis module 407 that provides the simplified analysis result (document list, the number of applications/registrations of the most-filing applicant according to the year, the year-based number of applications/registrations according to the most-filing inventor, the number of applications/registrations of the most-filing technical field (IPC, USPC, FT) according to the year (including drill-down) for the document set corresponding to the specific number when the specific number represented in the analysis result is clicked, with respect to the to-be- analyzed document set in which, when all the US patent registrations in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the forward citation document set serves as the to-be-analyzed document set.
  • the simplified analysis result document list, the number of applications/registrations of the most-filing applicant according to the year, the year-based number of applications/registrations according to the most
  • FIG. 49 illustrates that the simplified analysis module 407 provides a function of the drill-down according to the most-filing technology field (IPC, USPC, FT).
  • FIG. 50 illustrates a year-based analysis result of the citation total amount with respect to the to-be-analyzed document set, in which, when the numerously-filed IPC sub class unit document in all the US patent applications in the name of the Samsung Electronics among all the applicants in the DB of the patent information system 1 are considered as the reference document set, the backward citation document set serves as the to-be-analyzed document set .
  • the multi-dimensional analysis operation result table generating module 402 for conducting the competition analysis generates competition analysis multi-dimensional analysis operation result table data.
  • a method of generating the competition analysis multi-dimensional analysis operation result table data by the multi-dimensional analysis operation result table generating module 402 will be described.
  • the competition in view of the applicant can be defined as the conflict between applicant A and other applicants B belonging applicant A's nation (a first nation) or at least one another nation (a second nation).
  • the conflict between the applicants may occur when they involve commonly in 1) the numerously-filed technical field, 2) high-concentrated field, and 3) high-activated field, and it is preferred that the technical field in the numerously-filed/high-concentrated/high- activated technical field is defined with at least one patent classification code according to the level, such as IPC according to the level.
  • applicant A of a first nation files several applications in HOlL based on the IPC subclass
  • the competitor includes 1) an applicant having the multiple applications in HOlL (an applicant having high occupancy rate), 2) an applicant having high concentration rate in HOlL, and 3) an applicant having high activity rate in HOlL, on a basis of the IPC subclass in a first nation (in a second nation, 1), 2), and 3) applicants can be direct or potential competitors).
  • applicant A highly concentrates and activates in HOlL, 1), 2), and 3) applicants of a first nation or a second nation can be the competitors.
  • the multi-dimensional analysis operation result table generating module 402 generates the following information.
  • the multi-dimensional analysis operation result table generating module 402 extracts 1) a numerously-filed/numerously-registered applicant, 2) an applicant highly concentrating on H04B 7/02, 3) a highly active applicant, or 4) an applicant having a high calculation value according to other patent analysis indexes with respect to the application document/registration document in a first nation.
  • the extracting method can be basically processed with the SQL query.
  • the mult i-dimensional analysis operation result table generating module 402 accesses the multi-dimensional analysis operation result table data with a simple SQL query so as to obtain at least one applicant information highly related to the target competition and the amount of applicant's application/registration in the unit of the preset time period. If the multi-dimensional analysis operation result table data has not been generated, the target information can be obtained by using a relatively long and complex SQL query having the following steps.
  • the logic of the SQL query includes a command processing the steps of 1) extracting every document including the obtained specific-level patent classification code (e.g. IPC H04B 7/02) and the sub-patent classification codes of the specific patent classification code (the sub- patent classification codes are searched in the patent classification code master DB 203 and obtained, which is described previously.) from the patent document master DB 202 in the unit of the nation/nation unification (in this case, if an identical document is present two times or more, the duplication must be removed.), 2) obtaining the information on the applicant and date, such as application date/registration date, from the Bibliographical information of the extracted document, 3) sorting the obtained information based on the numerously-filed/numerously-registered applicant and obtaining ranking information on the numerously- filed/numerously-registered applicant, and/or 4) counting the amount of applications/registrations in the unit of the preset term from the application date/registration date information. It is of course that any two or more steps 1) to 4) can be processed
  • the number of applications/registrations for H04B 7/02 of the respective applicants is divided with the total number of applications/registrations of the respective applicants (which can be obtained from the patent document master DB 202 with the SQL) so as to calculate the concentration rate, and the sorting of step 3) can be processed with the calculated concentration rate.
  • the mult i-dimensional analysis operation result table generating module 402 has generated the multi-dimensional analysis operation result table data according to the level of the patent classification code, the processing can be processed with the simple SQL query (such the simple SQL query is very easy to the person skilled in the art).
  • the record including H04B 7/02 in the IPC 1 dot level (C5 column) is extracted and the number of records is grouped-by with the applicant field of AppName so as to grade the ranking.
  • the process it is possible to extract competitive applicant B, etc. based on H04B 7/02 of applicant A and to provide the number of applications/registration according to the year and term of applicant B to the users through reading the processed value of the documents including H04B 7/02 and its sub-patent classification code from the documents of applicant B generated in the multi-dimensional analysis operation result table.
  • the method of generating at least one mult i-dimensional analysis operation result table data for competition analysis by the multi ⁇ dimensional analysis operation result table generating module 402 includes the following steps. First, the multi-dimensional analysis operation result table generating module 402 generates the mult i-dimensional analysis operation result table data as shown in Table 56 according to the preset type of competition. When the mult i-dimensional analysis operation result table generating module 402 generates the table/data as shown in Table 65 with the occupancy rate, concentration rate, activity rate, or other analysis index, the AppNarae, the level of the patent classification code, etc. are identical and the numerical value according to the year/term becomes the value of the occupancy rate, concentration rate, activity rate, or other analysis index, respectively.
  • the patent classification codes have the multi-stage hierarchical structure like the IPC and the hierarchical structure can be classified by the number of the dots so that the patent classification codes can be identically processed as in the IPC.
  • FIG. 33 illustrates a representative competing company analysis based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system 1 of the present invention according to the exemplary embodiment of the present invention.
  • the analysis of the representative competing company according to the applicant is subjecting to the patent classification code of the IPC group level among the patent classification code of the numerously-filed applicant obtains the ranking and the amount of the applications of the patent classification codes, grading the score by the preset calculation formula based on application-amount information of the applicant who files multiple applications for the patent classification code, and provides the ranking together with the applicant having the high score. It is preferred that the representative competing company analysis also shows information on the number of applications/registrations of the representative competing companies according to the year.
  • the document corresponding to the number of applications/registrations is obtained with the query formula included in the number of applications/registrations, to be transmitted to the simplified analysis module 407, and the simplified analysis module 407 provides the simply analyzed information on the documents.
  • the numerical value included in every cell of the present invention is based on the query value and a basis document generating the corresponding numerical value from the respective query formulas can be obtained, which is applied to every cell. If the numerical value is a ratio value, the query corresponds to the respective numerical values according to the indexes included in the formula generating the ratio so that it is possible to obtain a basis document based on the query.
  • FIG. 33 illustrates a competing applicant analysis according to the numerously-filed patent technology classification code based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system 1 of the present invention according to the exemplary embodiment of the present invention.
  • the obtaining the competing applicant according to the level of the specific technology classification code and the technology classification code can 1) obtain ranking information (HOlL of first ranking, H04N of second ranking in FIG.
  • the code is drilled down, if above 1) to 3) are performed for the sub-patent classification code of the drilled-down patent classification code including even one document of the applicant so as to provide information on the competing applicant. Up to the lowermost patent classification code can be drilled down.
  • the analysis of the competing company can be conducted by the same method with respect to USPC, FT, etc., as well as the IPC, but there may be a limitation on the nation selection (because the USPC is the US patent classification code so that it is impossible to obtain the competing applicant with the Korean and Japanese patent document).
  • the competing applicant can be generated based on not only the application total amount of the applicant, but also the registration total amount of the applicant.
  • the reference patent classification code is generated based on the patent document of the specific applicant of a first nation and the numerously-filed, highly concentrating, highly occupying, and highly active competing applicant can be extracted in a second nation, not in a first nation, according to the reference patent classification code.
  • a first nation and a second nation can be selected by the user or by the default value of the system 1.
  • the analysis information for the entering competing company can be generated only with respect to the document within a preset recent term (e.g. for recent 7 years). That is, 1) based on the document of the applicant filed/registered within a recent term in a first nation, the information on the reference of the patent classification code according to the patent classification code level is generated, and the multi- application having, highly concentrating, highly occupying, and highly active competing applicant according to the reference of the patent classification code on a basis of the i) recent term or ii) entire term can be extracted in a first nation or a second nation.
  • the processing of the competition in view of the inventor (generating the document set based on the inventor, generating the table/data based on the inventor-based document set by the same manner, and accessing the data on a basis of the inventor) can be generated by a same method as that of processing the competition in view of the applicant (generating, in the unit of the document set unit based on the applicant, the table/data is generated according to the analysis index, such as the total amount, occupancy rate, concentration rate, and activity rate, based on the patent classification code level, and accessing the data on a basis of the applicant), an example of which is represented in Table 57 below.
  • the data is generated through placing the inventor under a single applicant and processing the document set based on the inventor of the single applicant (i.e. when it is assumed that the applicant is generally a large enterprise or organization and the inventor belongs to the enterprise or organization, the inventor is linked with the applicant under.
  • the data can be generated according to the document set satisfying the condition, such as applicant A AND inventor 1, applicant A AND inventor 2, etc.
  • the data is generated by processing the document set based on the inventor only with the inventor independently (the inventor is inputted in the place of the applicant).
  • An example relating to the former is obvious so that the exemplary embodiment of the data scheme generated by the multi-dimensional analysis operation result table generating module 402 for the latter is represented in Table 57 below. [Table 57]
  • the method of generating the data, such as the occupancy rate, concentration rate, activity rate in the unit of the individual inventor of the specific applicant by the multi-dimensional analysis operation result table generating module 402 when the total information is given is identical to that of generating the information based on the applicant.
  • the multi-dimensional analysis operation result table generating module 402 may generate the data of the value of the patent analysis index, such as the occupancy rate, concentration rate, activity rate, on a basis of the total amount information in the unit of the individual inventor, not the inventor under the specific applicant.
  • the analysis of the inventor according to the patent classification code level-based area includes the steps of 1) generating the ranking of the patent classification code reference according to the patent classification code level of the specific applicant, 2) executing any one of i) calculating information on the most application inventor according to the year and ii) calculating information on the ranking of the numerously-filed inventor according to the year unification (e.g. from 1 st to 10 th ranking) according to the ranking of the patent classification code reference, and 3) providing the inventor analysis information extracted through the execution to the user.
  • the analysis of the inventor according to the patent classification code level-based area does not limit the applicant in the unit of the nation, but can be conducted on a basis of the application document or registration document.
  • the analysis of the inventor on a basis of the application document or registration document includes the steps of 1) generating the ranking of the patent classification code reference according to the patent classification code level based on the entire subject document, 2) executing any one of i) calculating information on the most inventor according to the year and ii) calculating information on the ranking of the numerously-filed inventor according to the year unification (e.g. from 1 st to 10 th ranking) according to the ranking of the patent classification code reference, and 3) providing the inventor analysis information extracted through the execution to the user.
  • FIG. 36 illustrates a numerously-filed inventor-based analysis result according to the year based on the total amount of the Samsung Electronics among all the applicants in Korea in the DB of the patent information system 1 of the present invention according to the exemplary embodiment of the present invention.
  • the generation of the information on the inventor individual-based analysis for every inventor (the inventor belonging to the applicant of the company) relating to the specific applicant in the unit of the nation includes the steps of 1) obtaining every application document or registration document relating to the applicant in the unit of the specific nation, 2) extracting the inventor from the obtained document, 3) counting the number of documents according to the inventor based on the preset policy, 4) generating the ranking according to the inventor, and 5) generating the information according to the most application inventor and the number of applications/registrations based on the year/specific term according to the ranking so as to provide the generated information to the user.
  • the documents can include only the documents belonging to the preset term, such as recent 7 years. This is an essential issue of generating the document set and it is okay that the document set can be generated by any one from the methods of generating the document set variously described in the present specification.
  • the patent technology classification utilization technology analysis means the analysis of the document set determined through the patent classification code among the document sets.
  • a kind of the patent technology classification utilization technology analysis document set generated according to the patent classification code includes 1) a document set generated with the document including at least one patent classification code selected from the kinds (IPC, USPC, FT, ECLA, etc.) of the specific patent classification codes (if two or more patent classification codes are involved, various operations, such as OR, AND, NOT operations, between the patent classification codes are included.) and 2) a document set (IPC AND USPC, etc.) including two or more kinds of patent classification code.
  • Both 1) and 2) document sets can be 1) in the unit of the individual nation or 2) in the unit of the nation union, and the duplication in the unit of the nation must be removed (a result of the union operation). Further, 3) whether to process the document set including only the main patent classification code or including every patent classification code is determined according to the preset policy or the user's selection, which are all related to the determination of the document set.
  • the information obtained from the patent technology classification utilization technology analysis is the information on the 1) occupancy rate, 2) concentration rate, and 3) activity rate, and 4) other patent analysis index of the patent technology classification utilization technology analysis document set (hereinafter, referred to as the technology analysis document set).
  • the total amount analysis is first described from the patent technology classification utilization technology analysis.
  • the total amount analysis is to provide the analysis result in the amount view, such as the number of term-based/year based applications/registrations according to the patent technology classification code level, in which up to the document including the sub-patent classification code of the specific patent classification code is calculated by the rolled-up operation so that the multi-dimensional operated result is stored (the result is a numerical value calculated by summing up the number of patent documents including the specific patent classification code and the number of patent documents including the sub-patent classification code of the specific patent classification code(it is of course that the duplication is removed.)).
  • Table 58 represents the multi-dimensional analysis operation result table for a year-based distribution of the total application amount of the USPC according to the exemplary embodiment.
  • USPC 002048000 is equal to 2/48 and USPC 002049100 is equal to 2/49.1, which are dependent on the marking methods (if three places in a front side referring to a class number and the number obtained by dividing 6 places in a rear side by 1000 are placed at a right and left side of "/", respectively, this corresponds to the USPC typically indicated in the pub1ication document . ) [Table 58]
  • the number of applications/registrations in the obtained specific USPC can be identified through obtaining the multi-dimensional operated number or total according to the year/term in the specific USPC level (class to n dot) column.
  • the multi-dimensional analysis operation result table generating module 402 can generate the multi-dimensional analysis operation result table data as shown in Table 59 for the applicant-based multi-stage patent classification code-based analysis. [Table 59]
  • the data such as the multi- dimensional analysis operation result table according to the analysis index, such as occupancy rate, concentration rate, and activity rate
  • the analysis index such as occupancy rate, concentration rate, and activity rate
  • the multi-dimensional analysis operation result table as shown in below is suggested for the exemplary embodiment.
  • the fact that the equal operation can be performed with respect to other patent index, such as the occupancy rate is obvious to one skilled in the art.
  • the multi-dimensional analysis operation result table generating module 402 of the present invention generates the multi-dimensional analysis operation result table data below for the convergence analysis of the present invention.
  • the multi-dimensional analysis operation result table generating module 402 receives the inputted, established, or determined patent document set. When the multi-patent classification codes of one type are combined in the document set (in only IPC, two or more IPCs are processed, in the IPC and USPC, if there are two IPCs, they are is processed, but only one IPC is not processed, and in two or more USPCs, they are processed with the USPC multi-dimensional analysis operation result table).
  • At least one type of patent classification code such as IPC, USPC, FI, FT, ECLA
  • the patent classification code includes at least one information. It shows that the data as shown in Table 72 includes at least one essential main patent classification code and selective sub-patent classification code on a basis of the document number (here, the type of the main patent classification code is identical to that of the sub-patent classification code.) At this time, it can be identified that the patent classification code included in the document arrays the patent classification code itself and its every higher patent classification code.
  • the multi-dimensional analysis operation result table generating module 402 determines the main patent classification code as C6 level referring to the patent classification code master DB 203, inputs the main patent classification code into C6 level, inputs the direct higher patent classification code of the main patent classification code of 1 dot level in C5 level at a left side, and inputs the patent classification code of the main group level in C4 level, and those procedures are performed up to the topmost level in a series.
  • the same manner performed for the main IPC is applied to at least one sub- IPC included in the patent document. At this time, other Bibliographical data, such as the applicant, can be obtained through the document number, so that they are the selective elements.
  • Korean Patent Application No. 10-2005-0111868 is assigned H04B 7/26 and H04B 7/15 dated of January, 2006. Based on this, it will be exemplari Iy described through Table 60 (for convenience of the marking, the sub-class part of the patent classification code below the main group will be omitted.) [Table 60]
  • Korean Patent Application No. 10-2006-0012606 includes the patent classification code of H04B 7/04, H04B 7/155, H04Q 7/30 assigned in January, 2006, based on which, it will be exemplari Iy described (for convenience of the marking, AppName, etc. is omitted and H04Q 7/30- relation is represented in a lower line.) [Table 61]
  • Korean Patent Application No. 10-2005-0042032 includes the patent classification codes of H04B 7/02 and H04B 7/14. [Table 62]
  • the multi-dimensional analysis operation result table data can be generated with respect to USPC, FT, etc. by the multi-dimensional analysis operation result table generating module 402.
  • the multi-dimensional analysis operation result table generating module 402 generates multi-patent classification code pair information as included in Table 63 below from the series of information of the level- based multi-patent classification codes.
  • the types of multi-patent classification code pair has been described in the homogeneous patent classification code correlation pre-processing module.
  • table 63 below the Bibliographical particular including only one information, such as the application date and registration date, can be created and included in the side. However, as described above, a separate row must be created in the case of join applicants or join inventors.
  • Korean Patent Application No. 10-2005-0111868 Korean Patent Application No. 10-2006-0012606, and Korean patent application No. 10-2005-0042032 (of course, by repeatedly processing all the obtained documents, it is possible to generate the following data in relation to all the patent classification code combinations (Ai, Bj) based on all the documents).
  • Korean Patent Application No. 10-2005-0111868 includes 4 pairs of patent classification code combinations
  • Korean Patent Application No. 10-2005-0111868 includes 4 pairs of patent classification code combinations
  • 10-2006-0012606 includes 22 pairs (6+16) of patent classification code combinations (main IPC pair, sub-IPC pair) and 20 pairs of patent classification code combinations (sub-IPC pair, sub-IPC pair), and Korean patent application No. 10-2005-0042032 includes 1 pair of patent classification code combinations, so that three patent applications includes 47 pairs of patent classification code combinations. It is of course possible to represent all the pairs, but for the convenience of the description, the present inventive idea will be described with Table 63 in which a certain part of the pairs is omitted, and this can be corresponding to total 47 pairs. [Table 63]
  • the multi-dimensional analysis operation result table generating module 402 generates the multiple patent classification code pair information as shown in Table 65 below (for the convenience of description, the individual document number is not indicated, but is completely identical to the sequence of the document number shown in Table 64 above. Further, it is preferred that the multi-dimensional analysis operation result table data as shown in Table 65 below generated by the multi-dimensional analysis operation result table generating module 402 co-includes other Bibliographical particulars (the applicant, inventor, application date, registration date) of the document number (Below, M/S is a field implying that if the patent classification code is obtained from the main IPC, indicated M, and if the patent classification code is obtained from the sub IPC, indicate S. This is because the reference of the convergence is identified on a main IPC, preferably.) [Table 65]
  • First one is a subject information non-utilization convergence analysis when the patent classification code is obtained.
  • a single patent classification code is obtained or given, it is possible to search the ranking of the patent classification code well converging (having high frequency of being made as one of the pair) with the patent classification code.
  • the IPCs having the high convergence frequency in the respective IPC levels such as the patent classification code well converging with H04B 7/04 is H04B 7/155C7 in CP(IPC 3 dot level) level and H04B 7/15 in C6 level, and if the statistic/analysis information is generated with respect to the obtained IPCs (through processing by the group-by, count, and rank command), it is possible to identify the ranking information on the IPCs having the high convergence frequency (for example, it is possible to generate the ranking information relating to the convergence, such as "the most converging IPC of C6 level with H04B 7/04 is H04B 7/15 and IPC of C6 level converging with H04B 7/04 by the next ranking is H04Q 7/30".
  • the specific document in which the convergence is created can be specifically identified through the document number, but by querying the IPC pair creating the convergence to the search engine or DBMS 291, it is possible to the corresponding documents. It is possible to obtain year-based/term- based statistic/analysis information on the patent classification codes well converging with the obtained patent classification code according to the respective IPC levels (IPC sub-class, main group, 1 dot, 2 dot, ... ), which can be implemented by searching the obtained patent classification code (e.g.
  • H04B 7/04 in one side of the table (preferably in main IPC), determining the level in every record (row) Cl ⁇ Cn including the searched patent classification code (in which the column is determined), and generating the statistic/analysis data (through the group by, count, and rank command) with respect to every patent classification code existing in the determined level. Further, if the term/date information is included in the multi-dimensional analysis operation result table data, it is possible to generate the statistic/analysis data of the patent classification codes well-converging with the obtained patent classification code according to the term (e.g. according to the year).

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Economics (AREA)
  • Technology Law (AREA)
  • General Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Analysing Materials By The Use Of Radiation (AREA)
PCT/KR2008/007398 2007-12-12 2008-12-12 Patent information providing method and system WO2009075554A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2007-0129342 2007-12-12
KR20070129342 2007-12-12

Publications (2)

Publication Number Publication Date
WO2009075554A2 true WO2009075554A2 (en) 2009-06-18
WO2009075554A3 WO2009075554A3 (en) 2009-09-24

Family

ID=40756001

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2008/007398 WO2009075554A2 (en) 2007-12-12 2008-12-12 Patent information providing method and system

Country Status (2)

Country Link
KR (1) KR101534155B1 (ko)
WO (1) WO2009075554A2 (ko)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020069819A1 (en) * 2018-10-01 2020-04-09 Eta Sa Manufacture Horlogère Suisse Method of communication for a database
CN111460784A (zh) * 2020-01-20 2020-07-28 索意互动(北京)信息技术有限公司 分析文本信息的方法、系统和计算机存储介质
WO2020232092A1 (en) * 2019-05-15 2020-11-19 RELX Inc. Systems and methods for generating a low-dimensional space representing similarities between patents

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011123131A1 (en) * 2010-04-02 2011-10-06 Cpa Global Patent Research Limited Intellectual property scoring platform
WO2011149178A2 (ko) * 2010-05-28 2011-12-01 (주)광개토연구소 니치 기술 영역 탐색, 분석 및 평가 지원 정보 처리 방법 및 그 시스템
US20130282599A1 (en) * 2010-11-02 2013-10-24 Kwanggaeto Co., Ltd. Method of generating patent evaluation model, method of evaluating patent, method of generating patent dispute prediction model, method of generating patent dispute prediction information, and method and system for generating patent risk hedging information
KR101189068B1 (ko) * 2010-11-15 2012-10-10 (주)광개토연구소 n차원 프레임을 활용하여 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법 및 그 시스템
KR101255181B1 (ko) * 2011-03-23 2013-04-16 강민수 특허 분쟁 예측 모델 생성 방법, 그 방법을 실시하는 시스템 및 그 방법이 기록된 기록 매체
KR101247252B1 (ko) * 2011-05-09 2013-03-25 한국생산기술연구원 융합 지수 서비스 시스템
US20150121185A1 (en) * 2013-10-28 2015-04-30 Reed Technology And Information Services, Inc. Portfolio management system
KR101672987B1 (ko) * 2014-03-31 2016-11-07 (주)윕스 지식재산권 평가 장치 및 방법
KR101668011B1 (ko) * 2015-11-04 2016-10-28 (주)윕스 맞춤형 검색영역설정 서비스 제공 방법 및 이를 위한 서버
KR101668007B1 (ko) * 2015-11-04 2016-10-20 (주)윕스 검색문헌 치환 서버 및 방법
KR102497151B1 (ko) * 2017-09-12 2023-02-07 특허법인 플러스 출원인 정보 채우기 시스템 및 방법
KR102062885B1 (ko) * 2018-03-04 2020-03-02 경기대학교 산학협력단 특허 심판 및 소송 데이타를 포함한 특허기술 거래 방법

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20030012516A (ko) * 2001-08-01 2003-02-12 (주) 위즈도메인 특허 데이터베이스로부터 특허간의 상호관계를 분석하여제공하는 방법
US20040024733A1 (en) * 2002-07-11 2004-02-05 Won Jeong Wook Method for constructing database of technique classification patent map
KR20060114569A (ko) * 2005-05-02 2006-11-07 (주) 위즈도메인 특허정보시스템의 작동방법
KR20070071908A (ko) * 2005-12-30 2007-07-04 (주)광개토연구소 특허정보 검색시스템

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101421221B1 (ko) * 2007-01-09 2014-07-30 (주)광개토연구소 출원인 명의 단위의 특허 정보 서비스 방법

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20030012516A (ko) * 2001-08-01 2003-02-12 (주) 위즈도메인 특허 데이터베이스로부터 특허간의 상호관계를 분석하여제공하는 방법
US20040024733A1 (en) * 2002-07-11 2004-02-05 Won Jeong Wook Method for constructing database of technique classification patent map
KR20060114569A (ko) * 2005-05-02 2006-11-07 (주) 위즈도메인 특허정보시스템의 작동방법
KR20070071908A (ko) * 2005-12-30 2007-07-04 (주)광개토연구소 특허정보 검색시스템

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020069819A1 (en) * 2018-10-01 2020-04-09 Eta Sa Manufacture Horlogère Suisse Method of communication for a database
US11556594B2 (en) 2018-10-01 2023-01-17 Eta Sa Manufacture Horlogere Suisse Communication method for database
WO2020232092A1 (en) * 2019-05-15 2020-11-19 RELX Inc. Systems and methods for generating a low-dimensional space representing similarities between patents
US11200448B2 (en) 2019-05-15 2021-12-14 RELX Inc. Systems and methods for generating a low-dimensional space representing similarities between patents
CN111460784A (zh) * 2020-01-20 2020-07-28 索意互动(北京)信息技术有限公司 分析文本信息的方法、系统和计算机存储介质
CN111460784B (zh) * 2020-01-20 2023-09-26 索意互动(北京)信息技术有限公司 分析文本信息的方法、系统和计算机存储介质

Also Published As

Publication number Publication date
KR20090063156A (ko) 2009-06-17
KR101534155B1 (ko) 2015-07-07
WO2009075554A3 (en) 2009-09-24

Similar Documents

Publication Publication Date Title
WO2009075554A2 (en) Patent information providing method and system
US11709871B2 (en) Computer implemented method for quantifying the relevance of documents
Stryker et al. Validation of database search terms for content analysis: The case of cancer news coverage
Stvilia et al. A framework for information quality assessment
Coffano et al. CRIOS-Patstat database: sources, contents and access rules
KR101476350B1 (ko) 발명자 단위의 특허 정보 제공 방법
KR101534159B1 (ko) 소셜 네트워크 분석 결과의 생성을 위한 특허 정보 융합 분석 방법 및 그 시스템
US20030225761A1 (en) System for managing and searching links
Afzal et al. Expertise Recommender System for Scientific Community.
KR101067846B1 (ko) 특허 평가 시스템 및 그 방법, 이를 구현하기 위한 프로그램을 기록한 디지털 처리장치가 판독 가능한 기록매체
US7257568B2 (en) Process and system for matching products and markets
KR101401225B1 (ko) 문서 분석 시스템
Nwachukwu et al. Trends in work life balance research: a bibliometric analysis
Menin et al. From linked data querying to visual search: towards a visualization pipeline for LOD exploration
Sala et al. Midas for government: Integration of government spending data on Hadoop
KR101216116B1 (ko) 니치 평가 모델 생성 및 니치 기술 영역 평가 방법 및 그 시스템
KR101201218B1 (ko) 니치 기술 영역 발견을 지원하는 니치 기술 영역 발견을 지원하는 특허 정보 처리 방법
WO2008084979A1 (en) Patent searching method and patent search system
KR20090062270A (ko) 출원인 명의 단위의 자동 분석 지원 특허 정보 시스템
KR20090062288A (ko) 융합 분석용 다차원 연산 수행 결과 테이블을 포함하고있는 특허 정보 시스템
KR20090062251A (ko) 특허 분류 기호 전처리 모듈 포함 특허 정보 시스템
CN106682126B (zh) 基于总体数据质量的主题数据集过滤与排序方法及系统
KR20090062158A (ko) 특허 정보 시스템
KR20090062234A (ko) 문건 집합 통합 인용 분석 기능을 가진 특허 정보 시스템
Yan et al. Hyperlink analysis for government websites of Chinese provincial capitals

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08858815

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24/08/2010)

122 Ep: pct application non-entry in european phase

Ref document number: 08858815

Country of ref document: EP

Kind code of ref document: A2