JP2018169854A

JP2018169854A - Patent requirement propriety prediction device and patent requirement propriety prediction program

Info

Publication number: JP2018169854A
Application number: JP2017067305A
Authority: JP
Inventors: 和之白井; Kazuyuki Shirai
Original assignee: Individual
Current assignee: Individual
Priority date: 2017-03-30
Filing date: 2017-03-30
Publication date: 2018-11-01
Anticipated expiration: 2037-03-30
Also published as: JP6308706B1

Abstract

PROBLEM TO BE SOLVED: To predict propriety of patent requirements so as to match a judgement practical business, and effectively reduce a load for preparing application documents.SOLUTION: A patent requirement propriety prediction server includes: summary data storage means which stores summary data that is data where the invention to be predicted is described in one or more claims and is extracted from proposal document data constituting a proposal document including detailed description where the invention to be predicted is described and indicates terms capable of specifying the summary of the invention to be predicted; specified prior art data storage means which stores specified prior art data; an advance prediction processing section which generates invention specified advance prediction data on advance requirement propriety of the invention to be predicted by a relationship with the specified prior art invention; and patent requirement propriety prediction processing means having a prediction result file generation section which generates a prediction result file indicating the patent requirement propriety of the invention to be predicted using the invention specified advance prediction data.SELECTED DRAWING: Figure 4

Description

本発明は、出願書類の案文書に記載されている発明について、新規性、進歩性といった特許要件に適合しているか否かの適否を予測する特許要件適否予測装置および特許要件適否予測プログラムに関する。 The present invention relates to a patent requirement suitability prediction apparatus and a patent requirement suitability prediction program for predicting suitability of whether or not an invention described in a draft document of an application document conforms to patent requirements such as novelty and inventive step.

従来、電力需要や株価の予測、商品の購買予測、不動産の将来価格の予測といった様々な場面で予測が行われ、そのための装置や方法も数多く提案されている。発明を特許出願して権利化する権利化業務に関しても、出願された発明に関する特許可能性（特許性、パテンタビリティともいう）を予測する技術が提案されている（例えば、特許文献１参照）。特許文献１には、次のような特許性予測装置が記載されている。この装置は、審査結果通知済の特許出願（既通知出願）を特許データベースから取得して、既通知出願の請求項についての情報量と、類似する先行出願の数を検出し、既通知出願を対象とする回帰分析を実行して、これらから算出した登録予見式にしたがい、審査結果未通知出願の特許性の予測値を算出する。 Conventionally, prediction is performed in various scenes such as prediction of power demand and stock price, prediction of purchase of goods, and prediction of future price of real estate, and many apparatuses and methods have been proposed. A technology for predicting patentability (also referred to as patentability or patentability) relating to an applied invention has also been proposed with respect to a right-giving operation for applying a patent for an invention (see, for example, Patent Document 1). Patent Document 1 describes the following patentability prediction apparatus. This device obtains a patent application (notified application) that has been notified of examination results from the patent database, detects the amount of information about the claims of the already-notified application, and the number of similar prior applications, and The target regression analysis is executed, and the predicted value of the patentability of the application for which the examination result is not notified is calculated according to the registration prediction formula calculated from these.

また、従来、特許可能性の予測のほか、発明の特許性や発明の質、特許出願や特許権の価値を評価する装置や方法も提案されている（例えば、特許文献２，３，４，５，６参照）。 Conventionally, in addition to prediction of patentability, devices and methods for evaluating the patentability and quality of an invention, the value of patent applications and patent rights have also been proposed (for example, Patent Documents 2, 3, 4, and 4). 5 and 6).

特開２００９−２３８０７４号公報JP 2009-238074 A 特開２０１５−２０７１９４号公報JP2015-207194A 特開２０００−１８１９６６号公報JP 2000-181966 A 特開２０００−１３２６０６号公報JP 2000-132606 A 特開２０１５−１８７８８３号公報Japanese Patent Laying-Open No. 2015-187883 特開２００７−１０８８０３号公報JP 2007-108803 A

上記のとおり、従来技術によれば特許出願について特許可能性を予測することや特許権を評価することが可能である。 As described above, according to the prior art, it is possible to predict the patentability and evaluate the patent right for a patent application.

しかし、前述の従来技術、例えば、特許文献１記載の特許性予測装置では、特許性の予測が、既通知出願の請求項についての情報量や、類似する先行出願数といった情報に基づき算出された登録予見式にしたがって行われる。この予測は、請求項の広狭、技術分野の疎密および特許性との間の統計的な相関関係に基づいて行われ、特許法や特許・実用新案審査基準に基づくものではなかった。そのため、特許文献１記載の特許性予測装置では、特許実務に沿った予測結果が得られないおそれが高いという課題があった。 However, in the above-described conventional technology, for example, the patentability prediction apparatus described in Patent Document 1, the patentability prediction is calculated based on information such as the amount of information about the claims of a previously notified application and the number of similar prior applications. This is done according to the registration preview formula. This prediction was made based on a statistical correlation between the breadth of claims, the density of technical fields, and patentability, and was not based on patent law or patent / utility model examination standards. Therefore, the patentability prediction apparatus described in Patent Document 1 has a problem that there is a high possibility that a prediction result in accordance with the patent practice cannot be obtained.

ところで、特許出願をしようとするときは、出願しようとする発明について先行技術調査を行い、その結果見つかった文献に記載されている先行技術との対比を行い、特許要件を具備するように、出願しようとする発明と先行技術との相違点を明確にして出願書類を準備することが望ましい。 By the way, when applying for a patent application, a prior art search is performed on the invention to be applied, and the results are compared with the prior art described in the documents found as a result. It is desirable to prepare the application documents by clarifying the differences between the invention to be attempted and the prior art.

しかし、特許要件の審査は、特許庁審査官が特許法や特許・実用新案審査基準に沿って行うものであるから（特許法47条）、特許要件を具備するように出願書類を準備するには、特許法や特許・実用新案審査基準の理解が不可欠であり、特に、特許法第２９条第２項の要件（進歩性）を具備するように出願書類を準備するには、特許法や特許・実用新案審査基準の十分な理解と、より熟練した経験やスキルが求められる。そのため、特許要件を具備するように出願書類を準備することは、とても手間や時間のかかることである。 However, the examination of patent requirements is performed by the JPO examiner in accordance with the Patent Law and Patent / Utility Model Examination Standard (Patent Law Article 47). It is essential to understand patent law and patent / utility model examination standards. In particular, in order to prepare application documents so as to satisfy the requirements (inventive step) of Article 29, Paragraph 2 of the Patent Law, A sufficient understanding of patent / utility model examination standards and more experienced experience and skills are required. Therefore, it is very time consuming and time consuming to prepare the application documents so as to satisfy the patent requirements.

この点、特許文献１記載の従来技術によって、発明の特許可能性が予測されるから、これを活用することによって、出願書類の準備に要する負担を軽減できるのではないかとも考えられる。 In this regard, since the patentability of the invention is predicted by the prior art described in Patent Document 1, it may be possible to reduce the burden required for preparing the application documents by utilizing this.

しかし、特許文献１記載の従来技術では、予測結果が審査実務に適合していないおそれがあるから、そのような予測結果を活用しても、出願書類の準備負担の軽減が有効にならないおそれがある。 However, in the prior art described in Patent Document 1, there is a possibility that the prediction result may not be suitable for examination practice. Therefore, even if such a prediction result is used, there is a possibility that the reduction of the preparation burden of the application documents may not be effective. is there.

したがって、特許要件を具備するような出願書類を準備するための負担軽減が有効になるよう、出願しようとする発明に関し、特許要件の適否の予測が審査実務に適合した内容で行われることが望まれていた。 Therefore, it is hoped that the prediction of the propriety of the patent requirements will be made with the content that suits the examination practice so that the burden of preparing the application documents that satisfy the patent requirements will be effective. It was rare.

本発明は、上記課題を解決するためになされたもので、特許要件の適否に関する予測が審査実務に適合した内容で行われ、出願書類の準備負担を有効に軽減し得る特許要件適否予測装置および特許要件適否予測プログラムを提供することを目的とする。 The present invention has been made to solve the above-mentioned problems, and the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, and the patent requirement propriety prediction device capable of effectively reducing the preparation burden of the application documents and It is an object of the present invention to provide a patent requirement conformity prediction program.

上記課題を解決するため、本発明は、特許要件適否の予測対象となる予測対象発明が１または２以上の請求項に記載され、かつその予測対象発明が記載されている詳細な説明を含む案文書を構成する案文書データから抽出されたデータであって、その予測対象発明の要旨を特定し得る用語を示す用語データであり、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび詳細な説明に含まれる課題の部分から抽出された課題データを含むデータを要旨データとして記憶する要旨データ記憶手段と、指定先行技術発明が記載されている指定先行技術文献を構成する指定先行技術データを記憶する指定先行技術データ記憶手段と、指定先行技術発明との関係による予測対象発明の進歩性の要件適否に関する発明指定進歩性予測データを生成する進歩性予測処理部と、その発明指定進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段とを有し、進歩性予測処理部は、文書ベクトルの分類を行う文書分類部を有し、その文書分類部は、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合するか否かのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、各請求項に応じた予測対象発明の要旨ベクトルと、指定先行技術データに応じた引用候補ベクトルとの差分に応じたベクトルである特許要件適否予測装置を特徴とする。 In order to solve the above-mentioned problems, the present invention includes a detailed description in which the invention to be predicted that is the object to be predicted for the suitability of patent requirements is described in one or more claims, and the invention to be predicted is described. Data extracted from proposed document data constituting the document, term data indicating terms that can specify the gist of the invention to be predicted, and at least feature part data and details extracted from feature parts of each claim Summary data storage means for storing data including problem data extracted from the problem part included in the description as summary data, and designated prior art data constituting the designated prior art document describing the designated prior art invention Invention-designated inventive step prediction data relating to the suitability of the inventive step to be predicted based on the relationship between the designated prior art data storage means to be stored and the designated prior art invention Patent requirement suitability prediction processing means comprising: an inventive step prediction processing unit to generate; and a prediction result file generation unit that generates a prediction result file indicating the patent requirement suitability of the invention to be predicted using the invention-designated inventive step prediction data. The inventive step predictive processing unit has a document classification unit for classifying document vectors, and the document classification unit is input by machine learning using a plurality of training data including a learning document vector and a teacher vector. The abstract movement vector is classified into whether or not it conforms to the requirement of inventive step, and the requirement conformity document vector is output according to the classification result, and the abstract movement vector is defined in each claim. Characterized by a patent requirement conformity prediction device which is a vector corresponding to a difference between a gist vector of a prediction target invention according to the reference prior art data and a citation candidate vector according to designated prior art data

また、本発明は、特許要件適否の予測対象となる予測対象発明が１または２以上の請求項に記載され、かつその予測対象発明が記載されている詳細な説明を含む案文書を構成する案文書データから抽出されたデータであって、その予測対象発明の要旨を特定し得る用語を示す用語データであり、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび詳細な説明に含まれる課題の部分から抽出された課題データを含むデータを要旨データとして記憶する要旨データ記憶手段と、指定先行技術発明が記載されている指定先行技術文献を構成する指定先行技術データを記憶する指定先行技術データ記憶手段と、要旨データ記憶手段に記憶されている要旨データを用いて、公開公報の電子データである公開公報データを検索し、その検索結果に応じて、予測対象発明の新規性の要件適否を示す新規性予測データを生成する新規性予測処理部と、予測対象発明の進歩性の要件適否を示す進歩性予測データを生成する進歩性予測処理部と、その新規性予測データおよび進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段とを有し、進歩性予測処理部は、公開公報データによって特定される先行技術発明のうち、予測対象発明に最も近い主引用発明を検索する主引用発明検索部と、文書ベクトルの分類を行う文書分類部とを有し、主引用発明検索部は、要旨データ記憶手段に記憶されている要旨データのうちの各請求項の特徴部分データおよび課題データを主検索文書データとして公開公報データを対象とする概念検索を行い、その概念検索の結果、類似度の降順にその類似度が最も高い最類似文献を含む複数の文献を類似文献として抽出し、かつ各その類似文献を主引用発明が開示されている主引用文献とし、文書分類部は、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合しない可能性が極めて高いクラス、高いクラス、その進歩性の要件に適合するクラスのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、各請求項に応じた予測対象発明の要旨ベクトルと、各類似文献に応じた引用候補ベクトルそれぞれとの差分に応じた複数のベクトルであり、特許要件適否予測処理手段は、文書分類部から出力される複数の要件適否文書ベクトルに応じて、予測対象発明について、進歩性の要件に適合しない可能性を示す非適合率を算出する非適合率算出部と、指定先行技術データ記憶手段に指定先行技術データが記憶されているときに、進歩性予測処理部が、進歩性予測データの代わりに指定先行技術発明との関係による予測対象発明の進歩性の要件適否に関する発明指定進歩性予測データを生成するように制御する予測処理制御部とを更に有する特許要件適否予測装置を提供する。 Further, the present invention provides a draft document that includes a detailed description in which one or two or more claims include a prediction target invention that is a prediction target of suitability for patent requirements and includes the prediction target invention. It is data extracted from document data, and is term data indicating terms that can specify the gist of the invention to be predicted, and is included in at least the feature part data extracted from the feature part of each claim and the detailed description Abstract data storage means for storing data including problem data extracted from the problem part as abstract data, and specified prior art for storing specified prior art data constituting the specified prior art document describing the specified prior art invention Using the data storage means and the summary data stored in the summary data storage means, the public gazette data, which is the electronic data of the public gazette, is searched for and verified. According to the result, a novelty prediction processing unit that generates novelty prediction data indicating whether the novelty requirement of the prediction target invention is appropriate, and an inventive step that generates inventive step prediction data indicating whether the inventive step is appropriate Patent requirement adequacy prediction processing means comprising: a prediction processing unit; and a prediction result file generation unit that generates a prediction result file indicating whether the patent requirement of the invention to be predicted is appropriate using the novelty prediction data and the inventive step prediction data. The inventive step prediction processing unit includes a main citation invention search unit that searches for a main citation invention that is closest to the prediction target invention among prior art inventions specified by the publication gazette data, and a document classification unit that classifies document vectors. The main citation invention search unit uses the feature part data and the problem data of each claim of the summary data stored in the summary data storage means as the main search document data. A concept search is performed on public gazette data. As a result of the concept search, a plurality of documents including the most similar document with the highest similarity in descending order of similarity are extracted as similar documents, and each similar document is extracted. The document categorizing unit sets the input abstract movement vector as a requirement for inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so that it can be classified into a class that is very unlikely to be matched, a high class, or a class that meets the requirements of the inventive step, and a requirement pass / fail document vector corresponding to the classification result is output, and its summary movement vector Is a plurality of vectors according to the difference between the gist vector of the invention to be predicted according to each claim and each citation candidate vector according to each similar document. The suitability prediction processing means calculates a non-conformance rate calculation that calculates a non-conformance rate indicating a possibility that the invention to be predicted does not conform to the inventive step requirement according to a plurality of requirement suitability document vectors output from the document classification unit. And the designated prior art data storage means stores the specified prior art data, the inventive step predictive processing unit uses the inventive step of the prediction target invention in relation to the designated prior art invention instead of the inventive step predictive data. A patent requirement suitability prediction apparatus further comprising a prediction processing control unit that controls to generate invention specified inventive step prediction data relating to the suitability of requirements.

上記特許要件適否予測装置の場合、文書分類部は、訓練データとして、学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが新規性および進歩性の拒絶理由有りを示すベクトルとの組み合わせと、学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りで新規性の拒絶理由無しを示すベクトルとの組み合わせと、学習文書ベクトルが第３の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトルとの組み合わせが用いられ、第１の学習文書ベクトルは、すでに公開されている公開済出願の中で拒絶理由通知が発行された出願であって、その拒絶理由通知で同じ文献を引用して新規性および進歩性無しの拒絶理由が指摘されていた新規性・進歩性拒絶出願のその拒絶理由が指摘されていた請求項に応じた文書ベクトルと、その拒絶理由で引用されていた文献である第１の主引用刊行物に応じた第１の引用文書ベクトルとの差分に応じた第１の移動文書ベクトルであり、第２の学習文書ベクトルは、公開済出願の中で拒絶理由通知が発行された出願であって、その拒絶理由通知で新規性の拒絶理由は指摘されていないが進歩性の拒絶理由が指摘されていた進歩性拒絶出願の該拒絶理由が指摘されていた請求項に応じた文書ベクトルと、その拒絶理由で主たる刊行物として引用されていた第２の主引用刊行物に応じた第２の引用文書ベクトルとの差分に応じた第２の移動文書ベクトルであり、第３の学習文書ベクトルは、公開済出願の中で拒絶理由通知が発行されずに特許査定が発行された拒絶無し出願または拒絶理由通知が発行された出願であって、その拒絶理由通知で進歩性の拒絶理由が指摘されていなかった進歩性拒絶無し出願の請求項１に応じた文書ベクトルと、拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる学習用最類似文献に応じた非引用文書ベクトルとの差分から求めた第３の移動文書ベクトルである特許要件適否予測装置とすることができる。 In the case of the above-mentioned patent requirement conformity prediction apparatus, the document classification unit uses training data as a combination of a learning document vector as a first learning document vector and a teacher vector as a vector indicating that there is a reason for rejection of novelty and inventive step, The combination of the document vector is the second learning document vector and the teacher vector is a vector indicating that there is a reason for refusal of inventive step and there is no reason for refusal of novelty, and the learning document vector is the third learning document vector and the teacher vector is inventive. Is used in combination with a vector indicating that there is no reason for refusal, and the first learning document vector is an application for which a notice of reason for refusal has been issued among already published applications, The reason for refusal of a novelty / inventive step rejection application was pointed out by citing the same document and the reason for refusal without novelty and inventive step was pointed out A first moving document vector corresponding to a difference between a document vector corresponding to a request and a first cited document vector corresponding to a first main cited publication which is a document cited for the reason for refusal The second learning document vector is an application for which a notice of reason for refusal has been issued in the published application, and the reason for refusal of novelty is not pointed out in the notice of reason for refusal, but the reason for refusal of inventive step is pointed out A document vector corresponding to the claim in which the reason for refusal of the inventive step rejection application was pointed out, and a second corresponding to the second main cited publication cited as the main publication for the reason for refusal A second moving document vector corresponding to a difference from the cited document vector, and the third learning document vector is a non-rejection application in which a patent decision is issued without a notice of reason for refusal being issued in the published application or Reason for refusal issued A document vector corresponding to claim 1 of the non-inventive refusal application for which the reason for refusal of refusal was not pointed out in the notice of reasons for refusal, and an application without refusal or an application without inventive step refusal As a result of the concept search, the patent requirement suitability prediction apparatus which is the third moving document vector obtained from the difference from the non-cited document vector corresponding to the learning most similar document having the highest similarity can be obtained. .

さらに、本発明は、特許要件適否の予測対象となる予測対象発明が１または２以上の請求項に記載され、かつその予測対象発明が記載されている詳細な説明を含む案文書を構成する案文書データから抽出されたデータであって、その予測対象発明の要旨を特定し得る用語を示す用語データであり、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび詳細な説明に含まれる課題の部分から抽出された課題データを含むデータを要旨データとして記憶する要旨データ記憶手段と、要旨データ記憶手段に記憶されている要旨データを用いて、公開公報の電子データである公開公報データを検索し、その検索結果に応じて、予測対象発明の新規性の要件適否を示す新規性予測データを生成する新規性予測処理部と、予測対象発明の進歩性の要件適否を示す進歩性予測データを生成する進歩性予測処理部と、その新規性予測データおよび進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段と、その予測結果ファイル生成部によって生成された予測結果ファイルを記憶する予測結果記憶手段とを有し、進歩性予測処理部は、公開公報データによって特定される先行技術発明のうち、予測対象発明に最も近い主引用発明を検索する主引用発明検索部と、文書ベクトルの分類を行う文書分類部とを有し、主引用発明検索部は、要旨データ記憶手段に記憶されている要旨データのうちの各請求項の特徴部分データおよび課題データを主検索文書データとして公開公報データを対象とする概念検索を行い、その概念検索の結果、類似度の降順にその類似度が最も高い最類似文献を含む複数の文献を類似文献として抽出し、かつ各その類似文献を主引用発明が開示されている主引用文献とし、文書分類部は、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合しない可能性が極めて高いクラス、高いクラス、その進歩性の要件に適合するクラスのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、各請求項に応じた予測対象発明の要旨ベクトルと、各類似文献に応じた引用候補ベクトルそれぞれとの差分に応じた複数のベクトルである特許要件適否予測装置を提供する。 Further, the present invention provides a draft document that includes a detailed description in which one or two or more claims include a prediction target invention that is a prediction target of whether or not patent requirements are appropriate, and the prediction target invention is described. It is data extracted from document data, and is term data indicating terms that can specify the gist of the invention to be predicted, and is included in at least the feature part data extracted from the feature part of each claim and the detailed description Abstract data storage means for storing data including assignment data extracted from the assignment portion as abstract data, and abstract data stored in the abstract data storage means, using the abstract data stored in the abstract data, A novelty prediction processing unit that generates novelty prediction data indicating whether or not the novelty requirement of the prediction target invention is appropriate according to the search result, and the progress of the prediction target invention A prediction result generation unit that generates the inventive step prediction data indicating whether or not the requirements of the invention are appropriate, and a prediction result file that generates the prediction result file indicating the suitability of the patent requirements of the invention to be predicted using the novelty prediction data and the inventive step prediction data A patent requirement suitability prediction processing unit having a file generation unit, and a prediction result storage unit for storing a prediction result file generated by the prediction result file generation unit. Among the specified prior art inventions, the invention includes a main citation invention search unit that searches for a main citation invention that is closest to the prediction target invention, and a document classification unit that classifies document vectors. Of the summary data stored in the data storage means, the feature part data and the subject data of each claim are used as main search document data, and the publication publication data is targeted. As a result of the concept search, a plurality of documents including the most similar document having the highest similarity in descending order of similarity are extracted as similar documents, and the main cited invention is disclosed for each similar document. The document classification unit is highly likely that the input abstract movement vector does not meet the inventive step requirement by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed to classify into either a high class, a high class, or a class that meets the requirements of the inventive step and output a requirement conformity document vector according to the classification result, and the summary movement vector is included in each claim. Provided is a patent requirement propriety prediction device which is a plurality of vectors corresponding to differences between a gist vector of a corresponding invention to be predicted and a citation candidate vector corresponding to each similar document.

さらにまた、本発明は、コンピュータを特許要件適否予測装置として機能させるための特許要件適否予測プログラムであって、そのコンピュータを特許要件適否の予測対象となる予測対象発明が１または２以上の請求項に記載され、かつその予測対象発明が記載されている詳細な説明を含む案文書を構成する案文書データから抽出されたデータであって、その予測対象発明の要旨を特定し得る用語を示す用語データであり、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび詳細な説明に含まれる課題の部分から抽出された課題データを含むデータを要旨データとして記憶させる要旨データ記憶制御手段と、指定先行技術発明が記載されている指定先行技術文献を構成する指定先行技術データを記憶させる指定先行技術データ記憶制御手段と、指定先行技術発明との関係による予測対象発明の進歩性の要件適否に関する発明指定進歩性予測データを生成する進歩性予測処理部と、その発明指定進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段として機能させ、進歩性予測処理部が文書ベクトルの分類を行う文書分類部を有し、その文書分類部が、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合するか否かのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、各請求項に応じた予測対象発明の要旨ベクトルと、指定先行技術データに応じた引用候補ベクトルとの差分に応じたベクトルである特許要件適否予測プログラムを提供する。 Furthermore, the present invention is a patent requirement suitability prediction program for causing a computer to function as a patent requirement suitability prediction device, wherein the invention to be predicted is one or more claims. Is a data extracted from the draft document data constituting the draft document including the detailed explanation in which the invention to be predicted is described, and indicates a term that can identify the gist of the invention to be predicted Summary data storage control means for storing, as summary data, data including at least feature part data extracted from the feature part of each claim and problem data extracted from the problem part included in the detailed description; Designated prior art data for storing designated prior art data constituting the designated prior art document describing the designated prior art invention An inventive step prediction processing unit for generating invention-designated inventive step prediction data relating to the suitability of the inventive step to be predicted based on the relationship between the storage control means and the designated prior art invention, and prediction using the invention-designated inventive step prediction data A prediction result file generation unit that generates a prediction result file indicating whether or not the patent requirement of the subject invention is generated, and a document classification unit that classifies the document vectors by the inventive step prediction processing unit. The document classification unit classifies the input abstract movement vector as to whether it meets the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. The document is constructed so as to output a requirement conformity document vector corresponding to the classification result, and the summary movement vector is the summary vector of the invention to be predicted according to each claim. And torr, providing a patent requirements appropriateness prediction program is a vector corresponding to the difference between the reference candidate vector corresponding to the designated prior art data.

以上詳述したように、本発明によれば、特許要件の適否に関する予測が審査実務に適合した内容で行われ、出願書類の準備負担を有効に軽減し得る特許要件適否予測装置および特許要件適否予測プログラムが得られる。 As described above in detail, according to the present invention, the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, and the patent requirement propriety prediction device and the patent requirement propriety that can effectively reduce the preparation burden of the application documents. A prediction program is obtained.

本発明の第１の実施の形態に係る特許要件適否予測サーバを含む特許要件適否予測システムのシステム構成図である。1 is a system configuration diagram of a patent requirement suitability prediction system including a patent requirement suitability prediction server according to a first embodiment of the present invention. 特許要件適否予測サーバの内部の構成を中心に示すブロック図である。It is a block diagram which mainly shows the internal structure of a patent requirement suitability prediction server. ユーザ端末装置の内部の構成を中心に示すブロック図である。It is a block diagram which mainly shows the internal structure of a user terminal device. 本発明の第１の実施の形態に係る特許要件適否予測サーバの主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction server which concerns on the 1st Embodiment of this invention. 要旨データ抽出部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of an abstract data extraction part. 本発明の第１の実施の形態に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on the 1st Embodiment of this invention. 入力ベクトル生成部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of an input vector production | generation part. 要旨データ記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of a summary data storage part. ＣＴデータ記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of CT data storage part. 予測結果記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of a prediction result memory | storage part. 機械学習部のネットワーク構造の一例を示す図である。It is a figure which shows an example of the network structure of a machine learning part. 特許要件適否予測処理の動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a patent requirement suitability prediction process. 特許要件適否予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a patent requirement suitability prediction routine. 新規性・拡大先願予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a novelty and expansion prior application prediction routine. 拡大先願予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of an expansion prior application prediction routine. 進歩性予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of an inventive step prediction routine. 単一独立項ルーチンの一例を示すフローチャートである。It is a flowchart which shows an example of a single independent term routine. 複数独立項ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a multiple independent term routine. 独立項検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of an independent term search process. 主引用発明検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of main citation invention search processing. 副引用発明検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of a subcitation invention search process. 従属項検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of a dependent term search process. 審査対象となる特許出願と、複数の特許公開公報との関係を模式的に示した図である。It is the figure which showed typically the relationship between the patent application used as examination object, and several patent publications. 独立項テーブルのレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of an independent item table. 特許要件適否予測リストの一例を示す図である。It is a figure which shows an example of a patent requirement suitability prediction list. 本発明の第２の実施の形態に係る特許要件適否予測サーバの主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction server which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on the 2nd Embodiment of this invention. 同じく入力ベクトル生成部の主要な構成を示す機能ブロック図である。It is a functional block diagram which similarly shows the main structures of an input vector production | generation part. 同じく予測結果記憶部のレコードレイアウトの一例を示す図である。It is a figure which similarly shows an example of the record layout of a prediction result memory | storage part. 同じく特許要件適否予測リストの一例を示す図である。It is a figure which similarly shows an example of a patent requirement suitability prediction list. 変形例に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on a modification. サーチ無し特許要件適否予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a patent requirement adequacy prediction routine without a search. サーチ無し進歩性予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a search-less progress prediction routine. サーチ無し単一独立項ルーチンの一例を示すフローチャートである。It is a flowchart which shows an example of a single independent term routine without a search. サーチ無し複数独立項ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a search independent multiple independent term routine.

以下、本発明の実施の形態について図面を参照しつつ説明する。なお、同一要素には同一符号を用い、重複する説明は省略する。 Embodiments of the present invention will be described below with reference to the drawings. In addition, the same code | symbol is used for the same element and the overlapping description is abbreviate | omitted.

第１の実施の形態
(特許要件適否予測システムの全体構成）
まず、本発明の第１の実施の形態に係る特許要件適否予測サーバ１０を含む特許要件適否予測システム１の構成について説明する。 First embodiment
(Overall structure of patent requirement conformity prediction system)
First, the configuration of the patent requirement suitability prediction system 1 including the patent requirement suitability prediction server 10 according to the first embodiment of the present invention will be described.

図１は特許要件適否予測システム１のシステム構成図である。図１に示すように、特許要件適否予測システム１は、特許要件適否予測サーバ１０と、ユーザが操作する複数のユーザ端末装置３０（図１では、固定端末装置３０Ａ、３０Ｂ、３０Ｃ）とを有し、これらがインターネットＮ１を介して互いに接続される構成を有している。 FIG. 1 is a system configuration diagram of a patent requirement suitability prediction system 1. As shown in FIG. 1, the patent requirement suitability prediction system 1 has a patent requirement suitability prediction server 10 and a plurality of user terminal devices 30 (in FIG. 1, fixed terminal devices 30A, 30B, and 30C) operated by the user. These are connected to each other via the Internet N1.

特許要件適否予測サーバ１０は、特許要件適否予測プログラムにしたがったデータ処理を行う。特許要件適否予測サーバ１０は、ユーザが出願しようとする発明について、特許要件（本実施形態では、新規性（特許法第２９条第１項３号）、拡大先願（特許法第２９条の２）および進歩性（特許法第２９条第２項））に適合しているか否かを予測する。ユーザ端末装置３０は、特許要件適否予測サーバ１０との間でデータの受信または送信を行う。 The patent requirement suitability prediction server 10 performs data processing according to the patent requirement suitability prediction program. The patent requirement suitability prediction server 10 provides a patent requirement (in this embodiment, novelty (Patent Act Article 29, Paragraph 1, Item 3), an extended prior application (Patent Act Article 29 2) and the inventive step (Patent Act Article 29, Paragraph 2)). The user terminal device 30 receives or transmits data with the patent requirement suitability prediction server 10.

特許要件適否予測システム１では、ユーザが出願しようとする発明が、特許要件に適合しているか否か（特許要件適否）の予測対象となるので、本実施の形態に係る予測対象発明に相当する。そして、特許要件適否予測サーバ１０が、その予測対象発明について、２種類の予測モード（後述するサーチ有りモードまたはサーチ無しモードのいずれかのモード）で特許要件の適否を予測する。いずれのモードにおいても、進歩性違反の拒絶理由が見つかる可能性が高いか低いかを審査実績に基づく複数の訓練データで機械学習を行った人工知能プログラムで予測して、特許要件の適否が予測される。特許要件適否予測サーバ１０によって、特許要件の適否の予測が審査実務に適合した内容で行われるため、出願書類の準備負担を有効に軽減することができる。 In the patent requirement suitability prediction system 1, the invention that the user intends to apply for is subject to the prediction of whether or not the patent requirement is met (patent requirement suitability), and therefore corresponds to the prediction target invention according to the present embodiment. . Then, the patent requirement suitability prediction server 10 predicts the suitability of the patent requirement for the prediction target invention in two types of prediction modes (either a mode with search or a mode without search described later). In any mode, predict whether the reason for refusal of violating the inventive step is high or low by using an artificial intelligence program that has performed machine learning with multiple training data based on examination results, and predict the suitability of patent requirements. Is done. Since the patent requirement conformity prediction server 10 predicts the suitability of the patent requirement with the content suitable for the examination practice, the preparation burden of the application documents can be effectively reduced.

ここで、サーチ有りモードとは、予測対象発明に関して、公開公報（主として公開特許公報）の電子データである公開公報データの検索（後述する主引用発明検索および副引用発明検索）を行い、その検索結果に応じて特許要件の適否予測を行うモードである。サーチ無しモードとは、予測対象発明に関して、公開公報データの検索を行うことなく、後述する指定先行技術発明との関係による特許要件の適否予測を行うモードである。 Here, the search mode refers to a search for public gazette data (main citation invention search and sub-citation invention search described later), which is electronic data of a public gazette (mainly a published patent gazette), with respect to the prediction target invention. In this mode, the propriety of patent requirements is predicted according to the result. The no-search mode is a mode in which the propriety of patent requirements is predicted based on the relationship with the designated prior art invention described later without searching the publication gazette data for the prediction target invention.

（特許要件適否予測サーバ１０の構成）
次に、図２を参照して特許要件適否予測サーバ１０の構成について説明する。図２は、特許要件適否予測サーバ１０の内部の構成を中心に示すブロック図である。特許要件適否予測サーバ１０は、予測対象発明の特許要件適否の予測に関するサービスを提供する専門事業者が運用するサーバである。 (Configuration of Patent Requirements Compliance Predicting Server 10)
Next, the configuration of the patent requirement suitability prediction server 10 will be described with reference to FIG. FIG. 2 is a block diagram centering on the internal configuration of the patent requirement suitability prediction server 10. The patent requirement propriety prediction server 10 is a server that is operated by a specialized business operator that provides a service related to prediction of patent requirement propriety of the invention to be predicted.

特許要件適否予測サーバ１０は、ＣＰＵ（Central Processing Unit）１１と、ＲＯＭ(Read Only Memory)１２と、ＲＡＭ（Random Access Memory）１３とを有している。ＣＰＵ１１は、ＲＯＭ１２に記憶されているプログラムにしたがい作動して、ＫＢＣ（Key board controller）１７を介してキーボード１９やマウス２０の操作入力で得られる入力データをメインバス１９Ａを介して入力する一方、他の構成要素との信号の入出力を行い、特許要件適否予測サーバ１０全体の動作制御を行う。ＣＰＵ１１は、後述する特許要件適否予測プログラムにしたがい、後述する案文データ生成部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、対象公報抽出部１０４、予測結果編集処理部１０５、指定先行技術データ生成部１０６としての動作を行う。ＲＯＭ１２には、特許要件適否予測プログラム等のＣＰＵ１１が実行する制御プログラムと、恒久的なデータが記憶されている。ＲＡＭ１３にはＣＰＵ１１が作動する際に用いるデータやプログラムが記憶されている。 The patent requirement conformity prediction server 10 includes a CPU (Central Processing Unit) 11, a ROM (Read Only Memory) 12, and a RAM (Random Access Memory) 13. The CPU 11 operates in accordance with a program stored in the ROM 12 and inputs input data obtained by operation input of the keyboard 19 and the mouse 20 via the KBC (Key board controller) 17 via the main bus 19A. Input / output of signals to / from other components is performed to control the operation of the patent requirement conformity prediction server 10 as a whole. The CPU 11 follows a later-described patent requirement suitability prediction program, a later-described draft sentence data generation unit 101, a gist data extraction unit 102, a patent requirement suitability prediction processing unit 103, a target gazette extraction unit 104, a prediction result edit processing unit 105, a designated preceding The operation as the technical data generation unit 106 is performed. The ROM 12 stores a control program executed by the CPU 11 such as a patent requirement suitability prediction program and permanent data. The RAM 13 stores data and programs used when the CPU 11 operates.

そのほか特許要件適否予測サーバ１０は、ハードディスク装置（Hard disk drive,ＨＤＤ）１４と、通信制御部１５と、通信処理部１６と、ビデオコントローラ１８とを有している。 In addition, the patent requirement compliance prediction server 10 includes a hard disk drive (HDD) 14, a communication control unit 15, a communication processing unit 16, and a video controller 18.

ハードディスク装置１４には、特許要件適否予測プログラムの実行に必要な図４に示す各種記憶部またはＤＢ（database）と、その他の記憶部またはＤＢが形成されている。ハードディスク装置１４には、指定先行技術データ記憶部１５１と、予測対象トランザクション記憶部（予測対象ＴＲ記憶部）１５２と、要旨データ記憶部１５３と、クレームツリーデータ記憶部１５４と、対象公報記憶部１５５と、予測結果ファイル記憶部１５６とが形成されている。各記憶部またはＤＢについては後述する。 The hard disk device 14 is formed with various storage units or DBs (database) shown in FIG. 4 and other storage units or DBs necessary for executing the patent requirement suitability prediction program. The hard disk device 14 includes a designated prior art data storage unit 151, a prediction target transaction storage unit (prediction target TR storage unit) 152, a gist data storage unit 153, a claim tree data storage unit 154, and a target publication storage unit 155. And a prediction result file storage unit 156 is formed. Each storage unit or DB will be described later.

通信制御部１５は、ＣＰＵ１１の指示にしたがい作動して、ユーザ端末装置３０や、図示しない特許庁サーバとの通信を行うための回線の接続および切断を制御する。通信処理部１６は、通信制御部１５の指示にしたがい作動して、インターネットＮ１を介して行われるデータの送受信を実行する。 The communication control unit 15 operates in accordance with an instruction from the CPU 11 to control connection and disconnection of a line for performing communication with the user terminal device 30 and a patent office server (not shown). The communication processing unit 16 operates according to an instruction from the communication control unit 15 and executes data transmission / reception performed via the Internet N1.

ビデオコントローラ１８は、図示しないディスプレイ装置における画像表示を制御して、各種の設定に用いられる画面等を表示させる。 The video controller 18 controls image display on a display device (not shown) to display a screen used for various settings.

そして、ハードディスク装置１４の各種記憶部またはＤＢについて説明すると次のとおりである。指定先行技術データ記憶部１５１には、ユーザ端末装置３０から送信されるユーザ特定に必要なデータ（例えば、会員ＩＤ）と、そのユーザが指定した指定先行技術発明が記載された指定先行技術文献を構成する指定先行技術データとが記憶されている。指定先行技術発明は、特許要件適否予測サーバ１０による特許要件適否の予測にあたり、ユーザが指定した先行技術発明であり、例えば、ユーザが自らの調査によって見つけた公開特許公報に記載されている先行技術発明に相当する。 The various storage units or DBs of the hard disk device 14 will be described as follows. The designated prior art data storage unit 151 stores data (for example, a member ID) necessary for user identification transmitted from the user terminal device 30 and a designated prior art document in which the designated prior art invention designated by the user is described. The designated prior art data to be configured is stored. The designated prior art invention is a prior art invention designated by the user in the prediction of patent requirement suitability by the patent requirement suitability prediction server 10, for example, the prior art described in the published patent publication found by the user's own search Corresponds to the invention.

指定先行技術データが指定先行技術データ記憶部１５１に記憶されているときは、ユーザが先行技術発明を指定した場合に相当する。この場合、サーチ無しモードにおいて、その指定先行技術発明との関係において、予測対象発明の進歩性の適否が予測される（指定先行技術発明からみて進歩性があるのかどうかが予測される）。 The case where the designated prior art data is stored in the designated prior art data storage unit 151 corresponds to the case where the user designates the prior art invention. In this case, in the no-search mode, the suitability of the inventive step to be predicted is predicted in relation to the designated prior art invention (whether there is an inventive step in view of the designated prior art invention).

予測対象ＴＲ記憶部１５２には、案文データ生成部１０１が生成した案文書データ（後述する案文書を構成する電子データ）が記憶されている。公開公報ＤＢ１５０は、公開特許公報の電子データが公開公報データとして格納されている。公開公報ＤＢ１５０として、図４では、工業所有権情報・研修館により運営されている特許情報プラットフォーム（Ｊ−ＰｌａｔＰａｔ）のデータベースまたはそこからダウンロードした電子データを記憶しているデータベースを想定している。後者のデータベースは、図示しないサーバに格納することができるし、ＨＤＤ１４に格納してもよい。 The prediction target TR storage unit 152 stores the plan document data generated by the plan sentence data generation unit 101 (electronic data constituting a plan document described later). The public gazette DB 150 stores the electronic data of the published patent gazette as public gazette data. As the public gazette DB 150, FIG. 4 assumes a database of a patent information platform (J-PlatPat) operated by the industrial property information / training hall or a database storing electronic data downloaded therefrom. The latter database can be stored in a server (not shown) or may be stored in the HDD 14.

案文書とは、予測対象発明が記載されている文献であって、その予測対象発明が１または２以上の請求項（本実施の形態では独立形式で記載されている場合を想定しているが、従属形式でもよい）に記載されている部分（願書に添付される特許請求の範囲に相当する文書）と、その予測対象発明が詳細に記載されている部分（本実施の形態において、「詳細な説明」といい、願書に添付される明細書の「発明の詳細な説明」に相当する文書）とを含んでいる。その「詳細な説明」には、少なくとも、予測対象発明の名称が記載されている部分（「発明の名称」に相当する部分）と、課題の部分（予測対象発明の解決しようとする課題が記載されている部分で、「発明が解決しようとする課題」に相当する部分）とを含み、本実施の形態では、「発明を実施するための形態」に相当する部分と、技術分野に相当する部分が含まれている。そして、案文書データは、暗号化通信（例えば、SSLを利用した暗号化通信）によって、ユーザ端末装置３０から特許要件適否予測サーバ１０に送信される。 A draft document is a document in which the invention to be predicted is described, and the invention to be predicted is one or more claims (in this embodiment, it is assumed that the invention is described in an independent form). , Which may be a subordinate form) (a document corresponding to the claims attached to the application) and a part where the invention to be predicted is described in detail (in this embodiment, “details”) And a document corresponding to “Detailed Description of the Invention” in the specification attached to the application. The “detailed description” includes at least a portion where the name of the prediction target invention is described (a portion corresponding to the “name of the invention”) and a problem portion (the problem to be solved by the prediction target invention). In the present embodiment, a portion corresponding to “a mode for carrying out the invention” and a technical field are included. Part is included. The draft document data is transmitted from the user terminal device 30 to the patent requirement conformity prediction server 10 by encrypted communication (for example, encrypted communication using SSL).

要旨データ記憶部１５３には、要旨データ抽出部１０２が抽出・生成した要旨データが記憶されている。要旨データは、予測対象発明の要旨を特定し得る用語を示す用語データであって、少なくとも後述する特徴部分データと課題データとが含まれている。 The summary data storage unit 153 stores summary data extracted and generated by the summary data extraction unit 102. The summary data is term data indicating terms that can specify the summary of the invention to be predicted, and includes at least feature portion data and task data described later.

要旨データ記憶部１５３には、例えば図８に示すように、データ種別エリア１５３ａ、項番エリア１５３ｂ、用語記憶部１５３ｃを有するレコードが記憶されている。データ種別エリア１５３ａは、各レコードに記憶されているデータが案文書のどの部分のデータであるのかを示すデータ（「データ種別」という）が記憶されている。本実施の形態では、データ種別として、"Ｃ"、"Ｐ"、"Ｔ"、"Ｄ"の４種類が設定されている。"Ｃ"は請求の範囲のデータ、"Ｐ"は課題の部分のデータ、"Ｔ"は技術分野の部分のデータ、"Ｄ"は発明の実施の形態の部分のデータをそれぞれ示している。項番エリア１５３ｂには、請求項の番号が記憶されている。用語記憶部１５３ｃは、用語エリア１５３ｃ１、展開度エリア１５３ｃ２および必須フラグエリア１５３ｃ３を有している。図８では、これらの組み合わせが１５通り用意されているが、この組み合わせは１５通りより多くてもよいし、少なくてもよい。そして、用語エリア１５３ｃ１，展開度エリア１５３ｃ２，必須フラグエリア１５３ｃ３には、それぞれ要旨の特定に用いられる用語、後述する展開度（Ｅｄ）、必須フラグ（Ｅｆ）が記憶されている。 For example, as shown in FIG. 8, the abstract data storage unit 153 stores a record having a data type area 153a, an item number area 153b, and a term storage unit 153c. The data type area 153a stores data (referred to as “data type”) indicating which part of the draft document the data stored in each record is. In the present embodiment, four types, “C”, “P”, “T”, and “D”, are set as data types. “C” indicates data of claims, “P” indicates data of a problem part, “T” indicates data of a technical field part, and “D” indicates data of an embodiment part of the invention. The number of the claim is stored in the item number area 153b. The term storage unit 153c has a term area 153c1, a development degree area 153c2, and an essential flag area 153c3. In FIG. 8, 15 combinations of these are prepared, but there may be more or less than 15 combinations. In the term area 153c1, the development degree area 153c2, and the essential flag area 153c3, terms used for specifying the gist, a later-described development degree (Ed), and an essential flag (Ef) are stored.

図８では、一例として、特開２００８−６２２８２号公報に記載されている発明が予測対象発明であった場合の要旨データ（上半分のデータ）と、特開２０１１−１８６７３５号公報に記載されている発明が予測対象発明であった場合の要旨データ（下半分のデータ）とが記載されている。前者は、独立形式で記載されている請求項（独立項）が１つの場合、後者は独立項が複数の場合の例示である。 In FIG. 8, as an example, summary data (upper half data) when the invention described in Japanese Patent Application Laid-Open No. 2008-62282 is the invention to be predicted and described in Japanese Patent Application Laid-Open No. 2011-186735. Summary data (lower half data) in the case where the present invention is the invention to be predicted is described. The former is an example when there is one claim (independent claim) described in an independent form, and the latter is an example when there are a plurality of independent claims.

クレームツリーデータ記憶部（ＣＴデータ記憶部）１５４には、後述するクレームツリーデータ（claim tree データ、ＣＴデータともいう）が記憶されている。ＣＴデータ記憶部１５４には、例えば図９（Ａ）に示すように、独立区分エリア１５４ａと、ナンバエリア１５４ｂと、ＭＡＸ区分エリア１５４ｃと、従属項エリア１５４ｄと、サーチフラグエリア１５４ｅを含むレコードが記憶されている。 The claim tree data storage unit (CT data storage unit) 154 stores claim tree data (also referred to as claim tree data or CT data) to be described later. In the CT data storage unit 154, for example, as shown in FIG. 9A, a record including an independent division area 154a, a number area 154b, a MAX division area 154c, a dependent item area 154d, and a search flag area 154e. It is remembered.

独立区分エリア１５４ａには、案文書に記載されている各請求項が独立形式で記載されている請求項（独立項）か、従属形式で記載されている請求項（従属項）のいずれであるかを示す独立区分（独立項がスペース、従属項が"Ｄ"）が記憶されている。ナンバエリア１５４ｂには各請求項の番号（請求項ナンバ）が記憶されている。ＭＡＸ区分エリア１５４ｃには、ＭＡＸ区分が記憶されている。ＭＡＸ区分には、同じ独立項を引用する従属項が複数あった場合の最も番号の大きい請求項（最大従属項）に"Ｍ"、それ以外にスペースがセットされている。 In the independent division area 154a, each claim described in the draft document is either a claim described in an independent form (independent claim) or a claim described in a dependent form (dependent claim) An independent section (independent term is space, dependent term is “D”) is stored. In the number area 154b, the number of each claim (claim number) is stored. The MAX division area 154c stores a MAX division. In the MAX section, “M” is set in the claim with the largest number (maximum dependent claim) when there are a plurality of dependent claims that refer to the same independent term, and a space is set in addition to that.

図９（Ａ）は、特開２００８−６２２８２号公報に記載されている発明が予測対象発明であった場合のＣＴデータを示しているが、該公報では、従属項の中で請求項９が最大従属項なので、請求項ナンバが"９"のレコードのＭＡＸ区分に"Ｍ"、それ以外の請求項ナンバのＭＡＸ区分にスペースがセットされている。また、図９（Ｂ）は、特開２０１１−１８６７３５号公報に記載されている発明が予測対象発明であった場合のＣＴデータを示しているが、該公報では、請求項１、請求項６が独立項であり、請求項５、請求項７が最大従属項なので、請求項ナンバが"５"のレコードと、"７"のレコードのＭＡＸ区分に"Ｍ"、これら以外のＭＡＸ区分にスペースがセットされている。 FIG. 9A shows CT data when the invention described in Japanese Patent Application Laid-Open No. 2008-62282 is the invention to be predicted. Since it is the maximum dependent term, “M” is set in the MAX section of the record whose claim number is “9”, and a space is set in the MAX section of the other claim numbers. FIG. 9B shows CT data when the invention described in Japanese Patent Application Laid-Open No. 2011-186735 is the invention to be predicted. In this publication, claims 1 and 6 are shown. Is an independent term, and claims 5 and 7 are the maximum dependent claims, so the record number of the claim number is "5", "M" in the MAX section of the record of "7", and space in the other MAX sections Is set.

従属項エリア１５４ｄには、従属項が引用している請求項の番号が記憶されている。サーチフラグエリア１５４ｅには、サーチフラグ、すなわち、後述する主引用発明検索が実行済である否かの区分が記憶されている。スペースは主引用発明検索の実行前、"９"は実行済を示している。 In the dependent claim area 154d, the number of the claim cited by the dependent claim is stored. The search flag area 154e stores a search flag, that is, a classification indicating whether or not a later-described main citation invention search has been executed. A space indicates before execution of the main cited invention search, and “9” indicates execution completed.

対象公報記憶部１５５には、主引用発明検索および副引用発明検索の対象とされる公開公報データ（検索対象公報データ）が記憶されている。予測結果記憶部１５６には、図１０に示すような後述する予測結果ファイルが記憶されている。 The target publication storage unit 155 stores public gazette data (search target publication data) to be searched for the main citation invention search and the sub-citation invention search. The prediction result storage unit 156 stores a prediction result file described later as shown in FIG.

続いて、要旨データ抽出部１０２、特許要件適否予測処理部１０３について説明する。要旨データ抽出部１０２は、図５に示すように、候補抽出部１１１、要部データ抽出部１１２、ＣＴデータ生成部１１３、テキスト分析・用語抽出部１１４、展開度・必須要件分析部１１５、係り受け解析部１１６、パターンデータ抽出部１１７およびファイル生成部１１８を有している。なお、図示の都合上、図５では、詳細な説明データ記憶部（詳細な説明ＴＲ）１６０、請求の範囲データ記憶部（請求の範囲ＴＲ）１６１、要部データ記憶部（要部データＴＲ）１６２が要旨データ抽出部１０２に含まれているが、これらはデータ記憶手段であるＨＤＤ１４に設けられている。 Next, the summary data extraction unit 102 and the patent requirement suitability prediction processing unit 103 will be described. As shown in FIG. 5, the summary data extraction unit 102 includes a candidate extraction unit 111, a main data extraction unit 112, a CT data generation unit 113, a text analysis / term extraction unit 114, a development level / essential requirement analysis unit 115, A receiving analysis unit 116, a pattern data extraction unit 117, and a file generation unit 118 are included. For convenience of illustration, in FIG. 5, a detailed explanation data storage unit (detailed description TR) 160, a claim data storage unit (claims TR) 161, a main data storage unit (main data TR) 162 are included in the summary data extraction unit 102, and these are provided in the HDD 14 serving as data storage means.

候補抽出部１１１は、予測対象ＴＲ記憶部１５２に記憶されている案文書データを読み込み、そこから不要なデータをスキップ（読み飛ばし）して要旨データ作成に必要なデータを抽出し、抽出後のデータを詳細な説明データ（明細書データ）と、請求の範囲データとに分けて、それぞれ詳細な説明ＴＲ１６０、請求の範囲ＴＲ１６１に記憶させる。ここでは、「前記」、「該」、「当該」と、段落番号がスキップされる。 The candidate extraction unit 111 reads the draft document data stored in the prediction target TR storage unit 152, skips unnecessary data from the draft document data, extracts data necessary for creating the summary data, The data is divided into detailed explanation data (specification data) and claim scope data, and stored in the detailed explanation TR160 and claim scope TR161, respectively. Here, “the above”, “the”, “the” and the paragraph numbers are skipped.

要部データ抽出部１１２は、予測対象ＴＲ記憶部１５２に記憶されている案文書データを読み込み、そこから案文書の要部に相当する部分のデータ（要部データ）を抽出し、抽出した要部データを要部データ記憶部１６２に記憶させる。ここでは、要部データとして、予測対象発明の名称が記載されている部分のデータ（発明の名称）および技術分野に相当する部分のデータと、課題の部分の「本発明」または「この発明」の文字列を含む一文のデータとを抽出する。 The main part data extraction unit 112 reads the draft document data stored in the prediction target TR storage unit 152, extracts data corresponding to the main part of the draft document (main part data), and extracts the extracted key data. The part data is stored in the main part data storage unit 162. Here, as the main part data, the data of the part (name of the invention) in which the name of the invention to be predicted is described, the data corresponding to the technical field, and the “present invention” or “this invention” of the problem part Extract one sentence data including the character string.

ＣＴデータ生成部１１３は、予測対象ＴＲ記憶部１５２に記憶されている案文書データを読み込み、そのうちの請求の範囲の部分に記載されているデータを読み込んで前述したクレームツリーデータ（ＣＴデータ）を生成し、それＣＴデータ記憶部１５４に記憶させる。 The CT data generation unit 113 reads the draft document data stored in the prediction target TR storage unit 152, reads the data described in the claims, and reads the above-described claim tree data (CT data). It is generated and stored in the CT data storage unit 154.

テキスト分析・用語抽出部１１４は、詳細な説明ＴＲ１６０、請求の範囲ＴＲ１６１からそれぞれテキストデータを入力し、そのそれぞれについて、特徴語を抽出して（特許請求の範囲は請求項ごと）、各特徴語を重要とされる順序で出力する。この場合、例えば、形態素解析またはＮ−Ｇｒａｍなどの索引文字列抽出処理を実行して、各単語の出現頻度、各単語の共起頻度を調べ、その結果に応じて各特徴語を出力する。 The text analysis / term extraction unit 114 inputs text data from the detailed description TR160 and the claims TR161, extracts feature words for each of them (the claims are for each claim), and each feature word Are output in the important order. In this case, for example, index character string extraction processing such as morphological analysis or N-Gram is executed to check the appearance frequency of each word and the co-occurrence frequency of each word, and output each feature word according to the result.

展開度・必須要件分析部１１５は、請求の範囲ＴＲ１６１から請求の範囲データを読み込んで，テキスト分析・用語抽出部１１４で抽出された各特徴語について、展開度と、必須要件に該当するか否かとを調べ、その結果を出力する。ここで、展開度（Ｅｄ）とは、各特徴語がいくつの請求項に展開されているのか（用いられているのか）、展開されている請求項の個数を示すデータである。一般に、特許出願の出願書類では、できるだけ広い範囲の発明思想がカバーされるように、より重要な事項が請求項１（または他の独立項）に広い範囲で記載され、そこから下位の請求項に段階的に範囲を縮小（具体化）されながら記載されることが多い。そのため、展開度（Ｅｄ）が大きいほど、重要度がより高いと考えられるので、展開度は発明の要旨を把握するのに有益な情報と考えられる。例えば、図８に示すデータ種別"Ｃ"、項番"１"の用語エリア１５３ｃ１が"用語１"のエリアに「パンチ」という用語がセットされているが、この「パンチ」という用語は、特開２００８−６２２８２号公報の特許請求の範囲において、請求項１、２、３、４、５に記載されているので、展開度エリア１５３ｃ２に"５"がセットされている。 The development degree / required requirement analysis unit 115 reads the claim range data from the claim range TR161, and for each feature word extracted by the text analysis / term extraction unit 114, the development degree and whether or not it meets the essential requirements Check the heel and output the result. Here, the degree of expansion (Ed) is data indicating how many claims each feature word is expanded (used) and the number of claims expanded. In general, in an application for a patent application, more important matters are set forth in claim 1 (or other independent claims) in a broad range so that the broadest possible inventive concept is covered, and claims subordinate thereto. In many cases, it is described while the range is reduced (embraced) step by step. Therefore, it is considered that the greater the degree of development (Ed), the higher the degree of importance. Therefore, the degree of development is considered to be useful information for grasping the gist of the invention. For example, the term “punch” is set in the area of “term 1” in the term area 153c1 of data type “C” and item number “1” shown in FIG. In the scope of claims of Japanese Patent Application Laid-Open No. 2008-62282, since it is described in claims 1, 2, 3, 4, and 5, "5" is set in the expansion degree area 153c2.

必須要件に該当するか否かは、各請求項の特徴部分に記載されているか否かであって、必須フラグ（Ｅｆ）によって示されている。本件出願に係る発明の実施の形態では、各請求項における最終段落、または「ことを特徴とする」の文字列を含む一文を各請求項の特徴部分としていて、そのデータが特徴部分データであり、ここに含まれている用語が必須要件を満たすものとしている。特徴部分から抽出された用語には、必須要件を満たすことを示す"Ｘ"が必須フラグ（Ｅｆ）にセットされる。 Whether or not it meets the essential requirements is whether or not it is described in the characteristic part of each claim and is indicated by an essential flag (Ef). In the embodiment of the invention according to the present application, the last paragraph in each claim or a sentence including the character string of “characteristic” is used as the feature part of each claim, and the data is the feature part data. , The terms included here shall meet the essential requirements. For the term extracted from the feature portion, “X” indicating that the essential requirement is satisfied is set in the essential flag (Ef).

係受け解析部１１６は、要部データ記憶部１６２に記憶されている要部データについて係受け解析を行い、その結果をパターンデータ抽出部１１７に出力する。 The dependency analysis unit 116 performs dependency analysis on the main data stored in the main data storage unit 162 and outputs the result to the pattern data extraction unit 117.

パターンデータ抽出部１１７は、係受け解析部１１６の解析結果を入力して、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとなる文字列と、発明の名称のうち、先頭に記載されているもの（筆頭名称）を出力する。例えば、特開２００８−６２２８２号公報の「発明が解決しようとする課題」の欄の「本発明」の文字列を含む一文の中に、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとして、「調整」および「行わず」と、「同心性」および「得る」と、「精密打ち抜き型」および「提供」がある。これらがパターンデータ抽出部１１７から出力される。本実施の形態において、パターンデータ抽出部１１７から出力されるデータのうち、課題の部分から抽出されたデータが課題データに相当していて、例えば図８のデータ種別"Ｐ"のレコードのようなデータとすることができる。 The pattern data extraction unit 117 inputs the analysis result of the dependency analysis unit 116, and the noun immediately before the hiragana character “O”, the character string that is the combination of the corresponding verb, and the beginning of the name of the invention (First name) is output. For example, in a sentence including the character string “present invention” in the column “Problem to be Solved by” of Japanese Patent Application Laid-Open No. 2008-62282, the noun immediately before “O” in hiragana and the corresponding Verb combinations include “adjustment” and “do not do”, “concentricity” and “obtain”, “precision punching” and “provide”. These are output from the pattern data extraction unit 117. In the present embodiment, among the data output from the pattern data extraction unit 117, the data extracted from the task portion corresponds to the task data, for example, a record of the data type “P” in FIG. It can be data.

ファイル生成部１１８はテキスト分析・用語抽出部１１４、展開度・必須要件分析部１１５およびパターンデータ抽出部１１７から出力されるデータを用いて要旨データを生成し、要旨データ記憶部１５３に記憶させる。この場合、テキスト分析・用語抽出部１１４および展開度・必須要件分析部１１５の出力データを用いて、データ種別"Ｃ"、"Ｄ"のデータが生成され、パターンデータ抽出部１１７の出力データを用いて、データ種別"Ｐ"、"Ｔ"のデータが生成される。 The file generation unit 118 generates summary data using data output from the text analysis / term extraction unit 114, the development level / essential requirement analysis unit 115, and the pattern data extraction unit 117, and stores the summary data in the summary data storage unit 153. In this case, data of the data types “C” and “D” is generated using the output data of the text analysis / term extraction unit 114 and the development level / essential requirement analysis unit 115, and the output data of the pattern data extraction unit 117 is used as the output data. As a result, data of data types “P” and “T” is generated.

特許要件適否予測処理部１０３は、図６に示すように、新規性・拡大先願予測処理部１２５と、進歩性予測処理部１２６とを有している。新規性・拡大先願予測処理部１２５は、要旨データ記憶部１５３に記憶されている要旨データを検索タームに用いて対象公報記憶部１５５の検索対象公報データの全文検索を行い、その結果にしたがい、新規性・拡大先願予測データＮｄを予測結果ファイル生成部１２７に出力する。新規性・拡大先願予測処理部１２５の機能、動作手順については、後に詳しく説明する。 As shown in FIG. 6, the patent requirement suitability prediction processing unit 103 includes a novelty / expansion prior application prediction processing unit 125 and an inventive step prediction processing unit 126. The novelty / expanded prior application prediction processing unit 125 uses the summary data stored in the summary data storage unit 153 as a search term to perform a full text search of the search target publication data in the target publication storage unit 155, and according to the result. The novelty / expansion prior application prediction data Nd is output to the prediction result file generation unit 127. The function and operation procedure of the novelty / expansion earlier application prediction processing unit 125 will be described in detail later.

進歩性予測処理部１２６は、引用発明検索部１３１と、入力ベクトル生成部１３２と、機械学習部１３３と、指定予測データ生成部１３４とを有している。引用発明検索部１３１は、後述する主引用発明検索を行う主引用発明検索部および副引用発明検索を行う副引用発明検索部を有している。また、引用発明検索部１３１は、主引用発明検索および副引用発明検索の結果にしたがい、進歩性予測データＶｄ１を予測結果ファイル生成部１２７に出力し、検索の対象となった請求項に応じた請求項要旨データｉｅｄと概念検索データＶｄ２を入力ベクトル生成部１３２に出力する。概念検索データＶｄ２には、概念検索の結果、最も類似度が高いとされた文献（最類似文献）の公開公報データが含まれている。引用発明検索部１３１の機能、動作手順については、後に詳しく説明する。 The inventive step prediction processing unit 126 includes a cited invention search unit 131, an input vector generation unit 132, a machine learning unit 133, and a designated prediction data generation unit 134. The cited invention search unit 131 has a main cited invention search unit that performs a main cited invention search and a subcited invention search unit that performs a subcited invention search, which will be described later. In addition, the cited invention search unit 131 outputs the inventive step prediction data Vd1 to the prediction result file generation unit 127 according to the results of the main cited invention search and the sub-cited invention search, and responds to the claim that is the target of the search. Claim summary data ied and concept search data Vd 2 are output to input vector generation section 132. The concept search data Vd2 includes publication gazette data of a document (the most similar document) that is determined to have the highest similarity as a result of the concept search. The function and operation procedure of the cited invention search unit 131 will be described in detail later.

入力ベクトル生成部１３２は、図７に示すように、要旨ベクトル生成部１３２ａと、引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cとを有している。 As shown in FIG. 7, the input vector generation unit 132 includes a summary vector generation unit 132a, a citation candidate vector generation unit 132b, and a movement vector generation unit 132c.

要旨ベクトル生成部１３２ａは、請求項要旨データｉｅｄを入力してその特徴語を抽出し、各語に応じた重み付けを行って、各請求項の記載に応じた要旨ベクトルＥＶを生成する。引用候補ベクトル生成部１３２ｂは、後述するサーチ有りモードにおいて、概念検索データＶｄ２に含まれる最類似文献の公開公報データを入力してその特徴語を抽出し、各語に応じた重み付けを行って最類似文献に応じた文書ベクトル（引用候補ベクトル）ＲｆＶを生成する。また、サーチ無しモードにおいて、指定先行技術データＶｓ２を入力してその特徴語を抽出し、各語に応じた重み付けを行って指定先行技術データに応じた文書ベクトル（引用候補ベクトル）ＲｆＶを生成する。移動ベクトル生成部１３２ｃは、要旨ベクトルＥＶと、引用候補ベクトルＲｆＶとの差分を計算して、双方の文書ベクトルの差分に応じた要旨移動ベクトルＶ３を生成する。 The summary vector generation unit 132a receives the claim summary data ied, extracts the feature words, performs weighting according to each word, and generates the summary vector EV according to the description of each claim. The citation candidate vector generation unit 132b inputs public gazette data of the most similar document included in the concept search data Vd2 in the search with mode described later, extracts the feature word, performs weighting according to each word, and performs the weighting. A document vector (citation candidate vector) RfV corresponding to similar documents is generated. In the no-search mode, the designated prior art data Vs2 is input to extract the feature word, and the document vector (citation candidate vector) RfV corresponding to the designated prior art data is generated by performing weighting according to each word. . The movement vector generation unit 132c calculates a difference between the abstract vector EV and the citation candidate vector RfV, and generates an abstract movement vector V3 according to the difference between the two document vectors.

最類似文献は、主引用発明検索部による概念検索の結果、最も類似度が高いとされた文献であるため、予測対象発明の進歩性の審査で主引用発明の開示文献として引用される確率が最も高いと推測される。そのため、最類似文献を引用候補として引用候補ベクトルＲｆＶを求め、これと要旨ベクトルＥＶとの差分を計算して要旨移動ベクトルＶ３を求めれば、予測対象発明と、最類似文献に開示されている発明との相違に応じた文書ベクトル（要旨移動ベクトルＶ３に相当する）が生成される。 Since the most similar document is a document having the highest similarity as a result of the concept search by the main citation invention search unit, the probability of being cited as a disclosure document of the main citation invention in the inventive step examination of the prediction target invention is high. Presumed to be the highest. Therefore, if the citation candidate vector RfV is obtained using the most similar document as a citation candidate and the difference between the citation candidate vector RfV is calculated and the gist movement vector V3 is obtained, the invention to be predicted and the invention disclosed in the most similar document A document vector (corresponding to the abstract movement vector V3) corresponding to the difference is generated.

機械学習部１３３は、本発明の実施の形態にかかる文書分類部であって、次のような訓練データ（学習パターンともいう）を用いた機械学習（教師付き学習）によって、後述する要旨移動ベクトルＶ３を進歩性の要件に適合するクラスと適合しないクラス（拒絶理由が無いクラスと有るクラス）に分類し、その分類結果に応じた出力信号（要件適否文書ベクトル）Ｖ４を出力するように構築されている。本発明の実施の形態の場合、学習パターンは次に述べるＨＬパターンとすることができる。 The machine learning unit 133 is a document classification unit according to the embodiment of the present invention, and a summary movement vector described later by machine learning (supervised learning) using the following training data (also referred to as a learning pattern). It is constructed to classify V3 into a class that conforms to the requirement of inventive step and a class that does not conform (class that has no reason for refusal) and output an output signal (requirement conformity document vector) V4 according to the classification result. ing. In the embodiment of the present invention, the learning pattern can be an HL pattern described below.

ＨＬパターンは、学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りを示すベクトル（例えば、正解のクラスに対応した次元だけが"１"で、他が"０"のベクトル）との組み合わせと、学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトル（例えば、上記とは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせのパターンである。 In the HL pattern, the learning document vector is the first learning document vector, and the teacher vector is a vector indicating that there is a reason for rejection of the inventive step (for example, only the dimension corresponding to the correct class is “1” and the others are “0”). Vector), the learning document vector is the second learning document vector, and the teacher vector is a vector indicating that there is no reason for rejection of the inventive step (for example, only a dimension different from the above is “1” and the others are “0”). Is a combination pattern with "vector".

第１の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで進歩性違反の拒絶理由（特許法第２９条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた主引用文献）に応じた文書ベクトル（引用文書ベクトル）との差分に応じた第１の移動文書ベクトルである。 The first learning document vector is an application for which the first notice of reasons for refusal (1st action) has been issued as a result of examination by the Patent Office in a published application, and the reason for refusal of inventive step violation ( Claims (at the time of issuance of the notice of reasons for refusal) that pointed out the reasons for refusal of the application (reasonable refusal application) for which the reason for refusal that the requirement of Article 29 (2) of the Patent Act was not satisfied was pointed out Is a first moving document vector corresponding to a difference between a document vector corresponding to the document vector and a document vector (cited document vector) corresponding to the cited document 1 (main cited document cited as the main publication) at that time. .

第２の学習文書ベクトルは、公開済出願の中で審査の結果、１ｓｔアクションが発行されずに特許査定が発行された出願（拒絶無し出願）または１ｓｔアクションは発行されたがその拒絶理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）の（拒絶理由通知が発行された時点の）請求項１に応じた文書ベクトルと、それら拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる文献（学習用最類似文献）に応じた文書ベクトル（非引用文書ベクトル）との差分に応じた第２の移動文書ベクトルである。 The second learning document vector is the result of examination in the published application that the patent action is issued without the first action being issued (the application without refusal) or the first action is issued but the reason for refusal is advanced Document vectors according to claim 1 (at the time of the notice of reasons for refusal) of applications for which no reason for refusal of sex violation has been pointed out (applications without inventive refusal), and those applications without refusal or inventive refusal As a result of the concept search for the application, the second moving document vector corresponding to the difference from the document vector (non-cited document vector) corresponding to the document (the most similar document for learning) that has the highest similarity is used. is there.

機械学習部１３３は、上記のようなＨＬパターンの学習パターンで学習を繰り返し行うことにより、要旨移動ベクトルＶ３を進歩性の拒絶理由有りまたは無しのいずれかのクラスに分類し、その分類したクラスに応じた要件適否文書ベクトルＶ４を出力する。前者は、予測対象発明が出願された場合について、進歩性違反の拒絶理由が発行される可能性が高い場合、後者は低い場合に相当する。 The machine learning unit 133 classifies the abstract movement vector V3 into either a class with or without a reason for refusal of inventive step by repeatedly performing learning with the learning pattern of the HL pattern as described above, and the class is classified into the classified class. A corresponding requirement conformity document vector V4 is output. The former corresponds to a case where the reason for refusal of inventive step violation is likely to be issued and the latter is low when the invention to be predicted is filed.

機械学習部１３３は、入力される要旨移動ベクトルＶ３を進歩性の拒絶理由が有るクラスと無いクラスに分類して、その分類結果に応じた要件適否文書ベクトルＶ４を出力すればよいので、機械学習部１３３にサポートベクターマシーン（ＳＶＭ）と呼ばれる学習アルゴリズムを適用することができる。サポートベクターマシーン（ＳＶＭ）によれば、決定境界との距離（マージン）が最大になるように、決定境界を得ることができる。 The machine learning unit 133 may classify the inputted abstract movement vector V3 into a class having a reason for refusal of inventive step and a class having no reason for refusal of inventive step, and output a requirement conformity document vector V4 according to the classification result. A learning algorithm called a support vector machine (SVM) can be applied to the unit 133. According to the support vector machine (SVM), the decision boundary can be obtained so that the distance (margin) from the decision boundary is maximized.

また、機械学習部１３３の情報処理に脳神経回路網をモデルにしたニューラルネットワークを適用することができる。ニューラルネットワークには、階層型ニューラルネットワークと、相互結合型ニューラルネットワークがある。たとえば、機械学習部１３３の学習アルゴリズムとして、階層型ニューラルネットワークのパーセプトロンを適用することができる。 In addition, a neural network modeled on the brain neural network can be applied to the information processing of the machine learning unit 133. Neural networks include hierarchical neural networks and interconnected neural networks. For example, a perceptron of a hierarchical neural network can be applied as a learning algorithm of the machine learning unit 133.

パーセプトロンはＳ層、Ａ層、Ｒ層と呼ばれる３層からなる階層型ネットワークで構成され（図示せず）、Ｓ層からＡ層、Ａ層からＲ層という片方向の結合だけが存在している。前述のＨＬパターンの学習パターンが与えられると、第１の学習文書ベクトルまたは第２の学習文書ベクトルが入力されたときの出力ベクトルがそれぞれの教師ベクトルと異なっていたときに、その誤差に応じて結合の重みが修正され、出力ベクトルと教師ベクトルとの誤差が一定値以下になったときに学習が終了する。 The perceptron is composed of a three-layer hierarchical network called S layer, A layer, and R layer (not shown), and there is only one-way coupling from S layer to A layer and from A layer to R layer. . Given the learning pattern of the HL pattern described above, when the output vector when the first learning document vector or the second learning document vector is input is different from each teacher vector, according to the error. The learning ends when the weight of the connection is corrected and the error between the output vector and the teacher vector becomes a certain value or less.

しかしながら、パーセプトロンでは、学習パターンが線形分離不可能な場合にアルゴリズムが停止しないおそれがある。そのため、機械学習部１３３が学習によって非線形な決定境界を獲得できるようにするため、階層型ニューラルネットワークの中で応用例が多く、誤識別の少ない非線形識別面が学習できるＢＰ（バックプロパゲーション）ネットワークを適用することが好ましい。 However, the perceptron may not stop the algorithm when the learning pattern cannot be linearly separated. Therefore, in order for the machine learning unit 133 to acquire a non-linear decision boundary by learning, there are many application examples in a hierarchical neural network, and a BP (back propagation) network that can learn a non-linear discriminant plane with few misidentifications. Is preferably applied.

ＢＰネットワークは、図１１に示すように、入力層および出力層と、その間の中間層とを有し、誤差逆伝播アルゴリズムと呼ばれる学習アルゴリズムによって、ユニット間のすべての結合の重みが学習可能になっている。誤差逆伝播アルゴリズムでは、入力信号が入力層、中間層、出力層と伝わり、その一方、誤差信号が逆に伝わることによって、重み調整が行われる。 As shown in FIG. 11, the BP network has an input layer and an output layer, and an intermediate layer therebetween, and a learning algorithm called an error back-propagation algorithm makes it possible to learn the weights of all the connections between units. ing. In the error back-propagation algorithm, the input signal is transmitted to the input layer, the intermediate layer, and the output layer, while the error signal is transmitted in the reverse direction, whereby weight adjustment is performed.

そして、図１１に示すＢＰネットワークに、学習パターンｘ_ｐ（ｘ_０、ｘ_１・・・ｘ_ｎ）が入力されたとき、ある階層のｊ（０≦ｊ≦ｎ）番目のユニットには、そのユニットｊとの結合を有する１階層前のユニットから重み付きの信号が入力される。そこで、１階層前のｉ（０≦ｉ≦ｎ）番目のユニットからの信号をｔ_ｉｐ，重みをｗ_ｉｊとすると、ユニットｊへの入力は、式１のようになり、ユニットｊの出力は、閾値関数をｆとして、式２のようになる。 Then, when a learning pattern x _p (x ₀ , x ₁ ... X _n ) is input to the BP network shown in FIG. 11, the j (0 ≦ j ≦ n) -th unit in a certain hierarchy A weighted signal is input from the unit one layer before having a connection with the unit j. Therefore, if the signal from the i-th unit (0 ≦ i ≦ n) of the previous layer is t _ip and the weight is w _ij , the input to the unit j is given by Equation 1, and the output of the unit j is Suppose f is a threshold function and Equation 2 is obtained.

式１

Formula 1

式２

Formula 2

学習パターンｘ_ｐに対する誤差Ｄ_ｐは、出力層のユニットｋの出力と、教師信号ｂ_ｋｐの差の２乗和で定義されるから、以下の式３のようになる。この誤差Ｄ_ｐをすべての学習パターンに対して足しあげて式４のＤを求め、そのＤが最小になるように、ユニット間の結合重みが調整されて機械学習部１３３における学習が行われる。この場合、個々の学習パターンが入力されるごとに、式５によって重みが調整される。ｗ_ｉｊは更新前の重み、ｗ'_ijは更新後の重み、ρは学習係数である。これは確率的最急降下法と呼ばれる。なお、ユニットの入出力関数は式６に示すシグモイド関数が用いられる。 Error D _p for learning pattern x _p is the output of unit k of output layer, since is defined by the sum of squares of the difference between the teacher signal b _kp, so equation 3 below. The error D _p 3GS sum for all learning patterns sought D of formula 4, as its D is minimized, is adjusted connection weights between units is learning in machine learning unit 133 is carried out. In this case, each time an individual learning pattern is input, the weight is adjusted by Expression 5. w _ij is a weight before update, w ′ _ij is a weight after update, and ρ is a learning coefficient. This is called stochastic steepest descent. Note that the sigmoid function shown in Equation 6 is used as the unit input / output function.

式３

Formula 3

式４

Formula 4

式５

Formula 5

式６

Equation 6

指定予測データ生成部１３４は、サーチ無しモードにおいて、要旨データ記憶部１５３に記憶されている要旨データと、指定先行技術データ記憶部１５１に記憶されている指定先行技術データとを入力して、各請求項に応じた予測対象発明の請求項要旨データｉｅｄと、指定先行技術データＶｓ２とを入力ベクトル生成部１３２に出力し、進歩性予測データＶｓ１を予測結果ファイル生成部１２７に出力する。進歩性予測データＶｓ１には、指定先行技術データＶｓ２とともに主引用発明が見つかったことを示す検索フラグ"ＶＹ"が請求項ごとに含まれていて、これと、後述する要件適否文書ベクトルＶ４とが本実施に形態にかかる発明指定進歩性予測データに相当する。 The designated prediction data generation unit 134 inputs the summary data stored in the summary data storage unit 153 and the designated prior art data stored in the designated prior art data storage unit 151 in the no-search mode, The summary data ied of the invention to be predicted according to the claims and the designated prior art data Vs2 are output to the input vector generation unit 132, and the inventive step prediction data Vs1 is output to the prediction result file generation unit 127. The inventive step predictive data Vs1 includes a search flag “VY” indicating that the main cited invention has been found together with the designated prior art data Vs2 for each claim, and a requirement suitability document vector V4 described later. This corresponds to the invention-designated inventive step prediction data according to the present embodiment.

（ユーザ端末装置３０の構成）
ユーザ端末装置３０は、図１に示すように、インターネットＮ１への接続環境を備え、特許要件適否予測サーバ１０と通信を行うことができる。なお、ユーザ端末装置３０は、据え置き型（または持ち運び可能なノート型）のパーソナルコンピュータを想定しているが、タブレット型の端末装置でもよい。 (Configuration of the user terminal device 30)
As shown in FIG. 1, the user terminal device 30 includes a connection environment to the Internet N <b> 1 and can communicate with the patent requirement suitability prediction server 10. The user terminal device 30 is assumed to be a stationary (or portable notebook type) personal computer, but may be a tablet-type terminal device.

ユーザ端末装置３０は、図３に示すように、ＣＰＵ３１、ＲＯＭ３２、ＲＡＭ３３、データ記憶部３４、液晶表示部３５を有している。また、ユーザ端末装置３０は、音声変換処理部３６、通信制御部３７、通信処理部３８ａ、無線通信部３８ｂ、スピーカ３９およびマイク４０を有している。 As illustrated in FIG. 3, the user terminal device 30 includes a CPU 31, a ROM 32, a RAM 33, a data storage unit 34, and a liquid crystal display unit 35. In addition, the user terminal device 30 includes a voice conversion processing unit 36, a communication control unit 37, a communication processing unit 38a, a wireless communication unit 38b, a speaker 39, and a microphone 40.

ＣＰＵ３１は、ＲＯＭ３２に記憶されているプログラムにしたがい作動してユーザ端末装置３０全体の動作制御を司る。ＲＯＭ３２はＣＰＵ３１が実行するプログラム、例えば、データ通信を行うための通信制御プログラムが記憶されている。ＲＡＭ３３には、ＣＰＵ３１によるプログラムの実行に必要なデータ等が記憶される。 The CPU 31 operates according to a program stored in the ROM 32 and controls operation of the entire user terminal device 30. The ROM 32 stores a program executed by the CPU 31, for example, a communication control program for performing data communication. The RAM 33 stores data necessary for the CPU 31 to execute the program.

データ記憶部３４には種々のデータが記憶されている。液晶表示部３５は、ＬＣＤ（Liquid Crystal Display）とその駆動部を有し、文字、図形、記号などの画像表示を行う画像表示手段である。音声変換処理部３６は、音声データを伸張してスピーカ３９に出力する一方、マイク４０から入力するアナログ音声信号をデジタルの音声データに変換および圧縮して、通信処理部３８ａに入力する。通信制御部３７はＣＰＵ３１の指示を受けて作動し、データ通信を行うための回線の接続および切断を制御する。通信処理部３８ａは、通信制御部３７の指示にしたがい作動して、インターネットＮ１を介して行われるデータの送受信を実行する。無線通信部３８ｂは通信制御部３７の制御にしたがい、無線によるデータの送受信を実行する無線通信手段である。スピーカ３９は、音声を出力する音声出力手段であり、マイク４０はユーザの会話内容等の音声を入力し、電気信号に変換する。 Various data are stored in the data storage unit 34. The liquid crystal display unit 35 includes an LCD (Liquid Crystal Display) and its driving unit, and is an image display unit that displays images of characters, figures, symbols, and the like. The voice conversion processing unit 36 decompresses the voice data and outputs it to the speaker 39, while converting and compressing the analog voice signal input from the microphone 40 into digital voice data and inputs the digital voice data to the communication processing unit 38a. The communication control unit 37 operates in response to an instruction from the CPU 31 and controls connection and disconnection of a line for performing data communication. The communication processing unit 38a operates according to an instruction from the communication control unit 37, and executes transmission / reception of data performed via the Internet N1. The wireless communication unit 38b is a wireless communication unit that performs wireless data transmission / reception under the control of the communication control unit 37. The speaker 39 is an audio output means for outputting audio, and the microphone 40 inputs audio such as the user's conversation content and converts it into an electrical signal.

(特許要件適否予測システムの動作内容）
次に、図４とともに図１２から図２２までと、図３２から図３５までとを参照して、特許要件適否予測サーバ１０による特許要件適否予測処理の動作内容について説明する。 (Operations of the patent requirement conformity prediction system)
Next, referring to FIGS. 12 to 22 and FIGS. 32 to 35 together with FIG. 4, the operation content of the patent requirement suitability prediction process by the patent requirement suitability prediction server 10 will be described.

ここで、図４は、特許要件適否予測処理を実現する特許要件適否予測サーバ１０の主要な構成を示す機能ブロック図である。特許要件適否予測サーバ１０では、ＣＰＵ１１が特許要件適否予測プログラムにしたがい、公開公報ＤＢ１５０、指定先行技術データ記憶部１５１、要旨データ記憶部１５３等に記憶されている各種ファイルやＤＢにアクセスしながら、案文データ生成部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、予測結果編集処理部１０５、指定先行技術データ生成部１０６としての動作を行う。これにより、特許要件適否予測処理が実行される。なお、特許要件適否予測プログラムは、特許要件適否予測サーバ１０を案文データ生成部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、予測結果編集処理部１０５、指定先行技術データ生成部１０６等として機能させるためのプログラムである。 Here, FIG. 4 is a functional block diagram showing a main configuration of the patent requirement suitability prediction server 10 for realizing the patent requirement suitability prediction process. In the patent requirement suitability prediction server 10, the CPU 11 accesses various files and DBs stored in the publication gazette DB 150, the designated prior art data storage unit 151, the abstract data storage unit 153 according to the patent requirement suitability prediction program, The draft sentence data generation unit 101, the summary data extraction unit 102, the patent requirement suitability prediction processing unit 103, the prediction result editing processing unit 105, and the designated prior art data generation unit 106 are operated. Thereby, a patent requirement suitability prediction process is executed. The patent requirement suitability prediction program includes a patent requirement suitability prediction server 10 that includes a draft sentence data generation unit 101, a gist data extraction unit 102, a patent requirement suitability prediction processing unit 103, a prediction result editing processing unit 105, and a designated prior art data generation unit 106. It is a program to make it function as.

そして、特許要件適否予測サーバ１０が特許要件適否予測処理を行うときは、ＣＰＵ１１が特許要件適否予測プログラムにしたがい図１２に示すフローチャートに沿った動作を行う。図１２は、特許要件適否予測プログラムにしたがったＣＰＵ１１の特許要件適否予測処理の動作手順の一例を示すフローチャートである。なお、図１２、図１３等において"Ｓ"とはステップを略記したものである。 When the patent requirement suitability prediction server 10 performs the patent requirement suitability prediction process, the CPU 11 performs an operation according to the flowchart shown in FIG. 12 according to the patent requirement suitability prediction program. FIG. 12 is a flowchart showing an example of the operation procedure of the patent requirement suitability prediction process of the CPU 11 according to the patent requirement suitability prediction program. In FIG. 12, FIG. 13, etc., “S” is an abbreviation of step.

ＣＰＵ１１は、特許要件適否予測プログラムにしたがい動作を開始すると、ステップ１に進み、ユーザ認証処理を行う。ここでは、ユーザがユーザ端末装置３０を用いて入力したユーザＩＤおよびパスワードを確認する等してユーザ認証処理を行う。次に、ＣＰＵ１１は、ステップ２に進み、ポイント残高確認処理を行う。ポイント残高確認処理では、ユーザのポイント残高が一定値以上あるかどうかをＣＰＵ１１が確認し、ポイント残高不足であれば、特許要件適否予測処理を終了するか、ポイント残高不足を知らせるメッセージの送信などを行う。 When starting the operation according to the patent requirement suitability prediction program, the CPU 11 proceeds to step 1 and performs user authentication processing. Here, the user authentication process is performed by confirming the user ID and password input by the user using the user terminal device 30. Next, the CPU 11 proceeds to step 2 to perform a point balance confirmation process. In the point balance confirmation process, the CPU 11 confirms whether or not the user's point balance is equal to or greater than a certain value. If the point balance is insufficient, the patent requirement suitability prediction process is terminated, or a message notifying that the point balance is insufficient is transmitted. Do.

ＣＰＵ１１は、処理をステップ３に進めると、発明データ受信・復号化が実行される。このステップ３では、ＣＰＵ１１が通信処理部１６を作動させるなどして、ユーザ端末装置３０から暗号化通信（例えばＳＳＬを利用した暗号化通信）による発明データ受信を実行するとともに、案文データ生成部１０１としての動作を行い、発明データ受信で受信した案文書データの復号化を行い、さらにマシン日付を出願日にセットしたうえで、予測対象ＴＲ記憶部１５２に記憶させる。また、ＣＰＵ１１は、発明データ受信の受信データに指定先行技術データが含まれているときは、その指定先行技術データの復号化を行い、さらに、指定先行技術データ記憶制御手段としての動作を行い指定先行技術データ生成部１０６を作動させて、復号化した指定先行技術データを指定先行技術データ記憶部１５１に記憶させる。続いてステップ４に処理が進むと、案文カウンタＭＡＸに受信した案文書データの件数がセットされ、案文カウンタに"０"がセットされる。 When the CPU 11 proceeds with the process to step 3, the invention data reception / decryption is executed. In step 3, the CPU 11 activates the communication processing unit 16 to receive the invention data by encrypted communication (for example, encrypted communication using SSL) from the user terminal device 30, and the draft sentence data generating unit 101. The draft document data received by the invention data reception is decoded, and the machine date is set to the filing date and stored in the prediction target TR storage unit 152. Further, when the designated prior art data is included in the received data of the invention data reception, the CPU 11 decodes the designated prior art data and further performs the operation as the designated prior art data storage control means and designates it. The prior art data generation unit 106 is operated to store the designated prior art data that has been decrypted in the designated prior art data storage unit 151. Subsequently, when the process proceeds to step 4, the number of received draft document data is set in the draft sentence counter MAX, and “0” is set in the draft sentence counter.

次に、ＣＰＵ１１は、処理をステップ５に進めて、指定先行技術データ記憶部１５１に指定先行技術データが記憶されているかどうかによって、予測モードがサーチ有りモードかサーチ無しモードなのかを判定する。そして、指定先行技術データ記憶部１５１に指定先行技術データが記憶されているときは、ＣＰＵ１１が予測処理制御部としての動作を行い、サーチ無しモードによる要件適否の予測を行うためステップ６Ａに処理を進め、一方、指定先行技術データ記憶部１５１に指定先行技術データが記憶されていなければ、サーチ有りモードによる要件適否の予測を行うため、ＣＰＵ１１はステップ６に処理を進める。 Next, the CPU 11 advances the process to step 5 to determine whether the prediction mode is the search mode or the search-free mode depending on whether the specified prior art data is stored in the specified prior art data storage unit 151. When the designated prior art data storage unit 151 stores the designated prior art data, the CPU 11 operates as a prediction process control unit, and performs the process in step 6A in order to predict the requirement suitability in the no-search mode. On the other hand, if the designated prior art data storage unit 151 does not store the designated prior art data, the CPU 11 advances the process to step 6 in order to predict the suitability of the requirement in the search mode.

そして、ＣＰＵ１１は、処理をステップ６に進めると予測終了条件が成立しているか否かを判定する。ここで、ＣＰＵ１１は予測終了条件が成立しているときはステップ８に進むが、そうでないときはステップ７に進む。ＣＰＵ１１はステップ７に進むと、後述する特許要件適否予測ルーチンを実行するが、ステップ８に進むと、終了処理を実行し、そのユーザに対する特許要件適否予測処理を終了する。このようにすることで、サーチ有りモードでは、予測終了条件が成立するまでの間、特許要件適否予測処理が自動的かつ継続的に実行される。 Then, when the process proceeds to step 6, the CPU 11 determines whether or not a prediction end condition is satisfied. Here, the CPU 11 proceeds to step 8 when the prediction end condition is satisfied, but proceeds to step 7 otherwise. When proceeding to step 7, the CPU 11 executes a patent requirement suitability prediction routine which will be described later. When proceeding to step 8, the CPU 11 executes a termination process and terminates the patent requirement conformity prediction process for the user. By doing so, in the search mode, the patent requirement suitability prediction process is automatically and continuously executed until the prediction end condition is satisfied.

また、ステップ６Ａに進んだ場合も、ステップ６と同様、予測終了条件の成否が判定され、予測終了条件が成立すればステップ８に進むが、そうでないときはステップ９に進み、ＣＰＵ１１が後述するサーチ無し特許要件適否予測ルーチンを実行して、その後、ステップ６Ａにもどる。このようにすることで、サーチ無しモードでは、予測終了条件が成立するまでの間、サーチ無し特許要件適否予測処理が自動的かつ継続的に実行される。 Also, when the process proceeds to step 6A, the success or failure of the prediction end condition is determined as in step 6, and if the prediction end condition is satisfied, the process proceeds to step 8; otherwise, the process proceeds to step 9, and the CPU 11 will be described later. A search-less patent requirement suitability prediction routine is executed, and then the process returns to step 6A. By doing so, in the no-search mode, the non-search patent requirement suitability prediction process is automatically and continuously executed until the prediction end condition is satisfied.

そして、ＣＰＵ１１はステップ７に進むときは、図１３に示すフローチャートに沿って特許要件適否予測ルーチンを実行する。 And when CPU11 progresses to step 7, it performs a patent requirement suitability prediction routine along the flowchart shown in FIG.

（特許要件適否予測ルーチン）
ＣＰＵ１１は特許要件適否予測ルーチンを開始すると、ステップ１１に進み、案文カウンタに"１"を加算する。続くステップ１２では、案文カウンタが案文カウンタＭＡＸよりも大きいか否かを判定し、大きくなければ処理をステップ１３に進めるが、そうでなければ（案文カウンタが案文カウンタＭＡＸより大きいとき）はステップ１６に処理を進める。 (Patent requirement compliance routine)
When the CPU 11 starts the patent requirement suitability prediction routine, the CPU 11 proceeds to step 11 and adds “1” to the draft sentence counter. In the following step 12, it is determined whether or not the draft counter is larger than the draft counter MAX. If not larger, the process proceeds to step 13. If not (if the draft counter is larger than the draft counter MAX), step 16 is executed. Proceed with the process.

ＣＰＵ１１は、ステップ１３に処理を進めると、対象公報抽出部１０４としての動作を行って案文書データの出願日（ステップ３でマシン日付がセットされている）を基準にして公開公報データの抽出を行い、抽出したデータを検索対象公報データとして対象公報記憶部１５５に記憶させる。また、ＣＰＵ１１は、要旨データ抽出部１０２としての動作を行って前述した要旨データおよびＣＴデータを生成し、それぞれ要旨データ記憶部１５３、ＣＴデータ記憶部１５４に記憶させる。対象公報抽出部１０４は、出願日が案文書データの出願日よりも前の公開公報データを抽出する。 When the CPU 11 proceeds with the process to step 13, it performs an operation as the target publication extracting unit 104 to extract the publication publication data based on the application date of the draft document data (the machine date is set in step 3). The extracted data is stored in the target publication storage unit 155 as search target publication data. Further, the CPU 11 performs the operation as the summary data extraction unit 102 to generate the above-described summary data and CT data, and stores them in the summary data storage unit 153 and the CT data storage unit 154, respectively. The target publication extraction unit 104 extracts public publication data whose application date is earlier than the application date of the draft document data.

続いてＣＰＵ１１は、ステップ１４に処理を進めて後述する新規性・拡大先願予測ルーチンを実行してからステップ１５に進み、進歩性予測ルーチンを実行する。その後、ＣＰＵ１１は、ステップ１１に戻って上記同様の処理を繰り返す。ステップ１６では、ＣＰＵ１１が予測結果編集処理部１０５としての動作を行い、後述する予測結果リストＬ１を編集出力する。その後、ステップ１７のポイント消費処理を実行して、特許要件適否予測を行った案文書データの件数に応じて、ポイント残高を減らす。その後、特許要件適否予測ルーチンが終了する。 Subsequently, the CPU 11 proceeds to step 14 to execute a novelty / expansion prior application prediction routine described later, and then proceeds to step 15 to execute the inventive step prediction routine. Thereafter, the CPU 11 returns to step 11 and repeats the same processing as described above. In step 16, the CPU 11 performs an operation as the prediction result editing processing unit 105 to edit and output a prediction result list L <b> 1 described later. Thereafter, the point consumption process of step 17 is executed, and the point balance is reduced according to the number of proposal document data for which the propriety of patent requirement is predicted. Thereafter, the patent requirement suitability prediction routine ends.

（サーチ無し特許要件適否予測ルーチン）
一方、ＣＰＵ１１は、図３２に示すように、サーチ無し特許要件適否予測ルーチンを開始すると、ステップ１１、ステップ１２を特許要件適否予測ルーチンと同様に実行した後、ステップ１３Ａを実行する。ステップ１３Ａでは、ＣＰＵ１１が要旨データ抽出部１０２としての動作を行って前述した要旨データおよびＣＴデータを生成し、それぞれ要旨データ記憶部１５３、ＣＴデータ記憶部１５４に記憶させる。また、続いてＣＰＵ１１はステップ１２５に処理を進めて後述するサーチ無し進歩性予測ルーチンを実行し、その後、ステップ１１に戻る。処理がステップ１２から、ステップ１６，１７に進む場合も、それらステップ１６，１７を特許要件適否予測ルーチンと同様に実行する。 (No-search patent requirement suitability prediction routine)
On the other hand, as shown in FIG. 32, when starting the no-search patent requirement suitability prediction routine, the CPU 11 executes step 11 and step 12 in the same manner as the patent requirement suitability prediction routine, and then executes step 13A. In step 13A, the CPU 11 operates as the gist data extraction unit 102 to generate the gist data and the CT data described above, and stores them in the gist data storage unit 153 and the CT data storage unit 154, respectively. Further, the CPU 11 then proceeds to step 125 to execute a search-less inventive step prediction routine described later, and then returns to step 11. When the process proceeds from step 12 to steps 16 and 17, the steps 16 and 17 are executed in the same manner as the patent requirement suitability prediction routine.

（新規性・拡大先願予測ルーチン）
そして、図１３に戻り、ＣＰＵ１１は、ステップ１４に処理を進めると、前述した新規性・拡大先願予測処理部１２５としての動作を行い、図１４、図１５に示すフローチャートに沿って、新規性・拡大先願予測ルーチンを実行する。 (Novelty / expanded application prediction routine)
Returning to FIG. 13, when the CPU 11 proceeds the process to step 14, the CPU 11 operates as the novelty / expansion prior application prediction processing unit 125 described above, and follows the flowcharts shown in FIGS. 14 and 15.・ Extended application prediction routine is executed.

この場合、ＣＰＵ１１は、新規性・拡大先願予測ルーチンをスタートするとステップ２１に処理を進め、文献カウンタ（文献ｃｔ）および文献ＭＡＸに"０"をセットし、項番カウンタ（項番ｃｔ）に"１"をセットする。続いてステップ２２に処理が進み、要旨データ記憶部１５３に記憶されている要旨データの項番ｃｔに応じたデータを検索タームに用いて、対象公報記憶部１５５の検索対象公報データについて全文検索が行われ、ヒットした文献の件数が文献ＭＡＸにセットされる。この場合、ステップ２１で項番カウンタに"１"がセットされているので、項番エリア１５３ｂが"１"のデータ、すなわち請求項１の要旨データを用いて検索タームが設定される。 In this case, when the CPU 11 starts the novelty / expansion prior application prediction routine, the CPU 11 proceeds to step 21, sets “0” to the document counter (document ct) and document MAX, and sets the item number counter (item ct). Set “1”. Subsequently, the process proceeds to step 22, and a full-text search is performed on the search target publication data in the target publication storage section 155 using data corresponding to the item number ct of the summary data stored in the summary data storage section 153 as a search term. The number of hit documents is set in the document MAX. In this case, since “1” is set in the item number counter in step 21, the search term is set using the data in which the item number area 153b is “1”, that is, the summary data of claim 1.

続いて処理がステップ２３に進み、ステップ２２でヒットした文献があったか否か（文献ＭＡＸが１以上か否か）が判定され、ヒットした文献があればステップ２４に処理が進み、そうでなければ新規性・拡大先願予測ルーチンが終了する。 Subsequently, the process proceeds to step 23, where it is determined whether or not there is a document hit in step 22 (whether or not document MAX is 1 or more). If there is a hit document, the process proceeds to step 24; The novelty / expansion prior application prediction routine ends.

ステップ２４では、文献カウンタに"１"が加算され、続くステップ２５では、文献カウンタが文献ＭＡＸ以下であるか否かが判定され、これが成立しているときはステップ２６に処理が進み、そうでなければステップ２９に処理が進む。ステップ２６では、ヒットした文献の出願公開日（ヒット文献公開日）が案文書データの出願日（対象出願日）よりも小さいか否か（ヒット文献公開日＜対象出願日が成立するか否か）が判定され、これが成立しているときはステップ２７に処理が進み、そうでなければステップ２８に処理が進む。ステップ２７では、新規性無しを示す新規性フラグ"Ｎ１"を含むように新規性・拡大先願予測データＮｄが生成される。その後、ステップ２４に戻り、上記同様の処理が繰り返えされる。 In step 24, “1” is added to the document counter, and in subsequent step 25, it is determined whether or not the document counter is equal to or less than document MAX. If this is true, the process proceeds to step 26, and so on. If not, the process proceeds to step 29. In step 26, whether the application publication date of the hit document (hit document publication date) is smaller than the filing date (target application date) of the draft document data (whether hit document publication date <subject application date is satisfied). ) Is determined, and if this is true, the process proceeds to step 27; otherwise, the process proceeds to step 28. In step 27, novelty / expansion prior application prediction data Nd is generated so as to include a novelty flag “N1” indicating no novelty. Thereafter, the process returns to step 24 and the same processing as described above is repeated.

そして、ステップ２８では、後述する拡大先願予測ルーチンが実行される。ステップ２９では、ＣＴデータ記憶部１５４を参照して、他の独立項が有るか否かが判定され、他の独立項があるときはステップ３０に処理が進み、そうでなければステップ３１で新規性・拡大先願予測データＮｄが出力された後、新規性・拡大先願予測ルーチンが終了する。ステップ３０では、ＣＰＵ１１が文献カウンタおよび文献ＭＡＸに"０"をセットし、項番カウンタに"１"よりも大きい請求項ナンバがセットされる。その後、処理がステップ２２に戻り、上記同様の処理が繰り返される。 In step 28, an enlargement-first application prediction routine, which will be described later, is executed. In step 29, it is determined whether or not there is another independent term by referring to the CT data storage unit 154. If there is another independent term, the process proceeds to step 30; After the sex / expansion prior application prediction data Nd is output, the novelty / expansion prior application prediction routine ends. In step 30, the CPU 11 sets "0" in the document counter and document MAX, and a claim number larger than "1" is set in the item number counter. Thereafter, the process returns to step 22 and the same process as described above is repeated.

一方、ＣＰＵ１１は、図１５に示すフローチャートに沿って拡大先願予測ルーチンを実行する。拡大先願予測ルーチンがスタートすると、ステップ４１に処理が進み、ヒットした文献の出願日（文献出願日）が対象出願日よりも前であるか否か（文献出願日＜対象出願日が成立するか否か）が判定され、これが成立しているときはステップ４２に処理が進むが、そうでなければ拡大先願予測ルーチンを終了する。ステップ４２では、予測対象発明とヒットした文献とで発明者が同一であるか否かが判定され、これが成立していないときはステップ４３に処理が進むが、成立していれば拡大先願予測ルーチンを終了する。ステップ４３では、予測対象発明とヒットした文献とで出願人が同一であるか否かが判定され、これが成立していないときはステップ４４に処理が進むが、成立していれば拡大先願予測ルーチンを終了する。そして、ＣＰＵ１１は、ステップ４４に処理を進めると、拡大先願の要件（特許法第２９条の２に規定される要件）を満たしていないことを示す拡大先願フラグ"Ｆ１"を含むように新規性・拡大先願予測データＮｄを生成する。その後、拡大先願予測ルーチンが終了する。 On the other hand, the CPU 11 executes an enlargement-first application prediction routine according to the flowchart shown in FIG. When the extended earlier application prediction routine starts, the process proceeds to step 41, and whether or not the filing date of the hit document (document filing date) is earlier than the target filing date (reference filing date <subject filing date is satisfied) If this is true, the process proceeds to step 42; otherwise, the enlargement earlier application prediction routine is terminated. In step 42, it is determined whether or not the inventor is the same between the invention to be predicted and the hit document, and if this is not established, the process proceeds to step 43. End the routine. In step 43, it is determined whether or not the applicant is the same for the prediction target invention and the hit document. If this is not the case, the process proceeds to step 44. End the routine. Then, when the CPU 11 proceeds the process to step 44, the CPU 11 includes an enlarged first application flag “F1” indicating that the requirement of the enlarged first application (required in Article 29-2 of the Patent Law) is not satisfied. The novelty / expansion prior application prediction data Nd is generated. Thereafter, the enlargement destination application prediction routine ends.

以上で新規性・拡大先願予測ルーチンが終了すると、図１３において処理がステップ１４からステップ１５に進み、ＣＰＵ１１が進歩性予測処理部１２６としての動作を行い、進歩性予測ルーチンを実行する。ＣＰＵ１１は、図１６〜図２２に示すフローチャートに沿って進歩性予測ルーチンを実行する。 When the novelty / expansion prior application prediction routine is completed as described above, the process proceeds from step 14 to step 15 in FIG. 13, and the CPU 11 performs the operation as the inventive step prediction processing unit 126 to execute the inventive step prediction routine. The CPU 11 executes an inventive step prediction routine according to the flowcharts shown in FIGS.

（進歩性予測ルーチン）
ＣＰＵ１１は、進歩性予測ルーチンをスタートするとステップ５１に処理を進め、ＣＴデータ記憶部１５４から、独立区分エリア１５４ａの独立区分がスペースのレコードにつき、そのナンバエリア１５４ｂの請求項ナンバを取得して、後述する独立項テーブル１６５のナンバエリア（Ｎｏエリア）１６５ｂにセットする。続くステップ５２では、ＣＴデータ記憶部１５４から、サーチフラグエリア１５４ｅのサーチフラグがスペースで、ＭＡＸ区分エリア１５４ｃのＭＡＸ区分が"Ｍ"のレコードからそのナンバエリア１５４ｂの請求項ナンバを取得したうえで、取得した請求項ナンバの最小値（ＭＩＮ）を求め、それをＭＡＸカウンタにセットする。図９（Ｂ）のように、ＭＡＸ区分が"Ｍ"のレコードが複数あるときはそのうちの最も小さい請求項ナンバがＭＡＸカウンタにセットされる。 (Inventive step prediction routine)
When starting the inventive step predictive routine, the CPU 11 advances the process to step 51, obtains the claim number of the number area 154b for the record of the independent section area 154a from the CT data storage unit 154, and It is set in the number area (No area) 165b of the independent item table 165 described later. In the subsequent step 52, the claim number of the number area 154b is obtained from the CT data storage unit 154 from a record in which the search flag of the search flag area 154e is a space and the MAX section of the MAX section area 154c is "M". Then, the minimum value (MIN) of the obtained claim number is obtained and set in the MAX counter. As shown in FIG. 9B, when there are a plurality of records whose MAX classification is “M”, the smallest claim number is set in the MAX counter.

そして、ＣＰＵ１１は、ステップ５３に処理を進めて独立項テーブル１６５のＮｏエリア１６５ｂをサーチし、続くステップ５４で、"１"よりも大きい請求項ナンバがあるか否かを判定し、"１"よりも大きい請求項ナンバがあるか否かで処理が分岐する。この場合、"１"よりも大きい請求項ナンバがなければ処理がステップ５５に進み、あれば処理がステップ５６に進む。ステップ５５は予測対象発明の案文書に含まれる請求項の中で独立項が１つだけの場合の処理（単一独立項ルーチン）、ステップ５６は独立項が複数の場合の処理（複数独立項ルーチン）に相当している。前者は例えば予測対象発明が特開２００８−６２２８２号公報に開示されている発明の場合、後者は例えば予測対象発明が特開２０１１−１８６７３５号公報に開示されている発明の場合に相当している。 Then, the CPU 11 proceeds to step 53 to search the No area 165b of the independent item table 165, and in the subsequent step 54, determines whether there is a claim number larger than “1”, and “1”. Processing branches depending on whether there is a claim number larger than that. In this case, if there is no claim number greater than “1”, the process proceeds to step 55, and if it is, the process proceeds to step 56. Step 55 is processing when there is only one independent term among claims included in the draft document of the invention to be predicted (single independent term routine), and step 56 is processing when there are a plurality of independent terms (multiple independent terms). Routine). For example, the former corresponds to the case where the prediction target invention is disclosed in Japanese Patent Application Laid-Open No. 2008-62282, and the latter corresponds to the case where the prediction target invention is disclosed in Japanese Patent Application Laid-Open No. 2011-186735, for example. .

独立項テーブル１６５は、図２４に示すように、カウンタエリア１６５ａ，Ｎｏエリア１６５ｂおよびサーチフラグエリア１６５ｃを有している。カウンタエリア１６５ａには、記憶されるデータの件数に応じた数値が記憶されている。Ｎｏエリア１６５ｂには、独立項の番号が記憶される。サーチフラグエリア１６５ｃにはサーチフラグが記憶されている。図２４には、一例として、予測対象発明が特開２０１１−１８６７３５号公報に開示されている発明の場合が示されている。 As shown in FIG. 24, the independent term table 165 has a counter area 165a, a No area 165b, and a search flag area 165c. The counter area 165a stores a numerical value corresponding to the number of stored data. In the No area 165b, the number of the independent term is stored. A search flag is stored in the search flag area 165c. FIG. 24 shows, as an example, a case where the invention to be predicted is an invention disclosed in Japanese Patent Application Laid-Open No. 2011-186735.

そして、ＣＰＵ１１は、ステップ５５に処理を進めると、図１７に示すフローチャートに沿って単一独立項ルーチンを実行する。ＣＰＵ１１は、単一独立項ルーチンを開始すると、ステップ６１に処理を進め、ＣＴデータ記憶部１５４から、サーチフラグエリア１５４ｅのサーチフラグがスペースのレコードについて、そのナンバエリア１５４ｂから請求項ナンバを取得して、そのうちの最小値（ＭＩＮ）を項番カウンタにセットする。続くステップ６２では、後述する独立項検索処理が行われる。続くステップ６３で検索フラグ（検索ｆｌａｇ）が"ＶＸ"または"ＶＹ"であるか否かが判定され、検索フラグが"ＶＸ"または"ＶＹ"であれば処理がステップ６４に進み、そうでなければ単一独立項ルーチンを終了する。 Then, when the CPU 11 proceeds with the process to step 55, the CPU 11 executes a single independent term routine according to the flowchart shown in FIG. When the CPU 11 starts the single independent term routine, the process proceeds to step 61, and the claim number is acquired from the number area 154b for the record in which the search flag in the search flag area 154e is space from the CT data storage unit 154. Then, the minimum value (MIN) is set in the item number counter. In the subsequent step 62, an independent term search process to be described later is performed. In subsequent step 63, it is determined whether or not the search flag (search flag) is “VX” or “VY”. If the search flag is “VX” or “VY”, the process proceeds to step 64; The single independent term routine is terminated.

ＣＰＵ１１は、ステップ６４に処理を進めると項番カウンタに"１"を加算する。続くステップ６５では、項番カウンタが、ステップ５２でセットしたＭＡＸカウンタ以下であるか否かが判定され、項番カウンタがＭＡＸカウンタ以下ならステップ６６に処理を進めて後述する従属項検索処理が実行されるが、そうでなければ独立項検索処理が終了する。 When the CPU 11 advances the process to step 64, it adds “1” to the item number counter. In the next step 65, it is determined whether or not the item number counter is equal to or less than the MAX counter set in step 52. If the item number counter is equal to or less than the MAX counter, the process proceeds to step 66 to execute a dependent item search process described later. Otherwise, the independent term search process ends.

そして、ＣＰＵ１１は図１８に示すフローチャートに沿って、複数独立項ルーチンを実行する。ＣＰＵ１１は処理をスタートすると、ステップ５２に処理を進め、前述同様の処理を実行し、その後、ステップ５５に進んで、上記同様にして単一独立項ルーチンを実行する。その後、ＣＰＵ１１は処理をステップ６７に進め、ＣＴデータ記憶部１５４に、サーチフラグエリア１５４ｅのサーチフラグがスペースのレコードがあるか否か（すなわち、検索処理が行われていないレコードがあるか否か）が判定され、あればステップ５２に戻って上記同様の処理が実行されるが、そうでなければ複数独立項ルーチンが終了する。 Then, the CPU 11 executes a plurality of independent term routines according to the flowchart shown in FIG. When the CPU 11 starts the process, the process proceeds to step 52, executes the same process as described above, and then proceeds to step 55 to execute the single independent term routine in the same manner as described above. Thereafter, the CPU 11 advances the process to step 67 to determine whether or not there is a record in the CT data storage unit 154 where the search flag in the search flag area 154e is a space (that is, whether or not there is a record for which search processing has not been performed). ) Is determined, and if it is determined, the process returns to step 52 and the same processing as described above is executed. Otherwise, the plural independent term routine is terminated.

また、ＣＰＵ１１は図１９に示すフローチャートに沿って、独立項検索処理を実行する。独立項検索処理では、ＣＰＵ１１が引用発明検索部１３１としての動作を行い、独立項について主引用発明検索および副引用発明検索を行う。 Further, the CPU 11 executes an independent term search process according to the flowchart shown in FIG. In the independent term search processing, the CPU 11 performs the operation as the cited invention search unit 131, and performs the main cited invention search and the sub cited invention search for the independent term.

ＣＰＵ１１は、独立項検索処理を開始すると、ステップ７１に処理を進めて後述する主引用発明検索処理を実行する。続くステップ７２では、主引用発明検索処理で主引用発明があったか否か（後述する主引用文献がセットされているか否か）が判定され、主引用発明があればステップ７３に処理が進むが、主引用発明がなければステップ７６に処理が進む。続くステップ７３では、後述する副用発明検索処理が実行され、そのあとのステップ７４で、副引用発明検索処理で副引用発明があったか否か（後述する副引用文献がセットされているか否か）が判定される。副引用発明があればステップ７５に処理が進み、副引用発明がなければステップ７７に処理が進む。 When starting the independent term search process, the CPU 11 advances the process to step 71 to execute a main citation invention search process to be described later. In the following step 72, it is determined whether or not there is a main citation invention in the main citation invention search process (whether or not a main citation document described later is set). If there is a main citation invention, the process proceeds to step 73. If there is no main cited invention, the process proceeds to step 76. In the subsequent step 73, a sub-invention search process to be described later is executed, and in a subsequent step 74, whether or not there has been a sub-citation invention in the sub-citation invention search process (whether a sub-citation document to be described later is set). Is determined. If there is a secondary citation invention, the process proceeds to step 75, and if there is no secondary citation invention, the process proceeds to step 77.

ＣＰＵ１１はステップ７５に処理を進めると、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＸ"をセットし、ステップ７７では、検索フラグ（検索ｆｌａｇ）に"ＶＹ"をセットする。また、ＣＰＵ１１はステップ７６に処理を進めると、ＣＴデータ記憶部１５４に記憶されているレコードのうち、ナンバエリア１５４ｂの請求項ナンバが項番カウンタに一致しているレコードについて、サーチフラグエリア１５４ｅのサーチフラグＥｆにサーチ済み（検索済み）を示す"９"をセットする。また、ＣＰＵ１１はセットされた検索フラグを含むように進歩性予測データＶｄ１を生成して、それを予測結果ファイル生成部１２７に出力する。また、ＣＰＵ１１は検索結果に応じた請求項要旨データｉｅｄと概念検索データＶｄ２を入力ベクトル生成部１３２に出力する。この場合、請求項要旨データｉｅｄは、検索の対象となった請求項の要旨データとすることができるが、予測対象ＴＲ記憶部１５２に記憶されている予測対象発明の検索の対象となった請求項のデータでもよい。検索フラグは、主引用発明が見つかった場合に"ＶＸ"または"ＶＹ"がセットされるが、主引用発明が見つかると、それによって、進歩性の要件を満たさないと判断される可能性が高いため、進歩性違反の拒絶理由が見つかるか否かは主引用発明が見つかるか否かに大きく左右される。進歩性予測データＶｄ１は、このような検索フラグを含むことによって、進歩性の要件適否を示すものとなる。 When the processing proceeds to step 75, the CPU 11 sets “VX” to the search flag (search flag) of the corresponding claim number, and sets “VY” to the search flag (search flag) in step 77. Further, when the CPU 11 advances the process to step 76, among the records stored in the CT data storage unit 154, the records in which the claim number in the number area 154b matches the item number counter are stored in the search flag area 154e. A search flag Ef is set to “9” indicating that the search has been completed (searched). Further, the CPU 11 generates the inventive step prediction data Vd1 so as to include the set search flag, and outputs it to the prediction result file generation unit 127. Further, the CPU 11 outputs the claim summary data ied and the concept search data Vd2 corresponding to the search result to the input vector generation unit 132. In this case, the claim summary data ied can be the summary data of the claim that is the target of the search, but the claim that is the target of the search of the prediction target invention stored in the prediction target TR storage unit 152. It may be term data. The search flag is set to “VX” or “VY” when the main cited invention is found, but if the main cited invention is found, it is likely that the search flag does not satisfy the inventive step requirement. Therefore, whether or not the reason for refusal of the inventive step violation is found depends greatly on whether or not the main cited invention is found. By including such a search flag, the inventive step prediction data Vd1 indicates the suitability of the inventive step requirement.

そして、ＣＰＵ１１は図２０に示すフローチャートに沿って、主引用発明検索処理を実行する。主引用発明検索処理は、予測対象発明に最も近い主引用発明を検索する処理である。 Then, the CPU 11 executes main citation invention search processing according to the flowchart shown in FIG. The main citation invention search process is a process of searching for the main citation invention closest to the prediction target invention.

ＣＰＵ１１は、主引用発明検索処理を開始すると、ステップ８１に処理を進めて、展開度カウンタｔｃに"０"をセットする。続いてＣＰＵ１１は、ステップ８２に処理を進め、要旨データ記憶部１５３から、次のデータを読みだして主検索文書データ（主引用発明を概念検索で検索するときの文書データ）を設定する。１つは、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコード（項番カウンタには、ステップ６１で独立項の最小値がセットされている）から必須フラグＥｆが"Ｘ"で、展開度Ｅｄが展開度の最大値（展開度ＭＡＸ）−展開度カウンタｔｃの用語（例えば、展開度ＭＡＸが"５"なら、展開度Ｅｄが"５"−ｔｃの用語）であり、もう１つは、課題データ、すなわち、データ種別が"Ｐ"のレコードのデータである。 When the CPU 11 starts the main cited invention search process, the CPU 11 proceeds to step 81 and sets “0” in the development degree counter tc. Subsequently, the CPU 11 advances the processing to step 82, reads the next data from the summary data storage unit 153, and sets main search document data (document data when searching the main cited invention by concept search). One is a mandatory flag from a record in which the data type is “C” and the number of the item number area 153b corresponds to the item number counter (the item number counter is set with the minimum value of the independent item in step 61). Ef is “X”, and the degree of development Ed is the maximum value of the degree of development (expansion degree MAX) −the term of the degree of development counter tc (for example, if the degree of development MAX is “5”, the degree of development Ed is “5” −tc The other is problem data, that is, data of a record whose data type is “P”.

続くステップ８３では、ＣＰＵ１１が主引用発明の検索処理、すなわち、主検索文書データを入力文書に用いて、対象公報記憶部１５５に記憶されている検索対象公報データについて概念検索を行う。この概念検索では、主検索文書データと、検索される文書それぞれを特徴語の抽出、重み付けを行う等してそれぞれの文書に応じたベクトル（文書ベクトル）が生成され、各ベクトルの内積が求められて類似度が算出される。次にステップ８４では、ステップ８３の概念検索の結果から、最も大きい類似度が一定値以上になっているか否かが判定され、一定値以上の場合はステップ８５に処理が進むが、そうでなければステップ８７に処理が進む。ステップ８５では、類似度が一定値以上の文献が複数あったか否かが判定され、なければステップ８６に処理が進み、あれば処理がステップ８９に進む。主引用発明検索処理では、類似度が一定値以上の文献があったときだけ主引用文献がセットされる。 In the following step 83, the CPU 11 performs a concept search for the search target publication data stored in the target publication storage unit 155 using the search process of the main cited invention, that is, the main search document data as the input document. In this conceptual search, a vector (document vector) corresponding to each document is generated by extracting and weighting the main search document data and each document to be searched, and the inner product of each vector is obtained. Thus, the similarity is calculated. Next, in step 84, it is determined from the result of the concept search in step 83 whether or not the largest similarity is a certain value or more, and if it is more than a certain value, the process proceeds to step 85. If so, the process proceeds to step 87. In step 85, it is determined whether or not there are a plurality of documents whose similarity is equal to or greater than a certain value. If there are no documents, the process proceeds to step 86, and if there is, the process proceeds to step 89. In the main citation invention search process, the main citation is set only when there is a document whose similarity is a certain value or more.

ステップ８６では、ヒットした文献が主引用文献（主引用発明が開示されている先行技術文献）にセットされて主引用発明検索処理が終了する。ステップ８７では、展開度カウンタｔｃに"１"が加算され、その後のステップ８８では、展開度ＭＡＸ−展開度カウンタｔｃが"０"以下であるか否かが判定され、"０"以下なら主引用発明検索処理を終了するが、そうでなければステップ８２に戻って上記の処理を繰り返す。 In step 86, the hit document is set as the main cited document (prior art document in which the main cited invention is disclosed), and the main cited invention search process ends. In step 87, “1” is added to the expansion degree counter tc, and then in step 88, it is determined whether or not the expansion degree MAX−expansion degree counter tc is “0” or less. The cited invention search process is terminated. If not, the process returns to step 82 and the above process is repeated.

こうすることで、はじめに展開度Ｅｄが展開度ＭＡＸのより重要な用語で概念検索が行される。概念検索では、文献の類似度に応じて、複数の文献が抽出され得るが、最も高い類似度が一定値に達していないときは、その文献が主引用文献に該当しないおそれが高い。そのため、類似度が一定値以上の文献が見つからなかった場合に展開度Ｅｄが展開度ＭＡＸよりも小さい用語を含めて再び概念検索が実行される。 In this way, first, a concept search is performed with terms that have a degree of expansion Ed more important than the degree of expansion MAX. In the concept search, a plurality of documents can be extracted according to the similarity of documents, but when the highest similarity does not reach a certain value, there is a high possibility that the document does not correspond to the main cited document. For this reason, when a document with a similarity equal to or greater than a certain value is not found, the concept search is executed again including a term whose expansion degree Ed is smaller than the expansion degree MAX.

ステップ８９では、類似度の最も大きい文献（最類似文献ともいう）を主引用文献にセットし、その後、主引用発明検索処理が終了する。 In step 89, the document with the highest degree of similarity (also referred to as the most similar document) is set as the main cited document, and then the main cited invention search process ends.

そして、ＣＰＵ１１は図２１に示すフローチャートに沿って、副引用発明検索処理を実行する。副引用発明検索処理は、予測対象発明と主引用発明との相違点を含む副引用発明を検索する処理であり、主引用発明検索処理で主引用発明が見つかったときだけ実行される。 Then, the CPU 11 executes the secondary citation invention search process according to the flowchart shown in FIG. The sub-cited invention search process is a process for searching a sub-cited invention including a difference between the prediction target invention and the main-cited invention, and is executed only when the main-cited invention is found in the main-cited invention search process.

ＣＰＵ１１は、副引用発明検索処理を開始すると、ステップ９１に処理を進めて、要旨データ記憶部１５３から、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの主検索文書データに含まれていない用語（検索未使用データ）と、データ種別が"Ｐ"のレコードのデータとを読み出し、それらを副検索ターム（副引用発明を全文検索で検索するときのキーワード）に設定する。 When the CPU 11 starts the sub-cited invention search process, the process proceeds to step 91, and from the summary data storage unit 153, the data type is “C” and the number of the item number area 153b corresponds to the item number counter. Terms that are not included in the main search document data (unused search data) and data of records whose data type is “P” are read out, and they are sub-search terms (keywords for searching the sub-cited invention by full-text search) ).

続くステップ９２では、ＣＰＵ１１が副引用発明の検索処理、すなわち、副検索タームを検索キーワードに用いて、対象公報記憶部１５５に記憶されている検索対象公報データについて全文検索を行う。続くステップ９３では、ステップ９２でヒットした文献があったか否かが判定され、ヒットした文献があればステップ９４に処理が進み、そうでなければ処理がステップ９６に進む。ステップ９４では、ヒットした文献が複数あったか否かが判定され、ヒットした文献が複数なければステップ９５に処理が進み、ヒットした文献が複数あれば処理がステップ９８に進む。 In the subsequent step 92, the CPU 11 performs a full-text search on the search target publication data stored in the target publication storage section 155 using the search process of the subcitation invention, that is, the sub search term as a search keyword. In the following step 93, it is determined whether or not there is a document hit in step 92. If there is a hit document, the process proceeds to step 94, and if not, the process proceeds to step 96. In step 94, it is determined whether or not there are a plurality of hit documents. If there are no hit documents, the process proceeds to step 95. If there are a plurality of hit documents, the process proceeds to step 98.

ステップ９５では、ヒットした文献が副引用文献（副引用発明が開示されている先行技術文献）にセットされて副引用発明検索処理が終了する。ステップ９６では、副検索タームが変更されて再び全文検索が行われる。ここでは、副検索タームが、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの主検索文書データに含まれていない検索未使用データと、データ種別が"Ｔ"のレコードのデータに変更される。次のステップ９７でヒットした文献があったか否かが判定され、ヒットした文献があればステップ９４に処理が進み、なければ副引用発明検索処理が終了する。さらに、ステップ９８では、ヒットした文献のそれぞれについて、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの必須フラグＥｆが"Ｘ"の用語との一致数がカウントされ、その一致数が副引用ファイルにセットされる。次のステップ９９で副引用ファイルが一致数の降順にソートされ、続くステップ１００で副引用ファイルの先頭から３件が副引用文献にセットされ、その後、副引用発明検索処理が終了する。副引用発明検索で複数の文献がヒットしたときは、そのそれぞれについて、予測対象発明の特徴部分がどの程度開示されているのかが、必須フラグＥｆが"Ｘ"の用語との一致数で調べられ、その一致数の多い文献が副引用文献にセットされる。 In step 95, the hit document is set in the sub-cited document (prior art document in which the sub-cited invention is disclosed), and the sub-cited invention search process ends. In step 96, the sub-search term is changed and the full-text search is performed again. Here, the sub-search term is the data type “C”, the search number unused data not included in the main search document data of the record corresponding to the item number counter with the data type “C”, and the data type “ It is changed to the data of the record of “T”. In step 97, it is determined whether or not there is a hit document. If there is a hit document, the process proceeds to step 94. If not, the sub-cited invention search process is terminated. Further, in step 98, for each of the hit documents, the number of matches with the term having the data type “C” and the number of the item number area 153b corresponding to the item number counter is “X”. It is counted and the number of matches is set in the subcitation file. In the next step 99, the sub-cited files are sorted in descending order of the number of matches, and in the subsequent step 100, three sub-cited files from the top of the sub-cited file are set in the sub-cited document, and then the sub-cited invention search process ends. When a plurality of documents are hit in the sub-cited invention search, the extent to which the feature part of the prediction target invention is disclosed is examined for each of them by the number of matches with the term whose essential flag Ef is “X”. A document with a large number of matches is set as a sub-cited document.

そして、ＣＰＵ１１は図２２に示すフローチャートに沿って、従属項検索処理を実行する。従属項検索処理は、検索フラグ（検索ｆｌａｇ）が"ＶＸ"または"ＶＹ"であったとき（主引用発明がみつかったとき）だけ実行される。ＣＰＵ１１が従属項検索処理を開始すると、ステップ１１１に処理が進み、要旨データ記憶部１５３から、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコード（項番カウンタには、ステップ６４で独立項の最小値に順次"１"が加算される）から必須フラグＥｆが"Ｘ"の用語が読み出され、それが従属検索ターム（従属項に記載されている発明を全文検索で検索するときのキーワード）に設定される。次のステップ１１２でＣＰＵ１１が従属検索タームを検索キーワードに用いて、主引用文献について全文検索を行い、従属項に記載されている発明が主引用文献に開示されているか否かを調べる。 And CPU11 performs a dependent term search process according to the flowchart shown in FIG. The dependent term search process is executed only when the search flag (search flag) is “VX” or “VY” (when the main cited invention is found). When the CPU 11 starts the subordinate item search process, the process proceeds to step 111, and a record (item number counter) from the summary data storage unit 153 corresponding to the item number counter with the data type “C” and the item number area 153b number. In step 64, the term having the required flag Ef of “X” is read from the minimum value of the independent term in order at step 64, and this is the dependent search term (invention described in the dependent term). Is set as a keyword when searching in full-text search. In the next step 112, the CPU 11 performs a full-text search for the main cited document using the subordinate search term as a search keyword, and checks whether the invention described in the dependent claim is disclosed in the main cited document.

次のステップ１１３で、ヒットした文献があったか否かが判定され、ヒットした文献があればステップ１１４に処理が進み、そうでなければ処理がステップ１１６に処理が進む。ステップ１１４では、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＸ"がセットされ、ヒットした文献が該当する請求項ナンバの主引用文献にセットされる。その後、処理がステップ１１５に進み、ＣＴデータ記憶部１５４に記憶されているレコードのうち、ナンバエリア１５４ｂの請求項ナンバが項番カウンタに一致するレコードについて、サーチフラグエリア１５４ｅのサーチフラグＥｆに"９"がセットされ、その後、従属項検索処理が終了する。また、ステップ１１６では、従属検索タームで副引用文献について全文検索が行われ、次のステップ１１７で、ヒットした文献があったか否かが判定される。ヒットした文献があればステップ１１８を実行したあとステップ１１５に進み、なければ従属項検索処理が終了する。ステップ１１８では、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＹ"がセットされ、ヒットした文献が該当する請求項ナンバの副引用文献にセットされる。 In step 113, it is determined whether there is a hit document. If there is a hit document, the process proceeds to step 114. If not, the process proceeds to step 116. In step 114, "VX" is set in the search flag (search flag) of the corresponding claim number, and the hit document is set as the main cited reference of the corresponding claim number. Thereafter, the process proceeds to step 115, and among the records stored in the CT data storage unit 154, the record whose claim number in the number area 154b matches the item number counter is set in the search flag Ef in the search flag area 154e. 9 "is set, and then the dependent term search process ends. Further, in step 116, a full-text search is performed for the sub-cited document by the subordinate search term, and in the next step 117, it is determined whether or not there is a hit document. If there is a hit document, step 118 is executed and then the process proceeds to step 115. If not, the dependent term search process is terminated. In step 118, “VY” is set in the search flag (search flag) of the corresponding claim number, and the hit document is set in the sub-citation document of the corresponding claim number.

以上のようにして、新規性・拡大先願予測ルーチンと、進歩性予測ルーチンとが実行されると、それぞれの結果に応じて、新規性・拡大先願予測データＮｄと、進歩性予測データＶｄ１とが予測結果ファイル生成部１２７に出力される。また、機械学習部１３３から要件適否文書ベクトルＶ４が出力されるので、これらを用いて予測結果ファイル生成部１２７が図１０に示した予測結果ファイルを生成し、予測結果記憶部１５６に記憶させる。 As described above, when the novelty / expansion prior application prediction routine and the inventive step prediction routine are executed, the novelty / expansion prior application prediction data Nd and the inventive step prediction data Vd1 according to the respective results. Is output to the prediction result file generation unit 127. Further, since the requirement adequacy document vector V4 is output from the machine learning unit 133, the prediction result file generation unit 127 generates a prediction result file illustrated in FIG. 10 using these, and stores the prediction result file in the prediction result storage unit 156.

予測結果ファイルは、図１０に示すように、案文書番号、請求項、主検索文書データ、副検索ターム、検索フラグ、ヒット文献、マシン予測の各項目のデータが予測対象発明ごとに記憶されている。マシン予測とは、機械学習記憶部１３３からの要件適否文書ベクトルＶ４に応じたデータであって、進歩性予測ルーチンで見つかった主引用文献および副引用文献を引用した進歩性違反の拒絶理由が見つかる可能性が高いか低いか（高い場合は"Ｈ"、低い場合は"Ｌ"）を示している。 As shown in FIG. 10, the prediction result file stores the data of each item of the draft document number, claim, main search document data, sub search term, search flag, hit document, and machine prediction for each prediction target invention. Yes. The machine prediction is data corresponding to the requirement conformity document vector V4 from the machine learning storage unit 133, and the reason for refusal of the inventive step violation that cited the main cited document and the sub cited document found in the inventive step prediction routine is found. The possibility is high or low (“H” when high, “L” when low).

また、予測結果編集部１０５が予測結果ファイルを読み込み、図２５に示すような特許要件適否予測リストＬ１を編集および出力して、ユーザ端末装置３０に送信する。特許要件適否予測リストＬ１には、予測対象発明の案文書番号、請求項の番号とともに、新規性（拡大先願）、進歩性の要件適否がその根拠となる文献（主引用文献、副引用文献）とともに示されている。新規性（拡大先願）、進歩性の要件に適合しない（満たさない）と予測される場合は、"Ｘ"、適合する（満たす）と予測される場合は"Ａ"が記載される。これらは、予測結果ファイルの検索フラグで判断される。 Further, the prediction result editing unit 105 reads the prediction result file, edits and outputs a patent requirement suitability prediction list L 1 as shown in FIG. 25, and transmits it to the user terminal device 30. The patent requirement propriety prediction list L1 includes the document number of the invention to be predicted and the number of the claim, as well as the documents on which the suitability of the requirements of novelty (expanded prior application) and inventive step is based (main cited document, sub cited document) ). “X” is described when it is predicted that the requirement of novelty (expanded prior application) and inventive step will not be met (does not meet), and “A” is recorded when it is predicted that it will meet (satisfy). These are determined by the search flag of the prediction result file.

進歩性の要件に適合しないと予測される場合の"Ｘ"（主引用文献、副引用文献有り）、"Ｙ"（主引用文献のみ有り）、には、"Ｈ"、"Ｌ"が併記される（図２５では、"Ｈ"が併記されるばあいのみ例示）これは、機械学習部１３３の要件適否文書ベクトルＶ４にしたがったもので、"Ｈ"は主引用文献で進歩性違反の拒絶理由が発行される可能性が高い場合、"Ｌ"は低い場合を示している。 “H” and “L” are also written in “X” (main citation and sub-citation) and “Y” (only main citation) when predicted not to meet the inventive step requirement. (In FIG. 25, only when “H” is written together) This is in accordance with the requirement adequacy document vector V4 of the machine learning unit 133, and “H” is a violation of inventive step in the main cited document. When the possibility of refusal is likely to be issued, “L” indicates a low case.

（サーチ無し進歩性予測ルーチン）
ＣＰＵ１１は、サーチ無し進歩性予測ルーチンをスタートすると、図３３に示すように、進歩性予測ルーチンと同様にして、ステップ５１、５２，５３、５４を実行する。ステップ５４で、"１"よりも大きい請求項ナンバがあるか否かで処理が分岐する。この場合、"１"よりも大きい請求項ナンバがなければ処理がステップ１２６に進み、あれば処理がステップ１２７に進む。ステップ１２６はサーチ無し単一独立項ルーチン、ステップ１２７はサーチ無し複数独立項ルーチンである。前者は図３４に沿って実行されるが、後者は図３５に沿って実行される。 (No search progress routine without search)
When the CPU 11 starts the search-less inventive step prediction routine, it executes steps 51, 52, 53, and 54 in the same manner as the inventive step prediction routine, as shown in FIG. In step 54, the process branches depending on whether there is a claim number larger than “1”. In this case, if there is no claim number greater than “1”, the process proceeds to step 126, and if there is, the process proceeds to step 127. Step 126 is a single independent term routine without search, and step 127 is a multiple independent term routine without search. The former is executed along FIG. 34, while the latter is executed along FIG.

（サーチ無し単一独立項ルーチン）
ＣＰＵ１１は、サーチ無し単一独立項ルーチンをスタートすると、図３４に示すように、図１７と同様にステップ６１を実行したあと、指定予測データ生成部１３４としての動作を行い、ステップ１３０，１３１を実行する。ＣＰＵ１１は、ステップ１３０に処理を進めると、要旨データ記憶部１５３から項番１５３ｂが最小値ＭＩＮに応じたレコードと、データ種別が"Ｐ"のレコードを読み出して請求項要旨データｉｅｄを生成する。ＣＰＵ１１は、続くステップ１３１では、指定先行技術データ記憶部１５１に記憶されている指定先行技術データを読み出して、進歩性予測データＶｓ１と指定先行技術データＶｓ２ｓを生成し、それぞれ予測結果ファイル生成部１２７、入力ベクトル生成部１３２に出力する。続くステップ１３２で、サーチフラグエリア１５４ｅのサーチフラグＥｆに"９"をセットすると、サーチ無し単一独立項ルーチンが終了する。 (Single independent term routine without search)
When the CPU 11 starts the single independent term routine without search, as shown in FIG. 34, after executing step 61 as in FIG. 17, the CPU 11 performs the operation as the designated prediction data generation unit 134, and performs steps 130 and 131. Run. When the CPU 11 proceeds with the process to step 130, it reads out the record corresponding to the minimum value MIN in the item number 153 b and the record whose data type is “P” from the abstract data storage unit 153 and generates the claimed abstract data ied. In the following step 131, the CPU 11 reads the designated prior art data stored in the designated prior art data storage unit 151, generates the inventive step prediction data Vs1 and the designated prior art data Vs2s, and each of the prediction result file generation unit 127. And output to the input vector generation unit 132. In subsequent step 132, when "9" is set in the search flag Ef of the search flag area 154e, the single independent term routine without search is terminated.

（サーチ無し複数独立項ルーチン）
ＣＰＵ１１は、サーチ無し複数独立項ルーチンをスタートすると、図３５に示すように、ステップ１２６に進んで上記同様にしてサーチ無し単一独立項ルーチンを実行する。その後、図１８と同様のステップ６７に処理が進み、ＣＴデータ記憶部１５４に、サーチフラグエリア１５４ｅのサーチフラグがスペースのレコードがあるか否かが判定され、レコードがあればステップ１２６に戻ってサーチ無し単一独立項ルーチンが上記同様に実行されるが、無ければサーチ無し複数独立項ルーチンが終了する。 (Multiple independent term routines without search)
When starting the multiple independent term routine without search, the CPU 11 proceeds to step 126 and executes the single independent term routine without search as described above, as shown in FIG. Thereafter, the process proceeds to the same step 67 as in FIG. 18, and it is determined whether or not there is a record whose search flag in the search flag area 154e is a space in the CT data storage unit 154. If there is a record, the process returns to step 126. A single independent term routine without search is executed in the same manner as described above. If there is no single independent term routine without search, the multiple independent term routine without search ends.

以上のように、本発明の実施の形態にかかる特許要件適否予測サーバ１０では、サーチ有りモードの場合は、予測対象発明が記載されている案文書の案文書データから要旨データを生成し、これを用いて主引用発明検索、副引用発明検索を行っている。主引用発明検索は、予測対象発明とその骨格において共通する、すなわち、先行技術発明のうち、予測対象発明に最も近い主引用発明を要旨データで探し出す処理であり、特許法や特許・実用新案審査基準に沿って行われる。副引用発明検索は、主引用発明検索で主引用発明が見つかった場合に、発明が解決しようとする課題や、技術分野を特定する用語を用いた全文検索で行われており、これも特許法や特許・実用新案審査基準に沿って行われる。したがって、本発明の実施の形態にかかる特許要件適否予測サーバ１０では、特許要件の適否に関する予測が審査実務に適合した内容で行われるので、特許出願の出願書類の準備負担を有効に軽減することができる。 As described above, in the patent requirement suitability prediction server 10 according to the embodiment of the present invention, in the search mode, summary data is generated from the draft document data of the draft document in which the invention to be predicted is described. Is used to perform a main citation invention search and a sub-citation invention search. The main citation invention search is common to the prediction target invention and its skeleton, that is, the process of finding the main citation invention closest to the prediction target invention among the prior art inventions in the summary data. Patent law and patent / utility model examination Done in line with standards. The sub-cited invention search is performed by a full-text search using a term specifying a problem to be solved by the invention or a technical field when the main cited invention is found by the main cited invention search. And patent and utility model examination standards. Therefore, in the patent requirement conformity prediction server 10 according to the embodiment of the present invention, the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, so that the burden of preparing the application documents for the patent application is effectively reduced. Can do.

一方、前述したように、特許要件適否予測処理部１０３が機械学習部１３３を有しているが、その機械学習部１３３は過去の審査実績に基づく学習データで訓練された人工知能プログラムで構築されている。 On the other hand, as described above, the patent requirement suitability prediction processing unit 103 has the machine learning unit 133, and the machine learning unit 133 is constructed by an artificial intelligence program trained with learning data based on past examination results. ing.

ところで、平成２６年の実績ベースで年間３２万数千件程度の特許出願が出されており、その一部またはそれ以前の多数の特許出願について１ｓｔアクションがすでに発行されている。その中には、拒絶理由通知で進歩性違反の拒絶理由が指摘されている出願（進歩性拒絶出願）が多数存在している。 By the way, about 320,000 or more patent applications are filed annually on a performance basis in 2014, and the 1st action has already been issued for some or many patent applications before that. Among them, there are many applications (progressive rejection applications) in which the reasons for refusal of violation of inventive step are pointed out in the notice of reasons for refusal.

進歩性拒絶出願では、審査結果が、審査時点の請求項に記載された発明と主引用発明とに相違点があったものの、その相違点だけでは、進歩性があるとは審査官によって判断されなかったということを意味している。これに対し、特許出願の中には、１ｓｔアクションが発行されることなく特許査定が発行された出願や、拒絶理由通知が発行されたものの、その理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）も存在している。 In the inventive step rejection application, the examination results differed between the invention described in the claim at the time of examination and the main cited invention. However, the examiner determined that there was an inventive step based only on the difference. It means that there was no. On the other hand, in patent applications, applications for which a patent assessment was issued without the first action being issued, or a reason for refusal was issued, but the reasons for refusal of the inventive step violation were pointed out. There are applications that did not exist (applications without inventive step rejection).

そして、例えば図２３に示すように、審査対象となる特許出願Ｐｄがあり、その出願日がｔ_０であったとすると、特許出願Ｐｄに対する主引用発明または副引用発明となりえるのは、公知、公用、文献公知およびインターネット公知の発明であり、主に出願日ｔ_０より前にすでに公開されている出願の特許公開公報（図２３では、ｒｆ１〜ｒｆ６）に開示されている発明である。 Then, for example, as shown in FIG. 23, if there is a patent application Pd to be examined and its filing date is t ₀ , the main cited invention or sub-cited invention for the patent application Pd can be known or publicly used. are known from the literature and the Internet known invention is primarily (in FIG. 23, rf1~rf6) filed t ₀ Patent Laid-open application previously have been published from the invention disclosed in.

ここで、仮に、審査の結果、公報ｒｆ６に開示されている発明が主引用発明に該当すると判断されたとする。すると、その場合、特許出願Ｐｄに係る発明と、その公報ｒｆ６に開示されている発明とに相違点があったものの、その相違点に応じた距離ｄｐが、特許出願Ｐｄに係る発明の進歩性を肯定するに足りる大きさではなかったと考えられる。逆に、公報ｒｆ６に開示されている発明が主引用発明には該当しないと判断されていたとすれば、距離ｄｐが、出願Ｐｄに係る発明の進歩性を肯定するに足りる大きさであったと考えられる。 Here, it is assumed that, as a result of examination, it is determined that the invention disclosed in the publication rf6 corresponds to the main cited invention. In this case, although there is a difference between the invention according to the patent application Pd and the invention disclosed in the publication rf6, the distance dp corresponding to the difference is the inventive step of the invention according to the patent application Pd. It is thought that it was not large enough to affirm. On the other hand, if it is determined that the invention disclosed in the publication rf6 does not fall under the main cited invention, it is considered that the distance dp is large enough to affirm the inventive step of the application related to the application Pd. It is done.

もし、発明の進歩性が肯定されるときの相違がどの程度で、否定されるときの相違がどの程度なのかが割り出せれば、それが特許要件適否の客観的な判断材料になると考えられるが、以上を考慮すると、そのためには、２つの発明の相違に応じた距離ｄｐがどの程度なのかを割り出すのが有効であると考えられる。これを過去の審査実績に基づく訓練データの学習によって割り出し、進歩性が否定されるおそれが高いのか、それとも低いのかの目安を付けるのが機械学習部１３３である。 If it is possible to determine how much the difference is when the inventive step is affirmed and what is the difference when the invention is denied, it can be considered as an objective judgment material for whether or not the patent requirement is met. In view of the above, for that purpose, it is considered effective to determine the distance dp according to the difference between the two inventions. The machine learning unit 133 determines this by learning the training data based on the past examination results, and gives a measure of whether the inventive step is highly likely to be denied or low.

機械学習部１３３の学習において、本件出願にかかる発明（本願発明）では、距離ｄｐを２つの文書ベクトルの差分と捉え、進歩性の拒絶理由有りの場合、無しの場合それぞれの距離ｄｐを学習するため、前述のＨＬパターンによる訓練データで学習が行われている。 In the learning of the machine learning unit 133, in the invention according to the present application (the invention of the present application), the distance dp is regarded as a difference between two document vectors, and the distance dp is learned when there is a reason for refusal of inventive step and when there is no reason for rejection. Therefore, learning is performed using the training data based on the above-described HL pattern.

そして、特許要件の適否を予測する場合は、予測対象発明について、その要旨データを求め、それを用いて概念検索で最類似文献を探し出す。最類似文献は、予測対象発明の文書ベクトル（正確には、独立項の記載事項などから求めた文書ベクトル）に最も類似度が高い文書ベクトルを有しているので、公開済出願の中で主引用文献になる可能性が最も高いと認められる。 When predicting the suitability of the patent requirement, the gist data is obtained for the invention to be predicted, and the most similar document is searched by using the concept data. Since the most similar document has the document vector having the highest similarity to the document vector of the invention to be predicted (to be exact, the document vector obtained from the description of the independent item), it is the main document in the published application. It is recognized that it is most likely to be cited.

その最類似文献から求めた引用候補ベクトルＲｆＶと、予測対象発明の要旨データから求めた要旨ベクトルＥＶとの差分を求めて要旨移動ベクトルＶ３を生成し、これを機械学習部１３３に入力して、主引用発明検索で見つかった主引用文献を引用する進歩性違反の拒絶理由が有るのか、無いのかが出力されるようにしている。これにより、進歩性違反の拒絶理由が見つかる可能性が高いのか、低いのかの目安を付けることが可能になる。 A difference between the citation candidate vector RfV obtained from the most similar document and the summary vector EV obtained from the summary data of the prediction target invention is obtained to generate a summary movement vector V3, which is input to the machine learning unit 133, Whether or not there is a reason for refusal of inventive step violation that cites the main cited document found in the main cited invention search is output. As a result, it is possible to give an indication of whether the reason for refusal of violating the inventive step is high or low.

以上のように、特許要件適否予測サーバ１０では、機械学習部１３３を備えていることによって、特許庁の審査実績を反映させる形で特許要件適否の予測が行われることになる。従前のような審査官や弁理士などの専門家の経験や勘だけに頼らざるを得ない判断結果に人工知能の判断結果を生かせるようになるため、予測結果に客観性を持たせることが可能になり、出願書類の準備負担の軽減を通じて権利化業務の効率向上が期待できる。 As described above, the patent requirement suitability prediction server 10 includes the machine learning unit 133, so that the patent requirement suitability is predicted in a manner that reflects the examination results of the Patent Office. Since the judgment results of artificial intelligence can be used for judgment results that must rely solely on the experience and intuition of experts such as examiners and patent attorneys as before, it is possible to make the prediction results objective Therefore, it can be expected that the efficiency of the rights acquisition work will be improved by reducing the burden of preparing the application documents.

特にサーチ無しモードの場合は、指定先行技術発明との関係による特許要件の適否予測が行われるため、ユーザが自ら指定した先行技術発明からみて、予測対象発明が進歩性の要件を具備しているのかどうかが予測される。出願書類が準備される際には、先行技術調査で見つかった文献に記載されている先行技術と出願しようとする発明との相違が明確になるようにするものであるが、その相違で、果たしてその発明が進歩性を具備するのか、確信が持てない場合が多々ある。また、進歩性を具備するように出願書類を準備したものの、進歩性違反の拒絶理由が指摘されることもある。この点、特許要件適否予測サーバ１０では、機械学習部１３３によって、進歩性適否の目安を付けることができるので、それを出願書類の準備の段階で生かすことで、出願書類の準備負担の軽減に加え、出願書類の品質向上が期待できる。 In particular, in the no-search mode, the propriety of the patent requirement is predicted based on the relationship with the designated prior art invention. Therefore, the invention to be predicted has the inventive step requirement in view of the prior art invention designated by the user himself / herself. It is predicted whether or not. When the application documents are prepared, the difference between the prior art described in the documents found in the prior art search and the invention to be filed should be clarified. There are many cases where one cannot be sure that the invention has an inventive step. In addition, although the application documents are prepared to have inventive step, the reason for refusal of violating the inventive step may be pointed out. In this respect, in the patent requirement suitability prediction server 10, since the machine learning unit 133 can provide an indication of suitability for inventive step, it can be used at the stage of preparing the application documents to reduce the preparation burden of the application documents. In addition, improvement in the quality of application documents can be expected.

また、進歩性予測処理部１２６が主引用発明検索では概念検索を行い、副引用発明検索で全文検索を行っている。進歩性違反の拒絶理由が有るのかどうかは主引用発明が見つかるか否かが大きく左右するが、その主引用発明を探す主引用発明検索で全文検索を行うと、複数の文献がヒットする可能性があり、主引用発明（主引用文献）を特定できない場合がある。この点、概念検索では、文書ベクトルの内積から求めた類似度にしたがい類似している文献が順番付けされるので、最も類似度の高い文献を選ぶことで主引用文献を特定できる。こうして見つけた主引用文献に機械学習部１３３による予測を併用することで、その主引用文献を引用した進歩性違反の拒絶理由が出るおそれが高いのか、低いのかを予測することができる。また、副引用発明検索で全文検索を行うことで、副引用文献があるのかどうかを明確にすることができる。 Further, the inventive step prediction processing unit 126 performs a concept search in the main citation invention search and performs a full text search in the sub citation invention search. Whether or not there is a reason for refusal to violate the inventive step largely depends on whether or not the main cited invention is found, but if a full text search is performed in the main cited invention search to find the main cited invention, there is a possibility that multiple documents will be hit In some cases, the main cited invention (main cited document) cannot be specified. In this regard, in the concept search, similar documents are ordered according to the similarity obtained from the inner product of the document vectors. Therefore, the main cited document can be specified by selecting the document having the highest similarity. By combining the prediction by the machine learning unit 133 with the main citation document found in this way, it is possible to predict whether the reason for refusal of the inventive step violation that cited the main citation document is high or low. In addition, it is possible to clarify whether there is a sub-cited document by performing a full text search by sub-cited invention search.

以上の説明では、より好ましい実施の形態として、進歩性予測処理部１２６の引用発明検索部１３１が主引用発明検索部および副引用発明検索部を有している場合を示している。前述したように、主引用発明検索によって主引用発明が見つかると、進歩性無しの拒絶理由が見つかる可能性が高いから、主引用発明が見つかったら、その後は副引用発明検索を行うことなく入力ベクトル生成部１３２と、機械学習部１３３を作動させて要件適否文書ベクトルＶ４を出力するようにしてもよい。この場合でも、前述のステップ８９でセットされる主引用文献に関する進歩性予測データＶｄ１と、機械学習部１３３により生成される要件適否文書ベクトルＶ４とを併用することで、審査実務に適合した内容の予測が行えるのであって、しかもその予測は人工知能の判断結果を生かしたものとなるから、予測結果に客観性を持たせることができる。したがって、進歩性予測処理部１２６が主引用発明検索部を有していればよく、副引用発明検索部を有していなくてもよいが、上記のように、副引用発明検索部を有する進歩性予測処理部１２６の方がより好ましい。 In the above description, as a more preferred embodiment, a case where the cited invention search unit 131 of the inventive step prediction processing unit 126 has a main cited invention search unit and a sub-cited invention search unit is shown. As described above, when the main cited invention is found by the main cited invention search, there is a high possibility that the reason for refusal without inventive step will be found. The generation unit 132 and the machine learning unit 133 may be operated to output the requirement suitability document vector V4. Even in this case, by using together the inventive step prediction data Vd1 related to the main cited document set in step 89 and the requirement suitability document vector V4 generated by the machine learning unit 133, the content suitable for the examination practice can be obtained. Since the prediction can be performed and the prediction uses the judgment result of the artificial intelligence, the prediction result can have objectivity. Therefore, it is sufficient that the inventive step prediction processing unit 126 has the main citation invention search unit and does not have to have the sub citation invention search unit, but as described above, the progress having the sub citation invention search unit. The sex prediction processing unit 126 is more preferable.

第２の実施の形態
続いて、第２の実施の形態に係る特許要件適否予測サーバ２００について、図２６〜図３０を参照して説明する。特許要件適否予測サーバ２００は、図２６に示すように、前述した特許要件適否予測サーバ１０と比較して、特許要件適否予測処理部１０３、予測結果編集処理部１０５、予測結果記憶部１５６の代わりに特許要件適否予測処理部２０３、予測結果編集処理部２０５、予測結果記憶部２５６を有する点と、予測結果リストＬ１の代わりに予測結果リストＬ２を出力する点とで相違している。 Second Embodiment Subsequently, a patent requirement suitability prediction server 200 according to a second embodiment will be described with reference to FIGS. 26 to 30. As shown in FIG. 26, the patent requirement suitability prediction server 200 is replaced with the patent requirement suitability prediction processing unit 103, the prediction result editing processing unit 105, and the prediction result storage unit 156, as compared with the patent requirement suitability prediction server 10 described above. 3 further includes a patent requirement suitability prediction processing unit 203, a prediction result editing processing unit 205, and a prediction result storage unit 256, and a point that the prediction result list L2 is output instead of the prediction result list L1.

特許要件適否予測処理部２０３は、図２７に示すように、特許要件適否予測処理部１０３と比較して、進歩性予測処理部１２６と予測結果ファイル生成部１２７の代わりに進歩性予測処理部２２６と予測結果ファイル生成部２２７を有する点で相違している。 As shown in FIG. 27, the patent requirement suitability prediction processing unit 203 is different from the patent requirement suitability prediction processing unit 103 in that an inventive step prediction processing unit 226 is used instead of the inventive step prediction processing unit 126 and the prediction result file generation unit 127. And the prediction result file generation unit 227 is different.

そして、進歩性予測処理部２２６は、進歩性予測処理部１２６と比較して、引用発明検索部１３１と機械学習部１３３の代わりに引用発明検索部２３１と機械学習部２３３を有する点と、入力ベクトル生成部１３２の動作が異なる点とで相違している。 And the inventive step prediction processing unit 226 is different from the inventive step prediction processing unit 126 in that it has a cited invention search unit 231 and a machine learning unit 233 instead of the cited invention search unit 131 and the machine learning unit 133, and an input The difference is that the operation of the vector generation unit 132 is different.

前述した第１の実施の形態に係る進歩性予測処理部１２６では、主引用発明検索を行うことによって、最類似文献だけを主引用文献にセットしているが（ステップ８９）、第２の実施の形態に係る進歩性予測処理部２２６では、類似度の降順に最類似文献を含むｎ件の文献（ｎは２以上の整数）を類似文献として抽出し、それら各類似文献を主引用文献にセットしている。また、機械学習部２３３が、ｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎをそれぞれ後述するＳクラス、Ｈクラス、Ｌクラスの３つのクラスに分類する。 In the inventive step predictive processing unit 126 according to the first embodiment described above, only the most similar document is set as the main cited document by performing the main cited invention search (step 89). In the inventive step predictive processing unit 226, n documents including the most similar documents in descending order of similarity (n is an integer of 2 or more) are extracted as similar documents, and these similar documents are used as main cited documents. It is set. In addition, the machine learning unit 233 classifies the _n summary movement vectors V3 _{1 to} V3 _n into three classes of S class, H class, and L class, which will be described later.

引用発明検索部２３１は、引用発明検索部１３１と同様に主引用発明検索部および副引用発明検索部を有しているが、引用発明検索部１３１と比較して、主引用発明検索部の動作が異なり、出力されるデータも異なる。引用発明検索部１３１では、主引用発明検索部が主引用発明検索を行うことによって、最類似文献を主引用文献にセットしているが（前述した主引用発明検索処理のステップ８９）、引用発明検索部２３１の主引用発明検索部は、ステップ８９において、類似度の降順に最類似文献を含むｎ件の文献を類似文献として抽出し、それらを主引用文献（ｄｏｃ_１〜ｄｏｃ_ｎ）にセットする。また、主引用発明検索部の動作が異なることに伴い、前述した独立項検索処理におけるステップ７６でＣＰＵ１１が各類似文献に応じたｎ件の進歩性予測データＶｄ１_１〜Ｖｄ１_ｎを生成して、それらを予測結果ファイル生成部２２７に出力する。また、独立項検索処理において、ＣＰＵ１１は請求項要旨データｉｅｄを入力ベクトル生成部１３２に出力するが、各類似文献に応じたｎ件の概念検索データＶｄ２_１〜Ｖｄ２_ｎを入力ベクトル生成部１３２に出力する。 The cited invention search unit 231 has a main cited invention search unit and a sub-cited invention search unit similar to the cited invention search unit 131, but the operation of the main cited invention search unit compared to the cited invention search unit 131. Are different, and output data is also different. The cited invention search unit 131 sets the most similar document as the main cited document by the main cited invention search unit performing the main cited invention search (step 89 of the above-described main cited invention search process). In step 89, the main citation invention search unit of the search unit 231 extracts n documents including the most similar documents in descending order of similarity as similar documents, and sets them as the main cited documents (doc _{1 to} doc _n ). To do. The main cited invention with the operation of the search part vary, and generates n matter of inventive step prediction data Vd1 ₁ ~Vd1 _n corresponding the CPU11 at step 76 in independent claim retrieval processing described above to each similar document, They are output to the prediction result file generation unit 227. Further, in the independent claims search process, CPU 11 is output to the claims subject matter data ied to the input vector generating unit 132, a concept search data _Vd2 1 _~Vd2 _n of n matter in accordance with the similar documents to the input vector generator 132 Output.

機械学習部２３３は、機械学習部１３３と比較して、次に述べるＳＨＬパターンを学習パターンに用いた機械学習（教師付き学習）によって、各類似文献に応じたｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎをそれぞれＳクラス、Ｈクラス、Ｌクラスのいずれかに分類し、その分類結果に応じたｎ件の出力信号（要件適否文書ベクトルＶ４_１〜Ｖ４_ｎ）を出力するように構築されている。Ｓクラス、Ｈクラス、Ｌクラスは、それぞれ、進歩性の要件に適合しない可能性が極めて高いクラス、適合しない可能性が高いクラス、適合するクラス（予測対象発明について、進歩性違反の拒絶理由が見つかる可能性が極めて高いクラス、高いクラス、無いクラス）に相当している。 Compared with the machine learning unit 133, the machine learning unit 233 performs n pieces of abstract movement vectors V3 ₁ to V3 ₁ corresponding to each similar document by machine learning (supervised learning) using the SHL pattern described below as a learning pattern. V3 _n is classified into one of S class, H class, and L class, respectively, and n output signals (requirement conformity document vectors V4 _{1 to} V4 _n ) corresponding to the classification result are output. . The S class, the H class, and the L class are respectively a class that is very likely not to meet the inventive step requirement, a class that is highly likely not to match, and a conforming class (for the invention to be predicted, there is a reason for refusal of inventive step violation). Class that is very likely to be found, a high class, a class that is not found).

また、ＳＨＬパターンは、次のパターンＳ、Ｈ、Ｌの３つのパターンの組み合わせである。
パターンＳ：学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが新規性および進歩性の拒絶理由有りを示すベクトル（例えば、正解のクラスに対応した次元だけが"１"で、他が"０"のベクトル）との組み合わせ
パターンＨ：学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りで新規性の拒絶理由無しを示すベクトル（例えば、正解のクラスに対応した上記とは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせ
パターンＬ：学習文書ベクトルが第３の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトル（例えば、上記２つとは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせのパターンである。 The SHL pattern is a combination of the following three patterns S, H, and L.
Pattern S: The learning document vector is the first learning document vector, and the teacher vector is a vector indicating that there is a reason for rejection of novelty and inventive step (for example, only the dimension corresponding to the correct class is "1" and the others are "0") Combination pattern H with “vector” of: the learning document vector is the second learning document vector, the teacher vector is a vector indicating the reason for refusal of inventive step and no reason for refusal of novelty (for example, the above corresponding to the correct class) A combination pattern L: a vector in which the learning document vector is the third learning document vector and the teacher vector has no reason for refusal of inventive step. For example, it is a combination pattern with a vector whose only dimension different from the above two is “1” and the other is “0”.

第１の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで、同じ文献を引用して新規性および進歩性違反の拒絶理由（特許法第２９条第１項第３号および同条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（新規性・進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた第１の主引用刊行物）に応じた文書ベクトル（第１の引用文書ベクトル）との差分に応じた第１の移動文書ベクトルである。 The first learning document vector is an application for which the first notice of reasons for refusal (1st action) has been issued as a result of examination by the Patent Office in a published application, and the same document is cited in the 1st action. An application (novelty / inventive step) in which the reason for refusal of violation of novelty or inventive step (the reason for refusal that the requirements of Article 29 (1) (iii) and Article 2 (2) of the Patent Act are not met) was pointed out The document vector corresponding to the claim (at the time of issuance of the reason for refusal) where the reason for refusal of the application for refusal was pointed out, and the cited reference 1 at that time (the first main cited publication cited as the main publication) The first moving document vector corresponding to the difference from the document vector corresponding to the object (first quoted document vector).

第２の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで、新規性の拒絶理由（特許法第２９条第１項第３号の要件を満たしていないとする拒絶理由）は指摘されていないが、進歩性違反の拒絶理由（同条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた第２の主引用刊行物）に応じた文書ベクトル（第２の引用文書ベクトル）との差分に応じた第２の移動文書ベクトルである。 The second learning document vector is an application in which the first notice of reasons for refusal (1st action) is issued as a result of examination by the Patent Office in the published application, and the reason for refusal of novelty ( The reason for refusal to violate the inventive step (rejection not satisfying the requirement of paragraph 2 of the same Article) is not pointed out, but the reason for refusal of violating the inventive step is not pointed out The document vector corresponding to the claim (at the time of issuance of the reason for refusal) and the cited reference 1 (main publication) where the reason for refusal of the application (reasonable refusal application) was pointed out The second moving document vector corresponding to the difference from the document vector (second cited document vector) corresponding to the second main cited publication cited as.

第３の学習文書ベクトルは、公開済出願の中で審査の結果、１ｓｔアクションが発行されずに特許査定が発行された出願（拒絶無し出願）または１ｓｔアクションは発行されたがその拒絶理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）の（拒絶理由通知が発行された時点の）請求項１に応じた文書ベクトルと、それら拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる文献（学習用最類似文献）に応じた文書ベクトル（非引用文書ベクトル）との差分に応じた第３の移動文書ベクトルである。 The third learning document vector is the result of examination in the published application, where the first action is not issued and the patent decision is issued (non-rejection application) or the first action is issued but progresses to the reason for refusal Document vectors according to claim 1 (at the time of the notice of reasons for refusal) of applications for which no reason for refusal of sex violation has been pointed out (applications without inventive refusal), and those applications without refusal or inventive refusal As a result of the concept search for the application, a third moving document vector corresponding to the difference from the document vector (non-cited document vector) corresponding to the document (the most similar document for learning) that is considered to have the highest similarity. is there.

機械学習部２３３は、機械学習部１３３と同様、情報処理に脳神経回路網をモデルにしたニューラルネットワークを適用することができるが、そのうちのＢＰ（バックプロパゲーション）ネットワークを適用することが好ましい。 As with the machine learning unit 133, the machine learning unit 233 can apply a neural network modeled on a cranial nerve network for information processing, and preferably uses a BP (back propagation) network among them.

そして、進歩性予測処理部１２６の入力ベクトル生成部１３２は図２８に示すように、要旨ベクトル生成部１３２ａと、引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cとを有しているが、そのうちの引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cの動作が異なっている。すなわち、引用候補ベクトル生成部１３２ｂは、引用発明検索部２３１からｎ件の概念検索データＶｄ２_１〜Ｖｄ２_ｎが入力されるので、そのそれぞれに含まれる各類似文献の公開公報データを入力してその特徴語を抽出し、各語に応じた重み付けを行って各類似文献に応じたｎ件の文書ベクトル（引用候補ベクトル）ＲｆＶ_１〜ＲｆＶ_ｎを生成する。移動ベクトル生成部１３２ｃは、要旨ベクトルＥＶと、各引用候補ベクトルＲｆＶ_１〜ＲｆＶ_ｎとの差分を計算して、双方の文書ベクトルの差分に応じたｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎを生成する。 As shown in FIG. 28, the input vector generation unit 132 of the inventive step prediction processing unit 126 includes a gist vector generation unit 132a, a citation candidate vector generation unit 132b, and a movement vector generation unit 132c. Among them, the operations of the citation candidate vector generation unit 132b and the movement vector generation unit 132c are different. That is, reference candidate vector generation unit 132b, since the concept search data Vd2 ₁ ~Vd2 _n of n matter from the cited invention search unit 231 is input, and inputs the publication data of each similar documents included in that each of which Feature words are extracted and weighted according to each word to generate _n document vectors (citation candidate vectors) RfV _{1 to} RfV _n corresponding to each similar document. Movement vector generation unit 132c includes a summary vector EV, the difference between the reference candidate vectors _RfV 1 _~RfV _n by calculating the n matter gist movement vector _V3 1 to V3 _n for corresponding to the difference between both the document vector Generate.

前述したように、各類似文献は、主引用発明検索部による概念検索によって、最も高い類似度からその降順に抽出した文献であるため、そのいずれも予測対象発明の審査で、主引用発明の開示文献として引用される確率が高いと推測される。そのため、各類似文献を引用候補として引用候補ベクトルＲｆＶ_１〜ＲｆＶ_ｎを求め、これらと要旨ベクトルＥＶとの差分を計算して要旨移動ベクトルＶ３_１〜Ｖ３_ｎを求めれば、予測対象発明と、各類似文献に開示されている発明との相違に応じた要旨移動ベクトルＶ３_１〜Ｖ３_ｎが生成される。 As described above, since each similar document is a document extracted in descending order from the highest similarity by the concept search by the main cited invention search unit, all of them are the examination of the prediction target invention, and the disclosure of the main cited invention is disclosed. It is estimated that the probability of being cited as a document is high. Therefore, if the candidate documents RfV _{1 to} RfV _n are obtained by using each similar document as a citation candidate and the difference between the citation candidate vectors RfV _{1 to} RfV _n is calculated to obtain the gist movement vectors V3 _{1 to} V3 _n , the prediction target invention, Abstract movement vectors V3 _{1 to} V3 _n corresponding to differences from the invention disclosed in similar documents are generated.

予測結果ファイル生成部２２７は、予測結果ファイル生成部１２７と比較して、図２９に示したレイアウトを有する予測結果ファイルを生成してそれを予測結果記憶部２５６に記憶させる点と、本発明の実施の形態にかかる非適合率算出部としての動作を行い、予測対象発明に関する非適合率Ｖｒを算出する点とで相違している。 Compared with the prediction result file generation unit 127, the prediction result file generation unit 227 generates a prediction result file having the layout shown in FIG. 29 and stores the prediction result file in the prediction result storage unit 256. The difference is that the operation as the non-conformance rate calculation unit according to the embodiment is performed to calculate the non-conformance rate Vr related to the prediction target invention.

非適合率Ｖｒは、予測対象発明についての進歩性の要件に適合しない可能性であって、予測対象発明について、進歩性違反の拒絶理由が見つかる可能性を示している。機械学習部２３３から出力される要件適否文書ベクトルＶ４_１〜Ｖ４_ｎは、進歩性の要件に適合しない可能性が極めて高い、高い、無いといった内容で生成されるので、予測対象発明について、各類似文献との関係でみた進歩性違反の拒絶理由が見つかる可能性を示している。そのため、これらを用いて予測結果ファイル生成部２２７が予測対象発明に関する非適合率Ｖｒを算出する。この場合、予測結果ファイル生成部２２７は、その非適合率Ｖｒを非適合率算出規則にしたがい算出する。非適合率算出規則とは、予測結果ファイル生成部２２７が非適合率Ｖｒを算出する規則であって、本実施の形態では、要件適否文書ベクトルＶ４_１〜Ｖ４_ｎの中に含まれるＳクラス、Ｈクラス、Ｍクラスそれぞれの件数に応じて、非適合率Ｖｒの数値が決定されるように、図示しない算出規則テーブルに設定されている。非適合率算出規則は例えば次のようにすることができる。 The non-conformance rate Vr indicates the possibility of not meeting the inventive step requirement for the prediction target invention, and indicates the possibility of finding the reason for refusal of the inventive step violation for the prediction target invention. The requirement adequacy document vectors V4 _{1 to} V4 _n output from the machine learning unit 233 are generated with contents that are highly likely to be incompatible with the requirement of inventive step, are high, and are not included. It shows the possibility of finding reasons for refusal of violation of inventive step in relation to the literature. Therefore, using these, the prediction result file generation unit 227 calculates the non-conformance rate Vr related to the prediction target invention. In this case, the prediction result file generation unit 227 calculates the non-conformance rate Vr according to the non-conformance rate calculation rule. The non-conformance rate calculation rule is a rule by which the prediction result file generation unit 227 calculates the non-conformance rate Vr. In the present embodiment, the S class included in the requirement conformity document vectors V4 _{1 to} V4 _n , The calculation rule table (not shown) is set so that the numerical value of the non-conformance rate Vr is determined according to the number of cases of the H class and the M class. The non-conformance rate calculation rule can be set as follows, for example.

Ｓクラスが２件以上：Ｖｒ≧８５％
Ｓクラスが１件で、Ｈクラスの件数が５０％以上：Ｖｒ≧７５％
Ｓクラスが１件で、Ｈクラスの件数が５０％未満：Ｖｒ≧６５％
Ｓクラスが０件で、Ｈクラスの件数が５０％以上：Ｖｒ≧５０％
Ｓクラスが０件で、Ｈクラスの件数が５０％未満：Ｖｒ≧４０％
Ｓクラス、Ｈクラスがともに０件：Ｖｒ≧１５％ 2 or more S class: Vr ≧ 85%
There is one S class and the number of H class is 50% or more: Vr ≧ 75%
One S class and less than 50% H class: Vr ≧ 65%
S class is 0, H class is more than 50%: Vr ≧ 50%
S class is 0, H class is less than 50%: Vr ≧ 40%
0 for both S and H classes: Vr ≧ 15%

上記非適合率算出規則によれば、例えば、類似文献が５件（前述の整数ｎが"５"）の場合、Ｓクラスが２件あればＶｒ≧８５％である。また、Ｓクラスが１件で、Ｈクラスが３件ならＶｒ≧７５％になるが、Ｈクラスが２件だとＶｒ≧６５％、Ｓクラス、Ｈクラスがともに０件（全件がＬクラス）だとＶｒ≧１５％になる。 According to the non-conformance rate calculation rule, for example, when there are five similar documents (the above-mentioned integer n is “5”) and there are two S classes, Vr ≧ 85%. Also, if there is one S class and three H classes, Vr ≧ 75%, but if there are two H classes, Vr ≧ 65%, both S class and H class are zero (all cases are L class) ) Vr ≧ 15%.

そして、図２９に示すように、予測結果ファイル生成部２２７が生成する予測結果ファイルは、予測結果ファイル生成部１２７が生成する予測結果ファイルと比較して、非適合率Ｖｒが含まれている点で相違している。 29, the prediction result file generated by the prediction result file generation unit 227 includes a non-conformance rate Vr as compared to the prediction result file generated by the prediction result file generation unit 127. Is different.

また、予測結果編集処理部２０５は、予測結果編集処理部１０５と比較して、予測結果ファイルを読み込み、図３０に示すような特許要件適否予測リストＬ２を編集および出力する点で相違している。特許要件適否予測リストＬ２は、特許要件適否予測リストＬ１と比較して、ＯＡ率が追加されている点で相違している。ＯＡ率とは、予測対象発明について、審査過程で特許要件（新規性または進歩性）に違反する拒絶理由が見つかり、それを示す拒絶理由通知書が発行される可能性を示していて、前述した非適合率Ｖｒに相当する数値が示されている。 Further, the prediction result editing processing unit 205 is different from the prediction result editing processing unit 105 in that it reads a prediction result file, and edits and outputs a patent requirement suitability prediction list L2 as shown in FIG. . The patent requirement suitability prediction list L2 is different from the patent requirement suitability prediction list L1 in that an OA rate is added. The OA rate indicates the possibility that a reason for refusal that violates patent requirements (novelty or inventive step) will be found in the examination process and a notice of reason for refusal indicating this will be issued. A numerical value corresponding to the non-conformity rate Vr is shown.

以上のように、第２の実施の形態に係る特許要件適否予測サーバ２００では、特許要件適否予測処理部２０３の進歩性予測処理部２２６において、引用発明検索部２３１が最類似文献を含むｎ件の類似文献を主引用文献にセットしている。また、機械学習部２３３が各類似文献に応じた要旨移動ベクトルＶ３_１〜Ｖ３_ｎをＳクラス、Ｈクラス、Ｌクラスの３つに分類し、その分類結果に応じた要件適否文書ベクトルＶ４_１〜Ｖ４_ｎを出力する。 As described above, in the patent requirement suitability prediction server 200 according to the second embodiment, the cited invention search unit 231 includes the most similar documents in the inventive step predictive processing unit 226 of the patent requirement suitability prediction processing unit 203. Is set as the main cited document. In addition, the machine learning unit 233 classifies the abstract movement vectors V3 _{1 to} V3 _n corresponding to each similar document into three classes of S class, H class, and L class, and the requirement conformity document vector V4 ₁ to V class according to the classification result. V4 _n is output.

第１の実施の形態に係る特許要件適否予測サーバ１０では、特許要件適否に関する予測が審査実務に適合した内容で、しかも特許庁の審査実績を反映させる形で行われる。この点は、特許要件適否予測サーバ２００も同様である。 In the patent requirement conformity prediction server 10 according to the first embodiment, the prediction regarding patent requirement conformity is performed in a form that conforms to examination practice and reflects examination results of the Patent Office. This also applies to the patent requirement suitability prediction server 200.

しかし、特許要件適否予測サーバ１０では、主引用発明検索で、主引用文献として、最類似文献だけが抽出されるに過ぎなかった。最類似文献は、概念検索の結果、予測対象発明との類似度が最も高いとされた文献であるため、実際の審査の結果、主引用文献として引用される可能性が最も高いと考えられる。とはいえ、最類似文献が実際の審査で引用されるとは限らないし、最類似文献と、その次の類似度の文献（次類似文献）とで類似度の相違がごくわずかでしかなく、その次類似文献の方が最類似文献よりも主引用文献として適切な場合も十分に考えられる。そのため、主引用発明検索で最類似文献を含む複数の文献を抽出し、これらを対象として機械学習部２３３による文書ベクトルの分類を行えば、次類似文献をも考慮に入れた形で特許要件適否に関する予測が行われる。そのため、特許要件適否予測サーバ１０の予測の精度よりも、特許要件適否予測サーバ２００の予測の精度が向上する。 However, in the patent requirement suitability prediction server 10, only the most similar document is extracted as the main cited document in the main cited invention search. Since the most similar document is a document having the highest similarity to the prediction target invention as a result of the concept search, it is considered that the most similar document is most likely cited as the main cited document as a result of the actual examination. However, the most similar document is not always cited in the actual examination, and there is very little difference in similarity between the most similar document and the next similar document (next similar document) It is fully conceivable that the next similar document is more appropriate as the main cited document than the most similar document. Therefore, if a plurality of documents including the most similar documents are extracted in the main cited invention search, and the document vectors are classified by the machine learning unit 233 for these documents, whether or not the patent requirement is met in consideration of the next similar documents. Predictions are made. Therefore, the prediction accuracy of the patent requirement suitability prediction server 200 is improved more than the prediction accuracy of the patent requirement suitability prediction server 10.

また、実際の審査実務では、ある特許出願について、進歩性違反の拒絶理由が見つかるとき、進歩性違反の拒絶理由と新規性違反の拒絶理由とが同じ文献（この場合に引用される文献を新規性・進歩性拒絶引用文献ともいう）を引用して指摘される場合がある。このような場合、その特許出願にかかる請求項にかかる発明と、新規性・進歩性拒絶引用文献に開示されている発明とに相違がないと審査官によって判断されているから、要旨移動ベクトルＶ３_１〜Ｖ３_ｎの中にＳクラスへ分類される文書ベクトルが含まれているときは、進歩性違反の拒絶理由が見つかる可能性がそうでない場合に比べてより高くなっていると考えられる。したがって、新規性・進歩性拒絶出願からみた距離ｄｐと、進歩性拒絶出願からみた距離ｄｐとが区別できるように、ＳＨＬパターンによる機械学習を行って機械学習部２３３を構築しておくことで、進歩性違反の拒絶理由が見つかる可能性が極めて高い場合とそうでない場合とを区別した予測が可能になる。こうすることで、特許要件適否予測サーバ２００による予測精度の向上と、業務効率のより一層の改善が期待できる。 In actual examination practice, when a reason for refusal of inventive step violation is found for a patent application, the reason for refusal of inventive step violation and the reason for refusal of novelty violation is the same document (the document cited in this case is newly May also be pointed out by quoting refusal documents of sex / inventive step rejection). In such a case, since the examiner determines that there is no difference between the invention according to the claims of the patent application and the invention disclosed in the novelty / inventive step rejection citation, the summary movement vector V3 _When document vectors classified into the S class are included in _{1 to} V3 _n , it is considered that the possibility of finding the reason for refusal of the inventive step violation is higher than the case where it is not. Therefore, by constructing the machine learning unit 233 by performing machine learning using the SHL pattern so that the distance dp viewed from the novelty / inventive rejection application and the distance dp viewed from the inventive rejection application can be distinguished, It is possible to make a prediction that distinguishes between cases where the reason for refusal of an inventive step violation is very likely and cases where it is not. By doing so, it is possible to expect improvement in prediction accuracy by the patent requirement suitability prediction server 200 and further improvement in business efficiency.

（変形例１）
上記の進歩性予測処理部２２６では、一つの機械学習部２３３がｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎの分類を行っていたが、図３１に示した特許要件適否予測処理部２１３における進歩性予測処理部２２７のように、要旨移動ベクトルＶ３_１〜Ｖ３_ｎに応じた複数の機械学習部２３３_１〜２３３_ｎを有し、そのそれぞれが要旨移動ベクトルＶ３_１〜Ｖ３_ｎを分類するようにしてもよい。また、図示はしないが、入力ベクトル生成部１３２も、概念検索データＶｄ２_１〜Ｖｄ２_ｎの件数に応じて複数設けてもよい。これらのようにすると、各機械学習部２３３_１〜２３３_ｎまたは各入力ベクトル生成部１３２が並行に処理を実行するので、処理時間を短縮することができる。なお、図示はしないが、指定先行技術データ記憶部１５１に複数の指定先行技術データが記憶されているとき（複数の指定先行技術データを受信して）に、指定予測データ生成部１３４がそれらに応じて複数の進歩性予測データＶｓ１、指定先行技術データＶｓ２を生成してもよい。 (Modification 1)
In the inventive step prediction processing unit 226, one machine learning unit 233 classifies _n summary movement vectors V3 _{1 to} V3 _n . However, the progress in the patent requirement suitability prediction processing unit 213 shown in FIG. as sex prediction processing unit 227 includes a plurality of machine learning unit ₂₃₃ 1 _~233 _n corresponding to the subject matter movement vector _V3 1 to V3 _n, each of which is adapted to classify the subject matter movement vector _V3 1 to V3 _n May be. Further, although not shown, a plurality of input vector generation units 132 may be provided according to the number of concept search data Vd2 _{1 to} Vd2 _n . By doing so, each of the machine learning units 233 _{1 to} 233 _n or each of the input vector generation units 132 executes the processing in parallel, so that the processing time can be shortened. Although not shown, when a plurality of designated prior art data are stored in the designated prior art data storage unit 151 (when a plurality of designated prior art data are received), the designated predicted data generation unit 134 includes them. Accordingly, a plurality of inventive step prediction data Vs1 and designated prior art data Vs2 may be generated.

（変形例２）
以上述べた各実施の形態では、特許要件適否予測サーバ１０，２００に特許要件適否予測プログラムがインストールされることによって、特許要件適否予測サーバ１０，２００が特許要件適否予測装置として機能する場合を例にとって説明している。その他、本発明は、ユーザ端末装置３０が特許要件適否予測装置として機能する場合についても適用がある。この場合、前述した特許要件適否予測プログラムについて少なくとも以下の変更点１）、２）にしたがった変更を行い、その変更後の特許要件適否予測プログラムを特許要件適否予測サーバ１０，２００からユーザ端末装置３０にダウンロードし、ユーザ端末装置３０にインストールすればよい。 (Modification 2)
In each of the embodiments described above, an example in which the patent requirement suitability prediction server 10 or 200 functions as a patent requirement suitability prediction device by installing the patent requirement suitability prediction program in the patent requirement suitability prediction server 10 or 200 will be described. To explain. In addition, the present invention is also applicable to the case where the user terminal device 30 functions as a patent requirement suitability prediction device. In this case, the above-described patent requirement suitability prediction program is changed according to at least the following changes 1) and 2), and the changed patent requirement suitability prediction program is transferred from the patent requirement suitability prediction servers 10 and 200 to the user terminal device. It may be downloaded to 30 and installed in the user terminal device 30.

変更点１）指定ナンバなどの入力操作を行うための画像データを特許要件適否予測サーバ１０，２００からユーザ端末装置３０に送信することなくユーザ端末装置３０に表示させる。
変更点２）特許要件適否予測リストをユーザ端末装置３０が出力する。 Modification 1) Image data for performing an input operation such as a designated number is displayed on the user terminal device 30 without being transmitted from the patent requirement suitability prediction servers 10 and 200 to the user terminal device 30.
Modification 2) The user terminal device 30 outputs a patent requirement suitability prediction list.

以上の説明は、本発明の実施の形態についての説明であって、この発明の装置及び方法を限定するものではなく、様々な変形例を容易に実施することができる。また、各実施形態における構成要素、機能、特徴あるいは方法ステップを適宜組合わせて構成される装置又は方法も本発明に含まれるものである。 The above description is the description of the embodiment of the present invention, and does not limit the apparatus and method of the present invention, and various modifications can be easily implemented. In addition, an apparatus or a method configured by appropriately combining components, functions, features, or method steps in each embodiment is also included in the present invention.

例えば、ユーザ端末装置は高機能携帯電話機や、タブレット型の端末装置ではなく、ノートパソコンや、ＰＤＡでもよい。なお、ＣＰＵ１１が実行する特許要件適否予測プログラムは、磁気記録媒体、ＣＤ−ＲＯＭ，ＤＶＤ等の各種記録媒体に記録することができるし、ネットワークを介して図示しないサーバからダウンロードすることもできる。 For example, the user terminal device may be a notebook computer or a PDA instead of a high-function mobile phone or a tablet-type terminal device. The patent requirement suitability prediction program executed by the CPU 11 can be recorded on various recording media such as a magnetic recording medium, a CD-ROM, and a DVD, and can also be downloaded from a server (not shown) via a network.

また、上記の実施形態では、ユーザ端末装置３０が案文書データを特許要件適否予測サーバ１０に送信している。案文書データによって特定される予測対象発明は、未だ出願されていないため、公知にならないようにする必要がある。そのためには、案文書データから要旨データを抽出する要旨データ抽出手段（要旨データ抽出部１０２に相当）をユーザ端末装置３０に設けて、ユーザ端末装置３０が要旨データを生成し、ユーザ端末装置３０から要旨データを暗号化通信によって特許要件適否予測サーバ１０に送信することが好ましい。この場合、特許要件適否予測サーバ１０は、受信した要旨データを記憶する要旨データ記憶部１５３を有していればよく、要旨データ抽出部１０２を有していなくてもよい。 In the above-described embodiment, the user terminal device 30 transmits the draft document data to the patent requirement suitability prediction server 10. The prediction target invention specified by the draft document data has not been filed yet, so it is necessary not to make it publicly known. For this purpose, a gist data extracting means (corresponding to the gist data extraction unit 102) for extracting gist data from the draft document data is provided in the user terminal device 30, the user terminal device 30 generates gist data, and the user terminal device 30. The summary data is preferably transmitted to the patent requirement suitability prediction server 10 by encrypted communication. In this case, the patent requirement suitability prediction server 10 only needs to include the summary data storage unit 153 that stores the received summary data, and does not need to include the summary data extraction unit 102.

本発明を適用することにより、特許要件の適否に関する予測が審査実務に適合した内容で行われ、出願書類の準備負担を有効に軽減することができる。本発明は、特許要件適否予測装置および特許要件適否予測プログラムの分野で利用することができる。 By applying the present invention, the prediction regarding the suitability of the patent requirement is made in a content that conforms to the examination practice, and the preparation burden of the application documents can be effectively reduced. The present invention can be used in the field of patent requirement suitability prediction apparatus and patent requirement suitability prediction program.

１…特許要件適否予測システム、１０，２００…特許要件適否予測サーバ、１１，３１…ＣＰＵ、３０…ユーザ端末装置、１０１…案文データ生成部、１０２…要旨データ抽出部、１０３，２０３，２１３…特許要件適否予測処理部、１０５…予測結果編集処理部、１０６…指定先行技術データ生成部、１２５…新規性・拡大先願予測処理部、１２６，２２６，２２７…進歩性予測処理部、１３２…入力ベクトル生成部、１３２ａ…要旨ベクトル生成部、１３２ｂ…引用候補ベクトル生成部、１３２ｃ…移動ベクトル生成部、１３３，２３３…機械学習部、１３４…指定予測データ生成部、１５１…指定先行技術データ記憶部、１５３…要旨データ記憶部、１５４…ＣＴデータ記憶部、１５６，２５６…予測結果記憶部、Ｌ１、Ｌ２…予測結果リスト。 DESCRIPTION OF SYMBOLS 1 ... Patent requirement suitability prediction system, 10,200 ... Patent requirement suitability prediction server, 11, 31 ... CPU, 30 ... User terminal device, 101 ... Proposed sentence data generation unit, 102 ... Abstract data extraction unit, 103, 203, 213 ... Patent requirement suitability prediction processing unit, 105 ... Prediction result editing processing unit, 106 ... Designated prior art data generation unit, 125 ... Novelty / expansion prior application prediction processing unit, 126, 226, 227 ... Inventive step prediction processing unit, 132 ... Input vector generation unit, 132a ... abstract vector generation unit, 132b ... citation candidate vector generation unit, 132c ... movement vector generation unit, 133, 233 ... machine learning unit, 134 ... designated prediction data generation unit, 151 ... designated prior art data storage 153 ... summary data storage unit, 154 ... CT data storage unit, 156, 256 ... prediction result storage unit, L1, L2 ... prediction result List.

Claims

The invention to be predicted, which is subject to prediction of whether or not the patent requirement is satisfied, is described in one or more claims and extracted from the proposal document data constituting the proposal document including the detailed description in which the invention to be predicted is described Data, term data indicating terms that can specify the gist of the invention to be predicted, from at least feature part data extracted from the feature part of each of the claims and the problem part included in the detailed description Summary data storage means for storing data including the extracted task data as summary data;
Designated prior art data storage means for storing designated prior art data constituting the designated prior art document in which the designated prior art invention is described;
An inventive step prediction processing unit for generating invention-designated inventive step prediction data relating to the suitability of the inventive step of the prediction target invention based on the relationship with the designated prior art invention, and the prediction target invention using the invention-designated inventive step prediction data A patent result suitability prediction processing means having a prediction result file generation unit for generating a prediction result file indicating whether or not the patent requirement is suitable,
The inventive step predictive processing unit has a document classification unit for classifying document vectors,
The document classifying unit classifies an input summary movement vector as to whether or not it meets the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so as to output a requirement conformity document vector according to the classification result, and the summary movement vector includes the summary vector of the invention to be predicted according to each of the claims and the citation candidate according to the designated prior art data A patent requirement conformity prediction apparatus which is a vector corresponding to a difference from a vector.

The invention to be predicted, which is subject to prediction of whether or not the patent requirement is satisfied, is described in one or more claims and extracted from the proposal document data constituting the proposal document including the detailed description in which the invention to be predicted is described Data, term data indicating terms that can specify the gist of the invention to be predicted, from at least feature part data extracted from the feature part of each of the claims and the problem part included in the detailed description Summary data storage means for storing data including the extracted task data as summary data;
Designated prior art data storage means for storing designated prior art data constituting the designated prior art document in which the designated prior art invention is described;
Using the gist data stored in the gist data storage means, search public gazette data which is electronic data of a public gazette, and indicate whether the novelty requirement of the prediction target invention is appropriate according to the search result. Novelty prediction processing unit for generating novelty prediction data, inventive step prediction processing unit for generating inventive step prediction data indicating whether or not the inventive step requirements are appropriate, and the novelty prediction data and the inventive step prediction data Patent requirement suitability prediction processing means having a prediction result file generation unit that generates a prediction result file indicating the suitability of patent requirements of the invention to be predicted using
The inventive step prediction processing unit includes a main citation invention search unit that searches for a main citation invention closest to the prediction target invention among prior art inventions specified by the publication gazette data, and a document classification that classifies document vectors And
The main citation invention search unit targets the publication gazette data with the feature portion data and the problem data of each claim out of the summary data stored in the summary data storage means as main search document data. A plurality of documents including the most similar document having the highest similarity in descending order of the similarity as a result of the concept search, and the main cited invention discloses each similar document. As the main cited reference,
The document classification unit is a class that has a high possibility that the input summary movement vector does not conform to the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector, a high class, It is constructed so as to be classified into any of the classes that meet the requirement of the inventive step and to output a requirement conformity document vector according to the classification result, and the summary movement vector is the prediction target according to each claim A plurality of vectors according to the difference between the gist vector of the invention and each citation candidate vector according to each of the similar documents,
The patent requirement conformity prediction processing means includes:
A non-conformance rate calculating unit that calculates a non-conformance rate indicating the possibility of not conforming to the inventive step requirement for the prediction target invention according to the plurality of requirement conformity document vectors output from the document classification unit;
When the designated prior art data is stored in the designated prior art data storage means, the inventive step predictive processing unit uses the prediction subject invention in relation to the designated prior art invention instead of the inventive step predictive data. A patent requirement conformity prediction apparatus further comprising: a prediction processing control unit that performs control so as to generate invention-designated inventive step prediction data relating to the suitability of the inventive step.

The invention to be predicted, which is subject to prediction of whether or not the patent requirement is satisfied, is described in one or more claims and extracted from the proposal document data constituting the proposal document including the detailed description in which the invention to be predicted is described Data, term data indicating terms that can specify the gist of the invention to be predicted, from at least feature part data extracted from the feature part of each of the claims and the problem part included in the detailed description Summary data storage means for storing data including the extracted task data as summary data;
Using the gist data stored in the gist data storage means, search public gazette data which is electronic data of a public gazette, and indicate whether the novelty requirement of the prediction target invention is appropriate according to the search result. Novelty prediction processing unit for generating novelty prediction data, inventive step prediction processing unit for generating inventive step prediction data indicating whether or not the inventive step requirements are appropriate, and the novelty prediction data and the inventive step prediction data Patent requirement suitability prediction processing means having a prediction result file generation unit for generating a prediction result file indicating the suitability of patent requirements of the invention to be predicted using
Prediction result storage means for storing the prediction result file generated by the prediction result file generation unit;
The inventive step prediction processing unit includes a main citation invention search unit that searches for a main citation invention closest to the prediction target invention among prior art inventions specified by the publication gazette data, and a document classification that classifies document vectors And
The main citation invention search unit targets the publication gazette data with the feature portion data and the problem data of each claim out of the summary data stored in the summary data storage means as main search document data. A plurality of documents including the most similar document having the highest similarity in descending order of the similarity as a result of the concept search, and the main cited invention discloses each similar document. As the main cited reference,
The document classification unit is a class that has a high possibility that the input summary movement vector does not conform to the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector, a high class, It is constructed so as to be classified into any of the classes that meet the requirement of the inventive step and to output a requirement conformity document vector according to the classification result, and the summary movement vector is the prediction target according to each claim A patent requirement propriety prediction device which is a plurality of vectors corresponding to differences between a gist vector of an invention and each citation candidate vector corresponding to each similar document.

A patent requirement propriety prediction program for causing a computer to function as a patent requirement propriety prediction device, wherein the prediction target invention for which the computer is subject to patent requirement propriety prediction is described in one or more claims, and the prediction Data extracted from the draft document data constituting the draft document including the detailed description in which the subject invention is described, and is term data indicating terms that can specify the gist of the invention to be predicted, Gist data storage control means for storing, as gist data, characteristic part data extracted from the characteristic part of claim and data including problem data extracted from the problem part included in the detailed description;
Designated prior art data storage control means for storing designated prior art data constituting the designated prior art document in which the designated prior art invention is described;
An inventive step prediction processing unit for generating invention-designated inventive step prediction data relating to the suitability of the inventive step of the prediction target invention based on the relationship with the designated prior art invention, and the prediction target invention using the invention-designated inventive step prediction data A prediction result file generation unit that generates a prediction result file indicating whether or not the patent requirement is appropriate, and function as a patent requirement appropriateness prediction processing means,
The inventive step predictive processing unit has a document classification unit for classifying document vectors;
The document classification unit classifies the input summary movement vector as to whether it meets the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so as to output a requirement conformity document vector according to the classification result, and the summary movement vector includes the summary vector of the invention to be predicted according to each of the claims and the citation candidate according to the designated prior art data Patent requirement conformity prediction program which is a vector corresponding to a difference from a vector.