JP2017167725A

JP2017167725A - Patent requirement adequacy prediction device and patent requirement adequacy prediction program

Info

Publication number: JP2017167725A
Application number: JP2016051019A
Authority: JP
Inventors: 和之白井; Kazuyuki Shirai
Original assignee: Individual
Current assignee: Individual
Priority date: 2016-03-15
Filing date: 2016-03-15
Publication date: 2017-09-21
Anticipated expiration: 2036-03-15
Also published as: JP6188172B1

Abstract

PROBLEM TO BE SOLVED: To effectively reduce a monitoring burden on patent application by predicting the adequacy of patent requirements using contents in conformity to examination practice.SOLUTION: A patent requirement adequacy prediction server includes: gazette data extraction means; gist data extraction means for extracting, regarding a prediction object application identified by prediction object data, gist data allowing the gist of a prediction object invention to be determined from prediction object data; gist data storage means for storing the gist data; and patent requirement adequacy prediction processing means including a novelty prediction processing unit for generating novelty prediction data indicating the requirement adequacy of novelty of the prediction object invention, an inventiveness prediction processing unit for generating inventiveness prediction data indicating the requirement adequacy of inventiveness, and a prediction result file generation unit for generating a prediction result file indicating the patent requirement adequacy of the prediction object invention by using the novelty prediction data and the inventiveness prediction data.SELECTED DRAWING: Figure 4

Description

本発明は、出願公開された特許出願に係る発明について、新規性、進歩性といった特許要件に適合しているか否かの適否を予測する特許要件適否予測装置および特許要件適否予測プログラムに関する。 The present invention relates to a patent requirement conformity prediction apparatus and a patent requirement conformity prediction program for predicting whether or not a patent application such as novelty and inventive step is suitable for the invention relating to a patent application that has been published.

従来、電力需要や株価の予測、商品の購買予測、不動産の将来価格の予測といった様々な場面で予測が行われ、そのための装置や方法も数多く提案されている。発明を特許出願して権利化する権利化業務に関しても、出願された発明に関する特許可能性（特許性、パテンタビリティともいう）を予測する技術が提案されている（例えば、特許文献１参照）。特許文献１には、次のような特許性予測装置が記載されている。この装置は、審査結果通知済の特許出願（既通知出願）を特許データベースから取得して、既通知出願の請求項についての情報量と、類似する先行出願の数を検出し、既通知出願を対象とする回帰分析を実行して、これらから算出した登録予見式にしたがい、審査結果未通知出願の特許性の予測値を算出する。 Conventionally, prediction is performed in various scenes such as prediction of power demand and stock price, prediction of purchase of goods, and prediction of future price of real estate, and many apparatuses and methods have been proposed. A technology for predicting patentability (also referred to as patentability or patentability) relating to an applied invention has also been proposed with respect to a right-giving operation for applying a patent for an invention (see, for example, Patent Document 1). Patent Document 1 describes the following patentability prediction apparatus. This device obtains a patent application (notified application) that has been notified of examination results from the patent database, detects the amount of information about the claims of the already-notified application, and the number of similar prior applications, and The target regression analysis is executed, and the predicted value of the patentability of the application for which the examination result is not notified is calculated according to the registration prediction formula calculated from these.

また、従来、特許可能性の予測のほか、発明の特許性や発明の質、特許出願や特許権の価値を評価する装置や方法も提案されている（例えば、特許文献２，３，４，５，６参照）。 Conventionally, in addition to prediction of patentability, devices and methods for evaluating the patentability and quality of an invention, the value of patent applications and patent rights have also been proposed (for example, Patent Documents 2, 3, 4, and 4). 5 and 6).

特開２００９−２３８０７４号公報JP 2009-238074 A 特開２０１５−２０７１９４号公報JP2015-207194A 特開２０００−１８１９６６号公報JP 2000-181966 A 特開２０００−１３２６０６号公報JP 2000-132606 A 特開２０１５−１８７８８３号公報Japanese Patent Laying-Open No. 2015-187883 特開２００７−１０８８０３号公報JP 2007-108803 A

上記のとおり、従来技術によれば特許出願について特許可能性を予測することや特許権を評価することが可能である。 As described above, according to the prior art, it is possible to predict the patentability and evaluate the patent right for a patent application.

しかし、前述の従来技術、例えば、特許文献１記載の特許性予測装置では、特許性の予測が、既通知出願の請求項についての情報量や、類似する先行出願数といった情報に基づき算出された登録予見式にしたがって行われる。この予測は、請求項の広狭、技術分野の疎密および特許性との間の統計的な相関関係に基づいて行われ、特許法や特許・実用新案審査基準に基づくものではなかった。そのため、特許文献１記載の特許性予測装置では、特許実務に沿った予測結果が得られないおそれが高いという課題があった。 However, in the above-described conventional technology, for example, the patentability prediction apparatus described in Patent Document 1, the patentability prediction is calculated based on information such as the amount of information about the claims of a previously notified application and the number of similar prior applications. This is done according to the registration preview formula. This prediction was made based on a statistical correlation between the breadth of claims, the density of technical fields, and patentability, and was not based on patent law or patent / utility model examination standards. Therefore, the patentability prediction apparatus described in Patent Document 1 has a problem that there is a high possibility that a prediction result in accordance with the patent practice cannot be obtained.

ところで、一般に、他社がどのような特許出願をし、どのような特許権を取得しようとしているのかは、自社にとって重大な関心事である。そのため、特許権に関する権利化業務の中には、自社のみならず、他社の特許出願を監視して、特許権の成否を予測する場面が少なからず存在している。その予測のためには、出願書類を読み込み、先行技術調査を行い、特許要件の適否を判断しなければならないから、とりわけ他社出願の監視負担は相当なものである。 By the way, in general, what kind of patent applications other companies are applying for and what kind of patent rights they are trying to acquire is a serious concern for the company. For this reason, there are not a few cases in which patent rights are obtained by monitoring not only the company's own patent application but also patent applications of other companies. In order to make such predictions, it is necessary to read application documents, conduct prior art searches, and determine whether the patent requirements are appropriate.

この点、従来技術のような特許性予測装置を用いれば、審査結果未通知出願の特許性が予測されるため、権利化業務の中でその予測結果を活用する場面も考えられる。 In this regard, if a patentability prediction apparatus such as the prior art is used, the patentability of an application for which an examination result has not been notified can be predicted.

しかし、特許要件の審査は、特許庁審査官が特許法や特許・実用新案審査基準に沿って行うものであるから（特許法47条）、これらが考慮されない特許性の予測は、審査実務に適合していないおそれがあり、したがって、特許出願の監視負担を軽減するのに有効であるとはいえない。 However, the examination of patent requirements is conducted by the JPO examiner in accordance with the Patent Law and Patent / Utility Model Examination Standards (Patent Law Article 47). May not be compatible and therefore not effective in reducing the monitoring burden of patent applications.

したがって、特許要件に適合しているか否かの予測が審査実務に適合した内容で行われることによって、特許出願の監視負担をできる限り有効に軽減することが望まれていた。 Therefore, it has been desired to reduce the monitoring burden of a patent application as effectively as possible by predicting whether or not it conforms to patent requirements with a content that conforms to examination practice.

本発明は、上記課題を解決するためになされたもので、特許要件の適否に関する予測が審査実務に適合した内容で行われ、特許出願の監視負担を有効に軽減し得る特許要件適否予測装置および特許要件適否予測プログラムを提供することを目的とする。 The present invention has been made in order to solve the above-mentioned problems, and the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, and the patent requirement conformity prediction device capable of effectively reducing the monitoring burden of the patent application and It is an object of the present invention to provide a patent requirement conformity prediction program.

上記課題を解決するため、本発明は、公開特許公報の電子データが公開公報データとして格納されている公開公報データ記憶部から、指定された番号に応じた予測対象データを抽出する公報データ抽出手段と、その公報データ抽出手段によって抽出された予測対象データによって特定される予測対象出願について、その予測対象出願に開示されている予測対象発明の要旨を特定し得る用語を示す用語データであって、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび発明が解決しようとする課題の欄から抽出された課題データを含むデータを要旨データとして予測対象データから抽出する要旨データ抽出手段と、その要旨データ抽出手段によって抽出された要旨データを記憶する要旨データ記憶手段と、要旨データ記憶手段に記憶されている要旨データを用いて公開公報データを検索し、その検索結果に応じて、予測対象発明の新規性の要件適否を示す新規性予測データを生成する新規性予測処理部と、予測対象発明の進歩性の要件適否を示す進歩性予測データを生成する進歩性予測処理部と、その新規性予測データおよび進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段とを有し、進歩性予測処理部は、公開公報データによって特定される先行技術発明のうち、予測対象発明に最も近い主引用発明を検索する主引用発明検索部と、文書ベクトルの分類を行う文書分類部とを有し、主引用発明検索部は、要旨データ記憶手段に記憶されている要旨データのうちの各請求項の特徴部分データおよび課題データを主検索文書データとして公開公報データを対象とする概念検索を行い、その概念検索の結果、最も類似度が高いとされた最類似文献を主引用発明が開示されている主引用文献とし、文書分類部は、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合するか否かのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、予測対象出願の各請求項に応じた要旨ベクトルと、最類似文献に応じた引用候補ベクトルとの差分に応じたベクトルである特許要件適否予測装置を特徴とする。 In order to solve the above problems, the present invention provides a publication data extraction means for extracting prediction target data corresponding to a designated number from a publication gazette data storage unit in which electronic data of a published patent publication is stored as publication gazette data. And, for the prediction target application specified by the prediction target data extracted by the publication data extraction means, term data indicating terms that can specify the gist of the prediction target invention disclosed in the prediction target application, Summary data extraction means for extracting from the data to be predicted as summary data data including at least feature part data extracted from the feature part of each claim and problem data extracted from the column of the problem to be solved by the invention; and Abstract data storage means for storing abstract data extracted by the abstract data extraction means, and abstract data storage means A novelty prediction processing unit that searches public gazette data using summary data stored in the database, and generates novelty prediction data indicating whether the novelty requirement of the invention to be predicted is appropriate according to the search result; An inventive step prediction processing unit for generating inventive step prediction data indicating whether or not the requirement of the subject invention is necessary, and a prediction result file indicating the suitability of the patent requirement of the subject invention using the novelty prediction data and the inventive step prediction data A patent requirement suitability prediction processing unit having a prediction result file generation unit to generate, and the inventive step prediction processing unit is a main cited invention closest to the prediction target invention among the prior art inventions specified by the publication data A main citation invention search unit and a document classification unit for classifying document vectors. The main citation invention search unit stores summary data stored in the summary data storage means. The main reference invention is the most similar document that has the highest similarity as a result of the concept search. Whether the document classification unit conforms to the requirement for inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so as to be classified into any of the above and a requirement conformity document vector corresponding to the classification result is output, and the abstract movement vector is the abstract vector corresponding to each claim of the prediction target application and the most similar document It is characterized by a patent requirement suitability prediction device which is a vector corresponding to a difference from a citation candidate vector corresponding to.

また、本発明は、公開特許公報の電子データが公開公報データとして格納されている公開公報データ記憶部から、指定された番号に応じた予測対象データを抽出する公報データ抽出手段と、その公報データ抽出手段によって抽出された前記予測対象データによって特定される予測対象出願について、その予測対象出願に開示されている予測対象発明の要旨を特定し得る用語を示す用語データであって、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび発明が解決しようとする課題の欄から抽出された課題データを含むデータを要旨データとして予測対象データから抽出する要旨データ抽出手段と、その要旨データ抽出手段によって抽出された要旨データを記憶する要旨データ記憶手段と、要旨データ記憶手段に記憶されている要旨データを用いて公開公報データを検索し、その検索結果に応じて、予測対象発明の新規性の要件適否を示す新規性予測データを生成する新規性予測処理部と、予測対象発明の進歩性の要件適否を示す進歩性予測データを生成する進歩性予測処理部と、その新規性予測データおよび進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段とを有し、進歩性予測処理部は、公開公報データによって特定される先行技術発明のうち、予測対象発明に最も近い主引用発明を検索する主引用発明検索部と、文書ベクトルの分類を行う文書分類部とを有し、主引用発明検索部は、要旨データ記憶手段に記憶されている要旨データのうちの各請求項の特徴部分データおよび課題データを主検索文書データとして公開公報データを対象とする概念検索を行い、その概念検索の結果、類似度の降順にその類似度が最も高い最類似文献を含む複数の文献を類似文献として抽出し、かつ各その類似文献を主引用発明が開示されている主引用文献とし、文書分類部は、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合しない可能性が極めて高いクラス、高いクラス、その進歩性の要件に適合するクラスのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、予測対象出願の各請求項に応じた要旨ベクトルと、各類似文献に応じた引用候補ベクトルそれぞれとの差分に応じた複数のベクトルであり、特許要件適否予測処理手段は、文書分類部から出力される複数の要件適否文書ベクトルに応じて、予測対象出願について、進歩性の要件に適合しない可能性を示す非適合率を算出する非適合率算出部を更に有する特許要件適否予測装置を提供する。 Further, the present invention provides a publication data extraction means for extracting prediction target data corresponding to a designated number from a publication gazette data storage unit in which electronic data of a published patent publication is stored as publication publication data, and publication data Term data that indicates terms that can identify the gist of the invention to be predicted disclosed in the application to be predicted, which is specified by the data to be predicted extracted by the extraction means, and at least each claim Abstract data extracting means for extracting data including characteristic part data extracted from the characteristic parts of the present invention and problem data extracted from the problem column to be solved by the invention from the data to be predicted as abstract data, and the abstract data extracting means The abstract data storage means for storing the abstract data extracted by the means and the abstract data storage means The novelty prediction processing unit that searches the public gazette data using the abstract data and generates the novelty prediction data indicating the suitability of the novelty requirement of the prediction target invention according to the search result, and the progress of the prediction target invention A prediction result generation unit that generates the inventive step prediction data indicating whether or not the requirements of the invention are appropriate, and a prediction result file that generates the prediction result file indicating the suitability of the patent requirements of the invention to be predicted using the novelty prediction data and the inventive step prediction data A patent requirement propriety prediction processing means having a file generation unit, and the inventive step prediction processing unit searches for a main cited invention that is closest to the prediction target invention among the prior art inventions specified by the publication data. The invention includes a citation invention search unit and a document classification unit that classifies document vectors, and the main citation invention search unit includes a summary data stored in the summary data storage unit. A concept search is performed on public gazette data using the collection part data and task data as the main search document data, and as a result of the concept search, a plurality of documents including the most similar document with the highest similarity in descending order of similarity Extracted as similar documents, and each similar document as the main cited document in which the main cited invention is disclosed, the document classification unit, by machine learning using a plurality of training data including a learning document vector and a teacher vector, The input summary movement vector is classified into either a class that is highly unlikely to meet the inventive step requirement, a high class, or a class that meets the inventive step requirement, and the requirement conformance document vector according to the classification result The abstract movement vector includes an abstract vector corresponding to each claim of the application to be predicted, and a citation candidate vector corresponding to each similar document. The patent requirement conformity prediction processing means does not conform to the inventive step requirement for the prediction target application according to the plurality of requirement conformity document vectors output from the document classification unit. There is provided a patent requirement conformity prediction apparatus further including a non-conformance rate calculation unit for calculating a non-conformance rate indicating possibility.

上記特許要件適否予測装置の場合、文書分類部は、訓練データとして、学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが新規性および進歩性の拒絶理由有りを示すベクトルとの組み合わせと、学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りで新規性の拒絶理由無しを示すベクトルとの組み合わせと、学習文書ベクトルが第３の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトルとの組み合わせが用いられ、第１の学習文書ベクトルは、すでに公開されている公開済出願の中で拒絶理由通知が発行された出願であって、その拒絶理由通知で同じ文献を引用して新規性および進歩性無しの拒絶理由が指摘されていた新規性・進歩性拒絶出願のその拒絶理由が指摘されていた請求項に応じた文書ベクトルと、その拒絶理由で引用されていた文献である第１の主引用刊行物に応じた第１の引用文書ベクトルとの差分に応じた第１の移動文書ベクトルであり、第２の学習文書ベクトルは、公開済出願の中で拒絶理由通知が発行された出願であって、その拒絶理由通知で新規性の拒絶理由は指摘されていないが進歩性の拒絶理由が指摘されていた進歩性拒絶出願のその拒絶理由が指摘されていた請求項に応じた文書ベクトルと、その拒絶理由で主たる刊行物として引用されていた第２の主引用刊行物に応じた第２の引用文書ベクトルとの差分に応じた第２の移動文書ベクトルであり、第３の学習文書ベクトルは、公開済出願の中で拒絶理由通知が発行されずに特許査定が発行された拒絶無し出願または拒絶理由通知が発行された出願であって、その拒絶理由通知で進歩性の拒絶理由が指摘されていなかった進歩性拒絶無し出願の請求項１に応じた文書ベクトルと、拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる学習用最類似文献に応じた非引用文書ベクトルとの差分から求めた第３の移動文書ベクトルである特許要件適否予測装置することができる。 In the case of the above-mentioned patent requirement conformity prediction apparatus, the document classification unit uses training data as a combination of a learning document vector as a first learning document vector and a teacher vector as a vector indicating that there is a reason for rejection of novelty and inventive step, The combination of the document vector is the second learning document vector and the teacher vector is a vector indicating that there is a reason for refusal of inventive step and there is no reason for refusal of novelty, and the learning document vector is the third learning document vector and the teacher vector is inventive. Is used in combination with a vector indicating that there is no reason for refusal, and the first learning document vector is an application for which a notice of reason for refusal has been issued among already published applications, The reason for refusal of a novelty / inventive step rejection application was pointed out by citing the same document and the reason for refusal without novelty and inventive step was pointed out A first moving document vector corresponding to a difference between a document vector corresponding to a request and a first cited document vector corresponding to a first main cited publication which is a document cited for the reason for refusal The second learning document vector is an application for which a notice of reason for refusal has been issued in the published application, and the reason for refusal of novelty is not pointed out in the notice of reason for refusal, but the reason for refusal of inventive step is pointed out The document vector corresponding to the claim in which the reason for refusal of the inventive stepped rejection application was pointed out, and the second corresponding to the second main cited publication cited as the main publication for the reason for refusal A second moving document vector corresponding to a difference from the cited document vector, and the third learning document vector is a non-rejection application in which a patent decision is issued without a notice of reason for refusal being issued in the published application or Issuing reason for rejection A document vector according to claim 1 of a non-inventive refusal application for which the reason for refusal of refusal was not pointed out in the notice of reasons for refusal, and an application without refusal or an application without inventive step refusal As a result of the concept search, it is possible to obtain a patent requirement conformity prediction apparatus that is a third moving document vector obtained from a difference from a non-cited document vector corresponding to the most similar document for learning that has the highest similarity. .

さらに、本発明は、コンピュータを特許要件適否予測装置として機能させるための特許要件適否予測プログラムであって、そのコンピュータを公開特許公報の電子データが公開公報データとして格納されている公開公報データ記憶部から、指定された番号に応じた予測対象データを抽出する公報データ抽出手段と、その公報データ抽出手段によって抽出された予測対象データによって特定される予測対象出願について、その予測対象出願に開示されている予測対象発明の要旨を特定し得る用語を示す用語データであって、少なくとも各請求項の特徴部分から抽出された特徴部分データおよび発明が解決しようとする課題の欄から抽出された課題データを含むデータを要旨データとして予測対象データから抽出する要旨データ抽出手段と、その要旨データ抽出手段によって抽出された要旨データを記憶させる要旨データ記憶制御手段と、要旨データ記憶制御手段の制御によって記憶されている要旨データを用いて公開公報データを検索し、その検索結果に応じて、予測対象発明の新規性の要件適否を示す新規性予測データを生成する新規性予測処理部と、予測対象発明の進歩性の要件適否を示す進歩性予測データを生成する進歩性予測処理部と、その新規性予測データおよび進歩性予測データを用いて予測対象発明の特許要件適否を示す予測結果ファイルを生成する予測結果ファイル生成部とを有する特許要件適否予測処理手段として機能させ、進歩性予測処理部が公開公報データによって特定される先行技術発明のうち、予測対象発明に最も近い主引用発明を検索する主引用発明検索部と、文書ベクトルの分類を行う文書分類部とを有し、主引用発明検索部が要旨データ記憶制御手段の制御によって記憶されている要旨データのうちの各請求項の特徴部分データおよび課題データを主検索文書データとして公開公報データを対象とする概念検索を行い、その概念検索の結果、最も類似度が高いとされた最類似文献を主引用発明が開示されている主引用文献とし、文書分類部が、学習文書ベクトルと教師ベクトルとを含む複数の訓練データを用いた機械学習によって、入力される要旨移動ベクトルを進歩性の要件に適合するか否かのいずれかに分類してその分類結果に応じた要件適否文書ベクトルを出力するように構築され、その要旨移動ベクトルは、予測対象出願の各請求項に応じた要旨ベクトルと、最類似文献に応じた引用候補ベクトルとの差分に応じたベクトルである特許要件適否予測プログラムを提供する。 Furthermore, the present invention is a patent requirement suitability prediction program for causing a computer to function as a patent requirement suitability prediction device, wherein the computer is a public gazette data storage unit in which electronic data of a published patent gazette is stored as public gazette data. From the publication data extraction means for extracting the prediction target data corresponding to the designated number, and the prediction target application specified by the prediction target data extracted by the publication data extraction means, the prediction target application is disclosed. Term data indicating terms that can identify the gist of the invention to be predicted, and at least feature part data extracted from the feature part of each claim and problem data extracted from the problem column to be solved by the invention Abstract data extraction means for extracting the data including the abstract data from the prediction target data, and its The gazette data storage control means for storing the gist data extracted by the essence data extraction means and the gazette data stored by the control of the gist data storage control means are used to search the public gazette data, and according to the search result A novelty prediction processing unit that generates novelty prediction data indicating whether the novelty requirement of the prediction target invention is appropriate; and an inventive step prediction processing unit that generates inventive step prediction data indicating whether the inventive step requirement is appropriate And a prediction result file generation unit that generates a prediction result file that indicates whether or not the patent requirement of the invention to be predicted is satisfied by using the novelty prediction data and the inventive step prediction data. Among the prior art inventions whose processing unit is specified by the publication gazette data, the main citation invention test for searching the main citation invention closest to the prediction target invention Characteristic data and problem data of each claim of the gist data stored in the main citation invention search unit under the control of the gist data storage control means The main search document data is used as a main search document data, and a concept search is performed on public gazette data. As a result of the concept search, the most similar document with the highest similarity is set as the main reference document in which the main cited invention is disclosed, and the document The classification unit classifies the input summary movement vector according to the inventive step requirement by machine learning using a plurality of training data including a learning document vector and a teacher vector, and classifies the classification. It is constructed so as to output a requirement conformity document vector according to the result, and its summary movement vector is a summary vector corresponding to each claim of the prediction target application and a reference corresponding to the most similar document. A patent requirement suitability prediction program that is a vector corresponding to a difference from a candidate vector for use is provided.

以上詳述したように、本発明によれば、特許要件の適否に関する予測が審査実務に適合した内容で行われ、特許出願の監視負担を有効に軽減し得る特許要件適否予測装置および特許要件適否予測プログラムが得られるる。 As described above in detail, according to the present invention, the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, and the patent requirement conformity prediction device and the patent requirement propriety that can effectively reduce the monitoring burden of the patent application. A prediction program is obtained.

本発明の第１の実施の形態に係る特許要件適否予測サーバを含む特許要件適否予測システムのシステム構成図である。1 is a system configuration diagram of a patent requirement suitability prediction system including a patent requirement suitability prediction server according to a first embodiment of the present invention. 特許要件適否予測サーバの内部の構成を中心に示すブロック図である。It is a block diagram which mainly shows the internal structure of a patent requirement suitability prediction server. ユーザ端末装置の内部の構成を中心に示すブロック図である。It is a block diagram which mainly shows the internal structure of a user terminal device. 本発明の第１の実施の形態に係る特許要件適否予測サーバの主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction server which concerns on the 1st Embodiment of this invention. 要旨データ抽出部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of an abstract data extraction part. 本発明の第１の実施の形態に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on the 1st Embodiment of this invention. 入力ベクトル生成部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of an input vector production | generation part. 要旨データ記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of a summary data storage part. ＣＴデータ記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of CT data storage part. 予測結果記憶部のレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of a prediction result memory | storage part. 機械学習部のネットワーク構造の一例を示す図である。It is a figure which shows an example of the network structure of a machine learning part. 特許要件適否予測処理の動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a patent requirement suitability prediction process. 特許要件適否予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a patent requirement suitability prediction routine. 新規性・拡大先願予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a novelty and expansion prior application prediction routine. 拡大先願予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of an expansion prior application prediction routine. 進歩性予測ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of an inventive step prediction routine. 単一独立項ルーチンの一例を示すフローチャートである。It is a flowchart which shows an example of a single independent term routine. 複数独立項ルーチンの動作手順の一例を示すフローチャートである。It is a flowchart which shows an example of the operation | movement procedure of a multiple independent term routine. 独立項検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of an independent term search process. 主引用発明検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of main citation invention search processing. 副引用発明検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of a subcitation invention search process. 従属項検索処理の一例を示すフローチャートである。It is a flowchart which shows an example of a dependent term search process. 審査対象となる特許出願と、複数の特許公開公報との関係を模式的に示した図である。It is the figure which showed typically the relationship between the patent application used as examination object, and several patent publications. 独立項テーブルのレコードレイアウトの一例を示す図である。It is a figure which shows an example of the record layout of an independent item table. 特許要件適否予測リストの一例を示す図である。It is a figure which shows an example of a patent requirement suitability prediction list. 本発明の第２の実施の形態に係る特許要件適否予測サーバの主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction server which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on the 2nd Embodiment of this invention. 同じく入力ベクトル生成部の主要な構成を示す機能ブロック図である。It is a functional block diagram which similarly shows the main structures of an input vector production | generation part. 同じく予測結果記憶部のレコードレイアウトの一例を示す図である。It is a figure which similarly shows an example of the record layout of a prediction result memory | storage part. 同じく特許要件適否予測リストの一例を示す図である。It is a figure which similarly shows an example of a patent requirement suitability prediction list. 変形例に係る特許要件適否予測処理部の主要な構成を示す機能ブロック図である。It is a functional block diagram which shows the main structures of the patent requirement suitability prediction process part which concerns on a modification.

以下、本発明の実施の形態について図面を参照しつつ説明する。なお、同一要素には同一符号を用い、重複する説明は省略する。 Embodiments of the present invention will be described below with reference to the drawings. In addition, the same code | symbol is used for the same element and the overlapping description is abbreviate | omitted.

第１の実施の形態
(特許要件適否予測システムの全体構成）
まず、本発明の第１の実施の形態に係る特許要件適否予測サーバ１０を含む特許要件適否予測システム１の構成について説明する。 First embodiment
(Overall structure of patent requirement conformity prediction system)
First, the configuration of the patent requirement suitability prediction system 1 including the patent requirement suitability prediction server 10 according to the first embodiment of the present invention will be described.

図１は特許要件適否予測システム１のシステム構成図である。図１に示すように、特許要件適否予測システム１は、特許要件適否予測サーバ１０と、ユーザが操作する複数のユーザ端末装置３０（図１では、固定端末装置３０Ａ、３０Ｂ、３０Ｃ）とを有し、これらがインターネットＮ１を介して互いに接続される構成を有している。 FIG. 1 is a system configuration diagram of a patent requirement suitability prediction system 1. As shown in FIG. 1, the patent requirement suitability prediction system 1 has a patent requirement suitability prediction server 10 and a plurality of user terminal devices 30 (in FIG. 1, fixed terminal devices 30A, 30B, and 30C) operated by the user. These are connected to each other via the Internet N1.

特許要件適否予測サーバ１０は、特許要件適否予測プログラムにしたがったデータ処理を行い、公開済の特許出願（以下「公開済出願」という）のうち、ユーザが指定した出願公開番号（または出願番号）に応じた特許出願（予測対象出願）について、特許要件（本実施形態では、新規性（特許法第２９条第１項３号）、拡大先願（特許法第２９条の２）および進歩性（特許法第２９条第２項））に適合しているか否かを予測する。ユーザ端末装置３０は、特許要件適否予測サーバ１０との間でデータの受信または送信を行う。 The patent requirement conformity prediction server 10 performs data processing according to the patent requirement conformity prediction program, and among the published patent applications (hereinafter referred to as “published applications”), the application publication number (or application number) designated by the user. For patent applications (applications to be predicted) according to the requirements, patent requirements (in this embodiment, novelty (Patent Act Article 29, Paragraph 1, Item 3), extended prior application (Patent Act Article 29-2) and inventive step) (Patent Act Article 29 Paragraph 2)) is predicted. The user terminal device 30 receives or transmits data with the patent requirement suitability prediction server 10.

特許要件適否予測システム１では、特許要件適否予測サーバ１０が、予測対象出願について、後述する主引用発明検索および副引用発明検索を行うとともに、進歩性違反の拒絶理由が見つかる可能性が高いか低いかを審査実績に基づく複数の訓練データで機械学習を行った人工知能プログラムで予測して、特許要件の適否を予測する。特許要件適否予測サーバ１０によって、特許要件の適否の予測が審査実務に適合した内容で行われるため、特許出願の監視負担を有効に軽減することができる。 In the patent requirement suitability prediction system 1, the patent requirement suitability prediction server 10 performs a main citation invention search and a sub-citation invention search, which will be described later, for a prediction target application, and also has a high or low possibility of finding a reason for refusal of inventive step violation. This is predicted by an artificial intelligence program that performs machine learning with a plurality of training data based on examination results, and predicts the suitability of patent requirements. Since the patent requirement conformity prediction server 10 predicts the suitability of the patent requirement with the content suitable for the examination practice, the monitoring burden of the patent application can be effectively reduced.

（特許要件適否予測サーバ１０の構成）
次に、図２を参照して特許要件適否予測サーバ１０の構成について説明する。図２は、特許要件適否予測サーバ１０の内部の構成を中心に示すブロック図である。特許要件適否予測サーバ１０は、公開済出願の特許要件適否の予測に関するサービスを提供する専門事業者が運用するサーバである。 (Configuration of Patent Requirements Compliance Predicting Server 10)
Next, the configuration of the patent requirement suitability prediction server 10 will be described with reference to FIG. FIG. 2 is a block diagram centering on the internal configuration of the patent requirement suitability prediction server 10. The patent requirement conformity prediction server 10 is a server operated by a specialized business operator that provides a service relating to prediction of patent requirement conformance of a published application.

特許要件適否予測サーバ１０は、ＣＰＵ（Central Processing Unit）１１と、ＲＯＭ(Read Only Memory)１２と、ＲＡＭ（Random Access Memory）１３とを有している。ＣＰＵ１１は、ＲＯＭ１２に記憶されているプログラムにしたがい作動して、ＫＢＣ（Key board controller）１７を介してキーボード１９やマウス２０の操作入力で得られる入力データをメインバス１９Ａを介して入力する一方、他の構成要素との信号の入出力を行い、特許要件適否予測サーバ１０全体の動作制御を行う。ＣＰＵ１１は、後述する特許要件適否予測プログラムにしたがい、後述する公報データ抽出部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、対象公報抽出部１０４、予測結果編集処理部１０５としての動作を行う。ＲＯＭ１２には、特許要件適否予測プログラム等のＣＰＵ１１が実行する制御プログラムと、恒久的なデータが記憶されている。ＲＡＭ１３にはＣＰＵ１１が作動する際に用いるデータやプログラムが記憶されている。 The patent requirement conformity prediction server 10 includes a CPU (Central Processing Unit) 11, a ROM (Read Only Memory) 12, and a RAM (Random Access Memory) 13. The CPU 11 operates in accordance with a program stored in the ROM 12 and inputs input data obtained by operation input of the keyboard 19 and the mouse 20 via the KBC (Key board controller) 17 via the main bus 19A. Input / output of signals to / from other components is performed to control the operation of the patent requirement conformity prediction server 10 as a whole. The CPU 11 operates as a gazette data extraction unit 101, a gist data extraction unit 102, a patent requirement suitability prediction processing unit 103, a target gazette extraction unit 104, and a prediction result editing processing unit 105, which will be described later, according to a patent requirement suitability prediction program described later. I do. The ROM 12 stores a control program executed by the CPU 11 such as a patent requirement suitability prediction program and permanent data. The RAM 13 stores data and programs used when the CPU 11 operates.

そのほか特許要件適否予測サーバ１０は、ハードディスク装置（Hard disk drive,ＨＤＤ）１４と、通信制御部１５と、通信処理部１６と、ビデオコントローラ１８とを有している。 In addition, the patent requirement compliance prediction server 10 includes a hard disk drive (HDD) 14, a communication control unit 15, a communication processing unit 16, and a video controller 18.

ハードディスク装置１４には、特許要件適否予測プログラムの実行に必要な図４に示す各種記憶部またはＤＢ（database）と、その他の記憶部またはＤＢが形成されている。ハードディスク装置１４には、指定ナンバトランザクション記憶部（指定ナンバＴＲ記憶部）１５１と、予測対象トランザクション記憶部（予測対象ＴＲ記憶部）１５２と、要旨データ記憶部１５３と、クレームツリーデータ記憶部１５４と、対象公報記憶部１５５と、予測結果ファイル記憶部１５６とが形成されている。各記憶部またはＤＢについては後述する。 The hard disk device 14 is formed with various storage units or DBs (database) shown in FIG. 4 and other storage units or DBs necessary for executing the patent requirement suitability prediction program. The hard disk device 14 includes a designated number transaction storage unit (designated number TR storage unit) 151, a prediction target transaction storage unit (prediction target TR storage unit) 152, a gist data storage unit 153, and a claim tree data storage unit 154. A target publication storage unit 155 and a prediction result file storage unit 156 are formed. Each storage unit or DB will be described later.

通信制御部１５は、ＣＰＵ１１の指示にしたがい作動して、ユーザ端末装置３０や、図示しない特許庁サーバとの通信を行うための回線の接続および切断を制御する。通信処理部１６は、通信制御部１５の指示にしたがい作動して、インターネットＮ１を介して行われるデータの送受信を実行する。 The communication control unit 15 operates in accordance with an instruction from the CPU 11 to control connection and disconnection of a line for performing communication with the user terminal device 30 and a patent office server (not shown). The communication processing unit 16 operates according to an instruction from the communication control unit 15 and executes data transmission / reception performed via the Internet N1.

ビデオコントローラ１８は、図示しないディスプレイ装置における画像表示を制御して、各種の設定に用いられる画面等を表示させる。 The video controller 18 controls image display on a display device (not shown) to display a screen used for various settings.

そして、ハードディスク装置１４の各種記憶部またはＤＢについて説明すると次のとおりである。指定ナンバＴＲ記憶部１５１には、ユーザ端末装置３０から送信されるユーザ特定に必要なデータ（例えば、会員ＩＤ）と、そのユーザが指定した出願公開番号または出願番号（これらを「指定番号」という）とが記憶されている。予測対象ＴＲ記憶部１５２には、公報データ抽出部１０１が公開公報ＤＢ１５０から指定番号に応じて抽出した予測対象出願の出願書類電子データ（予測対象データ）が記憶されている。公開公報ＤＢ１５０は、公開特許公報の電子データが公開公報データとして格納されている。公開公報ＤＢ１５０として、図４では、工業所有権情報・研修館により運営されている特許情報プラットフォーム（Ｊ−ＰｌａｔＰａｔ）のデータベースまたはそこからダウンロードした電子データを記憶しているデータベースを想定している。後者のデータベースは、図示しないサーバに格納することができるし、ＨＤＤ１４に格納してもよい。 The various storage units or DBs of the hard disk device 14 will be described as follows. In the designated number TR storage unit 151, data (for example, a member ID) necessary for user identification transmitted from the user terminal device 30 and an application publication number or application number designated by the user (these are referred to as “designated numbers”). ) Is stored. The prediction target TR storage unit 152 stores application document electronic data (prediction target data) of the prediction target application extracted by the gazette data extraction unit 101 from the publication gazette DB 150 according to the designated number. The public gazette DB 150 stores the electronic data of the published patent gazette as public gazette data. As the public gazette DB 150, FIG. 4 assumes a database of a patent information platform (J-PlatPat) operated by the industrial property information / training hall or a database storing electronic data downloaded therefrom. The latter database can be stored in a server (not shown) or may be stored in the HDD 14.

要旨データ記憶部１５３には、要旨データ抽出部１０２が抽出・生成した要旨データが記憶されている。要旨データは、予測対象出願に開示されている発明（予測対象発明）の要旨を特定し得る用語を示す用語データであって、少なくとも後述する特徴部分データと課題データとが含まれている。 The summary data storage unit 153 stores summary data extracted and generated by the summary data extraction unit 102. The gist data is term data indicating terms that can specify the gist of the invention (prediction target invention) disclosed in the prediction target application, and includes at least characteristic part data and problem data described later.

要旨データ記憶部１５３には、例えば図８に示すように、データ種別エリア１５３ａ、項番エリア１５３ｂ、用語記憶部１５３ｃを有するレコードが記憶されている。データ種別エリア１５３ａは、各レコードに記憶されているデータが予測対象出願のどの部分のデータであるのかを示すデータ（「データ種別」という）が記憶されている。本実施の形態では、データ種別として、"Ｃ"、"Ｐ"、"Ｔ"、"Ｄ"の４種類が設定されている。"Ｃ"は特許請求の範囲のデータ、"Ｐ"は発明が解決しようとする課題の欄のデータ、"Ｔ"は技術分野の欄のデータ、"Ｄ"は発明の実施の形態の欄のデータをそれぞれ示している。項番エリア１５３ｂには、請求項の番号が記憶されている。用語記憶部１５３ｃは、用語エリア１５３ｃ１、展開度エリア１５３ｃ２および必須フラグエリア１５３ｃ３を有している。図８では、これらの組み合わせが１５通り用意されているが、この組み合わせは１５通りより多くてもよいし、少なくてもよい。そして、用語エリア１５３ｃ１，展開度エリア１５３ｃ２，必須フラグエリア１５３ｃ３には、それぞれ要旨の特定に用いられる用語、後述する展開度（Ｅｄ）、必須フラグ（Ｅｆ）が記憶されている。 For example, as shown in FIG. 8, the abstract data storage unit 153 stores a record having a data type area 153a, an item number area 153b, and a term storage unit 153c. The data type area 153a stores data (referred to as “data type”) indicating which part of the prediction target application the data stored in each record is. In the present embodiment, four types, “C”, “P”, “T”, and “D”, are set as data types. “C” is data in the claims, “P” is data in the column of the problem to be solved by the invention, “T” is data in the column of the technical field, and “D” is in the column of the embodiment of the invention Each data is shown. The number of the claim is stored in the item number area 153b. The term storage unit 153c has a term area 153c1, a development degree area 153c2, and an essential flag area 153c3. In FIG. 8, 15 combinations of these are prepared, but there may be more or less than 15 combinations. In the term area 153c1, the development degree area 153c2, and the essential flag area 153c3, terms used for specifying the gist, a later-described development degree (Ed), and an essential flag (Ef) are stored.

図８では、一例として、特開２００８−６２２８２号公報から抽出した要旨データ（上半分のデータ）と、特開２０１１−１８６７３５号公報から抽出した要旨データ（下半分のデータ）とが記載されている。前者は、独立形式で記載されている請求項（独立項）が１つの場合、後者は独立項が複数の場合の例示である。 In FIG. 8, as an example, summary data (upper half data) extracted from JP 2008-62282 A and summary data (lower half data) extracted from JP 2011-186735 A are described. Yes. The former is an example when there is one claim (independent claim) described in an independent form, and the latter is an example when there are a plurality of independent claims.

クレームツリーデータ記憶部（ＣＴデータ記憶部）１５４には、後述するクレームツリーデータ（claim tree データ、ＣＴデータともいう）が記憶されている。ＣＴデータ記憶部１５４には、例えば図９（Ａ）に示すように、独立区分エリア１５４ａと、ナンバエリア１５４ｂと、ＭＡＸ区分エリア１５４ｃと、従属項エリア１５４ｄと、サーチフラグエリア１５４ｅを含むレコードが記憶されている。 The claim tree data storage unit (CT data storage unit) 154 stores claim tree data (also referred to as claim tree data or CT data) to be described later. In the CT data storage unit 154, for example, as shown in FIG. 9A, a record including an independent division area 154a, a number area 154b, a MAX division area 154c, a dependent item area 154d, and a search flag area 154e. It is remembered.

独立区分エリア１５４ａには、予測対象出願の各請求項が独立項か、従属形式で記載されている請求項（従属項）のいずれであるかを示す独立区分（独立項がスペース、従属項が"Ｄ"）が記憶されている。ナンバエリア１５４ｂには各請求項の番号（請求項ナンバ）が記憶されている。ＭＡＸ区分エリア１５４ｃには、ＭＡＸ区分が記憶されている。ＭＡＸ区分には、同じ独立項を引用する従属項が複数あった場合の最も番号の大きい請求項（最大従属項）に"Ｍ"、それ以外にスペースがセットされている。 In the independent division area 154a, an independent division indicating whether each claim of the prediction target application is an independent claim or a claim (dependent claim) described in a dependent format (independent claim is space, dependent claim is "D") is stored. In the number area 154b, the number of each claim (claim number) is stored. The MAX division area 154c stores a MAX division. In the MAX section, “M” is set in the claim with the largest number (maximum dependent claim) when there are a plurality of dependent claims that refer to the same independent term, and a space is set in addition to that.

図９（Ａ）は、特開２００８−６２２８２号公報のＣＴデータを示しているが、該公報では、従属項の中で請求項９が最大従属項なので、請求項ナンバが"９"のレコードのＭＡＸ区分に"Ｍ"、それ以外の請求項ナンバのＭＡＸ区分にスペースがセットされている。また、図９（Ｂ）は、特開２０１１−１８６７３５号公報のＣＴデータを示しているが、該公報では、請求項１、請求項６が独立項であり、請求項５、請求項７が最大従属項なので、請求項ナンバが"５"のレコードと、"７"のレコードのＭＡＸ区分に"Ｍ"、これら以外のＭＡＸ区分にスペースがセットされている。 FIG. 9A shows CT data of Japanese Patent Application Laid-Open No. 2008-62282. In this publication, since claim 9 is the maximum dependent claim among the dependent claims, the record whose claim number is “9”. "M" is set in the MAX section, and a space is set in the MAX section of the other claim numbers. FIG. 9B shows CT data of Japanese Patent Application Laid-Open No. 2011-186735. In this publication, claims 1 and 6 are independent claims, and claims 5 and 7 are Since it is the maximum dependent term, “M” is set in the MAX division of the record whose claim number is “5” and “7”, and a space is set in the MAX division other than these.

従属項エリア１５４ｄには、従属項が引用している請求項の番号が記憶されている。サーチフラグエリア１５４ｅには、サーチフラグ、すなわち、後述する主引用発明検索が実行済である否かの区分が記憶されている。スペースは主引用発明検索の実行前、"９"は実行済を示している。 In the dependent claim area 154d, the number of the claim cited by the dependent claim is stored. The search flag area 154e stores a search flag, that is, a classification indicating whether or not a later-described main citation invention search has been executed. A space indicates before execution of the main cited invention search, and “9” indicates execution completed.

対象公報記憶部１５５には、主引用発明検索および副引用発明検索の対象とされる公開公報データ（検索対象公報データ）が記憶されている。予測結果記憶部１５６には、図１０に示すような後述する予測結果ファイルが記憶されている。 The target publication storage unit 155 stores public gazette data (search target publication data) to be searched for the main citation invention search and the sub-citation invention search. The prediction result storage unit 156 stores a prediction result file described later as shown in FIG.

続いて、要旨データ抽出部１０２、特許要件適否予測処理部１０３について説明する。要旨データ抽出部１０２は、図５に示すように、候補抽出部１１１、要部データ抽出部１１２、ＣＴデータ生成部１１３、テキスト分析・用語抽出部１１４、展開度・必須要件分析部１１５、係り受け解析部１１６、パターンデータ抽出部１１７およびファイル生成部１１８を有している。なお、図示の都合上、図５では、詳細な説明データ記憶部（詳細な説明ＴＲ）１６０、請求の範囲データ記憶部（請求の範囲ＴＲ）１６１、要部データ記憶部（要部データＴＲ）１６２が要旨データ抽出部１０２に含まれているが、これらはデータ記憶手段であるＨＤＤ１４に設けられている。 Next, the summary data extraction unit 102 and the patent requirement suitability prediction processing unit 103 will be described. As shown in FIG. 5, the summary data extraction unit 102 includes a candidate extraction unit 111, a main data extraction unit 112, a CT data generation unit 113, a text analysis / term extraction unit 114, a development level / essential requirement analysis unit 115, A receiving analysis unit 116, a pattern data extraction unit 117, and a file generation unit 118 are included. For convenience of illustration, in FIG. 5, a detailed explanation data storage unit (detailed description TR) 160, a claim data storage unit (claims TR) 161, a main data storage unit (main data TR) 162 are included in the summary data extraction unit 102, and these are provided in the HDD 14 serving as data storage means.

候補抽出部１１１は、予測対象ＴＲ記憶部１５２に記憶されている予測対象データを読み込み、そこから不要なデータをスキップ（読み飛ばし）して要旨データ作成に必要なデータを抽出し、抽出後のデータを詳細な説明データ（明細書データ）と、特許請求の範囲データとに分けて、それぞれ詳細な説明ＴＲ１６０、請求の範囲ＴＲ１６１に記憶させる。ここでは、「前記」、「該」、「当該」と、段落番号がスキップされる。 The candidate extraction unit 111 reads the prediction target data stored in the prediction target TR storage unit 152, skips unnecessary data from the prediction target data, extracts data necessary for the creation of summary data, The data is divided into detailed explanation data (specification data) and claims data, and stored in the detailed explanation TR160 and claims TR161, respectively. Here, “the above”, “the”, “the” and the paragraph numbers are skipped.

要部データ抽出部１１２は、予測対象ＴＲ記憶部１５２に記憶されている予測対象データを読み込み、そこから明細書の要部に相当する部分のデータ（要部データ）を抽出し、抽出した要部データを要部データ記憶部１６２に記憶させる。ここでは、要部データとして、明細書中の「発明の名称」の欄のデータ（発明の名称）および「技術分野」の欄のデータと、「発明が解決しようとする課題」の欄の「本発明」または「この発明」の文字列を含む一文のデータとを抽出する。 The main part data extraction unit 112 reads the prediction target data stored in the prediction target TR storage unit 152, extracts the data (main part data) corresponding to the main part of the specification, and extracts the extracted main data. The part data is stored in the main part data storage unit 162. Here, as the main part data, the data in the “Title of Invention” column (name of the invention) and the data in the “Technical Field” column of the specification, and the “Problem to be Solved by the Invention” column are displayed. One sentence data including the character string of the present invention or the present invention is extracted.

ＣＴデータ生成部１１３は、予測対象ＴＲ記憶部１５２に記憶されている予測対象データを読み込み、そのうちの特許請求の範囲の欄に記載されているデータを読み込んで前述したクレームツリーデータ（ＣＴデータ）を生成し、それＣＴデータ記憶部１５４に記憶させる。 The CT data generation unit 113 reads the prediction target data stored in the prediction target TR storage unit 152, reads the data described in the column of the claims, and the above-described claim tree data (CT data) Is generated and stored in the CT data storage unit 154.

テキスト分析・用語抽出部１１４は、詳細な説明ＴＲ１６０、請求の範囲ＴＲ１６１からそれぞれテキストデータを入力し、そのそれぞれについて、特徴語を抽出して（特許請求の範囲は請求項ごと）、各特徴語を重要とされる順序で出力する。この場合、例えば、形態素解析またはＮ−Ｇｒａｍなどの索引文字列抽出処理を実行して、各単語の出現頻度、各単語の共起頻度を調べ、その結果に応じて各特徴語を出力する。 The text analysis / term extraction unit 114 inputs text data from the detailed description TR160 and the claims TR161, extracts feature words for each of them (the claims are for each claim), and each feature word Are output in the important order. In this case, for example, index character string extraction processing such as morphological analysis or N-Gram is executed to check the appearance frequency of each word and the co-occurrence frequency of each word, and output each feature word according to the result.

展開度・必須要件分析部１１５は、請求の範囲ＴＲ１６１から特許請求の範囲データを読み込んで，テキスト分析・用語抽出部１１４で抽出された各特徴語について、展開度と、必須要件に該当するか否かとを調べ、その結果を出力する。ここで、展開度（Ｅｄ）とは、各特徴語がいくつの請求項に展開されているのか（用いられているのか）、展開されている請求項の個数を示すデータである。一般に、特許出願の出願書類では、できるだけ広い範囲の発明思想がカバーされるように、より重要な事項が請求項１（または他の独立項）に広い範囲で記載され、そこから下位の請求項に段階的に範囲を縮小（具体化）されながら記載されることが多い。そのため、展開度（Ｅｄ）が大きいほど、重要度がより高いと考えられるので、展開度は発明の要旨を把握するのに有益な情報と考えられる。例えば、図８に示すデータ種別"Ｃ"、項番"１"の用語エリア１５３ｃ１が"用語１"のエリアに「パンチ」という用語がセットされているが、この「パンチ」という用語は、特開２００８−６２２８２号公報の特許請求の範囲において、請求項１、２、３、４、５に記載されているので、展開度エリア１５３ｃ２に"５"がセットされている。 The development level / essential requirement analysis unit 115 reads the claim data from the claim range TR161, and for each feature word extracted by the text analysis / term extraction unit 114, whether the development level and the essential requirements are met. Check whether or not, and output the result. Here, the degree of expansion (Ed) is data indicating how many claims each feature word is expanded (used) and the number of claims expanded. In general, in an application for a patent application, more important matters are set forth in claim 1 (or other independent claims) in a broad range so that the broadest possible inventive concept is covered, and claims subordinate thereto. In many cases, it is described while the range is reduced (embraced) step by step. Therefore, it is considered that the greater the degree of development (Ed), the higher the degree of importance. Therefore, the degree of development is considered to be useful information for grasping the gist of the invention. For example, the term “punch” is set in the area of “term 1” in the term area 153c1 of data type “C” and item number “1” shown in FIG. In the scope of claims of Japanese Patent Application Laid-Open No. 2008-62282, since it is described in claims 1, 2, 3, 4, and 5, "5" is set in the expansion degree area 153c2.

必須要件に該当するか否かは、各請求項の特徴部分に記載されているか否かであって、必須フラグ（Ｅｆ）によって示されている。本件出願に係る発明の実施の形態では、各請求項における最終段落、または「ことを特徴とする」の文字列を含む一文を各請求項の特徴部分としていて、そのデータが特徴部分データであり、ここに含まれている用語が必須要件を満たすものとしている。特徴部分から抽出された用語には、必須要件を満たすことを示す"Ｘ"が必須フラグ（Ｅｆ）にセットされる。 Whether or not it meets the essential requirements is whether or not it is described in the characteristic part of each claim and is indicated by an essential flag (Ef). In the embodiment of the invention according to the present application, the last paragraph in each claim or a sentence including the character string of “characteristic” is used as the feature part of each claim, and the data is the feature part data. , The terms included here shall meet the essential requirements. For the term extracted from the feature portion, “X” indicating that the essential requirement is satisfied is set in the essential flag (Ef).

係受け解析部１１６は、要部データ記憶部１６２に記憶されている要部データについて係受け解析を行い、その結果をパターンデータ抽出部１１７に出力する。 The dependency analysis unit 116 performs dependency analysis on the main data stored in the main data storage unit 162 and outputs the result to the pattern data extraction unit 117.

パターンデータ抽出部１１７は、係受け解析部１１６の解析結果を入力して、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとなる文字列と、発明の名称のうち、先頭に記載されているもの（筆頭名称）を出力する。例えば、特開２００８−６２２８２号公報の「発明が解決しようとする課題」の欄の「本発明」の文字列を含む一文の中に、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとして、「調整」および「行わず」と、「同心性」および「得る」と、「精密打ち抜き型」および「提供」がある。これらがパターンデータ抽出部１１７から出力される。本実施の形態において、パターンデータ抽出部１１７から出力されるデータのうち、「発明が解決しようとする課題」の欄から抽出されたデータが課題データに相当していて、例えば図８のデータ種別"Ｐ"のレコードのようなデータとすることができる。 The pattern data extraction unit 117 inputs the analysis result of the dependency analysis unit 116, and the noun immediately before the hiragana character “O”, the character string that is the combination of the corresponding verb, and the beginning of the name of the invention (First name) is output. For example, in a sentence including the character string “present invention” in the column “Problem to be Solved by” of Japanese Patent Application Laid-Open No. 2008-62282, the noun immediately before “O” in hiragana and the corresponding Verb combinations include “adjustment” and “do not do”, “concentricity” and “obtain”, “precision punching” and “provide”. These are output from the pattern data extraction unit 117. In the present embodiment, among the data output from the pattern data extraction unit 117, the data extracted from the “problem to be solved by the invention” column corresponds to the problem data. For example, the data type of FIG. Data such as a record of “P” can be used.

ファイル生成部１１８はテキスト分析・用語抽出部１１４、展開度・必須要件分析部１１５およびパターンデータ抽出部１１７から出力されるデータを用いて要旨データを生成し、要旨データ記憶部１５３に記憶させる。この場合、テキスト分析・用語抽出部１１４および展開度・必須要件分析部１１５の出力データを用いて、データ種別"Ｃ"、"Ｄ"のデータが生成され、パターンデータ抽出部１１７の出力データを用いて、データ種別"Ｐ"、"Ｔ"のデータが生成される。 The file generation unit 118 generates summary data using data output from the text analysis / term extraction unit 114, the development level / essential requirement analysis unit 115, and the pattern data extraction unit 117, and stores the summary data in the summary data storage unit 153. In this case, data of the data types “C” and “D” is generated using the output data of the text analysis / term extraction unit 114 and the development level / essential requirement analysis unit 115, and the output data of the pattern data extraction unit 117 is used as the output data. As a result, data of data types “P” and “T” is generated.

特許要件適否予測処理部１０３は、図６に示すように、新規性・拡大先願予測処理部１２５と、進歩性予測処理部１２６とを有している。新規性・拡大先願予測処理部１２５は、要旨データ記憶部１５３に記憶されている要旨データを検索タームに用いて対象公報記憶部１５５の検索対象公報データの全文検索を行い、その結果にしたがい、新規性・拡大先願予測データＮｄを予測結果ファイル生成部１２７に出力する。新規性・拡大先願予測処理部１２５の機能、動作手順については、後に詳しく説明する。 As shown in FIG. 6, the patent requirement suitability prediction processing unit 103 includes a novelty / expansion prior application prediction processing unit 125 and an inventive step prediction processing unit 126. The novelty / expanded prior application prediction processing unit 125 uses the summary data stored in the summary data storage unit 153 as a search term to perform a full text search of the search target publication data in the target publication storage unit 155, and according to the result. The novelty / expansion prior application prediction data Nd is output to the prediction result file generation unit 127. The function and operation procedure of the novelty / expansion earlier application prediction processing unit 125 will be described in detail later.

進歩性予測処理部１２６は、引用発明検索部１３１と、入力ベクトル生成部１３２と、機械学習部１３３とを有している。引用発明検索部１３１は、後述する主引用発明検索を行う主引用発明検索部および副引用発明検索を行う副引用発明検索部を有している。また、引用発明検索部１３１は、主引用発明検索および副引用発明検索の結果にしたがい、進歩性予測データＶｄ１を予測結果ファイル生成部１２７に出力し、検索の対象となった請求項に応じた請求項要旨データｉｅｄと概念検索データＶｄ２を入力ベクトル生成部１３２に出力する。概念検索データＶｄ２には、概念検索の結果、最も類似度が高いとされた文献（最類似文献）の公開公報データが含まれている。引用発明検索部１３１の機能、動作手順については、後に詳しく説明する。 The inventive step prediction processing unit 126 includes a cited invention search unit 131, an input vector generation unit 132, and a machine learning unit 133. The cited invention search unit 131 has a main cited invention search unit that performs a main cited invention search and a subcited invention search unit that performs a subcited invention search, which will be described later. In addition, the cited invention search unit 131 outputs the inventive step prediction data Vd1 to the prediction result file generation unit 127 according to the results of the main cited invention search and the sub-cited invention search, and responds to the claim that is the target of the search. Claim summary data ied and concept search data Vd 2 are output to input vector generation section 132. The concept search data Vd2 includes publication gazette data of a document (the most similar document) that is determined to have the highest similarity as a result of the concept search. The function and operation procedure of the cited invention search unit 131 will be described in detail later.

入力ベクトル生成部１３２は、図７に示すように、要旨ベクトル生成部１３２ａと、引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cとを有している。 As shown in FIG. 7, the input vector generation unit 132 includes a summary vector generation unit 132a, a citation candidate vector generation unit 132b, and a movement vector generation unit 132c.

要旨ベクトル生成部１３２ａは、請求項要旨データｉｅｄを入力してその特徴語を抽出し、各語に応じた重み付けを行って、各請求項の記載に応じた要旨ベクトルＥＶを生成する。引用候補ベクトル生成部１３２ｂは、概念検索データＶｄ２に含まれる最類似文献の公開公報データを入力してその特徴語を抽出し、各語に応じた重み付けを行って最類似文献に応じた文書ベクトル（引用候補ベクトル）ＲｆＶを生成する。移動ベクトル生成部１３２ｃは、要旨ベクトルＥＶと、引用候補ベクトルＲｆＶとの差分を計算して、双方の文書ベクトルの差分に応じた要旨移動ベクトルＶ３を生成する。 The summary vector generation unit 132a receives the claim summary data ied, extracts the feature words, performs weighting according to each word, and generates the summary vector EV according to the description of each claim. The citation candidate vector generation unit 132b inputs public gazette data of the most similar document included in the concept search data Vd2, extracts the feature word, performs weighting according to each word, and performs document weighting according to the most similar document (Quotation candidate vector) RfV is generated. The movement vector generation unit 132c calculates a difference between the abstract vector EV and the citation candidate vector RfV, and generates an abstract movement vector V3 according to the difference between the two document vectors.

最類似文献は、主引用発明検索部による概念検索の結果、最も類似度が高いとされた文献であるため、予測対象発明の進歩性の審査で主引用発明の開示文献として引用される確率が最も高いと推測される。そのため、最類似文献を引用候補として引用候補ベクトルＲｆＶを求め、これと要旨ベクトルＥＶとの差分を計算して要旨移動ベクトルＶ３を求めれば、予測対象発明と、最類似文献に開示されている発明との相違に応じた文書ベクトル（要旨移動ベクトルＶ３に相当する）が生成される。 Since the most similar document is a document having the highest similarity as a result of the concept search by the main citation invention search unit, the probability of being cited as a disclosure document of the main citation invention in the inventive step examination of the prediction target invention is high. Presumed to be the highest. Therefore, if the citation candidate vector RfV is obtained using the most similar document as a citation candidate and the difference between the citation candidate vector RfV is calculated and the gist movement vector V3 is obtained, the invention to be predicted and the invention disclosed in the most similar document A document vector (corresponding to the abstract movement vector V3) corresponding to the difference is generated.

機械学習部１３３は、本発明の実施の形態にかかる文書分類部であって、次のような訓練データ（学習パターンともいう）を用いた機械学習（教師付き学習）によって、後述する要旨移動ベクトルＶ３を進歩性の要件に適合するクラスと適合しないクラス（拒絶理由が無いクラスと有るクラス）に分類し、その分類結果に応じた出力信号（要件適否文書ベクトル）Ｖ４を出力するように構築されている。本発明の実施の形態の場合、学習パターンは次に述べるＨＬパターンとすることができる。 The machine learning unit 133 is a document classification unit according to the embodiment of the present invention, and a summary movement vector described later by machine learning (supervised learning) using the following training data (also referred to as a learning pattern). It is constructed to classify V3 into a class that conforms to the requirement of inventive step and a class that does not conform (class that has no reason for refusal) and output an output signal (requirement conformity document vector) V4 according to the classification result. ing. In the embodiment of the present invention, the learning pattern can be an HL pattern described below.

ＨＬパターンは、学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りを示すベクトル（例えば、正解のクラスに対応した次元だけが"１"で、他が"０"のベクトル）との組み合わせと、学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトル（例えば、上記とは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせのパターンである。 In the HL pattern, the learning document vector is the first learning document vector, and the teacher vector is a vector indicating that there is a reason for rejection of the inventive step (for example, only the dimension corresponding to the correct class is “1” and the others are “0”). Vector), the learning document vector is the second learning document vector, and the teacher vector is a vector indicating that there is no reason for rejection of the inventive step (for example, only a dimension different from the above is “1” and the others are “0”). Is a combination pattern with "vector".

第１の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで進歩性違反の拒絶理由（特許法第２９条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた主引用文献）に応じた文書ベクトル（引用文書ベクトル）との差分に応じた第１の移動文書ベクトルである。 The first learning document vector is an application for which the first notice of reasons for refusal (1st action) has been issued as a result of examination by the Patent Office in a published application, and the reason for refusal of inventive step violation ( Claims (at the time of issuance of the notice of reasons for refusal) that pointed out the reasons for refusal of the application (reasonable refusal application) for which the reason for refusal that the requirements of Article 29 (2) of the Patent Act were not satisfied Is a first moving document vector corresponding to a difference between a document vector corresponding to the document vector and a document vector (cited document vector) corresponding to the cited document 1 (main cited document cited as the main publication) at that time. .

第２の学習文書ベクトルは、公開済出願の中で審査の結果、１ｓｔアクションが発行されずに特許査定が発行された出願（拒絶無し出願）または１ｓｔアクションは発行されたがその拒絶理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）の（拒絶理由通知が発行された時点の）請求項１に応じた文書ベクトルと、それら拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる文献（学習用最類似文献）に応じた文書ベクトル（非引用文書ベクトル）との差分に応じた第２の移動文書ベクトルである。 The second learning document vector is the result of examination in the published application that the patent action is issued without the first action being issued (the application without refusal) or the first action is issued but the reason for refusal is advanced Document vectors according to claim 1 (at the time of the notice of reasons for refusal) of applications for which no reason for refusal of sex violation has been pointed out (applications without inventive refusal), and those applications without refusal or inventive refusal As a result of the concept search for the application, the second moving document vector corresponding to the difference from the document vector (non-cited document vector) corresponding to the document (the most similar document for learning) that has the highest similarity is used. is there.

機械学習部１３３は、上記のようなＨＬパターンの学習パターンで学習を繰り返し行うことにより、要旨移動ベクトルＶ３を進歩性の拒絶理由有りまたは無しのいずれかのクラスに分類し、その分類したクラスに応じた要件適否文書ベクトルＶ４を出力する。前者は、予測対象出願について、進歩性違反の拒絶理由が発行される可能性が高い場合、後者は低い場合に相当する。 The machine learning unit 133 classifies the abstract movement vector V3 into either a class with or without a reason for refusal of inventive step by repeatedly performing learning with the learning pattern of the HL pattern as described above, and the class is classified into the classified class. A corresponding requirement conformity document vector V4 is output. The former corresponds to a case where the reason for refusal of inventive step violation is likely to be issued for the forecast application, and the latter corresponds to a low case.

機械学習部１３３は、入力される要旨移動ベクトルＶ３を進歩性の拒絶理由が有るクラスと無いクラスに分類して、その分類結果に応じた要件適否文書ベクトルＶ４を出力すればよいので、機械学習部１３３にサポートベクターマシーン（ＳＶＭ）と呼ばれる学習アルゴリズムを適用することができる。サポートベクターマシーン（ＳＶＭ）によれば、決定境界との距離（マージン）が最大になるように、決定境界を得ることができる。 The machine learning unit 133 may classify the inputted abstract movement vector V3 into a class having a reason for refusal of inventive step and a class having no reason for refusal of inventive step, and output a requirement conformity document vector V4 according to the classification result. A learning algorithm called a support vector machine (SVM) can be applied to the unit 133. According to the support vector machine (SVM), the decision boundary can be obtained so that the distance (margin) from the decision boundary is maximized.

また、機械学習部１３３の情報処理に脳神経回路網をモデルにしたニューラルネットワークを適用することができる。ニューラルネットワークには、階層型ニューラルネットワークと、相互結合型ニューラルネットワークがある。たとえば、機械学習部１３３の学習アルゴリズムとして、階層型ニューラルネットワークのパーセプトロンを適用することができる。 In addition, a neural network modeled on the brain neural network can be applied to the information processing of the machine learning unit 133. Neural networks include hierarchical neural networks and interconnected neural networks. For example, a perceptron of a hierarchical neural network can be applied as a learning algorithm of the machine learning unit 133.

パーセプトロンはＳ層、Ａ層、Ｒ層と呼ばれる３層からなる階層型ネットワークで構成され（図示せず）、Ｓ層からＡ層、Ａ層からＲ層という片方向の結合だけが存在している。前述のＨＬパターンの学習パターンが与えられると、第１の学習文書ベクトルまたは第２の学習文書ベクトルが入力されたときの出力ベクトルがそれぞれの教師ベクトルと異なっていたときに、その誤差に応じて結合の重みが修正され、出力ベクトルと教師ベクトルとの誤差が一定値以下になったときに学習が終了する。 The perceptron is composed of a three-layer hierarchical network called S layer, A layer, and R layer (not shown), and there is only one-way coupling from S layer to A layer and from A layer to R layer. . Given the learning pattern of the HL pattern described above, when the output vector when the first learning document vector or the second learning document vector is input is different from each teacher vector, according to the error. The learning ends when the weight of the connection is corrected and the error between the output vector and the teacher vector becomes a certain value or less.

しかしながら、パーセプトロンでは、学習パターンが線形分離不可能な場合にアルゴリズムが停止しないおそれがある。そのため、機械学習部１３３が学習によって非線形な決定境界を獲得できるようにするため、階層型ニューラルネットワークの中で応用例が多く、誤識別の少ない非線形識別面が学習できるＢＰ（バックプロパゲーション）ネットワークを適用することが好ましい。 However, the perceptron may not stop the algorithm when the learning pattern cannot be linearly separated. Therefore, in order for the machine learning unit 133 to acquire a non-linear decision boundary by learning, there are many application examples in a hierarchical neural network, and a BP (back propagation) network that can learn a non-linear discriminant plane with few misidentifications. Is preferably applied.

ＢＰネットワークは、図１１に示すように、入力層および出力層と、その間の中間層とを有し、誤差逆伝播アルゴリズムと呼ばれる学習アルゴリズムによって、ユニット間のすべての結合の重みが学習可能になっている。誤差逆伝播アルゴリズムでは、入力信号が入力層、中間層、出力層と伝わり、その一方、誤差信号が逆に伝わることによって、重み調整が行われる。 As shown in FIG. 11, the BP network has an input layer and an output layer, and an intermediate layer therebetween, and a learning algorithm called an error back-propagation algorithm makes it possible to learn the weights of all the connections between units. ing. In the error back-propagation algorithm, the input signal is transmitted to the input layer, the intermediate layer, and the output layer, while the error signal is transmitted in the reverse direction, whereby weight adjustment is performed.

そして、図１１に示すＢＰネットワークに、学習パターンｘ_ｐ（ｘ_０、ｘ_１・・・ｘ_ｎ）が入力されたとき、ある階層のｊ（０≦ｊ≦ｎ）番目のユニットには、そのユニットｊとの結合を有する１階層前のユニットから重み付きの信号が入力される。そこで、１階層前のｉ（０≦ｉ≦ｎ）番目のユニットからの信号をｔ_ｉｐ，重みをｗ_ｉｊとすると、ユニットｊへの入力は、式１のようになり、ユニットｊの出力は、閾値関数をｆとして、式２のようになる。 Then, when a learning pattern x _p (x ₀ , x ₁ ... X _n ) is input to the BP network shown in FIG. 11, the j (0 ≦ j ≦ n) -th unit in a certain hierarchy A weighted signal is input from the unit one layer before having a connection with the unit j. Therefore, if the signal from the i-th unit (0 ≦ i ≦ n) of the previous layer is t _ip and the weight is w _ij , the input to the unit j is given by Equation 1, and the output of the unit j is Suppose f is a threshold function and Equation 2 is obtained.

式１

Formula 1

式２

Formula 2

学習パターンｘ_ｐに対する誤差Ｄ_ｐは、出力層のユニットｋの出力と、教師信号ｂ_ｋｐの差の２乗和で定義されるから、以下の式３のようになる。この誤差Ｄ_ｐをすべての学習パターンに対して足しあげて式４のＤを求め、そのＤが最小になるように、ユニット間の結合重みが調整されて機械学習部１３３における学習が行われる。この場合、個々の学習パターンが入力されるごとに、式５によって重みが調整される。ｗ_ｉｊは更新前の重み、ｗ'_ijは更新後の重み、ρは学習係数である。これは確率的最急降下法と呼ばれる。なお、ユニットの入出力関数は式６に示すシグモイド関数が用いられる。 Error D _p for learning pattern x _p is the output of unit k of output layer, since is defined by the sum of squares of the difference between the teacher signal b _kp, so equation 3 below. The error D _p 3GS sum for all learning patterns sought D of formula 4, as its D is minimized, is adjusted connection weights between units is learning in machine learning unit 133 is carried out. In this case, each time an individual learning pattern is input, the weight is adjusted by Expression 5. w _ij is a weight before update, w ′ _ij is a weight after update, and ρ is a learning coefficient. This is called stochastic steepest descent. Note that the sigmoid function shown in Equation 6 is used as the unit input / output function.

式３

Formula 3

式４

Formula 4

式５

Formula 5

式６

Equation 6

（ユーザ端末装置３０の構成）
ユーザ端末装置３０は、図１に示すように、インターネットＮ１への接続環境を備え、特許要件適否予測サーバ１０と通信を行うことができる。なお、ユーザ端末装置３０は、据え置き型（または持ち運び可能なノート型）のパーソナルコンピュータを想定しているが、タブレット型の端末装置でもよい。 (Configuration of the user terminal device 30)
As shown in FIG. 1, the user terminal device 30 includes a connection environment to the Internet N <b> 1 and can communicate with the patent requirement suitability prediction server 10. The user terminal device 30 is assumed to be a stationary (or portable notebook type) personal computer, but may be a tablet-type terminal device.

ユーザ端末装置３０は、図３に示すように、ＣＰＵ３１、ＲＯＭ３２、ＲＡＭ３３、データ記憶部３４、液晶表示部３５を有している。また、ユーザ端末装置３０は、音声変換処理部３６、通信制御部３７、通信処理部３８ａ、無線通信部３８ｂ、スピーカ３９およびマイク４０を有している。 As illustrated in FIG. 3, the user terminal device 30 includes a CPU 31, a ROM 32, a RAM 33, a data storage unit 34, and a liquid crystal display unit 35. In addition, the user terminal device 30 includes a voice conversion processing unit 36, a communication control unit 37, a communication processing unit 38a, a wireless communication unit 38b, a speaker 39, and a microphone 40.

ＣＰＵ３１は、ＲＯＭ３２に記憶されているプログラムにしたがい作動してユーザ端末装置３０全体の動作制御を司る。ＲＯＭ３２はＣＰＵ３１が実行するプログラム、例えば、データ通信を行うための通信制御プログラムが記憶されている。ＲＡＭ３３には、ＣＰＵ３１によるプログラムの実行に必要なデータ等が記憶される。 The CPU 31 operates according to a program stored in the ROM 32 and controls operation of the entire user terminal device 30. The ROM 32 stores a program executed by the CPU 31, for example, a communication control program for performing data communication. The RAM 33 stores data necessary for the CPU 31 to execute the program.

データ記憶部３４には種々のデータが記憶されている。液晶表示部３５は、ＬＣＤ（Liquid Crystal Display）とその駆動部を有し、文字、図形、記号などの画像表示を行う画像表示手段である。音声変換処理部３６は、音声データを伸張してスピーカ３９に出力する一方、マイク４０から入力するアナログ音声信号をデジタルの音声データに変換および圧縮して、通信処理部３８ａに入力する。通信制御部３７はＣＰＵ３１の指示を受けて作動し、データ通信を行うための回線の接続および切断を制御する。通信処理部３８ａは、通信制御部３７の指示にしたがい作動して、インターネットＮ１を介して行われるデータの送受信を実行する。無線通信部３８ｂは通信制御部３７の制御にしたがい、無線によるデータの送受信を実行する無線通信手段である。スピーカ３９は、音声を出力する音声出力手段であり、マイク４０はユーザの会話内容等の音声を入力し、電気信号に変換する。 Various data are stored in the data storage unit 34. The liquid crystal display unit 35 includes an LCD (Liquid Crystal Display) and its driving unit, and is an image display unit that displays images of characters, figures, symbols, and the like. The voice conversion processing unit 36 decompresses the voice data and outputs it to the speaker 39, while converting and compressing the analog voice signal input from the microphone 40 into digital voice data and inputs the digital voice data to the communication processing unit 38a. The communication control unit 37 operates in response to an instruction from the CPU 31 and controls connection and disconnection of a line for performing data communication. The communication processing unit 38a operates according to an instruction from the communication control unit 37, and executes transmission / reception of data performed via the Internet N1. The wireless communication unit 38b is a wireless communication unit that performs wireless data transmission / reception under the control of the communication control unit 37. The speaker 39 is an audio output means for outputting audio, and the microphone 40 inputs audio such as the user's conversation content and converts it into an electrical signal.

(特許要件適否予測システムの動作内容）
次に、図４とともに図１２から図２２までを参照して、特許要件適否予測サーバ１０による特許要件適否予測処理の動作内容について説明する。 (Operations of the patent requirement conformity prediction system)
Next, referring to FIGS. 12 to 22 together with FIG. 4, the operation content of the patent requirement suitability prediction process by the patent requirement suitability prediction server 10 will be described.

ここで、図４は、特許要件適否予測処理を実現する特許要件適否予測サーバ１０の主要な構成を示す機能ブロック図である。特許要件適否予測サーバ１０では、ＣＰＵ１１が特許要件適否予測プログラムにしたがい、公開公報ＤＢ１５０、要旨データ記憶部１５３等に記憶されている各種ファイルやＤＢにアクセスしながら、公報データ抽出部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、予測結果編集処理部１０５としての動作を行う。これにより、特許要件適否予測処理が実行される。なお、特許要件適否予測プログラムは、特許要件適否予測サーバ１０を公報データ抽出部１０１、要旨データ抽出部１０２、特許要件適否予測処理部１０３、予測結果編集処理部１０５等として機能させるためのプログラムである。 Here, FIG. 4 is a functional block diagram showing a main configuration of the patent requirement suitability prediction server 10 for realizing the patent requirement suitability prediction process. In the patent requirement suitability prediction server 10, the CPU 11 follows the patent requirement suitability prediction program while accessing various files and DBs stored in the publication gazette DB 150, summary data storage unit 153, etc. The extraction unit 102, the patent requirement suitability prediction processing unit 103, and the prediction result editing processing unit 105 are operated. Thereby, a patent requirement suitability prediction process is executed. The patent requirement suitability prediction program is a program for causing the patent requirement suitability prediction server 10 to function as the gazette data extraction unit 101, the gist data extraction unit 102, the patent requirement suitability prediction processing unit 103, the prediction result editing processing unit 105, and the like. is there.

そして、特許要件適否予測サーバ１０が特許要件適否予測処理を行うときは、ＣＰＵ１１が特許要件適否予測プログラムにしたがい図１２に示すフローチャートに沿った動作を行う。図１２は、特許要件適否予測プログラムにしたがったＣＰＵ１１の特許要件適否予測処理の動作手順の一例を示すフローチャートである。なお、図１２、図１３等において"Ｓ"とはステップを略記したものである。 When the patent requirement suitability prediction server 10 performs the patent requirement suitability prediction process, the CPU 11 performs an operation according to the flowchart shown in FIG. 12 according to the patent requirement suitability prediction program. FIG. 12 is a flowchart showing an example of the operation procedure of the patent requirement suitability prediction process of the CPU 11 according to the patent requirement suitability prediction program. In FIG. 12, FIG. 13, etc., “S” is an abbreviation of step.

ＣＰＵ１１は、特許要件適否予測プログラムにしたがい動作を開始すると、ステップ１に進み、ユーザ認証処理を行う。ここでは、ユーザがユーザ端末装置３０を用いて入力したユーザＩＤおよびパスワードを確認する等してユーザ認証処理を行う。次に、ＣＰＵ１１は、ステップ２に進み、ポイント残高確認処理を行う。ここでは、ユーザのポイント残高が一定値以上あるかどうかをＣＰＵ１１が確認し、ポイント残高不足であれば、特許要件適否予測処理を終了するか、ポイント残高不足を知らせるメッセージの送信などを行う。 When starting the operation according to the patent requirement suitability prediction program, the CPU 11 proceeds to step 1 and performs user authentication processing. Here, the user authentication process is performed by confirming the user ID and password input by the user using the user terminal device 30. Next, the CPU 11 proceeds to step 2 to perform a point balance confirmation process. Here, the CPU 11 checks whether or not the user's point balance is equal to or greater than a certain value. If the point balance is insufficient, the patent requirement suitability prediction process is terminated or a message notifying that the point balance is insufficient is transmitted.

続いて、ＣＰＵ１１は、ステップ３に進み、指定番号の入力画面データをユーザ端末３０に送信して、ユーザの指定番号の入力があるまで待機する。ここで、指定番号の入力があると、その指定番号を指定ナンバＴＲ記憶部１５１に記憶させ、入力のあった指定番号の件数（例えば３件）を公報カウンタＭＡＸにセットし、公報カウンタに"０"をセットする。 Subsequently, the CPU 11 proceeds to step 3, transmits the input screen data of the designated number to the user terminal 30, and waits for the input of the designated number of the user. Here, when a designated number is input, the designated number is stored in the designated number TR storage unit 151, and the number of designated numbers entered (for example, 3) is set in the publication counter MAX. Set 0 ".

次に、ＣＰＵ１１は、処理をステップ４に進めて予測終了条件が成立しているか否かを判定する。ここで、ＣＰＵ１１は予測終了条件が成立しているときはステップ６に進むが、そうでないときはステップ５に進む。ＣＰＵ１１はステップ５に進むと、後述する特許要件適否予測ルーチンを実行するが、ステップ６に進むと、終了処理を実行し、そのユーザに対する特許要件適否予測処理を終了する。このようにすることで、予測終了条件が成立している限り、特許要件適否予測処理が自動的かつ継続的に実行される。 Next, the CPU 11 proceeds to step 4 to determine whether or not a prediction end condition is satisfied. Here, the CPU 11 proceeds to step 6 when the prediction end condition is satisfied, but proceeds to step 5 otherwise. When proceeding to step 5, the CPU 11 executes a patent requirement suitability prediction routine which will be described later. When proceeding to step 6, the CPU 11 executes a termination process and terminates the patent requirement conformity prediction process for the user. By doing so, as long as the prediction end condition is satisfied, the patent requirement suitability prediction process is automatically and continuously executed.

そして、ＣＰＵ１１はステップ５に進むときは、図１３に示すフローチャートに沿って特許要件適否予測ルーチンを実行する。 Then, when proceeding to Step 5, the CPU 11 executes a patent requirement suitability prediction routine according to the flowchart shown in FIG.

ＣＰＵ１１は特許要件適否予測ルーチンを開始すると、ステップ１１に進み、公報カウンタに"１"を加算する。続くステップ１２では、公報カウンタが公報カウンタＭＡＸよりも大きいか否かを判定し、大きくなければ処理をステップ１３に進めるが、そうでなければ（公報カウンタが公報カウンタＭＡＸより大きいとき）はステップ１６に処理を進める。 When starting the patent requirement suitability prediction routine, the CPU 11 proceeds to step 11 and adds “1” to the publication counter. In the following step 12, it is determined whether the publication counter is larger than the publication counter MAX. If not, the process proceeds to step 13. If not (when the publication counter is larger than the publication counter MAX), step 16 is executed. Proceed with the process.

ＣＰＵ１１は、ステップ１３に処理を進めると、公報データ抽出部１０１としての動作を行って指定番号に応じたデータ抽出を行うとともに、対象公報抽出部１０４としての動作を行って公開公報データの抽出を行い、抽出したデータをそれぞれ予測対象ＴＲ記憶部１５２、対象公報記憶部１５５に記憶させる。また、ＣＰＵ１１は、要旨データ抽出部１０２としての動作を行って前述した要旨データおよびＣＴデータを生成し、それぞれ要旨データ記憶部１５３、ＣＴデータ記憶部１５４に記憶させる。対象公報抽出部１０４は、出願日が予測対象出願の出願日よりも前の公開公報データを抽出する。 When the CPU 11 proceeds to step 13, the operation as the publication data extraction unit 101 is performed to perform data extraction according to the designated number, and the operation as the target publication extraction unit 104 is performed to extract the publication publication data. The extracted data is stored in the prediction target TR storage unit 152 and the target publication storage unit 155, respectively. Further, the CPU 11 performs the operation as the summary data extraction unit 102 to generate the above-described summary data and CT data, and stores them in the summary data storage unit 153 and the CT data storage unit 154, respectively. The target gazette extraction unit 104 extracts public gazette data whose filing date is earlier than the filing date of the prediction target application.

続いてＣＰＵ１１は、ステップ１４に処理を進めて後述する新規性・拡大先願予測ルーチンを実行してからステップ１５に進み、進歩性予測ルーチンを実行する。その後、ＣＰＵ１１は、ステップ１１に戻って上記同様の処理を繰り返す。ステップ１６では、ＣＰＵ１１が予測結果編集処理部１０５としての動作を行い、後述する予測結果リストＬ１を編集出力する。その後、ステップ１７のポイント消費処理を実行して、特許要件適否予測を行った公開済出願の件数に応じて、ポイント残高を減らす。その後、特許要件適否予測ルーチンが終了する。 Subsequently, the CPU 11 proceeds to step 14 to execute a novelty / expansion prior application prediction routine described later, and then proceeds to step 15 to execute the inventive step prediction routine. Thereafter, the CPU 11 returns to step 11 and repeats the same processing as described above. In step 16, the CPU 11 performs an operation as the prediction result editing processing unit 105 to edit and output a prediction result list L <b> 1 described later. Thereafter, the point consumption process of step 17 is executed to reduce the point balance according to the number of published applications for which the patent requirement conformity prediction has been performed. Thereafter, the patent requirement suitability prediction routine ends.

そして、ＣＰＵ１１は、ステップ１４に処理を進めると、前述した新規性・拡大先願予測処理部１２５としての動作を行い、図１４、図１５に示すフローチャートに沿って、新規性・拡大先願予測ルーチンを実行する。 When the CPU 11 proceeds to step 14, the CPU 11 operates as the novelty / expansion prior application prediction processing unit 125 described above, and follows the flowcharts shown in FIGS. Run the routine.

この場合、ＣＰＵ１１は、新規性・拡大先願予測ルーチンをスタートするとステップ２１に処理を進め、文献カウンタ（文献ｃｔ）および文献ＭＡＸに"０"をセットし、項番カウンタ（項番ｃｔ）に"１"をセットする。続いてステップ２２に処理が進み、要旨データ記憶部１５３に記憶されている要旨データの項番ｃｔに応じたデータを検索タームに用いて、対象公報記憶部１５５の検索対象公報データについて全文検索が行われ、ヒットした文献の件数が文献ＭＡＸにセットされる。この場合、ステップ２１で項番カウンタに"１"がセットされているので、項番エリア１５３ｂが"１"のデータ、すなわち請求項１の要旨データを用いて検索タームが設定される。 In this case, when the CPU 11 starts the novelty / expansion prior application prediction routine, the CPU 11 proceeds to step 21, sets “0” to the document counter (document ct) and document MAX, and sets the item number counter (item ct). Set “1”. Subsequently, the process proceeds to step 22, and a full-text search is performed on the search target publication data in the target publication storage section 155 using data corresponding to the item number ct of the summary data stored in the summary data storage section 153 as a search term. The number of hit documents is set in the document MAX. In this case, since “1” is set in the item number counter in step 21, the search term is set using the data in which the item number area 153b is “1”, that is, the summary data of claim 1.

続いて処理がステップ２３に進み、ステップ２２でヒットした文献があったか否か（文献ＭＡＸが１以上か否か）が判定され、ヒットした文献があればステップ２４に処理が進み、そうでなければ新規性・拡大先願予測ルーチンが終了する。 Subsequently, the process proceeds to step 23, where it is determined whether or not there is a document hit in step 22 (whether or not document MAX is 1 or more). If there is a hit document, the process proceeds to step 24; The novelty / expansion prior application prediction routine ends.

ステップ２４では、文献カウンタに"１"が加算され、続くステップ２５では、文献カウンタが文献ＭＡＸ以下であるか否かが判定され、これが成立しているときはステップ２６に処理が進み、そうでなければステップ２９に処理が進む。ステップ２６では、ヒットした文献の出願公開日（ヒット文献公開日）が予測対象出願の出願日（対象出願日）よりも小さいか否か（ヒット文献公開日＜対象出願日が成立するか否か）が判定され、これが成立しているときはステップ２７に処理が進み、そうでなければステップ２８に処理が進む。ステップ２７では、新規性無しを示す新規性フラグ"Ｎ１"を含むように新規性・拡大先願予測データＮｄが生成される。その後、ステップ２４に戻り、上記同様の処理が繰り返えされる。 In step 24, “1” is added to the document counter, and in subsequent step 25, it is determined whether or not the document counter is equal to or less than document MAX. If this is true, the process proceeds to step 26, and so on. If not, the process proceeds to step 29. In step 26, whether the application publication date (hit document publication date) of the hit document is smaller than the filing date (target application date) of the prediction target application (whether hit document publication date <target application date is satisfied). ) Is determined, and if this is true, the process proceeds to step 27; otherwise, the process proceeds to step 28. In step 27, novelty / expansion prior application prediction data Nd is generated so as to include a novelty flag “N1” indicating no novelty. Thereafter, the process returns to step 24 and the same processing as described above is repeated.

そして、ステップ２８では、後述する拡大先願予測ルーチンが実行される。ステップ２９では、ＣＴデータ記憶部１５４を参照して、他の独立項が有るか否かが判定され、ほかの独立項があるときはステップ３０に処理が進み、そうでなければステップ３１で新規性・拡大先願予測データＮｄが出力された後、新規性・拡大先願予測ルーチンが終了する。ステップ３０では、ＣＰＵ１１が文献カウンタおよび文献ＭＡＸに"０"をセットし、項番カウンタに"１"よりも大きい請求項ナンバがセットされる。その後、処理がステップ２２に戻り、上記同様の処理が繰り返される。 In step 28, an enlargement-first application prediction routine, which will be described later, is executed. In step 29, it is determined whether or not there is another independent term by referring to the CT data storage unit 154. If there is another independent term, the process proceeds to step 30; After the sex / expansion prior application prediction data Nd is output, the novelty / expansion prior application prediction routine ends. In step 30, the CPU 11 sets "0" in the document counter and document MAX, and a claim number larger than "1" is set in the item number counter. Thereafter, the process returns to step 22 and the same process as described above is repeated.

一方、ＣＰＵ１１は、図１５に示すフローチャートに沿って拡大先願予測ルーチンを実行する。拡大先願予測ルーチンがスタートすると、ステップ４１に処理が進み、ヒットした文献の出願日（文献出願日）が対象出願日よりも前であるか否か（文献出願日＜対象出願日が成立するか否か）が判定され、これが成立しているときはステップ４２に処理が進むが、そうでなければ拡大先願予測ルーチンを終了する。ステップ４２では、予測対象出願とヒットした文献とで発明者が同一であるか否かが判定され、これが成立していないときはステップ４３に処理が進むが、成立していれば拡大先願予測ルーチンを終了する。ステップ４３では、予測対象出願とヒットした文献とで出願人が同一であるか否かが判定され、これが成立していないときはステップ４４に処理が進むが、成立していれば拡大先願予測ルーチンを終了する。そして、ＣＰＵ１１は、ステップ４４に処理を進めると、拡大先願の要件（特許法第２９条の２に規定される要件）を満たしていないことを示す拡大先願フラグ"Ｆ１"を含むように新規性・拡大先願予測データＮｄを生成する。その後、拡大先願予測ルーチンが終了する。 On the other hand, the CPU 11 executes an enlargement-first application prediction routine according to the flowchart shown in FIG. When the extended earlier application prediction routine starts, the process proceeds to step 41, and whether or not the filing date of the hit document (document filing date) is earlier than the target filing date (reference filing date <subject filing date is satisfied) If this is true, the process proceeds to step 42; otherwise, the enlargement earlier application prediction routine is terminated. In step 42, it is determined whether or not the inventor is the same between the prediction target application and the hit document. If this is not true, the process proceeds to step 43. End the routine. In step 43, it is determined whether or not the applicant is the same between the prediction target application and the hit document. If this is not the case, the process proceeds to step 44. End the routine. Then, when the CPU 11 proceeds the process to step 44, the CPU 11 includes an enlarged first application flag “F1” indicating that the requirement of the enlarged first application (required in Article 29-2 of the Patent Law) is not satisfied. The novelty / expansion prior application prediction data Nd is generated. Thereafter, the enlargement destination application prediction routine ends.

以上で新規性・拡大先願予測ルーチンが終了すると、図１３において処理がステップ１４からステップ１５に進み、ＣＰＵ１１が進歩性予測処理部１２６としての動作を行い、進歩性予測ルーチンを実行する。ＣＰＵ１１は、図１６〜図２２に示すフローチャートに沿って進歩性予測ルーチンを実行する。 When the novelty / expansion prior application prediction routine is completed as described above, the process proceeds from step 14 to step 15 in FIG. 13, and the CPU 11 performs the operation as the inventive step prediction processing unit 126 to execute the inventive step prediction routine. The CPU 11 executes an inventive step prediction routine according to the flowcharts shown in FIGS.

ＣＰＵ１１は、進歩性予測ルーチンをスタートするとステップ５１に処理を進め、ＣＴデータ記憶部１５４から、独立区分エリア１５４ａの独立区分がスペースのレコードにつき、そのナンバエリア１５４ｂの請求項ナンバを取得して、後述する独立項テーブル１６５のナンバエリア（Ｎｏエリア）１６５ｂにセットする。続くステップ５２では、ＣＴデータ記憶部１５４から、サーチフラグエリア１５４ｅのサーチフラグがスペースで、ＭＡＸ区分エリア１５４ｃのＭＡＸ区分が"Ｍ"のレコードからそのナンバエリア１５４ｂの請求項ナンバを取得したうえで、取得した請求項ナンバの最小値（ＭＩＮ）を求め、それをＭＡＸカウンタにセットする。図９（Ｂ）のように、ＭＡＸ区分が"Ｍ"のレコードが複数あるときはそのうちの最も小さい請求項ナンバがＭＡＸカウンタにセットされる。 When starting the inventive step predictive routine, the CPU 11 advances the process to step 51, obtains the claim number of the number area 154b for the record of the independent section area 154a from the CT data storage unit 154, and It is set in the number area (No area) 165b of the independent item table 165 described later. In the subsequent step 52, the claim number of the number area 154b is obtained from the CT data storage unit 154 from a record in which the search flag of the search flag area 154e is a space and the MAX section of the MAX section area 154c is "M". Then, the minimum value (MIN) of the obtained claim number is obtained and set in the MAX counter. As shown in FIG. 9B, when there are a plurality of records whose MAX classification is “M”, the smallest claim number is set in the MAX counter.

そして、ＣＰＵ１１は、ステップ５３に処理を進めて独立項テーブル１６５のＮｏエリア１６５ｂをサーチし、続くステップ５４で、"１"よりも大きい請求項ナンバがあるか否かを判定し、"１"よりも大きい請求項ナンバがあるか否かで処理が分岐する。この場合、"１"よりも大きい請求項ナンバがなければ処理がステップ５５に進み、あれば処理がステップ５６に進む。ステップ５５は予測対象出願に含まれる請求項の中で独立項が１つだけの場合の処理（単一独立項ルーチン）、ステップ５６は独立項が複数の場合の処理（複数独立項ルーチン）に相当している。前者は例えば予測対象出願が特開２００８−６２２８２号公報に開示されている出願の場合、後者は例えば予測対象出願が特開２０１１−１８６７３５号公報に開示されている出願の場合に相当している。 Then, the CPU 11 proceeds to step 53 to search the No area 165b of the independent item table 165, and in the subsequent step 54, determines whether there is a claim number larger than “1”, and “1”. Processing branches depending on whether there is a claim number larger than that. In this case, if there is no claim number greater than “1”, the process proceeds to step 55, and if it is, the process proceeds to step 56. Step 55 is a process when there is only one independent term in the claims included in the predicted application (single independent term routine), and step 56 is a process when there are multiple independent terms (multiple independent term routine). It corresponds. For example, the former corresponds to a case where the prediction target application is an application disclosed in Japanese Patent Application Laid-Open No. 2008-62282, and the latter corresponds to a case where the prediction target application is an application disclosed in Japanese Patent Application Laid-Open No. 2011-186735, for example. .

独立項テーブル１６５は、図２４に示すように、カウンタエリア１６５ａ，Ｎｏエリア１６５ｂおよびサーチフラグエリア１６５ｃを有している。カウンタエリア１６５ａには、記憶されるデータの件数に応じた数値が記憶されている。Ｎｏエリア１６５ｂには、独立項の番号が記憶される。サーチフラグエリア１６５ｃにはサーチフラグが記憶されている。図２４には、一例として、予測対象出願が特開２０１１−１８６７３５号公報に開示されている出願の場合が示されている。 As shown in FIG. 24, the independent term table 165 has a counter area 165a, a No area 165b, and a search flag area 165c. The counter area 165a stores a numerical value corresponding to the number of stored data. In the No area 165b, the number of the independent term is stored. A search flag is stored in the search flag area 165c. FIG. 24 shows, as an example, a case where the prediction target application is an application disclosed in Japanese Patent Application Laid-Open No. 2011-186735.

そして、ＣＰＵ１１は、ステップ５５に処理を進めると、図１７に示すフローチャートに沿って単一独立項ルーチンを実行する。ＣＰＵ１１は、単一独立項ルーチンを開始すると、ステップ６１に処理を進め、ＣＴデータ記憶部１５４から、サーチフラグエリア１５４ｅのサーチフラグがスペースのレコードについて、そのナンバエリア１５４ｂから請求項ナンバを取得して、そのうちの最小値（ＭＩＮ）を項番カウンタにセットする。続くステップ６２では、後述する独立項検索処理が行われる。続くステップ６３で検索フラグ（検索ｆｌａｇ）が"ＶＸ"または"ＶＹ"であるか否かが判定され、検索フラグが"ＶＸ"または"ＶＹ"であれば処理がステップ６４に進み、そうでなければ単一独立項ルーチンを終了する。 Then, when the CPU 11 proceeds with the process to step 55, the CPU 11 executes a single independent term routine according to the flowchart shown in FIG. When the CPU 11 starts the single independent term routine, the process proceeds to step 61, and the claim number is acquired from the number area 154b for the record in which the search flag in the search flag area 154e is space from the CT data storage unit 154. Then, the minimum value (MIN) is set in the item number counter. In the subsequent step 62, an independent term search process to be described later is performed. In subsequent step 63, it is determined whether or not the search flag (search flag) is “VX” or “VY”. If the search flag is “VX” or “VY”, the process proceeds to step 64; The single independent term routine is terminated.

ＣＰＵ１１は、ステップ６４に処理を進めると項番カウンタに"１"を加算する。続くステップ６５では、項番カウンタが、ステップ５２でセットしたＭＡＸカウンタ以下であるか否かが判定され、項番カウンタがＭＡＸカウンタ以下ならステップ６６に処理を進めて後述する従属項検索処理が実行されるが、そうでなければ独立項検索処理が終了する。 When the CPU 11 advances the process to step 64, it adds “1” to the item number counter. In the next step 65, it is determined whether or not the item number counter is equal to or less than the MAX counter set in step 52. If the item number counter is equal to or less than the MAX counter, the process proceeds to step 66 to execute a dependent item search process described later. Otherwise, the independent term search process ends.

そして、ＣＰＵ１１は図１８に示すフローチャートに沿って、複数独立項ルーチンを実行する。ＣＰＵ１１は処理をスタートすると、ステップ５２に処理を進め、前述同様の処理を実行し、その後、ステップ５５に進んで、上記同様にして単一独立項ルーチンを実行する。その後、ＣＰＵ１１は処理をステップ６７に進め、ＣＴデータ記憶部１５４に、サーチフラグエリア１５４ｅのサーチフラグがスペースのレコードがあるか否か（すなわち、検索処理が行われていないレコードがあるか否か）が判定され、あればステップ５２に戻って上記同様の処理が実行されるが、そうでなければ複数独立項ルーチンが終了する。 Then, the CPU 11 executes a plurality of independent term routines according to the flowchart shown in FIG. When the CPU 11 starts the process, the process proceeds to step 52, executes the same process as described above, and then proceeds to step 55 to execute the single independent term routine in the same manner as described above. Thereafter, the CPU 11 advances the process to step 67 to determine whether or not there is a record in the CT data storage unit 154 where the search flag in the search flag area 154e is a space (that is, whether or not there is a record for which search processing has not been performed). ) Is determined, and if it is determined, the process returns to step 52 and the same processing as described above is executed. Otherwise, the plural independent term routine is terminated.

また、ＣＰＵ１１は図１９に示すフローチャートに沿って、独立項検索処理を実行する。独立項検索処理では、ＣＰＵ１１が引用発明検索部１３１としての動作を行い、独立項について主引用発明検索および副引用発明検索を行う。 Further, the CPU 11 executes an independent term search process according to the flowchart shown in FIG. In the independent term search processing, the CPU 11 performs the operation as the cited invention search unit 131, and performs the main cited invention search and the sub cited invention search for the independent term.

ＣＰＵ１１は、独立項検索処理を開始すると、ステップ７１に処理を進めて後述する主引用発明検索処理を実行する。続くステップ７２では、主引用発明検索処理で主引用発明があったか否か（後述する主引用文献がセットされているか否か）が判定され、主引用発明があればステップ７３に処理が進むが、主引用発明がなければステップ７６に処理が進む。続くステップ７３では、後述する副用発明検索処理が実行され、そのあとのステップ７４で、副引用発明検索処理で副引用発明があったか否か（後述する副引用文献がセットされているか否か）が判定される。副引用発明があればステップ７５に処理が進み、副引用発明がなければステップ７７に処理が進む。 When starting the independent term search process, the CPU 11 advances the process to step 71 to execute a main citation invention search process to be described later. In the following step 72, it is determined whether or not there is a main citation invention in the main citation invention search process (whether or not a main citation document described later is set). If there is a main citation invention, the process proceeds to step 73. If there is no main cited invention, the process proceeds to step 76. In the subsequent step 73, a sub-invention search process to be described later is executed, and in a subsequent step 74, whether or not there has been a sub-citation invention in the sub-citation invention search process (whether a sub-citation document to be described later is set). Is determined. If there is a secondary citation invention, the process proceeds to step 75, and if there is no secondary citation invention, the process proceeds to step 77.

ＣＰＵ１１はステップ７５に処理を進めると、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＸ"をセットし、ステップ７７では、検索フラグ（検索ｆｌａｇ）に"ＶＹ"をセットする。また、ＣＰＵ１１はステップ７６に処理を進めると、ＣＴデータ記憶部１５４に記憶されているレコードのうち、ナンバエリア１５４ｂの請求項ナンバが項番カウンタに一致しているレコードについて、サーチフラグエリア１５４ｅのサーチフラグＥｆにサーチ済み（検索済み）を示す"９"をセットする。また、ＣＰＵ１１はセットされた検索フラグを含むように進歩性予測データＶｄ１を生成して、それを予測結果ファイル生成部１２７に出力する。また、ＣＰＵ１１は検索結果に応じた請求項要旨データｉｅｄと概念検索データＶｄ２を入力ベクトル生成部１３２に出力する。この場合、請求項要旨データｉｅｄは、検索の対象となった請求項の要旨データとすることができるが、予測対象ＴＲ記憶部１５２に記憶されている予測対象出願の検索の対象となった請求項のデータでもよい。検索フラグは、主引用発明が見つかった場合に"ＶＸ"または"ＶＹ"がセットされるが、主引用発明が見つかると、それによって、進歩性の要件を満たさないと判断される可能性が高いため、進歩性違反の拒絶理由が見つかるか否かは主引用発明が見つかるか否かに大きく左右される。進歩性予測データＶｄ１は、このような検索フラグを含むことによって、進歩性の要件適否を示すものとなる。 When the processing proceeds to step 75, the CPU 11 sets “VX” to the search flag (search flag) of the corresponding claim number, and sets “VY” to the search flag (search flag) in step 77. Further, when the CPU 11 advances the process to step 76, among the records stored in the CT data storage unit 154, the records in which the claim number in the number area 154b matches the item number counter are stored in the search flag area 154e. A search flag Ef is set to “9” indicating that the search has been completed (searched). Further, the CPU 11 generates the inventive step prediction data Vd1 so as to include the set search flag, and outputs it to the prediction result file generation unit 127. Further, the CPU 11 outputs the claim summary data ied and the concept search data Vd2 corresponding to the search result to the input vector generation unit 132. In this case, the claim summary data ied can be the summary data of the claim that is the subject of the search, but the claim that is the subject of the search of the prediction target application stored in the prediction target TR storage unit 152 It may be term data. The search flag is set to “VX” or “VY” when the main cited invention is found, but if the main cited invention is found, it is likely that the search flag does not satisfy the inventive step requirement. Therefore, whether or not the reason for refusal of the inventive step violation is found depends greatly on whether or not the main cited invention is found. By including such a search flag, the inventive step prediction data Vd1 indicates the suitability of the inventive step requirement.

そして、ＣＰＵ１１は図２０に示すフローチャートに沿って、主引用発明検索処理を実行する。主引用発明検索処理は、予測対象発明に最も近い主引用発明を検索する処理である。 Then, the CPU 11 executes main citation invention search processing according to the flowchart shown in FIG. The main citation invention search process is a process of searching for the main citation invention closest to the prediction target invention.

ＣＰＵ１１は、主引用発明検索処理を開始すると、ステップ８１に処理を進めて、展開度カウンタｔｃに"０"をセットする。続いてＣＰＵ１１は、ステップ８２に処理を進め、要旨データ記憶部１５３から、次のデータを読みだして主検索文書データ（主引用発明を概念検索で検索するときの文書データ）を設定する。１つは、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコード（項番カウンタには、ステップ６１で独立項の最小値がセットされている）から必須フラグＥｆが"Ｘ"で、展開度Ｅｄが展開度の最大値（展開度ＭＡＸ）−展開度カウンタｔｃの用語（例えば、展開度ＭＡＸが"５"なら、展開度Ｅｄが"５"−ｔｃの用語）であり、もう１つは、課題データ、すなわち、データ種別が"Ｐ"のレコードのデータである。 When the CPU 11 starts the main cited invention search process, the CPU 11 proceeds to step 81 and sets “0” in the development degree counter tc. Subsequently, the CPU 11 advances the processing to step 82, reads the next data from the summary data storage unit 153, and sets main search document data (document data when searching the main cited invention by concept search). One is a mandatory flag from a record in which the data type is “C” and the number of the item number area 153b corresponds to the item number counter (the item number counter is set with the minimum value of the independent item in step 61). Ef is “X”, and the degree of development Ed is the maximum value of the degree of development (expansion degree MAX) −the term of the degree of development counter tc (for example, if the degree of development MAX is “5”, the degree of development Ed is “5” −tc. The other is problem data, that is, data of a record whose data type is “P”.

続くステップ８３では、ＣＰＵ１１が主引用発明の検索処理、すなわち、主検索文書データを入力文書に用いて、対象公報記憶部１５５に記憶されている検索対象公報データについて概念検索を行う。この概念検索では、主検索文書データと、検索される文書それぞれを特徴語の抽出、重み付けを行う等してそれぞれの文書に応じたベクトル（文書ベクトル）が生成され、各ベクトルの内積が求められて類似度が算出される。次にステップ８４では、ステップ８３の概念検索の結果から、最も大きい類似度が一定値以上になっているか否かが判定され、一定値以上の場合はステップ８５に処理が進むが、そうでなければステップ８７に処理が進む。ステップ８５では、類似度が一定値以上の文献が複数あったか否かが判定され、なければステップ８６に処理が進み、あれば処理がステップ８９に進む。主引用発明検索処理では、類似度が一定値以上の文献があったときだけ主引用文献がセットされる。 In the following step 83, the CPU 11 performs a concept search for the search target publication data stored in the target publication storage unit 155 using the search process of the main cited invention, that is, the main search document data as the input document. In this conceptual search, a vector (document vector) corresponding to each document is generated by extracting and weighting the main search document data and each document to be searched, and the inner product of each vector is obtained. Thus, the similarity is calculated. Next, in step 84, it is determined from the result of the concept search in step 83 whether or not the largest similarity is a certain value or more, and if it is more than a certain value, the process proceeds to step 85. If so, the process proceeds to step 87. In step 85, it is determined whether or not there are a plurality of documents whose similarity is equal to or greater than a certain value. If there are no documents, the process proceeds to step 86; In the main citation invention search process, the main citation is set only when there is a document whose similarity is a certain value or more.

ステップ８６では、ヒットした文献が主引用文献（主引用発明が開示されている先行技術文献）にセットされて主引用発明検索処理が終了する。ステップ８７では、展開度カウンタｔｃに"１"が加算され、その後のステップ８８では、展開度ＭＡＸ−展開度カウンタｔｃが"０"以下であるか否かが判定され、"０"以下なら主引用発明検索処理を終了するが、そうでなければステップ８２に戻って上記の処理を繰り返す。 In step 86, the hit document is set as the main cited document (prior art document in which the main cited invention is disclosed), and the main cited invention search process ends. In step 87, “1” is added to the expansion degree counter tc, and then in step 88, it is determined whether or not the expansion degree MAX−expansion degree counter tc is “0” or less. The cited invention search process is terminated. If not, the process returns to step 82 and the above process is repeated.

こうすることで、はじめに展開度Ｅｄが展開度ＭＡＸのより重要な用語で概念検索が行される。概念検索では、文献の類似度に応じて、複数の文献が抽出され得るが、最も高い類似度が一定値に達していないときは、その文献が主引用文献に該当しないおそれが高い。そのため、類似度が一定値以上の文献が見つからなかった場合に展開度Ｅｄが展開度ＭＡＸよりも小さい用語を含めて再び概念検索が実行される。 In this way, first, a concept search is performed with terms that have a degree of expansion Ed more important than the degree of expansion MAX. In the concept search, a plurality of documents can be extracted according to the similarity of documents, but when the highest similarity does not reach a certain value, there is a high possibility that the document does not correspond to the main cited document. For this reason, when a document with a similarity equal to or greater than a certain value is not found, the concept search is executed again including a term whose expansion degree Ed is smaller than the expansion degree MAX.

ステップ８９では、類似度の最も大きい文献（最類似文献ともいう）を主引用文献にセットし、その後、主引用発明検索処理が終了する。 In step 89, the document with the highest degree of similarity (also referred to as the most similar document) is set as the main cited document, and then the main cited invention search process ends.

そして、ＣＰＵ１１は図２１に示すフローチャートに沿って、副引用発明検索処理を実行する。副引用発明検索処理は、予測対象発明と主引用発明との相違点を含む副引用発明を検索する処理であり、主引用発明検索処理で主引用発明が見つかったときだけ実行される。 Then, the CPU 11 executes the secondary citation invention search process according to the flowchart shown in FIG. The sub-cited invention search process is a process for searching a sub-cited invention including a difference between the prediction target invention and the main-cited invention, and is executed only when the main-cited invention is found in the main-cited invention search process.

ＣＰＵ１１は、副引用発明検索処理を開始すると、ステップ９１に処理を進めて、要旨データ記憶部１５３から、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの主検索文書データに含まれていない用語（検索未使用データ）と、データ種別が"Ｐ"のレコードのデータとを読み出し、それらを副検索ターム（副引用発明を全文検索で検索するときのキーワード）に設定する。 When the CPU 11 starts the sub-cited invention search process, the process proceeds to step 91, and from the summary data storage unit 153, the data type is “C” and the number of the item number area 153b corresponds to the item number counter. Terms that are not included in the main search document data (unused search data) and data of records whose data type is “P” are read out, and they are sub-search terms (keywords for searching the sub-cited invention by full-text search) ).

続くステップ９２では、ＣＰＵ１１が副引用発明の検索処理、すなわち、副検索タームを検索キーワードに用いて、対象公報記憶部１５５に記憶されている検索対象公報データについて全文検索を行う。続くステップ９３では、ステップ９２でヒットした文献があったか否かが判定され、ヒットした文献があればステップ９４に処理が進み、そうでなければ処理がステップ９６に進む。ステップ９４では、ヒットした文献が複数あったか否かが判定され、ヒットした文献が複数なければステップ９５に処理が進み、ヒットした文献が複数あれば処理がステップ９８に進む。 In the subsequent step 92, the CPU 11 performs a full-text search on the search target publication data stored in the target publication storage section 155 using the search process of the subcitation invention, that is, the sub search term as a search keyword. In the following step 93, it is determined whether or not there is a document hit in step 92. If there is a hit document, the process proceeds to step 94, and if not, the process proceeds to step 96. In step 94, it is determined whether or not there are a plurality of hit documents. If there are no hit documents, the process proceeds to step 95. If there are a plurality of hit documents, the process proceeds to step 98.

ステップ９５では、ヒットした文献が副引用文献（副引用発明が開示されている先行技術文献）にセットされて副引用発明検索処理が終了する。ステップ９６では、副検索タームが変更されて再び全文検索が行われる。ここでは、副検索タームが、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの主検索文書データに含まれていない検索未使用データと、データ種別が"Ｔ"のレコードのデータに変更される。次のステップ９７でヒットした文献があったか否かが判定され、ヒットした文献があればステップ９４に処理が進み、なければ副引用発明検索処理が終了する。さらに、ステップ９８では、ヒットした文献のそれぞれについて、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコードの必須フラグＥｆが"Ｘ"の用語との一致数がカウントされ、その一致数が副引用ファイルにセットされる。次のステップ９９で副引用ファイルが一致数の降順にソートされ、続くステップ１００で副引用ファイルの先頭から３件が副引用文献にセットされ、その後、副引用発明検索処理が終了する。副引用発明検索で複数の文献がヒットしたときは、そのそれぞれについて、予測対象発明の特徴部分がどの程度開示されているのかが、必須フラグＥｆが"Ｘ"の用語との一致数で調べられ、その一致数の多い文献が副引用文献にセットされる。 In step 95, the hit document is set in the sub-cited document (prior art document in which the sub-cited invention is disclosed), and the sub-cited invention search process ends. In step 96, the sub-search term is changed and the full-text search is performed again. Here, the sub-search term is the data type “C”, the search number unused data not included in the main search document data of the record corresponding to the item number counter with the data type “C”, and the data type “ It is changed to the data of the record of “T”. In step 97, it is determined whether or not there is a hit document. If there is a hit document, the process proceeds to step 94. If not, the sub-cited invention search process is terminated. Further, in step 98, for each of the hit documents, the number of matches with the term having the data type “C” and the number of the item number area 153b corresponding to the item number counter is “X”. It is counted and the number of matches is set in the subcitation file. In the next step 99, the sub-cited files are sorted in descending order of the number of matches, and in the subsequent step 100, three sub-cited files from the top of the sub-cited file are set in the sub-cited document, and then the sub-cited invention search process ends. When a plurality of documents are hit in the sub-cited invention search, the extent to which the feature part of the prediction target invention is disclosed is examined for each of them by the number of matches with the term whose essential flag Ef is “X”. A document with a large number of matches is set as a sub-cited document.

そして、ＣＰＵ１１は図２２に示すフローチャートに沿って、従属項検索処理を実行する。従属項検索処理は、検索フラグ（検索ｆｌａｇ）が"ＶＸ"または"ＶＹ"であったとき（主引用発明がみつかったとき）だけ実行される。ＣＰＵ１１が従属項検索処理を開始すると、ステップ１１１に処理が進み、要旨データ記憶部１５３から、データ種別が"Ｃ"で、項番エリア１５３ｂの番号が項番カウンタに相当するレコード（項番カウンタには、ステップ６４で独立項の最小値に順次"１"が加算される）から必須フラグＥｆが"Ｘ"の用語が読み出され、それが従属検索ターム（従属項に記載されている発明を全文検索で検索するときのキーワード）に設定される。次のステップ１１２でＣＰＵ１１が従属検索タームを検索キーワードに用いて、主引用文献について全文検索を行い、従属項に記載されている発明が主引用文献に開示されているか否かを調べる。 And CPU11 performs a dependent term search process according to the flowchart shown in FIG. The dependent term search process is executed only when the search flag (search flag) is “VX” or “VY” (when the main cited invention is found). When the CPU 11 starts the subordinate item search process, the process proceeds to step 111, and a record (item number counter) from the summary data storage unit 153 corresponding to the item number counter with the data type “C” and the item number area 153b number. In step 64, the term having the required flag Ef of “X” is read from the minimum value of the independent term in order at step 64, and this is the dependent search term (invention described in the dependent term). Is set as a keyword when searching in full-text search. In the next step 112, the CPU 11 performs a full-text search for the main cited document using the subordinate search term as a search keyword, and checks whether the invention described in the dependent claim is disclosed in the main cited document.

次のステップ１１３で、ヒットした文献があったか否かが判定され、ヒットした文献があればステップ１１４に処理が進み、そうでなければ処理がステップ１１６に処理が進む。ステップ１１４では、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＸ"がセットされ、ヒットした文献が該当する請求項ナンバの主引用文献にセットされる。その後、処理がステップ１１５に進み、ＣＴデータ記憶部１５４に記憶されているレコードのうち、ナンバエリア１５４ｂの請求項ナンバが項番カウンタに一致するレコードについて、サーチフラグエリア１５４ｅのサーチフラグＥｆに"９"がセットされ、その後、従属項検索処理が終了する。また、ステップ１１６では、従属検索タームで副引用文献について全文検索が行われ、次のステップ１１７で、ヒットした文献があったか否かが判定される。ヒットした文献があればステップ１１８を実行したあとステップ１１５に進み、なければ従属項検索処理が終了する。ステップ１１８では、該当する請求項ナンバの検索フラグ（検索ｆｌａｇ）に"ＶＹ"がセットされ、ヒットした文献が該当する請求項ナンバの副引用文献にセットされる。 In step 113, it is determined whether there is a hit document. If there is a hit document, the process proceeds to step 114. If not, the process proceeds to step 116. In step 114, "VX" is set in the search flag (search flag) of the corresponding claim number, and the hit document is set as the main cited reference of the corresponding claim number. Thereafter, the process proceeds to step 115, and among the records stored in the CT data storage unit 154, the record whose claim number in the number area 154b matches the item number counter is set in the search flag Ef in the search flag area 154e. 9 "is set, and then the dependent term search process ends. Further, in step 116, a full-text search is performed for the sub-cited document by the subordinate search term, and in the next step 117, it is determined whether or not there is a hit document. If there is a hit document, step 118 is executed and then the process proceeds to step 115. If not, the dependent term search process is terminated. In step 118, “VY” is set in the search flag (search flag) of the corresponding claim number, and the hit document is set in the sub-citation document of the corresponding claim number.

以上のようにして、新規性・拡大先願予測ルーチンと、進歩性予測ルーチンとが実行されると、それぞれの結果に応じて、新規性・拡大先願予測データＮｄと、進歩性予測データＶｄ１とが予測結果ファイル生成部１２７に出力される。また、機械学習部１３３から要件適否文書ベクトルＶ４が出力されるので、これらを用いて予測結果ファイル生成部１２７が図１０に示した予測結果ファイルを生成し、予測結果記憶部１５６に記憶させる。 As described above, when the novelty / expansion prior application prediction routine and the inventive step prediction routine are executed, the novelty / expansion prior application prediction data Nd and the inventive step prediction data Vd1 according to the respective results. Is output to the prediction result file generation unit 127. Further, since the requirement adequacy document vector V4 is output from the machine learning unit 133, the prediction result file generation unit 127 generates a prediction result file illustrated in FIG. 10 using these, and stores the prediction result file in the prediction result storage unit 156.

予測結果ファイルは、図１０に示すように、公開番号、請求項、主検索文書データ、副検索ターム、検索フラグ、ヒット文献、マシン予測の各項目のデータが予測対象出願ごとに記憶されている。マシン予測とは、機械学習記憶部１３３からの要件適否文書ベクトルＶ４に応じたデータであって、進歩性予測ルーチンで見つかった主引用文献および副引用文献を引用した進歩性違反の拒絶理由が見つかる可能性が高いか低いか（高い場合は"Ｈ"、低い場合は"Ｌ"）を示している。 As shown in FIG. 10, the prediction result file stores the data of each item of publication number, claim, main search document data, sub search term, search flag, hit document, and machine prediction for each prediction target application. . The machine prediction is data corresponding to the requirement conformity document vector V4 from the machine learning storage unit 133, and the reason for refusal of the inventive step violation that cited the main cited document and the sub cited document found in the inventive step prediction routine is found. The possibility is high or low (“H” when high, “L” when low).

また、予測結果編集部１０５が予測結果ファイルを読み込み、図２５に示すような特許要件適否予測リストＬ１を編集および出力して、ユーザ端末装置３０に送信する。特許要件適否予測リストＬ１には、予測対象出願の出願公開番号、請求項の番号とともに、新規性（拡大先願）、進歩性の要件適否がその根拠となる文献（主引用文献、副引用文献）とともに示されている。新規性（拡大先願）、進歩性の要件に適合しない（満たさない）と予測される場合は、"Ｘ"、適合する（満たす）と予測される場合は"Ａ"が記載される。これらは、予測結果ファイルの検索フラグで判断される。 Further, the prediction result editing unit 105 reads the prediction result file, edits and outputs a patent requirement suitability prediction list L 1 as shown in FIG. 25, and transmits it to the user terminal device 30. The patent requirement propriety prediction list L1 includes the application publication number of the application to be predicted, the number of claims, and documents on which the suitability of requirements of novelty (expanded prior application) and inventive step is based (main cited document, sub cited document) ). “X” is described when it is predicted that the requirement of novelty (expanded prior application) and inventive step will not be met (does not meet), and “A” is recorded when it is predicted that it will meet (satisfy). These are determined by the search flag of the prediction result file.

進歩性の要件に適合しないと予測される場合の"Ｘ"（主引用文献、副引用文献有り）、"Ｙ"（主引用文献のみ有り）、には、"Ｈ"、"Ｌ"が併記される（図２５では、"Ｈ"が併記されるばあいのみ例示）これは、機械学習部１３３の要件適否文書ベクトルＶ４にしたがったもので、"Ｈ"は主引用文献で進歩性違反の拒絶理由が発行される可能性が高い場合、"Ｌ"は低い場合を示している。 “H” and “L” are also written in “X” (main citation and sub-citation) and “Y” (only main citation) when predicted not to meet the inventive step requirement. (In FIG. 25, only when “H” is written together) This is in accordance with the requirement adequacy document vector V4 of the machine learning unit 133, and “H” is a violation of inventive step in the main cited document. When the possibility of refusal is likely to be issued, “L” indicates a low case.

以上のように、本発明の実施の形態にかかる特許要件適否予測サーバ１０では、予測対象出願の記載事項から要旨データを生成し、これを用いて主引用発明検索、副引用発明検索を行っている。主引用発明検索は、予測対象出願とその骨格において共通する、すなわち、先行技術発明のうち、予測対象発明に最も近い主引用発明を要旨データで探し出す処理であり、特許法や特許・実用新案審査基準に沿って行われる。副引用発明検索は、主引用発明検索で主引用発明が見つかった場合に、発明が解決しようとする課題や、技術分野を特定する用語を用いた全文検索で行われており、これも特許法や特許・実用新案審査基準に沿って行われる。したがって、本発明の実施の形態にかかる特許要件適否予測サーバ１０では、特許要件の適否に関する予測が審査実務に適合した内容で行われるので、特許出願の監視負担を有効に軽減することができる。 As described above, the patent requirement conformity prediction server 10 according to the embodiment of the present invention generates summary data from the description items of the prediction target application, and uses this to perform main citation invention search and sub-citation invention search. Yes. The main citation invention search is common to the prediction target application and its skeleton, that is, the process of searching the main citation invention closest to the prediction target invention among the prior art inventions in the summary data. Patent law and patent / utility model examination Done in line with standards. The sub-cited invention search is performed by a full-text search using a term specifying a problem to be solved by the invention or a technical field when the main cited invention is found by the main cited invention search. And patent and utility model examination standards. Therefore, in the patent requirement conformity prediction server 10 according to the embodiment of the present invention, the prediction regarding the propriety of the patent requirement is performed with the content suitable for the examination practice, so that the monitoring burden of the patent application can be effectively reduced.

一方、前述したように、特許要件適否予測処理部１０３が機械学習部１３３を有しているが、その機械学習部１３３は過去の審査実績に基づく学習データで訓練された人工知能プログラムで構築されている。 On the other hand, as described above, the patent requirement suitability prediction processing unit 103 has the machine learning unit 133, and the machine learning unit 133 is constructed by an artificial intelligence program trained with learning data based on past examination results. ing.

ところで、平成２６年の実績ベースで年間３２万数千件程度の特許出願が出されており、その一部またはそれ以前の多数の特許出願について１ｓｔアクションがすでに発行されている。その中には、拒絶理由通知で進歩性違反の拒絶理由が指摘されている出願（進歩性拒絶出願）が多数存在している。 By the way, about 320,000 or more patent applications are filed annually on a performance basis in 2014, and the 1st action has already been issued for some or many patent applications before that. Among them, there are many applications (progressive rejection applications) in which the reasons for refusal of violation of inventive step are pointed out in the notice of reasons for refusal.

進歩性拒絶出願では、審査結果が、審査時点の請求項に記載された発明と主引用発明とに相違点があったものの、その相違点だけでは、進歩性があるとは審査官によって判断されなかったということを意味している。これに対し、特許出願の中には、１ｓｔアクションが発行されることなく特許査定が発行された出願や、拒絶理由通知が発行されたものの、その理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）も存在している。 In the inventive step rejection application, the examination results differed between the invention described in the claim at the time of examination and the main cited invention. However, the examiner determined that there was an inventive step based only on the difference. It means that there was no. On the other hand, in patent applications, applications for which a patent assessment was issued without the first action being issued, or a reason for refusal was issued, but the reasons for refusal of the inventive step violation were pointed out. There are applications that did not exist (applications without inventive step rejection).

そして、例えば図２３に示すように、審査対象となる特許出願Ｐｄがあり、その出願日がｔ_０であったとすると、特許出願Ｐｄに対する主引用発明または副引用発明となりえるのは、公知、公用、文献公知およびインターネット公知の発明であり、主に出願日ｔ_０より前にすでに公開されている出願の特許公開公報（図２３では、ｒｆ１〜ｒｆ６）に開示されている発明である。 Then, for example, as shown in FIG. 23, if there is a patent application Pd to be examined and its filing date is t ₀ , the main cited invention or sub-cited invention for the patent application Pd can be known or publicly used. are known from the literature and the Internet known invention is primarily (in FIG. 23, rf1~rf6) filed t ₀ Patent Laid-open application previously have been published from the invention disclosed in.

ここで、仮に、審査の結果、公報ｒｆ６に開示されている発明が主引用発明に該当すると判断されたとする。すると、その場合、特許出願Ｐｄに係る発明と、その公報ｒｆ６に開示されている発明とに相違点があったものの、その相違点に応じた距離ｄｐが、特許出願Ｐｄに係る発明の進歩性を肯定するに足りる大きさではなかったと考えられる。逆に、公報ｒｆ６に開示されている発明が主引用発明には該当しないと判断されていたとすれば、距離ｄｐが、出願Ｐｄに係る発明の進歩性を肯定するに足りる大きさであったと考えられる。 Here, it is assumed that, as a result of examination, it is determined that the invention disclosed in the publication rf6 corresponds to the main cited invention. In this case, although there is a difference between the invention according to the patent application Pd and the invention disclosed in the publication rf6, the distance dp corresponding to the difference is the inventive step of the invention according to the patent application Pd. It is thought that it was not large enough to affirm. On the other hand, if it is determined that the invention disclosed in the publication rf6 does not fall under the main cited invention, it is considered that the distance dp is large enough to affirm the inventive step of the application related to the application Pd. It is done.

もし、発明の進歩性が肯定されるときの相違がどの程度で、否定されるときの相違がどの程度なのかが割り出せれば、それが特許要件適否の客観的な判断材料になると考えられるが、以上を考慮すると、そのためには、２つの発明の相違に応じた距離ｄｐがどの程度なのかを割り出すのが有効であると考えられる。これを過去の審査実績に基づく訓練データの学習によって割り出し、進歩性が否定されるおそれが高いのか、それとも低いのかの目安を付けるのが機械学習部１３３である。 If it is possible to determine how much the difference is when the inventive step is affirmed and what is the difference when the invention is denied, it can be considered as an objective judgment material for whether or not the patent requirement is met. In view of the above, for that purpose, it is considered effective to determine the distance dp according to the difference between the two inventions. The machine learning unit 133 determines this by learning the training data based on the past examination results, and gives a measure of whether the inventive step is highly likely to be denied or low.

機械学習部１３３の学習において、本件出願にかかる発明（本願発明）では、距離ｄｐを２つの文書ベクトルの差分と捉え、進歩性の拒絶理由有りの場合、無しの場合それぞれの距離ｄｐを学習するため、前述のＨＬパターンによる訓練データで学習が行われている。 In the learning of the machine learning unit 133, in the invention according to the present application (the invention of the present application), the distance dp is regarded as a difference between two document vectors, and the distance dp is learned when there is a reason for refusal of inventive step and when there is no reason for rejection. Therefore, learning is performed using the training data based on the above-described HL pattern.

そして、特許要件の適否を予測する場合は、予測対象出願について、その要旨データを求め、それを用いて概念検索で最類似文献を探し出す。最類似文献は、予測対象出願の文書ベクトル（正確には、独立項の記載事項などから求めた文書ベクトル）に最も類似度が高い文書ベクトルを有しているので、公開済出願の中で主引用文献になる可能性が最も高いと認められる。 Then, when predicting the suitability of patent requirements, the gist data is obtained for the prediction target application, and the most similar document is searched by using the concept data. Since the most similar document has the document vector having the highest similarity to the document vector of the prediction target application (more precisely, the document vector obtained from the description of the independent item), it is the main document in the published application. It is recognized that it is most likely to be cited.

その最類似文献から求めた引用候補ベクトルＲｆＶと、予測対象出願の要旨データから求めた要旨ベクトルＥＶとの差分を求めて要旨移動ベクトルＶ３を生成し、これを機械学習部１３３に入力して、主引用発明検索で見つかった主引用文献を引用する進歩性違反の拒絶理由が有るのか、無いのかが出力されるようにしている。これにより、進歩性違反の拒絶理由が見つかる可能性が高いのか、低いのかの目安を付けることが可能になる。 The difference between the citation candidate vector RfV obtained from the most similar document and the summary vector EV obtained from the summary data of the prediction target application is obtained to generate a summary movement vector V3, which is input to the machine learning unit 133, Whether or not there is a reason for refusal of inventive step violation that cites the main cited document found in the main cited invention search is output. As a result, it is possible to give an indication of whether the reason for refusal of violating the inventive step is high or low.

以上のように、特許要件適否予測サーバ１０では、機械学習部１３３を備えていることによって、特許庁の審査実績を反映させる形で特許要件適否の予測が行われることになる。従前のような審査官や弁理士などの専門家の経験や勘だけに頼らざるを得ない判断結果に人工知能の判断結果を生かせるようになるため、予測結果に客観性を持たせることが可能になり、他社出願の監視負担の軽減や、自社出願の審査請求有無の判断効率化などとも相まって業務効率の向上が期待できる。 As described above, the patent requirement suitability prediction server 10 includes the machine learning unit 133, so that the patent requirement suitability is predicted in a manner that reflects the examination results of the Patent Office. Since the judgment results of artificial intelligence can be used for judgment results that must rely solely on the experience and intuition of experts such as examiners and patent attorneys as before, it is possible to make the prediction results objective Therefore, it can be expected that business efficiency will be improved in combination with reduction of the monitoring burden of other company's applications and more efficient judgment of whether or not to request examination of in-house applications.

また、進歩性予測処理部１２６が主引用発明検索では概念検索を行い、副引用発明検索で全文検索を行っている。進歩性違反の拒絶理由が有るのかどうかは主引用発明が見つかるか否かが大きく左右するが、その主引用発明を探す主引用発明検索で全文検索を行うと、複数の文献がヒットする可能性があり、主引用発明（主引用文献）を特定できない場合がある。この点、概念検索では、文書ベクトルの内積から求めた類似度にしたがい類似している文献が順番付けされるので、最も類似度の高い文献を選ぶことで主引用文献を特定できる。こうして見つけた主引用文献に機械学習部１３３による予測を併用することで、その主引用文献を引用した進歩性違反の拒絶理由が出るおそれが高いのか、低いのかを予測することができる。また、副引用発明検索で全文検索を行うことで、副引用文献があるのかどうかを明確にすることができる。 Further, the inventive step prediction processing unit 126 performs a concept search in the main citation invention search and performs a full text search in the sub citation invention search. Whether or not there is a reason for refusal to violate the inventive step largely depends on whether or not the main cited invention is found, but if a full text search is performed in the main cited invention search to find the main cited invention, there is a possibility that multiple documents will be hit In some cases, the main cited invention (main cited document) cannot be specified. In this regard, in the concept search, similar documents are ordered according to the similarity obtained from the inner product of the document vectors. Therefore, the main cited document can be specified by selecting the document having the highest similarity. By combining the prediction by the machine learning unit 133 with the main citation document found in this way, it is possible to predict whether the reason for refusal of the inventive step violation that cited the main citation document is high or low. In addition, it is possible to clarify whether there is a sub-cited document by performing a full text search by sub-cited invention search.

以上の説明では、より好ましい実施の形態として、進歩性予測処理部１２６の引用発明検索部１３１が主引用発明検索部および副引用発明検索部を有している場合を示している。前述したように、主引用発明検索によって主引用発明が見つかると、進歩性無しの拒絶理由が見つかる可能性が高いから、主引用発明が見つかったら、その後は副引用発明検索を行うことなく入力ベクトル生成部１３２と、機械学習部１３３を作動させて要件適否文書ベクトルＶ４を出力するようにしてもよい。この場合でも、前述のステップ８９でセットされる主引用文献に関する進歩性予測データＶｄ１と、機械学習部１３３により生成される要件適否文書ベクトルＶ４とを併用することで、審査実務に適合した内容の予測が行えるのであって、しかもその予測は人工知能の判断結果を生かしたものとなるから、予測結果に客観性を持たせることができる。したがって、進歩性予測処理部１２６が主引用発明検索部を有していればよく、副引用発明検索部を有していなくてもよいが、上記のように、副引用発明検索部を有する進歩性予測処理部１２６の方がより好ましい。 In the above description, as a more preferred embodiment, a case where the cited invention search unit 131 of the inventive step prediction processing unit 126 has a main cited invention search unit and a sub-cited invention search unit is shown. As described above, when the main cited invention is found by the main cited invention search, there is a high possibility that the reason for refusal without inventive step will be found. The generation unit 132 and the machine learning unit 133 may be operated to output the requirement suitability document vector V4. Even in this case, by using together the inventive step prediction data Vd1 related to the main cited document set in step 89 and the requirement suitability document vector V4 generated by the machine learning unit 133, the content suitable for the examination practice can be obtained. Since the prediction can be performed and the prediction uses the judgment result of the artificial intelligence, the prediction result can have objectivity. Therefore, it is sufficient that the inventive step prediction processing unit 126 has the main citation invention search unit and does not have to have the sub citation invention search unit, but as described above, the progress having the sub citation invention search unit. The sex prediction processing unit 126 is more preferable.

第２の実施の形態
続いて、第２の実施の形態に係る特許要件適否予測サーバ２００について、図２６〜図３０を参照して説明する。特許要件適否予測サーバ２００は、図２６に示すように、前述した特許要件適否予測サーバ１０と比較して、特許要件適否予測処理部１０３、予測結果編集処理部１０５、予測結果記憶部１５６の代わりに特許要件適否予測処理部２０３、予測結果編集処理部２０５、予測結果記憶部２５６を有する点と、予測結果リストＬ１の代わりに予測結果リストＬ２を出力する点とで相違している。 Second Embodiment Subsequently, a patent requirement suitability prediction server 200 according to a second embodiment will be described with reference to FIGS. 26 to 30. As shown in FIG. 26, the patent requirement suitability prediction server 200 is replaced with the patent requirement suitability prediction processing unit 103, the prediction result editing processing unit 105, and the prediction result storage unit 156, as compared with the patent requirement suitability prediction server 10 described above. 3 further includes a patent requirement suitability prediction processing unit 203, a prediction result editing processing unit 205, and a prediction result storage unit 256, and a point that the prediction result list L2 is output instead of the prediction result list L1.

特許要件適否予測処理部２０３は、図２７に示すように、特許要件適否予測処理部１０３と比較して、進歩性予測処理部１２６と予測結果ファイル生成部１２７の代わりに進歩性予測処理部２２６と予測結果ファイル生成部２２７を有する点で相違している。 As shown in FIG. 27, the patent requirement suitability prediction processing unit 203 is different from the patent requirement suitability prediction processing unit 103 in that an inventive step prediction processing unit 226 is used instead of the inventive step prediction processing unit 126 and the prediction result file generation unit 127. And the prediction result file generation unit 227 is different.

そして、進歩性予測処理部２２６は、進歩性予測処理部１２６と比較して、引用発明検索部１３１と機械学習部１３３の代わりに引用発明検索部２３１と機械学習部２３３を有する点と、入力ベクトル生成部１３２の動作が異なる点とで相違している。 And the inventive step prediction processing unit 226 is different from the inventive step prediction processing unit 126 in that it has a cited invention search unit 231 and a machine learning unit 233 instead of the cited invention search unit 131 and the machine learning unit 133, and an input The difference is that the operation of the vector generation unit 132 is different.

前述した第１の実施の形態に係る進歩性予測処理部１２６では、主引用発明検索を行うことによって、最類似文献だけを主引用文献にセットしているが（ステップ８９）、第２の実施の形態に係る進歩性予測処理部２２６では、類似度の降順に最類似文献を含むｎ件の文献（ｎは２以上の整数）を類似文献として抽出し、それら各類似文献を主引用文献にセットしている。また、機械学習部２３３が、ｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎをそれぞれ後述するＳクラス、Ｈクラス、Ｌクラスの３つのクラスに分類する。 In the inventive step predictive processing unit 126 according to the first embodiment described above, only the most similar document is set as the main cited document by performing the main cited invention search (step 89). In the inventive step predictive processing unit 226, n documents including the most similar documents in descending order of similarity (n is an integer of 2 or more) are extracted as similar documents, and these similar documents are used as main cited documents. It is set. In addition, the machine learning unit 233 classifies the _n summary movement vectors V3 _{1 to} V3 _n into three classes of S class, H class, and L class, which will be described later.

引用発明検索部２３１は、引用発明検索部１３１と同様に主引用発明検索部および副引用発明検索部を有しているが、引用発明検索部１３１と比較して、主引用発明検索部の動作が異なり、出力されるデータも異なる。引用発明検索部１３１では、主引用発明検索部が主引用発明検索を行うことによって、最類似文献を主引用文献にセットしているが（前述した主引用発明検索処理のステップ８９）、引用発明検索部２３１の主引用発明検索部は、ステップ８９において、類似度の降順に最類似文献を含むｎ件の文献を類似文献として抽出し、それらを主引用文献（ｄｏｃ_１〜ｄｏｃ_ｎ）にセットする。また、主引用発明検索部の動作が異なることに伴い、前述した独立項検索処理におけるステップ７６でＣＰＵ１１が各類似文献に応じたｎ件の進歩性予測データＶｄ１_１〜Ｖｄ１_ｎを生成して、それらを予測結果ファイル生成部２２７に出力する。また、独立項検索処理において、ＣＰＵ１１は請求項要旨データｉｅｄを入力ベクトル生成部１３２に出力するが、各類似文献に応じたｎ件の概念検索データＶｄ２_１〜Ｖｄ２_ｎを入力ベクトル生成部１３２に出力する。 The cited invention search unit 231 has a main cited invention search unit and a sub-cited invention search unit similar to the cited invention search unit 131, but the operation of the main cited invention search unit compared to the cited invention search unit 131. Are different, and the output data is also different. The cited invention search unit 131 sets the most similar document as the main cited document by the main cited invention search unit performing the main cited invention search (step 89 of the above-described main cited invention search process). In step 89, the main citation invention search unit of the search unit 231 extracts n documents including the most similar documents in descending order of similarity as similar documents, and sets them as the main cited documents (doc _{1 to} doc _n ). To do. The main cited invention with the operation of the search part vary, and generates n matter of inventive step prediction data Vd1 ₁ ~Vd1 _n corresponding the CPU11 at step 76 in independent claim retrieval processing described above to each similar document, They are output to the prediction result file generation unit 227. Further, in the independent claims search process, CPU 11 is output to the claims subject matter data ied to the input vector generating unit 132, a concept search data _Vd2 1 _~Vd2 _n of n matter in accordance with the similar documents to the input vector generator 132 Output.

機械学習部２３３は、機械学習部１３３と比較して、次に述べるＳＨＬパターンを学習パターンに用いた機械学習（教師付き学習）によって、各類似文献に応じたｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎをそれぞれＳクラス、Ｈクラス、Ｌクラスのいずれかに分類し、その分類結果に応じたｎ件の出力信号（要件適否文書ベクトルＶ４_１〜Ｖ４_ｎ）を出力するように構築されている。Ｓクラス、Ｈクラス、Ｌクラスは、それぞれ、進歩性の要件に適合しない可能性が極めて高いクラス、適合しない可能性が高いクラス、適合するクラス（予測対象出願について、進歩性違反の拒絶理由が見つかる可能性が極めて高いクラス、高いクラス、無いクラス）に相当している。 Compared with the machine learning unit 133, the machine learning unit 233 performs n pieces of abstract movement vectors V3 ₁ to V3 ₁ corresponding to each similar document by machine learning (supervised learning) using the SHL pattern described below as a learning pattern. V3 _n is classified into one of S class, H class, and L class, respectively, and n output signals (requirement conformity document vectors V4 _{1 to} V4 _n ) corresponding to the classification result are output. . The S class, the H class, and the L class are classes that are highly likely not to meet the inventive step requirement, classes that are highly unlikely to fit, and conforming classes (the reason for refusal of inventive step violations for the predicted application). Class that is very likely to be found, a high class, a class that is not found).

また、ＳＨＬパターンは、次のパターンＳ、Ｈ、Ｌの３つのパターンの組み合わせである。
パターンＳ：学習文書ベクトルが第１の学習文書ベクトルで教師ベクトルが新規性および進歩性の拒絶理由有りを示すベクトル（例えば、正解のクラスに対応した次元だけが"１"で、他が"０"のベクトル）との組み合わせ
パターンＨ：学習文書ベクトルが第２の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由有りで新規性の拒絶理由無しを示すベクトル（例えば、正解のクラスに対応した上記とは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせ
パターンＬ：学習文書ベクトルが第３の学習文書ベクトルで教師ベクトルが進歩性の拒絶理由無しを示すベクトル（例えば、上記２つとは別の次元だけが"１"で、他が"０"のベクトル）との組み合わせのパターンである。 The SHL pattern is a combination of the following three patterns S, H, and L.
Pattern S: The learning document vector is the first learning document vector, and the teacher vector is a vector indicating that there is a reason for rejection of novelty and inventive step (for example, only the dimension corresponding to the correct class is "1" and the others are "0") Combination pattern H with “vector” of: the learning document vector is the second learning document vector, the teacher vector is a vector indicating the reason for refusal of inventive step and no reason for refusal of novelty (for example, the above corresponding to the correct class) A combination pattern L: a vector in which the learning document vector is the third learning document vector and the teacher vector has no reason for refusal of inventive step. For example, it is a combination pattern with a vector whose only dimension different from the above two is “1” and the other is “0”.

第１の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで、同じ文献を引用して新規性および進歩性違反の拒絶理由（特許法第２９条第１項第３号および同条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（新規性・進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた第１の主引用刊行物）に応じた文書ベクトル（第１の引用文書ベクトル）との差分に応じた第１の移動文書ベクトルである。 The first learning document vector is an application for which the first notice of reasons for refusal (1st action) has been issued as a result of examination by the Patent Office in a published application, and the same document is cited in the 1st action. An application (novelty / inventive step) in which the reason for refusal of violation of novelty or inventive step (the reason for refusal that the requirements of Article 29 (1) (iii) and Article 2 (2) of the Patent Act are not met) was pointed out The document vector corresponding to the claim (at the time of issuance of the reason for refusal) where the reason for refusal of the application for refusal was pointed out, and the cited reference 1 at that time (the first main cited publication cited as the main publication) The first moving document vector corresponding to the difference from the document vector corresponding to the object (first quoted document vector).

第２の学習文書ベクトルは、公開済出願の中で特許庁の審査の結果、初めての拒絶理由通知（１ｓｔアクション）が発行された出願であって、その１ｓｔアクションで、新規性の拒絶理由（特許法第２９条第１項第３号の要件を満たしていないとする拒絶理由）は指摘されていないが、進歩性違反の拒絶理由（同条第２項の要件を満たしていないとする拒絶理由）が指摘されていた出願（進歩性拒絶出願）の該拒絶理由が指摘されていた（拒絶理由通知発行時点の）請求項に応じた文書ベクトルと、そのときの引用文献１（主たる刊行物として引用されていた第２の主引用刊行物）に応じた文書ベクトル（第２の引用文書ベクトル）との差分に応じた第２の移動文書ベクトルである。 The second learning document vector is an application in which the first notice of reasons for refusal (1st action) is issued as a result of examination by the Patent Office in the published application, and the reason for refusal of novelty ( The reason for refusal to violate the inventive step (rejection not satisfying the requirement of paragraph 2 of the same Article) is not pointed out, but the reason for refusal of violating the inventive step is not pointed out The document vector corresponding to the claim (at the time of issuance of the reason for refusal) and the cited reference 1 (main publication) where the reason for refusal of the application (reasonable refusal application) was pointed out The second moving document vector corresponding to the difference from the document vector (second cited document vector) corresponding to the second main cited publication cited as.

第３の学習文書ベクトルは、公開済出願の中で審査の結果、１ｓｔアクションが発行されずに特許査定が発行された出願（拒絶無し出願）または１ｓｔアクションは発行されたがその拒絶理由に進歩性違反の拒絶理由が指摘されていなかった出願（進歩性拒絶無し出願）の（拒絶理由通知が発行された時点の）請求項１に応じた文書ベクトルと、それら拒絶無し出願または進歩性拒絶無し出願を対象とする概念検索の結果、最も類似度が高いとされる文献（学習用最類似文献）に応じた文書ベクトル（非引用文書ベクトル）との差分に応じた第３の移動文書ベクトルである。 The third learning document vector is the result of examination in the published application, where the first action is not issued and the patent decision is issued (non-rejection application) or the first action is issued but progresses to the reason for refusal Document vectors according to claim 1 (at the time of the notice of reasons for refusal) of applications for which no reason for refusal of sex violation has been pointed out (applications without inventive refusal), and those applications without refusal or inventive refusal As a result of the concept search for the application, a third moving document vector corresponding to the difference from the document vector (non-cited document vector) corresponding to the document (the most similar document for learning) that is considered to have the highest similarity. is there.

機械学習部２３３は、機械学習部１３３と同様、情報処理に脳神経回路網をモデルにしたニューラルネットワークを適用することができるが、そのうちのＢＰ（バックプロパゲーション）ネットワークを適用することが好ましい。 As with the machine learning unit 133, the machine learning unit 233 can apply a neural network modeled on a cranial nerve network for information processing, and preferably uses a BP (back propagation) network among them.

そして、進歩性予測処理部１２６の入力ベクトル生成部１３２は図２８に示すように、要旨ベクトル生成部１３２ａと、引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cとを有しているが、そのうちの引用候補ベクトル生成部１３２ｂと、移動ベクトル生成部１３２cの動作が異なっている。すなわち、引用候補ベクトル生成部１３２ｂは、引用発明検索部２３１からｎ件の概念検索データＶｄ２_１〜Ｖｄ２_ｎが入力されるので、そのそれぞれに含まれる各類似文献の公開公報データを入力してその特徴語を抽出し、各語に応じた重み付けを行って各類似文献に応じたｎ件の文書ベクトル（引用候補ベクトル）ＲｆＶ_１〜ＲｆＶ_ｎを生成する。移動ベクトル生成部１３２ｃは、要旨ベクトルＥＶと、各引用候補ベクトルＲｆＶ_１〜ＲｆＶ_ｎとの差分を計算して、双方の文書ベクトルの差分に応じたｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎを生成する。 As shown in FIG. 28, the input vector generation unit 132 of the inventive step prediction processing unit 126 includes a gist vector generation unit 132a, a citation candidate vector generation unit 132b, and a movement vector generation unit 132c. Among them, the operations of the citation candidate vector generation unit 132b and the movement vector generation unit 132c are different. That is, reference candidate vector generation part 132b, since the concept search data Vd2 ₁ ~Vd2 _n of n matter from the cited invention search unit 231 is input, and inputs the publication data of each similar documents included in that each of which Feature words are extracted and weighted according to each word to generate _n document vectors (citation candidate vectors) RfV _{1 to} RfV _n corresponding to each similar document. Movement vector generation unit 132c includes a summary vector EV, the difference between the reference candidate vectors _RfV 1 _~RfV _n by calculating the n matter gist movement vector _V3 1 to V3 _n for corresponding to the difference between both the document vector Generate.

前述したように、各類似文献は、主引用発明検索部による概念検索によって、最も高い類似度からその降順に抽出した文献であるため、そのいずれも予測対象発明の審査で、主引用発明の開示文献として引用される確率が高いと推測される。そのため、各類似文献を引用候補として引用候補ベクトルＲｆＶ_１〜ＲｆＶ_ｎを求め、これらと要旨ベクトルＥＶとの差分を計算して要旨移動ベクトルＶ３_１〜Ｖ３_ｎを求めれば、予測対象発明と、各類似文献に開示されている発明との相違に応じた要旨移動ベクトルＶ３_１〜Ｖ３_ｎが生成される。 As described above, since each similar document is a document extracted in descending order from the highest similarity by the concept search by the main cited invention search unit, all of them are the examination of the prediction target invention, and the disclosure of the main cited invention is disclosed. It is estimated that the probability of being cited as a document is high. Therefore, if the candidate documents RfV _{1 to} RfV _n are obtained by using each similar document as a citation candidate and the difference between the citation candidate vectors RfV _{1 to} RfV _n is calculated to obtain the gist movement vectors V3 _{1 to} V3 _n , the prediction target invention, Abstract movement vectors V3 _{1 to} V3 _n corresponding to differences from the invention disclosed in similar documents are generated.

予測結果ファイル生成部２２７は、予測結果ファイル生成部１２７と比較して、図２９に示したレイアウトを有する予測結果ファイルを生成してそれを予測結果記憶部２５６に記憶させる点と、本発明の実施の形態にかかる非適合率算出部としての動作を行い、予測対象出願に関する非適合率Ｖｒを算出する点とで相違している。 Compared with the prediction result file generation unit 127, the prediction result file generation unit 227 generates a prediction result file having the layout shown in FIG. 29 and stores the prediction result file in the prediction result storage unit 256. The difference is that the operation as the non-conformance rate calculating unit according to the embodiment is performed to calculate the non-conformance rate Vr related to the prediction target application.

非適合率Ｖｒは、予測対象出願についての進歩性の要件に適合しない可能性であって、予測対象出願について、進歩性違反の拒絶理由が見つかる可能性を示している。機械学習部２３３から出力される要件適否文書ベクトルＶ４_１〜Ｖ４_ｎは、進歩性の要件に適合しない可能性が極めて高い、高い、無いといった内容で生成されるので、予測対象発明について、各類似文献との関係でみた進歩性違反の拒絶理由が見つかる可能性を示している。そのため、これらを用いて予測結果ファイル生成部２２７が予測対象出願に関する非適合率Ｖｒを算出する。この場合、予測結果ファイル生成部２２７は、その非適合率Ｖｒを非適合率算出規則にしたがい算出する。非適合率算出規則とは、予測結果ファイル生成部２２７が非適合率Ｖｒを算出する規則であって、本実施の形態では、要件適否文書ベクトルＶ４_１〜Ｖ４_ｎの中に含まれるＳクラス、Ｈクラス、Ｍクラスそれぞれの件数に応じて、非適合率Ｖｒの数値が決定されるように、図示しない算出規則テーブルに設定されている。非適合率算出規則は例えば次のようにすることができる。 The non-conformance rate Vr indicates the possibility of not meeting the inventive step requirement for the prediction target application, and indicates the possibility of finding the reason for refusal of the inventive step violation for the prediction target application. The requirement adequacy document vectors V4 _{1 to} V4 _n output from the machine learning unit 233 are generated with contents that are highly likely to be incompatible with the requirement of inventive step, are high, and are not included. It shows the possibility of finding reasons for refusal of violation of inventive step in relation to the literature. Therefore, using these, the prediction result file generation unit 227 calculates the non-conformance rate Vr related to the prediction target application. In this case, the prediction result file generation unit 227 calculates the non-conformance rate Vr according to the non-conformance rate calculation rule. The non-conformance rate calculation rule is a rule by which the prediction result file generation unit 227 calculates the non-conformance rate Vr. In the present embodiment, the S class included in the requirement conformity document vectors V4 _{1 to} V4 _n , The calculation rule table (not shown) is set so that the numerical value of the non-conformance rate Vr is determined according to the number of cases of the H class and the M class. The non-conformance rate calculation rule can be set as follows, for example.

Ｓクラスが２件以上：Ｖｒ≧８５％
Ｓクラスが１件で、Ｈクラスの件数が５０％以上：Ｖｒ≧７５％
Ｓクラスが１件で、Ｈクラスの件数が５０％未満：Ｖｒ≧６５％
Ｓクラスが０件で、Ｈクラスの件数が５０％以上：Ｖｒ≧５０％
Ｓクラスが０件で、Ｈクラスの件数が５０％未満：Ｖｒ≧４０％
Ｓクラス、Ｈクラスがともに０件：Ｖｒ≧１５％ 2 or more S class: Vr ≧ 85%
There is one S class and the number of H class is 50% or more: Vr ≧ 75%
One S class and less than 50% H class: Vr ≧ 65%
S class is 0, H class is more than 50%: Vr ≧ 50%
S class is 0, H class is less than 50%: Vr ≧ 40%
0 for both S and H classes: Vr ≧ 15%

上記非適合率算出規則によれば、例えば、類似文献が５件（前述の整数ｎが"５"）の場合、Ｓクラスが２件あればＶｒ≧８５％である。また、Ｓクラスが１件で、Ｈクラスが３件ならＶｒ≧７５％になるが、Ｈクラスが２件だとＶｒ≧６５％、Ｓクラス、Ｈクラスがともに０件（全件がＬクラス）だとＶｒ≧１５％になる。 According to the non-conformance rate calculation rule, for example, when there are five similar documents (the above-mentioned integer n is “5”) and there are two S classes, Vr ≧ 85%. Also, if there is one S class and three H classes, Vr ≧ 75%, but if there are two H classes, Vr ≧ 65%, both S class and H class are zero (all cases are L class) ) Vr ≧ 15%.

そして、図２９に示すように、予測結果ファイル生成部２２７が生成する予測結果ファイルは、予測結果ファイル生成部１２７が生成する予測結果ファイルと比較して、非適合率Ｖｒが含まれている点で相違している。 29, the prediction result file generated by the prediction result file generation unit 227 includes a non-conformance rate Vr as compared to the prediction result file generated by the prediction result file generation unit 127. Is different.

また、予測結果編集処理部２０５は、予測結果編集処理部１０５と比較して、予測結果ファイルを読み込み、図３０に示すような特許要件適否予測リストＬ２を編集および出力する点で相違している。特許要件適否予測リストＬ２は、特許要件適否予測リストＬ１と比較して、ＯＡ率が追加されている点で相違している。ＯＡ率とは、予測対象出願について、審査過程で特許要件（新規性または進歩性）に違反する拒絶理由が見つかり、それを示す拒絶理由通知書が発行される可能性を示していて、前述した非適合率Ｖｒに相当する数値が示されている。 Further, the prediction result editing processing unit 205 is different from the prediction result editing processing unit 105 in that it reads a prediction result file, and edits and outputs a patent requirement suitability prediction list L2 as shown in FIG. . The patent requirement suitability prediction list L2 is different from the patent requirement suitability prediction list L1 in that an OA rate is added. The OA rate indicates the possibility that a reason for refusal that violates patent requirements (novelty or inventive step) will be found in the examination process and a notice of reason for refusal indicating this will be issued. A numerical value corresponding to the non-conformity rate Vr is shown.

以上のように、第２の実施の形態に係る特許要件適否予測サーバ２００では、特許要件適否予測処理部２０３の進歩性予測処理部２２６において、引用発明検索部２３１が最類似文献を含むｎ件の類似文献を主引用文献にセットしている。また、機械学習部２３３が各類似文献に応じた要旨移動ベクトルＶ３_１〜Ｖ３_ｎをＳクラス、Ｈクラス、Ｌクラスの３つに分類し、その分類結果に応じた要件適否文書ベクトルＶ４_１〜Ｖ４_ｎを出力する。 As described above, in the patent requirement suitability prediction server 200 according to the second embodiment, the cited invention search unit 231 includes the most similar documents in the inventive step predictive processing unit 226 of the patent requirement suitability prediction processing unit 203. Is set as the main cited document. In addition, the machine learning unit 233 classifies the abstract movement vectors V3 _{1 to} V3 _n corresponding to each similar document into three classes of S class, H class, and L class, and the requirement conformity document vector V4 ₁ to V class according to the classification result. V4 _n is output.

第１の実施の形態に係る特許要件適否予測サーバ１０では、特許要件適否に関する予測が審査実務に適合した内容で、しかも特許庁の審査実績を反映させる形で行われる。この点は、特許要件適否予測サーバ２００も同様である。 In the patent requirement conformity prediction server 10 according to the first embodiment, the prediction regarding patent requirement conformity is performed in a form that conforms to examination practice and reflects examination results of the Patent Office. This also applies to the patent requirement suitability prediction server 200.

しかし、特許要件適否予測サーバ１０では、主引用発明検索で、主引用文献として、最類似文献だけが抽出されるに過ぎなかった。最類似文献は、概念検索の結果、予測対象発明との類似度が最も高いとされた文献であるため、実際の審査の結果、主引用文献として引用される可能性が最も高いと考えられる。とはいえ、最類似文献が実際の審査で引用されるとは限らないし、最類似文献と、その次の類似度の文献（次類似文献）とで類似度の相違がごくわずかでしかなく、その次類似文献の方が最類似文献よりも主引用文献として適切な場合も十分に考えられる。そのため、主引用発明検索で最類似文献を含む複数の文献を抽出し、これらを対象として機械学習部２３３による文書ベクトルの分類を行えば、次類似文献をも考慮に入れた形で特許要件適否に関する予測が行われる。そのため、特許要件適否予測サーバ１０の予測の精度よりも、特許要件適否予測サーバ２００の予測の精度が向上する。 However, in the patent requirement suitability prediction server 10, only the most similar document is extracted as the main cited document in the main cited invention search. Since the most similar document is a document having the highest similarity to the prediction target invention as a result of the concept search, it is considered that the most similar document is most likely cited as the main cited document as a result of the actual examination. However, the most similar document is not always cited in the actual examination, and there is very little difference in similarity between the most similar document and the next similar document (next similar document) It is fully conceivable that the next similar document is more appropriate as the main cited document than the most similar document. Therefore, if a plurality of documents including the most similar documents are extracted in the main cited invention search, and the document vectors are classified by the machine learning unit 233 for these documents, whether or not the patent requirement is met in consideration of the next similar documents. Predictions are made. Therefore, the prediction accuracy of the patent requirement suitability prediction server 200 is improved more than the prediction accuracy of the patent requirement suitability prediction server 10.

また、実際の審査実務では、ある特許出願について、進歩性違反の拒絶理由が見つかるとき、進歩性違反の拒絶理由と新規性違反の拒絶理由とが同じ文献（この場合に引用される文献を新規性・進歩性拒絶引用文献ともいう）を引用して指摘される場合がある。このような場合、その特許出願にかかる請求項にかかる発明と、新規性・進歩性拒絶引用文献に開示されている発明とに相違がないと審査官によって判断されているから、要旨移動ベクトルＶ３_１〜Ｖ３_ｎの中にＳクラスへ分類される文書ベクトルが含まれているときは、進歩性違反の拒絶理由が見つかる可能性がそうでない場合に比べてより高くなっていると考えられる。したがって、新規性・進歩性拒絶出願からみた距離ｄｐと、進歩性拒絶出願からみた距離ｄｐとが区別できるように、ＳＨＬパターンによる機械学習を行って機械学習部２３３を構築しておくことで、進歩性違反の拒絶理由が見つかる可能性が極めて高い場合とそうでない場合とを区別した予測が可能になる。こうすることで、特許要件適否予測サーバ２００による予測精度の向上と、業務効率のより一層の改善が期待できる。 In actual examination practice, when a reason for refusal of inventive step violation is found for a patent application, the reason for refusal of inventive step violation and the reason for refusal of novelty violation is the same document (the document cited in this case is newly May also be pointed out by quoting refusal documents of sex / inventive step rejection). In such a case, since the examiner determines that there is no difference between the invention according to the claims of the patent application and the invention disclosed in the novelty / inventive step rejection citation, the summary movement vector V3 _When document vectors classified into the S class are included in _{1 to} V3 _n , it is considered that the possibility of finding the reason for refusal of the inventive step violation is higher than the case where it is not. Therefore, by constructing the machine learning unit 233 by performing machine learning using the SHL pattern so that the distance dp viewed from the novelty / inventive rejection application and the distance dp viewed from the inventive rejection application can be distinguished, It is possible to make a prediction that distinguishes between cases where the reason for refusal of an inventive step violation is very likely and cases where it is not. By doing so, it is possible to expect improvement in prediction accuracy by the patent requirement suitability prediction server 200 and further improvement in business efficiency.

（変形例１）
上記の進歩性予測処理部２２６では、一つの機械学習部２３３がｎ件の要旨移動ベクトルＶ３_１〜Ｖ３_ｎの分類を行っていたが、図３１に示した特許要件適否予測処理部２１３における進歩性予測処理部２２７のように、要旨移動ベクトルＶ３_１〜Ｖ３_ｎに応じた複数の機械学習部２３３_１〜２３３_ｎを有し、そのそれぞれが要旨移動ベクトルＶ３_１〜Ｖ３_ｎを分類するようにしてもよい。また、図示はしないが、入力ベクトル生成部１３２も、概念検索データＶｄ２_１〜Ｖｄ２_ｎの件数に応じて複数設けてもよい。これらのようにすると、各機械学習部２３３_１〜２３３_ｎまたは各入力ベクトル生成部１３２が並行に処理を実行するので、処理時間を短縮することができる。 (Modification 1)
In the inventive step prediction processing unit 226, one machine learning unit 233 classifies _n summary movement vectors V3 _{1 to} V3 _n . However, the progress in the patent requirement suitability prediction processing unit 213 shown in FIG. as sex prediction processing unit 227 includes a plurality of machine learning unit ₂₃₃ 1 _~233 _n corresponding to the subject matter movement vector _V3 1 to V3 _n, each of which is adapted to classify the subject matter movement vector _V3 1 to V3 _n May be. Further, although not shown, a plurality of input vector generation units 132 may be provided according to the number of concept search data Vd2 _{1 to} Vd2 _n . By doing so, each of the machine learning units 233 _{1 to} 233 _n or each of the input vector generation units 132 executes the processing in parallel, so that the processing time can be shortened.

（変形例２）
以上述べた各実施の形態では、特許要件適否予測サーバ１０，２００に特許要件適否予測プログラムがインストールされることによって、特許要件適否予測サーバ１０，２００が特許要件適否予測装置として機能する場合を例にとって説明している。その他、本発明は、ユーザ端末装置３０が特許要件適否予測装置として機能する場合についても適用がある。この場合、前述した特許要件適否予測プログラムについて少なくとも以下の変更点１）、２）にしたがった変更を行い、その変更後の特許要件適否予測プログラムを特許要件適否予測サーバ１０，２００からユーザ端末装置３０にダウンロードし、ユーザ端末装置３０にインストールすればよい。 (Modification 2)
In each of the embodiments described above, an example in which the patent requirement suitability prediction server 10 or 200 functions as a patent requirement suitability prediction device by installing the patent requirement suitability prediction program in the patent requirement suitability prediction server 10 or 200 will be described. To explain. In addition, the present invention is also applicable to the case where the user terminal device 30 functions as a patent requirement suitability prediction device. In this case, the above-described patent requirement suitability prediction program is changed according to at least the following changes 1) and 2), and the changed patent requirement suitability prediction program is transferred from the patent requirement suitability prediction servers 10 and 200 to the user terminal device. It may be downloaded to 30 and installed in the user terminal device 30.

変更点１）指定ナンバなどの入力操作を行うための画像データを特許要件適否予測サーバ１０，２００からユーザ端末装置３０に送信することなくユーザ端末装置３０に表示させる。
変更点２）特許要件適否予測リストをユーザ端末装置３０が出力する。 Modification 1) Image data for performing an input operation such as a designated number is displayed on the user terminal device 30 without being transmitted from the patent requirement suitability prediction servers 10 and 200 to the user terminal device 30.
Modification 2) The user terminal device 30 outputs a patent requirement suitability prediction list.

以上の説明は、本発明の実施の形態についての説明であって、この発明の装置及び方法を限定するものではなく、様々な変形例を容易に実施することができる。また、各実施形態における構成要素、機能、特徴あるいは方法ステップを適宜組合わせて構成される装置又は方法も本発明に含まれるものである。 The above description is the description of the embodiment of the present invention, and does not limit the apparatus and method of the present invention, and various modifications can be easily implemented. In addition, an apparatus or a method configured by appropriately combining components, functions, features, or method steps in each embodiment is also included in the present invention.

例えば、ユーザ端末装置は高機能携帯電話機や、タブレット型の端末装置ではなく、ノートパソコンや、ＰＤＡでもよい。なお、ＣＰＵ１１が実行する特許要件適否予測プログラムは、磁気記録媒体、ＣＤ−ＲＯＭ，ＤＶＤ等の各種記録媒体に記録することができるし、ネットワークを介して図示しないサーバからダウンロードすることもできる。 For example, the user terminal device may be a notebook computer or a PDA instead of a high-function mobile phone or a tablet-type terminal device. The patent requirement suitability prediction program executed by the CPU 11 can be recorded on various recording media such as a magnetic recording medium, a CD-ROM, and a DVD, and can also be downloaded from a server (not shown) via a network.

本発明を適用することにより、特許要件の適否に関する予測が審査実務に適合した内容で行われ、監視負担を有効に軽減することができる。本発明は、特許要件適否予測装置および特許要件適否予測プログラムの分野で利用することができる。 By applying the present invention, prediction regarding the suitability of patent requirements is made with contents that are suitable for examination practice, and the monitoring burden can be effectively reduced. The present invention can be used in the field of patent requirement suitability prediction apparatus and patent requirement suitability prediction program.

１…特許要件適否予測システム、１０，２００…特許要件適否予測サーバ、１１，３１…ＣＰＵ、３０…ユーザ端末装置、１０１…抽出部、１０２…要旨データ抽出部、１０３，２０３，２１３…特許要件適否予測処理部、１０５…予測結果編集処理部、１２５…新規性・拡大先願予測処理部、１２６，２２６，２２７…進歩性予測処理部、１３２…入力ベクトル生成部、１３２ａ…要旨ベクトル生成部、１３２ｂ…引用候補ベクトル生成部、１３２ｃ…移動ベクトル生成部、１３３，２３３…機械学習部、１５３…要旨データ記憶部、１５４…ＣＴデータ記憶部、１５６，２５６…予測結果記憶部、Ｌ１、Ｌ２…予測結果リスト。 DESCRIPTION OF SYMBOLS 1 ... Patent requirement suitability prediction system, 10,200 ... Patent requirement suitability prediction server, 11, 31 ... CPU, 30 ... User terminal device, 101 ... Extraction unit, 102 ... Abstract data extraction unit, 103, 203, 213 ... Patent requirement Appropriateness prediction processing unit, 105 ... prediction result editing processing unit, 125 ... novelty / expanded prior application prediction processing unit, 126, 226, 227 ... inventive step prediction processing unit, 132 ... input vector generation unit, 132a ... abstract vector generation unit , 132b ... citation candidate vector generation unit, 132c ... movement vector generation unit, 133, 233 ... machine learning unit, 153 ... gist data storage unit, 154 ... CT data storage unit, 156, 256 ... prediction result storage unit, L1, L2 … Predicted result list.

係り受け解析部１１６は、要部データ記憶部１６２に記憶されている要部データについて係り受け解析を行い、その結果をパターンデータ抽出部１１７に出力する。 Dependency analysis unit 116 performs receiving dependency analysis principal part data stored in the main part data storage unit 162, and outputs the result to the pattern data extraction unit 117.

パターンデータ抽出部１１７は、係り受け解析部１１６の解析結果を入力して、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとなる文字列と、発明の名称のうち、先頭に記載されているもの（筆頭名称）を出力する。例えば、特開２００８−６２２８２号公報の「発明が解決しようとする課題」の欄の「本発明」の文字列を含む一文の中に、ひらがなの「を」の直前の名詞と、それに対応した動詞の組み合わせとして、「調整」および「行わず」と、「同心性」および「得る」と、「精密打ち抜き型」および「提供」がある。これらがパターンデータ抽出部１１７から出力される。本実施の形態において、パターンデータ抽出部１１７から出力されるデータのうち、「発明が解決しようとする課題」の欄から抽出されたデータが課題データに相当していて、例えば図８のデータ種別"Ｐ"のレコードのようなデータとすることができる。 The pattern data extraction unit 117 inputs the analysis result of the dependency analysis unit 116, and includes a noun immediately preceding “Hi” in hiragana, a character string that is a combination of the corresponding verb, and the beginning of the names of the inventions. (First name) is output. For example, in a sentence including the character string “present invention” in the column “Problem to be Solved by” of Japanese Patent Application Laid-Open No. 2008-62282, the noun immediately before “O” in hiragana and the corresponding Verb combinations include “adjustment” and “do not do”, “concentricity” and “obtain”, “precision punching” and “provide”. These are output from the pattern data extraction unit 117. In the present embodiment, among the data output from the pattern data extraction unit 117, the data extracted from the “problem to be solved by the invention” column corresponds to the problem data. For example, the data type of FIG. Data such as a record of “P” can be used.

次に、ＣＰＵ１１は、処理をステップ４に進めて予測終了条件が成立しているか否かを判定する。ここで、ＣＰＵ１１は予測終了条件が成立しているときはステップ６に進むが、そうでないときはステップ５に進む。ＣＰＵ１１はステップ５に進むと、後述する特許要件適否予測ルーチンを実行するが、ステップ６に進むと、終了処理を実行し、そのユーザに対する特許要件適否予測処理を終了する。このようにすることで、予測終了条件が成立するまで、特許要件適否予測処理が自動的かつ継続的に実行される。
Next, the CPU 11 proceeds to step 4 to determine whether or not a prediction end condition is satisfied. Here, the CPU 11 proceeds to step 6 when the prediction end condition is satisfied, but proceeds to step 5 otherwise. When proceeding to step 5, the CPU 11 executes a patent requirement suitability prediction routine which will be described later. When proceeding to step 6, the CPU 11 executes a termination process and terminates the patent requirement conformity prediction process for the user. By doing so , the patent requirement suitability prediction process is automatically and continuously executed until the prediction end condition is satisfied.

Claims

Gazette data extracting means for extracting prediction target data corresponding to a designated number from a published gazette data storage unit in which electronic data of a published patent gazette is stored as published gazette data;
Term data that indicates terms that can specify the gist of the invention to be predicted disclosed in the application to be predicted, which is specified by the data to be predicted extracted by the publication data extraction means, and is at least Summary data extraction means for extracting from the prediction target data, data including feature part data extracted from the feature part of each claim and problem data extracted from the column of the problem to be solved by the invention;
Summary data storage means for storing the summary data extracted by the summary data extraction means;
The publication data is searched using the summary data stored in the summary data storage means, and novelty prediction data indicating whether the novelty requirement of the prediction target invention is appropriate is generated according to the search result. Novelty prediction processing unit, inventive step prediction processing unit for generating inventive step prediction data indicating whether or not the inventive step requirements are appropriate, and the prediction target invention using the novelty prediction data and the inventive step prediction data A patent result suitability prediction processing means having a prediction result file generation unit for generating a prediction result file indicating whether or not the patent requirement is suitable,
The inventive step prediction processing unit includes a main citation invention search unit that searches for a main citation invention closest to the prediction target invention among prior art inventions specified by the publication gazette data, and a document classification that classifies document vectors And
The main cited invention search unit targets the publication gazette data with the feature portion data and the problem data of each claim among the summary data stored in the summary data storage means as main search document data. Performing a concept search, and as a result of the concept search, the most similar document with the highest similarity is set as the main cited document in which the main cited invention is disclosed,
The document classification unit classifies an input summary movement vector as to whether or not it meets the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so as to output a requirement suitability document vector according to the classification result, and the summary movement vector includes a summary vector according to each claim of the prediction target application, and a citation candidate vector according to the most similar document. Patent requirement conformity prediction device which is a vector corresponding to the difference between the two.

Gazette data extracting means for extracting prediction target data corresponding to a designated number from a published gazette data storage unit in which electronic data of a published patent gazette is stored as published gazette data;
Term data that indicates terms that can specify the gist of the invention to be predicted disclosed in the application to be predicted, which is specified by the data to be predicted extracted by the publication data extraction means, and is at least Summary data extraction means for extracting from the prediction target data, data including feature part data extracted from the feature part of each claim and problem data extracted from the column of the problem to be solved by the invention;
Summary data storage means for storing the summary data extracted by the summary data extraction means;
The publication data is searched using the summary data stored in the summary data storage means, and novelty prediction data indicating whether the novelty requirement of the prediction target invention is appropriate is generated according to the search result. Novelty prediction processing unit, inventive step prediction processing unit for generating inventive step prediction data indicating whether or not the inventive step requirements are appropriate, and the prediction target invention using the novelty prediction data and the inventive step prediction data A patent result suitability prediction processing means having a prediction result file generation unit for generating a prediction result file indicating whether or not the patent requirement is suitable,
The inventive step prediction processing unit includes a main citation invention search unit that searches for a main citation invention closest to the prediction target invention among prior art inventions specified by the publication gazette data, and a document classification that classifies document vectors And
The main cited invention search unit targets the publication gazette data with the feature portion data and the problem data of each claim among the summary data stored in the summary data storage means as main search document data. A concept search is performed, and as a result of the concept search, a plurality of documents including the most similar document having the highest similarity in descending order of similarity are extracted as similar documents, and the main cited invention is disclosed for each similar document. As the main cited reference
The document classification unit is a class that has a high possibility that the input summary movement vector does not conform to the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector, a high class, It is constructed so as to be classified into one of the classes that meet the requirement of the inventive step and to output a requirement conformity document vector according to the classification result, and the gist movement vector corresponds to each claim of the prediction target application A plurality of vectors according to the difference between the abstract vector and each citation candidate vector corresponding to each similar document,
The patent requirement conformity prediction processing means calculates a non-conformance rate indicating a possibility of not conforming to the inventive step requirement for the prediction target application according to a plurality of the requirement conformity document vectors output from the document classification unit. A patent requirement conformity prediction device further comprising a non-conformance rate calculation unit for calculating.

The document classification unit includes a combination of the learning document vector as the training data, the learning document vector being a first learning document vector, the teacher vector being a vector indicating that there is a reason for rejection of novelty and inventive step, and the learning document vector being a first learning document vector. A combination of the learning vector of 2 with a vector indicating that the teacher vector has a reason for refusal of inventive step and no reason for refusal of novelty, and the learning document vector is a third learning document vector and the teacher vector has an inventive step. A combination with a vector indicating no reason for refusal is used,
The first learning document vector is an application in which a notice of reasons for refusal has been issued among published applications that have already been published, and the novel document has no novelty or inventive step by citing the same document in the notice of reasons for refusal The document vector corresponding to the claim for which the reason for refusal was pointed out in the novelty / inventive step refusal application in which the reason for refusal was pointed out, and the first main cited publication which is the above-mentioned document cited for the reason for refusal A first moving document vector corresponding to a difference from the first cited document vector corresponding to the object,
The second learning document vector is an application for which a notice of reason for refusal is issued in the published application, and the reason for refusal of novelty is not pointed out in the notice of reason for refusal, but there is a reason for refusal of inventive step. The document vector corresponding to the claim in which the reason for refusal of the inventive step rejection application pointed out was pointed out, and the second corresponding to the second main cited publication cited as the main publication for the reason for refusal A second moving document vector corresponding to the difference from the cited document vector of
The third learning document vector is a non-rejection application for which a patent decision has been issued without a notice of reason for refusal being issued or an application for which a notice of reason for refusal has been issued in the published application, the notice of reason for refusal And the document vector according to claim 1 of the non-progressive refusal application for which the reason for refusal of inventive step was not pointed out, and the result of the concept search for the non-refusal application or the non-progressive refusal application, The patent requirement conformity prediction apparatus according to claim 2, wherein the third requirement is a third moving document vector obtained from a difference from a non-cited document vector corresponding to the most similar document for learning that is said to be high.

A patent requirement conformity prediction program for causing a computer to function as a patent requirement conformity prediction device, wherein the computer designates a specified number from a published gazette data storage unit in which electronic data of a published patent gazette is stored as published gazette data Gazette data extraction means for extracting prediction target data according to
Term data that indicates terms that can specify the gist of the invention to be predicted disclosed in the application to be predicted, which is specified by the data to be predicted extracted by the publication data extraction means, and is at least Summary data extraction means for extracting from the prediction target data, data including feature part data extracted from the feature part of each claim and problem data extracted from the column of the problem to be solved by the invention;
Summary data storage control means for storing the summary data extracted by the summary data extraction means;
Novelty prediction data which searches the publication gazette data using the summary data stored by the control of the summary data storage control means, and indicates whether the novelty requirement of the prediction target invention is appropriate according to the search result A novelty prediction processing unit that generates the inventive step, an inventive step prediction processing unit that generates the inventive step prediction data indicating whether or not the inventive step meets the requirements of the inventive step, and the novelty prediction data and the inventive step prediction data Function as a patent requirement suitability prediction processing means having a prediction result file generation unit for generating a prediction result file indicating the patent requirement suitability of the invention to be predicted;
Among the prior art inventions in which the inventive step prediction processing unit is specified by the publication data, a main citation invention search unit that searches for a main citation invention that is closest to the prediction target invention, and a document classification unit that classifies document vectors And
The main cited invention search unit targets the publication gazette data with the feature portion data and the problem data of each claim among the summary data stored under the control of the summary data storage control means as main search document data The most similar document with the highest similarity as a result of the concept search is the main cited document in which the main cited invention is disclosed,
The document classification unit classifies the input summary movement vector as one that meets the requirement of inventive step by machine learning using a plurality of training data including a learning document vector and a teacher vector. It is constructed so as to output a requirement suitability document vector according to the classification result, and the summary movement vector includes a summary vector according to each claim of the prediction target application, and a citation candidate vector according to the most similar document. Patent requirement propriety prediction program which is a vector corresponding to the difference between the two.