JP5347909B2

JP5347909B2 - Example device, program, and method of word or combination of words in patent document

Info

Publication number: JP5347909B2
Application number: JP2009251505A
Authority: JP
Inventors: 一成田中
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2009-10-30
Filing date: 2009-10-30
Publication date: 2013-11-20
Anticipated expiration: 2029-10-30
Also published as: JP2011096149A

Abstract

<P>PROBLEM TO BE SOLVED: To present a concrete example assumed by a word or the combination of words used in a claim even if the concrete example assumed by a word or the combination of words used in the claim does not exist in a patent document, concerning a technology for supporting the understanding of a word or the combination of words in the patent document. <P>SOLUTION: An illustrating device includes: a patent document retrieving part 102 for retrieving text data of a description designated by a patent database 101; a rejection reason announcement retrieving part 104 for retrieving text data of a document indicating the reason of a rejection indicated with respect to the description; and a language pair extracting part 106 for extracting from the text data of the document indicating the reason of the rejection, a word or the combination character string of words described in the claim of the description to be an object indicating the concrete example of the word or combination of words and also a word or the combination character string of words indicated in contrast with the word or the combination of words. Extracted data of the pairs of the words or the combinations of words are displayed in a display part 109 by contrast. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

特許文書たとえば請求項中の言葉の理解を支援するための技術に関する。 The present invention relates to a technology for supporting understanding of words in patent documents, for example, claims.

特許文書たとえば請求項は特許の専門家以外でも、発明者が他社特許の侵害を回避するなどの目的で読む必要がある。
しかし、特許文書中の特許請求の範囲に記載される請求項などは一般に、単語又は単語の組合せで記述されることにより或る特定の意味を持つ言葉が抽象的な言葉で書かれることが多く、特許文書に不慣れな発明者が、請求項に書かれている発明を具体的に理解するのは難しい。 Patent documents, such as claims, must be read by non-patent specialists for the purpose of inventors to avoid infringement of other companies' patents.
However, the claims described in the claims of patent documents are generally written in words or combinations of words, so that words having a specific meaning are often written as abstract words. It is difficult for an inventor who is unfamiliar with patent documents to specifically understand the invention described in the claims.

従って、請求項で使っている言葉が想定している具体例を表示することで内容のイメージを容易にし、請求項を理解しやすいようにする支援技術が要請されている。
例えば、図１Ａに示される特許文書中の請求項１の例では、下線部として示されるように、「コンテンツ情報」「個人の意見の開示単位」「意見の対象」「個人の評価」という言葉が使われている。この場合に、これら言葉の具体例を示す言葉を提示することで、請求項が表す内容をイメージすることができ可読性を高めることができる。 Therefore, there is a demand for assistive technology that makes it easy to understand the claims by displaying specific examples assumed by the words used in the claims, thereby facilitating the understanding of the claims.
For example, in the example of claim 1 in the patent document shown in FIG. 1A, the words “content information”, “disclosure unit of individual opinion”, “object of opinion”, “individual evaluation” as indicated by the underlined portion Is used. In this case, by presenting words indicating specific examples of these words, the contents represented by the claims can be imaged and readability can be improved.

請求項の言葉の理解を支援するための第１の従来技術として、シソーラスや同義語辞書などからより具体的な言葉や関連語を表示する技術が知られている。
また、請求項の言葉の理解を支援するための第２の従来技術として、請求項と実施例又は発明を実施するための形態とをリンク付け等で対応付けることによって、請求項の解釈を支援する技術が知られている。（たとえば、特許文献１）
さらに、本文構造解析の結果から参照文献の参考理由を判断し、参照理由を含む参照関係の情報を出力するものもある。（たとえば、特許文献２） As a first conventional technique for supporting the understanding of words in claims, a technique for displaying more specific words and related words from a thesaurus or a synonym dictionary is known.
In addition, as a second conventional technique for supporting the understanding of the words of the claims, the claims are supported by associating the claims with the embodiments or embodiments for carrying out the invention by linking or the like. Technology is known. (For example, Patent Document 1)
In addition, there is also a method for determining a reference reason for a reference document from a result of text structure analysis, and outputting reference relationship information including the reference reason. (For example, Patent Document 2)

特開２００２−１４９７０４号公報JP 2002-149704 A 特開２０００−３５７１７０号公報JP 2000-357170 A

しかし、前述した第１の従来技術では、請求項に書かれている言葉は、抽象的で、分野などにより解釈が異なることもある。このため、一般的なシソーラスの下位語では、発明者が意図した例になっているとは限らないという問題点を有していた。 However, in the first prior art described above, the words written in the claims are abstract and may be interpreted differently depending on the field. For this reason, a general thesaurus subordinate has a problem that it is not always an example intended by the inventor.

また、前述した第２の従来技術では、１件の特許の中に明示的な例が書かれていない場合には、請求項の解釈を効果的に支援することはできないという問題点を有していた。例えば、図１Ｂに示される例では、請求項１において、「個人の評価」という言葉が使われている。しかし、発明の詳細な説明の部分である［発明の開示］を見ても、「個人の評価（例えば良い評価又は悪い評価）」という程度の記述しかない。このため、このような例では、具体的なイメージを掴みづらい。 Further, the second prior art described above has a problem that it is not possible to effectively support the interpretation of a claim unless an explicit example is written in one patent. It was. For example, in the example shown in FIG. 1B, the term “individual evaluation” is used in claim 1. However, looking at [Disclosure of the Invention] which is a part of the detailed description of the invention, there is only a description of “individual evaluation (for example, good evaluation or bad evaluation)”. For this reason, it is difficult to grasp a specific image in such an example.

更に、たとえ発明の詳細な説明の中に請求項中の言葉の具体例が書いてあったとしても、請求項の各項に対応する実施例又は発明を実施するための形態までは特定できるが、具体例の書き方は様々である。このため、前述した第２の従来技術では、機械的に言葉と言葉の関係などの細かい対応付けまでを行うのは難しいという問題点を有していた。 Further, even if specific examples of words in the claims are written in the detailed description of the invention, it is possible to identify the embodiments corresponding to the respective claims or the modes for carrying out the invention. There are various ways of writing specific examples. For this reason, the second prior art described above has a problem that it is difficult to perform a fine association such as a relationship between words mechanically.

そこで、本発明の課題では、特許文書中に請求項で使っている言葉が想定している具体例がない場合であっても、請求項で使っている言葉が想定している具体例を提示可能とすることである。 Therefore, in the subject of the present invention, even if there is no specific example that the word used in the claim assumes in the patent document, a specific example that the word used in the claim assumes is presented. It is possible.

本発明の一つの態様の特許文書中の言葉(単語または単語の組み合わせ)の例示装置によれば、特許文書中の言葉の理解を支援するための装置において、特許文書に含まれる言葉の具体例を示す対象となる明細書を特定する情報を受け付ける受け付け部と、明細書に対して示された拒絶の理由を示す文書を記録した拒絶理由データベースから、言葉の具体例を示す対象となる明細書に対して示された、拒絶の理由を示す文書のテキストデータを、明細書を特定する情報に基づいて検索する拒絶理由を示す文書の検索部と、検索された拒絶の理由を示す文書のテキストデータから、あらかじめ記憶された対表現パターンと同じ対表現パターンの文字列を検出し、対表現パターンの文字列に基づいて、特許文書に記載された文字列とその文字列に対比して示される文字列とをそれぞれ含む対となる文字列を検出し、その対となる文字列から不要文字列を削除することにより、言葉の具体例を示す対象となる明細書のテキストデータに記載された言葉の文字列と、その言葉に対比して示される具体例の文字列の対を抽出する抽出部と、抽出された対となる言葉の文字列の組を出力する出力部とを具備する。 According to an example device for words (words or combinations of words) in a patent document according to one aspect of the present invention, a specific example of words included in a patent document in an apparatus for supporting understanding of words in a patent document A specification that is a target of a specific example of a word from a reception unit that receives information specifying a specification that is a target to indicate, and a rejection reason database that records a document that indicates the reason for rejection indicated for the specification The document search section showing the reason for refusal that searches the text data of the document showing the reason for refusal based on the information specifying the specification, and the text of the document showing the reason for refusal searched. A character string having the same pair expression pattern as the previously stored pair expression pattern is detected from the data, and based on the character string of the pair expression pattern, the character string described in the patent document and the character string are matched. By detecting a pair of character strings each including the character string shown in the above, and deleting unnecessary character strings from the paired character strings, An extraction unit that extracts a character string of a written word, a pair of character strings of a specific example shown in contrast to the word, and an output unit that outputs a set of character strings of the extracted paired words It has.

対象特許の明細書が明確に例を示していない場合も、その特許の内容を深く理解していると思われる審査官の解釈を利用することで、特許文書たとえば請求項中の言葉が具体的に何を表しているかを知ることが可能となる。この結果、特許文書たとえば請求項の記述方法に馴染みの無い利用者であっても特許文書たとえば請求項を理解することを支援することができる。 Even if the specification of the subject patent does not clearly show an example, the words in the patent document, for example, the claims are made concrete by using the interpretation of the examiner who seems to have a deep understanding of the content of the patent. It is possible to know what is represented in As a result, even a user who is unfamiliar with the description method of a patent document such as a claim can assist in understanding the patent document such as a claim.

請求項の記述例を示す図である。It is a figure which shows the example of a description of a claim. 従来技術の問題点の説明図である。It is explanatory drawing of the problem of a prior art. 第１の実施形態の構成図である。It is a block diagram of 1st Embodiment. 第１の実施形態の説明図である。It is explanatory drawing of 1st Embodiment. 拒絶理由通知のパターンを示す図である。It is a figure which shows the pattern of notification of reasons for refusal. 第１の実施形態のフローチャートである。It is a flowchart of a 1st embodiment. 図４のステップＳ４０３の説明図である。It is explanatory drawing of step S403 of FIG. 図４のステップＳ４０４（言葉の対を抽出する処理）の詳細フローチャートである。5 is a detailed flowchart of step S404 (processing for extracting word pairs) in FIG. 表現パターン辞書の例を示す図である。It is a figure which shows the example of an expression pattern dictionary. 図３（ａ）の例１に対応する第１の実施形態の動作説明図である。It is operation | movement explanatory drawing of 1st Embodiment corresponding to Example 1 of Fig.3 (a). 図３（ｂ）の例２に対応する第１の実施形態の動作説明図である。It is operation | movement explanatory drawing of 1st Embodiment corresponding to the example 2 of FIG.3 (b). 図３（ｃ）の例３に対応する第１の実施形態の動作説明図である。It is operation | movement explanatory drawing of 1st Embodiment corresponding to Example 3 of FIG.3 (c). 図３（ｄ）の例４（例外ルール）に対応する第１の実施形態の動作説明図である。It is operation | movement explanatory drawing of 1st Embodiment corresponding to Example 4 (exception rule) of FIG.3 (d). 表示部の表示例を示す図である。It is a figure which shows the example of a display of a display part. 特許データベース１０１の例を示す図である。It is a figure which shows the example of the patent database 101. FIG. 拒絶理由データベース１０５の例を示す図である。It is a figure which shows the example of the reason database 105 for refusal. パターンマッチング処理の具体的な動作を示す詳細フローチャートである。It is a detailed flowchart which shows the specific operation | movement of a pattern matching process. 係り受け解析による不要文字削除処理を示すフローチャートである。It is a flowchart which shows the unnecessary character deletion process by dependency analysis. 係り受け解析処理の説明図である。It is explanatory drawing of a dependency analysis process. 第２の実施形態の構成図である。It is a block diagram of 2nd Embodiment. 第２の実施形態のフローチャートである。It is a flowchart of a 2nd embodiment. 図１５のステップＳ１５０１（どちらが具体例かを判定する処理）の詳細フローチャート（その１）である。FIG. 16 is a detailed flowchart (part 1) of step S1501 in FIG. 15 (processing for determining which is a specific example); 図１６のステップＳ１６０１（方法１の処理）の詳細フローチャートである。FIG. 17 is a detailed flowchart of step S1601 (process of method 1) in FIG. 方法１における言葉対テーブルの内容例を示す図である。6 is a diagram illustrating an example of contents of a word pair table in Method 1. FIG. 図１６のステップＳ１６０３（方法２の処理）の詳細フローチャートである。FIG. 17 is a detailed flowchart of step S1603 (process of method 2) in FIG. 方法２における言葉対テーブルの内容例を示す図である。10 is a diagram illustrating an example of contents of a word pair table in Method 2. FIG. 図１５のステップＳ１５０１（どちらが具体例かを判定する処理）の詳細フローチャート（その２）である。FIG. 16 is a detailed flowchart (part 2) of step S1501 in FIG. 15 (processing for determining which is a specific example); 図２１のステップＳ２１０３の処理（請求項と実施例等の対応関係を解析する処理）の詳細フローチャートである。FIG. 22 is a detailed flowchart of the processing in step S2103 in FIG. 21 (processing for analyzing the correspondence between claims and examples). 言葉対データベースの例を示す図である。It is a figure which shows the example of a word pair database. 第３の実施形態の構成図（その１）である。It is a block diagram (the 1) of 3rd Embodiment. 第３の実施形態の構成図（その２）である。It is a block diagram (the 2) of 3rd Embodiment. 第３の実施形態のフローチャート（その１）である。It is a flowchart (the 1) of a 3rd embodiment. 第３の実施形態のフローチャート（その２）である。It is a flowchart (the 2) of a 3rd embodiment. 実施形態のシステムを実現するハードウェアの構成図である。It is a hardware block diagram which implement | achieves the system of embodiment.

以下、本発明を実施するための形態について図面を参照しながら詳細に説明する。
図２Ａは、特許文書中の言葉(単語または単語の組み合わせ)の例示装置の第１の実施形態の構成図である。第１の実施形態のシステムは、特許データベース１０１、特許文書検索部１０２、出願番号抽出部１０３、拒絶理由通知検索部１０４、拒絶理由データベース１０５、言葉対抽出部１０６、表現パターン辞書１０７、具体例付加対象特定部１０８、及び表示部１０９を備える。 Hereinafter, embodiments for carrying out the present invention will be described in detail with reference to the drawings.
FIG. 2A is a block diagram of a first embodiment of an example device for words (words or combinations of words) in a patent document. The system according to the first embodiment includes a patent database 101, a patent document search unit 102, an application number extraction unit 103, a rejection reason notification search unit 104, a rejection reason database 105, a word pair extraction unit 106, an expression pattern dictionary 107, and a specific example. An addition target specifying unit 108 and a display unit 109 are provided.

図２Ｂは、第１の実施形態の基本動作説明図である。
第１の実施形態では、特許データベース１０１（図１）から検索される特許文書に対して、審査官が拒絶理由通知等の拒絶の理由を示す文書で書いている言葉の対応関係を利用して、他の特許で書かれている具体例が抽出される。例えば、特許文書に対応する図２Ｂに示される公報データ２０１に対する拒絶理由２０２を説明する。公報データ２０１に対応する拒絶理由２０２は、公報データ２０１中の「出願番号」項目に記載されている出願番号によって拒絶理由データベース１０５（図２Ａ）を検索することで、得ることができる。拒絶理由２０２に書かれている『引用文献２に記載の「中傷や風説」が請求項１に係る発明の「個人の評価」に相当し、』といった表現を利用して、「個人の評価」の具体例は、「中傷や風説」であるという情報が得られる。この結果、図２Ｂの２０３として示されるような表示がなされる。 FIG. 2B is an explanatory diagram of a basic operation of the first embodiment.
In the first embodiment, a correspondence relationship between words written in a document indicating a reason for refusal, such as a notice of reasons for refusal, is used for a patent document searched from the patent database 101 (FIG. 1). Specific examples written in other patents are extracted. For example, the reason for refusal 202 for the publication data 201 shown in FIG. 2B corresponding to the patent document will be described. The reason for rejection 202 corresponding to the publication data 201 can be obtained by searching the rejection reason database 105 (FIG. 2A) by the application number described in the “application number” item in the publication data 201. “Individual evaluation” is described in the reason for refusal 202 by using the expression “The slander and the rumor described in the cited reference 2 corresponds to“ individual evaluation ”of the invention of claim 1”. As for a specific example of the above, it is possible to obtain the information that it is “defamatory or rude”. As a result, a display as shown by 203 in FIG. 2B is performed.

ここで、「言葉」とは、単語または単語の組合せを指し、この単語または単語の組み合わせは複合語、係り受け語、複数文節の組合せ等を含む。
ここで、拒絶理由通知等の拒絶の理由を示す文書は審査官が書いているため公報に比べれば形が整っており、比較的扱いやすいが、それでも様々なバリエーションがあり、単純なパターンでは対応する言葉の対を抽出することができない。拒絶理由通知等の記述パターンとしては、例えば、図３（ａ）〜（ｄ）の例１、例２、例３、例４に示されるような、４つの書くパターン（書き方）がある。 Here, “word” refers to a word or a combination of words, and this word or combination of words includes a compound word, a dependency word, a combination of multiple phrases, and the like.
Here, the document indicating the reason for refusal, such as a notice of reasons for refusal, is written by the examiner, so it has a better shape than the gazette and is relatively easy to handle. I can't extract word pairs. There are four writing patterns (writing methods) as shown in Examples 1, 2, 3, and 4 in FIGS.

図３（ａ）の例１は、引用文献の言葉「ネットワークトポロジ」に括弧書きで対応することを示す「相当」の前に本願の言葉「通信状態情報」を書くパターンである。
図３（ｂ）の例２は、対応することを示す「相当」の前に引用文献の言葉「掲示板」、本願の言葉「コンテンツ」の順に対応する言葉を書くパターンである。 Example 1 in FIG. 3A is a pattern in which the word “communication state information” of the present application is written before “equivalent” indicating that the word “network topology” in the cited document corresponds to the word “network topology” in parentheses.
Example 2 in FIG. 3B is a pattern in which the words corresponding to the quoted word “bulletin board” and the word “content” of the present application are written in the order of “equivalent” indicating corresponding.

図３（ｃ）の例３は、引用文献の言葉「クライアント計算機、Ａ書店、コンビニエンスチェーン」をまとめて書く。そして、それに対応する本願の言葉「第１の情報処理装置、第３の情報処理装置、第２の情報処理装置」を対応する順に書き、対応することを示す「相当」を本願の言葉の後に書くパターンである。 In Example 3 of FIG. 3C, the words “client computer, A bookstore, convenience chain” of the cited document are collectively written. Then, the corresponding words “first information processing device, third information processing device, second information processing device” of the present application are written in the corresponding order, and “corresponding” indicating the correspondence is indicated after the words of the present application. It is a writing pattern.

図３（ｄ）の例４は、本願の言葉「メタレベルタスク」が先に書かれ、対応することをと示す「相当する」の後に、引用文献の言葉「移動の指示」を書くパターンである。
上記例１から例４において、本願の言葉、引用文献の言葉、相当はそれぞれワークメモリ内に文字列データ（以下単に「文字列」と記述する）として格納され、文字列の各格納位置として書くパターンすなわち記述パターンが決定される。なお、例１から例３は、いずれも引用文献の言葉が本願の言葉より先に現れるが、例４では本願の言葉が引用文献の言葉より先に出現する書き方の例外パターンである。すなわち、その書くパターンのテキストデータがワークメモリに展開されて、逐次走査される場合、例１ないし例３では文字列位置からみて引用文献に続いて本願の順にスキャンされ、例４では本願に続いて引用文献の順にスキャンされる。 Example 4 in FIG. 3D is a pattern in which the word “meta-level task” of the present application is written first and “corresponding” indicating that it corresponds, followed by the word “movement instruction” of the cited document. .
In Examples 1 to 4 above, the words of the present application, the words of cited references, and the like are stored as character string data (hereinafter simply referred to as “character strings”) in the work memory, and are written as the storage positions of the character strings. A pattern or descriptive pattern is determined. In all of Examples 1 to 3, the words of the cited document appear before the words of the present application. In Example 4, however, there are exceptional patterns of writing in which the words of the present application appear before the words of the cited document. That is, when text data of the pattern to be written is developed in the work memory and sequentially scanned, in Examples 1 to 3, scanning is performed in the order of the present application following the cited document as viewed from the character string position, and in Example 4, the present application is continued. Are scanned in the order of cited references.

図４は、図２Ａに示される構成を有する特許文書中の単語または単語の組み合わせの例示装置の第１の実施形態の制御動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵ（中央演算処理装置）とメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 4 is an operation flowchart showing the control operation of the first embodiment of the example device for the word or word combination in the patent document having the configuration shown in FIG. 2A. Each process in this operation flowchart is executed by a computer system equipped with, for example, a CPU (Central Processing Unit), a memory, and an external storage device.

まず、特許データベース１０１（図２Ａ）が検索されることにより、利用者によって指定された特許文書（以下これを「表示特許」と呼ぶ）のファイルが検索される（ステップＳ４０１）。この処理は、図２Ａの特許文書検索部１０２の機能を実現する。特許データベース１０１は、記憶装置上に記憶されている。図１２Ａは、特許データベース１０１のデータ構成例を示す図である。図１２Ａに示すように、特許データベース１０１に登録される各特許文書のレコードデータは、出願番号、公開番号に続いて、要約インデックス、請求項インデックス、実施例インデックス、発明の名称データ、及び特許文書の本文の各フィールドから構成される。なお、実施例のインデックスは、発明を実施するための形態のインデックスであってもよい。利用者は、出願番号、公開番号、要約中の言葉、請求項中の言葉、実施例中の言葉、発明の名称に含まれる言葉等のキーワードデータによって、特許文書のファイルのメモリワーク領域に展開した特許データベースを検索することができる。出願番号が検索指定された場合には、特許データベース１０１中の出願番号フィールドに検索指定された出願番号の文字列と一致するレコードがあるか否かが検索される。公開番号が検索指定された場合には、特許データベース１０１中の公開番号フィールドに検索指定された公開番号の文字列と一致するレコードがあるか否かが検索される。要約、請求項、実施例、又は発明の名称に含まれる言葉が検索指定された場合には、特許データベース１０１中の要約、請求項、実施例のインデックスフィールド、又は発明の名称フィールド内に検索指定された言葉の文字列が登録されているか否かが検索される。この結果、検索されたレコードの本文フィールドに登録されているテキストデータが抽出される。 First, by searching the patent database 101 (FIG. 2A), a file of a patent document (hereinafter referred to as “displayed patent”) designated by the user is searched (step S401). This process realizes the function of the patent document search unit 102 in FIG. 2A. The patent database 101 is stored on a storage device. FIG. 12A is a diagram illustrating a data configuration example of the patent database 101. As shown in FIG. 12A, the record data of each patent document registered in the patent database 101 includes an application number, a publication number, a summary index, a claim index, an embodiment index, an invention name data, and a patent document. It consists of each field of the body. The index of the embodiment may be an index for implementing the invention. The user expands the patent document file to the memory work area by using keyword data such as application number, publication number, abstract word, claim word, example word, word included in the name of the invention, etc. Patent databases can be searched. When the application number is specified by search, it is searched whether there is a record that matches the character string of the application number specified for search in the application number field in the patent database 101. When the public number is specified for search, it is searched whether there is a record that matches the character string of the public number specified for search in the public number field in the patent database 101. When a word included in the abstract, claim, embodiment, or invention name is specified by search, the search specification is made in the abstract, claim, embodiment index field, or invention name field in the patent database 101. It is searched whether or not the character string of the entered word is registered. As a result, the text data registered in the body field of the retrieved record is extracted.

次に、拒絶理由通知等を検索するために、ステップＳ４０１で検索された特許文書から出願番号を示す文字列が抽出される（ステップＳ４０２）。具体的には、ステップＳ４０１にて検出が行われたレコードの出願番号フィールドに登録されている文字列が抽出される。この処理は、図１の出願番号抽出部１０３の機能を実現する。 Next, in order to search for notice of reasons for refusal, etc., a character string indicating the application number is extracted from the patent document searched in step S401 (step S402). Specifically, the character string registered in the application number field of the record detected in step S401 is extracted. This process realizes the function of the application number extraction unit 103 in FIG.

次に、ステップＳ４０１で検索された特許文書である表示特許に関して、拒絶理由データベース１０５（図１）が検索される。拒絶理由データベース１０５は、例えば後述する図２７の外部記憶装置３００５等の記憶装置のファイルに記憶されている。具体的には、表示特許についての拒絶理由通知等及び、表示特許を引用している拒絶理由通知のファイルがワークメモリに展開されたものが検索される（図４のステップＳ４０３）。図１２Ｂは、拒絶理由データベース１０５のデータ構成例を示す図である。図１２Ｂに示すように拒絶理由データベース１０５に登録される各拒絶理由通知等のレコードは、引用元出願番号、引用先情報、及び拒絶理由通知等の本文の各フィールドから構成される。表示特許についての拒絶理由通知等が検索される場合には、ステップＳ４０２で抽出された出願番号が、拒絶理由データベース１０５の引用元出願番号フィールドの登録内容と一致するレコードが検索される。表示特許を引用している拒絶理由通知等が検索される場合には、ステップＳ４０２で抽出された出願番号が、拒絶理由データベース１０５の引用先情報フィールドに含まれるレコードが検索される。この結果、検索されたレコードの本文フィールドに登録されているテキストデータが抽出される。この処理は、図２Ａの拒絶理由通知検索部１０４の機能を実現する。ここで拒絶理由データベースには、拒絶理由通知のほか、拒絶の理由を示す文書、例えば拒絶査定、審決、審査前置報告等が含まれる。以下の説明では、これらを合わせた「拒絶の理由を示す文章」を「拒絶理由通知等」と呼ぶことにする。 Next, the rejection reason database 105 (FIG. 1) is searched for the display patent which is the patent document searched in step S401. The rejection reason database 105 is stored in a file of a storage device such as an external storage device 3005 in FIG. More specifically, a search is made for a file in which a notification of reasons for refusal for a displayed patent and a notification of reasons for refusal quoting the displayed patent are expanded in the work memory (step S403 in FIG. 4). FIG. 12B is a diagram illustrating a data configuration example of the rejection reason database 105. As shown in FIG. 12B, each record of reasons for refusal or the like registered in the reason for rejection database 105 is made up of each field of the text of the citation source application number, citation destination information, and the reason for refusal. When searching for a notice of reason for refusal for the displayed patent, etc., a record in which the application number extracted in step S402 matches the registered content of the citation source application number field of the reason for rejection database 105 is searched. When a notice of reason for refusal that cites the displayed patent is retrieved, a record in which the application number extracted in step S402 is included in the citation destination information field of the reason database 105 is retrieved. As a result, the text data registered in the body field of the retrieved record is extracted. This process realizes the function of the rejection reason notification search unit 104 in FIG. 2A. In this case, the rejection reason database includes a document indicating the reason for refusal, for example, a rejection decision, a trial decision, a pre-examination report, etc., in addition to a notification of the reason for refusal. In the following description, the “text indicating the reason for refusal” combined with these will be referred to as “notification of reason for refusal”.

次に、ステップＳ４０３で検索された拒絶理由通知等とステップＳ４０１で得られている表示特許（特許文書）とから、後述する図５Ｂの動作フローチャートで示される制御動作に従って、言葉の対のデータが抽出される（ステップＳ４０４）。この処理は、図２Ａの言葉対抽出部１０６の機能を実現する。 Next, from the notification of reasons for refusal searched in step S403 and the display patent (patent document) obtained in step S401, the data of the word pair is obtained in accordance with the control operation shown in the operation flowchart of FIG. Extracted (step S404). This process realizes the function of the word pair extraction unit 106 in FIG. 2A.

次に、ステップＳ４０４で言葉の対のデータが抽出できたか否かが判定される（ステップＳ４０５）。
ステップＳ４０５の判定がＹＥＳならば、ステップＳ４０１で得られている表示特許中の特許請求の範囲における請求項の文字列において、具体例が付加される対象となる言葉の文字列が特定される（ステップＳ４０６）。即ち、ステップＳ４０４の言葉の対の抽出処理によって得られる各言葉の対の文字列に含まれる各抽象的な言葉の文字列と対となる言葉の各文字列が、請求項の文字列上で特定される。この処理は、図２Ａの具体例付加対象特定部１０８の機能を実現する。 Next, it is determined whether or not the word pair data has been extracted in step S404 (step S405).
If the determination in step S405 is YES, the character string of the word to which a specific example is added is specified in the character string of the claim in the display patent claim obtained in step S401 ( Step S406). That is, each character string of a word that is paired with each character string of each abstract word included in the character string of each word pair obtained by the word pair extraction process in step S404 is represented on the character string of the claim. Identified. This process realizes the function of the specific example addition target specifying unit 108 in FIG. 2A.

そして、ステップＳ４０６で特定された請求項中の各具体例付加対象の言葉とそれぞれに対応する具体例とが対比されて、利用者のコンピュータのディスプレイに表示される（ステップＳ４０７）。または、具体例付加対象にリンクを設定しておき、利用者にクリックされた時に具体例を表示するようにしてもいい。この処理は、図２Ａの表示部１０９の機能を実現する。ただし、ここで述べた具体例は、請求項中の各具体例付加対象の言葉と対応する言葉の例示であり、最終的に言葉が具体例か否かの判断は第２の実施例を参照して後述する。 Then, the specific example addition target words in the claims identified in step S406 are compared with the corresponding specific examples and displayed on the display of the user's computer (step S407). Alternatively, a link may be set as a specific example addition target, and a specific example may be displayed when the user clicks. This process realizes the function of the display unit 109 in FIG. 2A. However, the specific examples described here are exemplifications of words corresponding to the words to be added to the specific examples in the claims, and finally refer to the second embodiment to determine whether the words are specific examples. Will be described later.

前述のステップＳ４０４で言葉の対のデータが抽出できずステップＳ４０５の判定がＮＯならば、言葉の対の表示は行われず、利用者によって指定された現在の表示特許に対する請求項中の言葉の例示処理を終了する。 If the word pair data cannot be extracted in step S404 and the determination in step S405 is NO, the word pair is not displayed, and the words in the claims for the current display patent designated by the user are exemplified. The process ends.

上述のように、ステップS４０３において、表示特許についての拒絶理由通知及び表示特許を引用している拒絶理由通知の２種類の拒絶理由通知等が検索されるのは、以下の理由による。即ち、表示特許についての拒絶理由通知等では、図５Ａ（ａ）に示されるように、例えば『引用文献１の「中傷や風説」が請求項１の「個人の評価」に相当』という記述形式において、「請求項１」が表示特許（ステップＳ４０１で検索された特許文書）の請求項１を指している。一方、表示特許を引用している拒絶理由通知等では、図５（ｂ）に示されるように、例えば『引用文献２の「個人の評価」が請求項２の「中傷や風説」に相当』という記述形式において、「引用文献２」が表示特許（ステップＳ４０１で検索された特許文書）を指している。 As described above, in step S403, two types of reasons for refusal notification, such as a notice of reason for refusal for the displayed patent and a notice of reason for refusal that cites the indicated patent, are searched for the following reason. That is, in the notice of reasons for refusal of the displayed patent, as shown in FIG. 5A (a), for example, a description format “Correspondent Document 1“ Smallness and Rumor ”Corresponds to“ Individual Evaluation ”in Claim 1” "Claim 1" indicates claim 1 of the displayed patent (patent document searched in step S401). On the other hand, in the notice of reason for refusal that cites the displayed patent, as shown in FIG. 5 (b), for example, “individual evaluation” in cited reference 2 corresponds to “deflection or custom” in claim 2. In the description format, “Cited document 2” indicates the displayed patent (patent document searched in step S401).

そして、どちらの記述形式でも、「中傷や風説」という具体的な言葉と「個人の評価」という抽象的な言葉は、引用文献特定パターン「引用文献」及び本願特定パターン「請求項」で区切られた部分から抽出されている。ここでいう「本願」とは、拒絶理由通知等でいう「本願」の意味である。 In both description formats, the concrete word “defamatory or rude” and the abstract word “individual evaluation” are separated by the cited document specific pattern “cited document” and the present application specific pattern “claim”. It is extracted from the part. The “present application” here means the “present application” in the notice of reasons for refusal or the like.

しかし、図５Ａ（ａ）の記述形式では、「請求項１」を含む表示特許に関する拒絶理由通知等が「引用文献１」を引用する場合である。そして、本願特定パターンである「請求項１」から後の部分に含まれる言葉「個人の評価」が、表示特許中に含まれる抽象的な言葉である。また、本願特定パターンである「請求項１」から前の部分に含まれる言葉「中傷や風説」が、具体的な言葉即ち具体例である。 However, in the description format of FIG. 5A (a), the notice of reasons for refusal regarding the display patent including “claim 1” or the like refers to “cited document 1”. The word “individual evaluation” included in the part after “claim 1”, which is the specific pattern of the present application, is an abstract word included in the display patent. Further, the word “defamatory or rumor” included in the previous part from “Claim 1” which is the specific pattern of the present application is a specific word, that is, a specific example.

一方、図５Ａ（ｂ）の記述形式では、逆に、「請求項２」を含む他の特許に関する拒絶理由通知等が、表示特許である「引用文献２」を引用する場合である。そして、本願特定パターンである「請求項２」から後の部分に含まれる言葉「中傷や風説」が、具体的な言葉即ち具体例である。また、本願特定パターンである「請求項２」から前の部分に含まれる言葉「個人の評価」が、表示特許中に含まれる抽象的な言葉である。 On the other hand, in the description format of FIG. 5A (b), conversely, the notice of reasons for refusal regarding other patents including “Claim 2” and the like refer to “Cited document 2” which is a display patent. The word “defamatory or rumor” included in the part after “claim 2” which is the specific pattern of the present application is a specific word, that is, a specific example. Further, the word “individual evaluation” included in the part before “Claim 2” which is the present application specific pattern is an abstract word included in the display patent.

このように、表示特許についての拒絶理由通知等（図５Ａ（ａ））と、表示特許を引用している拒絶理由通知等（図５Ａ（ｂ））の何れにおいても、拒絶理由通知等に含まれる引用文献特定パターンと本願特定パターンに基づいて、次の処理へ言葉の対が抽出される。 Thus, both the notice of reasons for refusal for the displayed patent (FIG. 5A (a)) and the notice of reasons for refusal that cite the indicated patent (FIG. 5A (b)) are included in the notice of reasons for refusal. Based on the cited document specific pattern and the present application specific pattern, word pairs are extracted for the next processing.

しかし、表示特許についての拒絶理由通知等（図５Ａ（ａ））と、表示特許を引用している拒絶理由通知等（図５Ａ（ｂ））とでは、表示特許の中の抽象的な言葉と拒絶理由通知等の中の具体的な言葉をそれぞれ抽出する部分の位置関係が逆になっている。 However, the notice of reason for refusal for the displayed patent (FIG. 5A (a)) and the notice of reason for refusal etc. (FIG. 5A (b)) citing the indicated patent are the abstract words in the displayed patent. The positional relationship of the parts for extracting specific words in the notice of reasons for refusal is reversed.

以上のように、表示特許についての拒絶理由通知等（図５Ａ（ａ））と、表示特許を引用している拒絶理由通知等（図５Ａ（ｂ））の何れからでも、表示特許中の抽象的な言葉と拒絶理由通知等中の具体的な言葉の対応関係すなわち言葉の対を抽出できる。このため、ステップＳ４０３では、拒絶理由データベース１０５（図２Ａ）から図５Ａ（ａ）及び図５Ａ（ｂ）の両方の拒絶理由通知等が検索される。 As described above, the abstract in the display patent can be obtained from either the notification of the reason for refusal regarding the display patent (FIG. 5A (a)) or the notification of the reason for refusal quoting the display patent (FIG. 5A (b)). The correspondence between specific words and specific words in the notice of reasons for refusal, that is, word pairs can be extracted. For this reason, in step S403, the rejection reason notifications in both FIG. 5A (a) and FIG. 5A (b) are retrieved from the rejection reason database 105 (FIG. 2A).

次に第１の実施形態における言葉の対を抽出するステップＳ４０４における言葉の対の抽出処理の概略の手順を述べる。以下の手順１、２、３−１又は３−２に基づくコンピュータによるパターンマッチング処理と位置検索処理と不要文字削除処理により、上述の例１から例４までのパターンがそれぞれ判別されながら、本願についての記述部分と引用文献の記述部分が判別、分割されて、請求項で使っている言葉と引用文献中の具体例の言葉の対が抽出される。 Next, an outline procedure of the word pair extraction processing in step S404 for extracting word pairs in the first embodiment will be described. About the present application while the patterns from Example 1 to Example 4 described above are discriminated by pattern matching processing, position search processing, and unnecessary character deletion processing by a computer based on the following procedures 1, 2, 3-1, or 3-2. And the description part of the cited document are discriminated and divided, and a pair of a word used in the claim and a specific example word in the cited document is extracted.

手順１：
利用者から指定された特許文書ファイルに対応して検索された拒絶の理由を示す文書（文字列）から、「相当」などの対表現パターン文字列が存在する１文を複数の文字パターンのパターンマッチング処理により抽出する。ここで対表現パターンとは、「相当」のように、その前の言葉と後の言葉が意味的に同じあるいは対となっていることを示す文字列であって、あらかじめ決められたものが辞書に格納されている。一文とは、例えば句点の次の文字から始まり、次の句点までの文字列である。対表現パターンについてはそのテーブルのデータ構成を図６を参照して後述する。 Step 1 :
From a document (character string) indicating the reason for refusal retrieved corresponding to a patent document file designated by the user, a sentence having a pair expression pattern character string such as “equivalent” is converted into a plurality of character pattern patterns. Extract by matching process. Here, the pair expression pattern is a character string indicating that the previous word and the subsequent word are semantically the same or paired, as in “equivalent”, and a predetermined one is a dictionary. Stored in A sentence is, for example, a character string starting from the character next to a punctuation and extending to the next punctuation. Regarding the pair expression pattern, the data structure of the table will be described later with reference to FIG.

手順２：
手順１で抽出された１文が、本願の言葉が先に書かれ対表現パターン文字列「相当」の後に引用文献の言葉を書く書き方を有するか（例外ルール→図３（ｄ）の例４）、否か（通常処理→図３（ａ）の例１、図３（ｂ）の例２、図３（ｃ）の例３）を判別する。 Step 2 :
Whether one sentence extracted in step 1 has a way of writing the word of the cited document after the word of the present application is written first and the paired expression pattern character string “equivalent” (exception rule → example 4 in FIG. 3D) ) Or not (normal processing → example 1 in FIG. 3A, example 2 in FIG. 3B, example 3 in FIG. 3C).

手順３−１：（通常処理→図３（ａ）の例１、図３（ｂ）の例２、図３（ｃ）の例３）：
引用文献特定パターン例えば「引用文献」と本願特定パターン例えば「本願」によって、手順１で抽出された１文の記述部分を分割する。 Procedure 3-1: (Normal Processing → Example 1 in FIG. 3A, Example 2 in FIG. 3B, Example 3 in FIG. 3C):
The description part of one sentence extracted in the procedure 1 is divided by the cited document specific pattern such as “cited document” and the present application specific pattern such as “this application”.

本願特定パターンの前を「引用文献の言葉」文字列として抽出し、本願特定パターンの後を「本願の言葉」文字列として抽出する。
各抽出部分の文字列において、不要文字列を削除する。 The text before the application specific pattern is extracted as a character string of “citation document”, and the text after the application specific pattern is extracted as a character string of “word of this application”.
In the character string of each extracted part, the unnecessary character string is deleted.

本願特定パターン、引用文献特定パターン、不要文字列については、そのテーブルのデータ構成を図６を参照して後述する。 Regarding the application specific pattern, cited reference specific pattern, and unnecessary character string, the data structure of the table will be described later with reference to FIG.

手順３―２：（例外ルール→図３（ｄ）の例４）：
対表現パターン文字列、例えば「相当」の文字列によって、手順１で抽出された１文文字列の記述部分を分割する。即ち、上記対表現パターン文字列の文字列の後を「引用文献の言葉」文字列として抽出し、対表現パターン文字列の前を「本願の言葉」文字列として抽出する。
各抽出部分のデータにおいて、不要文字列を削除する。 Procedure 3-2: (Exception rule → Example 4 in FIG. 3D):
The description portion of the single sentence character string extracted in the procedure 1 is divided by a pair expression pattern character string, for example, a character string of “equivalent”. That is, the character string of the pair expression pattern character string is extracted as a “word of reference document” character string, and the character string before the counter expression pattern character string is extracted as a “word of the present application” character string.
Delete unnecessary character strings in the data of each extracted part.

次に、図４のステップＳ４０４における言葉の対の抽出処理（図２Ａの言葉対抽出部１０６の機能）の詳細について、図５Ｂに示される動作フローチャートに従って説明する。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 Next, the details of the word pair extraction process (the function of the word pair extraction unit 106 in FIG. 2A) in step S404 in FIG. 4 will be described according to the operation flowchart shown in FIG. 5B. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

図４のステップＳ４０３で前述したように、表示特許のテキストデータとそれに対応する拒絶理由文書のテキストデータから言葉の対のデータが抽出される場合には、引用文献特定パターンと本願特定パターンとが判定されながら、抽出処理が実行される。そして、図３の（ａ）から（ｄ）で示される例１から例４までのパターンが前述した手順１、２、３−１又は３−２に基づくパターンマッチング処理、位置検出処理により判別されながら、本願についての記述部分と引用文献の記述部分が判別、分割される。以下、図３の（ａ）から（ｄ）で示される例１から例４までのパターンを例示しながら、図５Ｂの動作フローチャートの処理を説明する。 As described above in step S403 of FIG. 4, when word pair data is extracted from the text data of the displayed patent and the text data of the reason document for rejection corresponding thereto, the cited document specific pattern and the present application specific pattern are The extraction process is executed while being determined. The patterns from Example 1 to Example 4 shown in (a) to (d) of FIG. 3 are discriminated by the pattern matching process and the position detection process based on the above-described procedures 1, 2, 3-1, or 3-2. However, the description part of the present application and the description part of the cited document are discriminated and divided. Hereinafter, the process of the operation flowchart of FIG. 5B will be described with reference to the patterns from Example 1 to Example 4 shown in (a) to (d) of FIG. 3.

まず、拒絶理由通知等のテキストデータにおいて、対表現パターン文字列、例えば「相当」の文字列が存在するか否かがパターンマッチング処理によって判定される（ステップＳ５０１）。ステップＳ５０１の具体的処理としては、拒絶理由データベースの先頭から１文字ずつ読み込まれながら、対表現パターン文字列のマッチングを検索するパターンマッチング処理が実行される。今、図２Ａの表現パターン辞書１０７内には、例えば図６（ａ）に例示されるような対表現パターンテーブルが記憶されている。この対表現パターンテーブルには、拒絶理由通知等における本願特許中の言葉と引用文献中の言葉とを対として結びつける「相当」「対応」「等価」「等しい」等の対表現パターンが登録されている。ステップＳ５０１では、拒絶理由通知等のファイルにおいて、対表現パターンテーブルに登録されている例えば「相当」等の文字列が存在するか否かが判定される。なお、表現パターン辞書１０７は、例えば後述する図２７の外部記憶装置３００５等の記憶装置に記憶されている。 First, it is determined by pattern matching processing whether or not a pair expression pattern character string, for example, a character string of “equivalent” exists in text data such as a rejection reason notice (step S501). As a specific process of step S501, a pattern matching process for searching for a matching of a pair expression pattern character string is executed while reading one character at a time from the top of the rejection reason database. Now, in the expression pattern dictionary 107 of FIG. 2A, for example, a counter expression pattern table as illustrated in FIG. 6A is stored. In this counter-expression pattern table, counter-expression patterns such as “equivalent”, “correspondence”, “equivalent”, “equal”, etc., that link the words in the patent of the present application and the words in the cited document in the notice of reasons for refusal are registered Yes. In step S501, it is determined whether or not a character string such as “equivalent” registered in the counter expression pattern table exists in the file of the rejection reason notice or the like. The expression pattern dictionary 107 is stored in a storage device such as an external storage device 3005 in FIG.

対表現パターン文字列のデータがなくステップＳ５０１の判定がＮＯならば、言葉の対のデータの抽出は行われず図５Ｂの動作フローチャートの処理を終了する。
ステップＳ５０１の判定がＹＥＳならばまず、上述した手順１として、拒絶理由通知のテキストデータにおいて、例えば「相当」という対表現パターン文字列が存在する１文の文字列が抽出されて文字列Ｙとされる（ステップＳ５０２）。この文字列Ｙは、ワークメモリ（例えば後述する図２７のメモリ３００２に対応する）上に保持される。例えば、前述した図３（ａ）に示される例１の記述パターンでは、図７（ａ）に示される１文が抽出される。また、図３（ｂ）に示される例２の記述パターンでは、図８（ａ）に示される１文のデータが抽出される。また、図３（ｃ）に示される例１の記述パターンでは、図９（ａ）に示される１文のデータが抽出される。更に、図３（ｄ）に示される例４の例外記述パターンでは、図１０（ａ）に示される１文のデータが抽出される。 If there is no pair expression pattern character string data and the determination in step S501 is NO, the word pair data is not extracted and the processing of the operation flowchart of FIG. 5B is terminated.
If the determination in step S501 is YES, first, as procedure 1 described above, in the text data of the reason for refusal, for example, a single sentence character string in which the counter-expression pattern character string “equivalent” exists is extracted, and the character string Y and (Step S502). This character string Y is held on a work memory (for example, corresponding to a memory 3002 in FIG. 27 described later). For example, in the description pattern of Example 1 shown in FIG. 3A described above, one sentence shown in FIG. 7A is extracted. Further, in the description pattern of Example 2 shown in FIG. 3B, one sentence data shown in FIG. 8A is extracted. Further, in the description pattern of Example 1 shown in FIG. 3C, one sentence data shown in FIG. 9A is extracted. Further, in the exception description pattern of Example 4 shown in FIG. 3D, one sentence data shown in FIG. 10A is extracted.

次に、上述の手順２として、文字列Ｙにおいて、文字列パターン「には」に続いて、本願の言葉が第１の文字列として先に書かれ、対表現パターン文字列「相当する」の文字列の後に、引用文献の言葉が第２の文字列として書かれる記述パターンがあるか否かが判定される（ステップＳ５０３）。これは、前述した図３（ｄ）の例４に対応する例外ルールパターンであるか否かを判定する処理である。具体的には、ステップＳ５０２は、ワークメモリ上の文字列Ｙがスキャンされながら、例えば「×××には・・・に相当する〜」という記述パターンがあるか否かを判定する処理である。更に具体的には、まず、ワークメモリ上の文字列Ｙの先頭から１文字ずつ読み込まれながら、パターンマッチングにより、パターン「には」がまず検索される。そして、そのパターンがマッチしたら、文字列Ｙの「には」以降でパターン「に相当する」が検索される。例えば図３（ａ）、（ｂ）、又は（ｃ）に示される例１、例２、例３の記述パターンでは、ステップＳ５０３の判定はＮＯとなる。また、図３（ｄ）に示される例４の例外記述パターンでは、ステップＳ５０３の判定はＹＥＳとなる。 Next, as the above-mentioned procedure 2, in the character string Y, the word of the present application is first written as the first character string following the character string pattern “to”, and the counter-expression pattern character string “corresponds”. It is determined whether there is a description pattern in which the word of the cited document is written as the second character string after the character string (step S503). This is a process of determining whether or not the exception rule pattern corresponds to Example 4 in FIG. Specifically, step S502 is a process of determining whether or not there is a description pattern such as “xxx corresponds to ...” while the character string Y on the work memory is scanned. . More specifically, first, the pattern “to” is first searched by pattern matching while reading one character at a time from the beginning of the character string Y on the work memory. If the pattern matches, the pattern “corresponding to” is searched after “in” of the character string Y. For example, in the description patterns of Example 1, Example 2, and Example 3 shown in FIGS. 3A, 3B, or 3C, the determination in step S503 is NO. In the exception description pattern of Example 4 shown in FIG. 3D, the determination in step S503 is YES.

上述の記述パターンがなくステップＳ５０３の判定がＮＯの場合には、手順３−１に対応するデータ処理としてとして、ステップＳ５０４からＳ５１２の一連の処理が実行される。 If there is no description pattern described above and the determination in step S503 is NO, a series of processing from steps S504 to S512 is executed as data processing corresponding to procedure 3-1.

まず、文字列Ｙに引用文献特定パターンがあるか否かが判定される（ステップＳ５０４）。ここで、前述した図２Ａの表現パターン辞書１０７には、例えば図６（ｂ）に例示される引用文献特定パターンテーブルが記憶されている。ステップＳ５０４では、ワークメモリ上の文字列Ｙがスキャンされながら、表現パターン辞書１０７内の図６（ｂ）に例示される引用文献特定パターンテーブルに登録されている引用文献特定パターンの文字列が含まれるか否かが判定される。具体的には、文字列Ｙの先頭から１文字ずつ読み込まれながら、パターンマッチングにより引用文献特定パターンテーブル中の各引用文献特定パターンを各検索パターンとする各パターンマッチング処理が実行される。ここで、文字列Ｙから引用文献特定パターンが検出される毎に、その引用文献特定パターンの末尾以降の文字列Ｙの領域で更に引用文献特定パターンを検出するパターンマッチング処理が繰り返し実行される。これにより、図３（ｂ）に示されるように、文字列Ｙ中に含まれる複数の引用文献特定パターンを検出できる。 First, it is determined whether or not there is a cited document specific pattern in the character string Y (step S504). Here, in the expression pattern dictionary 107 of FIG. 2A described above, for example, a cited document specific pattern table illustrated in FIG. 6B is stored. In step S504, the character string Y of the cited document specific pattern registered in the cited document specific pattern table illustrated in FIG. 6B in the expression pattern dictionary 107 is included while the character string Y on the work memory is scanned. It is determined whether or not. Specifically, each pattern matching process using each cited document specific pattern in the cited document specific pattern table as a search pattern is performed by pattern matching while reading one character at a time from the beginning of the character string Y. Here, every time a cited document specific pattern is detected from the character string Y, a pattern matching process for further detecting the cited document specific pattern in the area of the character string Y after the end of the cited document specific pattern is repeatedly executed. Thereby, as shown in FIG. 3B, a plurality of cited document specific patterns included in the character string Y can be detected.

文字列Ｙに引用文献特定パターンが含まれずステップＳ５０４の判定がＮＯならば、図５Ｂの動作フローチャートの処理を終了する。
ステップＳ５０４の判定がＹＥＳならば、ワークメモリ上の文字列Ｙが記憶アドレスの逆順にスキャンされながら、以下のパターンマッチング処理が実行される。即ち、最も後ろに出現する検索された引用文献特定パターンの後ろから「相当」等の対表現パターン文字列又は「相当」等の対表現パターン文字列がない場合は文字列Ｙの末尾までが、文字列Ｚとして抽出される。この文字列Ｚも、ワークメモリ上に保持される。そして、ワークメモリ上の文字列Ｙがスキャンされながら、ワークメモリ上の文字列Ｚに一致する文字列と上述の検索された引用文献特定パターンに一致する文字列が削除される。その結果、新たな文字列Ｙがワークメモリ上に得られる（ステップＳ５０５）。例えば、図３（ａ）に示される例１の記述パターンでは「引用文献２には、アドホックネットワークにおいて調停端末がネットワークトポロジ（本願の「通信状態情報」に相当）の変化を検知し、．．．」（図７（a））から、図７（ｂ）に示される部分「２には、アドホックネットワークにおいて調停端末がネットワークトポロジ（本願の「通信状態情報」に相当」が文字列Ｚとして抽出される。また、新たな文字列Ｙは「）の変化を検知し、・・・」となる。図３(ｂ)に示される例２の記述パターンでは、図８（ｂ）に示される部分が文字列Ｚとして抽出される。また、新たな文字列Ｙは「．．．引用文献１に記載の「掲示板」が請求項１に係る発明の「コンテンツ情報」に相当し、引用文献１に記載の「記事」が請求項１に係る発明の「個人の意見の開示単位」に相当し、する。」となる。図３（ｃ）に示される例３の記述パターンでは、図９（ｂ）に示される部分が文字列Ｚとして抽出される。また、新たな文字列Ｙは「．．．する。」となる。 If the character string Y does not include the cited document specific pattern and the determination in step S504 is NO, the process of the operation flowchart in FIG. 5B is terminated.
If the determination in step S504 is YES, the following pattern matching process is executed while the character string Y on the work memory is scanned in the reverse order of the storage address. That is, when there is no pair expression pattern character string such as “equivalent” or a pair expression pattern character string such as “equivalent” from the back of the searched cited document specific pattern that appears most recently, the end of the character string Y is Extracted as a character string Z. This character string Z is also held on the work memory. Then, while the character string Y on the work memory is scanned, the character string that matches the character string Z on the work memory and the character string that matches the searched cited document specific pattern are deleted. As a result, a new character string Y is obtained on the work memory (step S505). For example, in the description pattern of Example 1 shown in FIG. 3A, “ cited document 2 states that the arbitrating terminal detects a change in the network topology (corresponding to“ communication state information ”in the present application ) in the ad hoc network; . . "(FIG. 7 (a)), the portion" 2 "shown in FIG. 7 (b) shows that the arbitration terminal in the ad hoc network has a network topology (corresponding to" communication status information "in this application ) as a character string Z. In addition, the new character string Y becomes “... In the description pattern of Example 2 shown in FIG. 3B, the portion shown in FIG. Further, in the new character string Y, “... bulletin board” described in cited document 1 corresponds to “content information” of the invention according to claim 1, and “article” described in cited document 1 is claimed in claim 1. This corresponds to the “unit of disclosure of personal opinions” of the invention. " In the description pattern of Example 3 shown in FIG. 3C, the portion shown in FIG. 9B is extracted as the character string Z. Further, the new character string Y becomes “...

次に、ワークメモリ上の文字列Ｚに本願特定パターンがあるか否かが判定される（ステップＳ５０６）。ここで、前述した図６の表現パターン辞書１０７には、例えば図６（ｃ）に例示される本願特定パターンテーブルが記憶されている。ステップＳ５０６では、ワークメモリ上の文字列Ｚがスキャンされながら、表現パターン辞書１０７内の図６（ｃ）に例示される本願特定パターンテーブルに登録されている本願特定パターンの文字列が含まれるか否かが判定される。具体的には、文字列Ｚの先頭から１文字ずつ読み込まれながら、パターンマッチング処理により、本願特定パターンテーブルに登録されている各本願特定パターンの文字列を各検索パターンとする各パターンマッチング処理が実行される。例えば図３（ａ）に示される例１の記述パターンでは、図７（ｂ）に示される文字列Ｚに「本願」という本願特定パターンが含まれるため、ステップＳ５０６の判定はＹＥＳとなる。図３（ｂ）に示される例２の記述パターンでは、図７（ｂ）に示される文字列Ｚに「請求項」という本願特定パターンが含まれるため、ステップＳ５０６の判定はＹＥＳとなる。図３（ｃ）に示される例２の記述パターンでは、図９（ｂ）に示される文字列Ｚに「本願請求項」という本願特定パターンが含まれるため、ステップＳ５０６の判定はＹＥＳとなる。 Next, it is determined whether or not the application specific pattern is present in the character string Z on the work memory (step S506). Here, in the expression pattern dictionary 107 of FIG. 6 described above, for example, the application specific pattern table illustrated in FIG. 6C is stored. In step S506, whether the character string of the application specific pattern registered in the application specific pattern table illustrated in FIG. 6C in the expression pattern dictionary 107 is included while the character string Z on the work memory is scanned. It is determined whether or not. Specifically, each pattern matching process using each character string of each application specific pattern registered in the application specific pattern table as a search pattern is performed by pattern matching while reading one character at a time from the beginning of the character string Z. Executed. For example, in the description pattern of Example 1 shown in FIG. 3A, the present application specific pattern of “this application” is included in the character string Z shown in FIG. 7B, so the determination in step S506 is YES. In the description pattern of Example 2 shown in FIG. 3B, the character string Z shown in FIG. 7B includes the application specific pattern “claim”, so the determination in step S506 is YES. In the description pattern of Example 2 shown in FIG. 3C, the character string Z shown in FIG. 9B includes the present application specific pattern “claim of the present application”, so the determination in step S <b> 506 is YES.

文字列Ｚに本願特定パターンがなくステップＳ５０６の判定がＮＯならば、その文字列Ｚからは言葉の対の文字列は抽出されず、残りの文字列Ｙに対して、ステップＳ５０４からＳ５０６までの処理が繰り返される。 If there is no application specific pattern in the character string Z and the determination in step S506 is NO, the character string of the word pair is not extracted from the character string Z, and steps S504 to S506 are performed for the remaining character string Y. The process is repeated.

ステップＳ５０６の判定がＹＥＳならば、ワークメモリ上の文字列Ｚにおいて、検索された本願特定パターンに対応する文字列の前の文字列部分が、文字列αとして抽出される（ステップＳ５０７）。文字列αは、ワークメモリ上に記憶される。この文字列αは、引用文献の言葉に対応する。例えば図３（ａ）に示される例１の記述パターンでは、図７（ｂ）の文字列Ｚから、本願特定パターン「本願」の前の部分「２には、アドホックネットワークにおいて調停端末がネットワークトポロジ(」が、文字列α即ち引用文献の言葉として抽出される。即ち、図７（ｃ）の如くである。図３（ｂ）に示される例２の記述パターンでは、図８（ｂ）の文字列Ｚから、本願特定パターン「請求項」の前の部分「１に記載の「悪口」が」が、文字列α即ち引用文献の言葉として抽出される。即ち、図８（ｃ）の如くである。図３（ｃ）に示される例３の記述パターンでは、図９（ｂ）の文字列Ｚから、本願特定パターン「本請求項」の前の部分「１のクライアント計算機、Ａ書店、コンビニエンスチェーンが各々」が、文字列α即ち引用文献の言葉として抽出される。即ち、図９（ｃ）の如くである。 If the determination in step S506 is YES, the character string portion before the character string corresponding to the searched specific pattern in the character string Z on the work memory is extracted as the character string α (step S507). The character string α is stored on the work memory. This character string α corresponds to the words of the cited document. For example, in the description pattern of Example 1 shown in FIG. 3A, from the character string Z in FIG. 7B, the arbitration terminal in the ad hoc network has a network topology in the portion “2” before the specific pattern “this application”. ("Is extracted as the character string α, that is, the word of the cited document, that is, as shown in Fig. 7 (c). In the description pattern of Example 2 shown in Fig. 3 (b), in Fig. 8 (b) From the character string Z, the portion ““ bad words ”described in 1” in front of the specific pattern “claim” of the present application is extracted as the character string α, that is, the word of the cited document, that is, as shown in FIG. In the description pattern of Example 3 shown in Fig. 3 (c), from the character string Z in Fig. 9 (b), the part "1 client computer, A bookstore, `` Each convenience chain is `` '' It is extracted as a leaf, that is, as shown in FIG.

続いて、ワークメモリ上の文字列Ｚにおいて、本願特定パターンに対応する文字列の後の文字列部分が、文字列βとして抽出される（ステップＳ５０８）。文字列βも、ワークメモリ上に記憶される。なお、図５Ｂ中では、「文字列β」は、「文字列β」と表示されている。この文字列βは、本願の言葉に対応する。例えば図３（ａ）に示される例１の記述パターンでは、図７（ｂ）の文字列Ｚから、本願特定パターン「本願」の後の部分「の「通信状態情報」に相当」が、文字列β即ち本願の言葉として抽出される。即ち、図７（ｃ）の如くである。図３（ｂ）に示される例２の記述パターンでは、図８（ｂ）の文字列Ｚから、本願特定パターン「請求項」の後の部分「１に係る発明の「個人の評価」に相当」が、文字列β即ち本願の言葉として抽出される。即ち、図８（ｃ）の如くである。図３（ｃ）に示される例３の記述パターンでは、図９（ｂ）の文字列Ｚから、本願特定パターン「本請求項」の後の部分「１の第１の情報処理装置、第３の情報処理装置、第２の情報処理装置に相当」が、文字列β即ち本願の言葉として抽出される。即ち、図９（ｃ）の如くである。 Subsequently, in the character string Z on the work memory, the character string portion after the character string corresponding to the specific pattern of the present application is extracted as the character string β (step S508). The character string β is also stored on the work memory. In FIG. 5B, “character string β” is displayed as “character string β”. This character string β corresponds to the word of the present application. For example, in the description pattern of Example 1 shown in FIG. 3A, the character string Z in FIG. It is extracted as the column β, that is, the word of the present application. That is, it is as shown in FIG. In the description pattern of Example 2 shown in FIG. 3B, the character string Z in FIG. 8B corresponds to the “individual evaluation” of the portion “1” of the invention according to the specific pattern “claim” of the present application. "Is extracted as the character string β, that is, the word of the present application. That is, it is as shown in FIG. In the description pattern of Example 3 shown in FIG. 3C, from the character string Z in FIG. 9B, the part “1st first information processing device, 3rd” after the specific pattern “this claim” of the present application. Is equivalent to the second information processing apparatus ”is extracted as the character string β, that is, the word of the present application. That is, it is as shown in FIG.

次に、ワークメモリ上の文字列α（引用文献の言葉）と文字列β（本願の言葉）のそれぞれにおいて、かぎ括弧と句読点による分割が行われる（ステップＳ５０９）。これは、言葉の並列記載を分割するための処理である。具体的には、文字列αと文字列βのそれぞれにおいて、先頭から１文字ずつ読み込まれながら、かぎ括弧と句読点を各検索パターンとする各パターンマッチング処理が実行され、各文字列上でのかぎ括弧と句読点のアドレスが検出される。このアドレスを区切りとして、各文字列が分割される。例えば図３（ａ）に示される例１の記述パターンでは、図７（ｃ）の文字列α（引用文献の言葉）「２には、アドホックネットワークにおいて調停端末がネットワークトポロジｌ」と文字列β（本願の言葉）「の通信状態に相当」から、図７（ｄ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）が得られる。図３（ｂ）に示される例２の記述パターンでは、図８（ｃ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）から、図８（ｄ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）が得られる。図３（ｃ）に示される例３の記述パターンでは、図９（ｃ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）から、図９（ｄ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）が得られる。特にこの図９（ｃ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）では、各言葉の並列記載が行われている。このため、これらの言葉が図９（ｄ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）として示されるように分割されることになる。 Next, the character string α (words of the cited document) and the character string β (words of the present application) on the work memory are divided by brackets and punctuation marks (step S509). This is a process for dividing the parallel description of words. Specifically, in each of the character string α and the character string β, each pattern matching process is performed by using the brackets and the punctuation marks as the respective search patterns while reading one character at a time from the beginning, and the key on each character string is executed. Parenthesis and punctuation addresses are detected. Each character string is divided using this address as a delimiter. For example, in the description pattern of Example 1 shown in FIG. 3A, the character string α (words of the cited reference) “2” in FIG. 7C is “a network topology l in the ad hoc network” and the character string β. From (corresponding to the communication state) of (the word of the present application), the character string α (word of the cited document) and the character string β (word of the present application) of FIG. In the description pattern of Example 2 shown in FIG. 3B, the character string α in FIG. 8D is obtained from the character string α (word of the cited document) and the character string β (word in the present application) in FIG. (Word of cited document) and character string β (word of the present application) are obtained. In the description pattern of Example 3 shown in FIG. 3C, the character string α in FIG. 9D is obtained from the character string α (word of the cited document) and the character string β (word in the present application) in FIG. (Word of cited document) and character string β (word of the present application) are obtained. In particular, in the character string α (words of the cited document) and the character string β (words of the present application) in FIG. 9C, the words are described in parallel. For this reason, these words are divided as shown as a character string α (words of the cited document) and a character string β (words of the present application) in FIG.

次に、ワークメモリ上の文字列α（引用文献の言葉）と文字列β（本願の言葉）のそれぞれにおいて、不要文字列に対応する文字列が削除される（ステップＳ５１０）。ここで、前述した図２Ａの表現パターン辞書１０７には、例えば図６（ｄ）に例示される不要文字列テーブルが記憶されている。ステップＳ５１０では、ワークメモリ上の文字列αと文字列βのそれぞれがスキャンされながら、図６（ｄ）に例示される不要文字列テーブルの各登録パターンに該当する不要文字列がパターンマッチングにより検索され削除される。また、ステップＳ５１０では、後述する係り受け解析処理により、ワークメモリ上の文字列αと文字列βのそれぞれにおいて、係り受け解析を実行して不要文字列が削除される。 Next, the character string corresponding to the unnecessary character string is deleted in each of the character string α (word of the cited document) and the character string β (word of the present application) on the work memory (step S510). Here, in the above-described expression pattern dictionary 107 in FIG. 2A, for example, an unnecessary character string table illustrated in FIG. 6D is stored. In step S510, an unnecessary character string corresponding to each registered pattern in the unnecessary character string table illustrated in FIG. 6D is searched by pattern matching while each of the character string α and the character string β on the work memory is scanned. And deleted. In step S510, dependency analysis is performed on each of the character string α and the character string β on the work memory, and unnecessary character strings are deleted by a dependency analysis process described later.

ステップＳ５１０では、ワークメモリ上の文字列αと文字列βのそれぞれにおいて、例えば、最後の複合語に係っていない文節も、不要文字列として削除される。図３（ａ）に示される例１の記述パターンにおいて、図７（ｄ）の文字列αから不要文字列テーブルに基づいて不要文字列が削除された後の文字列「アドホックネットワークにおいて調停端末がネットワークトポロジ」を考える。この文字列から、最後の複合語「ネットワークトポロジ」に係っていない文節「アドホックネットワークにおいて調停端末が」の部分が削除される。この結果、図７（ｅ）の文字列αが得られる。 In step S510, in each of the character string α and the character string β on the work memory, for example, a clause not related to the last compound word is also deleted as an unnecessary character string. In the description pattern of Example 1 shown in FIG. 3A, the character string “adjustment terminal in ad hoc network” after the unnecessary character string is deleted based on the unnecessary character string table from the character string α in FIG. Consider “network topology”. From this character string, the phrase “the arbitration terminal is in an ad hoc network” not related to the last compound word “network topology” is deleted. As a result, the character string α shown in FIG.

この結果、図７（ｅ）の文字列α「ネットワークトポロジ」が得られる。また、図７（ｄ）の本願の言葉を示す文字列β「の通信状態情報に相当」から、先頭のひらがな１文字「の」と記号「「」、文字列「相当」と末尾のひらがな１文字「に」と記号「」」が削除される。この結果、図７（ｅ）の文字列β「通信状態情報」が得られる。図３（ｂ）に示される例２の記述パターンでは、図８（ｄ）の文字列αから、先頭の数字「１」とひらがな１文字「に」、及び「記載」、ひらがな１文字「の」、記号「「」「」」、末尾のひらがな１文字「が」が削除される。この結果、図８（ｅ）の文字列αが得られる。また、図８（ｄ）の文字列βから、先頭の数字「１」とひらがな１文字「に」、及び文字列「係る」「発明」、ひらがな１文字「の」、記号「「」、文字列「相当」、ひらがな１文字「に」、記号「」」が削除される。この結果、図８（ｅ）の文字列βが得られる。図３（ｃ）に示される例３の記述パターンでは、図９（ｄ）の文字列α中の「１のクライアント計算機」から先頭の数字「１」とひらがな１文字「の」が削除される。また、文字列α中の「Ａ書店」から記号「Ａ」が削除される。更に、文字列α中の「コンビニエンスチェーンが各々」から文字列「各々」と末尾のひらがな１文字「が」が削除される。この結果、図９（ｅ）の文字列αが得られる。また、図９（ｄ）の文字列β中の「１の第１の情報処理装置」から先頭の数字「１」とひらがな１文字「の」が削除される。また、文字列β中の「第２の情報処理装置に相当」から文字列「相当」と末尾のひらがな１文字「に」が削除される。この結果、図９（ｅ）の文字列βが得られる。 As a result, the character string α “network topology” shown in FIG. Further, from the character string β “corresponding to the communication state information of the character string β indicating the word of the present application in FIG. 7D, the first hiragana character“ no ”and the symbol“ “”, the character string “equivalent” and the last hiragana 1 The characters “ni” and symbol “” ”are deleted. As a result, the character string β “communication state information” in FIG. In the description pattern of Example 2 shown in FIG. 3B, from the character string α in FIG. 8D, the first numeral “1”, one hiragana character “ni”, “description”, one hiragana character “ ", The symbol" "" "", and the last hiragana character "ga" are deleted. As a result, the character string α shown in FIG. Also, from the character string β in FIG. 8D, the first digit “1” and one hiragana character “ni”, the character string “related” “invention”, one hiragana character “no”, the symbol ““ ”, the character The column “equivalent”, the single hiragana character “ni”, and the symbol “” ”are deleted. As a result, the character string β shown in FIG. In the description pattern of Example 3 shown in FIG. 3C, the first numeral “1” and one hiragana character “NO” are deleted from “1 client computer” in the character string α of FIG. 9D. . Also, the symbol “A” is deleted from “A bookstore” in the character string α. Furthermore, the character string “each” and the last hiragana character “ga” are deleted from “the convenience chain is each” in the character string α. As a result, the character string α shown in FIG. 9E is obtained. In addition, the first numeral “1” and one hiragana character “NO” are deleted from “1 first information processing apparatus” in the character string β in FIG. 9D. In addition, the character string “equivalent” and the last hiragana character “ni” are deleted from “equivalent to the second information processing apparatus” in the character string β. As a result, the character string β shown in FIG. 9E is obtained.

以上のようにして最終的に、ワークメモリ上の文字列α（引用文献の言葉）から得られた言葉とワークメモリ上の文字列β（本願の言葉）から得られた言葉が対応付けられ、ワークメモリ上に言葉の対を示す文字列として記憶される（ステップＳ５１１）。例えば図３（ａ）に示される例１の記述パターンでは、図７（ｅ）の文字列α中の言葉「ネットワークトポロジ」と文字列β中の言葉「通信状態情報」が対応付けられる。図３（ｂ）に示される例２の記述パターンでは、図８（ｅ）の文字列α中の言葉「悪口」と文字列β中の言葉「個人の評価」が対応付けられる。図３（ｃ）に示される例３の記述パターンでは、図９（ｅ）の文字列α中の言葉「クライアント計算機」と文字列β中の言葉「第１の情報処理装置」が対応付けられる。また、図９（ｅ）の文字列α中の言葉「書店」と文字列β中の言葉「第３の情報処理装置」が対応付けられる。更に、図９（ｅ）の文字列α中の言葉「コンビニエンスチェーン」と文字列β中の言葉「第２の情報処理装置」が対応付けられる。 Finally, the word obtained from the character string α (word of the cited document) on the work memory and the word obtained from the character string β (word of the present application) on the work memory are associated with each other as described above. It is stored on the work memory as a character string indicating a word pair (step S511). For example, in the description pattern of Example 1 shown in FIG. 3A, the word “network topology” in the character string α in FIG. 7E is associated with the word “communication state information” in the character string β. In the description pattern of Example 2 shown in FIG. 3B, the word “bad words” in the character string α and the word “individual evaluation” in the character string β in FIG. In the description pattern of Example 3 shown in FIG. 3C, the word “client computer” in the character string α in FIG. 9E is associated with the word “first information processing device” in the character string β. . Further, the word “bookstore” in the character string α in FIG. 9E is associated with the word “third information processing device” in the character string β. Furthermore, the word “convenience chain” in the character string α in FIG. 9E is associated with the word “second information processing device” in the character string β.

以上のステップＳ５０７からＳ５１１までの一連の処理によって、ワークメモリ上の文字列Ｚに対する処理が終了すると、ワークメモリ上の残りの文字列Ｙが空であるか否かが判定される（ステップＳ５１２）。そして、ステップＳ５１２の判定がＮＯならば、ステップＳ５０４に戻って、ワークメモリ上の文字列Ｙが再びスキャンされながら、引用文献特定パターンの文字列があるか否かが判定される。例えば、図３（ａ）に示される例１の記述パターンでは、ステップＳ５０５の処理により、残りの文字列Ｙは「．．．）の変化を検知し、・・・」となっている。このためステップＳ５１２の判定はＮＯとなる。しかし、文字列Ｙには引用文献特定パターンが含まれないため、ステップＳ５０４の判定はＮＯとなって、図５Ｂの動作フローチャートの処理を終了する。また、図３（ｃ）に示される例３の記述パターンでは、ステップＳ５０５の処理により、残りの文字列Ｙは「．．．する。」となっているため、この場合もステップＳ５１２の判定はＮＯとなる。しかし、文字列Ｙには引用文献特定パターンが含まれないため、ステップＳ５０４の判定はＮＯとなって、図５Ｂの動作フローチャートの処理を終了する。 When the process for the character string Z on the work memory is completed by the series of processes from step S507 to S511, it is determined whether or not the remaining character string Y on the work memory is empty (step S512). . If the determination in step S512 is NO, the process returns to step S504 to determine whether there is a character string of the cited document specific pattern while the character string Y on the work memory is scanned again. For example, in the description pattern of Example 1 shown in FIG. 3A, the remaining character string Y is detected as “... Therefore, the determination in step S512 is NO. However, since the character string Y does not include the cited document specifying pattern, the determination in step S504 is NO, and the process of the operation flowchart in FIG. Further, in the description pattern of Example 3 shown in FIG. 3C, the remaining character string Y is “...” By the process of step S505. No. However, since the character string Y does not include the cited document specifying pattern, the determination in step S504 is NO, and the process of the operation flowchart in FIG.

一方、図３（ｂ）に示される例２の記述パターンでは、ステップＳ５０５の処理により、残りの文字列Ｙは「．．．引用文献１に記載の「掲示板」が請求項１に係る発明の「コンテンツ情報」に相当し、引用文献１に記載の「記事」が請求項１に係る発明の「個人の意見の開示単位」に相当し、する。」となっている。即ち、文字列Ｙには引用文献特定パターンが含まれている。このためステップＳ５１２の判定がＮＯとなった後、ステップＳ５０４の判定がＹＥＳとなる。 On the other hand, in the description pattern of Example 2 shown in FIG. 3B, the “character board” described in the cited reference 1 is the remaining character string Y in the process of step S505. It corresponds to “content information”, and “article” described in cited document 1 corresponds to “disclosure unit of personal opinion” of the invention according to claim 1. " That is, the character string Y includes a cited document specifying pattern. For this reason, after the determination in step S512 is NO, the determination in step S504 is YES.

この結果、新たに実行されるステップＳ５０５において、「１に記載の「記事」が請求項１に係る発明の「個人の意見の開示単位」に相当」が文字列Ｚとして抽出される。そして、新たな文字列Ｙが「．．．引用文献１に記載の「掲示板」が請求項１に係る発明の「コンテンツ情報」に相当し、」となる。更に、文字列Ｚに対して、ステップＳ５０６の判定がＹＥＳとなってステップＳ５０７からＳ５１１までの一連の処理が実行され、言葉の対のデータが抽出される。 As a result, in step S505 newly executed, “an article described in 1 corresponds to“ an individual opinion disclosure unit ”of the invention according to claim 1” is extracted as a character string Z. Then, the new character string Y becomes "..." Bulletin board "described in cited document 1 corresponds to" content information "of the invention according to claim 1." Further, for the character string Z, the determination in step S506 is YES, and a series of processing from step S507 to S511 is executed, and word pair data is extracted.

更に、ステップＳ５１２の判定がＮＯ、ステップＳ５０４の判定が再びＹＥＳとなった後、新たに実行されるステップＳ５０５において、「１に記載の「掲示板」が請求項１に係る発明の「コンテンツ情報」に相当」が文字列Ｚとして抽出される。そして、新たな文字列Ｙが「．．．し、する」となる。更に、文字列Ｚに対して、ステップＳ５０６の判定がＹＥＳとなってステップＳ５０７からＳ５１１までの一連の処理が事項され、言葉の対が抽出される。 Further, after the determination in step S512 is NO and the determination in step S504 is YES again, in step S505 that is newly executed, “the“ bulletin board ”described in 1” is the “content information” of the invention according to claim 1. "Corresponding to" is extracted as the character string Z. Then, the new character string Y becomes “... Furthermore, for the character string Z, the determination in step S506 is YES, and a series of processing from step S507 to S511 is performed, and word pairs are extracted.

最後に、ステップＳ５１２の判定がＮＯ、残りの文字列Ｙ「．．．し、する」に対してステップＳ５０４の判定がＮＯとなって、図５Ｂの動作フローチャートの処理を終了する。 Finally, the determination in step S512 is NO, the determination in step S504 is NO for the remaining character string Y “..., Do”, and the processing of the operation flowchart in FIG.

次に、図５ＢのステップＳ５０３で、文字列Ｙに、本願の言葉が先に書かれ対表現パターン文字列のデータ「相当する」の後に引用文献の言葉を書く記述パターン「×××には・・・に相当する〜」が存在すると判定された場合には、ステップＳ５１３からＳ５１７の一連の処理が実行される。これは、図３（ｄ）の例４に対応する前述した手順３−２に対応するデータ処理である。 Next, in step S503 of FIG. 5B, the description pattern “xxx” includes the word of the present application written in the character string Y and the word of the cited document after the data “corresponding” of the pair expression pattern character string. When it is determined that there is a “...” corresponding to..., A series of processing from step S513 to S517 is executed. This is data processing corresponding to the above-described procedure 3-2 corresponding to the example 4 in FIG.

まず、ワークメモリ上の文字列Ｙがスキャンされながら、対表現パターン文字列のデータの後の文字列部分が、文字列αとして抽出される（ステップＳ５１３）。この文字列αは、ワークメモリ上に記憶される。この文字列αは、引用文献の言葉に対応する。例えば図３（ｄ）に示される例４の例外記述パターンでは、図１０（ｂ）の文字列Ｙから、対表現パターン文字列の「に相当する」の後の部分「「移動の指示」（段落［０１００］）に基づくスケジューリングも開示されている。」が、文字列α即ち引用文献の言葉として抽出される。即ち、図１０（ｃ）の如くである。 First, while the character string Y on the work memory is scanned, the character string portion after the data of the pair expression pattern character string is extracted as the character string α (step S513). This character string α is stored on the work memory. This character string α corresponds to the words of the cited document. For example, in the exception description pattern of Example 4 shown in FIG. 3D, from the character string Y in FIG. 10B, the part “corresponding” of the pair expression pattern character string ““ movement instruction ”( Scheduling based on paragraph [0100]) is also disclosed. "Is extracted as the character string α, that is, the word of the cited document. That is, it is as shown in FIG.

続いて、ワークメモリ上の文字列Ｙにおいて、対表現パターン文字列「に相当する」に対応する文字列のデータの前の文字列部分が、文字列βとして抽出される（ステップＳ５１４）。この文字列βも、ワークメモリ上に記憶される。この文字列βは、本願の言葉に対応する。例えば図３（ｄ）に示される例４の例外記述パターンでは、図１０（ｂ）の文字列Ｙから、対表現パターン文字列「に相当する」の前の部分「1にはまた、メタレベルタスクに」が、文字列β即ち引用文献の言葉として抽出される。即ち、図１０（ｃ）の如くである。 Subsequently, in the character string Y on the work memory, the character string portion before the character string data corresponding to the pair expression pattern character string “corresponds to” is extracted as the character string β (step S514). This character string β is also stored on the work memory. This character string β corresponds to the word of the present application. For example, in the exception description pattern of Example 4 shown in FIG. 3D, the meta level task is also changed from the character string Y in FIG. "" Is extracted as the character string β, that is, the word of the cited document. That is, it is as shown in FIG.

次に、ワークメモリ上の文字列α（引用文献の言葉）と文字列β（本願の言葉）のそれぞれにおいて、かぎ括弧と句読点による分割が行われる（ステップＳ５１５）。これは、前述したステップＳ５０９の処理と同様の、言葉の並列記載を分割するための処理である。例えば図３（ｄ）に示される例４の例外記述パターンでは、図１０（ｃ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）から、図１０（ｄ）の文字列α（引用文献の言葉）と文字列β（本願の言葉）が得られる。 Next, the character string α (words of the cited document) and the character string β (words of the present application) on the work memory are divided by brackets and punctuation marks (step S515). This is a process for dividing the parallel description of words, similar to the process of step S509 described above. For example, in the exception description pattern of Example 4 shown in FIG. 3D, the character of FIG. 10D is obtained from the character string α (word of the cited document) and the character string β (word of the present application) of FIG. A sequence α (words of the cited document) and a character string β (words of the present application) are obtained.

次に、ワークメモリ上の文字列α（引用文献の言葉）と文字列β（本願の言葉）のそれぞれがスキャンされながら、文字列パターンマッチングに基づいて、図６（ｄ）に例示される不要文字列テーブルの各登録パターンに該当する不要文字列が検索され削除されまたは係り受け解析により不要文字列が削除される（ステップＳ５１６）。例えば図３（ｄ）に示される例４の例外記述パターンでは、図１０（ｄ）の文字列αの「する「移動の指示」から図１０（ｅ）の文字列α「移動の指示」が得られる。また、図１０（ｄ）の文字列β「１にはまた」「メタレベルタスク」から図１０（ｅ）の文字列β「メタレベルタスク」が得られる。 Next, while the character string α (words of the cited document) and the character string β (words of the present application) on the work memory are scanned, the unnecessary character illustrated in FIG. Unnecessary character strings corresponding to each registered pattern in the character string table are searched and deleted, or unnecessary character strings are deleted by dependency analysis (step S516). For example, in the exception description pattern of Example 4 shown in FIG. 3D, the character string α “movement instruction” in FIG. can get. Further, the character string β “meta-level task” in FIG. 10E is obtained from the “meta-level task” in the character string β “1” in FIG.

以上のようにして最終的に、ワークメモリ上の文字列α（引用文献の言葉）から得られた言葉とワークメモリ上の文字列β（本願の言葉）から得られた言葉が対応付けられ、ワークメモリ上に言葉の対を示すデータとして記憶される（ステップＳ５１７）。例えば図３（ｄ）に示される例４の例外記述パターンでは、図１０（ｅ）の文字列α中の言葉「移動の指示」と文字列β中の言葉「メタレベルタスク」が対応付けられる。 Finally, the word obtained from the character string α (word of the cited document) on the work memory and the word obtained from the character string β (word of the present application) on the work memory are associated with each other as described above. It is stored as data indicating word pairs on the work memory (step S517). For example, in the exception description pattern of Example 4 shown in FIG. 3D, the word “move instruction” in the character string α in FIG. 10E and the word “meta-level task” in the character string β are associated with each other.

以上説明した第１の実施形態の処理で、図５ＢのステップＳ５１１又はＳ５１７にて文字列α（引用文献の言葉）から得られた言葉と文字列β（本願の言葉）から得られた言葉が対応付けられ、ワークメモリ上に言葉の対を示すデータとして記憶される。これを受けて、その言葉の対のデータに含まれる各抽象的な言葉のデータと意味的に同じあるいは対となる言葉の各データ部分が、表示特許の特許請求の範囲の請求項のデータ上で特定される。
以上のステップＳ５１３からＳ５１７までの一連の処理によって、ワークメモリ上の文字列Ｙに対する処理が終了し、図５Ｂの動作フローチャートの処理を終了する。 In the processing of the first embodiment described above, words obtained from the character string α (words of the cited document) and words obtained from the character string β (words of the present application) in step S511 or S517 in FIG. Corresponding and stored as data indicating word pairs on the work memory. In response to this, each data portion of the word that is semantically the same as or paired with each abstract word data included in the word pair data is added to the claim data in the claims of the displayed patent. Specified by
Through the series of processes from step S513 to S517, the process for the character string Y on the work memory is completed, and the process of the operation flowchart of FIG. 5B is completed.

図１１は表示部の表示例を示す。表示特許の請求項１１０１とともに、請求項に記載された言葉、たとえば「コンテンツ情報」と引用文献に記載された言葉、たとえば「掲示板」等との対応関係１１０２が表示される。 FIG. 11 shows a display example of the display unit. Along with the claim 1101 of the displayed patent, the correspondence 1102 between the words described in the claims, for example, “content information” and the words described in the cited document, for example, “bulletin board” is displayed.

図１３Ａは、図５のステップＳ５０１、Ｓ５０３、Ｓ５０４、Ｓ５０６、Ｓ５０９、Ｓ５１０、Ｓ５１３、Ｓ５１５、Ｓ５１８、又はＳ５１９の処理におけるパターンマッチング処理及び文字列位置検出処理の具体的な動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 13A is an operation flowchart showing specific operations of the pattern matching process and the character string position detection process in the process of steps S501, S503, S504, S506, S509, S510, S513, S515, S518, or S519 of FIG. is there. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

図１３Ａでは、まず、検索を行いたいパターン文字列のデータがワークメモリ又は記憶装置からワークメモリ上の所定領域に読み込まれる（ステップＳ６０１）。例えば、図５ＢのステップＳ５０１では、このパターン文字列は、記憶装置上の表現パターン辞書１０７（図２Ａ）中の図７（ａ）に示される対表現パターンテーブルから読み込まれる各対表現パターンの文字列である。 In FIG. 13A, first, pattern character string data to be searched is read from a work memory or a storage device into a predetermined area on the work memory (step S601). For example, in step S501 of FIG. 5B, the pattern character string is a character of each pair expression pattern read from the pair expression pattern table shown in FIG. 7A in the expression pattern dictionary 107 (FIG. 2A) on the storage device. Is a column.

次に、パターンマッチング処理の対象となる文字列であるマッチング対象文字列のデータがワークメモリ又は記憶装置からワークメモリ上の所定領域に読み込まれる（ステップＳ６０２）。例えば、図５ＢのステップＳ５０１では、マッチング対象文字列は、記憶装置上の拒絶理由データベース１０５から読み込まれる拒絶理由通知等のファイルである。 Next, data of a matching target character string, which is a character string to be subjected to pattern matching processing, is read from a work memory or a storage device into a predetermined area on the work memory (step S602). For example, in step S501 of FIG. 5B, the matching target character string is a file such as a rejection reason notice read from the rejection reason database 105 on the storage device.

次に、ワークメモリ上の変数領域に保持されるパターンポインタｐ１に、ステップＳ６０１でワークメモリ上の所定領域に読み込まれたパターン文字列の先頭文字のアドレスがセットされる（ステップ６０３）。 Next, the address of the first character of the pattern character string read in the predetermined area on the work memory in step S601 is set to the pattern pointer p1 held in the variable area on the work memory (step 603).

次に、ワークメモリ上の変数領域に保持されるマッチング基点ポインタｐ２とマッチングポインタｐ３に、ステップＳ６０２でワークメモリ上の所定領域に読み込まれたマッチング対象文字列の先頭文字のアドレスがセットされる（ステップＳ６０４、Ｓ６０５）。 Next, the address of the first character of the character string to be matched read in the predetermined area in the work memory in step S602 is set in the matching base point pointer p2 and the matching pointer p3 held in the variable area on the work memory ( Steps S604 and S605).

次に、マッチングポインタｐ３のアドレスが１ずつインクリメントされながら（ステップＳ６０７）、パターンポインタｐ１で示されるパターン文字列中の先頭文字と一致する文字が、マッチング対象文字列中から検索される（ステップＳ６０６→Ｓ６０７→Ｓ６０８→Ｓ６０６の繰返し処理）。 Next, while the address of the matching pointer p3 is incremented by 1 (step S607), a character that matches the first character in the pattern character string indicated by the pattern pointer p1 is searched from the matching target character string (step S606). (Repeat processing of S607 → S608 → S606).

なお、マッチングポインタｐ３がインクリメントされた結果、ｐ３の値がワークメモリ上の所定領域に読み込まれているマッチング対象文字列の末尾のアドレスを超えた場合には、マッチング無しが出力され図１３Ａの処理を終了する（ステップＳ６０８→Ｓ６０９）。 As a result of incrementing the matching pointer p3, if the value of p3 exceeds the end address of the character string to be matched read into a predetermined area on the work memory, no matching is output and the processing in FIG. 13A is performed. Is finished (steps S608 → S609).

パターンポインタｐ１で示されるパターン文字列中の先頭文字がマッチング対象文字列中で一致しステップＳ６０６の判定がＹＥＳになると、以下のステップＳ６１０からＳ６１７の一連の処理が実行される。ここでは、マッチング対象文字列中で上記一致位置から続く文字列がパターン文字列の２文字目以降の全てと一致するか否かが判定される。 When the first character in the pattern character string indicated by the pattern pointer p1 matches in the character string to be matched and the determination in step S606 becomes YES, the following series of processing from step S610 to S617 is executed. Here, it is determined whether or not the character string continuing from the matching position in the matching target character string matches all the second and subsequent characters of the pattern character string.

まず、ワークメモリ上のマッチング基点ポインタｐ２にステップＳ６０６で一致が検出されたときのマッチングポインタｐ３のアドレスがセットされる（ステップＳ６１０）。
次に、ワークメモリ上のパターンポインタｐ１のアドレス値が＋１される（ステップＳ６１１）。 First, the address of the matching pointer p3 when a match is detected in step S606 is set to the matching base point pointer p2 on the work memory (step S610).
Next, the address value of the pattern pointer p1 on the work memory is incremented by 1 (step S611).

次に、パターンポインタｐ１のアドレス値が、ワークメモリ上の所定領域に読み込まれているパターン文字列の末尾のアドレスを超えたか否かが判定される（ステップＳ６１２）。 Next, it is determined whether or not the address value of the pattern pointer p1 has exceeded the end address of the pattern character string read into the predetermined area on the work memory (step S612).

ステップＳ６１２の判定がＮＯならば、マッチングポインタｐ３のアドレス値が＋１される（ステップＳ６１３）。
次に、マッチングポインタｐ３のアドレス値が、ワークメモリ上の所定領域に読み込まれているマッチング対象文字列の末尾のアドレスを超えたか否かが判定される（ステップＳ６１５）。 If the determination in step S612 is NO, the address value of the matching pointer p3 is incremented by 1 (step S613).
Next, it is determined whether or not the address value of the matching pointer p3 exceeds the end address of the matching target character string read into the predetermined area on the work memory (step S615).

ステップＳ６１４の判定がＮＯならば、マッチングポインタｐ３が指すマッチング対象文字列中のアドレス位置の文字が、パターンポインタｐ１が指すパターン文字列中のアドレス位置の文字と一致するか否かが判定される（ステップＳ６１７）。つまり、ステップＳ６０６がＹＥＳとなった後に最初にステップＳ６１７が実行されるときには、マッチング対象文字列中でステップＳ６０６で一致位置が検出された位置から数えて２文字目が、パターン文字列の２文字目と一致するか否かが判定される。 If the determination in step S614 is NO, it is determined whether or not the character at the address position in the matching target character string pointed to by the matching pointer p3 matches the character at the address position in the pattern character string pointed to by the pattern pointer p1. (Step S617). That is, when step S617 is first executed after YES in step S606, the second character counted from the position where the matching position is detected in step S606 in the matching target character string is the two characters of the pattern character string. It is determined whether or not the eye matches.

ステップＳ６１７の判定がＹＥＳならば、ステップＳ６１１に戻って再びステップＳ６１７までの一連の処理が実行される。ここでは、パターンポインタｐ１とマッチングポインタｐ３の各アドレス値が＋１される（ステップＳ６１１、Ｓ６１４）。この結果、更にマッチング対象文字列中でステップＳ６０６により一致位置が検出された位置から数えて３文字目が、パターン文字列の３文字目と一致するか否かが判定される。 If the determination in step S617 is YES, the process returns to step S611 and the series of processes up to step S617 is executed again. Here, the address values of the pattern pointer p1 and the matching pointer p3 are incremented by 1 (steps S611 and S614). As a result, it is further determined whether or not the third character counted from the position where the matching position is detected in step S606 in the matching target character string matches the third character of the pattern character string.

以上の処理の繰返しにより、マッチング対象文字列中でステップＳ６０６で一致が検出された位置から続く文字列がパターン文字列の２文字目以降の全てと一致するか否かが判定される。 By repeating the above processing, it is determined whether or not the character string that continues from the position where a match is detected in step S606 in the matching target character string matches all the second and subsequent characters of the pattern character string.

上記一連の処理で、ステップＳ６１１でインクリメントされたパターンポインタｐ１の値がワークメモリ上の所定領域に読み込まれているパターン文字列の末尾のアドレスを超えた場合は、マッチング対象文字列上でパターン文字列との一致が検出されたことになる。この場合には、マッチング対象文字列中でステップＳ６０６により一致が検出された位置を示すマッチング基点ポインタｐ２のアドレス値がマッチング結果として出力され、図１３Ａの処理を終了する（ステップＳ６１２→Ｓ６１３）。 In the above series of processing, if the value of the pattern pointer p1 incremented in step S611 exceeds the end address of the pattern character string read into the predetermined area on the work memory, the pattern character on the matching target character string A match with the column is detected. In this case, the address value of the matching base point pointer p2 indicating the position where the match is detected in step S606 in the matching target character string is output as the matching result, and the processing in FIG. 13A is terminated (steps S612 → S613).

一方、ステップＳ６１４でインクリメントされたマッチングポインタｐ３の値が、ワークメモリ上の所定領域に読み込まれているマッチング対象文字列の末尾のアドレスを超えた場合には、マッチング無しが出力され図１３Ａの処理を終了する（ステップＳ６１５→Ｓ６１６）。 On the other hand, if the value of the matching pointer p3 incremented in step S614 exceeds the end address of the matching target character string read into the predetermined area on the work memory, no matching is output and the processing in FIG. 13A is performed. Is finished (step S615 → S616).

上記一連の処理で、マッチングポインタｐ３が指すマッチング対象文字列中のアドレス位置の文字が、パターンポインタｐ１が指すパターン文字列中のアドレス位置の文字と一致せずステップＳ６１７の判定がＮＯとなったとなった場合には、再検索が実行される。即ち、パターンポインタｐ１に、ステップＳ６０１でワークメモリ上の所定領域に読み込まれたパターン文字列の先頭文字のアドレスがセットされる（ステップ６１８）。また、マッチングポインタｐ３に、マッチング基点ポインタｐ２が指すステップＳ６０６で一致が検出された位置の次の位置に対応するアドレス値がセットされる（ステップＳ６１９）。その後、ステップＳ６０６の処理に戻り、マッチングポインタｐ３のアドレスが上記一致位置の次の位置から１ずつインクリメントされながら（ステップＳ６０７）、以下の処理が実行される。即ち、パターンポインタｐ１で示されるパターン文字列中の先頭文字と一致する文字が、マッチング対象文字列中から再検索される（ステップＳ６０６→Ｓ６０７→Ｓ６０８→Ｓ６０６の繰返し処理）。 In the above-described series of processing, the character at the address position in the matching target character string pointed to by the matching pointer p3 does not match the character at the address position in the pattern character string pointed to by the pattern pointer p1, and the determination in step S617 is NO. If it becomes, re-search is executed. That is, the address of the first character of the pattern character string read into the predetermined area on the work memory in step S601 is set in the pattern pointer p1 (step 618). Further, the address value corresponding to the position next to the position where the match is detected in step S606 indicated by the matching base point pointer p2 is set in the matching pointer p3 (step S619). Thereafter, the processing returns to step S606, and the following processing is executed while the address of the matching pointer p3 is incremented by 1 from the position next to the matching position (step S607). That is, the character that matches the first character in the pattern character string indicated by the pattern pointer p1 is searched again from the character string to be matched (steps S606 → S607 → S608 → S606 repetitive processing).

そして、パターンポインタｐ１で示されるパターン文字列中の先頭文字がマッチング対象文字列中で再度一致しステップＳ６０６の判定がＹＥＳになると、以下のステップＳ６１０からＳ６１７の一連の処理が実行される。これにより、マッチング対象文字列中で上記一致位置から続く文字列がパターン文字列の２文字目以降の全てと一致するか否かが再度判定される。 When the first character in the pattern character string indicated by the pattern pointer p1 matches again in the matching target character string and the determination in step S606 is YES, the following series of processing from step S610 to S617 is executed. Thereby, it is determined again whether or not the character string continuing from the matching position in the matching target character string matches all the second and subsequent characters of the pattern character string.

以上の一連の処理の繰返しにより、マッチング対象文字列中でパターン文字列が検索され、マッチングに成功した場合には、その一致の先頭位置がマッチング結果として出力される（ステップＳ６１３）。 By repeating the above-described series of processing, a pattern character string is searched for in the matching target character string. If matching is successful, the head position of the match is output as a matching result (step S613).

次に３ステップによる係り受け解析による不要文字削除処理を図１３Ｂのフローチャート及び図１３Ｃの形態素の辞書（ａ）、品詞の接続確率（ｂ）、文節の接続ルール（ｃ）、文節の接続確率（ｄ）の各テーブルを参照して説明する。なお、この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 Next, unnecessary character deletion processing by dependency analysis in three steps is performed in the flowchart of FIG. 13B and the morpheme dictionary (a), part-of-speech connection probability (b), phrase connection rule (c), phrase connection probability ( This will be described with reference to each table of d). Each process in the operation flowchart is executed by a computer system including, for example, a CPU, a memory, and an external storage device.

ステップＳ１３０１：まず、形態素解析が実行される。名詞、助詞、動詞などの品詞と文字列を要素とする形態素の辞書（ａ）と、品詞の接続確率を定義したテーブル（ｂ）を用いて、最も確率が高い形態素列を選択する。形態素の辞書（ａ）と、品詞の接続確率を定義したテーブル（ｂ）は、例えばワークメモリ又は記憶装置上に記憶されている。ここで例えば、1文が「引用文献に記載の掲示板が請求項に係る発明のコンテンツ情報に相当する。」である場合について説明する。以下のような形態素列の候補について確率が計算され、最も確率が高い形態素列が選択される。
（１）引用（サ変名詞）、文献（名詞）、に（助詞）、記載（サ変名詞）、の（助詞）．．．
(２)引用（サ変名詞）、文（名詞）、献（未登録語）、に（助詞）、記載（サ変名詞）．．． Step S1301: First, morphological analysis is executed. The morpheme string having the highest probability is selected using a dictionary (a) of morphemes whose elements are parts of speech and character strings such as nouns, particles, and verbs, and a table (b) defining connection probabilities of parts of speech. The morpheme dictionary (a) and the table (b) defining the connection probabilities of parts of speech are stored, for example, on a work memory or a storage device. Here, for example, a case where one sentence is “The bulletin board described in the cited document corresponds to the content information of the claimed invention” will be described. Probabilities are calculated for the following morpheme sequence candidates, and the morpheme sequence with the highest probability is selected.
(1) Citation (sa variable noun), literature (noun), (particle), description (sa variable noun), (particle). . .
(2) Citation (sa variable noun), sentence (noun), dedication (unregistered word), ni (particle), description (sa variable noun). . .

ステップＳ１３０２：次に、文節分割の処理を行なう。すなわち文節の接続ルール（ｃ）によって形態素を接続して文節を生成する。
上記の例では、以下のような文節が生成される。
「引用文献に」「記載の」など Step S1302: Next, phrase division processing is performed. That is, a phrase is generated by connecting morphemes according to the phrase connection rule (c).
In the above example, the following clause is generated.
“In cited document”, “Described”, etc.

ステップＳ１３０３：次に、係り受け解析を行う。即ち、
（１）文節の接続確率（ｄ）と制約ルールによって、最も確率が高い係り受け先を選択する。
(２）接続確率（ｄ）は文節のタイプや近接関係などにより定義する。
例えば、上記の例では、文節のタイプは以下のようになり、引用文献に → 「に」格の名詞節、記載の → 「の」格の名詞節となる。
（３）近接関係の処理を行なう。例えば、以下のように求められ、「引用文献に」と「記載の」は隣接、「引用文献に」と「掲示板が」は１文節間隔である。 Step S1303: Next, dependency analysis is performed. That is,
(1) The dependency destination with the highest probability is selected according to the connection probability (d) of the clause and the constraint rule.
(2) The connection probability (d) is defined by the phrase type and proximity relationship.
For example, in the above example, the types of clauses are as follows: in the cited document, a noun clause with a “Ni” case, and with a description, a noun clause with a “no” case.
(3) Perform proximity processing. For example, it is calculated as follows, “in cited document” and “description” are adjacent, and “in cited document” and “bulletin board” are one-segment intervals.

これに対し、以下のような制約条件を満たし、かつ、確率が最大になる係り先を求める。

・最後以外の文節は後方に必ず一つの係り先文節を持つ
・係り受け関係は交差しない

最終的には、上記の例から以下のような係り受け組が生成される。

・「引用文献に−記載の」「記載の−掲示板が」「掲示板が−相当する」など On the other hand, a dependency destination that satisfies the following constraints and has the maximum probability is obtained.

・ The clauses other than the last one always have one dependency clause behind. ・ The dependency relationship does not intersect.

Finally, the following dependency pair is generated from the above example.

・ "In the cited document-described", "described in-bulletin board", "bulletin board-equivalent", etc.

上述の係り受け解析処理では、係り受け組は、辞書とのパターンマッチングと、マッチングしたパターンに付けられている確率の計算によって生成される。なお、上述の係り受け解析は、文字列α又はβのみからではなく、元の文字列Ｙを使って全体的な係り受け関係が抽出されるように実行されてもよい。 In the dependency analysis process described above, the dependency pair is generated by pattern matching with the dictionary and calculation of the probability attached to the matched pattern. Note that the above-described dependency analysis may be executed so that the entire dependency relationship is extracted using the original character string Y, not only from the character string α or β.

以上説明した特許文書中の単語または単語の組み合わせの例示装置の第１の実施形態によって、発明者が表示特許中で明確に例を示していない場合も、その表示特許の内容を深く理解していると思われるその表示特許に対応する拒絶理由通知等を作成した審査官の解釈が利用される。これにより、請求項中の言葉が具体的に何を表しているかを知ることができ、請求項の記述方法に馴染みの無い利用者であっても請求項を理解することができるようになる。 Even if the inventor has not clearly shown an example in the displayed patent, the first embodiment of the example of the word or the combination of words in the patent document described above can provide a deep understanding of the contents of the displayed patent. The interpretation of the examiner who prepared the notice of reasons for refusal corresponding to the indicated patent is used. Thereby, it is possible to know what the words in the claims specifically represent, and even a user who is not familiar with the claim description method can understand the claims.

以上の説明では、請求項の中で使われている言葉に具体例を表示するものとして説明したが、同様の処理を請求項以外の言葉にも対しても適用することで、例えば、実施例や発明を実施するための形態の中で使われている言葉に具体例を表示することもできる。 In the above description, specific examples have been described as being used in the words used in the claims. However, by applying the same processing to words other than the claims, for example, the embodiment Specific examples can also be displayed in the words used in the embodiments for carrying out the invention.

図１４は、特許文書中の単語または単語の組み合わせの例示装置の第２の実施形態の構成図である。図１４において、図１に示される第１の実施形態と同じ処理を行う部分には、同じ番号が付されている。 FIG. 14 is a configuration diagram of a second embodiment of an example device for words or combinations of words in a patent document. In FIG. 14, the same numbers are assigned to parts that perform the same processing as in the first embodiment shown in FIG. 1.

第２の実施形態の構成は、図１に示される第１の実施形態の構成に加えて、以下の構成を含む。即ち、言葉対データベース１４０２を参照することにより、言葉対抽出部１０６において抽出された各言葉の対を構成する２つの言葉のうちどちらか具体例であるかを判定する具体例判定部１４０１を含む。具体例判定部１４０１は更に、特許文書解析部１４０１−１、言葉の出現位置特定部１４０１−２、特許文書検索ヒット件数取得部１４０１−３、及び判定部１４０１−４を含む。 The configuration of the second embodiment includes the following configuration in addition to the configuration of the first embodiment shown in FIG. That is, a specific example determination unit 1401 that determines which one of the two words constituting each word pair extracted by the word pair extraction unit 106 is a specific example by referring to the word pair database 1402 is included. . The specific example determination unit 1401 further includes a patent document analysis unit 1401-1, a word appearance position specifying unit 1401-2, a patent document search hit number acquisition unit 1401-3, and a determination unit 1401-4.

第１の実施形態において、言葉の対のみからでは、その２つの言葉のうちどちらが具体例なのかを単純に判別することができない場合がある。
言葉の対にまで情報を落とさないで拒絶理由の記述を利用すれば、言葉が使われているセクション（明細書中の請求項や実施例又は発明を実施するための形態などのセクション）の情報が書かれている場合もあり、これを使って、どちらが具体的かを推定できる可能性もあるが、必ずしもセクションの情報が書かれるとは限らない。 In the first embodiment, it may not be possible to simply determine which of the two words is a specific example from only a pair of words.
If you use the description of the reason for refusal without dropping the information to the pair of words, the information of the section in which the word is used (the section in the description, such as claims, examples or forms for carrying out the invention) May be used to estimate which is specific, but the section information is not always written.

例えば、下記のような拒絶理由通知等においては、引用文献の言葉の方が具体的である。
拒絶理由通知等の記述例１：
『引用文献１に記載の「掲示板」が請求項１に係る発明の「コンテンツ情報」に相当し、引用文献１に記載の「記事」が請求項１に係る発明の「個人の意見の開示単位」に相当し、引用文献１に記載の「悪口」が請求項１に係る発明の「個人の評価」に相当する。』
拒絶理由通知等の記述例２：
『引用文献２には、アドホックネットワークにおいて調停端末がネットワークトポロジ（本願の「通信状態情報」に相当）の変化を検知し、』 For example, in the following notice of reasons for refusal, the words of cited documents are more specific.
Example of notification of reasons for refusal 1:
“The“ bulletin board ”described in the cited document 1 corresponds to the“ content information ”of the invention according to claim 1, and the“ article ”described in the cited document 1 is the“ disclosure unit of the individual opinion ”of the invention according to claim 1. The “bad word” described in the cited document 1 corresponds to “individual evaluation” of the invention according to claim 1. ]
Example 2 of notification of reasons for refusal:
“In Cited Document 2, an arbitration terminal detects a change in network topology (corresponding to“ communication status information ”in the present application) in an ad hoc network.”

一方、下記のような拒絶理由通知においては、本願（又は請求項に係る発明）の言葉の方が具体的である。
拒絶理由通知等の記述例３：
『引用文献１に記載の発明の「信号処理回路１４」は、請求項１に係る発明の「経路選択手段」に相当し、』
拒絶理由通知等の記述例４：
『引用文献２に記載の「板バネ状部材」は、魚体の後部を内部から支えているといえることから、当該「板バネ状部材」は、本願の請求項２に係る発明の「肋骨状部分」に相当するといえる。』 On the other hand, in the notification of reasons for refusal as described below, the words of the present application (or the claimed invention) are more specific.
Example 3 of notification of reasons for refusal:
“The“ signal processing circuit 14 ”of the invention described in the cited document 1 corresponds to the“ route selection means ”of the invention according to claim 1,”
Example 4 of notification of reasons for refusal:
“The“ plate spring-like member ”described in the cited document 2 can be said to support the rear part of the fish body from the inside. Therefore, the“ plate spring-like member ”is the“ rib-like shape ”according to claim 2 of the present application. It can be said that it corresponds to “part”. ]

ここで一般的に、請求項で使われる言葉は、実施例又は発明を実施するための形態（以下、「実施例等」と呼ぶ）で使われる言葉に比べて抽象的である可能性が高い。そこで、第２の実施形態では、具体例判定部１４０１が、言葉対抽出部１０６で抽出された言葉の対を構成する各言葉について、各言葉がどのセクションで使われているかという情報を解析し、その解析結果に基づいて、言葉の対を構成する各言葉のどちらが具体例かを判定する。 Here, in general, a word used in a claim is more likely to be abstract than a word used in an embodiment or a mode for carrying out the invention (hereinafter referred to as “example etc.”). . Therefore, in the second embodiment, the specific example determination unit 1401 analyzes information on which section each word is used for each word constituting the word pair extracted by the word pair extraction unit 106. Based on the analysis result, it is determined which of the words constituting the word pair is a specific example.

更に具体的には、具体例判定部１４０１は、以下の２つの方法を組み合わせた判定処理を実行する。 More specifically, the specific example determination unit 1401 executes a determination process in which the following two methods are combined.

方法１：特許データベース１０１（図１２Ａ参照）の中で拒絶理由通知における引用文献の特許文書と本願の特許文書について、抽出された言葉の対を構成する各言葉が使われている出現位置（セクション）を特定することによる判定処理が実施される。すなわち、出現位置（セクション）が［特許請求の範囲］のセクションかあるいはそれ以外のセクション例えば［発明の実施する形態］、［発明の概要］、［要約書］等のセクションかが判定される。 Method 1: In the patent database 101 (see FIG. 12A), with respect to the patent document of the cited document in the notice of reasons for refusal and the patent document of the present application, the appearance position (section) where each word constituting the extracted word pair is used A determination process is performed by specifying (). That is, it is determined whether the appearance position (section) is a section of [Claims] or other sections such as [Embodiment of the Invention], [Summary of Invention], [Summary], and the like.

方法２：方法１で何れかの特許文書の請求項と発明を実施する形態等の両方で使用されている言葉がある場合に、抽出された言葉の対を構成する各言葉によって特許データベース１０１（図１２Ａ参照）に登録されている特許文書集合全体が検索される。そして、各検索の結果である各ヒット件数に基づいて判定処理が実施される。 Method 2: When there is a word used in both the claim of any patent document and the mode for carrying out the invention in Method 1, the patent database 101 ( The entire patent document set registered in FIG. 12A) is searched. Then, determination processing is performed based on the number of hits as a result of each search.

図１５は、図１４に示される構成を有する特許文書中の単語または単語の組み合わせの例示装置の第２の実施形態の制御動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。図１５において、図４に示される第１の実施形態と同じ処理を行う部分には、同じ番号Ｓ４０１、Ｓ４０２、Ｓ４０３、Ｓ４０５、Ｓ４０６が付されている。第２の実施形態の制御動作では、第１の実施形態での制御動作に加えて更に、以下の制御動作が実行される。即ち、図１５のステップＳ４０４にて言葉の対のデータが抽出できてステップＳ４０５の判定がＹＥＳとなった言葉の対のデータ毎に、各言葉の対を構成する２つの言葉のうちどちらが具体例かを判定する処理が実行される（ステップＳ１５０１）。このステップＳ１５０１の処理は、図１４の具体例判定部１４０１−４の機能を実現する。 FIG. 15 is an operation flowchart showing the control operation of the second embodiment of the example device for words or word combinations in the patent document having the configuration shown in FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device. In FIG. 15, the same numbers S401, S402, S403, S405, and S406 are assigned to parts that perform the same processing as in the first embodiment shown in FIG. 4. In the control operation of the second embodiment, the following control operation is executed in addition to the control operation of the first embodiment. That is, for each word pair data for which the word pair data can be extracted in step S404 in FIG. 15 and the determination in step S405 is YES, which of the two words constituting each word pair is a specific example. The process which determines whether is performed (step S1501). The processing in step S1501 realizes the function of the specific example determination unit 1401-4 in FIG.

図１６は、図１５のステップＳ１５０１の具体的な制御動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。なお、ここでの処理は、図１５のステップＳ４０４にて抽出された言葉の対のデータ毎に繰り返し実行される。そして、その結果得られた具体例は、後述する言葉の対テーブルに登録される。 FIG. 16 is an operation flowchart showing a specific control operation in step S1501 of FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device. The processing here is repeatedly executed for each pair of words extracted in step S404 in FIG. The specific example obtained as a result is registered in a word pair table to be described later.

まず、前述した方法１による判定処理、即ち拒絶理由通知における引用文献の特許文書と本願の特許文書の各本文中で、抽出された言葉の対を構成する各言葉が使われている出現位置（セクション）を特定することによる判定処理が実施される（ステップＳ１６０１）。この処理は、図１４の特許文書解析部１４０１−１及び言葉の出現位置特定部１４０１−２の機能を実現する。 First, in the determination process according to the above-described method 1, that is, in each text of the patent document of the cited document and the patent document of the present application in the notice of reasons for refusal, the appearance position where each word constituting the pair of extracted words is used ( A determination process is performed by specifying (section) (step S1601). This process realizes the functions of the patent document analysis unit 1401-1 and the word appearance position specifying unit 1401-2 in FIG.

次に、方法１により判定ができたか否かが判断される（ステップＳ１６０２）。この処理は、図１４の判定部１４０１−４の機能の一部を実現する。
方法１により判定ができた場合には、図１６の動作フローチャートの処理を終了する。一方、方法１により判定ができなかった場合には、前述した方法２による判定処理が実行される（ステップＳ１６０３）。即ち、抽出された言葉の対を構成する各言葉によって特許データベース１０１（図１２Ａ参照）に登録されている所定の特許文書集合全体が検索される。そして、各検索の結果である各ヒット件数に基づいて判定処理が実施される。この処理は、図１４の特許文書検索ヒット件数取得部１４０１−３の機能及び判定部１４０１−４の一部の機能を実現する。 Next, it is determined whether or not the determination has been made by the method 1 (step S1602). This process realizes a part of the function of the determination unit 1401-4 in FIG.
If the determination can be made by the method 1, the process of the operation flowchart of FIG. On the other hand, when the determination cannot be made by the method 1, the determination process by the method 2 described above is executed (step S1603). That is, the entire predetermined patent document set registered in the patent database 101 (see FIG. 12A) is searched by each word constituting the extracted word pair. Then, determination processing is performed based on the number of hits as a result of each search. This process realizes the function of the patent document search hit number acquisition unit 1401-3 and the partial function of the determination unit 1401-4 in FIG.

図１７は、図１６のステップＳ１６０１の方法１の詳細な制御動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 17 is an operation flowchart showing the detailed control operation of method 1 in step S1601 of FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

まず、図１５のステップＳ４０３にて検索された拒絶理由通知等のファイルにて指定されている引用文献特許文書のファイルと本願特許文書のファイルが、特許データベース１０１から取得される（ステップＳ１７０１）。各特許文書のファイルは、ワークメモリ上に展開される。 First, the file of the cited document patent document and the file of the patent document of the present application specified in the file such as the notice of reason for rejection searched in step S403 of FIG. 15 are acquired from the patent database 101 (step S1701). Each patent document file is developed on the work memory.

次に、ステップＳ１７０１にてワークメモリ上に取得された各特許文書のファイルにおいて、セクション構造が解析される（ステップＳ１７０２）。具体的には、各特許文書のファイルにおいて、墨付き括弧タグで囲まれた各見出しに対応する文字列が検索されることにより、特許請求の範囲中の各請求項のセクションと、明細書中の実施例を含む実施の形態等が記述されているセクションが解析される。 Next, the section structure is analyzed in each patent document file acquired on the work memory in step S1701 (step S1702). Specifically, by searching for a character string corresponding to each heading enclosed in black bracket tags in each patent document file, a section of each claim in the claims and a description A section in which an embodiment including the example is described is analyzed.

次に、ステップＳ１７０１にて取得された各特許文書において、言葉の対の抽出処理（図１５のステップＳ４０４）にて抽出された言葉の対を構成する言葉Ａの文字列と言葉Ｂの文字列の出現する位置（セクション）が特定される（ステップＳ１７０３）。このとき、ステップＳ１７０２でのセクションの解析結果が参照される。なお、言葉Ａは、拒絶理由通知等にて指定されている引用文献特許文書に関連付けて抽出された言葉である。また、言葉Ｂは、拒絶理由通知等にて指定されている本願特許文書に関連付けて抽出された言葉である。なお、ステップ４０４にて抽出された言葉の対のデータは、図２３に示されるデータ構成例を有する言葉対データベース１４０２に登録される。言葉対データベース１４０２は、記憶装置に記憶される。言葉対データベース１４０２の各レコードは、被引用側即ち拒絶理由通知等の中で引用文献として引用される引用文献に関する情報として、引用先情報即ち引用文献を特定する情報（公開番号や文献名等）を登録するフィールドを有する。また、各レコードは、引用先即ち引用文献に関連付けてステップＳ４０４にて抽出された引用先での言葉を登録するフィールドを有する。更に、各レコードは、引用側即ち拒絶理由通知等の中で本願として引用される特許文書に関する情報として、引用元情報即ち本願を特定する出願番号を登録するフィールドを有する。また、各レコードは、引用元即ち本願に関連付けてステップＳ４０４にて抽出された引用先での言葉を登録するフィールドを有する。加えて、各レコードは、具体例判定結果フィールドを有する。このフィールドの内容は、ステップＳ４０４において抽出された言葉の対が各レコードに登録される時点では空欄である。そして、ステップＳ１５０１の具体例判定処理が実行された結果、引用先での言葉フィールドに登録された言葉と引用元での言葉フィールドに登録された言葉のうち具体例として判定された方の言葉が登録される。 Next, in each patent document acquired in step S1701, the character string of word A and the character string of word B constituting the word pair extracted in the word pair extraction process (step S404 in FIG. 15). Is identified (section) (step S1703). At this time, the analysis result of the section in step S1702 is referred to. The word A is a word extracted in association with the cited document patent document designated in the notice of reasons for refusal or the like. The word B is a word extracted in association with the patent document of the present application designated in the notice of reasons for refusal or the like. The word pair data extracted in step 404 is registered in the word pair database 1402 having the data configuration example shown in FIG. The word pair database 1402 is stored in a storage device. Each record of the word-pair database 1402 includes citation information, that is, information specifying a cited document (publication number, document name, etc.) as information about a cited document cited as a cited document in a cited party, that is, a notice of reason for refusal. Has a field for registering. Each record has a field for registering the word at the quotation destination extracted in step S404 in association with the quotation destination, that is, the cited document. Further, each record has a field for registering citation source information, that is, an application number for identifying the present application, as information relating to a patent document cited as the present application in the citation, that is, a notice of reasons for refusal. Each record has a field for registering a word at the quotation destination extracted in step S404 in association with the quotation source, that is, the present application. In addition, each record has a specific example determination result field. The contents of this field are blank when the word pairs extracted in step S404 are registered in each record. As a result of the execution of the specific example determination process in step S1501, the word determined as a specific example among the words registered in the word field at the quotation destination and the words registered in the word field at the quotation source is determined. be registered.

次に、言葉Ａのデータが引用文献特許文書のテキストデータ中で出現する場所が請求項のセクションのみで、かつ、言葉Ｂのデータが本願特許文書のテキストデータ中で実施例を含む実施の形態等のセクションに出現する（同時に請求項のセクションに出現してもよい）か否かが判定される（ステップＳ１７０４）。 Next, an embodiment in which the word A data appears in the text data of the cited patent document only in the claims section, and the data of the word B includes examples in the text data of the present patent document Or the like (it may appear in the claim section at the same time) or not (step S1704).

ステップＳ１７０４の判定がＹＥＳの場合に、言葉Ｂが具体例であると判定される（ステップＳ１７０５）。そして、図１７の動作フローチャート即ち図１６のステップＳ１６０１の処理を終了する。 If the determination in step S1704 is YES, it is determined that word B is a specific example (step S1705). Then, the operation flowchart of FIG. 17, that is, the process of step S1601 of FIG. 16 ends.

ステップＳ１７０５の判定がＮＯの場合、更に以下の判定が実行される。即ち、言葉Ａのデータが引用特許文書のテキストデータ中で実施例を含む実施の形態等に出現し（同時に請求項のセクションに出現してもよい）、かつ、言葉Ｂのデータが本願特許文書のテキストデータ中で出現する場所が請求項のセクションのみであるか否かが判定される（ステップＳ１７０６）。 If the determination in step S1705 is NO, the following determination is further performed. That is, the data of the word A appears in the embodiments including the examples in the text data of the cited patent document (may appear in the claims section at the same time), and the data of the word B is the patent document of the present application. It is determined whether or not the place that appears in the text data is only the section of the claim (step S1706).

ステップＳ１７０６の判定がＹＥＳの場合に、言葉Ａが具体例であると判定される（ステップＳ１７０７）。そして、図１７の動作フローチャート即ち図１６のステップＳ１６０１の処理を終了する。
ステップＳ１７０６の判定がＮＯの場合には、方法１では判定は行われず、図１７の動作フローチャート即ち図１６のステップＳ１６０１の処理を終了する。 If the determination in step S1706 is YES, it is determined that word A is a specific example (step S1707). Then, the operation flowchart of FIG. 17, that is, the process of step S1601 of FIG. 16 ends.
If the determination in step S1706 is NO, the determination is not made in Method 1, and the operation flowchart in FIG. 17, that is, the process in step S1601 in FIG.

上述の図１７の制御動作の実行時には、図４、ステップ４０４にて抽出された言葉の対のデータは、図１８に示されるデータ構成例を有する言葉対テーブルに登録される。言葉対テーブルは、ワークメモリ上に保持される。言葉対テーブルの各行は、被引用側即ち拒絶理由通知等の中で引用文献特許文書に関連付けてステップＳ４０４にて抽出された言葉Ａを登録するフィールドを有する。また、各行は、その言葉Ａの引用文献特許文書内での出現位置（セクション）をステップＳ１７０３の処理に基づいて登録するフィールドを有する。更に、各行は、引用側即ち拒絶理由通知等の中で本願特許文書に関連付けてステップＳ４０４にて抽出された言葉Ｂを登録するフィールドを有する。加えて、各行は、その言葉Ｂの引用文献特許文書内での出現位置（セクション）をステップＳ１７０３の処理に基づいて登録するフィールドを有する。そして、各行は、出現位置による具体例判定結果を登録するフォールドを有する。このフィールドには、言葉Ａフィールドと言葉Ｂフィールドに登録された言葉のうち、それぞれに対応する出現位置フィールドの内容に基づいてステップＳ１７０４及びＳ１７０６で具体例と判定された結果に基づく言葉が登録される。 When the control operation of FIG. 17 is executed, the word pair data extracted in step 404 in FIG. 4 is registered in the word pair table having the data configuration example shown in FIG. The word pair table is held on the work memory. Each row of the word pair table has a field for registering the word A extracted in step S404 in association with the cited document patent document in the cited side, that is, the reason for rejection. Each line has a field for registering the appearance position (section) of the word A in the cited patent document based on the processing in step S1703. Further, each line has a field for registering the word B extracted in step S404 in association with the patent document of the present application in the citation side, that is, the reason for rejection. In addition, each line has a field for registering the appearance position (section) of the word B in the cited patent document based on the processing in step S1703. Each row has a fold for registering a specific example determination result based on the appearance position. In this field, of the words registered in the word A field and the word B field, words based on the results determined as specific examples in steps S1704 and S1706 based on the contents of the corresponding appearance position fields are registered. The

例えば、図１８の１８０１で示される各行においては、被引用例（引用文献）での言葉Ａ「クライアント計算機、書店、コンビニエンスチェーン」のデータの出現位置が引用文献特許文書のテキストデータ内の実施例等のセクションに現れているため、図１７、ステップＳ１７０４の判定がＮＯとなる。また、引用側（本願）言葉Ｂ「第１の情報処理装置、第３の情報処理装置、第２の情報処理装置」のデータの出現位置が本願特許文書のテキストデータ内の請求項のセクションのみに現れているためステップＳ１７０６の判定がＹＥＳとなる。この結果、これらの行においては、言葉Ａフィールドの言葉が具体例となるので具体例判定結果の登録フィールドに、言葉Ａフィールドに登録されている言葉が登録「クライアント計算機、書店、コンビニエンスチェーン」される。図１８の１８０１で示される各行以外の行においては、言葉Ａのデータ「移動の指示、悪口、板バネ状部分」の出現位置が引用文献特許文書のテキストデータ内の実施例等のセクションに現れているため、ステップＳ１７０４の判定がＮＯとなる。また、言葉Ｂのデータ「メタルベルタスク、個人の評価、肋骨状部分」の出現位置が本願特許文書のテキストデータ内の請求項と実施例等の両方のセクションに現れているため、ステップＳ１７０６の判定もＮＯとなる。このため、１８０１で示される各行以外の行では、言葉Ａ、言葉Ｂともに実施例に出現しているのでどちらが具体例か判定できないので、図１７の動作フローチャートでは、これらの行においては、具体例判定結果の登録フィールドへの登録は行われない。 For example, in each line indicated by 1801 in FIG. 18, the appearance position of the data of the word A “client computer, bookstore, convenience chain” in the cited example (cited document) is an example in the text data of the cited document. And the determination in step S1704 in FIG. 17 is NO. In addition, only the section of the claim in the text data of the patent document has the appearance position of the data of the citation side (this application) word B “first information processing device, third information processing device, second information processing device”. Appears in step S1706, the determination in step S1706 becomes YES. As a result, in these lines, the words in the word A field are specific examples, so the words registered in the word A field are registered in the registration field of the specific example determination result “client computer, bookstore, convenience chain”. The In the lines other than the respective lines indicated by 1801 in FIG. 18, the appearance position of the data of the word A “movement instruction, bad mouth, leaf spring portion” appears in the section of the example etc. in the text data of the cited document. Therefore, the determination in step S1704 is NO. In addition, since the appearance position of the word B data “metal bell task, personal evaluation, rib-like portion” appears in both the claims and the embodiments in the text data of the patent document of the present application, The determination is also NO. For this reason, in the lines other than the lines indicated by 1801, both the word A and the word B appear in the embodiment, and therefore it cannot be determined which is a specific example. Therefore, in the operation flowchart of FIG. Registration of the determination result in the registration field is not performed.

図１９は、図１６のステップＳ１６０１で方法１により判定ができず、図１６のステップＳ１６０２の判定がＮＯとなって実行される、図１６のステップＳ１６０３の方法２の詳細な制御動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 19 shows the detailed control operation of method 2 in step S1603 in FIG. 16, which cannot be determined by method 1 in step S1601 in FIG. 16, but is determined as NO in step S1602 in FIG. It is a flowchart. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

まず、図１７のステップＳ１７０１でワークメモリ上に展開された特許文書において、図１７のステップＳ１７０２でのセクションの解析結果に基づいて決定された請求項のセクションのみが検索範囲とされる。そして、その検索範囲が言葉Ａに対応する文字列で検索されたときのヒット件数ＣＡ１が算出される（ステップＳ１９０１）。 First, in the patent document developed on the work memory in step S1701 of FIG. 17, only the section of the claim determined based on the analysis result of the section in step S1702 of FIG. Then, the number of hits CA1 when the search range is searched with the character string corresponding to the word A is calculated (step S1901).

次に、図１７のステップＳ１７０１でワークメモリ上に展開された特許文書において、図１７のステップＳ１７０２でのセクションの解析結果に基づいて決定された実施例等のセクションのみが検索範囲とされる。そして、その検索範囲が言葉Ａに対応する文字列で検索されたときのヒット件数ＣＡ２が算出される（ステップＳ１９０２）。 Next, in the patent document developed on the work memory in step S1701 in FIG. 17, only the section such as the embodiment determined based on the analysis result of the section in step S1702 in FIG. Then, the number of hits CA2 when the search range is searched with the character string corresponding to the word A is calculated (step S1902).

続いて、図１７のステップＳ１７０１でワークメモリ上に展開された文書において、図１７のステップＳ１７０２でのセクションの解析結果に基づいて決定された請求項のセクションのみが検索範囲とされる。そして、その検索範囲が言葉Ｂに対応する文字列で検索されたときのヒット件数ＣＢ１が算出される（ステップＳ１９０３）。 Subsequently, in the document developed on the work memory in step S1701 in FIG. 17, only the section of the claim determined based on the analysis result of the section in step S1702 in FIG. Then, the number of hits CB1 when the search range is searched with the character string corresponding to the word B is calculated (step S1903).

更に、図１７のステップＳ１７０１でワークメモリ上に取得された文書のファイルにおいて、図１７のステップＳ１７０２でのセクションの解析結果に基づいて決定された実施例等のセクションのみが検索範囲とされる。そして、その検索範囲が言葉Ｂに対応する文字列で検索されたときのヒット件数ＣＢ２が算出される（ステップＳ１９０４）。 Furthermore, in the document file acquired on the work memory in step S1701 in FIG. 17, only the section of the embodiment or the like determined based on the analysis result of the section in step S1702 in FIG. Then, the hit number CB2 when the search range is searched with the character string corresponding to the word B is calculated (step S1904).

そして、ＣＡ２／ＣＡ１の値とＣＢ２／ＣＢ１の値が、それぞれ算出され、各算出結果がワークメモリに保持される。ＣＡ２／ＣＡ１は、特許文書中で、言葉Ａが請求項のセクションに対して実施例等のセクションでどの程度多く出現するかを示す値である。同様に、ＣＢ２／ＣＢ１は、本願特許文書中で、言葉Ｂが請求項のセクションに対して実施例等のセクションでどの程度多く出現するかを示す値である。この結果、ワークメモリ上のＣＡ２／ＣＡ１の値よりもワークメモリ上のＣＢ２／ＣＢ１の値のほうが大きいか否かが判定される（ステップＳ１９０５）。 Then, the value of CA2 / CA1 and the value of CB2 / CB1 are calculated, and each calculation result is held in the work memory. CA2 / CA1 is a value indicating how much the word A appears in the section of the embodiment in the patent document with respect to the section of the claims. Similarly, CB2 / CB1 is a value indicating how much the word B appears in the section of the embodiment etc. with respect to the section of the claims in the patent document of the present application. As a result, it is determined whether the value of CB2 / CB1 on the work memory is larger than the value of CA2 / CA1 on the work memory (step S1905).

ステップＳ１９０５の判定がＹＥＳの場合には、言葉Ｂのほうが具体例であると判定される（ステップＳ１９０６）。このケースは、言葉Ｂが特許文書中で実施例等のセクションに現れる割合のほうが、言葉Ａが特許文書中で実施例等のセクションに現れる割合よりも大きい場合である。 If the determination in step S1905 is YES, it is determined that the word B is a specific example (step S1906). In this case, the ratio of the word B appearing in the section of the example etc. in the patent document is larger than the ratio of the word A appearing in the section of the example etc. in the patent document.

一方、ステップＳ１９０５の判定がＮＯの場合には、言葉Ａのほうが具体例であると判定される（ステップＳ１９０７）。このケースは、言葉Ａが特許文書中で実施例等のセクションに現れる割合のほうが、言葉Ｂが特許文書中で実施例等のセクションに現れる割合よりも大きい場合である。
以上の判定は、具体的な言葉のほうが、実施例等のセクション中で使用される確率が高いという経験則に基づいている。 On the other hand, if the determination in step S1905 is NO, the word A is determined to be a specific example (step S1907). In this case, the ratio of the word A appearing in the section of the example etc. in the patent document is larger than the ratio of the word B appearing in the section of the example etc. in the patent document.
The above determination is based on an empirical rule that specific words have a higher probability of being used in sections such as examples.

上述の図１９の制御動作の実行時には、図１７の制御動作においてワークメモリ上の図１８に例示される言葉の対テーブルの具体例判定結果フィールドへの登録が行われなかった行の言葉の対、例えば、移動の指示とメタレベルタスクが、図２０に示されるデータ構成例を有する言葉対テーブルに登録される。この言葉対テーブルは、ワークメモリに保持される。この言葉対テーブルの各行は、被引用側即ち拒絶理由通知等の中で引用文献特許文書に関連付けてステップＳ４０４にて抽出された言葉Ａ（移動の指示、悪口、板バネ状部材）を登録するフィールドを有する。また、各行は、ステップＳ１９０１及びＳ１９０２で算出される各ヒット件数ＣＡ１及びＣＡ２をそれぞれ登録する各フィールドを有する。更に、各行は、引用側即ち拒絶理由通知等の中で本願特許文書に関連付けてステップＳ４０４にて抽出された言葉Ｂ（メタレベルタスク、個人の評価、肋骨状部分）を登録するフィールドを有する。また、各行は、ステップＳ１９０３及びＳ１９０４で算出される各ヒット件数ＣＢ１及びＣＢ２をそれぞれ登録する各フィールドを有する。そして、各行は、ヒット件数による具体例判定結果を登録するフォールドを有する。このフィールドには、ＣＡ２／ＣＡ１及びＣＢ２／ＣＢ１の各割合に基づいてステップＳ１９０５で判定された結果に基づく言葉が登録される。 At the time of execution of the control operation of FIG. 19 described above, the word pair of the line that was not registered in the specific example determination result field of the word pair table illustrated in FIG. 18 on the work memory in the control operation of FIG. For example, the movement instruction and the meta level task are registered in the word pair table having the data configuration example shown in FIG. This word pair table is held in the work memory. Each row of the word pair table registers the word A (movement instruction, bad mouth, leaf spring-like member) extracted in step S404 in association with the cited document patent document in the cited side, that is, the reason for rejection. Has a field. Each row has fields for registering the numbers of hits CA1 and CA2 calculated in steps S1901 and S1902, respectively. Furthermore, each line has a field for registering the word B (meta-level task, personal evaluation, rib-shaped portion) extracted in step S404 in association with the patent document of the present application in the citation side, that is, the reason for rejection. Each row has fields for registering the hit numbers CB1 and CB2 calculated in steps S1903 and S1904, respectively. Each row has a fold for registering a specific example determination result based on the number of hits. In this field, words based on the result determined in step S1905 based on the ratios of CA2 / CA1 and CB2 / CB1 are registered.

例えば、図２０の２００１及び２００２で示される各行においては、ＣＡ２／ＣＡ１の割合のほうがＣＢ２／ＣＢ１の割合よりも大きいため、言葉Ａフィールドに登録されている言葉（移動の指示、悪口）が、具体例判定結果の登録フィールドに登録される。また、図２０の２００３で示される行においては、ＣＢ２／ＣＢ１の割合のほうがＣＡ２／ＣＡ１の割合よりも大きいため、言葉Ｂ（肋骨状部分）フィールドに登録されている言葉が、具体例判定結果の登録フィールドに登録される。 For example, in each row indicated by 2001 and 2002 in FIG. 20, since the ratio of CA2 / CA1 is larger than the ratio of CB2 / CB1, the words registered in the word A field (movement instructions, bad words) It is registered in the registration field of the specific example determination result. Further, in the row indicated by 2003 in FIG. 20, since the ratio of CB2 / CB1 is larger than the ratio of CA2 / CA1, the word registered in the word B (rib-shaped part) field is a specific example determination result. Registered in the registration field.

以上のようにして、方法１により得られた図１８に例示される言葉の対テーブルの内容と方法２により得られた図２０に例示される言葉の対テーブルの内容とに基づいて、各言葉の対に対応する具体例判定結果が得られる。これらの各言葉の対テーブルに得られた具体例判定結果が、図２３に例示される言葉対データベース１４０２（図１４）に登録される。この結果、図１５のステップＳ１５０１での具体例の判定処理が終了する。 As described above, based on the contents of the word pair table illustrated in FIG. 18 obtained by Method 1 and the contents of the word pair table illustrated in FIG. A specific example determination result corresponding to the pair is obtained. The specific example determination results obtained in these word pair tables are registered in the word pair database 1402 (FIG. 14) illustrated in FIG. As a result, the determination process of the specific example in step S1501 in FIG. 15 ends.

その後、図４の第１の実施形態の場合と同様にして、図１５のステップＳ４０１で得られている表示特許中の特許請求の範囲における請求項において、具体例が付加される対象となる言葉が特定される（ステップＳ４０６）。即ち、図２３に例示される言葉対データベース１４０２に今回新たに登録された各言葉の対において、具体例判定結果フィールド（図２３）に登録された言葉と同じ言葉の各部分が、請求項上で特定される。 Thereafter, in the same manner as in the first embodiment of FIG. 4, in the claims in the claims in the display patent obtained in step S401 of FIG. Is identified (step S406). That is, in each word pair newly registered in the word pair database 1402 exemplified in FIG. 23, each part of the same word as the word registered in the specific example determination result field (FIG. 23) is Specified by

そして、図１１の１１０１として例示されるステップＳ４０６で特定された請求項中の各具体例付加対象の言葉とそれぞれに対応する具体例とが対比されて、図１１の１１０２に例示されるようにして、利用者のコンピュータのディスプレイに表示される（ステップＳ４０７）。 Then, the specific example addition target words in the claim specified in step S406 illustrated as 1101 in FIG. 11 are compared with the specific examples corresponding thereto, and are illustrated in 1102 in FIG. Is displayed on the display of the user's computer (step S407).

図２１は、第２の実施形態における図１５のステップＳ１５０１の具体例の判定処理の制御動作のその他の実施形態の動作フローチャートを示す図である。この実施形態では本願と引用文献に限定されない。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 21 is a diagram illustrating an operation flowchart of another embodiment of the control operation of the determination processing of the specific example of step S1501 of FIG. 15 in the second embodiment. This embodiment is not limited to the present application and cited documents. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

この制御動作では、言葉の対を構成する２つの言葉について、特許データベース１０１中からそれぞれ独立に件数を求めるのではなく、２つの言葉の両方が１件の中に使われている特許文書の集合が作成される。そして、その集合の中での２つの言葉の出現場所によって、どちらの方がより具体的な言葉かが判定される。 In this control operation, a set of patent documents in which both of two words are used in one case, instead of obtaining the number of two words constituting the word pair independently from the patent database 101. Is created. Then, which one is more specific is determined by the appearance location of the two words in the set.

まず、言葉の対の抽出処理（図１５のステップＳ４０４）にて抽出された言葉の対のデータを構成する言葉Ａと言葉Ｂを共に含む特許文書のファイルが、特許データベース１０１（図１４）から検索され、その結果得られる特許集合が作成され、ワークメモリに保持される（ステップＳ２１０１）。即ち、図２１の２１０１の如くである。 First, a patent document file including both the words A and B constituting the word pair data extracted in the word pair extraction process (step S404 in FIG. 15) is obtained from the patent database 101 (FIG. 14). A patent set obtained as a result of the search is created and stored in the work memory (step S2101). That is, it is as 2101 in FIG.

次に、ワークメモリに保持される件数変数Ｄ１及びＤ２に、共に値ゼロが代入される（ステップＳ２１０２）。
次に、図２１のステップＳ２１０１で作成された特許集合中の各特許文書のファイルについて、ステップＳ２１０２からステップＳ２１０８までの処理が繰り返し実行される。 Next, the value zero is assigned to both the number variables D1 and D2 held in the work memory (step S2102).
Next, the processing from step S2102 to step S2108 is repeatedly executed for each patent document file in the patent set created in step S2101 of FIG.

まず、現在の特許文書のファイルにおいて、特許請求の範囲中の各請求項のセクションと実施例等のセクション中の段落との対応関係が解析される（ステップＳ２１０３）。これは、言葉Ａと言葉Ｂを、請求項とそれに対応する実施例の中から選択するためである。 First, in the current patent document file, the correspondence relationship between the section of each claim in the claims and the paragraph in the section of the embodiment or the like is analyzed (step S2103). This is because the words A and B are selected from the claims and the corresponding embodiments.

図２２は、図２１のステップＳ２１０３の解析処理の詳細動作を示す動作フローチャートである。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。 FIG. 22 is an operation flowchart showing the detailed operation of the analysis processing in step S2103 of FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device.

まず、現在の特許文書のファイルから、テキスト情報が読み込まれる（ステップＳ２２０１）。
次に、ステップＳ２２０１で読み込まれたテキスト文書のデータが、墨付括弧タグにより、セクションごとに分割される（ステップＳ２２０２）。 First, text information is read from the current patent document file (step S2201).
Next, the data of the text document read in step S2201 is divided for each section by a black bracket tag (step S2202).

次に、ステップＳ２２０２で分割されたセクションのうち、請求項のセクションの文書のテキストデータ部分が、請求項の各項毎に分割される（ステップＳ２２０３）。
次に、ステップＳ２２０３で分割された請求項の各項から、キーワードデータが抽出される（ステップＳ２２０４）。キーワードデータは、請求子とそれに対応する実施例を見つけるためのものであり、請求項の各項のテキストデータに対して、例えば形態素解析処理を実行することで抽出できる。 Next, among the sections divided in step S2202, the text data portion of the document in the claim section is divided for each item in the claim (step S2203).
Next, keyword data is extracted from each item of the claim divided in step S2203 (step S2204). The keyword data is for finding a biller and an example corresponding to it, and can be extracted by executing, for example, a morphological analysis process on the text data of each item of the claim.

次に、ステップＳ２２０１で読み込まれた実施例等のセクションのテキスト文書のデータが、墨付括弧タグにより、段落ごとに分割される（ステップＳ２２０５）。
次に、ステップＳ２２０５で分割された実施例等の各段落部分から、キーワードが抽出される（ステップＳ２２０６）。キーワードは、実施例等の各段落部分のテキストデータに対して、例えば形態素解析処理を実行することで抽出できる。 Next, the data of the text document in the section of the embodiment or the like read in step S2201 is divided into paragraphs by using a bracket with a black bracket (step S2205).
Next, keywords are extracted from each paragraph part of the embodiment or the like divided in step S2205 (step S2206). The keyword can be extracted by executing, for example, a morphological analysis process on the text data of each paragraph part in the embodiment or the like.

最後にステップＳ２２０３で分割された請求項の各項に対応するデータに対し、ステップＳ２２０４で抽出されたキーワードと最も類似するステップＳ２２０６によるキーワードが含まれるステップＳ２２０５で分割された実施例等の段落のデータが対応付けられる（ステップＳ２２０７）。 Finally, for the data corresponding to each item of the claim divided in step S2203, the keyword in step S2205 including the keyword in step S2206 that is most similar to the keyword extracted in step S2204 Data is associated (step S2207).

以上の図２２の動作フローチャートの処理として、図２１のステップＳ２１０３の処理が実行される。
図２１の動作フローチャートに戻り、ステップＳ２１０３の処理の後、現在の特許文書において、言葉Ａが請求項で使われており、かつ、この請求項に対応する実施例等の段落に言葉Ｂが使われているか否かが判定される（ステップＳ２１０４）。なお、請求項に対応する実施例等の段落は、ステップＳ２１０３によって得られたものが選択される。 The process of step S2103 of FIG. 21 is executed as the process of the operation flowchart of FIG.
Returning to the operation flowchart of FIG. 21, after the processing of step S2103, the word A is used in the claims in the current patent document, and the word B is used in the paragraph of the embodiment corresponding to the claims. It is determined whether or not it has been interrupted (step S2104). The paragraphs such as the example corresponding to the claims are selected in step S2103.

ステップＳ２１０４の判定がＹＥＳならばワークメモリ上の件数変数Ｄ１がインクリメントされ、ＮＯならば件数変数Ｄ１はインクリメントされない（ステップＳ２１０５）。
次に、現在の特許文書において、言葉Ｂが請求項で使われており、かつ、この請求項に対応する実施例等に言葉Ａが使われているか否かが判定される（ステップＳ２１０６）。この場合も、請求項に対応する実施例等の段落は、ステップＳ２１０３によって得られたものが選択される。 If the determination in step S2104 is YES, the number variable D1 on the work memory is incremented, and if NO, the number variable D1 is not incremented (step S2105).
Next, in the current patent document, it is determined whether or not the word B is used in the claim and the word A is used in the embodiment corresponding to the claim (step S2106). In this case as well, paragraphs such as examples corresponding to the claims are selected in step S2103.

ステップＳ２１０６の判定がＹＥＳならば件数変数Ｄ２がインクリメントされ、ＮＯならば件数変数Ｄ２はインクリメントされない（ステップＳ２１０７）。
その後、ステップＳ２１０１で作成された集合内の全ての特許文書のファイルが処理されたか否かが判定される（ステップＳ２１０８）。 If the determination in step S2106 is YES, the number variable D2 is incremented. If NO, the number variable D2 is not incremented (step S2107).
Thereafter, it is determined whether all patent document files in the set created in step S2101 have been processed (step S2108).

ステップＳ２１０８の判定がＮＯならば、ステップＳ２１０３に戻って、処理が繰り返される。
ステップＳ２１０８の判定がＹＥＳになったなら最後に、件数変数Ｄ１が件数変数Ｄ２よりも多いか否かが判定される（ステップＳ２１０９）。 If determination of step S2108 is NO, it will return to step S2103 and a process will be repeated.
If the determination in step S2108 is YES, it is finally determined whether or not the number variable D1 is larger than the number variable D2 (step S2109).

互いに対応する請求項と実施例等の段落で、言葉Ａが請求項側で使われ言葉Ｂが実施例等の側で使われる件数のほうが、言葉Ｂが請求項側で使われ言葉Ａが実施例等の側で使われる件数のより多ければ、言葉Ｂが具体例であると認定される（ステップＳ２０１０）。 In the paragraphs of claims and examples etc. corresponding to each other, the word A is used on the claims side and the word B is used on the side of the examples etc., the word B is used on the claims side and the word A is executed If the number of cases used on the example side is larger, the word B is recognized as a specific example (step S2010).

逆に、言葉Ｂが請求項側で使われ言葉Ａが実施例等の側で使われる件数が、言葉Ａが請求項側で使われ言葉Ｂが実施例等の側で使われる件数以上であれば、言葉Ａが具体例であると認定される（ステップＳ２０１１）。 Conversely, the number of cases where the word B is used on the claim side and the word A is used on the side of the embodiment etc. is greater than the number of cases where the word A is used on the claim side and the word B is used on the side of the embodiment. For example, the word A is recognized as a specific example (step S2011).

以上のようにして図２１及び図２２の動作フローチャートによって各言葉の対に対応する具体例判定結果が得られたら、これらの各言葉の対テーブルに得られた具体例判定結果が、図２３に例示される言葉対データベース１４０２（図１４）に登録される。この結果、図１５のステップＳ１５０１での具体例の判定処理が終了する。 When the specific example determination result corresponding to each word pair is obtained by the operation flowchart of FIGS. 21 and 22 as described above, the specific example determination result obtained in the pair table of each word is shown in FIG. It is registered in the illustrated word pair database 1402 (FIG. 14). As a result, the determination process of the specific example in step S1501 in FIG. 15 ends.

その後の処理は前述した通りである。
以上説明した第２の実施形態によれば、特許文書中で言葉の対を構成する２つの言葉がどのように出現するかを解析することにより、どちらの言葉が具体例に相当するかを的確に判定することが可能となる。 Subsequent processing is as described above.
According to the second embodiment described above, it is possible to accurately determine which word corresponds to a specific example by analyzing how two words constituting a word pair appear in a patent document. Can be determined.

図２４Ａ及び図２４Ｂは、特許文書中の単語または単語の組み合わせの例示装置の第３の実施形態の構成図である。図２４Ａ及び図２４Ｂにおいて、図１４に示される第２の実施形態と同じ処理を行う部分には、同じ番号が付されている。 24A and 24B are configuration diagrams of a third embodiment of an example device for words or combinations of words in a patent document. In FIG. 24A and FIG. 24B, the same number is attached | subjected to the part which performs the same process as 2nd Embodiment shown by FIG.

図１４に示される第２の実施形態では、利用者により指定された特許文書に対して、言葉対抽出部１０６で言葉の対が抽出される。更に、具体例判定部１４０１で、言葉の対を構成する２つの言葉のうち具体例となる言葉が判定されながら、請求項中の言葉の具体例が例示される。この場合、言葉の対は、それを構成する２つの言葉のうちどちらが具体例であるかを示す情報と共に、図２３に例示される言葉対データベース１４０２として蓄積することができる。 In the second embodiment shown in FIG. 14, word pairs are extracted by the word pair extraction unit 106 for the patent document designated by the user. Furthermore, specific examples of the words in the claims are exemplified while the specific example determination unit 1401 determines a specific example of the two words constituting the word pair. In this case, word pairs can be stored as a word pair database 1402 illustrated in FIG. 23 together with information indicating which of the two words constituting the word is a specific example.

そこで、第３の実施形態では、言葉対データベース１４０２を予め蓄積する構成と、利用者による特許文書の指定時には、言葉対データベース１４０２から言葉の対及び具体例を検索しながら例示する構成とが独立して実現される。、
まず、図２４Ａの構成は、図１５の構成に比較して、具体例付加対象特定部１０８及び表示部１０９が無い。そして、この構成の機能を実現する制御動作は、図２５の動作フローチャートによって示される。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。図２５において、図１５に示される第２の実施形態と同じ処理を行う部分には、同じ番号が付されている。 Therefore, in the third embodiment, the configuration in which the word pair database 1402 is stored in advance and the configuration illustrated while searching for word pairs and specific examples from the word pair database 1402 when a user specifies a patent document are independent. And realized. ,
First, the configuration of FIG. 24A does not include the specific example addition target specifying unit 108 and the display unit 109 as compared to the configuration of FIG. The control operation for realizing the function of this configuration is shown by the operation flowchart in FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device. In FIG. 25, the same number is attached | subjected to the part which performs the same process as 2nd Embodiment shown by FIG.

図２５の動作フローチャートでは、ステップＳ４０２からＳ１５０１までの一連の処理は、図１５の場合と同様である。
図２５は、ステップＳ２５０１で特許データベース１０１から最初の特許文書が読み込まれた後、ステップＳ２５０３で全ての特許文書が処理されたと判定されるまで、ステップＳ２５０４で次の特許文書が特許データベース１０１から順次読み込まれる。そして、各特許文書のファイルに対して、ステップＳ４０２からＳ１５０１までの一連の処理によって、第２の実施形態の場合と同様にして、言葉の対の抽出と具体例の判定処理が実行される。そして、言葉の対の抽出と具体例の判定結果が、ステップＳ２５０２において、図２３に例示される構成を有する言葉対データベース１４０２に保存される。 In the operation flowchart of FIG. 25, a series of processing from step S402 to S1501 is the same as in the case of FIG.
In FIG. 25, after the first patent document is read from the patent database 101 in step S2501, the next patent document is sequentially read from the patent database 101 in step S2504 until it is determined in step S2503 that all patent documents have been processed. Is read. Then, word pair extraction and specific example determination processing are executed for each patent document file by a series of processing from step S402 to S1501, as in the case of the second embodiment. Then, the word pair extraction and the determination result of the specific example are stored in the word pair database 1402 having the configuration illustrated in FIG. 23 in step S2502.

以上の処理により、特許データベース１０１中の特許文書のファイルに対して、請求項中に出現する言葉とそれに対応する具体的な言葉の対を、言葉対データベース１４０２として予め蓄積することができる。 As a result of the above processing, a pair of words appearing in claims and specific words corresponding thereto can be stored in advance as a word pair database 1402 in the patent document file in the patent database 101.

図２４Ｂは、利用者により指定された特許文書に対して、言葉対データベース１４０２から言葉の対及び具体例を検索しながら例示する第３の実施形態の構成である。
図２４Ｂにおいて、特許データベース１０１、特許文書検索部１０２、及び出願番号抽出部１０３は、第１の実施形態（図１）又は第２の実施形態（図１４）におけるものと同じである。即ち、利用者により、特許文書が指定されるとその特許文書が特許データベース１０１から検索され、出願番号が抽出される。 FIG. 24B is a configuration of the third embodiment that illustrates a patent document designated by the user while searching for word pairs and specific examples from the word pair database 1402.
In FIG. 24B, the patent database 101, the patent document search unit 102, and the application number extraction unit 103 are the same as those in the first embodiment (FIG. 1) or the second embodiment (FIG. 14). That is, when a patent document is designated by the user, the patent document is searched from the patent database 101, and the application number is extracted.

次に、言葉対検索部２４０１は、指定された特許文書の請求項中から言葉を抽出し、その言葉に対応する具体例を言葉対データベース１４０２から検索する。
具体例付加対象特定部１０８及び表示部１０９の機能は、第１の実施形態（図１）又は第２の実施形態（図１４）におけるものと同じである。 Next, the word pair search unit 2401 extracts words from the claims of the designated patent document, and searches the word pair database 1402 for specific examples corresponding to the words.
The functions of the specific example addition target specifying unit 108 and the display unit 109 are the same as those in the first embodiment (FIG. 1) or the second embodiment (FIG. 14).

図２４Ｂの構成の機能を実現する制御動作は、図２６の動作フローチャートによって示される。この動作フローチャートにおける各処理は、例えばＣＰＵとメモリと外部記憶装置を搭載したコンピュータシステムによって実行される。図２６において、ステップＳ４０１とステップＳ４０２の処理は、図４に示される第１の実施形態又は図１５に示される第２の実施形態の場合と同じである。即ち、特許データベース１０１（図１）が検索されることにより、利用者によって指定された特許文書（表示特許）が検索される（ステップＳ４０１）。次に、ステップＳ４０１で検索された特許文書から出願番号が抽出される（ステップＳ４０２）。 The control operation for realizing the function of the configuration of FIG. 24B is shown by the operation flowchart of FIG. Each process in the operation flowchart is executed by, for example, a computer system including a CPU, a memory, and an external storage device. In FIG. 26, the processing in step S401 and step S402 is the same as that in the first embodiment shown in FIG. 4 or the second embodiment shown in FIG. That is, by searching the patent database 101 (FIG. 1), a patent document (displayed patent) designated by the user is searched (step S401). Next, the application number is extracted from the patent document searched in step S401 (step S402).

次に、図２３に例示されるデータ構成例を有する言葉対データベース１４０２がアクセスされ、ステップＳ４０２で抽出された出願番号によって引用元情報フィールド及び引用先情報フィールドが検索される。そして、レコードが抽出できた場合には、そのレコードの具体例判定結果フィールドの言葉とその言葉と対になる言葉が抽出される（以上、ステップＳ２６０１）。 Next, the word pair database 1402 having the data configuration example illustrated in FIG. 23 is accessed, and the citation source information field and the citation destination information field are searched based on the application number extracted in step S402. When the record can be extracted, the words in the specific example determination result field of the record and the words that are paired with the words are extracted (step S2601).

次に、ステップＳ２６０１にて具体例が見つかったか否かが判定される（ステップＳ２６０２）。
以上のステップＳ２６０１とステップＳ２６０２の処理は、図２４の言葉対検索部２４０１の機能を実現する。 Next, it is determined whether a specific example has been found in step S2601 (step S2602).
The processes in steps S2601 and S2602 described above realize the function of the word pair search unit 2401 in FIG.

ステップＳ２６０２の判定がＹＥＳならば、ステップＳ４０１で得られている表示特許中の特許請求の範囲における請求項において、具体例が付加される対象となる言葉が特定される（ステップＳ４０６）。そして、図８の８０１として例示されるステップＳ４０６で特定された請求項中の各具体例付加対象の言葉とそれぞれに対応する具体例とが対比されて、図１１の１１０２に例示されるようにして、利用者のコンピュータのディスプレイに表示される（ステップＳ４０７）。 If the determination in step S2602 is YES, the word to which a specific example is to be added is specified in the claims in the claims in the displayed patent obtained in step S401 (step S406). Then, the specific example addition target words in the claim specified in step S406 illustrated as 801 in FIG. 8 are compared with the specific examples corresponding to each, as illustrated in 1102 in FIG. Is displayed on the display of the user's computer (step S407).

以上の第３の実施形態により、請求項中の言葉の具体例を例示するための言葉対データベース１４０２を予め作成しておくことが可能となる。この結果、利用者による特許文書の指定時には、その特許文書中の請求項中の言葉の具体例の提示を、言葉対データベース１４０２を参照して高速に実行することが可能となる。 According to the third embodiment described above, it is possible to previously create a word pair database 1402 for illustrating specific examples of words in claims. As a result, when a user designates a patent document, specific examples of words in claims in the patent document can be presented at high speed with reference to the word pair database 1402.

図２７は、上述の各実施形態のシステムを実現できるコンピュータのハードウェア構成の一例を示す図である。
図２７に示されるコンピュータは、ＣＰＵ３００１、メモリ３００２、入出力装置３００３、出力装置３００４、外部記憶装置３００５、可搬記録媒体３００９が挿入される可搬記録媒体駆動装置３００６、及び通信ネットワーク３００７を有し、これらがバス３００８によって相互に接続された構成を有する。同図に示される構成は上記システムを実現できるコンピュータの一例であり、そのようなコンピュータはこの構成に限定されるものではない。 FIG. 27 is a diagram illustrating an example of a hardware configuration of a computer that can realize the system according to each of the above-described embodiments.
A computer shown in FIG. 27 includes a CPU 3001, a memory 3002, an input / output device 3003, an output device 3004, an external storage device 3005, a portable recording medium driving device 3006 into which a portable recording medium 3009 is inserted, and a communication network 3007. These have a configuration in which they are connected to each other by a bus 3008. The configuration shown in the figure is an example of a computer that can implement the above system, and such a computer is not limited to this configuration.

ＣＰＵ３００１は、当該コンピュータ全体の制御を行う。メモリ３００２は、プログラムの実行、データ更新等の際に、外部記憶装置３００５（或いは可搬記録媒体３００９）に記憶されているプログラム又はデータを一時的に格納するＲＡＭ等のメモリである。ＣＵＰ３００１は、プログラムをメモリ３００２に読み出して実行することにより、全体の制御を行う。 The CPU 3001 controls the entire computer. The memory 3002 is a memory such as a RAM that temporarily stores a program or data stored in the external storage device 3005 (or the portable recording medium 3009) when executing a program, updating data, or the like. The CUP 3001 performs overall control by reading the program into the memory 3002 and executing it.

入出力装置３００３は、ユーザによるキーボードやマウス等による入力操作を検出し、その検出結果をＣＰＵ３００１に通知し、ＣＰＵ３００１の制御によって送られてくるデータを表示装置や印刷装置に出力する。 The input / output device 3003 detects an input operation performed by a user using a keyboard, a mouse, or the like, notifies the CPU 3001 of the detection result, and outputs data sent under the control of the CPU 3001 to a display device or a printing device.

外部記憶装置３００５は、例えばハードディスク記憶装置である。主に各種データやプログラムの保存に用いられる。
可搬記録媒体駆動装置３００６は、光ディスクやＳＤＲＡＭ、コンパクトフラッシュ（登録商標）等の可搬記録媒体３００９を収容するもので、外部記憶装置３００５の補助の役割を有する。 The external storage device 3005 is, for example, a hard disk storage device. Mainly used for storing various data and programs.
The portable recording medium driving device 3006 accommodates a portable recording medium 3009 such as an optical disk, SDRAM, or Compact Flash (registered trademark), and has an auxiliary role for the external storage device 3005.

通信インターフェース３００７は、例えばＬＡＮ（ローカルエリアネットワーク）又はＷＡＮ（ワイドエリアネットワーク）の通信回線を接続するための装置である。
第１、第２、又は第３の実施形態によるシステムは、各実施形態の機能を実現する図４、図５Ｂ、図１３Ａ、図１３Ｂ、図１５、図１６、図１７、図１９、図２１、図２２、図２５、又は図２６の各動作フローチャートに対応する各制御プログラムを、ＣＰＵ３００１が実行することで実現される。そのプログラムは、例えば外部記憶装置３００５や可搬記録媒体３００９に記録して配布してもよく、或いは通信インターフェース３００７によりネットワークから取得できるようにしてもよい。図１、図１４、図２４Ａ、図２４Ｂ等の特許データベース１０１や拒絶理由データベース１０５、及び表現パターン辞書１１２、図１４，図２４Ａ、図２４Ｂ等の言葉対データベース１３０２などは、例えば外部記憶装置３００５に記憶して運用される。また、メモリ３００２上には必要に応じて、前述したワークメモリの領域が展開される。 The communication interface 3007 is a device for connecting, for example, a LAN (local area network) or WAN (wide area network) communication line.
The system according to the first, second, or third embodiment is configured as shown in FIGS. 4, 5B, 13A, 13B, 15, 16, 17, 17, 19, and 21 for realizing the functions of the embodiments. This is realized by the CPU 3001 executing each control program corresponding to each operation flowchart of FIG. 22, FIG. 25, or FIG. The program may be recorded and distributed in, for example, the external storage device 3005 or the portable recording medium 3009, or may be acquired from the network by the communication interface 3007. The patent database 101 and rejection reason database 105 shown in FIGS. 1, 14, 24A, and 24B, the expression pattern dictionary 112, the word pair database 1302 shown in FIGS. 14, 24A, and 24B, and the like are, for example, an external storage device 3005. It is memorized and operated. In addition, the above-described work memory area is expanded on the memory 3002 as necessary.

以上の第１〜第３の実施形態に関して、更に以下の付記を開示する。
（付記１）
特許文書中の単語または単語の組み合わせの理解を支援するための装置であって、
前記特許文書に含まれる単語または単語の組み合わせの具体例を示す対象となる明細書を特定する情報を受け付ける受け付け部と、
前記明細書に対して示された拒絶の理由を示す文書を記録した拒絶理由データベースから、前記単語または単語の組み合わせの具体例を示す対象となる明細書に対して示された、拒絶の理由を示す文書のテキストデータを、前記明細書を特定する情報に基づいて検索する拒絶理由を示す文書の検索部と、
前記検索された拒絶の理由を示す文書のテキストデータから、あらかじめ記憶された対表現パターンと同じ対表現パターンの文字列を検出し、前記対表現パターンの文字列に基づいて、前記特許文書に記載された文字列と前記文字列に対比して示される文字列とをそれぞれ含む対となる文字列を検出し、前記対となる文字列から不要文字列を削除することにより、前記単語または単語の組み合わせの具体例を示す対象となる明細書のテキストデータに記載された単語または単語の組み合わせの文字列と、前記単語または単語の組み合わせに対比して示される単語または単語の組み合わせの文字列の対を抽出する抽出部と、
前記抽出された対となる単語または単語の組み合わせの文字列の組を出力する出力部と、
を具備することを特徴とする特許文書中の単語または単語の組み合わせの例示装置。
（付記２）
前記抽出部は、
文字列パターンマッチングの処理と、
文字列における位置検出の処理と、
不要文字辞書とのパターンマッチングと係り受け解析に基づく不要文字列の削除の処理と、
を実行することを特徴とする付記１に記載の特許文書中の単語または単語の組み合わせの例示装置。
（付記３）
前記抽出部は、
前記拒絶の理由を通知する文書に前記対表現パターンの文字列が存在した場合、本願特定パターンと引用文献特定パターンを検出し、これらのパターンの前または後から前記対となる文字列を特定し、あるいは、
文字列パターン「には」に続いて、第１の文字列、文字列パターン「に相当する」、第２の文字列の順に文字列が存在する場合には、前記第１の文字列と前記第２の文字列を前記対となる文字列パターンとして抽出する、
ことを特徴とする付記１に記載の特許文書中の単語または単語の組み合わせの例示装置。
（付記４）
前記不要文字列を削除する処理は、
品詞と文字列を要素とする形態素の辞書と、品詞の接続確率を定義したテーブルを用いて、最も確率が高い形態素列を選択する形態素解析処理と、
文節の接続ルールによって形態素を接続して文節を生成する文節分離処理と、
文節の接続確率と制約ルールによって、最も確率が高い係り受け先を選択する係り受け解析処理とからなる、
ことを特徴とする付記１に記載の特許文書中の単語または単語の組み合わせの例示装置。
（付記５）
前記拒絶の理由を示す文書の記載において引用されている引用文献の明細書のテキストデータと拒絶の対象とされている明細書のテキストデータとにおいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列データの該各明細書のテキストデータ内での出現位置を特定することにより、該対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定する具体例判定部を更に含む、
ことを特徴とする付記１に記載の特許文書中の単語または単語の組み合わせの例示装置。
（付記６）
前記具体例判定部は、前記出現位置を特定するだけでは前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定できない場合に、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列で明細書のデータを蓄積した特許データベースを検索し、前記明細書の請求項中及び実施例中での前記各単語または単語の組み合わせの出現数に基づいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定する、
ことを特徴とする付記５に記載の特許文書中の単語または単語の組み合わせの例示装置。
（付記７）
特許文書中の単語または単語の組み合わせの理解を支援するための装置であって、
前記特許文書に含まれる単語または単語の組み合わせの具体例を示す対象となる明細書のテキストデータを、前記明細書を蓄積したデータベースから順次入力する入力部と、
前記明細書に対して示された拒絶の理由を示す文書を記録した拒絶理由データベースから、前記単語または単語の組み合わせの具体例を示す対象となる明細書に対して示された、拒絶の理由を示す文書のテキストデータを、前記明細書を特定する情報に基づいて検索する拒絶理由を示す文書の検索部と、
前記検索された拒絶の理由を示す文書のテキストデータから、あらかじめ記憶された対表現パターンと同じ対表現パターンの文字列を検出し、前記対表現パターンの文字列に基づいて、前記特許文書に記載された文字列と前記文字列に対比して示される文字列とをそれぞれ含む対となる文字列を検出し、前記対となる文字列から不要文字列を削除することにより、前記単語または単語の組み合わせの具体例を示す対象となる明細書のテキストデータに記載された単語または単語の組み合わせの文字列と前記単語または単語の組み合わせに対比して示される単語または単語の組み合わせの文字列の対を抽出する抽出部と、
前記抽出された対となる単語または単語の組み合わせの文字列の組を蓄積する単語または単語の組み合わせ対データベースと、
を含むことを特徴とする特許文書中の単語または単語の組み合わせの例示装置。
（付記８）
付記７に記載の単語または単語の組み合わせ対データベースを使って特許文書中の単語または単語の組み合わせの理解を支援するための装置であって、
前記特許文書に含まれる単語または単語の組み合わせの具体例を示す対象となる明細書を特定する情報を受け付ける指示部と、
前記特許文書中で前記単語または単語の組み合わせ対データベースに含まれる単語または単語の組み合わせを検索することにより、前記特許文書に含まれる単語または単語の組み合わせの文字列と対となる前記単語または単語の組み合わせの具体例を示す単語または単語の組み合わせの文字列を検索する検索部と、
前記検索された対となる単語または単語の組み合わせの文字列の組を出力する出力部と、
を含むことを特徴とする特許文書中の単語または単語の組み合わせの例示装置。
（付記９）
特許文書中の単語または単語の組み合わせの理解を支援するための装置に、
前記特許文書に含まれる単語または単語の組み合わせの具体例を示す対象となる明細書を特定する情報を受け付け、
前記明細書に対して示された拒絶の理由を示す文書を記録した拒絶理由データベースから、前記単語または単語の組み合わせの具体例を示す対象となる明細書に対して示された、拒絶の理由を示す文書のテキストデータを、前記明細書を特定する情報に基づいて検索し、
前記検索された拒絶の理由を示す文書のテキストデータから、あらかじめ記憶された対表現パターンと同じ対表現パターンの文字列を検出し、前記対表現パターンの文字列に基づいて、前記特許文書に記載された文字列と前記文字列に対比して示される文字列とをそれぞれ含む対となる文字列を検出し、前記対となる文字列から不要文字列を削除することにより、前記単語または単語の組み合わせの具体例を示す対象となる明細書のテキストデータに記載された単語または単語の組み合わせの文字列と、前記単語または単語の組み合わせに対比して示される単語または単語の組み合わせの文字列の対を抽出し、
前記抽出された対となる単語または単語の組み合わせの文字列の組を出力する、
機能を実行させるためのプログラム。
（付記１０）
前記抽出の処理は、
文字列パターンマッチングの処理と、
文字列における位置検出の処理と、
不要文字辞書とのパターンマッチングと係り受け解析に基づく不要文字列の削除の処理と、
を含むことを特徴とする付記９に記載のプログラム。
（付記１１）
前記抽出の処理は、
前記拒絶の理由を通知する文書に前記対表現パターンの文字列が存在した場合、本願特定パターンと引用文献特定パターンを検出し、これらのパターンの前または後から前記対となる文字列を特定し、あるいは、
文字列パターン「には」に続いて、第１の文字列、文字列パターン「に相当する」、第２の文字列の順に文字列が存在する場合には、前記第１の文字列と前記第２の文字列を前記対となる文字列パターンとして抽出する、
処理であることを特徴とする付記９に記載のプログラム。
（付記１２）
前記不要文字列を削除する処理は、
品詞と文字列を要素とする形態素の辞書と、品詞の接続確率を定義したテーブルを用いて、最も確率が高い形態素列を選択する形態素解析処理と、
文節の接続ルールによって形態素を接続して文節を生成する文節分離処理と、
文節の接続確率と制約ルールによって、最も確率が高い係り受け先を選択する係り受け解析処理とからなる、
ことを特徴とする付記９に記載のプログラム。
（付記１３）
前記拒絶の理由を示す文書の記載において引用されている引用文献の明細書のテキストデータと拒絶の対象とされている明細書のテキストデータとにおいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列データの該各明細書のテキストデータ内での出現位置を特定することにより、該対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定する機能を更に実行する、
ことを特徴とする付記９に記載のプログラム。
（付記１４）
前記具体例の判定は、前記出現位置を特定するだけでは前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定できない場合に、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列で明細書のデータを蓄積した特許データベースを検索し、前記明細書の請求項中及び実施例中での前記各単語または単語の組み合わせの出現数に基づいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定する処理である、
ことを特徴とする付記１３に記載のプログラム。
（付記１５）
コンピュータが特許文書中の単語または単語の組み合わせの理解を支援するための方法であって、
前記特許文書に含まれる単語または単語の組み合わせの具体例を示す対象となる明細書を特定する情報を受け付け、
前記明細書に対して示された拒絶の理由を示す文書を記録した拒絶理由データベースから、前記単語または単語の組み合わせの具体例を示す対象となる明細書に対して示された、拒絶の理由を示す文書のテキストデータを、前記明細書を特定する情報に基づいて検索し、
前記検索された拒絶の理由を示す文書のテキストデータから、あらかじめ記憶された対表現パターンと同じ対表現パターンの文字列を検出し、前記対表現パターンの文字列に基づいて、前記特許文書に記載された文字列と前記文字列に対比して示される文字列とをそれぞれ含む対となる文字列を検出し、前記対となる文字列から不要文字列を削除することにより、前記単語または単語の組み合わせの具体例を示す対象となる明細書のテキストデータに記載された単語または単語の組み合わせの文字列と、前記単語または単語の組み合わせに対比して示される単語または単語の組み合わせの文字列の対を抽出し、
前記抽出された対となる単語または単語の組み合わせの文字列の組を出力する、
ことを前記コンピュータが実行をすることを特徴とする特許文書中の単語または単語の組み合わせの例示方法。
（付記１６）
前記抽出の処理は、
文字列パターンマッチングの処理と、
文字列における位置検出の処理と、
不要文字辞書とのパターンマッチングと係り受け解析に基づく不要文字列の削除の処理と、
を含むことを特徴とする付記１５に記載の特許文書中の単語または単語の組み合わせの例示方法。
（付記１７）
前記抽出の処理は、
前記拒絶の理由を通知する文書に前記対表現パターンの文字列が存在した場合、本願特定パターンと引用文献特定パターンを検出し、これらのパターンの前または後から前記対となる文字列を特定し、あるいは、
文字列パターン「には」に続いて、第１の文字列、文字列パターン「に相当する」、第２の文字列の順に文字列が存在する場合には、前記第１の文字列と前記第２の文字列を前記対となる文字列パターンとして抽出する、
処理であることを特徴とする付記１５に記載の特許文書中の単語または単語の組み合わせの例示方法。
（付記１８）
前記不要文字列を削除する処理は、
品詞と文字列を要素とする形態素の辞書と、品詞の接続確率を定義したテーブルを用いて、最も確率が高い形態素列を選択する形態素解析処理と、
文節の接続ルールによって形態素を接続して文節を生成する文節分離処理と、
文節の接続確率と制約ルールによって、最も確率が高い係り受け先を選択する係り受け解析処理とからなる、
ことを特徴とする付記１５に記載の特許文書中の単語または単語の組み合わせの例示方法。
（付記１９）
前記拒絶の理由を示す文書の記載において引用されている引用文献の明細書のテキストデータと拒絶の対象とされている明細書のテキストデータとにおいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列データの該各明細書のテキストデータ内での出現位置を特定することにより、該対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを前記コンピュータが更に判定する、
ことを特徴とする付記１５に記載の特許文書中の単語または単語の組み合わせの例示方法。
（付記２０）
前記具体例の判定は、前記出現位置を特定するだけでは前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定できない場合に、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせに対応する文字列で明細書のデータを蓄積した特許データベースを検索し、前記明細書の請求項中及び実施例中での前記各単語または単語の組み合わせの出現数に基づいて、前記対となる単語または単語の組み合わせの組を構成する各単語または単語の組み合わせのどちらが具体例を示す単語または単語の組み合わせであるかを判定する処理である、
ことを特徴とする付記１９に記載の特許文書中の単語または単語の組み合わせの例示方法。
（付記２１）
特許文書中の単語または単語の組み合わせの理解を支援するための装置であって、
前記特許文書を含む明細書を特定する情報を受け付ける受け付け部と、
拒絶の理由を示す文書を記録した拒絶理由データベースから、前記明細書に対して示された、拒絶の理由を示す文書のテキストを、前記明細書を特定する情報に基づいて検索する拒絶理由を示す文書の検索部と、
前記検索された拒絶の理由を示す文書のテキストから、前記特許文書に記載された単語または単語の組み合わせ文字列と前記明細書以外の文献に記載され、該単語または単語の組み合わせに対比して示される単語または単語の組み合わせの文字列を抽出する抽出部と、
該抽出された対となる単語または単語の組み合わせデータを出力する出力部と、
を具備することを特徴とする特許文書中の単語または単語の組み合わせの例示装置。 Regarding the above first to third embodiments, the following additional notes are further disclosed.
(Appendix 1)
A device for assisting in understanding a word or a combination of words in a patent document,
A receiving unit that receives information for specifying a specification that is a target indicating a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. A document search unit indicating a reason for refusal to search text data of a document to be displayed based on information specifying the specification;
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words An extraction unit for extracting
An output unit for outputting a pair of character strings of the extracted paired words or word combinations;
A device for exemplifying a word or a combination of words in a patent document.
(Appendix 2)
The extraction unit includes:
String pattern matching processing and
Processing for position detection in strings,
Unnecessary character string deletion processing based on pattern matching with dependency character dictionary and dependency analysis,
The apparatus for exemplifying a word or a combination of words in the patent document according to Supplementary Note 1, wherein:
(Appendix 3)
The extraction unit includes:
When the character string of the pair expression pattern exists in the document notifying the reason for rejection, the application specific pattern and the cited document specific pattern are detected, and the paired character string is specified before or after these patterns. Or
If there is a character string in the order of the first character string, the character string pattern “corresponds to”, and the second character string following the character string pattern “in”, the first character string and the character string pattern Extracting a second character string as the paired character string pattern;
The example apparatus of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.
(Appendix 4)
The process of deleting the unnecessary character string is as follows:
A morpheme analysis process that selects a morpheme string having the highest probability, using a dictionary of morphemes whose elements are parts of speech and character strings, and a table defining connection probabilities of parts of speech,
Clause separation processing for generating clauses by connecting morphemes according to clause connection rules;
It consists of dependency analysis processing that selects the dependency destination with the highest probability according to the connection probability of the clause and the constraint rule.
The example apparatus of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.
(Appendix 5)
In the text data of the specification of the cited reference cited in the description of the document indicating the reason for refusal and the text data of the specification to be rejected, the pair of words or combinations of words By identifying the appearance position of the character string data corresponding to each constituting word or combination of words in the text data of each specification, each word constituting the paired word or combination of words or A specific example determination unit that determines which of the word combinations is a word indicating a specific example or a combination of words;
The example apparatus of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.
(Appendix 6)
The specific example determination unit determines which one of the words or word combinations constituting the pair of words or word combinations that is the pair only by specifying the appearance position is a word or combination of words indicating a specific example. If the determination is not possible, the patent database storing the specification data with the character string corresponding to each word or combination of words constituting the pair of words or combinations of words is searched, and the claim of the specification Which word or word combination constitutes a specific example based on the number of occurrences of each word or word combination in the embodiment and in the embodiment, which word or word combination constitutes the paired word or word combination set To determine if it is a combination of
The example apparatus of the word in the patent document of Claim 5, or the combination of a word characterized by the above-mentioned.
(Appendix 7)
A device for assisting in understanding a word or a combination of words in a patent document,
An input unit for sequentially inputting text data of a specification to be used as a specific example of a word or a combination of words included in the patent document from a database storing the specification;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. A document search unit indicating a reason for refusal to search text data of a document to be displayed based on information specifying the specification;
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A character string pair of a word or a combination of words shown in contrast to the word or the combination of words described in the text data of the description of the specification to be a specific example of the combination An extractor for extracting;
A word or word combination pair database that stores a set of character strings of the extracted pairs of words or word combinations;
An example device for a word or a combination of words in a patent document characterized by comprising:
(Appendix 8)
An apparatus for assisting in understanding a word or a combination of words in a patent document using the word or word combination pair database according to appendix 7,
An instruction unit that receives information for specifying a specification that is a target indicating a specific example of a word or a combination of words included in the patent document;
By searching for the word or word combination contained in the word or word combination pair database in the patent document, the word or word combination included in the patent document is paired with the word or word combination character string. A search unit for searching for a word or a combination of words indicating a specific example of the combination;
An output unit for outputting a set of character strings of the searched word or word combination;
An example device for a word or a combination of words in a patent document characterized by comprising:
(Appendix 9)
A device to help understand words or combinations of words in patent documents,
Receiving information identifying a specification that is a target of a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. Search the text data of the document to be shown based on the information specifying the specification,
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words Extract
Outputting a set of character strings of the extracted paired words or word combinations;
A program for executing functions.
(Appendix 10)
The extraction process is as follows:
String pattern matching processing and
Processing for position detection in strings,
Unnecessary character string deletion processing based on pattern matching with dependency character dictionary and dependency analysis,
The program according to appendix 9, characterized by including:
(Appendix 11)
The extraction process is as follows:
When the character string of the pair expression pattern exists in the document notifying the reason for rejection, the application specific pattern and the cited document specific pattern are detected, and the paired character string is specified before or after these patterns. Or
If there is a character string in the order of the first character string, the character string pattern “corresponds to”, and the second character string following the character string pattern “in”, the first character string and the character string pattern Extracting a second character string as the paired character string pattern;
The program according to appendix 9, which is a process.
(Appendix 12)
The process of deleting the unnecessary character string is as follows:
A morpheme analysis process that selects a morpheme string having the highest probability, using a dictionary of morphemes whose elements are parts of speech and character strings, and a table defining connection probabilities of parts of speech,
Clause separation processing for generating clauses by connecting morphemes according to clause connection rules;
It consists of dependency analysis processing that selects the dependency destination with the highest probability according to the connection probability of the clause and the constraint rule.
The program according to appendix 9, characterized by:
(Appendix 13)
In the text data of the specification of the cited reference cited in the description of the document indicating the reason for refusal and the text data of the specification to be rejected, the pair of words or combinations of words By identifying the appearance position of the character string data corresponding to each constituting word or combination of words in the text data of each specification, each word constituting the paired word or combination of words or Further executing the function of determining which of the word combinations is a specific example word or a word combination;
The program according to appendix 9, characterized by:
(Appendix 14)
In the determination of the specific example, only by specifying the appearance position, which word or combination of words constituting the pair of words or combination of words is a word or combination of words indicating a specific example. If the determination is not possible, the patent database storing the specification data with the character string corresponding to each word or combination of words constituting the pair of words or combinations of words is searched, and the claim of the specification Which word or word combination constitutes a specific example based on the number of occurrences of each word or word combination in the embodiment and in the embodiment, which word or word combination constitutes the paired word or word combination set Is a process for determining whether the combination is
The program according to appendix 13, characterized by:
(Appendix 15)
A method for a computer to assist in understanding a word or combination of words in a patent document,
Receiving information identifying a specification that is a target of a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. Search the text data of the document to be shown based on the information specifying the specification,
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words Extract
Outputting a set of character strings of the extracted paired words or word combinations;
An example method of a word or a combination of words in a patent document, wherein the computer executes the above.
(Appendix 16)
The extraction process is as follows:
String pattern matching processing and
Processing for position detection in strings,
Unnecessary character string deletion processing based on pattern matching with dependency character dictionary and dependency analysis,
The method for illustrating a word or a combination of words in the patent document according to supplementary note 15, characterized by including:
(Appendix 17)
The extraction process is as follows:
When the character string of the pair expression pattern exists in the document notifying the reason for rejection, the application specific pattern and the cited document specific pattern are detected, and the paired character string is specified before or after these patterns. Or
If there is a character string in the order of the first character string, the character string pattern “corresponds to”, and the second character string following the character string pattern “in”, the first character string and the character string pattern Extracting a second character string as the paired character string pattern;
The method for illustrating a word or a combination of words in the patent document according to Supplementary Note 15, wherein the method is processing.
(Appendix 18)
The process of deleting the unnecessary character string is as follows:
A morpheme analysis process that selects a morpheme string having the highest probability, using a dictionary of morphemes whose elements are parts of speech and character strings, and a table defining connection probabilities of parts of speech,
Clause separation processing for generating clauses by connecting morphemes according to clause connection rules;
It consists of dependency analysis processing that selects the dependency destination with the highest probability according to the connection probability of the clause and the constraint rule.
The example of the word or the combination of words in the patent document of Additional remark 15 characterized by the above-mentioned.
(Appendix 19)
In the text data of the specification of the cited reference cited in the description of the document indicating the reason for refusal and the text data of the specification to be rejected, the pair of words or combinations of words By identifying the appearance position of the character string data corresponding to each constituting word or combination of words in the text data of each specification, each word constituting the paired word or combination of words or The computer further determines which of the word combinations is a specific example word or combination of words;
The example of the word or the combination of words in the patent document of Additional remark 15 characterized by the above-mentioned.
(Appendix 20)
In the determination of the specific example, only by specifying the appearance position, which word or combination of words constituting the pair of words or combination of words is a word or combination of words indicating a specific example. If the determination is not possible, the patent database storing the specification data with the character string corresponding to each word or combination of words constituting the pair of words or combinations of words is searched, and the claim of the specification Which word or word combination constitutes a specific example based on the number of occurrences of each word or word combination in the embodiment and in the embodiment, which word or word combination constitutes the paired word or word combination set Is a process for determining whether the combination is
An example method of a word or a combination of words in the patent document according to supplementary note 19, characterized by:
(Appendix 21)
A device for assisting in understanding a word or a combination of words in a patent document,
A receiving unit for receiving information specifying a specification including the patent document;
Indicates the reason for refusal by searching the text of the document indicating the reason for refusal based on the information specifying the specification from the reason for refusal database recording the document indicating the reason for refusal. A document search section;
From the text of the document indicating the reason for the retrieved rejection, the word or word combination character string described in the patent document and the document other than the specification are described and shown in contrast to the word or word combination. An extraction unit for extracting a character string of a word or a combination of words,
An output unit for outputting the extracted paired word or word combination data;
A device for exemplifying a word or a combination of words in a patent document.

１０１特許データベース
１０２特許文書検索部
１０３出願番号抽出部
１０４拒絶理由通知検索部
１０５拒絶理由データベース
１０６言葉対抽出部
１０７表現パターン辞書
１０８具体例付加対象特定部
１０９表示部
２０１公報データ
２０２拒絶理由
１４０１具体例判定部
１４０１−１特許文書解析部
１４０１−２言葉の出現位置特定部
１４０１−３特許文書検索ヒット件数取得部
１４０１−４判定部
１４０２言葉対データベース
２４０１言葉対検索部 DESCRIPTION OF SYMBOLS 101 Patent database 102 Patent document search part 103 Application number extraction part 104 Rejection reason notification search part 105 Rejection reason database 106 Word pair extraction part 107 Expression pattern dictionary 108 Specific example addition object specific part 109 Display part 201 Gazette data 202 Rejection reason 1401 Specific Example determination unit 1401-1 Patent document analysis unit 1401-2 Word appearance position specifying unit 1401-3 Patent document search hit number acquisition unit 1401-4 Determination unit 1402 Word pair database 2401 Word pair search unit

Claims

An apparatus for assisting in understanding a word or a combination of words in a patent document,
A receiving unit that receives information for specifying a specification that is a target indicating a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. A document search unit indicating a reason for refusal to search text data of a document to be displayed based on information specifying the specification;
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words An extraction unit for extracting
An output unit for outputting a pair of character strings of the extracted paired words or word combinations;
A device for exemplifying a word or a combination of words in a patent document.

The extraction unit includes:
When the character string of the pair expression pattern exists in the document notifying the reason for rejection, the application specific pattern and the cited document specific pattern are detected, and the paired character string is specified before or after these patterns. Or
If there is a character string in the order of the first character string, the character string pattern “corresponds to”, and the second character string following the character string pattern “in”, the first character string and the character string pattern Extracting a second character string as the paired character string pattern;
The example device of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.

The process of deleting the unnecessary character string is as follows:
A morpheme analysis process that selects a morpheme string having the highest probability, using a dictionary of morphemes whose elements are parts of speech and character strings, and a table defining connection probabilities of parts of speech,
Clause separation processing for generating clauses by connecting morphemes according to clause connection rules;
It consists of dependency analysis processing that selects the dependency destination with the highest probability according to the connection probability of the clause and the constraint rule.
The example device of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.

In the text data of the specification of the cited reference cited in the description of the document indicating the reason for refusal and the text data of the specification to be rejected, the pair of words or combinations of words By identifying the appearance position of the character string data corresponding to each constituting word or combination of words in the text data of each specification, each word constituting the paired word or combination of words or A specific example determination unit that determines which of the word combinations is a word indicating a specific example or a combination of words;
The example device of the word in the patent document of Claim 1, or the combination of a word characterized by the above-mentioned.

A device for assisting in understanding a word or a combination of words in a patent document,
An input unit for sequentially inputting text data of a specification to be used as a specific example of a word or a combination of words included in the patent document from a database storing the specification;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. A document search unit indicating a reason for refusal to search text data of a document to be displayed based on information specifying the specification;
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A character string pair of a word or a combination of words shown in contrast to the word or the combination of words described in the text data of the description of the specification to be a specific example of the combination An extractor for extracting;
A word-pair database that stores a set of character strings of the extracted pairs of words or word combinations;
An example device for a word or a combination of words in a patent document characterized by comprising:

A device to help understand words or combinations of words in patent documents,
Receiving information identifying a specification that is a target of a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. Search the text data of the document to be shown based on the information specifying the specification,
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words Extract
Outputting a set of character strings of the extracted paired words or word combinations;
A program for executing functions.

A method for a computer to assist in understanding a word or combination of words in a patent document,
Receiving information identifying a specification that is a target of a specific example of a word or a combination of words included in the patent document;
The reason for refusal indicated for the specification subject to the specific example of the word or combination of words from the refusal reason database recording the document showing the reason for refusal indicated for the specification. Search the text data of the document to be shown based on the information specifying the specification,
A character string having the same counter expression pattern as the pre-stored counter expression pattern is detected from the text data of the document indicating the reason for the retrieved rejection, and is described in the patent document based on the character string of the counter expression pattern. By detecting a pair of character strings each including a character string that is displayed and a character string shown in contrast to the character string, and deleting an unnecessary character string from the pair of character strings, A pair of a character string of a word or a combination of words described in text data of a specification to be used as a specific example of a combination and a character string of a word or a combination of words shown in contrast to the word or the combination of words Extract
Outputting a set of character strings of the extracted paired words or word combinations;
An example method of a word or a combination of words in a patent document, wherein the computer executes the above.