JP4806249B2

JP4806249B2 - Document information analysis apparatus and document information analysis method

Info

Publication number: JP4806249B2
Application number: JP2005325036A
Authority: JP
Inventors: 英介須藤; 克哉三室; 誠司高野
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2005-11-09
Filing date: 2005-11-09
Publication date: 2011-11-02
Anticipated expiration: 2025-11-09
Also published as: JP2007133587A

Description

本発明は、例えば、特許文献や学術文献等の文献情報を分析する文献情報分析装置及び文献情報分析方法に関する。 The present invention relates to a document information analysis apparatus and a document information analysis method for analyzing document information such as patent documents and academic documents, for example.

例えば、文献情報としての特許文献を調査することにより、研究開発の方向性を探ったり、事業計画の立案等を行うことができる。ところで、特許公開公報のような特許文献の場合は、例えば、いわゆる親出願と分割出願、国内優先権主張出願と国内優先権主張の基礎となった出願等のように、複数の文献が密接に関係している場合がある。 For example, by investigating patent documents as document information, it is possible to find out the direction of research and development and to make business plans. By the way, in the case of a patent document such as a patent publication, a plurality of documents are closely connected, such as a so-called parent application and divisional application, a domestic priority claim application and an application on which the domestic priority claim is based. May be related.

このような関係にある複数の特許文献を精査することにより、その分割や国内優先権主張の可否等を判断することができる。そこで、引用された文献及びそれを引用する文献の関連性を視覚的に表現させるものが提案されている（特許文献１）。
特開２００４−３４８７７１号公報 By scrutinizing a plurality of patent documents having such a relationship, it is possible to determine whether or not the division or domestic priority claim is possible. Then, what makes the relevance of the cited literature and the literature which cites it express visually is proposed (patent document 1).
JP 2004-348771 A

前記文献に記載の技術によれば、文献単位での関連性はわかるが、各文献がどのように関連しているのかまでを知ることができない。ここで、文書作成ソフトウェアの中には、複数の文献間の相違を分析して、ユーザに知らせる機能を備えるものがある。しかし、このような文献比較機能を備えたソフトウェアは、単純に両文献のテキストコードの順番を解析するに過ぎず、文献間の実質的な相違を検出することはできない。 According to the technique described in the document, it is possible to know the relevance of each document, but it is impossible to know how each document is related. Here, some document creation software has a function of analyzing a difference between a plurality of documents and notifying the user. However, software provided with such a document comparison function simply analyzes the order of the text codes of both documents, and cannot detect a substantial difference between documents.

例えば、２つの段落の順番を入れ替えただけに過ぎない場合でも、そのようなソフトウェアは、各段落が全て変更されたと判断する。従って、ユーザは、関連性のある文献を一頁ずつ確認しながら、一致部分及び相違部分を解析する必要があり、作業性が低く、精査に要する時間及びコストが増大するという問題がある。 For example, even if only the order of two paragraphs is changed, such software determines that all the paragraphs have been changed. Therefore, it is necessary for the user to analyze the matching part and the different part while checking the related documents one page at a time, and there is a problem that the workability is low and the time and cost required for the examination increase.

そこで、本発明の目的は、ユーザにより選択された第１文献情報及び第２文献情報を所定単位のブロックで比較することにより、両者の内容をより適切に対比することができる文献情報分析装置及び文献情報分析方法を提供することにある。本発明の他の目的は、関連性のあるブロック同士を関連を示す表示要素で接続して可視化し、この関連を示す表示要素をユーザが選択するだけで各ブロック間の対比結果を容易に確認することができる文献情報分析装置及び文献情報分析方法を提供することにある。本発明のさらなる目的は、後述する実施形態の記載から明らかになるであろう。 Therefore, an object of the present invention is to compare the first document information and the second document information selected by the user with a block of a predetermined unit, and thereby compare the contents of the two more appropriately. It is to provide a literature information analysis method. Another object of the present invention is to connect and visualize related blocks with a display element indicating the relation, and the user can easily confirm the comparison result between the blocks only by selecting the display element indicating the relation. An object of the present invention is to provide a literature information analysis apparatus and a literature information analysis method that can be used. The further objective of this invention will become clear from description of embodiment mentioned later.

上記課題を解決すべく、本発明の一つの観点に従う文献情報分析装置は、それぞれ電子化された複数の文献情報をそれぞれ記憶する文献情報記憶部と、記憶された各文献情報のうちユーザにより選択された第１文献情報及び第２文献情報を文献情報記憶部からそれぞれ取得し、第１文献情報及び前記第２文献情報をそれぞれ所定単位のブロックとして処理させる文献情報取得部と、少なくとも第１文献情報及び第２文献情報に含まれる単語群に基づいて決定される座標系において、各ブロックの座標をそれぞれ算出するブロック座標算出部と、各ブロックの座標に基づいて各ブロックを配置し、かつ、各ブロックのうち第１文献情報に由来する所定の第１ブロックと第２文献情報に由来する所定の第２ブロックとを、両者の関連を示す表示要素で接続することによりマップ情報を生成するマップ生成部と、生成された前記マップ情報を表示させるマップ表示部と、を備える。
第１文献情報の全体及び第２文献情報の全体をぞれぞれ一つずつの所定単位のブロックとして扱うこともできるし、または、第１文献情報を分割してなる複数の情報及び前記第２文献情報を分割してなる複数の情報を、それぞれ一つずつの所定単位のブロックとして扱うこともできる。なお、文献情報の全体を一つのブロックとして扱う場合、例えば、このブロックを「全体ブロック」と呼ぶことができ、文献情報を複数に分割してそれぞれを一つずつのブロックとして扱う場合は、これらのブロックを「部分ブロック」または「サブブロック」と呼ぶことができる。 In order to solve the above-described problem, a literature information analyzing apparatus according to one aspect of the present invention includes a literature information storage unit that stores a plurality of digitized literature information, and a user's selection among each stored literature information. A first document information acquisition unit that acquires the first document information and the second document information obtained from the document information storage unit, and processes each of the first document information and the second document information as a block of a predetermined unit, and at least the first document In the coordinate system determined based on the word group included in the information and the second document information, a block coordinate calculation unit that calculates the coordinates of each block, and arranges each block based on the coordinates of each block, and Of each block, a predetermined first block derived from the first document information and a predetermined second block derived from the second document information are displayed to indicate the relationship between the two. In comprising a map generator for generating map information by connecting a map display unit for displaying the generated the map information.
The whole of the first document information and the whole of the second document information can be treated as a block of a predetermined unit each, or a plurality of information obtained by dividing the first document information and the first information A plurality of pieces of information obtained by dividing two pieces of document information can be handled as a predetermined unit block. When handling the entire document information as one block, for example, this block can be referred to as an “entire block”. When the document information is divided into a plurality of blocks and treated as one block, these These blocks can be called “partial blocks” or “sub-blocks”.

ここで、文献情報としては、技術文献情報、学術論文情報、特許文献情報等を挙げることができる。特許文献情報としては、例えば、特許公開公報や実用新案登録公報等のように、特許出願または実用新案登録出願に関する文献情報を挙げることができる。所定単位としては、段落単位または請求項単位のいずれか又は両方を使用可能である。 Here, examples of the document information include technical document information, academic paper information, and patent document information. Examples of patent document information include document information relating to patent applications or utility model registration applications such as patent publications and utility model registration gazettes. As the predetermined unit, either a paragraph unit or a claim unit or both can be used.

ユーザは、第１文献情報及び第２文献情報をそれぞれ選択することができるが、いずれか一方または両方の選択に際して、支援を与えることもできる。例えば、各文献情報に含まれている書誌的事項（出願人名、発明者名、特記事項、出願番号、基礎出願番号、出願日、公開日等）や従来技術に関する事項（先行特許文献を特定する公報番号等）に基づいて、互いに関連する文献情報を自動的に抽出することができる。このような関連する文献情報を自動的に抽出する関連文献抽出手段を設けることにより、ユーザは、第１文献情報を指定するだけでよく、操作性が向上する。第１文献情報に関連する（対応する）第２文献情報は、関連文献抽出手段によって自動的に抽出される。 The user can select the first document information and the second document information, respectively, but can also provide support when selecting either one or both. For example, bibliographic items (applicant name, inventor name, special remarks, application number, basic application number, application date, publication date, etc.) included in each document information and prior art items (prior patent documents are specified) Bibliographic information related to each other can be automatically extracted based on a publication number or the like. By providing the related document extracting means for automatically extracting such related document information, the user only has to specify the first document information, and the operability is improved. The second document information related to (corresponding to) the first document information is automatically extracted by the related document extracting means.

このように、ユーザは、直接的にまたは間接的に、複数の文献情報を選択することができる。選択された各文献情報は、所定単位のブロックにそれぞれ分割される。ユーザが指定した範囲についてのみ、各文献情報をそれぞれブロックに分割してもよい。例えば、特許文献の場合、特許請求の範囲のみを請求項単位で分割したり、実施形態の記載のみを段落単位で分割することができる。 In this way, the user can select a plurality of document information directly or indirectly. Each selected document information is divided into predetermined units of blocks. Each document information may be divided into blocks only for the range designated by the user. For example, in the case of patent documents, only the claims can be divided in units of claims, or only the description of the embodiment can be divided in units of paragraphs.

そして、ブロック座標算出部は、所定の座標系における各ブロックの座標をそれぞれ算出する。所定の座標系は、例えば、選択された各文献情報にそれぞれ含まれる単語群の出現頻度等を解析することにより、決定可能である。あるいは、文献情報の全体を一つのブロックとして扱う場合、複数の文献情報から抽出されるキーワードの組合せ及び出現頻度を主成分分析することにより、座標系を決定することができる。座標軸を２つ設定すれば２次元の座標系を得ることができ、座標軸を３つ設定すれば３次元の座標系を得ることができる。ブロック座標算出部は、例えば、第１文献情報及び第２文献情報をそれぞれ分割してなるブロックを用いる場合において、各ブロックにそれぞれ含まれる各単語の座標の平均値を求め、この平均値をそのブロックの座標として決定する。 Then, the block coordinate calculation unit calculates the coordinates of each block in a predetermined coordinate system. The predetermined coordinate system can be determined, for example, by analyzing the appearance frequency of a word group included in each selected document information. Alternatively, when the entire document information is handled as one block, the coordinate system can be determined by performing principal component analysis on keyword combinations and appearance frequencies extracted from a plurality of document information. If two coordinate axes are set, a two-dimensional coordinate system can be obtained, and if three coordinate axes are set, a three-dimensional coordinate system can be obtained. For example, in the case where a block formed by dividing the first document information and the second document information is used, the block coordinate calculation unit obtains an average value of the coordinates of each word included in each block, and calculates the average value as the average value. Determined as block coordinates.

以下、主として、第１文献情報及び第２文献情報をそれぞれ複数のブロックに分割する場合を例に挙げて説明する。マップ生成部は、各ブロックをそれぞれの座標に配置し、さらに、所定のブロック同士を、両者の関連を示す表示要素で接続することにより、マップ情報を生成する。両者の関連を示す表示要素としては、例えば、線や楕円、長方形、三角形等の各種表示要素を挙げることができる。関連するブロック同士を線で接続する場合、マップ情報の表示エリアを有効に利用することができる。これにより、所定の座標系に各ブロックがそれぞれの位置に配置され、かつ、所定のブロック同士が表示要素で接続されて関連性が可視化されたマップ情報が生成される。ここで、留意すべきは、第１文献情報に由来する第１ブロックと、第２文献情報に由来する第２ブロックとが接続される点である。即ち、第１ブロック同士、または第２ブロック同士が接続されることはない。 Hereinafter, a case where the first document information and the second document information are each divided into a plurality of blocks will be mainly described as an example. A map generation part arrange | positions each block to each coordinate, and also produces | generates map information by connecting predetermined blocks with the display element which shows both relationship. Examples of display elements indicating the relationship between the two include various display elements such as lines, ellipses, rectangles, and triangles. When connecting related blocks with a line, the display area of map information can be used effectively. As a result, map information is generated in which each block is arranged at each position in a predetermined coordinate system, and the predetermined blocks are connected to each other by the display element to visualize the relationship. Here, it should be noted that the first block derived from the first document information and the second block derived from the second document information are connected. That is, the first blocks or the second blocks are not connected.

本発明の実施形態では、マップ生成部は、第１ブロックと第２ブロックが所定距離以上離れている場合に、これら第１ブロック及び第２ブロック同士を、両者の関連を示す表示要素で接続する。第１ブロックと第２ブロックとが所定距離以上離れていない場合、即ち、両ブロックが近接している場合、両者の内容がほぼ同一であると判断することが可能である。従って、この場合は、各ブロックを接続せず、有意な相違が見られるブロック同士のみを接続する。 In the embodiment of the present invention, when the first block and the second block are separated from each other by a predetermined distance or more, the map generation unit connects the first block and the second block with a display element indicating the relationship between the first block and the second block. . When the first block and the second block are not separated by a predetermined distance or more, that is, when both blocks are close to each other, it is possible to determine that the contents of both are almost the same. Therefore, in this case, the blocks are not connected, and only the blocks where a significant difference is seen are connected.

本発明の実施形態では、マップ生成部は、所定距離以上離間している第１ブロックと第２ブロックとの少なくともいずれか一方が複数存在する場合、その中で最も近い第１ブロックと第２ブロックとを、両者の関連を示す表示要素で接続させる。そのブロックに最も近いブロックとの関連性が高いと考えられるためである。 In the embodiment of the present invention, when there are a plurality of at least one of the first block and the second block that are separated by a predetermined distance or more, the map generation unit is closest to the first block and the second block. Are connected with a display element indicating the relationship between the two. This is because it is considered highly relevant to the block closest to that block.

本発明の実施形態では、関連を示す表示要素がユーザによって選択された場合、この選択された関連を示す表示要素によって接続されている第１ブロックの内容と第２ブロックの内容とを予め設定された所定のレベルで対比し、この対比結果を出力させるブロック対比部を備える。 In the embodiment of the present invention, when the display element indicating the relation is selected by the user, the contents of the first block and the contents of the second block connected by the display element indicating the selected relation are preset. In addition, a block comparison unit is provided that compares at a predetermined level and outputs the comparison result.

ここで、所定のレベルとしては、例えば、単語レベル、句レベル、文レベルを挙げることができる。これらのレベルのうち、いずれを使用するかは、ユーザが自由に選択することができる。 Here, examples of the predetermined level include a word level, a phrase level, and a sentence level. The user can freely select which of these levels to use.

ブロック対比部は、関連を示す表示要素で接続された第１ブロックの内容と第２ブロックの内容との相違部分を対比結果として出力させることもできるし、両ブロックの共通部分を対比結果として出力させることもできる。 The block comparison unit can output the difference between the content of the first block and the content of the second block connected by the display element indicating the relationship as a comparison result, or output the common part of both blocks as the comparison result It can also be made.

本発明の他の観点に従うプログラムは、コンピュータを、文献情報記憶部に電子化されて記憶された複数の文献情報の中からユーザにより選択された第１文献情報及び第２文献情報をそれぞれ取得し、第１文献情報及び第２文献情報をそれぞれ所定単位のブロックに分割する文献情報分割手段と、第１文献情報及び第２文献情報に含まれる単語群に基づいて決定される座標系において、各ブロックの座標をそれぞれ算出するブロック座標算出手段と、各ブロックの座標に基づいて各ブロックを配置し、かつ、各ブロックのうち第１文献情報に由来する所定の第１ブロックと第２文献情報に由来する所定の第２ブロックとを、両者の関連を示す表示要素で接続することによりマップ情報を生成するマップ生成手段として、それぞれ機能させる。 A program according to another aspect of the present invention acquires a first document information and a second document information selected by a user from a plurality of document information stored electronically in a document information storage unit. In the coordinate system determined based on the word information included in the first document information and the second document information, the document information dividing means for dividing the first document information and the second document information into blocks of predetermined units, respectively, Block coordinate calculating means for calculating the coordinates of each block, each block is arranged based on the coordinates of each block, and the predetermined first block and second document information derived from the first document information among each block The predetermined second blocks that are derived are functioned as map generation means for generating map information by connecting them with display elements indicating the relationship between them.

本発明のさらに別の観点に従う文献情報分析方法は、文献情報記憶部に記憶された複数の文献情報の中からユーザによって選択された第１文献情報及び第２文献情報をそれぞれ取得するステップと、第１文献情報及び第２文献情報をそれぞれ所定単位のブロックに分割するステップと、第１文献情報及び第２文献情報に含まれる単語群に基づいて決定される座標系において、各ブロックの座標をそれぞれ算出するステップと、各ブロックの座標に基づいて、各ブロックを座標系に従うマップ上にそれぞれ配置させるステップと、マップに配置された各ブロックのうち第１文献情報に由来する所定の第１ブロックと第２文献情報に由来する所定の第２ブロックとを、両者の関連を示す表示要素で接続させるすることによりマップ情報を生成するステップと、生成されたマップ情報を表示させるステップと、ユーザによって両者の関連を示す表示要素が選択された場合には、この選択された関連を示す表示要素によって接続されている第１ブロックの内容と第２ブロックの内容とを予め設定された所定のレベルで対比し、この対比結果を出力させるステップとを含む。 The document information analysis method according to still another aspect of the present invention includes a step of acquiring first document information and second document information selected by a user from among a plurality of document information stored in a document information storage unit, In the coordinate system determined based on a word group included in the first document information and the second document information, the step of dividing the first document information and the second document information into blocks of predetermined units, respectively, A step of calculating, a step of arranging each block on a map according to the coordinate system based on the coordinates of each block, and a predetermined first block derived from the first document information among the blocks arranged on the map Map information is generated by connecting a predetermined second block derived from the second document information with a display element indicating a relation between the two and the second document information. The step of displaying the generated map information and the display element showing the relation between the two when the user selects the contents of the first block connected by the display element showing the selected relation And comparing the contents of the second block with a predetermined level set in advance and outputting the comparison result.

なお、複数の文献情報をブロック単位で比較する前に、比較対象の各文献情報間の関連性を予め自動的に検出し、両文献情報を両者の関連性を示す接続線で接続しておくこともできる。例えば、ユーザによって選択された複数の文献情報間の関係（分割前と分割後の出願、変更前と変更後の出願、補正前と補正後の出願、引用・被引用等）を各文献情報に含まれる書誌事項等に基づいて自動的に判定し、この判定された関係に応じた線種（色彩や線の種類、線の太さ等）の接続線を自動的に選択し、各文献情報を接続線で接続して表示させることができる。そして、ユーザが手動操作で接続線を選択すると、この接続線で結ばれた各文献情報がブロック単位で比較されるようにしてもよい。 Before comparing a plurality of document information in units of blocks, the relationship between each document information to be compared is automatically detected in advance, and both document information is connected by a connection line indicating the relationship between the two. You can also. For example, the relationship between a plurality of document information selected by the user (pre-division and post-division applications, pre-change and post-change applications, pre-correction and post-correction applications, citations and citations, etc.) Judgment is automatically made based on the bibliographic items included, and the connection line of the line type (color, line type, line thickness, etc.) corresponding to the determined relationship is automatically selected, and each document information Can be connected and displayed. And when a user selects a connection line by manual operation, each literature information connected with this connection line may be compared in a block unit.

以下、図面に基づき、本発明の実施の形態を説明する。本実施形態に係る文献情報分析装置１は、マップ制御部１０と、対比制御部２０及び情報蓄積部３０を備えたコンピュータ装置として構成可能である。そして、文献情報分析装置１は、例えば、インターネットやLAN（Local Area Network）等の通信ネットワークを介して、クライアント端末２と双方向通信可能に接続されている。クライアント端末２は、例えば、パーソナルコンピュータや携帯情報端末（携帯電話を含む）等として構成可能である。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. The document information analysis apparatus 1 according to the present embodiment can be configured as a computer apparatus including a map control unit 10, a comparison control unit 20, and an information storage unit 30. The document information analysis apparatus 1 is connected to the client terminal 2 so as to be capable of bidirectional communication via a communication network such as the Internet or a LAN (Local Area Network). The client terminal 2 can be configured as, for example, a personal computer or a portable information terminal (including a mobile phone).

マップ制御部１０は、情報蓄積部３０に記憶されている多数の文献情報に基づいて、技術マップ１００を生成し、出力する。生成されたマップ１００は、通信ネットワークを介して、クライアント端末２に送信される。マップ１００は、クライアント端末２内に保存可能としてもよいし、クライアント端末２内に保存不能としてもよい。マップ制御部１０の詳細はさらに後述する。 The map control unit 10 generates and outputs a technical map 100 based on a large number of document information stored in the information storage unit 30. The generated map 100 is transmitted to the client terminal 2 via the communication network. The map 100 may be stored in the client terminal 2 or may not be stored in the client terminal 2. Details of the map control unit 10 will be described later.

対比制御部２０は、マップ１００に表示されている接続線がユーザによって選択された場合に、その接続線で結ばれている各ブロック１１１，１１２の内容を比較し、その結果をマップ１００上に表示させる。対比制御部２０の詳細はさらに後述する。 When the connection line displayed on the map 100 is selected by the user, the comparison control unit 20 compares the contents of the blocks 111 and 112 connected by the connection line, and the result is displayed on the map 100. Display. Details of the contrast control unit 20 will be described later.

情報蓄積部３０は、例えば、特許公開公報や登録公報、あるいは、科学技術論文等のような文献情報を多数記憶している。 The information storage unit 30 stores a large number of document information such as patent publications, registration bulletins, and scientific and technical papers.

マップ１００の構成を説明する。マップ１００は、ユーザから指示された目的に添って生成されるものである。ユーザは、例えば、文献番号やサーチワード等を指定することにより、情報蓄積部３０に記憶されている多数の文献情報群の中から、少なくとも２つ以上の文献情報を選ぶことができる。例えば、ユーザは、いわゆる親出願と分割出願、国内優先権主張出願と基礎出願、変更出願と変更前の出願、実用新案登録に基づく特許出願と実用新案登録出願、引用れた出願とこの出願を引用している出願等のように、互いに関連する複数の文献情報をそれぞれ選択することができる。 The configuration of the map 100 will be described. The map 100 is generated according to the purpose instructed by the user. For example, the user can select at least two or more document information from among a large number of document information groups stored in the information storage unit 30 by designating a document number, a search word, or the like. For example, a user can submit a so-called parent application and divisional application, domestic priority claim application and basic application, change application and application before change, patent application based on utility model registration and utility model registration application, cited application and this application. A plurality of pieces of literature information related to each other can be selected as in the cited application.

後述のように、相互に関連する文献情報のうちいずれか一方がユーザによって選択された場合、この選択された文献情報に関連する他方の文献情報を自動的に選択する構成としてもよい。以下の説明では、ユーザによって選択された一方の文献情報を第１文献情報と、他方の文献情報を第２文献情報と呼ぶ。 As will be described later, when any one of the mutually related document information is selected by the user, the other document information related to the selected document information may be automatically selected. In the following description, one document information selected by the user is referred to as first document information, and the other document information is referred to as second document information.

ユーザにより選択された互いに関連する複数の文献情報は、マップ制御部１０によって解析され、マップ１００が生成される。生成されたマップ１００は、クライアント端末２に提供される。 A plurality of pieces of document information related to each other selected by the user are analyzed by the map control unit 10 and a map 100 is generated. The generated map 100 is provided to the client terminal 2.

次に、マップ１００の構成を説明する。等高線１０１Ａ，１０１Ｂは、マップ１００に含まれる単語の分布密度を示す表示要素である。また、マップ１００には、複数のブロック１１１，１１２が表示されている。第１ブロック１１１は、第１文献情報を所定単位で分割することにより生成されたものである。第２ブロック１１２は、第２文献情報を所定単位で分割することにより生成されたものである。所定単位としては、特許文献の場合を例に挙げると、段落単位、請求項単位、実施例単位等を用いることができる。 Next, the configuration of the map 100 will be described. The contour lines 101A and 101B are display elements indicating the distribution density of words included in the map 100. The map 100 displays a plurality of blocks 111 and 112. The first block 111 is generated by dividing the first document information by a predetermined unit. The second block 112 is generated by dividing the second document information by a predetermined unit. As the predetermined unit, for example, in the case of patent documents, a paragraph unit, a claim unit, an example unit, and the like can be used.

そして、各ブロック１１１，１１２のうち互いに関連するブロック同士は、接続線１１３によって接続されている。この接続線１１３により、ユーザは、そのブロック間に関連性が存在することを容易に把握することができる。ユーザが、接続線１１３をクリック等して選択すると、対比制御部２０は、その接続線１１３で結ばれたブロックの内容を対比し、対比結果を表示させる。 Of the blocks 111 and 112, the blocks related to each other are connected by a connection line 113. With this connection line 113, the user can easily grasp that there is a relationship between the blocks. When the user selects the connection line 113 by clicking or the like, the comparison control unit 20 compares the contents of the blocks connected by the connection line 113 and displays the comparison result.

このように、本実施形態では、複数の文献情報の全体をそのまま対比するのではなく、各文献情報をブロック１１１，１１２に分割し、関連性のあるブロック同士のみを接続線１１３で接続してユーザに通知する。ユーザは、所望の接続線１１３を個別に選択することにより、特定のブロック間の対比結果を得ることができる。以下、本実施形態の詳細をさらに説明する。 In this way, in this embodiment, rather than comparing the entire plurality of document information as they are, each document information is divided into blocks 111 and 112, and only relevant blocks are connected by the connection line 113. Notify the user. The user can obtain a comparison result between specific blocks by individually selecting a desired connection line 113. Hereinafter, details of the present embodiment will be further described.

図２は、文献情報分析装置１の機能構成を示すブロック図である。マップ制御部１０は、例えば、マップ生成条件入力受付部１１と、文献情報分割部１２と、ブロック座標算出部１３と、ブロック間類似度判定部１４と、マップ生成部１５及びマップ表示部１６とを含んで構成することができる。
また、情報蓄積部３０の一例として、文献データベース（図中、データベースを「DB」と略記）３１と、単語データベース３２と、インデックスデータベース３３とを設けることができる。
文献データベース３１は、複数の文献情報を記憶するものである。単語データベース３２は、辞書として使用されるものである。インデックスデータベース３３は、各文献にどのようなキーワードが含まれているかを管理するためのものである。 FIG. 2 is a block diagram illustrating a functional configuration of the document information analysis apparatus 1. The map control unit 10 includes, for example, a map generation condition input reception unit 11, a document information division unit 12, a block coordinate calculation unit 13, an inter-block similarity determination unit 14, a map generation unit 15, and a map display unit 16. Can be configured.
In addition, as an example of the information storage unit 30, a document database (in the figure, database is abbreviated as “DB”) 31, a word database 32, and an index database 33 can be provided.
The document database 31 stores a plurality of document information. The word database 32 is used as a dictionary. The index database 33 is for managing what keywords are included in each document.

マップ生成条件入力受付部１１は、ユーザから指定されるマップ生成条件の入力を受け付けるためのものである。受け付けるとは、例えば、ユーザの指定するマップ生成条件を示す情報を通信インターフェースを介して電子情報として受信し、メモリ等に記憶させることを意味する。 The map generation condition input receiving unit 11 is for receiving an input of a map generation condition designated by the user. Accepting means, for example, receiving information indicating map generation conditions designated by the user as electronic information via a communication interface and storing the information in a memory or the like.

文献情報分割部１２は、ユーザによって選択された第１文献情報及び第２文献情報を、ユーザから指定された条件に基づいて、それぞれ分割する。ユーザは、例えば、段落単位や請求項単位等の分割単位を指定することができる。なお、各文献情報は、同一レベル（単位）で分割される。 The document information dividing unit 12 divides the first document information and the second document information selected by the user based on conditions specified by the user. For example, the user can specify a division unit such as a paragraph unit or a claim unit. Each document information is divided at the same level (unit).

ブロック座標算出部１３は、文献データベース（図中、データベースを「DB」と略記）３１及び単語データベース３２を検索することにより、各ブロックに含まれている単語を抽出する。例えば、ブロック座標算出部１３は、そのブロック内の各単語の座標の平均値を算出することにより、そのブロックの座標を算出する。 The block coordinate calculation unit 13 extracts a word included in each block by searching a literature database 31 (in the figure, the database is abbreviated as “DB”) 31 and a word database 32. For example, the block coordinate calculation unit 13 calculates the coordinates of the block by calculating the average value of the coordinates of each word in the block.

ブロック間類似度判定部１４は、各ブロック１１１，１１２の座標を比較し、両者の類似度を判別する。これにより、実質的に同一内容であると判定可能なブロック同士を接続の対象から排除し、実質的に同一ではないが関連するブロック同士のみを接続対象として抽出することができる。 The inter-block similarity determination unit 14 compares the coordinates of the blocks 111 and 112, and determines the similarity between them. As a result, blocks that can be determined to have substantially the same contents can be excluded from the connection targets, and only related blocks that are not substantially the same but can be extracted as connection targets.

マップ生成部１５は、各ブロック１１１，１１２をそれぞれの座標に配置すると共に、互いに関連するブロック同士を接続線１１３で接続することにより、マップ１００の論理的構造を生成する。 The map generation unit 15 generates the logical structure of the map 100 by arranging the blocks 111 and 112 at the respective coordinates and connecting the blocks related to each other by the connection line 113.

マップ表示部１６は、生成されたマップ１００の構造に応じて、所定の表示要素をそれぞれ割り当てることにより、ユーザが視認可能なマップ１００を生成し、クライアント端末２に提供する。所定の表示要素としては、例えば、各ブロックを示すための図形要素、等高線１０１Ａ等、マップ１００に対する操作を行うためのメニュー表示部などを挙げることができる。 The map display unit 16 generates a map 100 that can be visually recognized by the user by allocating predetermined display elements according to the structure of the generated map 100, and provides the map 100 to the client terminal 2. Examples of the predetermined display element include a graphic element for indicating each block, a contour display 101A, a menu display unit for performing an operation on the map 100, and the like.

ユーザは、クライアント端末２の端末画面に表示されたマップ１００を閲覧することにより、選択した文献情報の記述内容（ブロック）がどのように分布しており、どの記述内容同士が互いに関連しているのか等を容易に読みとることができる。 The user browses the map 100 displayed on the terminal screen of the client terminal 2 to determine how the description contents (blocks) of the selected document information are distributed and which description contents are related to each other. Can be easily read.

対比制御部２０は、例えば、接続線指定情報入力受付部２１と、ブロック対比部２２と、対比結果出力部２３とを含んで構成することができる。 The comparison control unit 20 can be configured to include, for example, a connection line designation information input reception unit 21, a block comparison unit 22, and a comparison result output unit 23.

接続線指定情報入力受付部２１は、ユーザから入力される接続線指定情報を電子的に受け付けて記憶する。例えば、ユーザによるマップ１００上の操作は、イベントハンドラによってイベントとして検出することができる。 The connection line designation information input receiving unit 21 electronically receives and stores connection line designation information input from the user. For example, an operation on the map 100 by the user can be detected as an event by an event handler.

ブロック対比部２２は、ユーザの選択する接続線１１３によって接続されているブロック１１１，１１２の内容をテキストマイニング等の手法を用いて対比する。この対比結果は、対比結果出力部２３によって、マップ１００上に表示される。 The block comparison unit 22 compares the contents of the blocks 111 and 112 connected by the connection line 113 selected by the user using a technique such as text mining. This comparison result is displayed on the map 100 by the comparison result output unit 23.

対比結果表示部２４は、例えば、対比対象のブロック間の一致部分と不一致部分とが明確に区別できるような形態で、対比結果を表示する。例えば、下線や取消線、フォント種別、大きさ、色等を適宜変えることにより、どの部分が一致しており、どの部分が一致していないのかを区別して表示させることができる。 For example, the comparison result display unit 24 displays the comparison result in such a form that a matched portion and a mismatched portion between blocks to be compared can be clearly distinguished. For example, by appropriately changing the underline, strikethrough, font type, size, color, etc., it is possible to distinguish and display which part matches and which part does not match.

図３は、文献情報分析装置１及びクライアント端末２のハードウェア構成の概略を示す構成説明図である。文献情報分析装置１は、上述のように、サーバコンピュータ等のように構成可能である。 FIG. 3 is a configuration explanatory diagram showing an outline of the hardware configuration of the document information analysis apparatus 1 and the client terminal 2. As described above, the document information analysis apparatus 1 can be configured as a server computer or the like.

文献情報分析装置１は、例えば、通信インターフェース（図中、インターフェースを「I/F」と略記）１００１と、CPU（Central Processing Unit）１００２と、ROM（Read Only Memory）１００３と、RAM（Random Access Memory）１００４と、補助記憶装置１００５を備えて構成することができる。 The document information analyzer 1 includes, for example, a communication interface (in the figure, the interface is abbreviated as “I / F”) 1001, a CPU (Central Processing Unit) 1002, a ROM (Read Only Memory) 1003, and a RAM (Random Access). Memory) 1004 and an auxiliary storage device 1005 can be provided.

補助記憶装置１００５には、例えば、OS（Operating System）の他に、文献データベース３１と、単語データベース３２と、表示制御プログラム１１１０と、主成分分析プログラム１１２０と、構造解析プログラム１１３０と、検索プログラム１１４０及びウェブサーバプログラム１１５０を、それぞれ記憶させることができる。 In the auxiliary storage device 1005, for example, in addition to the OS (Operating System), the document database 31, the word database 32, the display control program 1110, the principal component analysis program 1120, the structure analysis program 1130, and the search program 1140. And the web server program 1150 can be stored.

文献データベース３１には、上述のように、特許公開公報等の文献情報が予め記憶されている。単語データベース３２には、例えば、助詞や接続詞等のようなキーワードに不適切な単語、同義語、類義語等が予め記憶されている。インデックスデータベース３３は、各文献に含まれているキーワードを管理する。表示制御プログラム１１１０は、マップ１００の描画処理等を行うプログラムである。主成分分析プログラム１１２０は、主成分分析を行うプログラムである。構造解析プログラム１１３０は、例えば、テキストマイニング等の手法に基づいて、テキストデータの構造を解析するプログラムである。検索プログラム１１４０は、入力された検索条件に基づいて、文献データベース３１等を検索するプログラムである。ウェブサーバプログラム１１５０は、ウェブサーバ機能を実現するプログラムである。 As described above, the document database 31 stores document information such as patent publications in advance. In the word database 32, for example, words, synonyms, synonyms, and the like inappropriate for keywords such as particles and conjunctions are stored in advance. The index database 33 manages keywords included in each document. The display control program 1110 is a program that performs drawing processing of the map 100 and the like. The principal component analysis program 1120 is a program for performing principal component analysis. The structure analysis program 1130 is a program for analyzing the structure of text data based on a technique such as text mining. The search program 1140 is a program that searches the literature database 31 and the like based on the input search conditions. The web server program 1150 is a program that realizes a web server function.

クライアント端末２の構成を説明する。クライアント端末２は、インターネット等の通信ネットワークCNを介して文献情報分析装置１に接続されており、例えば、通信インターフェース２００１と、CPU２００２と、ROM２００３と、RAM２００４及び補助記憶装置２００５を備えて構成可能である。 The configuration of the client terminal 2 will be described. The client terminal 2 is connected to the document information analysis apparatus 1 via a communication network CN such as the Internet. For example, the client terminal 2 can be configured to include a communication interface 2001, a CPU 2002, a ROM 2003, a RAM 2004, and an auxiliary storage device 2005. is there.

なお、以上の構成は例示であって、本発明はこれに限定されない。例えば、プログラムに代えて、プログラマブル・ロジック・デバイス等のようなハードウェア回路を用いて、各機能の少なくとも一部を実現可能な場合もある。 In addition, the above structure is an illustration and this invention is not limited to this. For example, it may be possible to realize at least a part of each function by using a hardware circuit such as a programmable logic device instead of the program.

文献情報分析装置１（以下、分析装置１とも呼ぶ）を用いた文献情報分析方法について説明する。図４は、全体動作の概要を示すフローチャートである。なお、以下に述べる各フローチャートも同様であるが、各フローチャートは、処理の概要を示しており、実際のプログラムとは相違する。なお、後述のように、ユーザによって直接的または間接的に選択された複数の文献情報間の関係を自動的に判定し、判定された関係に応じた線種の接続線で両文献情報を接続して表示しておくこともできる。 A document information analysis method using the document information analysis apparatus 1 (hereinafter also referred to as the analysis apparatus 1) will be described. FIG. 4 is a flowchart showing an overview of the overall operation. The flowcharts described below are also the same, but each flowchart shows an outline of the process and is different from an actual program. In addition, as described later, the relationship between a plurality of document information selected directly or indirectly by the user is automatically determined, and both document information is connected by a connection line of a line type corresponding to the determined relationship. It can also be displayed.

まず、ユーザは、クライアント端末２のユーザインターフェースを介して、比較対象となる文献情報を指定する（Ｓ１０）。例えば、ユーザは、第１文献情報及び第２文献情報のそれぞれの文献番号等を指定することができる。あるいは、ユーザは、いずれか一方の文献番号のみを指定することもできる。 First, the user designates document information to be compared via the user interface of the client terminal 2 (S10). For example, the user can designate the respective document numbers of the first document information and the second document information. Alternatively, the user can specify only one of the document numbers.

分析装置１は、クライアント端末２から指定条件を取得すると（Ｓ１１）、この指定条件に基づいて文献データベース３１を検索し、指定条件に合致する文献情報をそれぞれ抽出する（Ｓ１２）。 When the analyzer 1 acquires the specified condition from the client terminal 2 (S11), the analyzer 1 searches the document database 31 based on the specified condition, and extracts the document information that matches the specified condition (S12).

抽出された文献情報の文献番号等は、分析装置１からクライアント端末２に送信され（Ｓ１３）、クライアント端末２の画面に表示される（Ｓ１４）。ユーザは、クライアント端末２の画面を介して、解析（対比）の実行を指示する（Ｓ１５）。 The document number etc. of the extracted document information is transmitted from the analyzer 1 to the client terminal 2 (S13) and displayed on the screen of the client terminal 2 (S14). The user instructs execution of analysis (contrast) via the screen of the client terminal 2 (S15).

この指示を与える際に、ユーザは、対比条件を指定することもできる。対比条件としては、例えば、第１文献情報及び第２文献情報のどの部分をどの単位で対比するか等の条件を挙げることができる。 When giving this instruction, the user can also specify a comparison condition. Examples of the comparison condition include conditions such as which part of the first document information and the second document information is compared in which unit.

分析装置１は、ユーザから指示を受信すると、抽出された第１文献情報及び第２文献情報に基づいて、マップ１００を生成する。詳細は後述するが、分析装置１は、第１文献情報及び第２文献情報をそれぞれ複数のブロックに分割し、各ブロックをマップ上に配置すると共に、互いに関連するブロック同士を接続線１１３で接続させる。生成されたマップ１００は、クライアント端末２に送信される（Ｓ１７）。 When receiving an instruction from the user, the analysis apparatus 1 generates a map 100 based on the extracted first document information and second document information. Although details will be described later, the analysis apparatus 1 divides each of the first document information and the second document information into a plurality of blocks, arranges each block on the map, and connects the blocks related to each other through the connection line 113. Let The generated map 100 is transmitted to the client terminal 2 (S17).

ユーザは、クライアント端末２の画面を介して、マップ１００を閲覧し、ブロックの配置具合や接続線１１３の設定状態等を確認する。そして、ユーザは、詳細な対比を希望する場合、所望のブロック間を結ぶ接続線１１３を指定する（Ｓ１９）。 The user browses the map 100 via the screen of the client terminal 2 and confirms the arrangement state of the blocks, the setting state of the connection line 113, and the like. When the user desires a detailed comparison, the user designates the connection line 113 that connects the desired blocks (S19).

分析装置１は、ユーザによる接続線の指定を検出すると、ブロック間の内容を対比し（Ｓ２０）、この対比結果をクライアント端末２に送信する（Ｓ２１）。ユーザは、マップ１００に表示された表示部２４を閲覧することにより、所望のブロック１１１，１１２の内容がどの程度相違しているか（共通しているか）を確認することができる。 When the analysis apparatus 1 detects the designation of the connection line by the user, the contents of the blocks are compared (S20), and the comparison result is transmitted to the client terminal 2 (S21). The user can confirm how much the contents of the desired blocks 111 and 112 are different (common) by browsing the display unit 24 displayed on the map 100.

図５は、各文献情報を複数のブロックに分割等してマップ１００を生成するための処理を示すフローチャートである。この処理は、図４中のＳ１６に対応する。まず、分析装置１は、第１文献情報及び第２文献情報にそれぞれ含まれる単語群に基づいて、各単語の出現頻度等から単語の座標軸を２つ決定する（Ｓ３１）。
なお、Ｓ３１における座標軸算出処理は、一般的な多変量解析の一手法である主成分分析を利用している。例えば、座標算出のために使用する基礎データとして、各文献（または各ブロック）のそれぞれについて、対象の単語が出現しているか否かを「1（出現）」「0（非出現）」のマトリックスを利用して算出し、この結果算出される「主成分負荷量」と呼ばれる指標を実際の座標とする。 FIG. 5 is a flowchart showing a process for generating the map 100 by dividing each document information into a plurality of blocks. This process corresponds to S16 in FIG. First, the analysis apparatus 1 determines two coordinate axes of words based on the appearance frequency of each word based on the word groups included in the first document information and the second document information (S31).
Note that the coordinate axis calculation processing in S31 uses principal component analysis, which is a common multivariate analysis technique. For example, as basic data used for coordinate calculation, a matrix of “1 (appearance)” and “0 (non-appearance)” indicates whether or not the target word appears for each document (or each block). An index called “principal component load amount” calculated as a result is used as an actual coordinate.

次に、分析装置１は、第１文献情報を予め設定された所定単位のブロックにそれぞれ分割する（Ｓ３２）。上述のように、例えば、段落単位や請求項単位等のように、各文献情報をそれぞれ分割することができる。ここで、所定単位とは、意味内容上の単位であり、ブロックのデータサイズとは直接関係しない点に留意すべきである。各ブロックのデータサイズは、それぞれに含まれているデータ量によって相違する。 Next, the analyzer 1 divides the first document information into predetermined units of blocks (S32). As described above, each document information can be divided, for example, in units of paragraphs or units of claims. Here, it should be noted that the predetermined unit is a unit on the semantic content and is not directly related to the data size of the block. The data size of each block differs depending on the amount of data included in each block.

次に、分析装置１は、第１文献情報を分割してなる第１ブロック群の各ブロックについて、以下のＳ３４〜Ｓ３７をそれぞれ実行する（Ｓ３３）。分析装置１は、一つの第１ブロック１１１を選択して、この第１ブロック１１１の内容を構文解析し（Ｓ３４）、その第１ブロック１１１に含まれている単語を全て抽出する（Ｓ３５）。 Next, the analyzer 1 executes the following S34 to S37 for each block of the first block group obtained by dividing the first document information (S33). The analysis apparatus 1 selects one first block 111, parses the contents of the first block 111 (S34), and extracts all the words included in the first block 111 (S35).

分析装置１は、そのブロック１１１に含まれている各単語の座標をそれぞれ求めて、その平均値を算出する。そして、分析装置１は、この算出された平均値を、そのブロック１１１の座標として使用する（Ｓ３６）。分析装置１は、算出された座標位置に、そのブロック１１１を所定の表示形態で配置させる（Ｓ３７）。所定の表示形態とは、例えば、そのブロック１１１を示す図形の形状や色彩等を意味する。以上の処理を第１ブロック群の全ブロック１１１について繰り返すことにより、第１文献情報はブロック単位で分割されて、マップ１００に分散して表示される。 The analysis device 1 obtains the coordinates of each word included in the block 111 and calculates the average value. Then, the analyzer 1 uses the calculated average value as the coordinates of the block 111 (S36). The analysis apparatus 1 places the block 111 in a predetermined display form at the calculated coordinate position (S37). The predetermined display form means, for example, the shape or color of a graphic showing the block 111. By repeating the above processing for all the blocks 111 in the first block group, the first document information is divided into blocks and displayed dispersed on the map 100.

次に、分析装置１は、第１ブロック１１１と同じ単位で、第２文献情報を複数のブロック１１２にそれぞれ分割する（Ｓ３８）。分析装置１は、第２文献情報を分割してなる第２ブロック群の各ブロック１１２について、以下のＳ４０〜Ｓ４７をそれぞれ実行する（Ｓ３９）。 Next, the analyzer 1 divides the second document information into a plurality of blocks 112 in the same unit as the first block 111 (S38). The analysis apparatus 1 executes the following S40 to S47 for each block 112 of the second block group obtained by dividing the second document information (S39).

分析装置１は、一つの第２ブロック１１２を選択して、この第２ブロック１１２の内容を構文解析し（Ｓ４０）、そこに含まれている単語を全て抽出する（Ｓ４１）。分析装置１は、そのブロック１１２に含まれている各単語の座標の平均値を算出することにより、その第２ブロック１１２の座標を決定する（Ｓ４２）。 The analysis apparatus 1 selects one second block 112, parses the contents of the second block 112 (S40), and extracts all the words contained therein (S41). The analysis apparatus 1 determines the coordinates of the second block 112 by calculating the average value of the coordinates of each word included in the block 112 (S42).

分析装置１は、算出された座標位置に、その第２ブロック１１２を所定の表示形態で配置させる（Ｓ４３）。以上の処理を第２ブロック群の全ブロック１１２について繰り返すことにより、第２文献情報はブロック単位で分割されて、マップ１００に分散して表示される。 The analysis apparatus 1 arranges the second block 112 in a predetermined display form at the calculated coordinate position (S43). By repeating the above processing for all the blocks 112 in the second block group, the second document information is divided into blocks and displayed dispersed on the map 100.

次に、接続線設定処理が実行される。分析装置１は、その第２ブロック１１２の座標から所定範囲内に第１ブロック１１１が存在するか否かを判定する（Ｓ４４）。第２ブロック１１２から所定範囲内に第１ブロック１１１が存在する場合（S44:YES）、その第２ブロック１１２と第１ブロック１１１の内容はほぼ同一であると判断できる。従って、この場合は、接続線１１３を設定せずに、次の第２ブロック１１２の処理に移る。 Next, connection line setting processing is executed. The analyzer 1 determines whether or not the first block 111 exists within a predetermined range from the coordinates of the second block 112 (S44). When the first block 111 exists within the predetermined range from the second block 112 (S44: YES), it can be determined that the contents of the second block 112 and the first block 111 are substantially the same. Therefore, in this case, the process proceeds to the next second block 112 without setting the connection line 113.

これに対し、その第２ブロック１１２の位置から所定範囲内に第１ブロック１１１が存在しない場合（S44:NO）、第２ブロック１１２と実質的に同一内容の第１ブロック１１１が、その第２ブロック１１２の近傍に位置しない場合である。即ち、その第２ブロック１１２とは内容の異なる第１ブロック１１１が、その第２ブロック１１２の周辺に存在すると判断可能な場合である（Ｓ４５）。 On the other hand, when the first block 111 does not exist within the predetermined range from the position of the second block 112 (S44: NO), the first block 111 having substantially the same content as the second block 112 is the second block 112. This is a case where it is not located near the block 112. That is, it is possible to determine that the first block 111 having a different content from the second block 112 exists around the second block 112 (S45).

そこで、分析装置１は、その第２ブロック１１２から最も近い第１ブロック１１１を検索し（Ｓ４６）、検索された最短距離の第１ブロック１１１と第２ブロック１１２とを接続線１１３で接続する（Ｓ４７）。 Therefore, the analysis apparatus 1 searches for the first block 111 closest to the second block 112 (S46), and connects the searched first block 111 and the second block 112 with the shortest distance through the connection line 113 ( S47).

ここで、分析装置１は、各ブロック１１１，１１２間の距離に応じて、接続線１１３の表示形態を変更させることができる。例えば、接続線表示形態管理テーブルＴ１に、ブロック間の距離と線種との関係を予め登録しておけば、分析装置１は、各ブロック間の距離に応じた線種で、接続線１１３を表示させることができる。ここにいう「線種」とは、線の種類（実線、点線、一点鎖線等）、線の太さ、線の色のいずれか一つまたは複数を含む概念である。 Here, the analyzer 1 can change the display form of the connection line 113 according to the distance between the blocks 111 and 112. For example, if the relationship between the distance between the blocks and the line type is registered in advance in the connection line display form management table T1, the analysis apparatus 1 sets the connection line 113 with the line type corresponding to the distance between the blocks. Can be displayed. Here, the “line type” is a concept including one or more of a line type (solid line, dotted line, alternate long and short dash line, etc.), line thickness, and line color.

次に、図６は、ブロック間の内容を対比するブロック間対比処理を示すフローチャートである。この処理は、図５中のＳ２０に対応する。分析装置１は、マップ１００上の接続線１１３のいずれかがユーザによって指定されたか否かを判定する（Ｓ５１）。 Next, FIG. 6 is a flowchart showing an inter-block comparison process for comparing contents between blocks. This process corresponds to S20 in FIG. The analysis apparatus 1 determines whether any of the connection lines 113 on the map 100 is designated by the user (S51).

接続線１１３がユーザによって指定されると（S51:YES）、分析装置１は、その指定された接続線１１３で結ばれている第１ブロック１１１と第２ブロック１１３とをそれぞれ特定する（Ｓ５２）。分析装置１は、第１ブロック１１１の内容及び第２ブロック１１２の内容をそれぞれ取得する（Ｓ５３，Ｓ５４）。 When the connection line 113 is designated by the user (S51: YES), the analysis apparatus 1 specifies the first block 111 and the second block 113 that are connected by the designated connection line 113 (S52). . The analyzer 1 acquires the contents of the first block 111 and the contents of the second block 112 (S53, S54).

そして、分析装置１は、第１ブロック１１１の内容と第２ブロック１１２の内容とを比較し（Ｓ５５）、その一致部分（共通部分）及び相違部分（差分）をそれぞれ抽出して一時的に記憶する（Ｓ５６）。分析装置１は、抽出された一致部分及び相違部分をマップ１００に表示させる（Ｓ５７）。 Then, the analysis apparatus 1 compares the contents of the first block 111 and the contents of the second block 112 (S55), and extracts and temporarily stores the matching part (common part) and the different part (difference). (S56). The analysis apparatus 1 displays the extracted matching part and different part on the map 100 (S57).

図７は、対比結果表示部２４による表示例を示す説明図である。例えば、各ブロック１１１，１１２の原文を最上段に表示し、次の段に両ブロック１１１，１１２の一致部分を表示し、最後の段に両ブロック１１１，１１２の相違部分を表示することができる。 FIG. 7 is an explanatory diagram illustrating a display example by the comparison result display unit 24. For example, the original text of each block 111, 112 can be displayed at the top, the matching part of both blocks 111, 112 can be displayed at the next level, and the difference between both blocks 111, 112 can be displayed at the last level. .

なお、これに限らず、例えば、下線や取消線等を用いて、各ブロック１１１，１１２の原文中に一致部分及び相違部分を同時に表示させる構成でもよい。 However, the present invention is not limited to this, and for example, a configuration may be used in which a matching portion and a different portion are simultaneously displayed in the original text of each of the blocks 111 and 112 by using an underline or strikethrough.

また、対比するレベルは、単語レベル、フレーズレベル、センテンスレベルのいずれかまたは複数をユーザが自由に設定することができる。 In addition, the user can freely set one or more of a word level, a phrase level, and a sentence level as levels to be compared.

次に、図８は、クライアント端末２の画面に表示されるマップ生成メニュー画面Ｇ１の一例を示す説明図である。例えば、メニュー画面Ｇ１には、文献指定部Ｇ２と、比較範囲指定部Ｇ３と、ブロック単位指定部Ｇ４と、関連文献検索ボタンＢ１と、実行指示ボタンＢ２及び取消ボタンＢ３とを含めることができる。 Next, FIG. 8 is an explanatory diagram showing an example of the map generation menu screen G1 displayed on the screen of the client terminal 2. For example, the menu screen G1 can include a document specifying part G2, a comparison range specifying part G3, a block unit specifying part G4, a related document search button B1, an execution instruction button B2, and a cancel button B3.

文献指定部Ｇ２には、ユーザが第１文献情報及び第２文献情報の文献番号をそれぞれ直接入力することができる。あるいは、ユーザは、いずれか一方の文献番号のみを入力した後、関連文献検索ボタンＢ１を操作することにより、その入力済の文献番号に関連する文献番号を分析装置１に自動的に検索させてセットさせることもできる。 The document designation part G2 allows the user to directly input the document numbers of the first document information and the second document information. Alternatively, after the user inputs only one of the document numbers, the user operates the related document search button B1 to cause the analyzer 1 to automatically search for a document number related to the input document number. It can also be set.

比較範囲指定部Ｇ３では、各文献情報のどの部分について比較するかを指定する。例えば、文献の全体、特許請求の範囲、発明の詳細な説明、課題を解決するための手段、発明の効果、要約等のように、ユーザは所望の範囲を指定することができる。なお、比較範囲は複数指定することができる。 The comparison range designating part G3 designates which part of each document information is to be compared. For example, the user can specify a desired range such as the whole literature, claims, detailed explanation of the invention, means for solving the problem, effects of the invention, summary, and the like. A plurality of comparison ranges can be specified.

ブロック単位指定部Ｇ４では、ブロックの生成単位を指定する。例えば、請求項単位、段落単位、ページ単位のように、ユーザは所望の単位をいずれか一つまたは複数選択することができる。 The block unit designation unit G4 designates a block generation unit. For example, the user can select any one or a plurality of desired units such as a claim unit, a paragraph unit, and a page unit.

そして、各文献情報を特定するための情報（文献番号）や比較範囲及びブロック単位を指定した後、ユーザが実行指示ボタンＢ２を操作すると、分析装置１は、上述した処理を実行してマップ１００を生成する。 When the user operates the execution instruction button B2 after designating information (reference number) for identifying each piece of document information, a comparison range, and a block unit, the analysis apparatus 1 executes the above-described processing to execute the map 100. Is generated.

本実施例は上述の構成を採用するため、以下の効果を奏する。本実施例では、第１文献情報及び第２文献情報をそれぞれ所定単位のブロックに分割し、互いに関連する第１ブロック１１１と第２ブロック１１２を接続線１１３で接続してマップ１００に表示させる構成とした。従って、各文献情報の全体を単純に対比する場合に比べて、ブロック単位で各文献情報の構造を把握でき、かつ、各文献情報のどの部分がどのように関連しているのかを視覚を通じて直感的に把握することが可能となり、使い勝手が向上する。 Since the present embodiment employs the above-described configuration, the following effects can be obtained. In the present embodiment, the first document information and the second document information are each divided into predetermined units of blocks, and the first block 111 and the second block 112 that are related to each other are connected by the connection line 113 and displayed on the map 100. It was. Therefore, it is possible to grasp the structure of each piece of document information in block units and to intuitively understand which part of each piece of document information is related to how compared to the case of simply comparing the whole pieces of each piece of document information. It becomes possible to grasp it and to improve usability.

本実施例では、第１ブロック１１１と第２ブロック１１２が所定距離以上離れている場合に、これら第１ブロック１１１及び第２ブロック１１２同士を、接続線１１３で接続して表示させる構成とした。従って、互いに関連し、かつ、内容の相違するブロック同士だけを接続線１１３で対応付けることができ、使い勝手が向上する。 In the present embodiment, when the first block 111 and the second block 112 are separated by a predetermined distance or more, the first block 111 and the second block 112 are connected by the connection line 113 and displayed. Therefore, only the blocks that are related to each other and that have different contents can be associated with each other by the connection line 113, and the usability is improved.

本実施例では、第１ブロック１１１と第２ブロック１１２とが所定距離以上離間している場合に、最も近い第１ブロック１１１と第２ブロック１１２とを接続線１１３で接続して表示させる構成とした。従って、互いに関連する可能性が高いブロック同士を接続して可視化することができ、使い勝手が向上する。 In the present embodiment, when the first block 111 and the second block 112 are separated from each other by a predetermined distance or more, the closest first block 111 and the second block 112 are connected by the connection line 113 and displayed. did. Therefore, blocks that are highly likely to be related to each other can be connected and visualized, improving usability.

本実施例では、接続線１１３がユーザによって選択された場合、この接続線１１３で接続されている第１ブロック１１１の内容と第２ブロックの内容１１２とを予め設定された所定のレベルで対比し、この対比結果を出力させる構成とした。従って、ユーザは、接続線１１３を選択するだけで、所望のブロック間の類似度合を簡単に把握することができ、使い勝手が向上する。 In this embodiment, when the connection line 113 is selected by the user, the contents of the first block 111 and the contents 112 of the second block connected by the connection line 113 are compared at a predetermined level set in advance. The comparison result is output. Therefore, the user can easily grasp the degree of similarity between desired blocks simply by selecting the connection line 113, and usability is improved.

本実施例では、各ブロックの一致部分または相違部分のいずれかまたは両方を表示可能な構成とした。従って、各文献情報の共通部分や相違部分をブロック単位で個別に精査することができ、使い勝手が向上する。 In the present embodiment, either one or both of the matching part and the different part of each block can be displayed. Therefore, common parts and different parts of each document information can be individually examined in units of blocks, and usability is improved.

図９〜図１２に基づいて、本発明の第２実施例を説明する。前記第１実施例では、文献情報を複数のブロックに分割し、ブロック単位で比較する場合を説明したが、本実施例では、各文献情報をブロック単位で比較する前に、比較対象の文献情報間の関連性を区別して表示させる。 A second embodiment of the present invention will be described with reference to FIGS. In the first embodiment, the case where the document information is divided into a plurality of blocks and compared in units of blocks has been described. However, in this embodiment, before comparing each document information in units of blocks, the document information to be compared is compared. Differentiate the relationship between them.

図９は、本実施例の全体概要を示す説明図である。本実施例では、文献情報分析装置１の前処理として、ユーザにより選択された文献情報間の関連性を視覚化する。
関連性分析装置３００は、文献情報の関連性を検出し、その関連性に応じた接続線で文献情報間を接続して表示させるものである。関連性分析装置３００は、文献情報分析装置１と同一のコンピュータ装置に設けることもできるし、あるいは文献情報分析装置１とは別のコンピュータ装置に設けることもできる。 FIG. 9 is an explanatory diagram showing an overall outline of the present embodiment. In the present embodiment, as the preprocessing of the document information analyzing apparatus 1, the relationship between the document information selected by the user is visualized.
The relevance analysis apparatus 300 detects relevance of document information, and connects and displays the document information with connection lines corresponding to the relevance. The relevance analysis apparatus 300 can be provided in the same computer apparatus as the document information analysis apparatus 1, or can be provided in a computer apparatus different from the document information analysis apparatus 1.

「文献情報分析装置１の前処理装置」として表現可能な関連性分析装置３００は、例えば、文献選択部３１０と、関連性検出部３２０と、接続線選択部３３０と、表示画面３５０を生成する表示出力部３４０と、選択操作検出部３６０とを備えて構成できる。
関連性分析装置３００は、文献情報分析装置１と同様に、文献データベース３１，単語データベース３２及びインデックスデータベース３３を使用可能である。即ち、これら各データベース３１〜３３は、両分析装置１，３００間で共用されている。 The relevance analysis device 300 that can be expressed as “a preprocessing device of the document information analysis device 1” generates, for example, a document selection unit 310, a relationship detection unit 320, a connection line selection unit 330, and a display screen 350. A display output unit 340 and a selection operation detection unit 360 can be provided.
The relevance analysis apparatus 300 can use the document database 31, the word database 32, and the index database 33, similarly to the document information analysis apparatus 1. In other words, these databases 31 to 33 are shared between both analyzers 1 and 300.

文献選択部３１０は、ユーザから指定された条件に基づいて、文献データベース３１（または文献データベース３１及びインデックスデータベース３３）を検索することにより、ユーザの望む文献情報を抽出するものである。例えば、ユーザは、特許公開公報番号や特許出願番号等のような文献番号を指定することにより、複数の文献情報１１１Ａ，１１２Ａ，１１３Ａを抽出させることができる。 The document selection unit 310 extracts the document information desired by the user by searching the document database 31 (or the document database 31 and the index database 33) based on conditions specified by the user. For example, the user can extract a plurality of document information 111A, 112A, 113A by designating a document number such as a patent publication number or a patent application number.

関連性検出部３２０は、抽出された各文献情報１１１Ａ，１１２Ａ，１１３Ａの関係を判定する。例えば、関連性判定部３２０は、各文献情報１１１Ａ，１１２Ａ，１１３Ａに含まれる書誌事項に基づいて、第１文献情報１１１Ａに関連する第２文献情報１１２Ａを抽出することができる。 The relevance detection unit 320 determines the relationship between the extracted pieces of document information 111A, 112A, and 113A. For example, the relevance determination unit 320 can extract the second document information 112A related to the first document information 111A based on the bibliographic items included in each document information 111A, 112A, 113A.

書誌事項には、例えば、特許分類、出願人氏名、発明者氏名、代理人氏名、出願番号、公開番号、登録番号、出願日、公開日、登録日、特別な出願の表示（優先権主張、分割変更の表示、新規性喪失の例外規定の適用等）、請求項の数、審査請求の有無、先行技術文献の番号等が含まれている。従って、関連性検出部３２０は、書誌事項を解析することにより、互いに関連する複数の文献情報１１１Ａ，１１２Ａを抽出可能である。
図９の表示画面３５０では、第１文献情報１１１Ａを基準として、複数の第２文献情報１１２Ａが第１文献情報１１１Ａに関連している様子を示している。なお、第３文献情報１１３Ａは、第１文献情報１１１Ａと直接的な関連性（例えば、法的関係）はなく、共通するキーワード等を有するだけの文献である。 Bibliographic items include, for example, patent classification, applicant name, inventor name, agent name, application number, publication number, registration number, application date, publication date, registration date, special application indication (priority claim, Display of divisional change, application of exception provision for loss of novelty, etc.), the number of claims, presence / absence of examination request, number of prior art documents, etc. Therefore, the relevance detection unit 320 can extract a plurality of document information 111A and 112A related to each other by analyzing bibliographic items.
The display screen 350 in FIG. 9 shows a state in which a plurality of second document information 112A is related to the first document information 111A with the first document information 111A as a reference. Note that the third document information 113A is a document that does not have a direct relationship (for example, a legal relationship) with the first document information 111A and has only a common keyword or the like.

接続線選択部３３０は、関連性検出部３２０によって検出された関連性に基づいて、接続線表示形態管理テーブルＴ１を参照することにより、文献情報間の関連性に応じた線種の接続線１１３Ａを選択する。 The connection line selection unit 330 refers to the connection line display form management table T1 based on the relevance detected by the relevance detection unit 320, thereby connecting the connection line 113A of the line type according to the relevance between the document information. Select.

表示出力部３４０は、選択された接続線１１３Ａを用いて、各文献情報１１１Ａ，１１２Ａを接続することにより、表示画面３５０を生成する。生成された表示画面３５０は、通信ネットワークを介してクライアント端末に提供される。 The display output unit 340 generates the display screen 350 by connecting the document information 111A and 112A using the selected connection line 113A. The generated display screen 350 is provided to the client terminal via the communication network.

ユーザがクライアント端末を介して、接続線１１３Ａを選択操作すると、この選択操作は、選択操作検出部３６０によって検出される。選択操作検出部３６０は、ユーザの選択操作を検出すると、文献情報分析装置１に分析開始を指示する。これにより、文献情報分析装置１は、上述のように、各文献情報１１１Ａ，１１２Ａを複数のブロックに分割し、対応するブロック同士を接続し、その内容を対比する。 When the user selects the connection line 113A via the client terminal, the selection operation is detected by the selection operation detection unit 360. When the selection operation detection unit 360 detects a user's selection operation, the selection operation detection unit 360 instructs the literature information analysis apparatus 1 to start analysis. Thereby, the literature information analysis apparatus 1 divides each literature information 111A, 112A into a plurality of blocks as described above, connects corresponding blocks, and compares the contents.

図１０は、接続線表示形態管理テーブルＴ１の一例を示す説明図である。このテーブルＴ１には、例えば、複数のテーブルＴ１Ａ，Ｔ１Ｂの少なくともいずれか一方を含めることができる。
一方のテーブルＴ１Ａは、文献情報１１１Ａ，１１２Ａ間の距離に応じた線種を選択するためのものである。例えば、ユーザによって抽出された多数の文献情報群に基づいて設定される座標系において、第１文献情報１１１Ａ，第２文献情報１１２Ａの座標をそれぞれ求め、各文献情報１１１Ａ，１１２Ａ間の距離を算出可能な場合に、テーブルＴ１Ａを使用することができる。このテーブルＴ１Ａでは、例えば、距離に応じて線の太さや種類が異なるように設定されている。 FIG. 10 is an explanatory diagram showing an example of the connection line display form management table T1. The table T1 can include, for example, at least one of a plurality of tables T1A and T1B.
One table T1A is for selecting a line type according to the distance between the document information 111A and 112A. For example, in the coordinate system set based on a large number of document information groups extracted by the user, the coordinates of the first document information 111A and the second document information 112A are obtained, and the distance between the document information 111A and 112A is calculated. Table T1A can be used if possible. In this table T1A, for example, the line thickness and type are set to be different depending on the distance.

他方のテーブルＴ１Ｂは、文献情報１１１Ａ，１１２Ａの関連性に応じた線種を選択するためのものである。関連性としては、上述のように、分割出願や変更出願等のような法的関係を挙げることができる。このテーブルＴ１Ｂでは、両文献情報の関連に応じて、異なる色彩の接続線１１３Ａが設定されている。 The other table T1B is for selecting a line type according to the relevance of the document information 111A and 112A. As described above, as described above, a legal relationship such as a divisional application or a change application can be cited. In this table T1B, connection lines 113A of different colors are set according to the relationship between the two document information.

なお、線の太さ及び色彩に限らず、例えば、接続線１１３Ａを点滅させたり、アニメーション表示させてもよい。また、どのような表示形態で接続線１１３Ａを表示させるかは、ユーザがカスタマイズ可能としてもよい。 For example, the connection line 113A may be blinked or displayed as an animation without being limited to the thickness and color of the line. In addition, the display form of the connection line 113A may be customizable by the user.

次に、図１１は、本実施例の全体動作を示すフローチャートである。
まず、ユーザは、クライアント端末２のユーザインターフェースを介して、検索条件を入力する（Ｓ６０）。検索条件は、例えば、「光触媒」等のようなキーワードを指定することにより、あるいは特許分類コードや文献の公開日等を指定することにより、行うことができる。 Next, FIG. 11 is a flowchart showing the overall operation of this embodiment.
First, the user inputs search conditions via the user interface of the client terminal 2 (S60). The search condition can be performed, for example, by specifying a keyword such as “photocatalyst” or by specifying a patent classification code, a publication date of a document, or the like.

関連性分析装置３００は、クライアント端末２から検索条件を取得すると（Ｓ６１）、この検索条件に基づいてインデックスデータベース３３及び文献データベース３１を検索し、検索条件に合致する文献情報を全て抽出する（Ｓ６２）。より具体的には、ユーザから指定されたキーワードを有する文献情報を抽出するために、インデックスデータベース３３が使用される。そして、指定されたキーワードを有する文献情報が特定されると、その文献情報が文献データベース３１から読み出される。そして、抽出された結果は、関連性分析装置３００からクライアント端末２に送信され（Ｓ６３）、クライアント端末２の画面に表示される（Ｓ６４）。 When acquiring the search condition from the client terminal 2 (S61), the relevance analysis apparatus 300 searches the index database 33 and the document database 31 based on the search condition, and extracts all the document information that matches the search condition (S62). ). More specifically, the index database 33 is used to extract document information having a keyword designated by the user. When document information having the specified keyword is specified, the document information is read from the document database 31. Then, the extracted result is transmitted from the relevance analysis device 300 to the client terminal 2 (S63) and displayed on the screen of the client terminal 2 (S64).

ユーザは、クライアント端末２の画面を介して、抽出された文献の総数や文献名称等を確認し、少なくとも一つの文献情報（第１文献情報１１１Ａ）を選択する（Ｓ６５）。
関連性分析装置３００は、書誌事項を解析することにより、ユーザによって選択された第１文献情報１１１Ａに関連する第２文献情報１１２Ａを全て抽出する（Ｓ６６）。 The user confirms the total number of extracted documents, document names, and the like via the screen of the client terminal 2, and selects at least one document information (first document information 111A) (S65).
The relevance analyzing apparatus 300 extracts all the second document information 112A related to the first document information 111A selected by the user by analyzing the bibliographic items (S66).

関連性分析装置３００は、抽出された各第２文献情報１１２Ａと第１文献情報１１１Ａとの関連性に応じた接続線１１３Ａを選択し（Ｓ６８）、各文献情報１１１Ａ，１１２Ａ同士を接続して（Ｓ６９）、クライアント端末の画面に表示させる（Ｓ７０）。 The relevance analysis apparatus 300 selects the connection line 113A according to the relevance between the extracted second document information 112A and the first document information 111A (S68), and connects the document information 111A and 112A to each other. (S69), it is displayed on the screen of the client terminal (S70).

ユーザがいずれかの接続線１１３Ａをクリック操作等して選択すると（Ｓ７１）、関連性分析装置３００は、文献情報分析装置１に分析開始を指示する（Ｓ７２）。これにより、第１実施例で述べた処理が開始され、マップ１００が生成される。 When the user selects one of the connection lines 113A by clicking or the like (S71), the relevance analysis apparatus 300 instructs the literature information analysis apparatus 1 to start analysis (S72). Thereby, the processing described in the first embodiment is started, and the map 100 is generated.

なお、Ｓ６０〜Ｓ６４は省略可能である。即ち、ユーザは、第１文献情報１１１Ａ及び第２文献情報１１２Ａの文献番号をそれぞれ明示して指定することもできる。また、ユーザは、接続線１１３Ａの選択操作に代えて、表示画面３５０に表示されるメニューを操作することにより、第１実施例で述べた処理に移行させることもできる。 Note that S60 to S64 can be omitted. That is, the user can explicitly specify the document numbers of the first document information 111A and the second document information 112A. Further, the user can shift to the processing described in the first embodiment by operating a menu displayed on the display screen 350 instead of the selection operation of the connection line 113A.

このように構成される本実施例では、関連する文献情報１１１Ａ，１１２Ａ間の関連性を視覚を通じて直感的に判断することができる。そして、両文献情報１１１Ａ，１１２Ａ間の関係を把握した上で、各文献情報１１１Ａ，１１２Ａをブロック単位で比較し、詳細に一致点や相違点を検討することができ、使い勝手が向上する。 In this embodiment configured as described above, it is possible to intuitively determine the relationship between the related document information 111A and 112A through vision. Then, after grasping the relationship between the two document information 111A and 112A, each document information 111A and 112A can be compared in block units, and the coincidence point or difference can be examined in detail, thereby improving the usability.

なお、本発明は、上述した実施の形態に限定されない。当業者であれば、本発明の範囲内で、種々の追加や変更等を行うことができる。例えば、第１実施例では、第１文献情報及び第２文献情報に含まれる単語群から座標系を決定する場合を述べたが、これに限らず、より多数の文献情報に含まれる単語群に基づいて座標系を決定してもよい。
また、第１実施例では、文献情報を複数のブロックに分割して対比する場合を述べたが、これに限らず、文献情報の全体を一つのブロックとして扱い、文献全体同士を対比させることもできる。 The present invention is not limited to the above-described embodiment. A person skilled in the art can make various additions and changes within the scope of the present invention. For example, in the first embodiment, the case where the coordinate system is determined from the word group included in the first document information and the second document information is described. However, the present invention is not limited to this, and the word group included in a larger number of document information. A coordinate system may be determined based on this.
In the first embodiment, the case where the document information is divided into a plurality of blocks and compared is described. However, the present invention is not limited to this, and the entire document information may be treated as one block and the entire documents may be compared. it can.

本発明の実施形態に係る文献情報分析装置の全体を示す説明図である。It is explanatory drawing which shows the whole literature information analyzer which concerns on embodiment of this invention. 文献情報分析装置の機能ブロック図である。It is a functional block diagram of a literature information analysis device. 文献情報分析装置のハードウェア及びソフトウェアの構成概要を示す説明図である。It is explanatory drawing which shows the hardware and software structure outline | summary of a literature information analyzer. 文献情報分析方法の全体動作を示すフローチャートである。It is a flowchart which shows the whole operation | movement of a literature information analysis method. 文献情報を複数のブロックに分割してマップを生成するための処理を示すフローチャートである。It is a flowchart which shows the process for dividing | segmenting literature information into a some block and producing | generating a map. ブロック間の内容を対比させるための処理を示すフローチャートである。It is a flowchart which shows the process for comparing the content between blocks. ブロック間の対比結果の一例を示す説明図である。It is explanatory drawing which shows an example of the comparison result between blocks. マップ生成条件を指定するためのメニュー画面の一例を示す説明図である。It is explanatory drawing which shows an example of the menu screen for designating map generation conditions. 本発明の第２実施例に係る関連性分析装置を備えた文献情報分析装置の全体を示す説明図である。It is explanatory drawing which shows the whole literature information analyzer provided with the relevance analyzer which concerns on 2nd Example of this invention. 接続線表示形態管理テーブルを示す説明図である。It is explanatory drawing which shows a connection line display form management table. 全体動作を示すフローチャートである。It is a flowchart which shows whole operation | movement.

Explanation of symbols

１…文献情報分析装置、２…クライアント端末、１０…マップ制御部、１１…マップ生成条件入力受付部、１２…文献情報分割部、１３…ブロック座標算出部、１４…ブロック間類似度判定部、１５…マップ生成部、１６…マップ表示部、２０…対比制御部、２１…接続線指定情報入力受付部、２２…ブロック対比部、２３…対比結果出力部、２４…対比結果表示部、３０…情報蓄積部、３１…文献データベース、３２…単語データベース、１００…マップ、１０１Ａ，１０１Ｂ…等高線、１１１，１１２…ブロック、３００…関連性分析装置、３１０…文献選択部、３２０…関連性検出部、３３０…接続線選択部、３４０…表示出力部、３５０…表示画面、３６０…選択操作検出部
DESCRIPTION OF SYMBOLS 1 ... Literature information analyzer, 2 ... Client terminal, 10 ... Map control part, 11 ... Map generation condition input reception part, 12 ... Literature information division part, 13 ... Block coordinate calculation part, 14 ... Inter-block similarity determination part, DESCRIPTION OF SYMBOLS 15 ... Map production | generation part, 16 ... Map display part, 20 ... Comparison control part, 21 ... Connection line designation | designated information input reception part, 22 ... Block comparison part, 23 ... Comparison result output part, 24 ... Comparison result display part, 30 ... Information storage unit, 31 ... literature database, 32 ... word database, 100 ... map, 101A, 101B ... contour, 111, 112 ... block, 300 ... relevance analyzer, 310 ... literature selection unit, 320 ... relevance detection unit, 330 ... Connection line selection unit, 340 ... Display output unit, 350 ... Display screen, 360 ... Selection operation detection unit

Claims

A document information storage unit that stores a plurality of document information, each of which is a patent document related to an electronic patent application or utility model registration application ,
The first document information and the second document information selected according to the map generation condition input from the user among the stored document information are respectively acquired from the document information storage unit, and the first document information and the second document information are acquired. A document information acquisition unit that divides and processes document information into blocks of a predetermined unit that is either a paragraph unit or a claim unit or both based on the map generation condition ,
Each of the blocks included in each block in a coordinate system determined by using principal component analysis based on the frequency of appearance of each word or each block of word information included in the first document information and the second document information and the block coordinate calculation unit configured to calculate an average value of coordinates in the coordinate system of the word as the coordinates of the blocks,
The respective blocks are arranged based on the coordinates of the respective blocks, and the second of the blocks is separated from a predetermined first block derived from the first document information by a predetermined distance in the coordinate system. a map generating unit for generating map information by a predetermined second block derived from literature data, connected by Viewing elements,
A map display unit for displaying the generated map information;
A bibliographic information analyzer.

When there are a plurality of at least one of the first block and the second block that are separated by the predetermined distance or more, the map generation unit determines the closest first block and second block among them. Bibliography analyzer according to claim 1 which is connected in front Symbol Display element.

If the previous SL Display element is selected by the user, the content and the second block of the content and the word level of the first block being connected by the selected table shown elements, phrase level, among the sentence level The literature information analysis apparatus according to claim 2 , further comprising: a block comparison unit that compares at least one level selected by the user and outputs a comparison result.

Said block comparison unit, document information analyzer according to different part of the content of content and the second block of the connected first block in the previous Symbol Display element according to claim 3 for output as the comparison result.

Said block comparison unit, document information analyzing apparatus according to the intersection of the content before Symbol Display element connected the first block with the content of the second block in claim 3 for output as the comparison result.

Before SL document information obtaining section, among the first document information and the second document information is selected, the literature of claim 2 for dividing only the range selected by the user into blocks of the predetermined unit Information analysis device.

Among the plurality of document information stored in the document information storage unit, the document information having a predetermined legal relationship is determined based on the bibliographic items of each document information, and the document information having the legal relationship is determined. A pre-processing unit for displaying by connecting with a connection line according to the legal relationship;
When the connection line is selected by a user, the document information acquisition unit converts each document information connected by the connection line from the document information storage unit as the first document information and the second document information, respectively. get,
The literature information analyzer according to any one of claims 1 to 6.

Computer
The first document information and the second document information selected by the map generation condition input from the user from among a plurality of document information that is a patent document or a patent document relating to a utility model registration application electronically stored in the document information storage unit Document information division for acquiring document information and dividing the first document information and the second document information into blocks of a predetermined unit that is either a paragraph unit or a claim unit or both based on the map generation condition. Means,
Each word included in each block in the coordinate system determined by using principal component analysis based on the frequency of appearance of each word information or block for each word included in the first document information and the second document information and block coordinates calculating means for calculating respective average values of the coordinates as the coordinates of the respective blocks in the coordinate system of
The respective blocks are arranged based on the coordinates of the respective blocks, and the second of the blocks is separated from a predetermined first block derived from the first document information by a predetermined distance in the coordinate system. and a predetermined second block derived from literature data, a map generating means for generating map information by connecting with Viewing elements,
Each program that functions.

The first document information and the second document selected by the computer based on the map generation condition input from the user from among a plurality of document information that is a patent application or a patent document related to the utility model registration application stored in the document information storage unit Obtaining each information step;
The computer dividing the first document information and the second document information into blocks of a predetermined unit which is either a paragraph unit or a claim unit or both based on the map generation condition ;
The computer, in the coordinate system is determined by using a principal component analysis based on the appearance frequency of each document information or each block of words contained in the first document information and the second document information, wherein in each block calculating respective average values of coordinates in the coordinate system of each word contained as the coordinates of the blocks,
The computer placing each block on a map according to the coordinate system based on the coordinates of each block;
The computer has a predetermined first block derived from the second document information that is separated from the predetermined first block derived from the first document information by a predetermined distance in the coordinate system among the blocks arranged on the map. and generating the map information by the two blocks, connected by Viewing elements,
The computer displaying the generated map information; and
Said computer, said if the selected pre-Symbol Display element by the user, the selected Viewing elements by the connected with the contents of the first block is the second block of the content and the word level, phrasal, compared with at least one of the level selected by the user out of the sentence level, document information analyzing method for performing the step of outputting the comparison result.