JP2006092091A

JP2006092091A - Document structuring device and document structuring method

Info

Publication number: JP2006092091A
Application number: JP2004274712A
Authority: JP
Inventors: Kazuo Ishida; 和生石田; Meguri Takada; 巡高田; Tetsuo Ishita; 哲夫井下
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2004-09-22
Filing date: 2004-09-22
Publication date: 2006-04-06

Abstract

<P>PROBLEM TO BE SOLVED: To provide a document structuring device allowing reduction of cost spent for production of an extraction rule or a document structure-extracting logic model prepared in each target document, and allowing improvement of structure extraction accuracy by only performing browsing operation of the document. <P>SOLUTION: This document structuring device has: a structure recognition means 102 recognizing a document structure from stored document data 101a, and storing it as document structure information 103a; a distribution means 104 distributing the stored document data to a client terminal 105, acquiring a history of a browsing part of a user, and storing it as browsing history information 106a; and a correlation calculation means 107 calculating correlation between document structure blocks from the document structure information and the browsing history information 106a. The structure recognition means 102 receives the correlation between the document structure blocks calculated by the correlation calculation means 107, and re-recognizes the document structure. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、文書画像データから文書構造を認識して構造化テキストに変換する装置および方法に関し、特に、文書閲覧者の閲覧履歴を利用して文書構造認識結果を改善する文書構造化装置および文書構造化方法に関するものである。 The present invention relates to an apparatus and method for recognizing a document structure from document image data and converting it into structured text, and more particularly to a document structuring apparatus and document for improving a document structure recognition result by using a browsing history of a document viewer. It relates to a structuring method.

紙の文書を電子化して配信や再利用する際、文字認識や文書構造（タイトルや章、節、段落などのような文書を構成する意味的構造）の認識を行い、構造情報を利用した検索（例：タイトルに「インターネット」を含む文献）や部分配信（例：文献の第２段落のみ配信）を行うことが考えられている。構造情報を利用することで前者は検索精度の向上、後者では通信効率の改善が行えるが、改善の度合いは文書構造の認識結果に大きく依存している。 When paper documents are digitized for distribution and reuse, character recognition and document structure (semantic structures that make up documents such as titles, chapters, sections, paragraphs, etc.) are recognized, and searches using structural information It is considered to perform (for example, a document including “Internet” in the title) and partial distribution (for example, distribution of only the second paragraph of the document). By using the structure information, the former can improve the search accuracy, and the latter can improve the communication efficiency. However, the degree of improvement largely depends on the recognition result of the document structure.

文書データからの文書構造認識には様々な手法が提案されている。例えば特許文献１では、文書のレイアウト情報を利用して文書構造の認識を行う。また、特許文献２および特許文献３では、抽出ルールと認識された文字列のパターンマッチを用いて文書構造認識を行う。また、特許文献４ではワードプロセッサなどで文書を作成する際の操作履歴（コピー、ペーストなど）を利用して構造情報を自動付加する手法を提案している。 Various methods have been proposed for document structure recognition from document data. For example, in Patent Document 1, document structure recognition is performed using document layout information. In Patent Document 2 and Patent Document 3, document structure recognition is performed using a pattern match of a character string recognized as an extraction rule. Patent Document 4 proposes a method of automatically adding structure information using an operation history (copy, paste, etc.) when a document is created by a word processor or the like.

他方、特許文献５には、文書利用者の意図を十分に反映した用語関連ネットワークを生成するために、文書利用者によって文書表示装置に表示される文書の表示履歴に基づいて、表示される文書部分間に対する表示推移の関係を解析し、表示推移関係を有する文書部分から抽出された用語同士の関連度を表示推移関係に基づいて算出し、所定値以上の用語関連度の用語同士を対応付けた用語関連ネットワークを生成する方法が記載されている。ただし、この特許文献５には、文書構造そのものの解析に関して特に表示履歴を利用しているという記述はない。
特開平１１−１８４８９４号公報特開平８−２８７１８９号公報特開平９−３１９７４７号公報特開平１０−２２２５０６号公報特開平１０−２２２５３２号公報 On the other hand, in Patent Document 5, in order to generate a term-related network that sufficiently reflects the intention of the document user, the document displayed based on the display history of the document displayed on the document display device by the document user. Analyzes the relationship between the display transitions for each part, calculates the relevance between terms extracted from the document part having the display transition relationship based on the display transition relationship, and associates terms with a term relevance greater than or equal to a predetermined value A method for generating a terminology related network is described. However, this Patent Document 5 does not describe that the display history is particularly used for the analysis of the document structure itself.
Japanese Patent Laid-Open No. 11-184894 JP-A-8-287189 JP-A-9-319747 Japanese Patent Laid-Open No. 10-222506 Japanese Patent Laid-Open No. 10-222532

特許文献４の文書構造化手法では、文書作成時の操作履歴を用いるため、文書の作成段階から特許文献４が要求するシステムを利用しておかなければならず、すでに完成済み、あるいは紙に出力された文書に対しては適用できないという問題がある。 In the document structuring method of Patent Document 4, since an operation history at the time of document creation is used, the system required by Patent Document 4 must be used from the document creation stage, and has already been completed or output to paper. There is a problem that it cannot be applied to a document that has been processed.

これに対して、文書レイアウトの論理モデルや構造抽出ルールを利用して文書構造の抽出を行う特許文献１、２および３の文書構造化手法は、そのような問題は生じない。しかし、認識するための規範となる文書レイアウトの論理モデルや構造抽出ルールが対象文書に正確に合っていないと認識精度が低いという問題がある。この認識精度を向上させるためには、従来は、規範となる文書レイアウトの論理モデルや構造抽出ルールを対象文書に正確に合わせる必要があり、多くの労力を必要とするという問題がある。また、特定の或る対象文書に正確に合わせた文書レイアウトの論理モデルや構造抽出ルールは、その対象文書に特化したものとなるため、その対象文書と少しでも相違する文書に対しては精度が低下し、汎用性に欠けるという問題がある。 On the other hand, the document structuring methods of Patent Documents 1, 2, and 3 that extract a document structure using a logical model of a document layout and a structure extraction rule do not cause such a problem. However, there is a problem in that the recognition accuracy is low if the logical model of the document layout and the structure extraction rule that are the norm for recognition do not match the target document accurately. In order to improve the recognition accuracy, conventionally, it has been necessary to accurately match the logical model of the document layout and the structure extraction rule as the reference document with the target document, which requires a lot of labor. In addition, the document layout logical model and structure extraction rules that are precisely matched to a specific target document are specific to the target document. There is a problem that it is lowered and lacks versatility.

本発明の目的は、規範となる文書レイアウトの論理モデルや構造抽出ルールの改変によらずに文書構造の認識精度を向上させる新規な文書構造化装置および文書構造化方法を提供することにある。 SUMMARY OF THE INVENTION An object of the present invention is to provide a novel document structuring apparatus and document structuring method capable of improving the recognition accuracy of a document structure without depending on a normative document layout logical model or structure extraction rule.

本発明の文書構造化装置は、文書データを記憶する文書記憶手段と、前記文書データの文書構造情報を記憶する文書構造記憶手段と、閲覧履歴情報を記憶する閲覧履歴記憶手段と、前記文書記憶手段に記憶された文書データを文書閲覧端末に配信した際の前記文書データの閲覧箇所の履歴を取得して閲覧履歴情報として前記閲覧履歴記憶手段に蓄積し、前記蓄積された閲覧履歴情報を用いて前記文書データの前記文書構造情報における文書構造ブロック間の相関を計算し、該計算した文書構造ブロック間の相関結果を用いて前記文書データの文書構造の再認識を行う処理部とを備えることを特徴とする。より具体的には、本発明の第１の文書構造化装置は、文書データを記憶する文書記憶手段と、文書構造情報を記憶する文書構造記憶手段と、前記文書記憶手段に記憶された文書データから文書構造を認識し文書構造情報として前記文書構造記憶手段に蓄積する構造認識手段と、閲覧履歴情報を記憶する閲覧履歴記憶手段と、前記蓄積された文書構造情報を参照して前記文書データの一部、あるいは、全部を文書閲覧端末に配信する配信手段と、前記文書閲覧端末に配信した文書の閲覧箇所の履歴を取得し閲覧履歴情報として前記閲覧履歴記憶手段に蓄積する閲覧履歴取得手段と、前記蓄積された閲覧履歴情報を用いて、前記蓄積された文書構造情報における文書構造ブロック間の相関を計算する相関計算手段とを備え、前記構造認識手段は、前記相関計算手段が計算した文書構造ブロック間の相関を受け取り文書構造の再認識を行うものであることを特徴とする。 The document structuring apparatus according to the present invention includes a document storage unit that stores document data, a document structure storage unit that stores document structure information of the document data, a browsing history storage unit that stores browsing history information, and the document storage. When the document data stored in the means is distributed to the document browsing terminal, the history of the browsing position of the document data is acquired and stored as browsing history information in the browsing history storage means, and the stored browsing history information is used. And calculating a correlation between the document structure blocks in the document structure information of the document data, and re-recognizing the document structure of the document data using the calculated correlation result between the document structure blocks. It is characterized by. More specifically, the first document structuring apparatus of the present invention includes document storage means for storing document data, document structure storage means for storing document structure information, and document data stored in the document storage means. A structure recognizing means for recognizing the document structure and storing the document structure information in the document structure storage means, a browsing history storage means for storing browsing history information, and referring to the stored document structure information A distribution unit that distributes a part or all of the document to the document browsing terminal; a browsing history acquisition unit that acquires a history of a browsing portion of the document distributed to the document browsing terminal and stores the history as browsing history information in the browsing history storage unit; Correlation calculation means for calculating correlation between document structure blocks in the stored document structure information using the stored browsing history information, the structure recognition means, Serial and wherein the correlation calculation means performs a re-recognition of the received document structure correlations between document structure blocks calculated.

本発明の第２の文書構造化装置は、第１の文書構造化装置において、前記構造認識手段は、再認識の結果に基づいて前記蓄積された文書構造情報を更新するものであることを特徴とする。 According to a second document structuring apparatus of the present invention, in the first document structuring apparatus, the structure recognition unit updates the stored document structure information based on a result of re-recognition. And

本発明の第３の文書構造化装置は、第１または第２の文書構造化装置において、前記配信手段は、１回の配信で前記文書閲覧端末の表示領域に相当するサイズの文書データを配信するものであり、前記閲覧履歴取得手段は、配信された文書データに含まれる文書構造ブロックの情報を文書の閲覧箇所の履歴として取得するものであることを特徴とする。 According to a third document structuring apparatus of the present invention, in the first or second document structuring apparatus, the distribution unit distributes document data having a size corresponding to a display area of the document browsing terminal in one distribution. The browsing history acquisition means acquires the information of the document structure block included in the distributed document data as a history of browsing locations of the document.

本発明の第４の文書構造化装置は、第３の文書構造化装置において、前記閲覧履歴取得手段は、文書データが配信された時刻情報を閲覧履歴情報に含めるものであることを特徴とする。 According to a fourth document structuring apparatus of the present invention, in the third document structuring apparatus, the browsing history acquisition unit includes time information at which document data is distributed in browsing history information. .

本発明の第５の文書構造化装置は、第１ないし第４の文書構造化装置において、前記相関計算手段は、前記閲覧履歴情報が一定個数得られるたび、あるいは、一定時間経過するたび、あるいは、前記文書閲覧端末からの指示があるたび、あるいは、それらの任意の組み合わせのタイミングで文書構造ブロック間の相関を計算するものであることを特徴とする。 According to a fifth document structuring apparatus of the present invention, in the first to fourth document structuring apparatuses, the correlation calculating unit is configured to obtain a certain number of the browsing history information, every time a certain time elapses, or The correlation between the document structure blocks is calculated whenever there is an instruction from the document browsing terminal, or at an arbitrary combination timing.

本発明の第６の文書構造化装置は、第１または第２の文書構造化装置において、前記相関計算手段は、前記閲覧履歴記憶手段に蓄積された同じ文書にかかる複数の閲覧履歴情報の中に、同じ文書構造ブロックを起点とする閲覧箇所の推移が複数通り存在する場合、より多数を占める閲覧箇所の推移に基づいて文書構造ブロック間の相関を計算するものであることを特徴とする。 According to a sixth document structuring apparatus of the present invention, in the first or second document structuring apparatus, the correlation calculating unit includes a plurality of browsing history information relating to the same document stored in the browsing history storing unit. In addition, when there are a plurality of browsing location transitions starting from the same document structure block, the correlation between the document structure blocks is calculated based on the transition of the browsing location occupying a larger number.

本発明の第１の文書構造化方法は、ａ）構造認識手段が、文書記憶手段に記憶された文書データから文書構造を認識し、文書構造情報として文書構造記憶手段に蓄積するステップ、ｂ）配信手段が、前記蓄積された文書構造情報を参照して前記文書データの一部、あるいは、全部を文書閲覧端末に配信するステップ、ｃ）閲覧履歴取得手段が、前記文書閲覧端末に配信した文書の閲覧箇所の履歴を取得し、閲覧履歴情報として閲覧履歴記憶手段に蓄積するステップ、ｄ）相関計算手段が、前記蓄積された閲覧履歴情報を用いて、前記蓄積された文書構造情報における文書構造ブロック間の相関を計算するステップ、ｅ）前記構造認識手段が、前記相関計算手段が計算した文書構造ブロック間の相関を受け取り、文書構造の再認識を行うステップ、を含むことを特徴とする。 In the first document structuring method of the present invention, a) the structure recognizing means recognizes the document structure from the document data stored in the document storage means, and accumulates it in the document structure storage means as document structure information; b) A step of distributing part or all of the document data to the document browsing terminal by referring to the stored document structure information; c) a document distributed by the browsing history acquiring unit to the document browsing terminal D) a step of acquiring a history of browsing locations and storing the history as browsing history information in browsing history storage means; d) a correlation calculation means using the stored browsing history information to store the document structure in the stored document structure information A step of calculating a correlation between blocks; e) a step in which the structure recognition unit receives the correlation between the document structure blocks calculated by the correlation calculation unit and re-recognizes the document structure. , Characterized in that it comprises a.

本発明の第２の文書構造化方法は、第１の文書構造化方法において、ｆ）前記構造認識手段が、再認識の結果に基づいて前記蓄積された文書構造情報を更新するステップ、を含むことを特徴とする。 The second document structuring method of the present invention includes the step of f) the structure recognizing unit updating the accumulated document structure information based on a result of re-recognition in the first document structuring method. It is characterized by that.

本発明の第３の文書構造化方法は、第１または第２の文書構造化方法において、前記配信手段は、１回の配信で前記文書閲覧端末の表示領域に相当するサイズの文書データを配信し、前記閲覧履歴取得手段は、配信された文書データに含まれる文書構造ブロックの情報を文書の閲覧箇所の履歴として取得することを特徴とする。 According to a third document structuring method of the present invention, in the first or second document structuring method, the distribution unit distributes document data having a size corresponding to a display area of the document browsing terminal in one distribution. The browsing history acquisition means acquires the information of the document structure block included in the distributed document data as a history of browsing locations of the document.

本発明の第４の文書構造化方法は、第３の文書構造化方法において、前記閲覧履歴取得手段は、文書データが配信された時刻情報を閲覧履歴情報に含めることを特徴とする。 According to a fourth document structuring method of the present invention, in the third document structuring method, the browsing history acquisition unit includes time information at which the document data is distributed in the browsing history information.

本発明の第５の文書構造化方法は、第１ないし第４の文書構造化方法において、前記相関計算手段は、前記閲覧履歴情報が一定個数得られるたび、あるいは、一定時間経過するたび、あるいは、前記文書閲覧端末からの指示があるたび、あるいは、それらの任意の組み合わせのタイミングで文書構造ブロック間の相関を計算することを特徴とする。 According to a fifth document structuring method of the present invention, in the first to fourth document structuring methods, the correlation calculating means is configured to obtain a certain number of the browsing history information, or to elapse a certain time, or The correlation between the document structure blocks is calculated every time there is an instruction from the document browsing terminal, or at an arbitrary combination timing.

本発明の第６の文書構造化方法は、第１または第２の文書構造化方法において、前記相関計算手段は、前記閲覧履歴記憶手段に蓄積された同じ文書にかかる複数の閲覧履歴情報の中に、同じ文書構造ブロックを起点とする閲覧箇所の推移が複数通り存在する場合、より多数を占める閲覧箇所の推移に基づいて文書構造ブロック間の相関を計算することを特徴とする。 According to a sixth document structuring method of the present invention, in the first or second document structuring method, the correlation calculating means includes a plurality of browsing history information relating to the same document stored in the browsing history storage means. In addition, when there are a plurality of browsing location transitions starting from the same document structure block, the correlation between the document structure blocks is calculated based on the transition of the browsing location occupying a larger number.

『作用』
本発明にあっては、文書データを文書閲覧端末に配信した際のその文書データの閲覧箇所の履歴を取得して閲覧履歴情報として蓄積し、この蓄積された閲覧履歴情報を用いて、その文書データの文書構造情報における文書構造ブロック間の相関を計算し、この計算した文書構造ブロック間の相関結果を用いて文書データの文書構造の再認識を行う。より具体的には、構造認識手段が、文書記憶手段に記憶された文書データから、タイトル、段落などのひとかたまりの領域である文書構造ブロックなどの文書構造を認識して、文書構造情報として文書構造記憶手段に蓄積し、配信手段が、前記蓄積された文書構造情報を参照して文書データの一部、あるいは、全部を文書閲覧端末に配信し、閲覧履歴記憶手段が、文書閲覧端末に配信した文書の閲覧箇所の履歴を取得して、閲覧履歴情報として閲覧履歴記憶手段に蓄積し、相関計算手段が、蓄積された閲覧履歴情報を用いて、蓄積された文書構造情報における文書構造ブロック間の相関を計算し、構造認識手段が、相関計算手段が計算した文書構造ブロック間の相関を受け取り文書構造の再認識を行う。文書閲覧端末の利用者は、継続している文書構造ブロックは継続して読み進める傾向があるため、文書の閲覧箇所の履歴から文書構造の一種である文書構造ブロック間の継続性を検出することができ、この検出結果を利用して文書構造認識結果を改善することができる。こうして本発明は、閲覧履歴情報を利用し文書構造認識結果を改善していくので、構造認識手段が利用する論理モデルや抽出ルールを対象文書毎に作成する手間を軽減でき、また、文書作成時の操作履歴を必要としないため、すでに完成済み、あるいは紙に出力された文書に対しても適用することが可能である。 "Action"
In the present invention, when the document data is distributed to the document browsing terminal, the history of the browsing portion of the document data is acquired and stored as browsing history information, and the document is stored using the stored browsing history information. The correlation between the document structure blocks in the document structure information of the data is calculated, and the document structure of the document data is re-recognized using the calculated correlation result between the document structure blocks. More specifically, the structure recognizing unit recognizes the document structure such as a document structure block which is a group of areas such as titles and paragraphs from the document data stored in the document storage unit, and the document structure information is obtained as document structure information. Accumulating in the storage means, the distribution means refers to the stored document structure information and distributes part or all of the document data to the document browsing terminal, and the browsing history storage means distributes to the document browsing terminal The browsing history of the document is acquired and stored as browsing history information in the browsing history storage means, and the correlation calculating means uses the stored browsing history information to store the document structure blocks in the stored document structure information. The correlation is calculated, and the structure recognizing means receives the correlation between the document structure blocks calculated by the correlation calculating means and re-recognizes the document structure. The user of the document browsing terminal tends to continue reading the document structure block, and therefore, the continuity between document structure blocks, which is a kind of document structure, is detected from the history of the document browsing location. It is possible to improve the document structure recognition result using this detection result. Thus, according to the present invention, the browsing history information is used to improve the document structure recognition result, so that it is possible to reduce the trouble of creating the logical model and the extraction rule used by the structure recognition unit for each target document. Therefore, the present invention can be applied to a document that has already been completed or output on paper.

第１の効果は、規範となる文書レイアウトの論理モデルや構造抽出ルールの改変によらずに文書構造の認識精度を向上させることができ、論理モデルや抽出ルールを対象文書毎に作成する手間を軽減できることである。その理由は、文書の閲覧履歴情報を利用し文書構造認識結果を改善するからである。 The first effect is that the recognition accuracy of the document structure can be improved without changing the logical model or structure extraction rule of the standard document layout, and the trouble of creating the logical model and the extraction rule for each target document is reduced. It can be reduced. The reason is that the document structure recognition result is improved by using the browsing history information of the document.

第２の効果は、汎用性があることである。その理由は、対象文書への依存が低い、文書の閲覧履歴情報を利用し文書構造認識結果を改善するためである。 The second effect is versatility. The reason is that the dependence on the target document is low and the document structure recognition result is improved by using the browsing history information of the document.

第３の効果は、すでに完成済み、あるいは紙に出力された文書に対しても適用できることである。その理由は、文書作成時の操作履歴を必要としないためである。 The third effect is that it can be applied to a document that has already been completed or output on paper. This is because an operation history at the time of document creation is not required.

次に、本発明を実施するための最良の形態について図面を参照して詳細に説明する。 Next, the best mode for carrying out the present invention will be described in detail with reference to the drawings.

『第１の実施の形態』
図１を参照すると、本発明の第１の実施の形態にかかる文書構造化装置１１０は、文書データ１０１ａを記憶する文書記憶部１０１と、文書構造情報１０３ａを記憶する文書構造記憶部１０３と、閲覧履歴情報１０６ａを記憶する閲覧履歴記憶部１０６と、これらに接続された処理部１０９とで構成される。また、処理部１０９は、インターネットなどのネットワークを介して１以上の文書閲覧端末であるクライアント端末１０５と通信可能になっている。 “First Embodiment”
Referring to FIG. 1, a document structuring apparatus 110 according to the first exemplary embodiment of the present invention includes a document storage unit 101 that stores document data 101a, a document structure storage unit 103 that stores document structure information 103a, The browsing history storage unit 106 stores the browsing history information 106a, and the processing unit 109 connected thereto. The processing unit 109 can communicate with the client terminal 105 that is one or more document browsing terminals via a network such as the Internet.

クライアント端末１０５は、携帯電話機、携帯情報端末、パーソナルコンピュータなどで構成され、配信手段１０４から配信されてきた文書データを表示装置に表示して利用者に提示する機能を有する。また、利用者の操作に応答して次の文書データの配信を配信手段１０４に要求する機能を有する。 The client terminal 105 includes a mobile phone, a portable information terminal, a personal computer, and the like, and has a function of displaying document data distributed from the distribution unit 104 on a display device and presenting it to the user. Further, it has a function of requesting the distribution means 104 to distribute the next document data in response to a user operation.

処理部１０９は、ワークステーション、パーソナルコンピュータなどで構成され、文書記憶部１０１に記憶された文書データ１０１ａから文書構造を認識し、文書構造情報１０３ａとして文書構造記憶部１０３に蓄積する構造認識手段１０２と、文書構造記憶部１０３に蓄積された文書構造情報１０３ａを参照して、文書記憶部１０１に蓄積された文書データ１０１ａの一部あるいは全部をクライアント端末１０５に配信する配信手段１０４と、クライアント端末１０５に配信した文書データ１０１ａの閲覧箇所の履歴を取得し、閲覧履歴情報１０６ａとして閲覧履歴記憶部１０６に蓄積する閲覧履歴取得手段１０８と、閲覧履歴記憶部１０６に蓄積された閲覧履歴情報１０６ａを用いて、文書構造記憶部１０３に蓄積された文書構造情報１０３ａにおける文書構造ブロック間の相関を計算する相関計算手段１０７とを備えている。また、構造認識手段１０２は、相関計算手段１０７が計算した文書構造ブロック間の相関を受け取り、文書構造の再認識を行う。 The processing unit 109 includes a workstation, a personal computer, and the like, and recognizes the document structure from the document data 101a stored in the document storage unit 101, and stores the structure in the document structure storage unit 103 as document structure information 103a. A distribution unit 104 that distributes part or all of the document data 101a stored in the document storage unit 101 to the client terminal 105 with reference to the document structure information 103a stored in the document structure storage unit 103, and a client terminal A browsing history acquisition unit 108 that acquires the browsing history of the document data 101 a distributed to 105 and stores the browsing history information 106 a in the browsing history storage unit 106, and the browsing history information 106 a stored in the browsing history storage unit 106. Document structure information 1 stored in the document structure storage unit 103 And a correlation calculation unit 107 for calculating the correlation between the document structure blocks in 3a. The structure recognizing unit 102 receives the correlation between the document structure blocks calculated by the correlation calculating unit 107, and re-recognizes the document structure.

次に、図１及び図２のフローチャートを参照して本実施の形態の全体動作の概略を説明する。 Next, an outline of the overall operation of the present embodiment will be described with reference to the flowcharts of FIGS.

文書構造化装置１１０は、まず、文書構造を認識する文書のページを電子化した文書データ１０１ａを文書記憶部１０１から読出し、特許文献１乃至３などで公知の手法を用いて構造認識手段１０２により、文書データ１０１ａをいくつかの領域（文書構造ブロック）に分割するとともに、各領域の文書構造（タイトルや段落など）を認識する（図２のステップ２０１）。構造認識手段１０２で認識された文書構造情報は文書構造記憶部１０３に記憶される。 The document structuring apparatus 110 first reads out document data 101a obtained by digitizing a document page for recognizing the document structure from the document storage unit 101, and the structure recognizing unit 102 uses a known method in Patent Documents 1 to 3, for example. The document data 101a is divided into several areas (document structure blocks), and the document structure (title, paragraph, etc.) of each area is recognized (step 201 in FIG. 2). The document structure information recognized by the structure recognition unit 102 is stored in the document structure storage unit 103.

次に文書構造化装置１１０は、配信手段１０４により、認識された文書構造情報を文書構造記憶部１０３から参照して、文書記憶部１０１に蓄積された文書データ１０１ａの全体、あるいは一部分をクライアント端末１０５に配信する（ステップ２０２）。 Next, the document structuring apparatus 110 refers to the recognized document structure information from the document structure storage unit 103 by the distribution unit 104, and the entire or a part of the document data 101a stored in the document storage unit 101 is transferred to the client terminal. (Step 202).

次に文書構造化装置１１０は、閲覧履歴取得手段１０８により、配信された文書データについてクライアント端末１０５の利用者による閲覧箇所の履歴を取得する（ステップ２０３)。取得された閲覧箇所の履歴は閲覧履歴情報１０６ａとして閲覧履歴記憶部１０６に蓄積される。 Next, the document structuring apparatus 110 uses the browsing history acquisition unit 108 to acquire the history of browsing locations by the user of the client terminal 105 for the distributed document data (step 203). The acquired browsing history is stored in the browsing history storage unit 106 as browsing history information 106a.

次に文書構造化装置１１０は、所定のタイミングで、相関計算手段１０７により、閲覧履歴記憶部１０６に蓄積された閲覧履歴を元に、構造認識手段１０２が認識した文書構造ブロック間の相関を計算し、計算結果を構造認識手段１０２に通知する（ステップ２０４）。上記の所定のタイミングとしては、閲覧履歴情報がｎ個（ｎは予め定められた１以上の整数）得られるたび、あるいは、ｔ秒毎（ｔは予め定められた正の数値）、あるいは、それらの任意の組み合わせのタイミングとすることができる。 Next, the document structuring apparatus 110 calculates the correlation between the document structure blocks recognized by the structure recognition unit 102 based on the browsing history stored in the browsing history storage unit 106 by the correlation calculation unit 107 at a predetermined timing. Then, the calculation result is notified to the structure recognition means 102 (step 204). As the above-mentioned predetermined timing, every time browsing history information (n is a predetermined integer equal to or greater than 1) is obtained, every t seconds (t is a predetermined positive numerical value), or Any combination of the timings can be used.

次に文書構造化装置１１０は、構造認識手段１０２により、相関計算手段１０７で計算された相関を使って文書データ１０１ａの文書構造を再認識し、文書構造記憶部１０３の文書構造情報１０３ａを更新する（ステップ２０５）。 Next, the document structuring apparatus 110 re-recognizes the document structure of the document data 101 a using the correlation calculated by the correlation calculation unit 107 by the structure recognition unit 102 and updates the document structure information 103 a in the document structure storage unit 103. (Step 205).

以下同様に、再認識された文書構造情報を利用して、ステップ２０２から２０５を任意の回数繰り返しても良い。 Similarly, steps 202 to 205 may be repeated any number of times using the re-recognized document structure information.

次に、本実施の形態の効果について説明する。 Next, the effect of this embodiment will be described.

本実施の形態では、文書の閲覧履歴情報を利用して文書構造認識結果を再認識していくため、構造認識手段１０２が利用する構造認識用論理モデルや抽出ルールの改変によらずに文書構造の認識精度を向上させることができ、論理モデルや抽出ツールを対象文書毎に作成する手間を軽減できる。 In this embodiment, since the document structure recognition result is re-recognized using the browsing history information of the document, the document structure is not dependent on the structure recognition logical model used by the structure recognition unit 102 or the modification of the extraction rule. Recognition accuracy can be improved, and the effort of creating a logical model or extraction tool for each target document can be reduced.

また、本実施の形態では、文書作成時の操作履歴を必要としないため、完成済み、あるいは紙に出力された文書に対しても文書構造認識を行うことができる。 Further, in this embodiment, since an operation history at the time of document creation is not required, document structure recognition can be performed on a document that has been completed or has been output on paper.

さらに、本実施の形態では、対象文書への依存が低い、文書の閲覧履歴情報を利用し文書構造認識結果を改善するため、種々の文書へ適用でき、汎用性がある。 Furthermore, in this embodiment, since the dependence on the target document is low and the document structure recognition result is improved by using the browsing history information of the document, it can be applied to various documents and is versatile.

次に、本発明の第１の実施の形態の実施例について図面を参照して詳細に説明する。 Next, examples of the first embodiment of the present invention will be described in detail with reference to the drawings.

本実施例にかかる文書構造化装置は、図１に示した第１の実施の形態にかかる文書構造化装置１１０と同様に、文書データ１０１ａを記憶する文書記憶部１０１と、文書構造情報１０３ａを記憶する文書構造記憶部１０３と、閲覧履歴情報１０６ａを記憶する閲覧履歴記憶部１０６と、これらに接続されると共にインターネットなどのネットワークを通じて１以上のクライアント端末１０５に接続された処理部１０９とで構成され、処理部１０９は、文書記憶部１０１に記憶された文書データ１０１ａから文書構造を認識し、文書構造情報１０３ａとして文書構造記憶部１０３に蓄積する構造認識手段１０２と、文書構造記憶部１０３に蓄積された文書構造情報１０３ａを参照して、文書記憶部１０１に蓄積された文書データ１０１ａの一部あるいは全部をクライアント端末１０５に配信する配信手段１０４と、クライアント端末１０５に配信した文書データ１０１ａの閲覧箇所の履歴を取得し、閲覧履歴情報１０６ａとして閲覧履歴記憶部１０６に蓄積する閲覧履歴取得手段１０８と、閲覧履歴記憶部１０６に蓄積された閲覧履歴情報１０６ａを用いて、文書構造記憶部１０３に蓄積された文書構造情報１０３ａにおける文書構造ブロック間の相関を計算する相関計算手段１０７とを備えている。 As in the document structuring apparatus 110 according to the first embodiment shown in FIG. 1, the document structuring apparatus according to the present example includes a document storage unit 101 that stores document data 101a and document structure information 103a. A document structure storage unit 103 that stores information, a browsing history storage unit 106 that stores browsing history information 106a, and a processing unit 109 that is connected to these and connected to one or more client terminals 105 through a network such as the Internet. Then, the processing unit 109 recognizes the document structure from the document data 101 a stored in the document storage unit 101 and stores the document structure information 103 a in the document structure storage unit 103 as document structure information 103 a and the document structure storage unit 103. A part of the document data 101a stored in the document storage unit 101 with reference to the stored document structure information 103a Alternatively, a distribution unit 104 that distributes all to the client terminal 105, and a browsing history acquisition unit that acquires the history of browsing portions of the document data 101a distributed to the client terminal 105 and accumulates the browsing history information 106a in the browsing history storage unit 106. 108 and correlation calculation means 107 for calculating the correlation between the document structure blocks in the document structure information 103a stored in the document structure storage unit 103 using the browsing history information 106a stored in the browsing history storage unit 106. ing.

文書記憶部１０１は、本実施例が対象とする文書データ１０１ａを蓄積したもので、例えば、紙の文書の各ページをスキャナなどの入力手段を用いてイメージデータ化したものを蓄積する。 The document storage unit 101 stores document data 101a targeted by the present embodiment. For example, the document storage unit 101 stores each page of a paper document converted into image data using an input unit such as a scanner.

構造認識手段１０２は、文書データとして蓄積された各ページのイメージデータを入力として受け取り、例えば、特許文献２で示されるような公知の手段を用いてイメージデータの領域分割と文字認識を行い（図３のステップ３０１）、分割された各領域に対して図４に示されるようなルールを用いて各領域の文書構造を決定するためのスコアを計算し（ステップ３０２）、最後に各領域毎に、最大のスコア値を持つ文書構造をその領域の文書構造として決定する（ステップ３０３）。計算されたスコアと決定された文書構造は、文書構造情報１０３に蓄積される。図５に、構造認識手段１０２の処理過程の一例を示す。また、参考として、図５の最終結果である文書構造をXML（eXtensible Markup Language）形式の構造化文書で表現した一例を図６に示す。 The structure recognizing unit 102 receives the image data of each page accumulated as document data as an input, and performs area division and character recognition of the image data using a known unit as shown in Patent Document 2, for example (see FIG. 3), a score for determining the document structure of each area is calculated using the rules shown in FIG. 4 for each divided area (step 302), and finally for each area. The document structure having the maximum score value is determined as the document structure of the area (step 303). The calculated score and the determined document structure are accumulated in the document structure information 103. FIG. 5 shows an example of processing steps of the structure recognition unit 102. For reference, FIG. 6 shows an example in which the document structure, which is the final result of FIG. 5, is expressed by a structured document in XML (eXtensible Markup Language) format.

図５を参照すると、構造認識手段１０２はページイメージに対して領域分割と文字認識を行って、ブロック５０１〜５０５を認識し、各ブロックに対して図４のルールを用いて各ブロックの文書構造を決定するためのスコアを計算し、文書構造記憶部１０３の文書構造情報１０３ａの一部を構成するスコアテーブルＳＴに記録している。スコアテーブルＳＴは、認識されたブロック毎の行を有し、各行に、ブロックを特定する情報と、タイトル、段落および継続に関するスコアを保持する。この例の場合、ブロック５０１は文字サイズ２２ポイントの文字であるため、ルールＲ１によりタイトルのスコアに２０点が付与されている。またブロック５０２〜５０５は行数が２行以上であるため、ルールＲ２により段落のスコアに１０点が付与されている。さらにルールＲ３、Ｒ４によりブロック５０３、５０５に対して継続のスコアが付与されている。文書構造テーブルＢＴも、文書構造情報１０３ａの一部を構成し、各ブロックについて決定された文書構造を記録している。 Referring to FIG. 5, the structure recognizing unit 102 performs region division and character recognition on the page image to recognize blocks 501 to 505, and the document structure of each block using the rules of FIG. 4 for each block. Is calculated and recorded in the score table ST constituting a part of the document structure information 103a of the document structure storage unit 103. The score table ST has a line for each recognized block, and holds information regarding the block and a score regarding the title, paragraph, and continuation in each line. In this example, since the block 501 is a character having a character size of 22 points, 20 points are given to the title score by the rule R1. Further, since the blocks 502 to 505 have two or more lines, 10 points are given to the score of the paragraph by the rule R2. Furthermore, a continuation score is given to the blocks 503 and 505 by the rules R3 and R4. The document structure table BT also forms part of the document structure information 103a and records the document structure determined for each block.

なお、上記の説明では、文書構造情報１０３ａには決定された文書構造を蓄積しているが、決定された文書構造だけでなく、領域（ブロック）の座標情報や文字のサイズ情報、文字認識結果などを一緒に蓄積するようにしても良い。また、図６に示されるようなXML形式の構造化文書として蓄積するようにしても良い。 In the above description, the determined document structure is stored in the document structure information 103a. However, not only the determined document structure but also region (block) coordinate information, character size information, and character recognition result. Etc. may be accumulated together. Alternatively, it may be stored as a structured document in XML format as shown in FIG.

配信手段１０４は、文書構造情報１０３ａを参照して文書記憶部１０１に蓄積された文書データ１０１ａにかかるイメージデータの全体、あるいは一部分をクライアント端末１０５に配信する。例えば、図６で示されるような文書構造情報を持つ文書データに対し、クライアント端末１０５が図７の破線で示されるサイズの領域を一度に表示できる場合には、配信データ量の削減とクライアント端末の表示レスポンスを良くするために、ブロック５０１とブロック５０２の領域のうち、クライアント端末１０５の表示領域に重なる部分のイメージデータ、およびブロック５０２と同じ文書構造に属しているブロック５０３の領域のうち、クライアント端末の表示領域に重なる部分のイメージデータをクライアント端末１０５に配信する。また、クライアント端末１０５の利用者が画面のスクロールなどによって隣接部分のイメージデータの配信を要求してきた場合、配信手段１０４は、隣接部分のイメージデータを文書データ１０１ａから切り出してクライアント端末１０５へ配信する。 The distribution unit 104 refers to the document structure information 103 a and distributes all or part of the image data related to the document data 101 a accumulated in the document storage unit 101 to the client terminal 105. For example, when the client terminal 105 can display an area of the size indicated by the broken line in FIG. 7 for document data having document structure information as shown in FIG. Of the block 501 and the block 502, the image data of the portion overlapping the display area of the client terminal 105, and the block 503 belonging to the same document structure as the block 502, The portion of the image data that overlaps the display area of the client terminal is distributed to the client terminal 105. Further, when the user of the client terminal 105 requests distribution of the image data of the adjacent part by scrolling the screen or the like, the distribution unit 104 cuts out the image data of the adjacent part from the document data 101 a and distributes it to the client terminal 105. .

他方、閲覧履歴取得手段１０８は、配信手段１０４がクライアント端末１０５にイメージデータを配信するたびに、配信した領域の重心座標に位置する領域情報を閲覧履歴情報１０６ａとして取得し、閲覧履歴記憶部１０６に蓄積する。図８に、配信領域の重心位置座標の推移と閲覧履歴記憶部１０６に記憶される閲覧履歴情報１０６ａの例を示す。 On the other hand, each time the distribution unit 104 distributes image data to the client terminal 105, the browsing history acquisition unit 108 acquires area information located at the barycentric coordinates of the distributed area as browsing history information 106a, and the browsing history storage unit 106 To accumulate. FIG. 8 shows an example of the transition of the center-of-gravity position coordinates of the distribution area and the browsing history information 106 a stored in the browsing history storage unit 106.

相関計算手段１０７は、閲覧履歴記憶部１０６に蓄積された閲覧履歴情報１０６ａを利用して、構造認識手段１０２が認識した各ブロック間の相関を計算する。例えば、閲覧履歴として挙がっているブロック同士（図８で示される閲覧履歴情報の場合には、５０２と５０５）の相関は＋２０、閲覧履歴上のブロックと閲覧履歴にないブロック（図８で示される閲覧履歴情報の場合には、５０２と５０３、５０４と５０５など）との相関は−２０、のように計算する。閲覧履歴に隣接して出現するブロック同士（図８で示される閲覧履歴情報の場合には、５０２と５０５）の相関は＋２０、のように計算してもよい。 The correlation calculation unit 107 calculates the correlation between the blocks recognized by the structure recognition unit 102 using the browsing history information 106a accumulated in the browsing history storage unit 106. For example, the correlation between the blocks listed as the browsing history (in the case of browsing history information shown in FIG. 8, 502 and 505) is +20, and the block on the browsing history and the block not in the browsing history (shown in FIG. 8) In the case of browsing history information, the correlation between 502 and 503, 504 and 505, etc.) is calculated as -20. The correlation between blocks appearing adjacent to the browsing history (in the case of browsing history information shown in FIG. 8, 502 and 505) may be calculated as +20.

構造認識手段１０２は、相関計算手段１０７が計算した相関を利用して文書構造の再認識を行う。例えば、構造認識手段１０２は、相関計算手段１０７が計算した相関の値をスコアテーブルＳＴ上の「継続」スコアに加算し、文書構造の再認識を行う。例えば、相関計算手段１０７において、ブロック５０２と５０５の相関が＋２０、ブロック５０２と５０３、５０４と５０５の相関がそれぞれ−２０と計算された場合、それを図５のスコアテーブルＳＴ上の「継続」スコアに加算すると、スコアテーブルＳＴの内容は図９に示すようになる。従って、この内容から文書構造を再認識すると、図９の文書構造テーブルＢＴに示すようになり、ブロック間の継続性についてはブロック５０５がブロック５０２から継続しているものと再認識される。図５の処理過程例に対して文書構造の再認識を実施した図９の最終結果を、図６と同様にXML形式の構造化文書で表現した一例を図１０に示す。 The structure recognition unit 102 re-recognizes the document structure using the correlation calculated by the correlation calculation unit 107. For example, the structure recognizing unit 102 adds the correlation value calculated by the correlation calculating unit 107 to the “continuation” score on the score table ST, and re-recognizes the document structure. For example, when the correlation calculation unit 107 calculates that the correlation between the blocks 502 and 505 is +20, and the correlation between the blocks 502 and 503 and 504 and 505 is −20, the result is “continuation” on the score table ST in FIG. When added to the score, the contents of the score table ST are as shown in FIG. Therefore, when the document structure is re-recognized from this content, it is as shown in the document structure table BT of FIG. 9, and regarding the continuity between blocks, the block 505 is re-recognized as continuing from the block 502. FIG. 10 shows an example in which the final result of FIG. 9 in which the document structure is re-recognized with respect to the processing process example of FIG. 5 is expressed by the structured document in the XML format as in FIG.

以下同様に、続いて新規に得られた閲覧履歴と再認識された文書構造情報を利用して、図２のステップ２０２から２０５に相当する上記の処理を任意の回数繰り返しても良い。 Similarly, the above processing corresponding to steps 202 to 205 in FIG. 2 may be repeated any number of times by using the newly obtained browsing history and the re-recognized document structure information.

次に、本実施例の効果について説明する。 Next, the effect of the present embodiment will be described.

本実施例では、文書の閲覧履歴情報を利用し文書構造認識結果を再認識していくため、構造認識手段１０２が利用する構造認識用論理モデルや抽出ルールを対象文書毎に作成する手間を軽減できる。例えば、図４に示されるようなスコア計算のルールのうち、本実施例によれば、ルールＲ３やルールＲ４を改変しなくても、文書の閲覧履歴を利用して「継続」スコア計算の補正を行うことが可能となる。 In this embodiment, since the document structure recognition result is re-recognized using the browsing history information of the document, it is possible to reduce the trouble of creating the structure recognition logical model and the extraction rule used by the structure recognition unit 102 for each target document. it can. For example, among the score calculation rules as shown in FIG. 4, according to this embodiment, correction of the “continuation” score calculation using the browsing history of the document without modifying the rules R3 and R4. Can be performed.

また、本実施例では、文書作成時の操作履歴を必要としないため、完成済み、あるいは紙に出力された文書に対しても文書構造認識を行うことができる。 Further, in this embodiment, since an operation history at the time of document creation is not required, document structure recognition can be performed even for a document that has been completed or output on paper.

本実施例では、配信手段１０４は文書構造情報１０３ａを参照して同じ文書構造に属している領域のイメージデータを一緒に送信しているため、クライアント端末１０５の利用者が次に必要とする可能性のあるイメージデータを先に配信することができる。これにより、クライアント端末１０５のレスポンスが改善できるという効果もある。また、クライアント端末１０５の閲覧履歴を利用して、文書構造が例えば図６から図１０にように更新されると、ブロック５０１とブロック５０２を配信する場合に一緒に配信されるイメージデータが、ブロック５０３からブロック５０５に変更される。すなわち、クライアント端末１０５の閲覧履歴によって配信手段１０４がクライアント端末１０５に配信するイメージデータが、クライアント端末１０５において次に必要とする可能性がより高いイメージデータに変更されるので、クライアント端末１０５のレスポンスが改善できるという効果がある。 In this embodiment, the distribution unit 104 refers to the document structure information 103a and transmits the image data of the area belonging to the same document structure together. Therefore, the user of the client terminal 105 may need next. Image data can be distributed first. Thereby, there is also an effect that the response of the client terminal 105 can be improved. Further, when the document structure is updated as shown in FIGS. 6 to 10 using the browsing history of the client terminal 105, the image data distributed together when the block 501 and the block 502 are distributed becomes the block The block is changed from 503 to block 505. That is, the image data that the distribution unit 104 distributes to the client terminal 105 is changed to image data that is more likely to be required next in the client terminal 105 according to the browsing history of the client terminal 105. Can be improved.

なお、上記実施例の説明では、文書データ１０１は紙文書の各ページをスキャナなどの入力手段を用いてイメージデータ化したものとなっているが、例えば、Adobe社製の文書編集ツールであるAcrobatのように、電子文書データからページイメージを生成することができるソフトウェアを用いて生成したイメージデータを蓄積するようにしても良い。また、構造認識手段１０２がイメージデータ以外のデータ、例えば、「http://ftp.debian.or.jp/debian-jp/pool/main/p/plain2/」から入手できるデータ変換ツールplain2のようにテキストデータを入力しそれをLaTeXやtroffのコマンドを埋め込んだ文書に変換するものである場合、テキストデータ形式で蓄積するようにしても良い。これらの場合には、紙文書だけでなく、電子文書データやテキストデータも本実施例の処理対象とすることができるという効果がある。 In the description of the above embodiment, the document data 101 is obtained by converting each page of a paper document into image data using an input unit such as a scanner. For example, Acrobat is a document editing tool manufactured by Adobe. As described above, image data generated using software capable of generating a page image from electronic document data may be stored. Further, the structure recognition means 102 is data other than image data, for example, a data conversion tool plain2 that can be obtained from “http://ftp.debian.or.jp/debian-jp/pool/main/p/plain2/”. If text data is input to and converted into a document embedded with LaTeX or troff commands, it may be stored in text data format. In these cases, there is an effect that not only a paper document but also electronic document data and text data can be processed.

また、上記実施例の説明では、構造認識手段１０２はイメージデータに対し領域分割と文字認識を行ってから文書構造の決定を行ったが、例えば、特許文献１で示されるような公知の手段を用いて、文字認識は行わず、領域分割だけで文書構造を決定するようにしても良い。 In the description of the above embodiment, the structure recognizing unit 102 determines the document structure after performing region division and character recognition on the image data. For example, a known unit as disclosed in Patent Document 1 is used. It is also possible to determine the document structure only by segmentation without using character recognition.

また、上記実施例の説明では、閲覧履歴取得手段１０８は配信した領域の重心に位置する領域情報を閲覧履歴情報１０６ａとして蓄積したが、重心座標ではなく、配信した領域内に含まれる領域情報の面積を使って閲覧履歴を抽出し、蓄積するようにしても良い。例えば、図１１のように、クライアントに配信した領域がＣ１１のようであったとき、その領域内に含まれるブロックＣ０１、Ｃ０２、Ｃ０３の面積Ｃ２１、Ｃ２２、Ｃ２３を比べ、最も大きい面積Ｃ２２を持つブロックＣ０２を閲覧履歴として蓄積する。配信領域Ｃ１２の場合は、ブロックＣ０２とＣ０５が同じ最大面積を持っていたとすると、その両方のブロックを閲覧履歴として蓄積する。同様に配信領域Ｃ１３の場合にはブロックＣ０５が蓄積され、最終的に図１１に示されるような閲覧履歴情報が得られる。以上のようにすると、配信領域内で占めている割合が高いブロックについての情報を抽出できるので、利用者が主に閲覧している可能性が高いブロックの情報を元に相関が計算できるようになるという効果がある。 In the description of the above embodiment, the browsing history acquisition unit 108 accumulates the area information located at the center of gravity of the distributed area as the browsing history information 106a. However, the browsing history information 106a is not the center of gravity coordinates but the area information included in the distributed area. The browsing history may be extracted and stored using the area. For example, as shown in FIG. 11, when the area distributed to the client is C11, the areas C21, C22, and C23 of the blocks C01, C02, and C03 included in the area are compared, and the area C22 is the largest. Block C02 is accumulated as a browsing history. In the case of the distribution area C12, if the blocks C02 and C05 have the same maximum area, both blocks are accumulated as a browsing history. Similarly, in the case of the distribution area C13, the block C05 is accumulated, and finally browsing history information as shown in FIG. 11 is obtained. If you do this, you can extract information about blocks that have a high percentage of the distribution area, so that you can calculate correlations based on information about blocks that users are most likely to browse There is an effect of becoming.

閲覧履歴情報として、上述した重心位置や最大面積で決定した領域情報だけでなく、配信した時刻も同時に蓄積するようにしてもよい。この場合、配信された時間差に基づいて、時間差が或る値以下のブロック間には正の相関値を、時間差が別の或る値以上のブロック間には負の相関値を与えるように、ブロック間の相関を計算するようにしてもよい。図１２にその計算ルールの例を示す。 As the browsing history information, not only the area information determined by the position of the center of gravity and the maximum area described above, but also the time of distribution may be stored at the same time. In this case, based on the delivered time difference, a positive correlation value is given between blocks with a time difference of a certain value or less, and a negative correlation value is given between blocks with a time difference of a certain value or more. You may make it calculate the correlation between blocks. FIG. 12 shows an example of the calculation rule.

また、上記実施例の説明では、配信手段１０４はクライアント端末１０５で表示する領域に相当するイメージデータのみを配信したが、文書全体、あるいは、ページ単位でイメージデータを配信するようにしても良い。このようにすると閲覧履歴取得手段１０８では前述した方法で閲覧箇所の履歴を得ることができなくなるが、その場合にはクライアント端末１０５で、例えば、利用者のページ移動や画面のスクロールといった操作履歴を取得して、閲覧履歴取得手段１０８へ閲覧履歴として送信し、閲覧履歴取得手段１０８が閲覧履歴情報１０６ａとして蓄積するようにしても良い。また、クライアント端末１０５に利用者の視線を追跡する公知の技術を用いて利用者の視線の動きを取得するようにし、取得した視線の追跡情報を閲覧履歴として閲覧履歴取得手段１０８へ送信し、閲覧履歴取得手段１０８が閲覧履歴情報１０６ａとして蓄積するようにしても良い。 In the description of the above embodiment, the distribution unit 104 distributes only the image data corresponding to the area to be displayed on the client terminal 105. However, the image data may be distributed for the entire document or for each page. In this way, the browsing history acquisition unit 108 cannot obtain the history of the browsing location by the above-described method. In this case, the client terminal 105 stores the operation history such as the user's page movement or screen scrolling. It may be acquired and transmitted as browsing history to the browsing history acquisition unit 108, and the browsing history acquisition unit 108 may store the browsing history information 106a. Also, the movement of the user's line of sight is acquired using a known technique for tracking the user's line of sight to the client terminal 105, and the acquired tracking information of the line of sight is transmitted as a browsing history to the browsing history acquisition means 108, The browsing history acquisition unit 108 may store the browsing history information 106a.

また、上記実施例の説明では、相関計算手段１０７は閲覧履歴取得手段１０８が閲覧履歴記憶部１０６に蓄積した閲覧履歴情報１０６ａをもとに相関を計算したが、例えば同じ文書データについて閲覧者が異なる複数の閲覧履歴が蓄積されている場合に、個々の履歴をひとつずつ利用して相関を計算するのではなく、ｎ個の閲覧履歴情報を統計処理した上で相関を計算するようにしても良い。例えば、図１３に示されるように、開始点が同じであるような配信領域の重心座標位置の経路が経路１、経路２、経路３として複数得られている場合に、各経路の閲覧履歴情報を互いに配信された順に比較し、最も多く出現している領域情報を最終的な閲覧履歴として相関計算に用いるようにしても良い。図１３の場合には、経路１〜経路３それぞれの最初の閲覧履歴情報は全てブロックＡ０２であるので、ブロックＡ０２を採用する。同様に２番目、３番目もブロックＡ０２が採用されるが、４番目はブロックＡ０２、Ａ０５、Ａ０３が１つずつ出現している。このような場合には最も多く出現している領域情報は不定とみなし削除する。次に、５番目、６番目はブロックＡ０５が２つ、Ａ０３が１つであるので出現個数の多いブロックＡ０５を採用し、最終的に図１４のような閲覧履歴情報を相関計算手段１０７は相関計算に使用する。このように、同じ領域を起点とする閲覧箇所の推移が複数通り存在する場合に、より多数を占める閲覧箇所の推移に基づいて文書構造ブロック間の相関を計算することにより、閲覧履歴の中で最も一般的な経路を利用した構造化が行われることになり、特定の個人の癖などの影響を受けにくい、より一般的な文書構造の抽出を行うことができるという効果がある。 In the description of the above embodiment, the correlation calculation unit 107 calculates the correlation based on the browsing history information 106a accumulated in the browsing history storage unit 106 by the browsing history acquisition unit 108. When a plurality of different browsing histories are accumulated, the correlation is not calculated by using each history one by one, but is calculated after statistically processing n pieces of browsing history information. good. For example, as shown in FIG. 13, when a plurality of routes of the barycentric coordinate positions of the distribution areas having the same start point are obtained as the route 1, the route 2, and the route 3, the browsing history information of each route May be used in correlation calculation as the final browsing history as the region information that appears most frequently. In the case of FIG. 13, since all the first browsing history information of each of the route 1 to route 3 is the block A02, the block A02 is adopted. Similarly, the second and third blocks A02 are used, but the fourth block A02, A05, and A03 appear one by one. In such a case, the region information that appears most frequently is regarded as indefinite and is deleted. Next, since the fifth and sixth blocks have two blocks A05 and one A03, the block A05 having a large number of appearances is adopted. Finally, the correlation calculating means 107 correlates the browsing history information as shown in FIG. Used for calculation. In this way, when there are multiple transitions of browsing locations starting from the same area, the correlation between the document structure blocks is calculated based on the transition of the browsing locations that occupy a larger number in the browsing history. The most general route is used for structuring, and there is an effect that it is possible to extract a more general document structure that is not easily affected by a particular individual's habit.

また、上記実施例の説明では、相関計算手段１０７が計算した相関を構造認識手段１０２の計算した「継続」スコアに加算して文書構造の再認識を行ったが、「継続」スコアに加算するのではなく、例えば、相関の高いブロック同士で共通して値の設定されているスコアに加算するようにしても良い。例えば、図１５の上段のようなスコアが計算されている状態で、ブロック５０３とブロック５０５の相関が＋２０と計算されたときには、ブロック５０３とブロック５０５とで共通して値の設定されている「段落」と「５０２との継続」スコアを＋２０し、ブロック５０５にのみ値が設定されている「５０４との継続」スコアには何も加算せず、図１５の下段のようなスコアに更新して文書構造の再認識を行うようにしても良い。 In the description of the above embodiment, the correlation calculated by the correlation calculation unit 107 is added to the “continuation” score calculated by the structure recognition unit 102 to re-recognize the document structure, but is added to the “continuation” score. Instead of this, for example, it may be added to a score for which a value is set in common among highly correlated blocks. For example, when the correlation between the block 503 and the block 505 is calculated as +20 in the state where the score as shown in the upper part of FIG. 15 is calculated, a value is commonly set in the block 503 and the block 505. The “Continue with Paragraph” and “Continuation with 502” scores are incremented by +20, nothing is added to the “Continuation with 504” score, which is set only in the block 505, and updated to the score shown in the lower part of FIG. The document structure may be re-recognized.

次に、本発明の第２の実施の形態について図面を参照して詳細に説明する。 Next, a second embodiment of the present invention will be described in detail with reference to the drawings.

図１６を参照すると、本発明の第２の実施の形態にかかる文書構造化装置２１０は、図１に示される第１の実施の形態にかかる文書構造化装置１１０と比較して、相関計算手段１０７に代えて相関計算手段２１２を備え、クライアント端末１０５の代わりに文書閲覧端末である操作端末２１１が接続されている点で相違する。 Referring to FIG. 16, the document structuring apparatus 210 according to the second exemplary embodiment of the present invention is compared with the document structuring apparatus 110 according to the first exemplary embodiment shown in FIG. The difference is that a correlation calculation unit 212 is provided instead of the operation terminal 107, and an operation terminal 211, which is a document browsing terminal, is connected instead of the client terminal 105.

操作端末２１１は、携帯電話機、携帯情報端末、パーソナルコンピュータなどで構成され、配信手段１０４から配信されてきた文書データを表示装置に表示して利用者に提示する機能と、利用者の操作に応答して次の文書データの配信を配信手段１０４に要求する機能とに加えて、相関計算手段２１２に対して文書構造ブロック間の相関を計算するタイミングを指示する機能を有する。操作端末２１１の利用者は、文書データの閲覧が目的ではなく、文書構造情報を修正することを目的に文書データの閲覧を行う。 The operation terminal 211 includes a mobile phone, a portable information terminal, a personal computer, and the like. The operation terminal 211 displays the document data distributed from the distribution unit 104 on the display device and presents it to the user, and responds to the user's operation. In addition to the function of requesting the distribution unit 104 to distribute the next document data, the correlation calculation unit 212 has a function of instructing the timing for calculating the correlation between the document structure blocks. The user of the operation terminal 211 browses the document data not for the purpose of browsing the document data but for the purpose of correcting the document structure information.

相関計算手段２１２は、操作端末２１１からの指示を受けたときに文書構造ブロック間の相関を計算する点で、図１の相関計算手段１０７と相違し、それ以外は相関計算手段１０７と同じである。 The correlation calculation unit 212 is different from the correlation calculation unit 107 in FIG. 1 in that the correlation between the document structure blocks is calculated when receiving an instruction from the operation terminal 211, and the rest is the same as the correlation calculation unit 107. is there.

次に、図２のフローチャートを借用して本実施の形態の全体動作について説明する。 Next, the overall operation of the present embodiment will be described by borrowing the flowchart of FIG.

文書構造化装置２１０は、まず、文書構造を認識する文書のページを電子化した文書データ１０１ａを文書記憶部１０１から読出し、特許文献１乃至３などで公知の手法を用いて構造認識手段１０２により、文書データ１０１ａをいくつかの領域（文書構造ブロック）に分割するとともに、各領域の文書構造（タイトルや段落など）を認識する（図２のステップ２０１）。構造認識手段１０２で認識された文書構造情報は文書構造記憶部１０３に記憶される。 The document structuring apparatus 210 first reads out document data 101a obtained by digitizing a document page for recognizing the document structure from the document storage unit 101, and the structure recognizing unit 102 uses a known method in Patent Documents 1 to 3 and the like. The document data 101a is divided into several areas (document structure blocks), and the document structure (title, paragraph, etc.) of each area is recognized (step 201 in FIG. 2). The document structure information recognized by the structure recognition unit 102 is stored in the document structure storage unit 103.

次に文書構造化装置２１０は、配信手段１０４により、認識された文書構造情報を文書構造記憶部１０３から参照して、文書記憶部１０１に蓄積された文書データ１０１ａの一部分を操作端末２１１に配信する（ステップ２０２）。このデータ配信は、操作端末２１１の利用者が画面のスクロールなどによって隣接部分の文書データの配信を要求するたびに実施される。文書構造情報の修正を目的に文書データの閲覧を行う操作端末２１１の利用者は、継続している文書構造ブロックが表示画面のほぼ中央に位置するように文書データの閲覧を行っていく。 Next, the document structuring apparatus 210 refers to the recognized document structure information from the document structure storage unit 103 by the distribution unit 104 and distributes a part of the document data 101 a stored in the document storage unit 101 to the operation terminal 211. (Step 202). This data distribution is performed every time the user of the operation terminal 211 requests distribution of document data of an adjacent portion by scrolling the screen or the like. The user of the operation terminal 211 who browses the document data for the purpose of correcting the document structure information browses the document data so that the continuing document structure block is located at the approximate center of the display screen.

次に文書構造化装置２１０は、配信手段１０４により操作端末２１１に文書データが配信されるたびに、閲覧履歴取得手段１０８により、配信された文書データについて操作端末２１１の利用者による閲覧箇所の履歴を例えば図８に示したような方法で取得する（ステップ２０３)。取得された閲覧箇所の履歴は閲覧履歴情報１０６ａとして閲覧履歴記憶部１０６に蓄積される。 Next, every time document data is distributed to the operation terminal 211 by the distribution unit 104, the document structuring apparatus 210 uses the browsing history acquisition unit 108 to record the history of browsing points by the user of the operation terminal 211 for the distributed document data. Is obtained by the method shown in FIG. 8, for example (step 203). The acquired browsing history is stored in the browsing history storage unit 106 as browsing history information 106a.

次に文書構造化装置２１０は、操作端末２１１から相関計算の開始が指示されたタイミングで、相関計算手段２１２により、閲覧履歴記憶部１０６に蓄積された閲覧履歴を元に、構造認識手段１０２が認識した文書構造ブロック間の相関を計算し、計算結果を構造認識手段１０２に通知する（ステップ２０４）。 Next, in the document structuring apparatus 210, at the timing when the start of correlation calculation is instructed from the operation terminal 211, the structure recognizing unit 102 uses the correlation calculation unit 212 based on the browsing history accumulated in the browsing history storage unit 106. The correlation between the recognized document structure blocks is calculated, and the calculation result is notified to the structure recognition means 102 (step 204).

次に文書構造化装置２１０は、構造認識手段１０２により、相関計算手段２１２で計算された相関を使って文書データ１０１ａの文書構造を再認識し、文書構造記憶部１０３の文書構造情報１０３ａを更新する（ステップ２０５）。 Next, the document structuring apparatus 210 re-recognizes the document structure of the document data 101 a by using the correlation calculated by the correlation calculation unit 212 by the structure recognition unit 102 and updates the document structure information 103 a in the document structure storage unit 103. (Step 205).

本実施の形態では、操作端末２１１から文書構造ブロック間の相関を計算するタイミングを指示することができるため、操作端末２１１の利用者が文書構造情報１０３を修正することを目的に文書データの閲覧を行う場合に、適切なタイミングで相関計算を行わせることができる。 In this embodiment, since the operation terminal 211 can instruct the timing for calculating the correlation between the document structure blocks, the user of the operation terminal 211 can view the document data for the purpose of correcting the document structure information 103. When performing, correlation calculation can be performed at an appropriate timing.

また本実施の形態では、構造認識手段１０２が利用する構造抽出用のルールを作成するための特別な知識や技術を有していない者でも、操作端末２１１を通して文書の閲覧行為を行うだけで、文書構造情報の修正を行うことができるという効果がある。 Further, in the present embodiment, even a person who does not have special knowledge or technology for creating a structure extraction rule used by the structure recognition unit 102 can simply browse a document through the operation terminal 211. There is an effect that the document structure information can be corrected.

なお、閲覧履歴取得手段１０８は、第１の実施の形態と同様に、配信手段１０４が操作端末２１１へ配信したデータから操作端末２１１の利用者による閲覧箇所の履歴を取得したが、操作端末２１１の利用者が相関のある文書構造ブロックを閲覧履歴取得手段１０８に直接指定するようにし、指定された文書構造ブロック情報を閲覧履歴取得手段１０８が閲覧履歴情報１０６ａとして閲覧履歴記憶部１０６に蓄積するようにしても良い。この場合、文書構造ブロックの指定を行うだけで文書構造の再認識を行うことが可能になるという効果がある。 The browsing history acquisition unit 108 acquires the history of browsing locations by the user of the operation terminal 211 from the data distributed to the operation terminal 211 by the distribution unit 104 as in the first embodiment. The user directly designates a correlated document structure block in the browsing history acquisition unit 108, and the browsing history acquisition unit 108 stores the specified document structure block information in the browsing history storage unit 106 as browsing history information 106a. You may do it. In this case, the document structure can be re-recognized only by specifying the document structure block.

以上本発明の実施の形態および実施例について説明したが、本発明は以上の例にのみ限定されず、その他各種の付加変更が可能である。また、本発明の文書構造化装置は、その有する機能をハードウェア的に実現することは勿論、コンピュータとプログラムとで実現することができる。プログラムは、磁気ディスクや半導体メモリ等のコンピュータ可読記録媒体に記録されて提供され、コンピュータの立ち上げ時などにコンピュータに読み取られ、そのコンピュータの動作を制御することにより、そのコンピュータを前述した各実施の形態および実施例における構造認識手段１０２、配信手段１０４、相関計算手段１０７または２１２、閲覧履歴取得手段１０８として機能させる。 Although the embodiments and examples of the present invention have been described above, the present invention is not limited to the above examples, and various other additions and modifications can be made. In addition, the document structuring apparatus of the present invention can be realized by a computer and a program as well as by realizing the functions of the document structuring apparatus as hardware. The program is provided by being recorded on a computer-readable recording medium such as a magnetic disk or a semiconductor memory, and is read by the computer at the time of starting up the computer, etc. And function as the structure recognition means 102, the distribution means 104, the correlation calculation means 107 or 212, and the browsing history acquisition means 108.

本発明は、文書データから文書構造を認識する文書構造認識装置や、認識された文書構造を利用する文書配信装置、および、それら装置をコンピュータ上で実現するためのプログラムといった用途に適用できる。 The present invention can be applied to uses such as a document structure recognition device that recognizes a document structure from document data, a document distribution device that uses the recognized document structure, and a program for realizing these devices on a computer.

本発明の第１の実施の形態にかかる文書構造化装置のブロック図である。1 is a block diagram of a document structuring apparatus according to a first embodiment of the present invention. 本発明の第１の実施の形態にかかる文書構造化装置の動作を示す流れ図である。It is a flowchart which shows operation | movement of the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における構造認識手段の動作を示す流れ図である。It is a flowchart which shows operation | movement of the structure recognition means in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における構造認識手段がスコアを決定するために利用するルールの一例を示す図である。It is a figure which shows an example of the rule utilized in order for the structure recognition means in the document structuring apparatus concerning the 1st Embodiment of this invention to determine a score. 本発明の第１の実施の形態にかかる文書構造化装置における構造認識手段の処理過程を示す図である。It is a figure which shows the process of the structure recognition means in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における文書構造情報のデータ形式の一例を示す図である。It is a figure which shows an example of the data format of the document structure information in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における配信手段が配信するイメージデータを決定するための基準の説明図である。It is explanatory drawing of the reference | standard for determining the image data which the delivery means in the document structuring apparatus concerning the 1st Embodiment of this invention delivers. 本発明の第１の実施の形態にかかる文書構造化装置における閲覧履歴取得手段が取得して蓄積する閲覧履歴情報の一例を示す図である。It is a figure which shows an example of the browsing history information which the browsing history acquisition means in the document structuring apparatus concerning the 1st Embodiment of this invention acquires and accumulate | stores. 本発明の第１の実施の形態にかかる文書構造化装置における相関計算手段により構造認識手段が計算したスコアが更新され文書構造の再認識が行われている過程を示す図である。It is a figure which shows the process in which the score which the structure recognition means computed by the correlation calculation means in the document structuring apparatus concerning the 1st Embodiment of this invention is updated, and the document structure is re-recognized. 本発明の第１の実施の形態にかかる文書構造化装置における更新された文書構造情報のデータ形式の一例を示す図である。It is a figure which shows an example of the data format of the updated document structure information in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における閲覧履歴取得手段が取得して蓄積する閲覧履歴情報の別の例を示す図である。It is a figure which shows another example of the browsing history information which the browsing history acquisition means in the document structuring apparatus concerning the 1st Embodiment of this invention acquires and accumulate | stores. 本発明の第１の実施の形態にかかる文書構造化装置における相関計算手段が相関を計算する別のルールを示す図である。It is a figure which shows another rule in which the correlation calculation means in the document structuring apparatus concerning the 1st Embodiment of this invention calculates a correlation. 本発明の第１の実施の形態にかかる文書構造化装置における相関計算手段による閲覧履歴の統計処理の説明図である。It is explanatory drawing of the statistical process of the browsing history by the correlation calculation means in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における相関計算手段によって統計処理された閲覧履歴の一例を示す図である。It is a figure which shows an example of the browsing history statistically processed by the correlation calculation means in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第１の実施の形態にかかる文書構造化装置における相関計算手段が計算した相関を使って構造認識手段がスコアを更新する別の手順を示す図である。It is a figure which shows another procedure in which a structure recognition means updates a score using the correlation which the correlation calculation means calculated in the document structuring apparatus concerning the 1st Embodiment of this invention. 本発明の第２の実施の形態にかかる文書構造化装置のブロック図である。It is a block diagram of the document structuring apparatus concerning the 2nd Embodiment of this invention.

Explanation of symbols

１０１…文書記憶部
１０１ａ…文書データ
１０２…構造認識手段
１０３…文書構造記憶部
１０３ａ…文書構造情報
１０４…配信手段
１０５…クライアント端末
１０６…閲覧履歴記憶部
１０６ａ…閲覧履歴情報
１０７…相関計算手段
１０８…閲覧履歴取得手段
１０９…処理部
１１０…文書構造化装置
２０１…文書構造認識ステップ
２０２…文書データ配信ステップ
２０３…閲覧履歴取得ステップ
２０４…相関計算ステップ
２０５…文書構造再認識ステップ
２１０…文書構造化装置
２１１…操作端末
２１２…相関計算手段
３０１…領域分割と文字認識ステップ
３０２…スコア計算ステップ
３０３…文書構造選択ステップ
５０１〜５０５…認識されたブロック 101 ... document storage unit 101a ... document data 102 ... structure recognition unit 103 ... document structure storage unit 103a ... document structure information 104 ... distribution unit 105 ... client terminal 106 ... browsing history storage unit 106a ... browsing history information 107 ... correlation calculation unit 108 ... browsing history acquisition means 109 ... processing unit 110 ... document structuring apparatus 201 ... document structure recognition step 202 ... document data distribution step 203 ... browsing history acquisition step 204 ... correlation calculation step 205 ... document structure re-recognition step 210 ... document structuring Device 211 ... Operation terminal 212 ... Correlation calculation means 301 ... Area division and character recognition step 302 ... Score calculation step 303 ... Document structure selection steps 501 to 505 ... Recognized blocks

Claims

Document storage means for storing document data, document structure storage means for storing document structure information of the document data, browsing history storage means for storing browsing history information, and document data stored in the document storage means as a document A history of the browsing position of the document data when distributed to the browsing terminal is acquired and stored as browsing history information in the browsing history storage means, and the document structure information of the document data is stored using the stored browsing history information. And a processing unit for re-recognizing the document structure of the document data using the calculated correlation result between the document structure blocks.

Document storage means for storing document data;
Document structure storage means for storing document structure information;
Structure recognition means for recognizing a document structure from document data stored in the document storage means and storing it in the document structure storage means as document structure information;
Browsing history storage means for storing browsing history information;
Distribution means for distributing a part or all of the document data to a document browsing terminal with reference to the stored document structure information;
A browsing history acquisition unit that acquires a history of a browsing portion of a document distributed to the document browsing terminal and accumulates in the browsing history storage unit as browsing history information;
Correlation calculation means for calculating correlation between document structure blocks in the stored document structure information using the stored browsing history information;
The structure recognizing device, wherein the structure recognizing unit receives the correlation between the document structure blocks calculated by the correlation calculating unit and re-recognizes the document structure.

3. The document structuring apparatus according to claim 2, wherein the structure recognizing unit updates the accumulated document structure information based on a result of re-recognition.

The distribution unit distributes document data having a size corresponding to a display area of the document browsing terminal in one distribution, and the browsing history acquisition unit includes a document structure block included in the distributed document data. 4. The document structuring apparatus according to claim 2, wherein the information is acquired as a history of browsing locations of the document.

5. The document structuring apparatus according to claim 4, wherein the browsing history acquisition unit includes time information at which document data is distributed in browsing history information.

The correlation calculation means is configured to generate a document structure every time a certain number of the browsing history information is obtained, every time a certain period of time elapses, every instruction from the document browsing terminal, or any combination thereof. 6. The document structuring apparatus according to claim 2, wherein the document structuring apparatus calculates a correlation between blocks.

The correlation calculation means, when there are a plurality of browsing location transitions starting from the same document structure block, in the plurality of browsing history information related to the same document stored in the browsing history storage means, 4. The document structuring apparatus according to claim 2, wherein the correlation between the document structure blocks is calculated based on the transition of the browsing location occupied.

a) a step of recognizing the document structure from the document data stored in the document storage means and storing the structure in the document structure storage means as document structure information;
b) a distribution means for distributing a part or all of the document data to a document browsing terminal with reference to the stored document structure information;
c) a browsing history acquisition unit that acquires a browsing history of a document distributed to the document browsing terminal, and stores the browsing history information as browsing history information in a browsing history storage unit;
d) a step of calculating a correlation between the document structure blocks in the stored document structure information using the stored browsing history information;
e) the structure recognizing means receiving the correlation between the document structure blocks calculated by the correlation calculating means and re-recognizing the document structure;
A document structuring method comprising:

f) the structure recognizing means updating the stored document structure information based on the result of re-recognition;
The document structuring method according to claim 8, further comprising:

The distribution unit distributes document data having a size corresponding to the display area of the document browsing terminal in one distribution, and the browsing history acquisition unit stores document structure block information included in the distributed document data as a document. The document structuring method according to claim 8, wherein the document structuring method is acquired as a history of browsing locations.

11. The document structuring method according to claim 10, wherein the browsing history acquisition unit includes time information at which the document data is distributed in the browsing history information.

The correlation calculation means is configured to generate a document structure every time a certain number of the browsing history information is obtained, every time a certain period of time elapses, every instruction from the document browsing terminal, or any combination thereof. 12. The document structuring method according to claim 8, wherein the correlation between blocks is calculated.

The correlation calculation means, when there are a plurality of browsing location transitions starting from the same document structure block, in the plurality of browsing history information related to the same document stored in the browsing history storage means, 10. The document structuring method according to claim 8, wherein the correlation between the document structure blocks is calculated based on the transition of the browsed portion occupied.

A computer having document storage means for storing document data, document structure storage means for storing document structure information, and browsing history storage means for storing browsing history information;
Structure recognition means for recognizing a document structure from document data stored in the document storage means and storing the document structure information in the document structure storage means;
Distribution means for distributing a part or all of the document data to a document browsing terminal with reference to the stored document structure information;
A browsing history acquisition unit that acquires a history of a browsing portion of a document distributed to the document browsing terminal and accumulates in the browsing history storage unit as browsing history information;
Correlation calculation means for calculating correlation between document structure blocks in the stored document structure information using the stored browsing history information;
The structure recognition means receives the correlation between the document structure blocks calculated by the correlation calculation means and re-recognizes the document structure.

15. The program according to claim 14, wherein the structure recognition unit updates the stored document structure information based on a result of re-recognition.

The distribution unit distributes document data having a size corresponding to a display area of the document browsing terminal in one distribution, and the browsing history acquisition unit includes a document structure block included in the distributed document data. 16. The program according to claim 14, wherein the information is acquired as a history of a browsing position of the document.

The program according to claim 16, wherein the browsing history acquisition unit includes time information at which document data is distributed in browsing history information.

The correlation calculation means is configured to generate a document structure every time a certain number of the browsing history information is obtained, every time a certain period of time elapses, every instruction from the document browsing terminal, or any combination thereof. 18. The program according to claim 14, 15, 16 or 17, which calculates a correlation between blocks.

The correlation calculation means, when there are a plurality of browsing location transitions starting from the same document structure block, in the plurality of browsing history information related to the same document stored in the browsing history storage means, 16. The program according to claim 14, wherein the correlation between the document structure blocks is calculated based on the transition of the browsing location occupied.