JP3352709B2

JP3352709B2 - Document shaping device and processing method of document shaping device

Info

Publication number: JP3352709B2
Application number: JP25880091A
Authority: JP
Inventors: 美佳福井; 浩司山口; 美和子土井
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-10-07
Filing date: 1991-10-07
Publication date: 2002-12-03
Anticipated expiration: 2017-12-03
Also published as: JPH05101039A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は複数の文書要素データ間
の関係を抽出し、その関係に基づいて文書を見やすい形
に整形する文書整形装置および文書整形装置の処理方法
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document shaping apparatus for extracting a relationship between a plurality of document element data and shaping a document into an easy-to-view form based on the relationship, and a processing method of the document shaping apparatus.

【０００２】[0002]

【従来の技術】ユーザに情報を提供するシステムにおい
て、情報を理解しやすくするためには、複数の情報をど
う配置・整形するかが大きなポイントとなってくる。2. Description of the Related Art In a system for providing information to a user, how to arrange and shape a plurality of pieces of information is an important point in order to make the information easy to understand.

【０００３】例えば、文書の整形を行うシステムでは、
ページに文章や図表・イメージデータなどを割り付ける
際、文書の分かりやすさを考慮して行う必要がある。特
に、図表やイメージは関連する文章の近くに割り付ける
必要があるため、文章内容の編集および別体裁の書式へ
の変更のたびに、ユーザが図表の最適な配置を考えて指
定しなければならなかった。For example, in a system for formatting a document,
When laying out text, charts, image data, etc. on a page, it is necessary to take into account the clarity of the document. In particular, charts and images need to be allocated near the relevant text, so each time the text is edited or changed to a separate format, the user must specify the optimal layout of the chart. Was.

【０００４】こういった作業の煩雑さを解決するものと
して、ユーザが文章と参照されている図表の関係をあら
かじめ定義しておくと、文章の流し込まれる位置によっ
て半自動的に図表の配置を決定する機能を持つシステム
が存在する。このようなシステムでは、例えば、ユーザ
は碇（アンカ）付き枠と呼ばれる図表領域を設定し、文
章中の参照箇所に碇でしめておき、図表領域は必ず参照
箇所の下にするといったように、位置を指定しておく。
何らかの編集作業や書式の変更によって、文章が別のペ
ージに移動した場合、碇付き枠は文章について自動的に
移動する。ただし、同じ図表を複数の箇所から参照して
いるような文書では、ひとつの図表に対する碇を文章中
の複数の箇所に置けないため、碇のある文を削除した場
合に、別の参照箇所に碇をつなぎ替えるといった作業が
必要になってしまう。In order to solve such a complicated work, if a user defines in advance the relationship between a text and a referenced chart, the layout of the chart is semi-automatically determined according to the position where the text is poured. There are systems with functions. In such a system, for example, the user sets a chart area called an anchored frame, anchors the reference point in the text with a anchor, and always places the chart area below the reference point. Is specified.
If the text moves to another page due to some editing or formatting change, the anchored frame automatically moves with the text. However, in a document that refers to the same diagram from multiple places, the anchor for one figure cannot be placed in multiple places in the text. Work such as reconnecting the anchor is required.

【０００５】これに対して、文章中の図表を参照してい
る「第＊＊図」「表＊＊」などの語句によって、図表の
参照関係を自動的に抽出し、同一ページに割り付けるシ
ステム（特開昭６１−２１５７０号公報参照）も提案さ
れている。このシステムでは、同一図表を参照している
箇所が複数ある場合、その参照箇所間に兄弟関係を持た
せ、整形時には、そのうち最初に印字された参照箇所の
近くに図表を自動的に配置する。よって、最初の参照箇
所を含む文が削除されても、次の参照箇所が残っていれ
ば、その近くに図表を自動的に割り付けることができ
る。ただし、この方式では、参照箇所間の優先順位がな
いため、目次に図表のリストが入っている文章を整形す
る場合、目次に図表がすべて割り付けられてしまい、本
文中の参照箇所に図表を割り付けられないといった問題
が生じる。On the other hand, a system for automatically extracting a reference relationship between figures and tables by using words such as “FIG. **” and “table **” referring to the figures and tables in the text and assigning them to the same page ( Japanese Patent Application Laid-Open No. 61-21570) has also been proposed. In this system, when there are a plurality of locations that refer to the same chart, a sibling relationship is established between the reference locations, and at the time of shaping, the chart is automatically arranged near the first printed reference location. Therefore, even if the sentence including the first reference point is deleted, if the next reference point remains, the chart can be automatically allocated in the vicinity. However, in this method, since there is no priority between reference locations, when formatting text containing a list of tables and tables in the table of contents, all the tables and tables are allocated to the table of contents, and the tables and tables are allocated to the reference points in the text. The problem that it cannot be performed arises.

【０００６】[0006]

【発明が解決しようとする課題】このように、従来のシ
ステムでは、図表などのデータを参照するデータ（参照
箇所）間の関係づけがなされていないため、複数の参照
箇所にまたがって処理を行うことが困難であった。ま
た、参照箇所間に何らかの関係付けを持たせている場合
も、ただ文章の流れに従った順番を持つのみで、参照の
強さや意味を考えた優先順位づけがされておらず、図表
のページ内の割り付けを自動的に行うことが、十分に行
えなかった。As described above, in the conventional system, since there is no relation between data (reference locations) referring to data such as charts, processing is performed over a plurality of reference locations. It was difficult. Also, when there is some relationship between the reference parts, the order is only according to the flow of the sentence, and the priority is not set in consideration of the strength and meaning of the reference. It was not possible to automatically perform the assignment within.

【０００７】そこで、本発明は、参照するデータ間の論
理構造に基づき、複数のデータから参照される図表の最
適な配置を自動的に決定し、ユーザの負担を軽減させる
文書処理装置を提供することを目的とする。Accordingly, the present invention provides a document processing apparatus that automatically determines the optimal arrangement of a chart referenced from a plurality of data based on the logical structure between the referenced data and reduces the burden on the user. The purpose is to:

【０００８】[0008]

【課題を解決するための手段】本発明は、文書の構成要
素となる文書要素データを入力し、入力された各文書要
素データ間の論理構造と参照構造からなる文書構造を抽
出し、抽出された文書構造に基づいて文書要素データを
割り付け、文書の形態に整形し、整形された文書を出力
する文書整形装置であって、参照する要素データ間の論
理構造をもとに、参照される要素データを参照する要素
データの近傍に割り付けるかどうか決定する割り付け可
否決定部を有することを特徴とする文書整形装置であ
る。According to the present invention, document element data which is a component of a document is input, and a document structure including a logical structure and a reference structure between the input document element data is extracted and extracted. A document shaping apparatus that allocates document element data based on a document structure that has been formed, shapes the document into a document form, and outputs the formatted document. A document shaping apparatus characterized by having an allocation availability determining unit that determines whether to allocate data near element data to be referenced.

【０００９】[0009]

【作用】本発明によれば、複数の参照箇所から参照され
るデータの配置を決定する際、参照する文書要素データ
間の論理構造に基づき、参照されるデータの最適な配置
を自動的に決定することができ、また、システムの決定
した優先順位に従って、要素データの配置をユーザに提
示し選択させることにより、ユーザの意図に合わせて容
易に配置を変更することができるため、文書編集・整形
におけるユーザの負担を軽減することができる。According to the present invention, when determining the arrangement of data referenced from a plurality of reference locations, the optimum arrangement of the referenced data is automatically determined based on the logical structure between the referenced document element data. In addition, by presenting and selecting the arrangement of element data to the user according to the priority determined by the system, the arrangement can be easily changed according to the user's intention. Can be reduced.

【００１０】[0010]

【実施例】以下、図面に従って、本発明の一実施例を説
明する。図１は、本発明の一実施例を示すブロック図で
ある。An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing one embodiment of the present invention.

【００１１】文書を構成する要素データである文字デー
タ・図表データ・イメージデータ等は、たとえば、キー
ボード・マウス・イメージスキャナ、その他の記憶装置
・通信装置等からなる入力部１０から入力され、文書構
造抽出部２０におくられる。文書構造抽出部２０は、論
理構造抽出部２１及び参照構造抽出部２２からなり、ま
ず、論理構造抽出部２１で要素データ間の論理構造を抽
出する。次に、参照構造抽出部２２で、要素データ間の
参照関係（参照構造）を抽出する。[0011] The character data, chart data, image data, etc., which are the element data constituting the document, are input from the input unit 10 composed of, for example, a keyboard, a mouse, an image scanner, other storage devices, a communication device, and the like. It is sent to the extraction unit 20. The document structure extracting unit 20 includes a logical structure extracting unit 21 and a reference structure extracting unit 22. First, the logical structure extracting unit 21 extracts a logical structure between element data. Next, the reference structure extraction unit 22 extracts a reference relationship (reference structure) between the element data.

【００１２】文書整形部３０は、抽出された文書構造を
用いて、入力された文書要素データを整形する。割り付
け位置生成部３１では、前段で抽出された参照構造を基
に、参照される要素データを参照する要素データの近辺
に割り付ける。この際、複数の参照箇所から参照される
要素データに関しては、優先順位決定部３２で、参照す
る要素データ間の論理構造に基づき、各参照箇所ごとの
割り付け位置間の優先順位を決定する。割り付けられた
要素データは出力部４０である。たとえばディスプレイ
・プリンタ等から整形された文書を出力する。以下に図
２の例に沿って本発明の動作を示す。The document shaping section 30 shapes the input document element data using the extracted document structure. The allocation position generation unit 31 allocates the referenced element data to the vicinity of the referenced element data based on the reference structure extracted in the previous stage. At this time, with respect to the element data referred to from the plurality of reference points, the priority order determination unit 32 determines the priority order among the allocation positions for each reference point based on the logical structure between the element data to be referred to. The assigned element data is the output unit 40. For example, a formatted document is output from a display or a printer. The operation of the present invention will be described below with reference to the example of FIG.

【００１３】入力部１０からは文章データＡと図表デー
タＢとから構成される文書要素データが入力される。例
えばキーボードからは文章データＡ、スキャナからは図
表データＢが各々入力される。文書構造抽出部２０で
は、各文書要素データ間の関係から、論理構造と参照構
造が抽出される。The input unit 10 receives document element data composed of text data A and chart data B. For example, text data A is input from the keyboard, and chart data B is input from the scanner. The document structure extraction unit 20 extracts a logical structure and a reference structure from the relationship between the respective document element data.

【００１４】論理構造は、文章データＡの場合、たとえ
ば、標題、著者、所属、章、節など、文間の階層的な関
係であらわされるものである。論理構造抽出部２１は、
ユーザが前もって付加した構造情報が入力データに含ま
れる場合は、その構造情報を読み込む。あるいは、構造
情報が付加されていない場合は論理構造解析規則を持ち
自動抽出してもよい。この論理構造の自動解析技術は特
開昭６１−１９０６５３号公報に詳しく記載されてい
る。論理構造抽出部２１で抽出された入力文章データの
論理構造は、たとえば、図３に示すような階層構造にな
っている。In the case of the sentence data A, the logical structure is represented by a hierarchical relationship between sentences such as a title, an author, an affiliation, a chapter, and a section. The logical structure extraction unit 21
If the input data includes the structure information added in advance by the user, the structure information is read. Alternatively, when no structure information is added, a logical structure analysis rule may be used to automatically extract the information. The technology for automatically analyzing the logical structure is described in detail in Japanese Patent Application Laid-Open No. 61-190563. The logical structure of the input text data extracted by the logical structure extraction unit 21 has, for example, a hierarchical structure as shown in FIG.

【００１５】参照構造は、各要素データ間の参照関係、
たとえば、文章中から各図表・他の章・参考文献等の参
照を表すものである。参照構造に関しても論理構造同
様、入力データから読み込むか、あるいは、例えば以下
のような手順により自動抽出する。技術文書の場合は、
文章中から各図表に対して、参照語句などを用いて参照
関係を指定している。図２の例では、第１５文からの図
面の説明を表す各文、第３６文の「図１に示す」、第４
０文の「この図」、第５０文の「機能比較表」、第５５
文の「次表」などが図表の参照を表している。参照構造
抽出部２２では、たとえば、図４に示すような参照語句
辞書と参照記述抽出規則を持ち、文章中から図表を参照
する表現を抽出する。次に、参照される図表データの解
析を行い、各参照表現に対応する図表を特定する。The reference structure is a reference relationship between each element data,
For example, it refers to each figure / table, other chapters, references, etc. from the text. Similarly to the logical structure, the reference structure is read from the input data or is automatically extracted by, for example, the following procedure. For technical documentation,
Reference relations are specified for each figure and table from the text using reference words and the like. In the example of FIG. 2, each sentence representing the description of the drawing from the fifteenth sentence, the “seen in FIG.
0 sentence "This figure", 50th sentence "Function comparison table", 55th sentence
The “next table” in the sentence indicates a reference to the chart. The reference structure extraction unit 22 has, for example, a reference phrase dictionary and a reference description extraction rule as shown in FIG. 4, and extracts expressions referring to figures and tables from sentences. Next, the referenced chart data is analyzed, and the chart corresponding to each reference expression is specified.

【００１６】ここで、「第１図」は、第１６文と第３６
文、第４０文の三箇所から参照されている。また、「第
１表」は第２１文と第５０文から、「第２表」は第２２
文と第５５文のそれぞれから参照されている。第１６
文、第２１文のような参照表現と図表との対応は、たと
えば、「図」「表」のような語句と図表番号の組み合わ
せが一致することにより特定する規則を持つことにより
参照関係を求める。また、第４０文、第５０文、第５５
文のような図表番号を指定していない参照箇所に関して
は、たとえば、文脈解析による代名詞の照応関係、ある
いは図表の内容の記述の相関関係から、該当する図表を
特定する機構を持つ。さらに、すでに文章中に割り付け
られている図表に関しては、図表との距離や、「下図」
「左図」といった位置を指定する記述を解析することも
可能である。これらの解析により、たとえば、図５に示
すような参照構造が抽出される。ここでは、優先順位は
まだ未定であるため空欄のままである。Here, "FIG. 1" refers to the sixteenth sentence and the thirty-sixth sentence.
The sentence is referred to from three places: sentence 40. "Table 1" is from the 21st and 50th sentences, and "Table 2" is from the 22nd sentence.
It is referenced from each of the sentence and the 55th sentence. Sixteenth
The correspondence between the reference expression such as the sentence and the 21st sentence and the figure and table is determined by, for example, having a rule specified by matching the combination of words such as “figure” and “table” with the figure and table number. . Also, the 40th sentence, 50th sentence, 55th sentence
For a reference location such as a sentence where a figure / table number is not specified, for example, there is a mechanism for specifying the corresponding figure / table from the anatomical relationship of pronouns by context analysis or the correlation of description of the contents of the figure / table. In addition, for charts that have already been assigned in the text, the distance from the charts and the "
It is also possible to analyze a description designating a position such as “left figure”. Through these analyses, for example, a reference structure as shown in FIG. 5 is extracted. Here, since the priority order is not yet determined, it is left blank.

【００１７】整形部３０では、抽出された論理構造と参
照構造を用いて、たとえば図６に示すような書式データ
（領域）に従って、文書要素データを割り付ける。ここ
で、点線枠は、それぞれ文章を流し込む文章枠をあらわ
す。書式データ中には、各ページのサイズ、ページに含
まれる文章枠のサイズ、文章枠に流し込まれる論理属
性、流し込まれる順番などがあらかじめ指定されてい
る。この整形部３０では、抽出された論理構造に従っ
て、各文章枠に指定された論理属性を持つ文を順に流し
込んでいく。ここで、書式データ中に、たとえば論理属
性（標題、著者名、……、章見出し、章段落）ごとのフ
ォントサイズ、インデント等の書式が記述してあれば、
各文はその論理属性の書式に従って整形することも可能
である。The shaping section 30 allocates document element data according to, for example, format data (area) as shown in FIG. 6 using the extracted logical structure and reference structure. Here, the dotted frame indicates a text frame into which a text is poured. In the format data, the size of each page, the size of the text frame included in the page, the logical attributes to be poured into the text frame, the order of the text frames, and the like are specified in advance. In the shaping unit 30, sentences having the designated logical attribute are sequentially poured into each sentence frame according to the extracted logical structure. Here, if the format data describes formats such as font size and indent for each logical attribute (title, author name,..., Chapter heading, chapter paragraph),
Each sentence can be formatted according to the format of its logical attributes.

【００１８】このとき、流し込む文から図表を参照して
いる場合は、その文の近くに図表を割り付ける。ただ
し、図２の例のような、複数の文から同じ図表を参照し
ている場合は、図７に示すようなフローチャートに従っ
て検出し、優先順位決定部３２に処理を渡す。つまり、
まず一文を入力し（ステップ１）、他に参照する図表が
あるかどうかを判定し（ステップ２）、無い場合は次の
一文を入力し、ある場合はすでに優先順位が決定したも
のかどうかを判定する（ステップ３）。無い場合は次に
複数の箇所から参照されているかどうかを判定し（ステ
ップ４）、参照されていれば優先順位を決定するための
処理を行い（ステップ５）、参照されていなければ優先
順位を１にして（ステップ６）各々ステップ２にもど
る。以上のようにして優先順位を決定する。At this time, if a chart is referred to from the sentence, the chart is allocated near the sentence. However, when the same chart is referred to from a plurality of sentences as in the example of FIG. 2, it is detected according to the flowchart shown in FIG. 7, and the processing is passed to the priority order determination unit 32. That is,
First, a sentence is input (step 1), and it is determined whether there is another chart to be referred to (step 2). If there is no sentence, the next sentence is input. A determination is made (step 3). If not, it is next determined whether or not it is referenced from a plurality of locations (step 4). If it is referenced, processing for determining the priority is performed (step 5). Set to 1 (Step 6) and return to Step 2 for each. The priority is determined as described above.

【００１９】優先順位決定部３２では、抽出された参照
構造と論理構造により、たとえば、図８に示すような、
同じ図表を参照している文番号と各文の論理構造中の属
性名とあわせたテーブルを作成する。なお、すでに決定
した優先順位は図５の参照構造中の優先順位の欄に随時
書き込んでいき、次の図表の優先順位の決定処理の際に
その情報も活用する。たとえば、図９に示すような優先
順位決定規則を持ち優先順位を決定する。In the priority order determining unit 32, for example, as shown in FIG.
A table is created with sentence numbers referring to the same chart and attribute names in the logical structure of each sentence. The already determined priorities are written in the priority order column in the reference structure of FIG. 5 as needed, and the information is also used in the process of determining the priorities in the next chart. For example, it has a priority determination rule as shown in FIG. 9 and determines the priority.

【００２０】図８（ａ）では、「第１図」の参照箇所が
リストされている。優先順位決定規則１−１より、図８
（ｂ）に示すように、第１６文の優先順位は３位に落ち
る。また、規則２−１あるいは２−７により、図８
（ｃ）に示すように優先順位が決定される。同様に「第
１表」の優先順位は規則１−１により、「第２表」は規
則１−１あるいは２−２、２−５などによって決定され
る。このように決定され、参照構造に囲まれた優先順位
をもとに、割り付け位置生成部３１では、優先順位第１
位の文の近くに図表を割り付ける。FIG. 8 (a) lists the reference portions of "FIG. 1." From the priority determination rule 1-1, FIG.
As shown in (b), the priority of the sixteenth sentence drops to third place. According to Rule 2-1 or 2-7, FIG.
The priority is determined as shown in FIG. Similarly, the priority of “Table 1” is determined by Rule 1-1, and “Table 2” is determined by Rule 1-1 or 2-2, 2-5, and the like. Based on the priority determined as described above and surrounded by the reference structure, the allocation position generation unit 31 determines the first priority.
Assign a chart near the rank sentence.

【００２１】なお、この例では、優先順位をすべて決定
したのち、最適と思われる位置に割り付けたが、割り付
け時に、優先順位を決定することも可能である。その場
合は具体的な座標値を持った割り付け位置の候補を各参
照箇所ごとに複数生成し、割り付けの実際の状態を比較
して優先順位を決定することも可能になる。また、どち
らの場合も、ユーザの判断を入力するために、優先順位
に従って自動的に割り付けたのちに、ユーザに確認をと
るようなインタフェースをもつことも可能である。この
場合、たとえば図１０に示すようなメニューやコマンド
などにより、ユーザが「他の参照位置へ」を選択した場
合、優先順位第２位の参照箇所へ図表を再割り付けす
る。また、「次の参照位置へ」といった名称のメニュー
項目により、優先順位とは無関係に文章の流れに沿った
次の参照箇所に図表を割り付け直す。また、「参照一
覧」のようなメニュー項目を選択することにより、たと
えば、図１１に示すような、参照構造の一覧をみてユー
ザが優先順位の変更を行う画面を用意することもでき
る。たとえば、ユーサはマウスなどで任意の参照関係を
表す線を選択し、線の太さを変更する指示を出すことに
より、参照関係の優先順位などを簡単に変更することが
できる。図１１の例では、図表は縮小イメージアイコン
で表しているが、図表見出しなどの文字列で表してもよ
い。In this example, after all the priorities have been determined, they are allocated to the positions considered to be optimal. However, the priorities can be determined at the time of allocation. In this case, it is possible to generate a plurality of allocation position candidates having specific coordinate values for each reference location, and determine the priority by comparing the actual state of allocation. In either case, it is also possible to have an interface that automatically assigns the user's judgment in accordance with the priority order and then asks the user for confirmation. In this case, for example, when the user selects “to another reference position” using a menu or a command as shown in FIG. 10, the chart is re-allocated to the reference position having the second highest priority. In addition, a menu item having a name such as “to the next reference position” reassigns the chart to the next reference position along the text flow regardless of the priority order. Further, by selecting a menu item such as “reference list”, it is possible to prepare a screen for the user to change the priority order by viewing a list of reference structures as shown in FIG. 11, for example. For example, the user can easily change the priority of the reference relation by selecting a line representing an arbitrary reference relation with a mouse or the like and issuing an instruction to change the thickness of the line. In the example of FIG. 11, the chart is represented by a reduced image icon, but may be represented by a character string such as a chart heading.

【００２２】また、優先順位決定部３２で、「技術文書
の場合は目次部には図表を割り付けない」などの割り付
け可否判定規則を記述することにより、割り付けの可否
を判定すれば、優先順位が１位でも図表を割り付けない
等の機能を実現することができる。If the priority determining unit 32 determines whether or not allocation is possible by describing a rule for determining whether or not allocation is possible, for example, "in the case of a technical document, a table is not allocated to the table of contents", the priority is determined. Even the first place can realize a function such as not allocating a chart.

【００２３】また、同一図表の参照箇所間の関係が抽出
されているので、図２の例のように目次部に図表説明が
ある文書の場合に、図表の実際に割り付けられたページ
番号を、目次の図表説明文に追加することも可能であ
る。Further, since the relationship between the reference locations of the same chart is extracted, in the case of a document having the chart description in the table of contents as in the example of FIG. 2, the page number actually assigned to the chart is replaced by It can be added to the table captions in the table of contents.

【００２４】[0024]

【発明の効果】以上詳細に説明したように、本発明によ
れば、複数の参照箇所から参照されるデータの配置を決
定する際、自動的に最適な参照箇所の近くに割り付ける
ことが可能になる。これにより、文書の表示、整形、編
集の際のめんどうな割り付け作業を軽減することができ
る。複雑な参照構造を持つ文書ほど、参照構造にあった
割り付けの生成および管理が困難になるため、本発明の
効果は大きくなるものである。As described above in detail, according to the present invention, when determining the arrangement of data to be referred to from a plurality of reference points, it is possible to automatically allocate the data near an optimum reference point. Become. As a result, troublesome layout work at the time of displaying, shaping, and editing a document can be reduced. As the document has a more complicated reference structure, it becomes more difficult to generate and manage the assignment that matches the reference structure, and thus the effect of the present invention is increased.

[Brief description of the drawings]

【図１】本発明の一実施例のブロック図。FIG. 1 is a block diagram of one embodiment of the present invention.

【図２】入力文書データの一例を示す図。FIG. 2 is a diagram showing an example of input document data.

【図３】論理構造抽出部で抽出される論理構造の一例
を示す図。FIG. 3 is a diagram showing an example of a logical structure extracted by a logical structure extraction unit.

【図４】参照構造抽出部で使われる参照語句辞書と参
照記述抽出規則の一例を示す図。FIG. 4 is a diagram showing an example of a reference phrase dictionary and a reference description extraction rule used in a reference structure extraction unit.

【図５】参照構造抽出部で抽出される参照構造の一例
を示す図。FIG. 5 is a diagram showing an example of a reference structure extracted by a reference structure extraction unit.

【図６】整形部で使われる書式データの一例を示す
図。FIG. 6 is a diagram illustrating an example of format data used in a shaping unit.

【図７】複数の文から同じ図表を参照するケースの検
出アルゴリズムを示す図。FIG. 7 is a diagram showing a detection algorithm of a case where the same chart is referred to from a plurality of sentences.

【図８】優先順位決定部で使われる優先順位決定テー
ブルを示す図。FIG. 8 is a diagram showing a priority order determination table used by a priority order determination unit.

【図９】優先順位決定部で使われる優先順位決定規則
の一例を示す図。FIG. 9 is a diagram illustrating an example of a priority determination rule used by a priority determination unit.

【図１０】ユーザの指示を入力するメニューの一例を
示す図。FIG. 10 is a diagram showing an example of a menu for inputting a user's instruction.

【図１１】参照構造変更のための画面の一例を示す
図。FIG. 11 is a diagram showing an example of a screen for changing a reference structure.

[Explanation of symbols]

１０…入力部２０…文書構造抽出部３０…整形部
４０…出力部10 input unit 20 document structure extraction unit 30 shaping unit
40 ... output unit

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開昭61−21570（ＪＰ，Ａ) 特開昭63−245556（ＪＰ，Ａ) 特開平１−230168（ＪＰ，Ａ) 特開平２−3859（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06F 17/21 - 17/24 ────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-61-21570 (JP, A) JP-A-63-245556 (JP, A) JP-A-1-230168 (JP, A) JP-A-2- 3859 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G06F 17/21-17/24

Claims

(57) [Claims]

1. A logical structure extracting means for extracting a hierarchical logical structure between a plurality of document element data which are constituent elements of a document; a document element data referred to by the document element data; Reference structure extraction means for extracting a reference structure indicating a relationship between reference locations of document element data to be referenced, and priority determination for determining a priority of an allocation position of document element data referenced from the logical structure and the reference structure A rule storage unit for storing rules, and a reference structure extracted by the reference structure extraction unit,
When there are a plurality of document element data that refer to the same referenced document element data, based on the logical structure and the reference structure of the referenced document element data, based on the priority determination rules stored in the rule storage unit, Allocation position priority determining means for determining the priority of the document element data to which the referenced document element data is allocated; and allocating the referenced document element data to the vicinity of the document element data determined by the position priority determining means. A document shaping device comprising: a document shaping unit for shaping a document by using the document shaping unit; and a document output unit for outputting the document shaped by the document shaping unit.

2. A logical structure extracting means, a reference structure extracting means, a rule storing means for storing a priority determining rule for determining a priority of an allocation position of document element data referenced from the logical structure and the reference structure, A processing method of a document shaping device including an allocation position priority determining unit, a document shaping unit, and a document output unit. The logical structure extracting means for extracting a hierarchical logical structure among a plurality of document element data which are constituent elements of the document; and the reference structure extracting means, wherein the document element data referred to by the document element data and Extracting a reference structure indicating a relationship between reference locations of the document element data referring to the document element data; and the allocating position priority determining means assigns the same reference to the reference structure extracted by the reference structure extracting means. If there are a plurality of document element data that refer to the document element data to be referred to, the document element data is referenced based on the logical structure and the reference structure of the referenced document element data based on the priority determination rules stored in the rule storage unit. Determining the priority order of the document element data to which the document element data is to be allocated; Allocating a document in the vicinity of the document element data to be determined determined by the position priority determining means, and shaping the document; and outputting the document formatted by the document shaping means by the document output means. A processing method of a document shaping device, characterized by the following.