JP2001014313A

JP2001014313A - Device and method for document processing, and recording medium

Info

Publication number: JP2001014313A
Application number: JP11189155A
Authority: JP
Inventors: Katashi Nagao; 確長尾
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1999-07-02
Filing date: 1999-07-02
Publication date: 2001-01-19

Abstract

PROBLEM TO BE SOLVED: To enable a user to accurately grasp document contents by performing a read-aloud processing corresponding to a displayed summary sentence. SOLUTION: A summary sentence of an electronic document can be presented to a user through display and voice output and the summary sentence on which a reading-aloud voice is based is nearly equal in document length to or longer than the displayed summary sentence. Namely, when a summary display area WS is too small in size to display a sufficient document as the summary sentence, a document which is long enough for the user to understand the contents is outputted as the reading-aloud voice (F30A) to enable the user to clearly know the summary sentence contents (electronic document contents) irrelevantly to the state of the summary sentence display area WS.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は電子文書処理に関
し、特に電子文書について要約文を作成し、それを表示
及び音声で出力することのできる文書処理装置に関する
ものである。[0001] 1. Field of the Invention [0002] The present invention relates to electronic document processing, and more particularly to a document processing apparatus capable of creating an abstract sentence of an electronic document, and displaying and outputting the summary sentence.

【０００２】[0002]

【従来の技術】従来、インターネットにおいて、ウィン
ドウ形式でハイパーテキスト型情報を提供するアプリケ
ーションサービスとしてＷＷＷ（World Wide Web）が提
供されている。2. Description of the Related Art Conventionally, WWW (World Wide Web) has been provided as an application service for providing hypertext information in a window format on the Internet.

【０００３】ＷＷＷは、文書の作成、公開または共有化
の文書処理を実行し、新しいスタイルの文書の在り方を
示したシステムである。しかし、文書の実際上の利用の
観点からは、文書の内容に基づいた文書の分類や要約と
いった、ＷＷＷを越える高度な文書処理が求められてい
る。このような高度な文書処理には、文書の内容の機械
的な処理が不可欠である。[0003] The WWW is a system for executing document processing for creating, publishing, or sharing a document, and showing the way of a new style document. However, from the viewpoint of practical use of documents, advanced document processing beyond WWW, such as classification and summarization of documents based on the contents of the documents, is required. For such advanced document processing, mechanical processing of the contents of the document is indispensable.

【０００４】しかしながら、文書の内容の機械的な処理
は、以下のような理由から依然として困難である。第１
に、ハイパーテキストを記述する言語であるＨＴＭＬ
（Hyper Text Markup Language）は、文書の表現につい
ては規定するが、文書の内容についてはほとんど規定し
ない。第２に、文書間に構成されたハイパーテキストの
ネットワークは、文書の読者にとって文書の内容を理解
するために必ずしも利用しやすいものではない。第３
に、一般に文章の著作者は読者の便宜を念頭に置かずに
著作するが、文書の読者の便宜が著作者の便宜と調整さ
れることはない。However, mechanical processing of the contents of a document is still difficult for the following reasons. First
HTML, a language that describes hypertext
(Hyper Text Markup Language) specifies the expression of a document, but hardly specifies the content of the document. Second, a network of hypertexts formed between documents is not always easy for a reader of the document to understand the contents of the document. Third
In general, the author of a document writes without considering the convenience of the reader, but the convenience of the reader of the document is not adjusted to the convenience of the author.

【０００５】このように、ＷＷＷは新しい文書の在り方
を示したシステムであるが、文書を機械的に処理しない
ので、高度な文書処理をおこなうことができなかった。
換言すると、高度な文書処理を実行するためには、文書
を機械的に処理することが必要となる。[0005] As described above, the WWW is a system showing the way of a new document. However, since the document is not mechanically processed, advanced document processing cannot be performed.
In other words, in order to perform advanced document processing, it is necessary to process the document mechanically.

【０００６】そこで、文書の機械的な処理を目標とし
て、文書の機械的な処理を支援するシステムが自然言語
研究の成果に基づいて開発されている。自然言語研究に
よる文書処理として、文書の著作者等による文書の内部
構造についての属性情報、いわゆるタグの付与を前提と
した、文書に付与されたタグを利用する機械的な文書処
理が提案されている。Therefore, a system for supporting mechanical processing of documents has been developed based on the results of natural language research, with the goal of mechanical processing of documents. As a document processing based on natural language research, mechanical document processing using tags attached to a document has been proposed on the assumption that attribute information about the internal structure of the document by the author of the document, so-called tags are added. I have.

【０００７】[0007]

【発明が解決しようとする課題】ところで、近年のコン
ピュータの普及や、ネットワーク化の進展に伴い、文章
処理や、文書の内容に依存した索引などで、テキスト文
書の作成、ラベル付け、変更などをおこなう文書処理の
高機能化が求められている。たとえば、ユーザの要望に
応じた文書の要約や、文書の分類等が望まれる。また、
文書やその要約を表示する文書ビューワや表示部の形態
に適した表示スタイルや、表示とともに行われる読み上
げ機能、さらには文書や画像の表示と読み上げの連動な
ど、よりユーザーにとって好適な処理が求められてい
る。特に要約文を表示する際には、表示エリアが小さい
場合は十分な表示ができない場合があるが、そのような
不十分な表示を伴って、文書内容をユーザーに好適に提
示できるようにすることが必要である。By the way, with the recent spread of computers and the progress of networking, text processing, creation of text documents, labeling, change, and the like have been performed by using an index depending on the contents of the documents. There is a demand for enhanced document processing to be performed. For example, it is desired to summarize a document or classify a document according to a user's request. Also,
Processing that is more suitable for users, such as display styles suitable for the form of the document viewer or display unit that displays documents and their summaries, read-aloud functions that accompany display, and interlocking of display of documents and images with read-aloud is required. ing. In particular, when displaying a summary, if the display area is small, it may not be possible to display it sufficiently.However, with such insufficient display, it is necessary to present the document contents to the user appropriately. is necessary.

【０００８】[0008]

【課題を解決するための手段】本発明は、このような事
情に鑑みて提案されたものであって、文書処理装置にお
いて、表示と音声出力を有効に使って、文書内容（要約
文内容）を的確にユーザーに提示できるようにすること
を目的とする。SUMMARY OF THE INVENTION The present invention has been proposed in view of such circumstances, and in a document processing apparatus, a document content (abstract text content) is used by effectively using display and voice output. The purpose of the present invention is to allow the user to be accurately presented.

【０００９】このため本発明の文書処理装置は、要約文
表示エリアのサイズに応じて電子文書の要約文データを
作成する要約作成手段と、要約作成手段で作成される要
約文データの文書長以上の文書長となる読み上げ用要約
文データを作成することのできる読み上げ要約作成手段
とを備えるようにする。つまり、表示用の要約文と、そ
れよりも長い文章としての読み上げ用要約文が生成され
るようにする。また本発明では、要約文表示エリアを有
する表示手段と、音声出力手段と、表示手段における要
約文表示エリアに要約作成手段で作成された要約文デー
タの表示出力を実行させるとともに、音声出力手段によ
り読み上げ要約作成手段で作成された読み上げ用要約文
データに基づく読み上げ音声の出力を実行させることが
できる制御手段とを備えるようにする。即ち、要約文を
表示手段による表示と音声出力手段による読み上げ音声
出力としてユーザーに提示できるようにするとともに、
読み上げ音声の元となる要約文は、表示される要約文と
文書長が同じもしくはそれを越えるものとされるように
する。つまり、表示手段の要約文表示エリアのサイズが
小さく、要約文として十分な文章を表示できない場合
に、読み上げ音声としてユーザーの内容理解に十分な文
書長の文書が出力されるようにする。For this reason, the document processing apparatus of the present invention has a summary creating means for creating summary text data of an electronic document according to the size of the summary text display area, and a document length of the summary text data created by the summary creation means. And a read-out summary creating means capable of creating read-out summary sentence data having the document length. That is, a display summary sentence and a read-out summary sentence as a longer sentence are generated. Further, in the present invention, the display means having the summary sentence display area, the audio output means, and the display and output of the summary sentence data created by the summary creation means in the summary sentence display area of the display means are performed by the audio output means. And a control means for outputting a read-out voice based on the read-out summary sentence data created by the read-out summary creating means. That is, the summary sentence can be presented to the user as a display by the display unit and a read-out voice output by the voice output unit,
The summary sentence that is the basis of the read-out voice is set to have the same or longer document length than the displayed summary sentence. In other words, when the size of the summary sentence display area of the display means is small and a sufficient sentence cannot be displayed as a summary sentence, a document having a document length sufficient for the user to understand the contents is output as a reading voice.

【００１０】また特に、要約文表示エリアのサイズは可
変とされる場合は、読み上げ要約作成手段は、要約文表
示エリアのサイズに応じて文書長を設定して、読み上げ
用要約文データを作成することで、要約文表示エリアの
サイズ状況に応じて最適な読み上げ出力を行うことがで
きるようにする。例えばユーザーが内容把握に十分な程
度の文書長で表示できる場合は、読み上げ用要約文デー
タのサイズは表示される要約文データと同等でよいが、
要約文表示エリアが小さく、表示される要約文データの
文書長が短くなって、内容理解に不十分と考えられる場
合は、より長い文書長で読み上げ用要約文データを作成
する。[0010] In particular, when the size of the summary sentence display area is variable, the reading summary creation means sets the document length in accordance with the size of the summary sentence display area and creates the reading summary sentence data. Thus, it is possible to perform optimal reading-out output according to the size of the summary sentence display area. For example, if the user can display the document with a length sufficient for comprehension of the content, the size of the read-out summary sentence data may be equal to the displayed summary sentence data.
When the summary sentence display area is small and the document length of the displayed summary sentence data is short and it is considered that the content is insufficient for comprehending the contents, the read-out summary sentence data is created with a longer document length.

【００１１】一方、装置によっては要約文表示エリアの
サイズが固定サイズとされる場合があるが、そのサイズ
が小さく要約文データが短いものとならざるを得ない場
合は、読み上げ要約作成手段は、要約作成手段で作成さ
れる要約文データの文書長を越える文書長となる読み上
げ用要約文データを作成し、それが音声として出力され
るようにすることで、ユーザーが十分に文書内容を把握
できるようにする。On the other hand, depending on the device, the size of the summary sentence display area may be a fixed size, but if the size is small and the summary sentence data must be short, the read-out summary creation means may include: The user can sufficiently comprehend the contents of the document by creating the read-out summary sentence data having a document length exceeding the document length of the summary sentence data created by the summary creation means and outputting it as audio. To do.

【００１２】本発明の文書処理方法は、電子文書に対し
て表示部の要約文表示エリアのサイズに応じた要約文デ
ータを作成する要約作成手順と、要約作成手順で作成さ
れる要約文データの文書長以上の文書長となる読み上げ
用要約文データを作成する読み上げ要約作成手順と、表
示部の要約文表示エリアに、要約作成手順で作成された
要約文データに基づく表示出力を行うとともに、音声出
力部から、読み上げ要約作成手順で作成された読み上げ
用要約文データに基づく読み上げ音声の出力を行う出力
手順とが行われるようにする。また本発明の記録媒体
は、このような各手順を有する動作制御プログラムが記
録されているものとする。According to the document processing method of the present invention, a summary creating procedure for creating summary sentence data according to the size of a summary sentence display area of a display unit for an electronic document, and a method for summarizing sentence data created in the summary creating procedure are provided. A reading summary creation procedure that creates reading summary sentence data that is longer than the document length, and a display output based on the summary sentence data created by the summary creation procedure in the summary sentence display area of the display unit, and voice The output unit performs an output procedure for outputting a reading voice based on the reading summary sentence data created in the reading summary creation procedure. Further, the recording medium of the present invention is assumed to record an operation control program having such procedures.

【００１３】[0013]

【発明の実施の形態】以下、本発明の実施の形態につい
て次の順序で説明する。この実施の形態としては、まず
文書処理装置の構成及び各種処理について説明し、その
後、要約文表示エリアが可変とされる例として、この文
書処理装置で実現される表示及び読み上げの動作を説明
する。またその後に、要約文表示エリアが固定とされる
例として、携帯型の文書処理装置（携帯端末装置）の構
成及び動作を説明する。１．文書処理装置の構成２．文書データ構造３．文書データに対する手動分類処理３−１処理手順３−２インデックス作成３−３文書閲覧／分類作成／分類操作３−４分類モデル作成／登録４．文書データに対する自動分類処理４−１処理手順４−２自動分類５．要約作成処理６．読み上げ処理７．表示される要約サイズに応じた読み上げ処理８．要約作成及び読み上げ処理９．携帯端末装置及び文書処理システム構成１０．携帯端末装置による文書受信処理１１．携帯端末装置による文書表示及び読み上げ処理１２．文書処理システム動作Embodiments of the present invention will be described below in the following order. In this embodiment, first, the configuration and various processes of the document processing apparatus will be described, and then, as an example in which the summary display area is variable, display and reading operations realized by this document processing apparatus will be described. . After that, as an example in which the summary sentence display area is fixed, the configuration and operation of a portable document processing device (portable terminal device) will be described. 1. 1. Configuration of document processing device 2. Document data structure 3. Manual classification process for document data 3-1 Processing procedure 3-2 Index creation 3-3 Document browsing / classification creation / classification operation 3-4 Classification model creation / registration 4. Automatic classification processing for document data 4-1 Processing procedure 4-2 Automatic classification Summary creation processing Read aloud processing 7. 7. Read-aloud processing according to the displayed summary size 8. Summarization and reading-out processing 9. Mobile terminal device and document processing system configuration 10. Document reception processing by mobile terminal device 11. Document display and reading-out processing by portable terminal device Document processing system operation

【００１４】１．文書処理装置の構成本発明の実施の形態としての文書処理装置１は、図１に
示すように、制御部１１およびインターフェース１２を
備える本体１０と、ユーザからの入力を受けて本体１０
に送る入力部２０と、外部との信号の送受信を行う通信
部２１と、本体１０からの出力を表示する表示部３０
と、記録媒体３２に対して情報を記録／再生する記録／
再生部３１と、音声出力部３３と、ＨＤＤ（ハードディ
スクドライブ）３４を有している。1. 1. Configuration of Document Processing Apparatus As shown in FIG. 1, a document processing apparatus 1 according to an embodiment of the present invention includes a main body 10 having a control unit 11 and an interface 12, and a main body 10 which receives an input from a user.
, An input unit 20 for sending and receiving signals to and from the outside, and a display unit 30 for displaying an output from the main body 10
Recording / reproduction for recording / reproducing information on / from the recording medium 32
It has a reproduction unit 31, an audio output unit 33, and an HDD (hard disk drive).

【００１５】本体１０は、制御部１１およびインターフ
ェース１２を有し、この文書処理装置１の主要な部分を
構成している。制御部１１は、この文書処理装置１にお
ける処理を実行するＣＰＵ１３と、揮発性のメモリであ
るＲＡＭ１４と、不揮発性のメモリであるＲＯＭ１５と
を有している。ＣＰＵ１３は、たとえばＲＯＭ１５に記
録された手順にしたがって、必要な場合にはデータを一
時的にＲＡＭ１４に格納して、プログラムを実行するた
めの制御をおこなう。この制御部１１の動作としては、
詳しくはそれぞれ後述していくが、供給された文書デー
タに関する分類処理、要約作成処理、読み上げ動作のた
めの音声読み上げ用ファイルの生成処理、及びこれらの
処理に必要な文書解析などがある。そしてこれらの動作
のために必要なプログラムやアプリケーションソフト
が、ＲＯＭ１５や、ＨＤＤ３４、あるいは記録媒体３２
に記憶されている。なお、制御部１１が用いる文書処理
プログラムは上記のようにあらかじめＲＯＭ１５に格納
されたり、あるいは、記録媒体３２やＨＤＤ３４から取
り込むことが考えられるが、例えば通信部２１を介して
インターネット等のネットワークから、外部サーバ等が
提供する文書処理プログラムをダウンロードすることも
考えられる。The main body 10 has a control unit 11 and an interface 12, and constitutes a main part of the document processing apparatus 1. The control unit 11 has a CPU 13 that executes processing in the document processing apparatus 1, a RAM 14 that is a volatile memory, and a ROM 15 that is a non-volatile memory. The CPU 13 temporarily stores data, if necessary, in the RAM 14 according to a procedure recorded in the ROM 15, for example, and performs control for executing the program. The operation of the control unit 11 is as follows.
As will be described later in detail, there are a classification process for the supplied document data, a summary creation process, a process of generating a speech reading file for a reading operation, and a document analysis required for these processes. The programs and application software necessary for these operations are stored in the ROM 15, the HDD 34, or the recording medium 32.
Is stored in The document processing program used by the control unit 11 may be stored in the ROM 15 in advance as described above, or may be fetched from the recording medium 32 or the HDD 34. It is also conceivable to download a document processing program provided by an external server or the like.

【００１６】インターフェース１２は、制御部１１、入
力部２０、通信部２１、表示部３０、記録／再生部３
１、音声出力部３３、ＨＤＤ３４に接続される。そして
インターフェース１２は、制御部１１の制御の下に、入
力部２０からのデータの入力、通信部２１との間のデー
タの入出力、表示部３０へのデータの出力、記録／再生
部３１に対するデータの入出力、音声出力部３３へのデ
ータの出力、ＨＤＤ３４に対するデータの入出力の各動
作をを行う。具体的には制御部１１と上記各部の間での
データの入出力のタイミングを調整したり、データの形
式を変換することなどを行う。The interface 12 includes a control unit 11, an input unit 20, a communication unit 21, a display unit 30, and a recording / reproducing unit 3.
1. Connected to the audio output unit 33 and HDD. Under the control of the control unit 11, the interface 12 inputs data from the input unit 20, inputs and outputs data to and from the communication unit 21, outputs data to the display unit 30, and controls the recording / reproducing unit 31. Each operation of data input / output, data output to the audio output unit 33, and data input / output to the HDD 34 is performed. More specifically, it adjusts the timing of data input / output between the control unit 11 and each of the above units, converts the format of data, and the like.

【００１７】入力部２０は、この文書処理装置１に対す
るユーザの入力を受ける部分である。この入力部２０
は、たとえばキーボードやマウスにより構成される。ユ
ーザは、この入力部２０を用い、キーボードによリキー
ワード等の文字を入力したり、マウスにより表示部３０
に表示されている電子文書のエレメントを選択すること
などができる。なお、以下では文書処理装置１で扱う電
子文書を、「文書データ」又は単に「文書」と称するこ
とにする。また「エレメント」とは文書を構成する要素
であって、たとえば文書、文および語が含まれる。The input section 20 is a section for receiving an input from the user to the document processing apparatus 1. This input unit 20
Is composed of, for example, a keyboard and a mouse. The user uses the input unit 20 to input characters such as keywords using a keyboard, or to display the display unit 30 using a mouse.
For example, the user can select an element of the electronic document that is displayed in. Hereinafter, an electronic document handled by the document processing apparatus 1 is referred to as “document data” or simply “document”. The “element” is an element constituting a document, and includes, for example, a document, a sentence, and a word.

【００１８】通信部２１は、この文書処理装置１に外部
からたとえば通信回線を介して送信される信号を受信し
たり、通信回線に信号を送信する部位である。この通信
部２１は、外部から送信された１又は複数の文書等のデ
ータを受信し、受信したデータを本体１０に送る。もち
ろん通信回線を介して外部装置にデータを送信すること
も行われる。The communication unit 21 is a unit that receives a signal transmitted from the outside to the document processing apparatus 1 via, for example, a communication line, and transmits a signal to the communication line. The communication unit 21 receives one or a plurality of data such as documents transmitted from the outside, and sends the received data to the main body 10. Of course, data is transmitted to an external device via a communication line.

【００１９】表示部３０は、この文書処理装置１の出力
としての文字や画像情報を表示する部位である。この表
示部３０は、たとえば陰極線管（cathode ray tube；CR
T）や液晶表示装置（Liquid crystal display；LCD）な
どにより構成され、たとえば単数または複数のウィンド
ウを表示し、このウィンドウ上に文字、図形等を表示す
る。The display section 30 is a section for displaying characters and image information as an output of the document processing apparatus 1. The display unit 30 is, for example, a cathode ray tube (CR).
T), a liquid crystal display (LCD), or the like, for example, displays one or more windows, and displays characters, figures, and the like on these windows.

【００２０】記録／再生部３１は、たとえばいわゆるフ
ロッピーディスクのような記録媒体３２に対してデータ
の記録／再生をおこなう。なお、ここでは記録媒体３２
の例としてフロッピーディスク（磁気ディスク）を例を
あげているが、もちろん光ディスク、光磁気ディスク、
メモリカードなど、可搬性メディアであれば記録媒体３
２の例として適用できる。そして記録／再生部３１は、
メディアに応じた記録再生装置（ディスクドライブ、カ
ードドライブなど）であればよい。The recording / reproducing section 31 records / reproduces data on / from a recording medium 32 such as a so-called floppy disk. Here, the recording medium 32
An example of a floppy disk (magnetic disk) is given as an example of an optical disk, a magneto-optical disk,
Recording medium 3 if it is a portable medium such as a memory card.
2 can be applied as an example. Then, the recording / reproducing unit 31
Any recording / reproducing device (disk drive, card drive, etc.) according to the media may be used.

【００２１】記録媒体３２が、文書を処理するための文
書処理プログラムが記録されているものである場合は、
記録／再生部３１は、その記録媒体３２から文書処理プ
ログラムを読み出して制御部１１に供給することができ
る。また記録媒体３２に文書データが記録されていれ
ば、記録／再生部３１でそれを読み出して制御部１１に
供給することができる。即ち文書処理装置１にとって、
通信部２１による文書データの受信とは別の文書データ
の入力態様となる。さらに、制御部１１は当該文書処理
装置１で処理した文書データを記録／再生部３１におい
て記録媒体３２に記録させることもできる。When the recording medium 32 has recorded therein a document processing program for processing a document,
The recording / reproducing unit 31 can read out the document processing program from the recording medium 32 and supply it to the control unit 11. If document data is recorded on the recording medium 32, the recording / reproducing unit 31 can read it out and supply it to the control unit 11. That is, for the document processing apparatus 1,
This is an input mode of the document data different from the reception of the document data by the communication unit 21. Further, the control unit 11 can cause the recording / reproducing unit 31 to record the document data processed by the document processing apparatus 1 on the recording medium 32.

【００２２】音声出力部３３は、文書処理装置１の出力
としての文書を、読み上げ音声として出力する部位であ
る。即ち音声出力部３３は、制御部１１が文書情報（後
述する読み上げ用ファイル）に基づいた音声合成処理に
より生成した音声信号が供給された際に、その音声信号
の出力処理を行うことで、表示部３０とともに文書処理
装置１の出力手段として機能する。The voice output section 33 is a section for outputting a document as an output of the document processing apparatus 1 as a reading voice. That is, when a voice signal generated by a voice synthesis process based on document information (a reading file to be described later) is supplied by the control unit 11, the voice output unit 33 performs output processing of the voice signal to perform display. Together with the unit 30, it functions as an output unit of the document processing apparatus 1.

【００２３】ＨＤＤ３４は、文書処理装置１における大
容量の記録領域を提供する。ＨＤＤ３４は、制御部１１
の制御に基づいて情報の記録／再生を行う。このＨＤＤ
３４は、制御部１１で実行される各種処理のためのアプ
リケーションプログラム、例えば音声合成のためのプロ
グラムなどの格納するために用いられたり、例えば当該
文書処理装置１に取り込まれた文書データ等を格納して
おく部位として用いることなどが可能となる。The HDD 34 provides a large-capacity recording area in the document processing apparatus 1. The HDD 34 includes the control unit 11
The recording / reproduction of information is performed based on the control of. This HDD
Reference numeral 34 denotes an application program for various processes executed by the control unit 11, for example, a program for speech synthesis, and the like, and stores, for example, document data and the like taken into the document processing apparatus 1. It can be used as a part to be kept.

【００２４】また本例の文書処理装置１には、携帯端末
リーダ２２を接続することができる。なお、この携帯端
末リーダ２２は例えば本体１０に内蔵されるものであっ
てもよい。この携帯端末リーダ２２は、携帯端末５０を
装着することで、携帯端末５０内に保持されている文書
データを読み出すことのできるものである。即ち制御部
１１は、携帯端末５０が携帯端末リーダ２２に装着され
ている際に、インターフェース１２を介して携帯端末リ
ーダ２２から文書データを受け取ることができる。制御
部１１は、受け取った文書データについては、通信部２
１から入力された文書データと同様に各種の処理を行う
ことができる。携帯端末５０及び携帯端末５０から転送
された文書データに関する処理ついては後に詳しく説明
する。A portable terminal reader 22 can be connected to the document processing apparatus 1 of this embodiment. The portable terminal reader 22 may be built in the main body 10, for example. The portable terminal reader 22 is capable of reading document data held in the portable terminal 50 by mounting the portable terminal 50. That is, the control unit 11 can receive document data from the portable terminal reader 22 via the interface 12 when the portable terminal 50 is mounted on the portable terminal reader 22. The control unit 11 transmits the received document data to the communication unit 2
Various processes can be performed in the same manner as the document data input from step 1. The processing related to the portable terminal 50 and the document data transferred from the portable terminal 50 will be described later in detail.

【００２５】２．文書データ構造続いて、本例における文書データの構造について説明す
る。本例においては、文書処理は、文書に付与された属
性情報であるタグを参照しておこなわれる。本例で用い
られるタグには、文書の構造を示す統語論的（syntacti
c）タグと、多言語間で文書の機械的な内容理解を可能
にするような意味的（semantic）・語用論的タグとがあ
る。2. Document Data Structure Next, the structure of document data in this example will be described. In this example, document processing is performed with reference to a tag that is attribute information assigned to the document. The tags used in this example include syntactic
c) There are tags and semantic / pragmatic tags that enable mechanical understanding of the content of documents between multiple languages.

【００２６】統語論的なタグとしては、文書の内部構造
を記述するものがある。タグ付けによる内部構造は、図
２に示すように、文書、文、語彙エレメント等の各エレ
メントが互いに、通常リンク、参照・被参照リンクによ
りリンクされて構成されている。図中において、白丸
“○”はエレメントを示し、最下位の白丸は文書におけ
る最小レベルの語に対応する語彙エレメントである。ま
た、実線は文書、文、語彙エレメント等のエレメント間
のつながり示す通常リンク（normal link）であり、破
線は参照・被参照による係り受け関係を示す参照リンク
（reference link）である。文書の内部構造は、上位か
ら下位への順序で、文書（documemt）、サブディビジョ
ン（subdivision）、段落（paragraph）、文（sentenc
e）、サブセンテンシャルセグメント（subsentential s
egment）、・・・、語彙エレメントから構成される。こ
のうち、サブディビジョンと段落はオプションである。Some syntactic tags describe the internal structure of a document. As shown in FIG. 2, the internal structure by tagging is configured such that each element such as a document, a sentence, and a vocabulary element is linked to each other by a normal link and a reference / referenced link. In the figure, a white circle “○” indicates an element, and the lowest white circle is a vocabulary element corresponding to the minimum level word in the document. The solid line is a normal link indicating a connection between elements such as a document, a sentence, and a vocabulary element, and the dashed line is a reference link indicating a dependency relationship by reference / reference. The internal structure of a document is document (documemt), subdivision (subdivision), paragraph (paragraph), sentence (sentenc
e), subsentential segment (subsentential s
egment), ..., vocabulary elements. Of these, subdivisions and paragraphs are optional.

【００２７】一方、意味論・語用論的なタグ付けとして
は、多義語の意味のように意味等の情報を記述するもの
がある。本例におけるタグ付けは、ＨＴＭＬ（Hyper Te
xt Markup Language）と同様なＸＭＬ（Extensible Mar
kup Language）の形式によるものである。On the other hand, as the semantic / pragmatic tagging, there is a method of describing information such as a meaning like a meaning of a polysemy. Tagging in this example is performed by HTML (Hyper Te
XML (Extensible Mar) similar to xt Markup Language
kup Language).

【００２８】タグ付けの一例を次に示すが、文書へのタ
グ付けはこの方法に限られない。また、以下では英語と
日本語の文書の例を示すが、タグ付けによる内部構造の
記述は他の言語にも同様に適用することができる。An example of tagging is shown below, but tagging a document is not limited to this method. In the following, examples of English and Japanese documents are shown, but the description of the internal structure by tagging can be similarly applied to other languages.

【００２９】たとえば、“Time flies like an arro
w.”という文については、下記のようなタグ付けをする
ことができる。＜＞が、文書に対して付与されたタグ
である。For example, "Time flies like an arro
The sentence "w." can be tagged as follows. <> is a tag added to the document.

【００３０】＜文＞＜名詞句語義＝“time０”＞time＜／名詞句＞＜動詞句＞＜動詞語義＝“fly１”＞flies＜／動詞＞＜副詞句＞＜副詞語義＝like０＞like＜／副詞＞＜名詞句＞an ＜名詞語義＝“arrow０”＞arrow＜／名詞＞＜／名詞句＞＜／副詞句＞＜／動詞句＞．＜／文＞<Sentence> <noun phrase word meaning = “time0”> time </ noun phrase> <verb phrase> <verb word meaning = “fly1”> flies <// verb> <adverb phrase> <adverb word meaning = like0> like < / Adverb> <noun phrase> an <noun meaning = “arrow0”> arrow </ noun> </ noun phrase> </ adverb phrase> </ verb phrase>. </ Sentence>

【００３１】ここで＜文＞、＜名詞＞、＜名詞句＞、＜
動詞＞、＜動詞句＞、＜副詞＞、＜副詞句＞は、それぞ
れ文、名詞、名詞句、動詞、動詞句、形容詞／副詞（前
置詞句または後置詞句を含む）、形容詞句／副詞句、を
示している。つまり文の統語構造（syntactic structur
e）を表している。Here, <sentence>, <noun>, <noun phrase>, <
Verbs, <verb phrases>, <adverbs>, <adverb phrases> are sentences, nouns, noun phrases, verbs, verb phrases, adjectives / adverbs (including prepositional or postpositional phrases), adjective phrases / adverbial phrases, respectively. Is shown. In other words, the syntactic structur of a sentence
e).

【００３２】これらのタグは、エレメントの先端の直前
および終端の直後に対応して配置される。エレメントの
終端の直後に配置されるタグは、記号“ ／”によりエ
レメントの終端であることを示している。エレメントと
は統語的構成素、すなわち句、節、および文のことであ
る。なお、語義（word sense）＝“time０”は、語“ti
me”の有する複数の意味、すなわち複数の語義のうちの
第０番目の意味を指している。具体的には、語“time”
には少なくとも名詞、形容詞、動詞の意味があるが、こ
こでは語“time”が名詞（＝第０番目の意味）であるこ
とを示している。同様に、語“オレンジ”は少なくとも
植物の名前、色、果物の意味があるが、これらも語義に
よって区別することができる。These tags are arranged immediately before the front end and immediately after the end of the element. The tag placed immediately after the end of the element indicates that it is the end of the element by the symbol "/". Elements are syntactic constructs: phrases, clauses, and sentences. Note that the meaning (word sense) = “time0” corresponds to the word “ti”.
It refers to the plural meanings of “me”, that is, the 0th meaning of the plural meanings.
Has at least the meanings of a noun, adjective, and verb. Here, it indicates that the word "time" is a noun (= the 0th meaning). Similarly, the word "orange" has at least the meaning of plant name, color, and fruit, but these can also be distinguished by their meaning.

【００３３】本例では、文書データについては、図３に
示すように、表示部３０上のウィンドウ１０１におい
て、その統語構造を表示することができる。このウィン
ドウ１０１においては、右半面１０３に語彙エレメント
が、左半面１０２に文の内部構造がそれぞれ表示されて
いる。In this example, the syntactic structure of the document data can be displayed in the window 101 on the display unit 30 as shown in FIG. In this window 101, vocabulary elements are displayed on the right half 103, and the internal structure of the sentence is displayed on the left half 102.

【００３４】例えば図示するようにこのウィンドウ１０
１には、タグ付けにより内部構造が記述された文章「Ａ
氏のＢ会が終わったＣ市で、一部の大衆紙と一般紙がそ
の写真報道を自主規制する方針を紙面で明らかにし
た。」の一部が表示されている。この文書のタグ付けの
例は次のようになる。For example, as shown in FIG.
1 has a sentence “A” in which the internal structure is described by tagging.
In C City, where his B meeting was over, some popular and general newspapers have stated on paper that they will voluntarily regulate their photographic coverage. Is displayed. An example of tagging this document is as follows:

【００３５】＜文書＞＜文＞＜副詞句関係＝“場所”＞＜名詞句＞＜副詞句場所＝“Ｃ市 ”＞＜副詞句関係＝“主語”＞＜名詞句識別子＝“Ｂ会”＞＜副詞句関係＝“ 所属”＞＜人名識別子＝“Ａ氏”＞Ａ氏＜／人名＞の＜／副詞句＞＜組織名識別子＝“Ｂ会”＞Ｂ会＜／組織名＞＜／名詞句＞が＜／副詞句＞終わった＜／副詞句＞＜地名識別子＝“Ｃ市”＞Ｃ市＜／地名＞＜／名詞句＞で、＜／副詞句＞＜副詞句関係＝“主語”＞＜名詞句識別子＝“press” 統語＝“並列”＞＜名詞句＞＜副詞句＞一部の＜／副詞句＞大衆紙＜／名詞句＞と＜名詞＞一般紙＜／名詞＞＜／名詞句＞が＜／副詞句＞＜副詞句関係＝“目的語”＞＜副詞句関係＝“内容” 主語＝“press”＞＜副詞句関係＝“目的語”＞＜名詞句＞＜副詞句＞＜名詞共参照＝“Ｂ会” ＞そ＜／名詞＞の＜／副詞句＞写真報道＜／名詞句＞を＜／副詞句＞自主規制する＜／副詞句＞方針を＜／副詞句＞＜副詞句間係＝“位置”＞紙面で＜／副詞句＞明らかにした。＜／文＞＜／文書＞<Document> <Sentence> <Adverb Phrase Relation = “Place”> <Noun Phrase> <Adverb Phrase Location = “C City”> <Adverb Phrase Relation = “Subject”> <Noun Phrase Identifier = “B Association” > <Adverb phrase relation = “affiliation”> <person name identifier = “Mr. A”> </ adverbular phrase of Mr. A <// person name> The noun phrase ends with </ adverb phrase> </ adverb phrase> <place name identifier = "C city"> C city </ place name> </ noun phrase> where </ adverb phrase> <adverb phrase relation = "subject" "> <Noun phrase identifier =“ press ”syntactic =“ parallel ”> <noun phrase> <adverb phrase> Some </ adverb phrase> popular paper </ noun phrase> and <noun> general paper </ noun> < / Noun phrase> is </ Adverb phrase> <Adverb phrase> Relation = "Object"> <Adverb phrase Relationship = "Content" Subject = "press"> <Adverb phrase Relation = "Object"> <Noun phrase> <Adverb Phrase> <noun Co-reference = "B-kai"> </ noun> </ adverb phrase> Photo coverage </ noun phrase> </ adverb phrase> Self-regulating </ adverb phrase> Policy </ adverb phrase> <adverb phrase> Coordinator = "position"> </ adverb phrase> on paper. </ Text> </ text>

【００３６】このようにタグ付されることで、各一対の
タグ＜＞〜＜／＞によって文書の構造が表現され
る。例えば＜文書＞〜＜／文書＞で１つの文書の範囲が
示され、同様に＜文＞〜＜／文＞で１つの文の範囲が示
される。また例えば、＜名詞句識別子＝“Ｂ会”＞〜
＜／名詞句＞により、「Ａ氏のＢ会」という部分が「Ｂ
会」を識別子とする名詞句として表現される。即ち上記
タグ付により、図３の左半面１０２に示した文の内部構
造が表現される。By being tagged in this way, the structure of the document is represented by each pair of tags <> to </>. For example, <document> to </ document> indicates the range of one document, and similarly, <sentence> to </ sentence> indicates the range of one sentence. Also, for example, <Noun phrase identifier = “B meeting”>
According to </ noun phrase>, the part "Mr. A's B meeting"
It is expressed as a noun phrase with "kai" as the identifier. That is, the internal structure of the sentence shown on the left half surface 102 in FIG. 3 is expressed by the tagging.

【００３７】さらに、この文書においては、「一部の大
衆紙と一般紙」は、統語＝“並列”というタグにより並
列であることが表されている。並列の定義は、係り受け
関係を共有するということである。特に何も指定がない
場合、たとえば、＜名詞句関係＝ｘ＞＜名詞＞Ａ＜／名
詞＞＜名詞＞Ｂ＜／名詞＞＜／名詞句＞は、ＡがＢに依
存関係のあることを表す。関係＝ｘは関係属性を表す。Further, in this document, "part of popular paper and general paper" is expressed in parallel by the tag "syntax =" parallel ". The definition of parallel is to share a dependency relationship. If nothing is specified, for example, <noun phrase relation = x><noun> A </ noun><noun> B </ noun></ noun phrase> indicates that A has a dependency on B Represent. Relation = x represents a relation attribute.

【００３８】関係属性は、統語、意味、修辞についての
相互関俵を記述する。主語、目的語、間接目的語のよう
な文法機能、動作主、被動作者、受益者などのような主
題役割、および理由、結果などのような修辞関係はこの
関係属性により記述される。本例では、主語、目的語、
間接目的語のような比較的容易な文法機能について関係
属性を記述する。The relation attribute describes an interaction between syntactic, meaning, and rhetorical. Grammar functions such as subjects, objects, and indirect objects, subject roles such as an actor, a subject, a beneficiary, etc., and rhetorical relations such as a reason and a result are described by the relation attributes. In this example, the subject, object,
Describe relative attributes for relatively easy grammar functions such as indirect objects.

【００３９】また、この文書においては、“Ａ氏”、
“Ｂ会”、“Ｃ市”のような固有名詞について、地名、
人名、組織名等のタグにより属性が記述されている。こ
れら地名、人名、組織名等のタグが付与されることで、
その語が固有名詞であることが表現される。In this document, "Mr. A",
For proper nouns such as "B Association" and "C City", place names,
Attributes are described by tags such as person names and organization names. By adding tags such as place name, person name, organization name, etc.,
It is expressed that the word is a proper noun.

【００４０】３．文書データに対する手動分類処理３−１処理手順本例の文書処理装置１では、例えば通信部２１（又は記
録／再生部３１）により外部から文書データが取り込ま
れると、その文書データを内容に応じて分類する処理を
行う。なお、以下の説明では、外部からの文書データは
通信部２１を介して取り込まれるとして述べていくが、
その説明は、外部からフロッピーディスク等の可搬性メ
ディアの形態で供給され、記録／再生部３１から文書デ
ータが取り込まれる場合も同様となるものである。3. Manual Classification Processing on Document Data 3-1 Processing Procedure In the document processing apparatus 1 of this example, when document data is taken in from the outside by, for example, the communication unit 21 (or the recording / reproducing unit 31), the document data is converted according to the content. Perform classification processing. In the following description, it is described that document data from the outside is taken in via the communication unit 21.
The description is the same when the document data is supplied from the outside in the form of a portable medium such as a floppy disk and the document data is taken in from the recording / reproducing unit 31.

【００４１】分類処理としては、文書データ内容に応じ
てユーザーが手動で分類する手動分類処理と、文書処理
装置１が自動的に分類する自動分類処理がある。これら
の分類処理は、後述する分類モデルに基づいて行われる
わけであるが、文書処理装置１においては、初期状態で
は分類モデルは存在しない。そのため初期状態にある時
点では、手動分類処理として、分類モデルの作成を含む
分類処理が必要になる。そして、分類モデルが生成され
た後においては、入力された文書データに対して自動分
類処理が可能となるものである。まずここでは、最初に
実行することが必要とされる手動分類処理について説明
する。即ちこの手動分類処理とは、初期状態にある文書
処理装置１が外部から送られた文書データを受信した際
に、ユーザーの操作に基づいて、制御部１１が分類モデ
ルの作成及び文書データの分類を行う動作となる。As the classification process, there are a manual classification process in which the user manually classifies according to the contents of the document data, and an automatic classification process in which the document processing apparatus 1 automatically classifies. These classification processes are performed based on a classification model described later. However, in the document processing apparatus 1, no classification model exists in an initial state. Therefore, at the time of the initial state, a classification process including creation of a classification model is required as a manual classification process. After the classification model is generated, the input document data can be automatically classified. First, a manual classification process that needs to be performed first will be described. That is, the manual classification process means that when the document processing apparatus 1 in the initial state receives document data sent from the outside, the control unit 11 creates a classification model and classifies the document data based on a user operation. Is performed.

【００４２】まず手動分類処理としての全体の処理手順
を図４に示す。なお、各処理ステップの詳細な処理につ
いては後述する。FIG. 4 shows the overall processing procedure as the manual classification processing. The detailed processing of each processing step will be described later.

【００４３】図４のステップＦ１１は、文書処理装置１
の受信部２１による文書受信処理を示している。このス
テップＦ１１では、受信部２１は、たとえば通信回線を
介して送信された１又は複数の文書を受信する。受信部
２１は、受信した文書を文書処理装置の本体１０に送
る。制御部１１は供給された１又は複数の文書データを
ＲＡＭ１４又はＨＤＤ３４に格納する。Step F11 in FIG.
2 shows a document receiving process by the receiving unit 21 of FIG. In step F11, the receiving unit 21 receives one or a plurality of documents transmitted via, for example, a communication line. The receiving unit 21 sends the received document to the main body 10 of the document processing device. The control unit 11 stores the supplied document data or document data in the RAM 14 or the HDD 34.

【００４４】ステップＦ１２では、文書処理装置１の制
御部１１は、受信部２１から送られた複数の文書の特徴
を抽出し、それぞれの文書の特徴情報すなわちインデッ
クスを作成する。制御部１１は、作成したインデックス
を、たとえばＲＡＭ１４又はＨＤＤ３４に記憶させる。
後述するがインデックスは、その文書に特徴的な、固有
名詞、固有名詞以外の語義などを含むものであり、文書
の分類や検索に利用できるものである。In step F12, the control section 11 of the document processing apparatus 1 extracts the features of the plurality of documents sent from the receiving section 21 and creates feature information, that is, an index of each document. The control unit 11 stores the created index in, for example, the RAM 14 or the HDD 34.
As will be described later, the index includes proper nouns, meanings other than proper nouns, and the like, which are characteristic of the document, and can be used for document classification and search.

【００４５】ステップＦ１３の文書閲覧は、ユーザーの
必要に応じて実行される処理である。つまりユーザーの
操作に応じて行われる。なお、このステップＦ１３や次
のステップＦ１４は、ユーザ操作に基づく処理である。
入力された文書データに対しては、ユーザーは所要の操
作を行うことにより、表示部３０の画面上で、その文書
内容を閲覧することができる。そして文書閲覧中は、ユ
ーザーは画面上のアイコン等に対する操作により、例え
ば後述する要約作成などの各種処理を指示できるが、こ
の手動分類処理に関しては、ステップＦ１４として示す
ように、分類項目の作成及び分類操作としての処理に進
むことになる。ステップＦ１４では、ユーザーが分類項
目（なお本明細書では、分類項目のことをカテゴリとも
いう）を設定する操作を行うことに応じて、制御部１１
は分類項目を生成／表示していく。またユーザーが文書
データを、設定された分類項目に振り分けていく操作も
行うことになり、それに応じて制御部１１は文書データ
の振り分け／表示を行うことになる。The document browsing in step F13 is a process executed as required by the user. That is, it is performed according to a user operation. Step F13 and the next step F14 are processes based on a user operation.
The user can view the content of the input document data on the screen of the display unit 30 by performing a required operation. While browsing the document, the user can instruct various processes such as summarization, which will be described later, by operating the icons and the like on the screen. The process will proceed to the classification operation. In step F14, in response to the user performing an operation of setting a classification item (the classification item is also referred to as a category in this specification), the control unit 11
Generates / displays classification items. The user also performs an operation of sorting the document data into the set classification items, and the control unit 11 sorts and displays the document data accordingly.

【００４６】ステップＦ１５では、制御部１１は、ステ
ップＦ１４でユーザーが行った分類項目作成及び分類操
作に応じて、分類モデルを作成する。分類モデルは、文
書を分類する複数の分類項目（カテゴリ）から構成され
るとともに、各カテゴリに対して各文書のインデックス
（ステップＦ１２で作成した各文書のインデックス）を
対応づけることで、分類状態を規定するデータである。
このような分類モデルを生成したら、ステップＦ１６
で、その分類モデルを登録する。即ち制御部１１は、分
類モデルをたとえばＲＡＭ１４に記憶させることで登録
を行う。以上の図４の処理により、文書処理状態１が初
期状態にある時に入力された１又は複数の各文書データ
について、手動分類及び分類モデルの作成が行われたこ
とになる。この図４のステップＦ１２以下の処理につい
て詳しく述べていく。In step F15, the control unit 11 creates a classification model according to the classification item creation and classification operation performed by the user in step F14. The classification model includes a plurality of classification items (categories) for classifying documents, and associates each document index with each category index (index of each document created in step F12) to change the classification state. This is the specified data.
After generating such a classification model, step F16
Then, the classification model is registered. That is, the control unit 11 performs the registration by storing the classification model in, for example, the RAM 14. By the processing of FIG. 4 described above, manual classification and creation of a classification model have been performed for one or a plurality of pieces of document data input when the document processing state 1 is in the initial state. The processing of step F12 and subsequent steps in FIG. 4 will be described in detail.

【００４７】３−２インデックス作成ステップＦ１４では、制御部１１は入力された文書デー
タについてインデックスの作成を行う。まず、或る１つ
の文書データに対して作成されたインデックスの具体例
を示す。3-2 Index Creation In step F14, the control section 11 creates an index for the input document data. First, a specific example of an index created for certain document data will be described.

【００４８】＜インデックス日付＝“AAAA/BB/CC” 時刻＝“DD:EE:FF” 文書アドレス＝ “1234”＞＜ユーザの操作履歴最大要約サイズ＝“100”＞＜選択エレメントの数＝“10”＞ピクチャーテル＜／選択＞・・・＜／ユーザの操作履歴＞＜要約＞減税規模、触れず−Ｘ首相の会見＜／要約＞＜語語義＝“0003” 中心活性値＝“140.6”＞触れず＜／語＞＜語語義＝“0105” 識別子＝“Ｘ” 中心活性値＝“67.2”＞首相＜／語＞＜人名識別子＝“Ｘ” 語語義＝“6103” 中心活性値＝“150.2”＞Ｘ首相＜／語／人名＞＜語語義＝“5301” 中心活性値＝“120.6”＞求めた＜／語＞＜語語義＝“2350” 識別子＝“Ｘ” 中心活性値＝“31.4”＞首相＜／語＞＜語語義＝“9582” 中心活性値＝“182.3”＞強調した＜／語＞＜語語義＝“2595” 中心活性値＝“93.6”＞触れる＜／語＞＜語帝義＝“9472” 中心活性値＝“12.0”＞予告した＜／語＞＜語語義＝“4934” 中心活性値＝“46.7”＞触れなかった＜／語＞＜語語義＝“0178” 中心活性値＝“175.7”＞釈明した＜／語＞＜語語義＝“7248” 識別子＝“Ｘ” 中心活性値＝“130.6”＞私＜／語＞＜語語義＝“3684” 識別子＝“Ｘ” 中心活性値＝“121.9”＞首相＜／語＞＜語語義＝“1824” 中心活性値＝“144.4.”＞訴えた＜／語＞＜語語義＝“7289” 中心活性値＝“176.8”＞見せた＜／語＞＜／インデックス＞<Index Date = "AAAA / BB / CC" Time = "DD: EE: FF" Document Address = "1234"> <User Operation Log Maximum Summary Size = "100"> <Number of Selected Elements = " 10 "> Picturetel </ selection> ...... </ user operation history> <summary> Tax reduction scale, not touching-Prime Minister X's meeting </ summary> <word meaning =“ 0003 ”central activity value =“ 140.6 ” > Do not touch </ word> <word meaning = “0105” identifier = “X” central activity value = “67.2”> Prime Minister </ word> <person name identifier = “X” word meaning = “6103” central activity value = “ 150.2 ”> X Prime Minister </ word / person name> <word meaning =“ 5301 ”central activity value =“ 120.6 ”> determined </ word> <word meaning =“ 2350 ”identifier =“ X ”central activity value =“ 31.4 ”> Prime Minister </ word> <word meaning =“ 9582 ”central activity value =“ 182.3 ”> emphasized </ word> <word meaning =“ 2595 ”central activity value “93.6”> Touch </ word> <word Teijin = “9472” Central activity value = “12.0”> Notified </ word> <word Sense = “4934” Central activity value = “46.7”> No touch < / Word> <word meaning = “0178” central activity value = “175.7”> declared </ word> <word meaning = “7248” identifier = “X” central activity value = “130.6”> I </ word> < Word Meaning = "3684" Identifier = "X" Central Activity Value = "121.9"> Prime Minister </ word> <Word Meaning = "1824" Central Activity Value = "144.4."> Complained </ word> <word Meaning = “7289” Central activity value = “176.8”> Showed </ word> </ index>

【００４９】このインデックスにおいては、＜インデッ
クス＞および＜／インデックス＞は、インデックスの始
端および終端を、＜日付＞および＜時刻＞はこのインデ
ックスが作成された日付および時刻を、＜要約＞および
＜／要約＞はこのインデックスの内容の要約の始端およ
び終端を、それぞれ示している。また、＜語＞および＜
／語＞は語の始端および終端を示している。さらに例え
ば、語義＝“0003”は、第３番目の語義であることを示
している。他についても同様である。上述したように、
同じ語でも複数の意味を持つ場合があるので、それを区
別するために語義ごとに番号が予め決められており、そ
の該当する語義が番号で表されているものである。In this index, <index> and </ index> indicate the start and end of the index, <date> and <time> indicate the date and time when this index was created, <summary> and </ Summary> indicates the start and end of the summary of the contents of this index, respectively. Also, <word> and <
/ Word> indicates the beginning and end of the word. Further, for example, the meaning = “0003” indicates that it is the third meaning. The same applies to other cases. As mentioned above,
Since the same word may have a plurality of meanings, a number is predetermined for each meaning in order to distinguish them, and the corresponding meaning is represented by a number.

【００５０】また、＜ユーザの操作履歴＞および＜／ユ
ーザの操作履歴＞は、ユーザの操作履歴の始端および終
端を、＜選択＞および＜／選択＞は、適訳されたエレメ
ントの始端および終端を、それぞれ示している。最大要
約サイズ＝“100”は、要約の最大のサイズが１００文
字であることを、エレメントの数＝“10”は、選択され
たエレメントの数が１０であることを示している。<User operation history> and </ User operation history> are the start and end of the user's operation history, and <Select> and </ Select> are the start and end of the translated element. Are respectively shown. The maximum summary size = “100” indicates that the maximum size of the summary is 100 characters, and the number of elements = “10” indicates that the number of selected elements is 10.

【００５１】この例のように、インデックスは、その文
書に特徴的な、固有名詞、固有名詞以外の語義などを含
むものである。例えばこのようなインデックスを作成す
るステップＦ１２の処理を、図５〜図８で説明する。な
お、図５は１つの文書データに対するインデックス作成
処理を示しており、従って複数の文書データについて処
理を行う場合は、各文書データについてこの図５の処理
が行われることになる。また図５のステップＦ３１の詳
細な処理を図７に示し、さらに図７のステップＦ４３の
詳細な処理を図８に示している。As in this example, the index includes proper nouns, meanings other than proper nouns, and the like, which are characteristic of the document. For example, the process of step F12 for creating such an index will be described with reference to FIGS. FIG. 5 shows an index creation process for one document data. Therefore, when a process is performed for a plurality of document data, the process of FIG. 5 is performed for each document data. Further, FIG. 7 shows a detailed process of step F31 in FIG. 5, and FIG. 8 shows a detailed process of step F43 in FIG.

【００５２】上述した図４のステップＦ１２のインデッ
クス作成処理としては、まず図５のステップＦ３１の活
性拡散が行われる。この活性拡散とは、文書データにつ
いて、エレメントの中心活性値を文書の内部構造に基づ
いて拡散することで、中心活性値の高いエレメントと関
わりのあるエレメントにも高い中心活性値を与えるよう
な処理である。即ち、文書を構成する各エレメントに対
して初期値としての中心活性値を与えた後、その中心活
性値を、文書の内部構造、具体的にはリンク構造に基づ
いて拡散する。この中心活性値は、タグ付けによる内部
構造に応じて決定されるので、文書の特徴の抽出等に利
用されるものである。制御部１１は、このステップＦ３
１として、活性拡散を行い、活性拡散の結果として得ら
れた各エレメントの中心活性値を、たとえばＲＡＭ１４
に記憶させることになる。In the index creation processing in step F12 in FIG. 4 described above, first, active diffusion in step F31 in FIG. 5 is performed. This active diffusion is a process in which the central activity value of an element is spread based on the internal structure of a document with respect to document data, so that an element related to an element having a high central activity value also has a high central activity value. It is. That is, after giving a central activity value as an initial value to each element constituting the document, the central activity value is diffused based on the internal structure of the document, specifically, the link structure. Since this central activity value is determined according to the internal structure by tagging, it is used for extracting features of a document. The control unit 11 determines in step F3
As 1, the active diffusion is performed, and the central active value of each element obtained as a result of the active diffusion is stored in, for example, the RAM 14.
Will be stored.

【００５３】ステップＦ３１の活性拡散について、図６
〜図８で詳しく説明していく。まずエレメントとエレメ
ントのリンク構造の例を図６に示す。図６においては、
文書を構成するエレメントとリンクの構造の一部とし
て、エレメントＥ１、Ｅ２の周辺を示している。Ｅ１〜
Ｅ８はエレメントの例であり、この中でエレメントＥ
１、Ｅ２に注目して説明する。FIG. 6 shows the active diffusion in step F31.
8 will be described in detail. First, an example of a link structure between elements is shown in FIG. In FIG.
The periphery of elements E1 and E2 is shown as part of the structure of the elements and links that make up the document. E1
E8 is an example of an element, in which element E
1 and E2 will be described.

【００５４】エレメントＥ１の中心活性値はｅ１である
とし、またエレメントＥ２の中心活性値はｅ２であると
する。このエレメントＥ１，Ｅ２は、リンクＬ１２（上
述した通常リンクもしくは参照リンク）にて接続されて
いる。リンクＬ１２のエレメントＥ１に接続する端点を
Ｔ１２、エレメントＥ２に接続する端点をＴ２１とす
る。エレメントＥ１は、さらにエレメントＥ３，Ｅ４，
Ｅ５と、それぞれリンクＬ１３，Ｌ１４，Ｌ１５で接続
されている。各リンクＬ１３，Ｌ１４，Ｌ１５における
エレメントＥ１側の端点をそれぞれＴ１３，Ｔ１４，Ｔ
１５とする。またエレメントＥ２は、エレメントＥ６，
Ｅ７，Ｅ８とも、それぞれリンクＬ２６，Ｌ２７，Ｌ２
８で接続されている。各リンクＬ２６，Ｌ２７，Ｌ２８
におけるエレメントＥ２側の端点をそれぞれＴ２６，Ｔ
２７，Ｔ２８とする。このようなリンク構造の例を用い
ながら、図７、図８の活性拡散処理を説明していく。It is assumed that the central activity value of the element E1 is e1, and the central activity value of the element E2 is e2. The elements E1 and E2 are connected by a link L12 (the normal link or the reference link described above). The end point of the link L12 connected to the element E1 is T12, and the end point of the link L12 connected to the element E2 is T21. Element E1 further includes elements E3, E4,
E5 is connected to links L13, L14, and L15, respectively. The end points of the links L13, L14, and L15 on the element E1 side are denoted by T13, T14, and T, respectively.
It is assumed to be 15. Element E2 is composed of elements E6 and E6.
Links L26, L27, L2 for E7 and E8 respectively
8 are connected. Each link L26, L27, L28
At the element E2 side are denoted by T26 and T26, respectively.
27 and T28. The active diffusion processing of FIGS. 7 and 8 will be described using an example of such a link structure.

【００５５】図７のステップＦ４１で制御部１１は、イ
ンデックス作成対象としての文書データについて活性拡
散を開始するにあたり、まず文書データの全エレメント
について中心活性値の初期設定を行う。中心活性値の初
期値としては、例えば固有名詞や、ユーザーが選択（ク
リック）したエレメント等に高い値を与えるようにす
る。また制御部１１は、参照リンクと通常リンクに関し
て、エレメントを連結するリンクの端点Ｔ(xx)の端点活
性値を０に設定する。制御部１１は、このように付与し
た端点活性値の初期値を、たとえばＲＡＭ１４に記憶さ
せる。In step F41 of FIG. 7, when starting the active diffusion for the document data to be indexed, the control unit 11 first initializes the central active value for all elements of the document data. As the initial value of the central activity value, for example, a high value is given to a proper noun, an element selected (clicked) by a user, or the like. Further, the control unit 11 sets the end point activation value of the end point T (xx) of the link connecting the elements to 0 for the reference link and the normal link. The control unit 11 stores, for example, the RAM 14 with the initial value of the endpoint activation value thus assigned.

【００５６】ステップＦ４２においては、制御部１１
は、文書を構成するエレメントＥｉを計数するカウンタ
の初期化をおこなう。すなわち、エレメントを計数する
カウンタのカウント値ｉを１に設定する。ｉ＝１の場
合、このカウンタは、第１番目のエレメント（例えば図
６のエレメントＥ１）を参照することになる。In step F42, the control unit 11
Initializes a counter for counting the elements Ei constituting the document. That is, the count value i of the counter for counting elements is set to one. If i = 1, this counter will refer to the first element (eg, element E1 in FIG. 6).

【００５７】ステップＦ４３においては、制御部１１
は、カウンタが参照するエレメントについて、新たな中
心活性値を計算する中心活性値更新処理を実行する。こ
の中心活性値更新処理について、エレメントＥ１につい
ての処理を例に挙げながら、図８で詳しく説明する。こ
の中心活性値更新処理は、エレメントについての端点活
性値を更新し、さらに更新された端点活性値と現在の中
心活性値を用いて、新たな中心活性値を算出する処理と
なる。In step F43, the control unit 11
Executes a central activation value update process for calculating a new central activation value for the element referenced by the counter. This center activation value update process will be described in detail with reference to FIG. 8 by taking the process for the element E1 as an example. The center activation value updating process is a process of updating the end point activation value of the element, and calculating a new center activation value using the updated end point activation value and the current center activation value.

【００５８】図８のステップＦ５１では、制御部１１
は、文書を構成するエレメントＥｉ（例えばこの場合Ｅ
１）に一端が接続されたリンクの数を計数するカウンタ
の初期化をおこなう。すなわち、リンクを計数するカウ
ンタのカウント値ｊを１に設定する。ｊ＝１の場合、こ
のカウンタは、エレメントＥｉと接続された第１番目の
リンクＬ（yy）を参照することになる。図６の例では、
エレメントＥ１についての第１のリンクとして例えばリ
ンクＬ１２を参照する。In step F51 of FIG.
Is an element Ei (for example, in this case E
In 1), a counter for counting the number of links having one end connected thereto is initialized. That is, the count value j of the counter for counting the links is set to one. When j = 1, this counter refers to the first link L (yy) connected to the element Ei. In the example of FIG.
For example, the link L12 is referred to as the first link for the element E1.

【００５９】ステップＦ５２で制御部１１は、参照中の
リンク、つまりエレメントＥ１とＥ２を接続するリンク
Ｌ１２について、関係属性のタグを参照することにより
通常リンクであるか否かを判断する。制御部１１は、リ
ンクＬ１２が通常リンクであればステップＦ５３に、一
方リンクＬ１２が参照リンクであればステップＦ５４に
処理を進める。In step F52, the control unit 11 determines whether or not the link being referred to, that is, the link L12 connecting the elements E1 and E2, is a normal link by referring to the tag of the related attribute. If the link L12 is a normal link, the control unit 11 proceeds to step F53, and if the link L12 is a reference link, proceeds to step F54.

【００６０】リンクＬ１２が通常リンクと判断されてス
テップＦ５３に進んだ場合は、制御部１１は、エレメン
トＥ１の通常リンクＬ１２に接続された端点Ｔ１２の新
たな端点活性値を計算する処理をおこなう。端点Ｔ１２
の端点活性値ｔ１２は、リンク先のエレメントＥ２の端
点活性値のうち、リンクＬ１２以外のリンクに接続する
すべての端点の各端点活性値（この場合Ｔ２６、Ｔ２
７、Ｔ２８の各端点活性値ｔ２６、ｔ２７，ｔ２８）
と、エレメントＥ２の中心活性値ｅ２を加算し、この加
算で得た値を、文書に含まれるエレメントの総数で除す
ることにより求められる。制御部１１は、この様な演算
を、ＲＡＭ１４から読み出した各端点活性値および各中
心活性値を用いて行うことで、通常リンクと接続された
端点についての新たな端点活性値を算出し、算出した端
点活性値を、ＲＡＭ１４に記憶させる。つまり端点Ｔ１
２の端点活性値ｔ１２を更新する。If the link L12 is determined to be a normal link and the process proceeds to step F53, the control section 11 performs a process of calculating a new endpoint activation value of the endpoint T12 connected to the normal link L12 of the element E1. Endpoint T12
Are the endpoint activation values of all the endpoints connected to the link other than the link L12 among the endpoint activation values of the linked element E2 (in this case, T26 and T2).
7, each end point activation value t26, t27, t28 of T28)
And the central activation value e2 of the element E2, and the value obtained by this addition is divided by the total number of elements included in the document. The control unit 11 performs such an operation using each endpoint activity value and each center activity value read from the RAM 14, thereby calculating a new endpoint activity value for the endpoint connected to the normal link. The calculated end point activation value is stored in the RAM 14. That is, the end point T1
The end point activation value t12 of No. 2 is updated.

【００６１】一方、ステップＦ５２でリンクＬ１２が参
照リンクであると判断され、ステップＦ５４に進んだ場
合は、同じく制御部１１は、通常リンクＬ１２に接続さ
れたエレメントＥ１の端点Ｔ１２の新たな端点活性値を
計算する処理をおこなうことになるが、端点活性値の算
出のための演算は次のようになる。即ちこの場合は、端
点Ｔ１２の端点活性値ｔ１２は、リンク先のエレメント
Ｅ２の端点活性値のうち、リンクＬ１２以外のリンクに
接続するすべての端点の各端点活性値（この場合Ｔ２
６、Ｔ２７、Ｔ２８の各端点活性値ｔ２６、ｔ２７，ｔ
２８）と、エレメントＥ２の中心活性値ｅ２を加算した
値とする。（つまり除算がない点が上記通常リンクの場
合と異なるものとなる）そして制御部１１は、この様な
演算を、ＲＡＭ１４から読み出した各端点活性値および
各中心活性値を用いて行うことで、参照リンクと接続さ
れた端点についての新たな端点活性値を算出し、算出し
た端点活性値を、ＲＡＭ１４に記憶させる。つまり端点
Ｔ１２の端点活性値ｔ１２を更新する。On the other hand, when it is determined in step F52 that the link L12 is a reference link and the process proceeds to step F54, the control unit 11 similarly sets a new end point activation of the end point T12 of the element E1 connected to the normal link L12. The processing for calculating the value is performed. The calculation for calculating the end point activity value is as follows. That is, in this case, the endpoint activation values t12 of the endpoint T12 are the endpoint activation values of all the endpoints connected to the link other than the link L12 among the endpoint activation values of the linked element E2 (in this case, T2
6, each end point activation value t26, t27, t of T27, T28
28) and the central activation value e2 of the element E2. (That is, the point where there is no division is different from the case of the above-mentioned normal link.) Then, the control unit 11 performs such an operation using each of the endpoint activation values and each of the center activation values read from the RAM 14, A new endpoint activity value is calculated for the endpoint connected to the reference link, and the calculated endpoint activity value is stored in the RAM. That is, the end point activation value t12 of the end point T12 is updated.

【００６２】このようなステップＦ５３又はＦ５４の処
理を行なったら、制御部１１はステップＦ５５での判別
処理を介して（判別結果がＮＯであれば）ステップＦ５
７に進み、カウント値ｊをインクリメントしてステップ
Ｆ５２に戻る。即ち続いて、カウント値ｊ＝２とされる
ことにより、エレメントＥ１についての第２のリンク
（例えばリンクＬ１３）が参照されることになるため、
上記同様にステップＦ５２以降の処理でリンクＬ１３に
接続される端点Ｔ１３の端点活性値ｔ１３が算出／更新
されることになる。After performing the processing in step F53 or F54, the control unit 11 performs the processing in step F5 through the determination processing in step F55 (if the determination result is NO).
Proceeding to 7, the count value j is incremented, and the process returns to step F52. That is, subsequently, by setting the count value j = 2, the second link (for example, link L13) for the element E1 is referred to.
In the same manner as described above, the end point activation value t13 of the end point T13 connected to the link L13 is calculated / updated in the processing after step F52.

【００６３】ステップＦ５５では、制御部１１は、現在
カウント値ｉで参照中のエレメントＥｉ（Ｅ１）につい
て、全てのリンクについての新たな端点活性値が計算さ
れたか否かを判別して処理を分岐するものであるため、
端点活性値の更新処理は、参照中のエレメントＥｉの全
ての端点活性値が更新されるまで行われる。つまりステ
ップＦ５７でカウント値ｊがインクリメントされながら
処理が繰り返されることで、例えばエレメントＥ１につ
いては、端点Ｔ１２，Ｔ１３，Ｔ１４，Ｔ１５について
それぞれ端点活性値ｔ１２，ｔ１３，ｔ１４，ｔ１５が
更新されていき、その全てが更新された時点で、処理は
ステップＦ５５からＦ５６に進むことになる。In step F55, the control unit 11 determines whether or not a new endpoint activation value has been calculated for all the links for the element Ei (E1) currently referred to by the count value i, and branches the processing. Because
The process of updating the endpoint activation values is performed until all endpoint activation values of the element Ei being referred to are updated. That is, by repeating the process while incrementing the count value j in step F57, for the element E1, for example, the endpoint activation values t12, t13, t14, and t15 are updated for the endpoints T12, T13, T14, and T15, respectively. When all of them have been updated, the process proceeds from step F55 to F56.

【００６４】エレメントＥｉについての全ての端点活性
値が求められたことに応じて、ステップＦ５６では、更
新された端点活性値を用いて、エレメントＥｉの新たな
中心活性値ｅｉを算出する。エレメントＥｉの新たな中
心活性値ｅｉは、エレメントＥｉの現在の中心活性値ｅ
ｉとエレメントＥｉのすべての端点の新たな端点活性値
の和で求められる。例えば図６のエレメントＥ１の場合
は、新たな中心活性値ｅ１(new)は、ｅ１(new)＝ｅ１＋
ｔ１２＋ｔ１３＋ｔ１４＋ｔ１５となる。In step F56, a new central activity value ei of the element Ei is calculated by using the updated endpoint activity values in response to the determination of all the endpoint activity values for the element Ei. The new central activation value ei of the element Ei is the current central activation value e of the element Ei.
It is determined by the sum of new endpoint activation values of all endpoints of i and element Ei. For example, in the case of the element E1 in FIG. 6, the new central activation value e1 (new) is e1 (new) = e1 +
t12 + t13 + t14 + t15.

【００６５】制御部１１は、このようにして現在カウン
ト値ｉで参照中のエレメントＥｉの中心活性値ｅｉを算
出する。そして、制御部１１は、計算した新たな中心活
性値ｅｉをＲＡＭ１４に記憶させる。つまりエレメント
Ｅｉの中心活性値ｅｉを更新する。（但しこの時点で
は、後述するステップＦ４５の処理で用いるため、旧中
心活性値も保持しておく）The control unit 11 calculates the central activation value ei of the element Ei being referred to by the current count value i in this way. Then, the control unit 11 causes the RAM 14 to store the calculated new central activation value ei. That is, the central activation value ei of the element Ei is updated. (However, at this point, the old center activation value is also held because it is used in the process of step F45 described later.)

【００６６】図７のステップＦ４３の中心活性値更新処
理として、以上図８に示したような処理が行われるた
ら、制御部１１の処理は図７のステップＦ４４に進み、
制御部１１は、文書中のすべてのエレメントについて中
心活性値更新処理が完了したか否かを判断する。具体的
には、制御部１１は、カウント値ｉが、文書に含まれる
エレメントの総数に達したか否かを判断する。制御部１
１は、すべてのエレメントについて中心活性値更新処理
が完了していないときは、ステップＦ４７に処理を進
め、カウント値ｉをインクリメントしてステップＦ４３
に戻る。例えば上記のようにエレメントＥ１についての
処理が終わった後であれば、カウント値ｉ＝２とされ
て、今度はエレメントＥ２が参照されることになる。そ
してエレメントＥ２について、ステップＦ４３の中心活
性値更新処理（即ち図８の処理）が上記同様に行われ
る。重複説明となるため詳細は述べないが、図６のリン
ク例でいえば、エレメントＥ２の場合は、図８の処理に
おいて端点Ｔ２１，Ｔ２６，Ｔ２７，Ｔ２８の各端点活
性値ｔ２１，ｔ２６，ｔ２７，ｔ２８が更新された後、
新たな中心活性値ｅ２(new)が、ｅ２(new)＝ｅ２＋ｔ２１＋ｔ２６＋ｔ２７＋ｔ２８として算出され、更新されることになる。When the processing shown in FIG. 8 is performed as the central activation value updating processing in step F43 in FIG. 7, the processing of the control unit 11 proceeds to step F44 in FIG.
The control unit 11 determines whether or not the central activation value update processing has been completed for all elements in the document. Specifically, control unit 11 determines whether or not count value i has reached the total number of elements included in the document. Control unit 1
When the central activation value update processing has not been completed for all the elements, the process proceeds to step F47, where the count value i is incremented, and
Return to For example, after the processing for the element E1 is completed as described above, the count value i is set to 2 and the element E2 is referred to this time. Then, for the element E2, the central activation value update processing of step F43 (that is, the processing of FIG. 8) is performed in the same manner as described above. Although the details will not be described because they are redundant description, in the case of the link example of FIG. 6, in the case of the element E2, the end point activation values t21, t26, t27 of the end points T21, T26, T27, T28 in the processing of FIG. After t28 is updated,
A new central activity value e2 (new) is calculated as e2 (new) = e2 + t21 + t26 + t27 + t28, and is updated.

【００６７】図７の処理においては、このようにステッ
プＦ４７でカウント値ｉがインクリメントされて参照エ
レメントが変更されながらステップＦ４３の中心活性値
更新処理が繰り返されることで、文書に含まれる全ての
エレメントの中心活性値が更新されていくことになる。In the process shown in FIG. 7, the count value i is incremented in step F47, and the central activation value updating process in step F43 is repeated while changing the reference element. Will be updated.

【００６８】文書中のすべてのエレメントについて中心
活性値の更新が完了したときは、処理はステップＦ４４
からＦ４５に進むことになる。ステップＦ４５において
は、制御部１１は、文書に含まれるすべてのエレメント
の中心活性値の変化分、すなわち新たに計算された中心
活性値の元の中心活性値に対する変化分について平均値
を計算する。例えば制御部１１は、ＲＡＭ１４に記憶さ
れた旧中心活性値と、更新した新たな中心活性値を、文
書に含まれるすべてのエレメントについて読み出す。そ
して各エレメントについて新中心活性値と旧中心活性値
の差分を求め、その差分の総和をエレメントの総数で除
することにより、すべてのエレメントの中心活性値の変
化分の平均値を計算する。制御部１１は、このように計
算したすべてのエレメントの中心活性値の変化分の平均
値を、たとえばＲＡＭ１４に記憶させる。When the updating of the central activation value has been completed for all the elements in the document, the processing is terminated at step F44.
To F45. In step F45, the control unit 11 calculates the average value of the change in the central activity values of all the elements included in the document, that is, the change in the newly calculated central activity value from the original central activity value. For example, the control unit 11 reads the old central activation value stored in the RAM 14 and the updated new central activation value for all elements included in the document. Then, for each element, the difference between the new central activity value and the old central activity value is obtained, and the sum of the differences is divided by the total number of elements to calculate the average value of the change in the central activity value of all elements. The control unit 11 causes the RAM 14 to store, for example, the average value of the change in the central activity value of all elements calculated in this way.

【００６９】続いてステップＦ４６において制御部１１
は、ステップＦ４５で計算した平均値が、あらかじめ設
定された閾値以内であるか否かを判断する。そして、制
御部１１は、上記平均値が閾値以内である場合は、活性
拡散処理としての一連の行程を終了するが、上記平均値
が閾値以内でないときには、ステップＦ４２にもどっ
て、上述した一連の行程を再び実行する。Subsequently, at step F46, the control unit 11
Determines whether the average calculated in step F45 is within a preset threshold. When the average value is within the threshold value, the control unit 11 ends a series of steps as the active diffusion process. When the average value is not within the threshold value, the control unit 11 returns to step F42 and performs the above-described series of processes. Perform the process again.

【００７０】この一連の活性拡散処理は、中心活性値が
高いエレメントに関連のある（リンクする）エレメント
について、その中心活性値を引き上げていく処理といえ
るものである。ところが、この活性拡散を１回行うのみ
では、インデックス作成処理の目的を考えたときに、本
来中心活性値を引き上げられるべきエレメントの中で、
中心活性値が十分に引き上げられないものが発生する場
合もありうる。例えば、１回の活性拡散では、中心活性
値の初期値が高く設定されたエレメントに直接リンクす
るエレメントについては、或る程度中心活性値が引き上
げられるが、直接リンクしていないエレメントは、それ
がインデックスとして重要なエレメントであっても十分
に中心活性値が引き上げられないことが生ずる。そこ
で、ステップＦ４６の判断を介して、必要に応じて活性
拡散処理を複数回行うようにすることで、全体的に中心
活性値が収束されるようにし、中心活性値が引き上げら
れない重要なエレメントがなるべく生じないようにする
ものである。なお、複数回の活性拡散で、全体的に中心
活性値が収束されていくのは、活性拡散処理で更新され
た各エレメントの中心活性値に基づいて、さらに次の活
性拡散処理で各エレメントの中心活性値が更新されてい
くためである。但し、このような活性拡散処理が多数回
行われすぎると、全エレメントの中心活性値が収束しき
ってほぼ同値となるような事態となり、不適切である。
このため、ステップＦ４５，Ｆ４６の処理として、中心
活性値の変化分の平均値を求めるように、その変化分に
基づいて活性拡散処理の終了タイミングを判断すること
で、インデックス作成に好適な活性拡散が実現されるこ
とになる。This series of active diffusion processing can be said to be processing for raising the central activity value of an element related (linked) to an element having a high central activity value. However, by performing this activation diffusion only once, when considering the purpose of the index creation processing, among the elements that should be able to raise the central activation value,
In some cases, the central activity value may not be sufficiently raised. For example, in one activation diffusion, for an element directly linked to an element for which the initial value of the central activation value is set high, the central activation value is raised to some extent, but for an element that is not directly linked, it is Even if the element is important as an index, the central activity value may not be sufficiently raised. Therefore, by performing the active diffusion process a plurality of times as necessary based on the determination in step F46, the central activity value can be converged as a whole, and important elements that cannot be increased in the central activity value As much as possible. The reason why the central activity value is converged as a whole in the active diffusion process a plurality of times is that the central active value of each element updated in the active diffusion process is based on the central activity value of each element, and further the active activity diffusion process is performed for each element. This is because the central activity value is updated. However, if such activation diffusion processing is performed too many times, the central activation values of all the elements converge and become almost the same, which is inappropriate.
For this reason, in the processing of steps F45 and F46, the end timing of the active diffusion process is determined based on the change so as to obtain the average value of the change of the central active value, so that the active diffusion suitable for index creation is performed. Will be realized.

【００７１】以上の図７、図８のような活性拡散処理
（即ち図５のステップＦ３１）が完了したら、制御部１
１の処理は図５のステップＦ３２に進むことになる。ス
テップＦ３２においては、制御部１１は、ステップＦ３
１で得られた各エレメントの中心活性値に基づいて、中
心活性値があらかじめ設定された閾値を超えるエレメン
トを抽出する。制御部１１は、このように抽出したエレ
メントをＲＡＭ１４に記憶させる。When the active diffusion processing as shown in FIGS. 7 and 8 (ie, step F31 in FIG. 5) is completed, the controller 1
The process 1 proceeds to step F32 in FIG. In step F32, the control unit 11 determines in step F3
Based on the central activity value of each element obtained in step 1, an element whose central activity value exceeds a preset threshold is extracted. The control unit 11 causes the RAM 14 to store the extracted elements.

【００７２】続いてステップＦ３３においては、制御部
１１は、ステップＦ３２にて抽出したエレメントをたと
えばＲＡＭ１４から読み出す。そして制御部１１は、こ
の抽出したエレメントの中からすべての固有名詞を取り
出してインデックスに加える。固有名詞は語義を持た
ず、辞書に載っていないなどの特殊の性質を有するので
固有名詞以外の語とは別に扱うものである。なお語義と
は、前述したように、語の有する複数の意味のうちの各
意味に対応したものである。各エレメントが固有名詞で
あるか否かは、文書に付されたタグに基づいて判断する
ことができる。たとえば、図３に示したタグ付けによる
内部構造においては、“Ａ氏”、“Ｂ会”および“Ｃ
市”は、タグによる関係属性がそれぞれ“人名”、“組
織名”および“地名”であるので固有名詞であることが
分かる。そして、制御部１１は、取り出した固有名詞を
インデックスに加え、その結果をＲＡＭ１４に記憶させ
る。Subsequently, in step F33, the control section 11 reads out the elements extracted in step F32, for example, from the RAM 14. Then, the control unit 11 extracts all proper nouns from the extracted elements and adds them to the index. Proper nouns have special properties such as having no meaning and not appearing in dictionaries, and are therefore treated separately from words other than proper nouns. As described above, the word meaning corresponds to each of a plurality of meanings of the word. Whether or not each element is a proper noun can be determined based on a tag attached to the document. For example, in the internal structure by tagging shown in FIG. 3, "A", "B" and "C"
“City” is a proper noun since the related attributes of the tags are “person name”, “organization name”, and “place name”, and the control unit 11 adds the extracted proper noun to the index, and The result is stored in the RAM 14.

【００７３】次のステップＦ３４においては、制御部１
１は、ステップＦ３２にて抽出したエレメントの中か
ら、固有名詞以外の語義を取り出してインデックスに加
え、その結果をＲＡＭ１４に記憶させる。In the next step F34, the control unit 1
1 extracts the meanings other than proper nouns from the elements extracted in step F32, adds them to the index, and stores the result in the RAM 14.

【００７４】以上の処理により、例えば上記した具体例
のようなインデックスが生成される。即ちインデックス
は、タグ付けされた文書の特徴を発見して、その特徴を
配列したものとなり、その文書の特徴は、文書の内部構
造に応じて拡散処理された中心活性値に基づいて判断さ
れるものとなる。そしてこのようなインデックスは、文
書を代表するような特徴を表す語義および固有名詞を含
むので、所望の文書を参照する際に用いることができ
る。なお、インデックスには、文書の特徴を表す語義お
よび固有名詞とともに、その文書がＲＡＭ１４（又はＨ
ＤＤ３４）において記憶された位置を示す文書アドレス
を含めておく。By the above processing, for example, an index as in the above specific example is generated. That is, the index finds the features of the tagged document and arranges the features, and the features of the document are determined based on the central activity value diffused according to the internal structure of the document. It will be. Since such an index includes meanings and proper nouns representing features representative of a document, it can be used when referring to a desired document. The index includes, in addition to the meanings and proper nouns representing the characteristics of the document, the document in the RAM 14 (or H
The document address indicating the position stored in DD34) is included.

【００７５】３−３文書閲覧／分類作成／分類操作以上の図５〜図８で説明したインデックス作成処理は図
４のステップＦ１２で行われるものとなる。従って図４
の手動分類処理としては、続いてステップＦ１３，Ｆ１
４の処理、即ち上述したようにユーザーによる閲覧及び
手動分類の処理に移る。3-3 Document Browsing / Category Creation / Category Operation The index creation processing described above with reference to FIGS. 5 to 8 is performed in step F12 of FIG. Therefore, FIG.
As for the manual classification processing of step F13 and step F1
The process proceeds to the process of No. 4, that is, the process of browsing and manual classification by the user as described above.

【００７６】上述のように、図４のステップＦ１３にお
いては、ユーザーは表示部３０に表示される文書を閲覧
することができる。またステップＦ１４においては、ユ
ーザーが分類項目を設定する操作や、文書データを、設
定された分類項目に振り分けていく操作を行うことがで
きる。このステップＦ１３，Ｆ１４で行われる操作や、
それに対応する制御部１１の処理及び表示部３０の表示
例は以下のようになる。As described above, in step F13 of FIG. 4, the user can browse the document displayed on the display unit 30. In step F14, the user can perform an operation of setting a classification item and an operation of sorting document data into the set classification item. The operations performed in steps F13 and F14,
The corresponding process of the control unit 11 and a display example of the display unit 30 are as follows.

【００７７】図９、図１０は表示部３０における表示の
具体例を示している。まず図９は、詳しくは後述する分
類モデルに対応した文書分類ウインドウ２０１の表示例
である。即ち、文書分類の表示に用いられるグラフィッ
クユーザインターフェース（graphic user interface；
GUI）の具体例となる。この文書分類ウィンドウ２０１
には、操作用のボタン表示２０２として、画面のウィン
ドウの状態を初期の位置にもどすポジションリセット
（position reset）ボタン２０２ａと、文書の内容を閲
読するブラウザ（browser）を呼び出すブラウザボタン
２０２ｂと、このウィンドウからの脱出（exit）ボタン
２０２ｃとが表示される。FIGS. 9 and 10 show specific examples of the display on the display unit 30. FIG. First, FIG. 9 is a display example of a document classification window 201 corresponding to a classification model described later in detail. That is, a graphic user interface (graphic user interface) used for displaying a document classification.
GUI). This document classification window 201
The operation button display 202 includes a position reset button 202a for returning a screen window state to an initial position, a browser button 202b for calling a browser (browser) for reading the contents of a document, An exit button 202c is displayed.

【００７８】また、この文書分類ウィンドウ３０１は、
分類モデルに対応する分類項目に応じた小ウインドウと
して、文書分類エリア２０３，２０４，２０５・・・が
形成される。文書分類エリア２０３は、“他のトピック
ス”を表示するエリアとされる。この”他のトピック
ス”の文書分類エリア２０３は、まだ分類されていない
文書が提示される領域となる。例えば図４のステップＦ
１１で受信された各文書（つまりこれから分類しようと
する文書）は、この”他のトピックス”の文書分類エリ
ア２０３に提示される。文書分類エリア２０４は、例え
ば”ビジネスニュース”に分類された文書が提示される
領域となる。文書分類エリア２０５は、例えば”政治ニ
ュース”に分類された文書が提示される領域となる。こ
れら以外にも、図中で符号を付していない文書分類エリ
アは、それぞれ特定の分類項目に応じた文書が提示され
る領域となる。The document classification window 301
Document classification areas 203, 204, 205,... Are formed as small windows corresponding to the classification items corresponding to the classification model. The document classification area 203 is an area for displaying “other topics”. The “other topics” document classification area 203 is an area where documents that have not been classified yet are presented. For example, step F in FIG.
Each document received at 11 (that is, a document to be classified) is presented in the "other topics" document classification area 203. The document classification area 204 is an area where documents classified as, for example, “business news” are presented. The document classification area 205 is an area in which documents classified as, for example, “political news” are presented. In addition to these, document classification areas without reference numerals in the figure are areas where documents corresponding to specific classification items are presented.

【００７９】これらの各文書分類エリア２０３，２０４
・・・では、その各文書分類エリアに設定された分類項
目（カテゴリ）に分類された文書が、その文書のアイコ
ンと文書のタイトルにより提示される。タイトルがない
場合には、一文の要約が表示される。また各文書分類エ
リア２０３，２０４・・・の大きさは固定的ではなく、
ユーザーがドラッグ操作などにより各文書分類エリアを
区切る区切枠２１１，２１２，２１３・・・を移動させ
ることにより、各文書分類エリア２０３，２０４・・・
の面積を任意に変更させることができる。文書分類エリ
アの数もユーザーが任意に増減できる。Each of these document classification areas 203 and 204
.., The documents classified into the classification items (categories) set in the respective document classification areas are presented by the icon of the document and the title of the document. If there is no title, a summary of one sentence is displayed. Also, the size of each of the document classification areas 203, 204,... Is not fixed,
When the user moves the delimiting frames 211, 212, 213,... Separating the document classification areas by a drag operation or the like, the document classification areas 203, 204,.
Can be arbitrarily changed. The user can arbitrarily increase or decrease the number of document classification areas.

【００８０】また各文書分類エリア２０３，２０４・・
・のタイトル（例えば「政治ニュース」など）は、ユー
ザーが任意に設定、変更できるものである。なお、この
文書分類エリアの数及び各タイトルは、後述する分類モ
デルの分類項目に応じたものとなる。言い換えれば、ユ
ーザーがこの分類ウインドウ２０１においてマウスやキ
ーボード等による入力部２０からの操作で、文書分類エ
リアの設定や削除、或いはタイトル設定を行うことで、
分類モデルの分類項目の数やタイトルが設定されること
になる。Each of the document classification areas 203, 204,.
The title (for example, "political news") can be arbitrarily set and changed by the user. Note that the number of document classification areas and each title correspond to the classification items of the classification model described later. In other words, the user sets or deletes a document classification area or performs a title setting in the classification window 201 by operating the input unit 20 using a mouse, a keyboard, or the like.
The number and title of the classification items of the classification model are set.

【００８１】図１０は、ユーザーが文書データの内容を
閲覧する閲覧ウインドウ３０１の例を示している。例え
ばユーザーが、図９の分類ウインドウ２０１において或
る文書をクリックして選択した状態としたうえで、ブラ
ウザボタン２０２ｂをクリックすることで、制御部１１
は図１０のように選択された文書を表示する閲覧ウイン
ドウ３０１を開くようにする。FIG. 10 shows an example of a browsing window 301 for a user to browse the contents of document data. For example, the user clicks on a certain document in the classification window 201 of FIG. 9 to select the document, and then clicks the browser button 202b, so that the control unit 11
Opens a browsing window 301 for displaying the selected document as shown in FIG.

【００８２】この閲覧ウインドウ３０１には、文書デー
タファイルのファイル名を表示するファイル名表示部３
０２、そのファイル名の文書データを表示する文書表示
部３０３、文書表示部３０３に表示された文書の要約文
を表示する要約表示部３０４、キーワードの入力／表示
を行うキーワード表示部３０５が設けられる。また操作
用のボタン表示３０６として、要約文の作成を指示する
ための要約作成ボタン３０６ａ、アンドゥ操作（操作取
消）を行うためのアンドゥボタン３０６ｂ、読み上げ動
作を実行させるための読み上げボタン３０６ｃなどが表
示される。In the browsing window 301, a file name display section 3 for displaying the file name of the document data file
02, a document display unit 303 for displaying the document data of the file name, a summary display unit 304 for displaying a summary of the document displayed on the document display unit 303, and a keyword display unit 305 for inputting / displaying a keyword. . As the operation button display 306, a summary creation button 306a for instructing creation of a summary sentence, an undo button 306b for performing an undo operation (operation cancellation), a reading button 306c for executing a reading operation, and the like are displayed. Is done.

【００８３】この様な閲覧ウインドウ３０１において、
ユーザーは文書表示部３０３に表示される文書を閲覧す
ることができる。なお、文書の全体を表示しきれないと
きは、文書の一部が表示される。もちろんスクロール操
作を行うことで、全文を閲覧できる。また、ユーザーは
要約作成ボタン３０６ａをクリックすることで、文書表
示部３０３に表示される文書についての要約文を作成さ
せ、要約表示部３０４に表示させることができる。な
お、要約文作成のための制御部１１の処理については後
述する。さらにユーザーは、読み上げボタン３０６ｃを
クリックすることで、文書表示部３０３に表示されてい
る文書の本文又は要約文についての読み上げを実行させ
ることができる。この読み上げ動作についても後述す
る。In such a browsing window 301,
The user can browse the document displayed on the document display unit 303. If the entire document cannot be displayed, a part of the document is displayed. Of course, you can browse the full text by scrolling. In addition, by clicking the summary creation button 306a, the user can cause a summary sentence for the document displayed on the document display unit 303 to be created and displayed on the summary display unit 304. The process of the control unit 11 for creating the summary sentence will be described later. Further, by clicking the read-aloud button 306c, the user can cause the text or the summary of the document displayed on the document display section 303 to be read aloud. This reading operation will also be described later.

【００８４】以上のような分類ウインドウ２０１、閲覧
ウインドウ３０１は、図４の手動分類処理の際に限ら
ず、ユーザーの操作に応じて随時表示部２０に表示され
るものであるが、図４の手動分類処理に関していえば、
ユーザーは受信した文書の種類や内容を、分類ウインド
ウ２０１、閲覧ウインドウ３０１で確認することができ
るものである。具体的には、図４のステップＦ１１で受
信された１又は複数の文書は、ステップＦ１２でのイン
デックス作成処理の後、図９のような分類ウインドウ２
０１における”他のトピックス”の文書分類エリア２０
３に表示される。この分類ウインドウ２０１において、
ユーザーは、文書分類エリア２０３に表示された各文書
を手動で分類していくことになるが、例えば文書のタイ
トルだけ等では内容がわからない場合は、図１０の閲覧
ウインドウ３０１により文書内容を確認する。そのよう
にユーザの必要に応じて行われる閲覧が図４のステップ
Ｆ１３の処理となる。The classification window 201 and the browsing window 301 as described above are displayed on the display unit 20 at any time according to the user's operation, not limited to the manual classification processing in FIG. As for the manual classification process,
The user can confirm the type and content of the received document in the classification window 201 and the browsing window 301. Specifically, one or a plurality of documents received in step F11 in FIG. 4 are subjected to the index creation processing in step F12, and then the classification window 2 shown in FIG.
01 "other topics" document classification area 20
3 is displayed. In this classification window 201,
The user manually classifies each of the documents displayed in the document classification area 203. If, for example, the content is not known only by the title of the document, the user checks the content of the document by using the browsing window 301 in FIG. . The browsing performed as needed by the user in this manner is the process of step F13 in FIG.

【００８５】ステップＦ１４としては、ユーザーは分類
ウインドウ２０１上において分類項目の追加、更新、削
除等を任意に行うことができ、その操作に応じて、制御
部１１は表示される文書分類エリア２０３、２０４・・
・の表示態様（数、面積、タイトル等）を変更させてい
く。なお、ユーザーによる分類項目（文書分類エリアの
タイトル）の設定／変更は、それが後述する分類モデル
に反映されることになる。In step F14, the user can arbitrarily add, update, or delete a classification item on the classification window 201. In response to the operation, the control unit 11 causes the displayed document classification area 203, 204 ...
・ Change the display mode (number, area, title, etc.). The setting / change of the classification item (title of the document classification area) by the user is reflected in the classification model described later.

【００８６】ユーザーは必要に応じて分類項目の設定を
行った後、文書分類エリア２０３に表示されている各文
書を、各文書分類エリアに振り分けていく。つまりユー
ザーの手動により、文書を分類する。具体的には、”他
のトピックス”の文書分類エリア２０３に表示されてい
る文書のアイコンを、例えば入力部２０のマウスを用
い、所望の分類項目（カテゴリ）に対応する文書分類エ
リアにドラッグすることによりおこなう。例えばユーザ
ーは、「スポーツ」というタイトルの文書分類エリアを
設定したうえで、”他のトピックス”の文書分類エリア
２０３に表示されているスポーツ関連の文書のアイコン
を、“スポーツ”の文書分類エリアにドラッグするよう
な操作を行う。このようにして手動で分類された各文書
のアイコンやタイトルは、以降、そのドラッグされた先
の文書分類エリア内で表示される。After setting the classification items as necessary, the user sorts the documents displayed in the document classification area 203 into the respective document classification areas. That is, the documents are classified manually by the user. Specifically, the icon of the document displayed in the document classification area 203 of “other topics” is dragged to the document classification area corresponding to a desired classification item (category) using, for example, the mouse of the input unit 20. Do it by doing. For example, the user sets a document classification area titled “Sports”, and changes the icon of a sports-related document displayed in the document classification area 203 of “Other topics” to the document classification area of “Sports”. Perform an operation like dragging. The icons and titles of the documents that have been manually classified in this manner are thereafter displayed in the dragged destination document classification area.

【００８７】３−４分類モデル作成／登録以上のようにユーザーによる手動分類操作が行われた
ら、制御部１１は図４のステップＦ１５において、ユー
ザの分類操作に基づいた複数のカテゴリからなる分類モ
デルを作成する。すなわち制御部１１は、各カテゴリに
分類された上記複数の文書のインデックスを集めて、分
類モデルを生成する。そして、分類モデルの各カテゴリ
に上記複数の文書を分類する。3-4 Creation / Registration of Classification Model As described above, when the user performs the manual classification operation, the control unit 11 determines in step F15 in FIG. 4 that the classification model is composed of a plurality of categories based on the user's classification operation. Create That is, the control unit 11 collects indices of the plurality of documents classified into each category and generates a classification model. Then, the plurality of documents are classified into each category of the classification model.

【００８８】分類モデルは、文書を分類する複数の分類
項目（カテゴリ）から構成される。そして各カテゴリに
ついて、分類された文書が示されるデータ形態となる。
各文書については、上記ステップＦ１２などでインデッ
クスが形成されるが、分類モデルは例えば図１１（ａ）
に示すように、各カテゴリについて分類された文書のイ
ンデックスが対応づけられたようなデータ構造となる。
この図１１（ａ）では、カテゴリとして「スポーツ」
「会社」「コンピュータ」・・・等が設定されている
が、これらは上記のように分類ウインドウ２０１におい
てユーザーが設定した分類項目となる。なお、もちろん
ユーザーが設定しなくとも、予め設定されている（つま
り分類ウインドウで文書分類エリアとして表示される）
カテゴリがあってもよい。そして各分類項目にはインデ
ックスＩＤＸ１、ＩＤＸ２・・・が対応づけられるが、
即ち各分類項目には、ユーザーが上記のように分類した
文書のインデックスが対応づけられるものとなる。The classification model is composed of a plurality of classification items (categories) for classifying documents. Then, for each category, a data format is provided in which the classified documents are shown.
An index is formed for each document in step F12 and the like, and the classification model is, for example, as shown in FIG.
As shown in (1), the data structure is such that the indexes of the documents classified for each category are associated with each other.
In FIG. 11A, the category is “sports”.
“Company”, “Computer”, etc. are set, but these are the classification items set by the user in the classification window 201 as described above. Of course, even if the user does not set, it is set in advance (that is, displayed as a document classification area in the classification window).
There may be categories. Each of the classification items is associated with an index IDX1, IDX2,.
That is, the index of the document classified by the user as described above is associated with each classification item.

【００８９】各分類項目に対応づけられるインデックス
は、分類ウインドウ２０１においてその分類項目の文書
分類エリアに表示されている文書のインデックスであ
る。例えばインデックスＩＤＸ１がカテゴリ「スポー
ツ」に対応づけられているのは、ユーザーが、分類ウイ
ンドウ２０１において「スポーツ」をタイトルとする文
書分類エリアを作成し、さらにインデックスＩＤＸ１の
文書のアイコンを、その「スポーツ」をタイトルとする
文書分類エリアにドラッグするという手動分類を行った
ことに基づくものとなる。The index associated with each classification item is the index of a document displayed in the document classification area of the classification item in the classification window 201. For example, the index IDX1 is associated with the category "sports" because the user creates a document classification area with the title "sports" in the classification window 201, and furthermore, sets the icon of the document of the index IDX1 to "sports". Is manually dragged to a document classification area having a title of "".

【００９０】ところで上述のように各文書のインデック
スは、固有名詞、固有名詞以外の語義や文書アドレス等
を含んでいる。そして、例えば図１１（ａ）のように１
つの分類項目には１又は複数のインデックスが対応づけ
られるが、インデックスとして固有名詞、語義、文書ア
ドレス等が含まれるため、分類モデルは図１１（ｂ）の
ようにも表すことができる。As described above, the index of each document includes proper nouns, meanings other than proper nouns, document addresses, and the like. Then, for example, as shown in FIG.
One or a plurality of indices are associated with one classification item. Since the indexes include proper nouns, meanings, document addresses, and the like, the classification model can be represented as shown in FIG. 11B.

【００９１】即ち図１１（ｂ）に示すように、分類モデ
ルは、各カテゴリに対応するカテゴリインデックスとし
て、固有名詞、固有名詞以外の語義、文書アドレスの欄
を有する構造となる。そして分類モデルにおいては、各
カテゴリ「スポーツ」「社会」「コンピュータ」「植
物」「美術」「イベント」に対して、固有名詞“Ａ氏、
・・・”、“Ｂ氏、・・・”、“Ｃ社、Ｇ社、・・
・”、“Ｄ種、・・・”、“Ｅ氏、・・・”および“Ｆ
氏”等の固有名詞が割り当てられる。また、“野球（４
５４６）、グランド（２３４３）、・・・”、“労働
（３１１２）、固有（９８２１）、・・・”、“モパイ
ル（２１０２）、・・・”、“桜１（１１１１１）、オ
レンジ１（９９１１）”、“桜２（１１１１２）、オレ
ンジ２（９９１２）”および“桜３（１１１１３）”等
の語義も各カテゴリに割り当てられる。さらに文書アド
レス“ＳＰ１、ＳＰ２、ＳＰ３、・・・”、“Ｓ０１、
Ｓ０２、Ｓ０３、・・・”、“ＣＯ１、ＣＯ２、ＣＯ
３、・・・”、“ＰＬ１、ＰＬ２、ＰＬ３、・・・”、
“ＡＲ１、ＡＲ２、ＡＲ３、・・・”および“ＥＶ１、
ＥＶ２、ＥＶ３、・・・”も各カテゴリに割り当てられ
る。That is, as shown in FIG. 11B, the classification model has a structure having columns of proper nouns, meanings other than proper nouns, and document addresses as category indexes corresponding to each category. Then, in the classification model, the proper noun “Mr. A” for each category “sports” “society” “computer” “plant” “art” “event”
... "," Mr. B, ... "," Company C, Company G, ...
"," D type, ... "," Mr. E, ... "and" F
A proper noun such as "Mr." is also assigned.
546), ground (2343), ... "," labor (3112), unique (9821), ... "," mopile (2102), ... "," cherry blossom 1 (11111), orange 1 ( 9911) "," Sakura 2 (11112), Orange 2 (9912) ", and" Sakura 3 (11113) "are also assigned to each category. Further, the document addresses" SP1, SP2, SP3,... " “S01,
S02, S03,... "," CO1, CO2, CO
3, ... "," PL1, PL2, PL3, ... ",
"AR1, AR2, AR3, ..." and "EV1,
EV2, EV3,... "Are also assigned to each category.

【００９２】なお、“桜１”“桜２”“桜３”は、
“桜”の第１の語義（１１１１１）、第２の語義（１１
１１２）、第３の語義（１１１１３）を示している。ま
た、“オレンジ１”“オレンジ２”は、“オレンジ”の
第１の語義（９９１１）、第２の語義（９９１２）を示
している。たとえば“オレンジ１”は植物のオレンジを
表し、“オレンジ２”はオレンジ色を表す。固有名詞以
外の場合に語そのものではなく語義を用いるのは、この
様に、同じ語でも複数の意味を有することがあるからで
ある。Note that "Sakura 1", "Sakura 2" and "Sakura 3"
The first meaning (11111) and the second meaning (1111)
112) and the third meaning (11113). “Orange 1” and “Orange 2” indicate the first meaning (9911) and the second meaning (9912) of “orange”. For example, "orange 1" represents a plant orange, and "orange 2" represents an orange color. The reason that the meaning is used instead of the word itself in cases other than proper nouns is that the same word may have a plurality of meanings.

【００９３】図４のステップＦ１５では、ユーザーの手
動分類操作に応じて例えばこの様な分類モデルが生成さ
れる。そしてステップＦ１６として分類モデルが登録、
即ちＲＡＭ１５（又はＨＤＤ３４）に記録される。この
ように分類モデルが生成／登録されることにより、文書
の分類が行われたことになる。In step F15 in FIG. 4, for example, such a classification model is generated according to the manual classification operation by the user. Then, the classification model is registered as Step F16,
That is, it is recorded in the RAM 15 (or the HDD 34). The generation / registration of the classification model in this way means that the classification of the document has been performed.

【００９４】なお、このように図４におけるステップＦ
１５、Ｆ１６として分類モデルの作成／登録が行われた
後は、後述する自動分類処理や、ユーザーの分類項目の
編集、或いは手動分類操作などに応じて、分類モデルは
逐次更新されていくことになる。分類モデルが更新され
ると、分類モデルに更新日時が記録される。図１１に
は、更新日時として“１９９８年１２月１０日１９時５
６分１０秒”が記録されている。Note that step F in FIG.
After the creation / registration of the classification model is performed as F15 and F16, the classification model is sequentially updated in accordance with automatic classification processing described later, editing of a classification item by a user, or manual classification operation. Become. When the classification model is updated, the update date and time are recorded in the classification model. FIG. 11 shows that the update date and time “19:05 on December 10, 1998
6 minutes and 10 seconds "are recorded.

【００９５】４．文書データに対する自動分類処理４−１処理手順本例の文書処理装置１では、上記のように一旦分類モデ
ルが作成された後は、例えば通信部２１により外部から
取り込まれた文書データを、自動的に分類していく自動
分類処理が可能となる。即ち以下説明する自動分類処理
とは、文書処理装置１が外部から送られた文書データを
受信した際に、その文書データを分類モデルに対して分
類していく処理となる。なお、この例では、一つの文書
を受信する毎に以下説明する自動分類処理をおこなうこ
ととするか、複数の所定数の文書を受信する度におこな
ってもよいし、ユーザが図９の画面を開く操作をしたと
きにそれまでに受信した全文書に対して自動分類処理を
おこなうようにしてもよい。4. 4. Automatic Classification Processing for Document Data 4-1 Processing Procedure In the document processing apparatus 1 of the present example, after the classification model is once created as described above, for example, the document data taken in from the outside by the communication unit 21 is automatically converted. The automatic classification process of classifying into is possible. That is, the automatic classification process described below is a process of, when the document processing apparatus 1 receives document data sent from the outside, classifying the document data with respect to a classification model. In this example, the automatic classification process described below may be performed each time one document is received, or may be performed each time a plurality of predetermined documents are received. The automatic classification process may be performed on all the documents received so far when the operation of opening is performed.

【００９６】自動分類処理としての全体の処理手順を図
１２に示す。図１２のステップＦ２１は、文書処理装置
１の受信部２１による文書受信処理を示している。この
ステップＦ２１では、受信部２１は、たとえば通信回線
を介して送信された１又は複数の文書を受信する。受信
部２１は、受信した文書を文書処理装置の本体１０に送
る。制御部１１は供給された１又は複数の文書データを
ＲＡＭ１４又はＨＤＤ３４に格納する。FIG. 12 shows the entire processing procedure as the automatic classification processing. Step F21 in FIG. 12 illustrates a document receiving process by the receiving unit 21 of the document processing apparatus 1. In this step F21, the receiving section 21 receives one or a plurality of documents transmitted via a communication line, for example. The receiving unit 21 sends the received document to the main body 10 of the document processing device. The control unit 11 stores the supplied document data or document data in the RAM 14 or the HDD 34.

【００９７】続いてステップＦ２２に進み、制御部１１
は、ステップＦ２１で取り込まれた文書についてインデ
ックスを作成する。Subsequently, the process proceeds to step F22, where the control unit 11
Creates an index for the document taken in step F21.

【００９８】ステップＦ２３では、制御部１１は、分類
モデルに基づいて、インデックスを付された各文書を、
分類モデルのいずれかのカテゴリに自動分類する。そし
て、制御部１１は、分類の結果をたとえばＲＡＭ１４に
記憶させる。自動分類の詳細については後述する。In step F23, the control unit 11 converts each indexed document based on the classification model into
Automatic classification into one of the categories of the classification model. Then, the control unit 11 causes the RAM 14 to store the result of the classification, for example. Details of the automatic classification will be described later.

【００９９】ステップＦ２４では、制御部１１は、ステ
ップＦ２３での新たな文書の自動分類の結果に基づい
て、分類モデルを更新する。そしてステップＦ２５で
は、制御部１１は、ステップＦ２４で更新された分類モ
デルを登録する。例えば分類モデルをＲＡＭ１４に記憶
させる。In step F24, the control unit 11 updates the classification model based on the result of the automatic classification of the new document in step F23. Then, in step F25, the control unit 11 registers the classification model updated in step F24. For example, the classification model is stored in the RAM 14.

【０１００】以上の図１２の処理により、文書処理状態
１に入力された文書データが、分類モデル上で分類され
るように自動分類処理が行われることになる。すなわち
この自動分類処理においては、受信した文書に対しては
インデックスが作成され、さらに自動分類が行われた
後、そのインデックスを構成している固有名詞、語義、
文書アドレス等が、上記図１１のように分類モデル上で
或るカテゴリーに対応づけられることになる（分類モデ
ルが更新される）。By the above-described processing of FIG. 12, automatic classification processing is performed so that the document data input to the document processing state 1 is classified on the classification model. That is, in this automatic classification process, an index is created for the received document, and after the automatic classification is performed, proper nouns, meanings,
The document address and the like are associated with a certain category on the classification model as shown in FIG. 11 (the classification model is updated).

【０１０１】ステップＦ２１、Ｆ２２の処理は、上述し
た手動分類処理におけるステップＦ１１，Ｆ１２と同様
である。即ちステップＦ２２のインデックス作成処理と
しては、図５〜図８で説明した処理が行われるものであ
り、ここでの繰り返しの説明は避ける。また、ステップ
Ｆ２４の分類モデルの更新は、ステップＦ２３の自動分
類の分類結果に応じてものとなる。以下、上述の手動分
類処理とは異なる処理として、ステップＦ２３の自動分
類について詳細に説明する。The processes in steps F21 and F22 are the same as steps F11 and F12 in the above-described manual classification process. That is, as the index creation processing in step F22, the processing described with reference to FIGS. 5 to 8 is performed, and a repeated description thereof will be omitted. The update of the classification model in step F24 depends on the classification result of the automatic classification in step F23. Hereinafter, automatic classification in step F23 will be described in detail as processing different from the above-described manual classification processing.

【０１０２】４−２自動分類図１２のステップＦ２３での自動分類の詳しい処理を図
１３に示す。図１３のステップＦ６１では、制御部１１
は、分類モデルのカテゴリＣｉに含まれる固有名詞の集
合と、ステップＦ２１で受信した文書から抽出されイン
デックスに入れられた語のうちの固有名詞の集合とにつ
いて、これらの共通集合の数をＰ（Ｃｉ）とする。そし
て制御部１１は、このようにして算出した数Ｐ（Ｃｉ）
をＲＡＭ１４に記憶させる。4-2 Automatic Classification FIG. 13 shows the detailed processing of the automatic classification in step F23 in FIG. In step F61 of FIG.
Defines the number of intersections of the set of proper nouns included in the category Ci of the classification model and the set of proper nouns among the words extracted from the document received in step F21 and put into the index by P ( Ci). Then, the control unit 11 calculates the number P (Ci) thus calculated.
Is stored in the RAM 14.

【０１０３】ステップＦ６２においては、制御部１１
は、その文書のインデックス中に含まれる全語義と、各
カテゴリＣｉに含まれる全語義との語義間関連度を、後
述する図１５に示す語義間関連度の表を参照して、語義
間関連度の総和Ｒ（Ｃｉ）を演算する。すなわち制御部
１１は、分類モデルにおける固有名詞以外の語につい
て、全語義間関連度の総和Ｒ（Ｃｉ）を演算する。そし
て制御部１１は、演算した語義間関連度の総和Ｒ（Ｃ
ｉ）をＲＡＭ１４に記憶させる。In step F62, the control unit 11
Refers to the degree of association between all meanings included in the index of the document and all the meanings included in each category Ci by referring to the table of association between meanings shown in FIG. The total sum R (Ci) of the degrees is calculated. That is, the control unit 11 calculates the total sum R (Ci) of the degrees of association between all the meanings of words other than the proper noun in the classification model. The control unit 11 then calculates the total sum R (C
i) is stored in the RAM 14.

【０１０４】ここで語義間関連度について説明してお
く。語義間関連度は、図１４の処理により文書処理装置
１が備える電子辞書に含まれる語義について予め算出
し、その結果を図１５のように保持しておけばよい。つ
まり、制御部１１が予め一度だけ図１４の処理を実行し
ておくようにすることで、図１３の自動分類処理の際に
用いることができる。Here, the degree of association between meanings will be described. The degree of association between meanings may be calculated in advance for the meanings included in the electronic dictionary included in the document processing apparatus 1 by the processing in FIG. 14 and the result may be stored as shown in FIG. That is, if the control unit 11 executes the process of FIG. 14 once in advance, it can be used in the automatic classification process of FIG.

【０１０５】制御部１１が予め実行しておく図１５の処
理は次のようになる。まずステップＦ７１において、制
御部１１は、電子辞書内の語の語義の説明を用いて、こ
の辞書を使って語義のネットワークを作成する。すなわ
ち、辞書における各語義の説明とこの説明中に現れる語
義との参照関係から、語義のネットワークを作成する。
ネットワークの内部構造は、上述したようなタグ付けに
より記述される。文書処理装置の制御部１１は、たとえ
ばＲＡＭ１４に記憶された電子辞書について、語義とそ
の説明を順に読み出して、ネットワークを作成する。制
御部１４は、このようにして作成した語義のネットワー
クをＲＡＭ１４に記憶させる。The processing of FIG. 15 which is executed in advance by the control unit 11 is as follows. First, in step F71, the control unit 11 uses the description of the meaning of a word in the electronic dictionary and creates a meaning network using this dictionary. That is, a meaning network is created from the reference relation between the explanation of each meaning in the dictionary and the meaning appearing in the explanation.
The internal structure of the network is described by tagging as described above. For example, the control unit 11 of the document processing apparatus sequentially reads meanings and descriptions of the electronic dictionary stored in the RAM 14 and creates a network. The control unit 14 causes the RAM 14 to store the meaning network created in this manner.

【０１０６】なお、上記ネットワークは、文書処理装置
の制御部１１が辞書を用いて作成する他に、受信部２１
にて外部から受信したリ、記録／再生部３１にて記録媒
体３２から再生したりすることにより得ることもでき
る。また上記電子辞書は、受信部２１にて外部から受信
したり、記録／再生部３１にて記録媒体３２から再生し
たりすることにより得ることができる。The above network is not only created by the control unit 11 of the document processing apparatus using a dictionary, but also by the receiving unit 21.
The recording / reproducing section 31 reproduces the data received from the outside from the recording medium 32. Further, the electronic dictionary can be obtained by receiving from the outside by the receiving unit 21 or by reproducing from the recording medium 32 by the recording / reproducing unit 31.

【０１０７】ステップＦ７２においては、ステップＦ７
１で作成された語義のネットワーク上で、各語義のエレ
メントに対応する中心活性値の拡散処理をおこなう。こ
の活性拡散により、各語義に対応する中心活性値は、上
記辞書により与えられたタグ付けによる内部構造に応じ
て与えられる。中心活性値の拡散処理は、図７で説明し
た処理となる。In Step F72, Step F7
The central activity value corresponding to each semantic element is spread on the semantic network created in step 1. By this activity diffusion, the central activity value corresponding to each meaning is given according to the internal structure by tagging given by the dictionary. The diffusion processing of the central activity value is the processing described with reference to FIG.

【０１０８】ステップＦ７３においては、ステップＦ７
１で作成された語義のネットワークを構成するある一つ
の語義Ｓｉを選択し、続くステップＦ７４においては、
この語義Ｓｉに対応する語彙エレメントＥｉの中心活性
値ｅｉの初期値を変化させ、このときの中心活性値の差
分△ｅｉを計算する。In Step F73, Step F7
One semantic Si constituting the semantic network created in 1 is selected, and in the subsequent step F74,
The initial value of the central activity value ei of the vocabulary element Ei corresponding to this meaning Si is changed, and the difference △ ei of the central activity value at this time is calculated.

【０１０９】さらにステップＦ７５においては、ステッ
プＦ７４におけるエレメントＥｉの中心活性値ｅｉの差
分△ｅｉに対応する、他の語義Ｓｊに対応するエレメン
トＥｊの中心活性値ｅｊの差分△ｅｊを求める。ステッ
プＦ７６においては、ステップＦ７５で求めた差分△ｅ
ｊを、ステップＦ７４で求めた△ｅｉで除した商△ｅｊ
／△ｅｉを、語義Ｓｉの語義ｓｊに対する語義間関連度
とする。Further, in step F75, a difference Δej of the central activation value ej of the element Ej corresponding to another meaning Sj corresponding to the difference Δei of the central activation value ei of the element Ei in step F74 is determined. In step F76, the difference Δe obtained in step F75
quotient ｊej obtained by dividing j by △ ei obtained in step F74
/ △ ei is the degree of association between meanings of the meaning Si with respect to the meaning sj.

【０１１０】ステップＦ７７においては、一の語義Ｓｉ
と他の語義Ｓｊとのすべての対について語義間関連度の
演算が終了したか否かについて判断する。すべての語義
の対について語義間関連度の演算が終了していないとき
には、ステップＦ７３にもどり、語義間関連度の演算が
終了していない対について語義間関連度の演算を継続す
る。このようなステップＦ７３からステップＦ７７のル
ープにおいて、制御部１１は、必要な値をたとえばＲＡ
Ｍ１４から順に読み出して、上述したように語義間関連
度を計算する。制御部１１は、計算した語義間関連度を
たとえばＲＡＭ１４に順に記憶させる。そして、すべて
の語義の対について語義間関連度の演算が終了したとき
には、ステップＦ７７から、この一連の処理を終了す
る。In step F77, one meaning Si
It is determined whether or not the calculation of the degree of association between meanings has been completed for all pairs of the word and another meaning Sj. When the calculation of the relevance between meanings is not completed for all pairs of meanings, the process returns to step F73, and the calculation of the relevance between meanings is continued for the pairs for which the calculation of the relevance is not completed. In the loop from step F73 to step F77, the control unit 11 sets a necessary value to, for example, RA
M14 is sequentially read out, and the degree of association between meanings is calculated as described above. The control unit 11 causes the RAM 14 to sequentially store the calculated meaning-to-sense associations, for example. Then, when the calculation of the degree of inter-word meaning is completed for all the word-sense pairs, this series of processing is ended from step F77.

【０１１１】このような語義間関連度の算出は、或る１
つの語義の中心活性値を変化させた時に、それにつられ
て中心活性値が変化する語義を、関連度が高いものとす
る処理といえる。つまりステップＦ７４で或る語義の中
心活性値を変化させると、それに応じて関連する（リン
クされた）語義の中心活性値が変化するものとなるた
め、その変化の度合いを調べれば、或る語義に対する他
の各語義の関連度がわかるものである。（或るエレメン
トＥｉの中心活性値は、上述した活性拡散の説明におい
て述べたように、リンク先のエレメントの中心活性値と
端点活性値が反映されて、そのエレメントＥｉ端点活性
値が更新されたうえで、そのエレメントＥｉの端点活性
値と現在の中心活性値の和から求められるため、リンク
先との関連度が大きいほど中心活性値の変化量は大きく
なる）このような処理を各語義から他の全ての語義に対
して行っていくことで、すべての語義の対（組み合わ
せ）について、関連度を算出することができる。The calculation of the degree of association between meanings is performed by a certain one.
It can be said that, when the central activity value of one meaning is changed, the meaning in which the central activity value changes along with the meaning has a high degree of relevance. That is, if the central activity value of a certain meaning is changed in step F74, the central activity value of the related (linked) meaning changes accordingly, and if the degree of the change is examined, a certain meaning is obtained. The degree of relevance of each other meaning with respect to is understood. (The center activity value of a certain element Ei reflects the center activity value and the endpoint activity value of the linked element as described in the above description of activity diffusion, and the element Ei endpoint activity value is updated. In addition, since it is obtained from the sum of the end point activity value of the element Ei and the current center activity value, the larger the degree of association with the link destination is, the larger the change amount of the center activity value is.) By performing the processing for all other meanings, it is possible to calculate the degree of association for all pairs (combinations) of meanings.

【０１１２】このように計算された語義間関連度は、図
１５に示すように、それぞれの語義と語義の間に定義さ
れる。この図１５の表においては、語義間関連度は０か
ら１までの値をとるように正規化されている。そしてこ
の表においては一例として“コンピュータ”、“テレ
ビ”、“ＶＴＲ”の間の相互の語義間関連度が示されて
いる。“コンピュータ”と“テレビ”の語義間関連度は
０．５５、“コンピュータ” と“ＶＴＲ”の語義間関
連度は０．２５、“テレビ”と“ＶＴＲ”の語義間関連
度は０．６０である。The calculated degree of association between meanings is defined between each meaning as shown in FIG. In the table of FIG. 15, the degree of association between meanings is normalized to take a value from 0 to 1. In this table, as an example, the degree of inter-semantic association between “computer”, “television”, and “VTR” is shown. The relevance between meanings of "computer" and "television" is 0.55, the relevance between meanings of "computer" and "VTR" is 0.25, and the relevance between meanings of "television" and "VTR" is 0.60. It is.

【０１１３】以上のように予め算出されていた語義間関
連度を用いて図１３のステップＦ６２の処理が行われた
ら、続いて制御部１１は、ステップＦ６３として、カテ
ゴリＣｉに対する文書の文書分類間関連度Ｒｅｌ（Ｃ
ｉ）をＲｅｌ（Ｃｉ）＝ｍ１Ｐ（Ｃｉ）＋ｎ１Ｒ（Ｃｉ）として算出する。ここで、係数ｍ１、ｎ１は定数で、そ
れぞれの値の文書分類間関連度への寄与の度合いを表す
ものである。制御部１１は、ステップＦ６１で算出した
共通集合の数Ｐ（Ｃｉ）およびステップＦ６２で算出し
た語義間関連度の総和Ｒ（Ｃｉ）を用いて、上記式の演
算を行い、文書分類間関連度Ｒｅｌ（Ｃｉ）を算出す
る。制御部１１は、このように算出した文書分類間関連
度Ｒｅｌ（Ｃｉ）をＲＡＭ１４に記憶させる。When the processing in step F62 in FIG. 13 is performed using the degree of association between the meanings calculated in advance as described above, the control unit 11 subsequently proceeds to step F63, where the inter-document classification for the category Ci is performed. Relevance Rel (C
i) is calculated as Rel (Ci) = m1P (Ci) + n1R (Ci). Here, the coefficients m1 and n1 are constants and represent the degree of contribution of the respective values to the inter-document-class relevance. The control unit 11 performs the calculation of the above expression using the number P (Ci) of common sets calculated in step F61 and the sum R (Ci) of the degree of association between meanings calculated in step F62, and calculates the degree of association between document classifications. Calculate Rel (Ci). The control unit 11 causes the RAM 14 to store the thus calculated inter-document-class relevance Rel (Ci).

【０１１４】なお、これらの係数ｍ１、ｎ１の値として
は、たとえばｍ１＝１０、ｎ１＝１とすることができ
る。また係数ｍ１、ｎ１の値は、統計的手法を使って推
定することもできる。すなわち、制御部１１は、複数の
係数ｍおよびｎの対について文書分類間関連度Ｒｅｌ
（Ｃｉ）が与えられることで、上記係数を最適化により
求めることができる。The values of these coefficients m1 and n1 can be, for example, m1 = 10 and n1 = 1. The values of the coefficients m1 and n1 can also be estimated using a statistical method. That is, the control unit 11 determines the degree of association Rel between document classifications for a plurality of pairs of coefficients m and n.
Given (Ci), the coefficient can be obtained by optimization.

【０１１５】ステップＦ６４においては、制御部１１
は、カテゴリＣｉに対する文書分類間関連度Ｒｅｌ（Ｃ
ｉ）が最大で、その文書分類間関連度Ｒｅｌ（Ｃｉ）の
値がある閾値を越えているとき、そのカテゴリＣｉに文
書を分類する。すなわち制御部１１は、複数のカテゴリ
に対してそれぞれ文書分類間関連度を作成し、最大の文
書分類間関連度が閣値を越えているときには、文書を最
大の文書分類間関連度を有する上記カテゴリＣｉに分類
する。これにより文書が自動的に所要のカテゴリに分類
されることになる。なお最大の文書分類間関連度が閾値
を越えていないときには、文書の分類はおこなわない。In Step F64, the control unit 11
Is the document class relevance Rel (C
When i) is the largest and the value of the degree of association Rel (Ci) between the document classifications exceeds a certain threshold, the documents are classified into the category Ci. That is, the control unit 11 creates the inter-document-class relevance for each of the plurality of categories, and when the maximum inter-document-class relevance exceeds the cabinet value, the control unit 11 assigns the document with the maximum inter-document-class relevance. Classify into category Ci. As a result, the document is automatically classified into a required category. If the maximum degree of association between document classifications does not exceed the threshold value, no document classification is performed.

【０１１６】以上のような図１３の処理として、図１２
のステップＦ２３の自動分類が行われたら、ステップＦ
２４、Ｆ２５で、それに応じて分類モデルを更新し、登
録することで、一連の自動分類が完了する。即ち文書処
理装置１に受信された文書データは、自動的に分類され
たことになり、ユーザーは例えば図９の分類ウインドウ
２０１において、所要の文書分類エリアにおいて、受信
された文書データを確認できることになる。The processing shown in FIG.
If the automatic classification in step F23 is performed,
The series of automatic classification is completed by updating and registering the classification model accordingly at 24 and F25. That is, the document data received by the document processing apparatus 1 is automatically classified, and the user can confirm the received document data in a required document classification area, for example, in the classification window 201 of FIG. Become.

【０１１７】５．要約作成処理続いて、文書データについての要約文を作成する処理に
ついて述べる。上述したようにユーザーは、文書を選択
して図１０のような閲覧ウインドウ３０１を開くことに
より、文書の本文を閲覧することができる。例えば上述
した手動分類処理におけるステップＦ１３の時点や、そ
の他任意の時点において、図９で説明した分類ウインド
ウ２０１から、閲覧ウインドウ３０１を開くことができ
る。5. Summary Creation Processing Next, processing for creating a summary sentence for document data will be described. As described above, the user can browse the text of the document by selecting the document and opening the browse window 301 as shown in FIG. For example, the viewing window 301 can be opened from the classification window 201 described with reference to FIG.

【０１１８】例えば分類ウインドウ２０１において或る
文書を選択した状態でブラウザボタン２０２ｂをクリッ
クすることで、図１６のように、文書表示部３０３に選
択された文書の本文が表示された閲覧ウインドウ３０１
が開かれる。なお文書表示部３０３に文書全文が表示で
きないときには、その文書の一部が表示される。また要
約文が作成されていない時点では、図１６のように要約
表示部３０４は空白とされる。For example, when a browser button 202b is clicked while a certain document is selected in the classification window 201, a browsing window 301 in which the text of the selected document is displayed in the document display unit 303 as shown in FIG.
Is opened. When the entire text of the document cannot be displayed on the document display unit 303, a part of the document is displayed. At the time when the summary sentence has not been created, the summary display section 304 is blank as shown in FIG.

【０１１９】この閲覧ウインドウ３０１において要約作
成ボタン３０６ａがクリックされると、文書表示部３０
３に表示されている文書についての要約文が作成され、
図１７に示すように要約表示部３０４に表示される。つ
まり制御部１１は、ユーザーの要約作成操作に応じて、
以下説明するような要約文作成処理を行い、作成後、そ
れを表示する制御を行うものとなる。文書から要約を作
成する処理は、文書のタグ付けによる内部構造に基づい
て実行される。なお要約文は、要約表示部３０４のサイ
ズに応じて生成される。そして本文表示部３０３と要約
表示部３０４の面積は、ユーザーが仕切枠３１２を移動
させることで変化させることができる。つまり要約文
は、要約作成が指示された時点での要約表示部３０４の
サイズに応じたサイズ（文書長）で作成されることにな
る。When the user clicks the summary creation button 306a in the browsing window 301, the document display unit 30
A summary sentence for the document displayed in 3 is created,
It is displayed on the summary display section 304 as shown in FIG. That is, the control unit 11 responds to the user's summary creation operation by
A summary sentence creation process as described below is performed, and after creation, control for displaying the summary sentence is performed. The process of creating an abstract from a document is performed based on the internal structure of the document by tagging. The summary sentence is generated according to the size of the summary display unit 304. The area of the text display unit 303 and the summary display unit 304 can be changed by moving the partition frame 312 by the user. In other words, the summary sentence is created in a size (document length) according to the size of the summary display unit 304 at the time when the summary creation is instructed.

【０１２０】要約作成ボタン３０６ａがクリックされる
とにより開始される、制御部１１の要約作成処理を図１
８に示す。FIG. 1 shows a summary creation process of the control unit 11, which is started when the summary creation button 306a is clicked.
FIG.

【０１２１】図１８のステップＦ８１では、制御部１１
は活性拡散を行う。本例においては、活性拡散により得
られた中心活性値を重要度として採用することにより、
文書の要約を行うものである。すなわち、タグ付けによ
る内部構造を与えられた文書においては、活性拡散を行
うことにより、各エレメントにタグ付けによる内部構造
に応じた中心活性値を付与することができる。ステップ
Ｆ８１で行う活性拡散処理は、図６〜図８で説明したも
のと同様の処理となるが、上述したように活性拡散は、
中心活性値の高いエレメントと関わりのあるエレメント
にも高い中心活性値を与えるような処理である。すなわ
ち、活性拡散は、照応（共参照）表現とその先行詞の間
で中心活性値が等しくなり、それ以外では中心活性値が
減衰するような中心活性値についての演算である。この
中心活性値は、タグ付けによる内部構造に応じて決定さ
れるので、タグ付けによる内部構造を考慮した文書の分
析に利用することができる。At step F81 of FIG.
Performs active diffusion. In this example, by adopting the central activity value obtained by the active diffusion as the importance,
Summarizes the document. That is, in a document given an internal structure by tagging, by performing active diffusion, a central activation value corresponding to the internal structure by tagging can be given to each element. The active diffusion process performed in step F81 is the same process as that described with reference to FIGS. 6 to 8, but the active diffusion is performed as described above.
This is a process of giving a high central activity value to an element related to an element having a high central activity value. That is, the active diffusion is an operation on the central activity value such that the central activity value is equal between the anaphor (co-reference) expression and its antecedent, and the central activity value attenuates otherwise. Since the central activity value is determined according to the internal structure by tagging, it can be used for analyzing a document in consideration of the internal structure by tagging.

【０１２２】次にステップＦ８２では、制御部１１は、
表示部３０に表示されている閲覧ウィンドウ３０１の要
約表示部３０４のサイズ、具体的にはこの要約表示部３
０４に表示可能な最大文字数をｗｓと設定する。また制
御部１１は、要約の文字列（要約文を保持する内部レジ
スタ）ｓを初期化して初期値ｓ（０）＝””と設定す
る。制御部１１は、このように設定した、最大文字数ｗ
ｓおよび文字列ｓの初期値ｓ（０）を、ＲＡＭ１４に記
録する。Next, at step F82, the control unit 11
The size of the summary display section 304 of the browsing window 301 displayed on the display section 30, specifically, the summary display section 3
The maximum number of characters that can be displayed in 04 is set as ws. Further, the control unit 11 initializes a summary character string (an internal register holding a summary sentence) and sets an initial value s (0) = “”. The control unit 11 sets the maximum number of characters w
The s and the initial value s (0) of the character string s are recorded in the RAM 14.

【０１２３】ステップＦ８３では、制御部１１は、要約
の骨格の順次の作成をカウントするカウンタのカウント
値ｉを「１」に設定する。そしてステップＦ８４で制御
部１１は、カウンタのカウント値ｉにに基づいて、文章
からｉ番目に平均中心活性値の高い文の骨格を抽出す
る。平均中心活性値とは、一つの文を構成する各エレメ
ントの中心活性値を平均したものである。制御部１１
は、たとえばＲＡＭ１４に記録した文字列ｓ（ｉ−１）
を読み出し、この文字列ｓ（ｉ−１）に対して、抽出し
た文の骨格の文字列を加えて、Ｓ（ｉ）とする。そして
制御部１１は、このようにして得た文字列ｓ（ｉ）をＲ
ＡＭ１４に記録する。初回は、文字列ｓ（ｉ−１）は初
期値ｓ（０）であるので、今回抽出した文の骨格が文字
列Ｓ（ｉ）としてＲＡＭ１４に記憶されることになる。
また以降においてステップＦ８４の処理が行われる場合
は、抽出された文の骨格が文字列Ｓ（ｉ）に、それまで
の文字列Ｓ（ｉ）（つまりその時点では文字列Ｓ（ｉ−
１））に追加されていくものとなる。また同時に、制御
部１１はこのステップＦ８４において、上記文の骨格に
含まれないエレメントの中心活性値順のリストＬ（ｉ）
を作成し、このリストＬ（ｉ）をＲＡＭ１４に記録す
る。In step F83, the control section 11 sets the count value i of a counter for counting the sequential creation of the skeleton of the digest to "1". Then, in step F84, the control unit 11 extracts a skeleton of a sentence having the ith highest average central activity value from the sentence based on the count value i of the counter. The average central activity value is the average of the central activity values of the elements constituting one sentence. Control unit 11
Is, for example, a character string s (i-1) recorded in the RAM 14.
Is read out, and the character string of the skeleton of the extracted sentence is added to this character string s (i-1) to obtain S (i). Then, the control unit 11 converts the character string s (i) thus obtained into R
Record in AM14. In the first time, since the character string s (i-1) has the initial value s (0), the skeleton of the sentence extracted this time is stored in the RAM 14 as the character string S (i).
When the process of step F84 is performed thereafter, the skeleton of the extracted sentence is changed to the character string S (i), and the character string S (i) up to that point (that is, the character string S (i-
1)). At the same time, the control unit 11 determines in this step F84 a list L (i) of the elements not included in the skeleton of the sentence in the order of the central activation value.
And the list L (i) is recorded in the RAM 14.

【０１２４】すなわち、このステップＦ８４において
は、要約のアルゴリズムは、活性拡散の結果を用いて、
平均中心活性値の大きい順に文を選択し、選択された文
の骨格の抽出する。文の骨格は、文から抽出した必須要
素により構成される。必須要素になりうるのは、エレメ
ントの主辞（head）と、主語（subject）、目的語（obj
ect）、間接目的語（indirect object）、所有者（poss
essor）、原因（cause）、条件（condition）または比
較（comparison）の関係属性を有する要素と、等位構造
が必須要素のときにはそれに直接含まれるエレメントと
が必須要素を構成するものである。そして、文の必須要
素をつなげて文の骨格を生成し、要約に加える。That is, in step F84, the summarization algorithm uses the result of activity diffusion to
The sentences are selected in descending order of the average central activity value, and the skeleton of the selected sentence is extracted. The skeleton of a sentence is composed of essential elements extracted from the sentence. Required elements can be the head of the element, the subject, the object (obj
ect), indirect object, owner (poss
An element having a relationship attribute of essor, cause, condition, or comparison, and an element directly included when the coordination structure is an essential element, constitute an essential element. Then, by connecting the essential elements of the sentence, a skeleton of the sentence is generated and added to the summary.

【０１２５】、ステップＦ８５では制御部１１は、文字
列ｓ（ｉ）の長さが、閲覧ウィンドウ３０１の要約表示
部１０４の最大文字数ｗｓより大きいか否かを判断す
る。このステップＦ８５は、要約表示部３０４のサイズ
に応じた要約文を作成するための判断処理となる。In step F85, the control section 11 determines whether or not the length of the character string s (i) is larger than the maximum number of characters ws of the summary display section 104 of the browsing window 301. This step F85 is a determination process for creating a summary sentence according to the size of the summary display unit 304.

【０１２６】制御部１１は、文字列ｓ（ｉ）の長さが最
大文字数ｗｓに達していないときは、処理をステップＦ
８６に進める。ステップＦ８６では制御部１１は、文書
中で、（ｉ＋１）番目に平均中心活性値が高い文のエレ
メントの中心活性値と、上記ステップＦ８４で作成した
リストＬ（ｉ）の最も中心活性値が高いエレメントの中
心活性値を比較する。つまり、上記ステップＦ８４にお
いて要約として採用された文の次に平均中心活性値が高
い文（即ち次に要約文に付加する候補となる文）と、ス
テップＦ８４において要約として採用された文の中で骨
格ではないとして要約からは排除されたエレメントの中
心活性値を比較する。If the length of the character string s (i) has not reached the maximum number of characters ws, the control unit 11 executes the processing in step F
Proceed to 86. In step F86, the control unit 11 determines in the document the central activity value of the element of the sentence having the (i + 1) -th highest average central activity value and the highest central activity value of the list L (i) created in step F84. Compare the central activity values of the elements. That is, a sentence having the second highest average central activity value after the sentence adopted as the summary in step F84 (that is, a sentence which is a candidate to be added to the next summary sentence) and a sentence adopted as the summary in step F84 The central activity values of elements excluded from the summary as not scaffolds are compared.

【０１２７】このステップＦ８６の処理は、要約文とし
ての文字列に次に加える部位を、その直前のステップＦ
８４で採用した文において骨格として採用されなかった
ものから選ぶか、或いは他の文から選ぶかを判断する処
理となる。In the process of step F86, the part to be added next to the character string as the summary sentence is set in the immediately preceding step F86.
In the sentence adopted in S84, the process is to determine whether to select from those not adopted as a skeleton or to select from other sentences.

【０１２８】（ｉ＋１）番目に平均中心活性値が高い文
におけるエレメントの中心活性値よりも、リストＬ
（ｉ）における最も高い中心活性値の方が、中心活性値
が高い値であった場合は、要約文としての文字列に次に
加える部位を、その直前のステップＦ８４で採用した文
において骨格として採用されなかったものから選ぶよう
にする。このため制御部１１の処理はステップＦ８８に
進み、リストＬ（ｉ）における最も中心活性値が高いエ
レメントを、その時点で記憶されている文字列Ｓ（ｉ）
に加え、文字列ＳＳ（ｉ）とする。またこのとき、文字
列ＳＳ（ｉ）に加えたエレメントをリストＬ（ｉ）から
削除する。そして、ステップＦ８９において、文字列Ｓ
Ｓ（ｉ）が、最大文字数ｗｓより大きいか否かを判断
し、大きくなければステップＦ８６に戻る。The list L is higher than the central activity value of the element in the (i + 1) -th sentence having the highest average central activity value.
In the case where the highest central activity value in (i) is a higher central activity value, the portion to be added next to the character string as the summary sentence is set as the skeleton in the sentence adopted in the immediately preceding step F84. Try to choose from those that have not been adopted. For this reason, the process of the control unit 11 proceeds to step F88, in which the element having the highest central activity value in the list L (i) is stored in the character string S (i) stored at that time.
And a character string SS (i). At this time, the element added to the character string SS (i) is deleted from the list L (i). Then, in a step F89, the character string S
It is determined whether S (i) is greater than the maximum number of characters ws. If not, the process returns to step F86.

【０１２９】ステップＦ８６において、（ｉ＋１）番目
に平均中心活性値が高い文のエレメントとして、リスト
Ｌ（ｉ）における最も高い中心活性値よりも中心活性値
が高いエレメントがあった場合は、要約文としての文字
列に次に加える部位を、その直前のステップＦ８４で採
用した文とは別の文から選ぶこととしてステップＦ８７
でカウント値ｉをインクリメントしてステップＦ８４に
戻ることになる。つまりステップＦ８６で、（ｉ＋１）
番目に平均中心活性値が高い文とされた文について、ス
テップＦ８４で骨格を抽出し、それを文字列Ｓ（ｉ）に
加えるようにする。In step F86, if the (i + 1) -th element having the highest central activity value is higher than the highest central activity value in the list L (i), the summary sentence The part to be added next to the character string as is selected from a sentence different from the sentence adopted in step F84 immediately before that step F87.
, The count value i is incremented, and the process returns to step F84. That is, in step F86, (i + 1)
For the sentence having the second highest average central activity value, the skeleton is extracted in step F84 and added to the character string S (i).

【０１３０】以上のように、ステップＦ８４又はステッ
プＦ８８で文の骨格となるエレメントやその他のエレメ
ントとして、中心活性値の高いものを基準として文字列
に加えていきながら、ステップＦ８５又はステップＦ８
９で、文字列Ｓ（ｉ）又はＳＳ（ｉ）を最大文字数ｗｓ
と比較していくことで、最大文字数ｗｓに近いが最大文
字数ｗｓを越えない文字列を作成していくことになる。As described above, in step F84 or step F88, while adding elements to the character string based on the element having the higher central activation value as the element or other element serving as the skeleton of the sentence in step F84 or step F88,
9, the character string S (i) or SS (i) is converted to the maximum number of characters ws
By comparing with, a character string close to the maximum number of characters ws but not exceeding the maximum number of characters ws is created.

【０１３１】例えばステップＦ８５で文字列Ｓ（ｉ）が
最大文字数ｗｓを越えた場合は、制御部１１の処理はス
テップＦ９０に進み、直前のステップＦ８４で骨格を加
える前の文字列Ｓ（ｉ−１）を、要約文とする。つま
り、これはステップＦ８４で文の骨格を加えたことによ
り、最大文字数ｗｓを越えてしまったことになるため、
その骨格を加える前の文字列Ｓ（ｉ−１）が、最大文字
数ｗｓに近いが最大文字数ｗｓを越えない文字列である
と判断して、それを要約文とするものである。For example, if the character string S (i) exceeds the maximum number of characters ws in step F85, the processing of the control unit 11 proceeds to step F90, and the character string S (i- 1) is a summary sentence. In other words, this means that the maximum number of characters ws has been exceeded by adding the skeleton of the sentence in step F84.
The character string S (i-1) before adding the skeleton is determined to be a character string that is close to the maximum number of characters ws but does not exceed the maximum number of characters ws, and is used as a summary sentence.

【０１３２】なお、このため初めてステップＦで文字列
Ｓ（ｉ）を生成した時点（ｉ＝１の時点）で、ステップ
Ｆ８５で、文字列Ｓ（ｉ）が最大文字数ｗｓを越えた場
合は、文字列Ｓ（ｉ−１）は、ステップＦ８２で設定し
た初期値としての文字列Ｓ（０）となるため、実質的に
要約文は作成できなかったことになる。これは、要約表
示部３０４のサイズが小さすぎたことに起因するため、
ユーザーは画面上で要約表示部３０４の面積を広げた上
で、再度、要約作成ボタン３０６ａをクリックして、図
１８の処理が開始されるようにすればよい。If the character string S (i) exceeds the maximum number of characters ws at step F85 when the character string S (i) is first generated at step F (i = 1) at step F85, Since the character string S (i-1) becomes the character string S (0) as the initial value set in step F82, a summary sentence could not be created substantially. This is because the size of the summary display unit 304 is too small.
The user may enlarge the area of the summary display unit 304 on the screen, and then click the summary creation button 306a again to start the processing in FIG.

【０１３３】ステップＦ８５で文字列Ｓ（ｉ）が最大文
字数ｗｓを越えていない場合は、上述のように制御部１
１の処理はステップＦ８６に進み、次に文字列に加える
部分を判断することになる。そして上記のようにステッ
プＦ８８に進んだ場合は、文字列ＳＳｉが最大文字数ｗ
ｓを越えたか否かを判別する。ここで文字列ＳＳｉが最
大文字数ｗｓを越えた場合は、制御部１１の処理はステ
ップＦ９１に進み、直前のステップＦ８８で或るエレメ
ントを加える前の文字列Ｓ（ｉ）を、要約文とすること
になる。つまり、これはステップＦ８８でエレメントを
加えたことにより、最大文字数ｗｓを越えてしまったこ
とになるため、そのエレメントを加える前の文字列Ｓ
（ｉ）が、最大文字数ｗｓに近いが最大文字数ｗｓを越
えない文字列であると判断して、それを要約文とするも
のである。If it is determined in step F85 that the character string S (i) does not exceed the maximum number of characters ws, as described above, the control unit 1
The process of step 1 proceeds to step F86, where a portion to be added to the next character string is determined. When the process proceeds to step F88 as described above, the character string SSi is
s is determined. Here, if the character string SSi exceeds the maximum number of characters ws, the process of the control unit 11 proceeds to step F91, and the character string S (i) before adding a certain element in the immediately preceding step F88 is used as a summary sentence. Will be. In other words, this means that the maximum number of characters ws has been exceeded by adding an element in step F88.
It is determined that (i) is a character string that is close to the maximum number of characters ws but does not exceed the maximum number of characters ws, and is used as a summary sentence.

【０１３４】以上のような処理により、その時点の要約
表示部３０４のサイズに適合した要約文が作成されるこ
とになる。そしてその要約文の内容は、平均中心活性値
の高い１又は複数の文の骨格、及び骨格以外の中心活性
値の高いエレメントが用いられたものとなる。そしてこ
の様に作成された要約文は、ＲＡＭ１４に記憶されると
ともに、図１７のように要約表示部３０４に表示され
る。By the above processing, a summary sentence suitable for the size of the summary display unit 304 at that time is created. The contents of the summary sentence include the skeleton of one or more sentences having a high average central activity value and elements having a high central activity value other than the skeleton. The summary sentence thus created is stored in the RAM 14 and displayed on the summary display unit 304 as shown in FIG.

【０１３５】なお、表示された要約文を見てユーザーが
より詳しい要約文を見たいと思った場合、或いはより短
い要約文を見たいと思った場合は、閲覧ウインドウ３０
１の要約表示部３０４のサイズ（面積）を増減した上
で、再度要約作成ボタン３０６ａをクリックすればよ
い。すると、上述した図１８の処理により、その時点の
要約表示部３０４のサイズに応じた文書長の要約書が作
成され、表示されることになる。When the user looks at the displayed summary and wants to see a more detailed summary, or wants to see a shorter summary, the viewing window 30 is displayed.
After increasing or decreasing the size (area) of the first summary display unit 304, the user may click the summary creation button 306a again. Then, a summary having a document length corresponding to the size of the summary display unit 304 at that time is created and displayed by the processing of FIG. 18 described above.

【０１３６】６．読み上げ処理文書処理装置１は、以上のようにタグ付けされた文書を
受信すると、その本文や要約文を表示してユーザーに提
示できるが、さらに受信した文書を音声でユーザーに提
示することもできる。即ちＣＰＵ１３により、ＲＯＭ１
５やＨＤＤ３４に記録されている電子文書処理プログラ
ムのうちの音声読み上げプログラムを起動することで、
図１９に示すような一連の工程を経ることによって、文
書の読み上げを行うことができる。まずここでは、簡略
化した各工程の説明を行い、その後、具体的な文書例を
用いて、各工程の説明を詳細に行う。6. Read-aloud processing When the document processing apparatus 1 receives a document tagged as described above, the document processing apparatus 1 can display the body and the summary sentence to the user, and can also present the received document to the user by voice. . That is, the ROM 13
5 or an electronic document processing program recorded in the HDD 34 to activate a speech reading program,
The document can be read aloud through a series of steps as shown in FIG. First, here, a simplified description of each step is given, and thereafter, each step is explained in detail using a specific document example.

【０１３７】制御部１１の処理として、図１９のステッ
プＦ１０１は、図４のステップＦ１１（又は図１２のス
テップＦ２１）と同様の文書受信／記憶処理である。上
述ののように、受信した文書（タグ付文書）については
手動又は自動での分類処理が行われるが、それと同様
に、受信した文書について読み上げ処理も実行可能とい
う意味で図１９にステップＦ１０１を記したものであ
る。ここでは特に処理手順として、分類処理と読み上げ
処理の順序その他を規定するものではない。As the processing of the control section 11, step F101 in FIG. 19 is the same document receiving / storing processing as step F11 in FIG. 4 (or step F21 in FIG. 12). As described above, the received document (tagged document) is subjected to manual or automatic classification processing. Similarly, in step S101, FIG. It is written. Here, the order of the classification process and the reading process and the like are not particularly defined as the processing procedure.

【０１３８】なお、文書読み上げ処理の対象となる文書
（受信文書）には、後述するように、音声合成を行うた
めに必要なタグが付与されていることが必要である。但
し、文書処理装置１は、タグ付けされた文章を受信した
うえで、その文書に音声合成を行うために必要なタグを
新たに付与して文書を作成することもできる。以下で
は、このようにして受信又は作成されて用意されたタグ
付き文書を、タグファイルと記す。It should be noted that a document (reception document) to be subjected to the text-to-speech process needs to be provided with a tag necessary for performing speech synthesis, as described later. However, after receiving the tagged text, the document processing apparatus 1 can also create a document by newly adding a tag necessary for performing speech synthesis to the document. Hereinafter, the tagged document prepared by receiving or creating in this manner is referred to as a tag file.

【０１３９】文書処理装置１の文書読み上げ処理として
は続いてステップＦ１０２において、ＣＰＵ１３の制御
のもとに、タグファイルに基づいて読み上げ用ファイル
を生成する。この読み上げ用ファイルは、後述するよう
に、タグファイル中のタグから、読み上げのための属性
情報を導出し、この属性情報を埋め込むことにより生成
される。In the document reading process of the document processing apparatus 1, a reading file is generated based on the tag file under the control of the CPU 13 in step F102. As will be described later, the reading file is generated by deriving attribute information for reading from a tag in the tag file and embedding the attribute information.

【０１４０】続いてステップＦ１０３において文書処理
装置１は、ＣＰＵ１３の制御のもとに、読み上げ用ファ
イルを用いて、音声合成エンジンに適した処理を行う。
なお、この音声合成エンジンは、ハードウェアで構成し
てもよいし、ソフトウェアで実現するようにしてもよ
い。音声合成エンジンをソフトウェアで実現する場合に
は、そのアプリケーションプログラムは、ＲＯＭ１５や
ＨＤＤ３４等に予め記憶されている。Subsequently, in step F103, under the control of the CPU 13, the document processing apparatus 1 performs a process suitable for the speech synthesis engine using the reading file.
The speech synthesis engine may be configured by hardware or may be realized by software. When the speech synthesis engine is realized by software, the application program is stored in the ROM 15 or the HDD 34 in advance.

【０１４１】続いて文書処理装置１はステップＦ１０４
において、ユーザが後述するユーザインターフェースを
用いて行う操作に応じて処理を行う。文書処理装置１
は、このような処理を行うことによって、与えられた文
書を読み上げることができる。これらの各工程につい
て、以下詳細に説明する。Subsequently, the document processing apparatus 1 proceeds to step F104.
In, processing is performed according to an operation performed by a user using a user interface described later. Document processing device 1
Can read out a given document by performing such processing. Each of these steps will be described in detail below.

【０１４２】まず、ステップＦ１０１におけるタグ付け
された文書の受信又は作成について説明する。文書処理
装置１は、例えば通信部２１から文書（音声合成を行う
ために必要なタグが既に付与されている文書）を受信す
る。または、文書処理装置１は、タグ付けされた文書を
受信し、その文書に音声合成を行うために必要なタグを
新たに付与して文書を作成する。First, reception or creation of a tagged document in step F101 will be described. The document processing apparatus 1 receives, for example, a document (a document to which a tag required for performing speech synthesis has already been added) from the communication unit 21. Alternatively, the document processing apparatus 1 receives a tagged document, and newly adds a tag necessary for performing speech synthesis to the document to create the document.

【０１４３】説明上の例として、以下のような、日本
語、及び英語の文書にタグ付けがなされたタグファイル
が、受信又は作成されものとする。As an illustrative example, it is assumed that the following tag file in which Japanese and English documents are tagged is received or created.

【０１４４】まずタグファイルの元となる日本語文書
は、次のような文書とする。「［素敵にエイジング］／８ガン転移、抑えられる！？がんはこの十数年、わが国の死因第一位を占めている。
そめ死亡率は年齢が進むとともに増加傾向にある。高齢
者の健康を考えるとき、がんの問題を避けて通れない。
がんを特徴づけるのは、細胞増殖と転移である。人間の
細胞には、自動車でいえばアクセルに当たり、がんをど
んどん増殖する「がん遺伝子」と、ブレーキ役の「がん
抑制遺伝子」がある。双方のバランスが取れていれば問
題はない。正常な調節機能が失われ、細胞内でブレーキ
が利かない変異が起こると、がんの増殖が始まる。高齢
者の場合、長い年月の間にこの変異が蓄積し、がん化の
条件を備えた細胞の割合が増え、がん多発につながるわ
けだ。ところで、もう一つの特徴、転移という性質がな
ければ、がんはそれほど恐れる必要はない。切除するだ
けで、完治が可能になるからである。転移を抑制するこ
との重要性がここにある。この転移、がん細胞が増える
だけでは発生しない。がん細胞が細胞と細胞の間にある
蛋白（たんぱく）質などを溶かし、自分の進む道をつく
って、血管やリンパ管に入り込む。循環しながら新たな
“住み家”を探して潜り込む、といった複雑な動きをす
ることが、近年解明されつつある。」First, a Japanese document serving as a source of a tag file is as follows. "[Nice aging] / 8 cancer metastases can be suppressed! Cancer has been the leading cause of death in Japan for more than a decade.
Reed mortality is increasing with age. When thinking about the health of the elderly, you cannot avoid cancer.
Characterizing cancer is cell proliferation and metastasis. There are two types of human cells: "cancer genes", which act as accelerators in automobiles and rapidly grow cancer, and "cancer suppressing genes" that act as brakes. There is no problem if both are balanced. Loss of normal regulatory function and nonbreaking mutations in the cell initiate cancer growth. In the elderly, the mutation accumulates over the years, increasing the proportion of cells with cancerous conditions, leading to more cancers. By the way, without the other characteristic, metastasis, cancer doesn't have to be so afraid. This is because a complete cure is possible just by resection. Here is the importance of suppressing metastasis. This metastasis does not occur simply by increasing the number of cancer cells. Cancer cells dissolve proteins (proteins) between cells, create their own way, and enter blood vessels and lymph vessels. In recent years, it has been elucidated that complicated movements such as searching for a new “dwelling house” while circulating are performed. "

【０１４５】タグファイルの元となる英語の文書の例は
次のようなものとする「During its centennial year, The Wall Street Jour
nal will report events of the past century that st
and as milestones of American business history. TH
REE COMPUTERS THAT CHANGED the face of personal co
mputing were Iaunched in 1977. That year the Apple
II, Commodore Pet and Tendy TRS came to market. T
he computers were crude by today's standerds. Appl
e ll owners, for example,had to use their televisi
on sets as screens and storeddata on audiocassette
s.」The following is an example of an English document serving as a tag file: "During its centennial year, The Wall Street Jour"
nal will report events of the past century that st
and as milestones of American business history.TH
REE COMPUTERS THAT CHANGED the face of personal co
mputing were Iaunched in 1977. That year the Apple
II, Commodore Pet and Tendy TRS came to market.T
he computers were crude by today's standerds.Appl
e ll owners, for example, had to use their televisi
on sets as screens and storeddata on audiocassette
s. "

【０１４６】文書処理装置１は、このような日本語又は
英語の文書についてタグが付された文書を受信すると、
分類処理や、図１６、図１７等で説明したようにその本
文を表示したり、要約文を作成して表示することができ
る。When the document processing apparatus 1 receives a document with a tag attached to such a Japanese or English document,
As described with reference to FIG. 16 and FIG. 17 and the like, it is possible to display the main text or create and display a summary sentence.

【０１４７】ここで上記の日本語又は英語の文書は、そ
れそれ、図２１又は図２２に示すようなタグファイルと
して構成されている。日本語文書のタグファイルとして
は、図２１（ａ）に見出しの部分である「〔素敵にエイ
ジング〕／８ガン転移、抑えられる！？」を抜粋したも
のを示し、図２１（ｂ）に、文書中の最後の段落である
「この転移、がん細胞が・・・・近年解明されつつあ
る。」を抜粋したものを示している。残りの段落につい
ては省略してある。なお、実際のタグファイルは、見出
し部分から最後の段落までが１つのファイルとして構成
されている。Here, the above-mentioned Japanese or English documents are each constituted as a tag file as shown in FIG. 21 or FIG. As a tag file of a Japanese document, FIG. 21 (a) shows an excerpt of the heading “[nicely aging] / 8 cancer metastasis, suppressed !?”, and FIG. 21 (b) It shows an excerpt from the last paragraph in the document, "This metastasis, cancer cells are being elucidated in recent years." The remaining paragraphs have been omitted. It should be noted that the actual tag file is composed of one file from the heading to the last paragraph.

【０１４８】図２１（ａ）に示す見出し部分において、
＜見出し＞というタグは、この部分が見出しであること
を示している。この図２１（ａ）（ｂ）に示すタグファ
イルは基本的には、図２を用いて文書データ構造を説明
した際に用いたタグファイル例と同様にタグが付されて
いるものであり、上述した各タグについての細かい説明
は省略するが、所要各所に、音声合成を行うために必要
なタグが付与されているものである。In the heading shown in FIG.
The tag <heading> indicates that this part is a heading. The tag files shown in FIGS. 21A and 21B are basically attached with tags in the same manner as the tag file example used when the document data structure was described with reference to FIG. Although a detailed description of each of the above-described tags is omitted, tags required for performing speech synthesis are added to required locations.

【０１４９】音声合成を行うために必要なタグとして
は、例えばまず、図中「例１」の部分に示すように、
「蛋白（たんぱく）」のように元の文書に読み仮名を示
す情報が与えられているときに付与されるものがある。
すなわち、この場合では、「たんぱくたんぱく」と重複
して読み上げてしまうことを防ぐために、発音＝“nul
l”という読み属性情報が記述されており、「（たんぱ
く）」の部分読み上げを禁止するタグが付与されてい
る。また、音声合成を行うために必要なタグとしては、
図中「例２」「例３」に示すように、「リンパ管」のよ
うな専門用語や「住み家」のように、誤った読み上げを
行う可能性のある難訓部分に付与されるものがある。す
なわち、この場合では、「りんぱくだ」や「すみいえ」
と読み上げてしまうことを防ぐために、それぞれ、発音
＝“りんぱかん”、発音＝“すみか”という読み仮名を
示す読み属性情報が記述されている。For example, as a tag necessary for performing speech synthesis, first, as shown in the portion of “Example 1” in the figure,
Some are given when information indicating a reading pseudonym is given to the original document, such as “protein”.
In other words, in this case, the pronunciation = “nul
Reading attribute information “l” is described, and a tag for prohibiting partial reading of “(tanpaku)” is added. Also, the tags required to perform speech synthesis include:
As shown in the "Example 2" and "Example 3" in the figure, technical terms such as "lymphatic vessels" and difficult-to-read parts such as "dwelling house" that may be read aloud incorrectly There is. In other words, in this case, "Rinpakuda" or "Sumie"
In order to prevent the pronunciation from being read out, pronunciation attribute information indicating the pronunciation pseudonym of pronunciation = “Rinkan” and pronunciation = “Sumika” is described.

【０１５０】一方、図２２に示すタグファイルにおける
音声合成を行うために必要なタグとしては、図中「例
４」として示す部分のように、「ＩＩ」というローマ数
字に対して、発音＝“two”という読み属性情報が記述
されている。これは、「ＩＩ」を「トゥ（two）」と読
み上げさせたい場合に、「セカンド（second）」と読み
上げてしまうことを防ぐために記述されているものであ
る。On the other hand, as a tag necessary for performing speech synthesis in the tag file shown in FIG. 22, as shown in a portion shown as “Example 4” in the figure, for the Roman numeral “II”, the pronunciation = “ Two ”reading attribute information is described. This is described in order to prevent "II" from being read out as "second" when it is desired to be read out as "two".

【０１５１】また、例えば文書内に引用文が含まれてい
る場合、このようなタグファイルには、図示しないが、
その文が引用文であることを示すタグが付与される。さ
らに、タグファイルには、例えば文書内に疑問文がある
場合、図示しないが、その文が疑問文であることを示す
タグが付与される。If a citation is included in a document, for example, such a tag file includes
A tag indicating that the sentence is a quote is attached. Further, when there is a question sentence in the document, for example, a tag indicating that the sentence is a question sentence is added to the tag file, though not shown.

【０１５２】文書処理装置１は、先に図１９に示したス
テップＦ１０１において、例えば以上の例のように、音
声合成を行うために必要なタグが付与された文書を受信
又は作成するものとなる。In step F101 shown in FIG. 19, the document processing apparatus 1 receives or creates a document to which a tag necessary for performing speech synthesis has been added, for example, as in the above example. .

【０１５３】つぎに、図１９のステップＦ１０２におけ
る読み上げ用ファイルの生成について説明する。文書処
理装置１は、タグファイル中のタグから、読み上げめた
めの属性情報を導出し、この属性情報を埋め込むことに
よって読み上げ用ファイルを生成する。具体的には、文
書処理装置１は、文書の段落、文、句の先頭を示すタグ
を見つけ出し、これらのタグに対応して読み上げのため
の属性情報を埋め込む。また文書処理装置は、文書の要
約文を作成した場合には、その要約文に含まれる部分の
先頭を文書から見つけ出し、読み上げの際に音量を増大
させる属性情報を埋め込み、要約文に含まれる部分であ
ることを強調することなどもできる。Next, the generation of a reading file in step F102 of FIG. 19 will be described. The document processing device 1 derives attribute information for reading out from a tag in the tag file, and generates a reading-out file by embedding the attribute information. Specifically, the document processing apparatus 1 finds tags indicating the beginning of a paragraph, a sentence, or a phrase of a document, and embeds attribute information for reading aloud in correspondence with these tags. When the document processing apparatus prepares the summary of the document, the head of the part included in the summary is found from the document, and the attribute information for increasing the volume at the time of reading out is embedded. Can be emphasized.

【０１５４】文書処理装置１は、図２１又は図２２に示
したタグファイルから図２３又は図２４に示すような読
み上げ用ファイルを生成する。なお、図２３（ａ）
（ｂ）は図２１（ａ）（ｂ）に示した部分に対応するも
のである。実際の読み上げ用ファイルは、上述した見出
し部分から最後の段落までが１つのファイルとして構成
されていることは勿論である。The document processing apparatus 1 generates a reading file as shown in FIG. 23 or 24 from the tag file shown in FIG. 21 or FIG. FIG. 23 (a)
(B) corresponds to the portion shown in FIGS. 21 (a) and (b). It goes without saying that the actual reading file is composed of one file from the above-mentioned heading portion to the last paragraph.

【０１５５】図２３に示す読み上げ用ファイルには、文
書の先頭に対応してCom=Lang***という属性情報が埋め
込まれている。この属性情報は、文書を記述している言
語を示す。ここでは、Com=Lang=JPNという属性情報であ
り、文書を記述している言語が日本語であることを示し
ている。文書処理装置においては、この属性情報を参照
することで、文書毎に言語に応じた適切な音声合成エン
ジンを選択することができる。In the read-out file shown in FIG. 23, attribute information Com = Lang *** is embedded corresponding to the head of the document. This attribute information indicates the language describing the document. Here, the attribute information is Com = Lang = JPN, indicating that the language describing the document is Japanese. In the document processing apparatus, by referring to the attribute information, it is possible to select an appropriate speech synthesis engine corresponding to the language for each document.

【０１５６】また、この読み上げ用ファイルには各所
に、Com=begin_p、Com=begin_s、Com=begin_phという属
性情報が埋め込まれている。これらの属性情報は、それ
ぞれ、文書の段落、文及び句の先頭を示す。文書処理装
置１は、上述したタグファイル中のタグに基づいて、こ
れらの段落、文及び句の先頭を識別する。なお、読み上
げ用ファイルにおいて、例えば上述したタグファイル中
の＜形容動詞句＞＜名詞句＞のように、同じレベルの統
語構造を表すタグが連続して表れる部分に対しては、そ
れぞれに対応する数のCom=begin_phが埋め込まれずに、
まとめられて１つのCom=begin_phが埋め込まれる。In addition, attribute information such as Com = begin_p, Com = begin_s, and Com = begin_ph is embedded in this reading file at various places. These pieces of attribute information indicate the beginnings of paragraphs, sentences, and phrases of the document, respectively. The document processing device 1 identifies the beginning of these paragraphs, sentences, and phrases based on the tags in the tag file described above. In the reading file, for example, portions where tags representing syntactic structures of the same level appear continuously, such as <adjective verb phrase><nounphrase> in the above-described tag file, correspond to each. Without the number Com = begin_ph being embedded,
It is put together and one Com = begin_ph is embedded.

【０１５７】さらに、読み上げ用ファイルには、Com=be
gin_p、Com=begin_s、及びCom=begin_phに対応して、そ
れぞれ、Pau=500、Pau=100及びPau=50という属性情報が
埋め込まれている。これらの属性情報は、それぞれ、読
み上げの際に５００ミリ秒、１００ミリ秒及び５０ミリ
秒の休止期間を設けることを示す。すなわち文書処理装
置１が、文章の段落、文及び句の先頭こおいて、それぞ
れ、５００ミリ秒、１００ミリ秒及び５０ミリ秒の休止
期間を設けて文書を音声合成エンジンにより読み上げる
ようにするための情報である。なお、これらの属性情報
は、Com=begin_p、Com=begin_s及びCom=begin_phに対応
して埋め込まれる。そのため、例えばタグファイル中の
＜副詞句＞＜名詞句＞のように、同じレベルの統語構造
を表すタグが連続して表れる部分は、１つの句として捉
えられ、それぞれに対応する数のPau=50が埋め込まれず
に、まとめられて１つのPau=50が埋め込まれる。また、
例えばタグファイル中の＜段落＞＜文＞＜名詞句＞のよ
うに、異なるレベルの統語構造を表すタグが連続して表
れる部分については、それぞれに対応するPau=***が埋
め込まれる。そのため文書処理装置１は、このような部
分を読み上げる際には、例えば文書の段落、文及び句の
それぞれの休止期間を加算して得られる６５０ミリ秒の
休止期間を設けて読み上げるようにする。このように、
文書処理装置１は、段落、文及び句に対応した休止期間
を設けることで、段落、文及び句の切れ目を考慮した違
和感のない読み上げを行うことができる。なお、この休
止期間は、文書の段落、文及び句の先頭において、それ
ぞれ、６００ミリ秒、１００ミリ秒及び５０ミリ秒であ
る必要はなく、適宜変更することができる。Further, the read-out file includes Com = be
Attribute information of Pau = 500, Pau = 100, and Pau = 50 is embedded corresponding to gin_p, Com = begin_s, and Com = begin_ph, respectively. These pieces of attribute information indicate that pause periods of 500 ms, 100 ms, and 50 ms are provided for reading out. That is, the document processing apparatus 1 provides pauses of 500 ms, 100 ms, and 50 ms at the beginning of a paragraph, a sentence, and a phrase of a sentence, respectively, so that the document is read out by the speech synthesis engine. Information. Note that these pieces of attribute information are embedded corresponding to Com = begin_p, Com = begin_s, and Com = begin_ph. Therefore, for example, a portion where tags representing the same level of syntactic structure appear continuously, such as <adverb phrase><nounphrase> in the tag file, is regarded as one phrase, and the corresponding number of Pau = One Pau = 50 is embedded together without being embedded. Also,
For example, Pau = *** is embedded in portions where tags representing different levels of syntactic structure appear continuously, such as <paragraph><sentence><nounphrase> in the tag file. Therefore, when reading out such a portion, the document processing apparatus 1 provides a pause period of 650 milliseconds obtained by adding the pause periods of the paragraphs, sentences, and phrases of the document, for example. in this way,
By providing a pause period corresponding to a paragraph, a sentence, and a phrase, the document processing apparatus 1 can perform reading without discomfort in consideration of a break between the paragraph, the sentence, and the phrase. This pause period does not need to be 600 ms, 100 ms, and 50 ms at the beginning of the paragraph, sentence, and phrase of the document, respectively, and can be changed as appropriate.

【０１５８】さらにまた、読み上げ用ファイルにおいて
は、タグファイル中で記述されている発音＝“null”と
いう読み属性情報に対応して、「（たんぱく）」が除か
れているとともに、発音＝“りんぱかん”、発音＝“す
みか”という読み属性情報に対応して、「リンパ管」、
「住み家」が、それぞれ、「りんぱかん」、「すみか」
に置換されている。文書処理装置１は、このような読み
属性情報を埋め込むことで、音声合成エンジンが参照す
る辞書の不備による読み誤りをすることがないようにし
ている。Further, in the reading-out file, “(protein)” is removed and the pronunciation = “rin” corresponding to the pronunciation attribute information of pronunciation = “null” described in the tag file. According to the reading attribute information of "kan" and pronunciation = "Sumika", "lymphatic vessels"
"Dwellers" are "Rinkan" and "Sumika", respectively.
Has been replaced by By embedding such reading attribute information, the document processing device 1 prevents reading errors due to inadequate dictionaries referenced by the speech synthesis engine.

【０１５９】また、読み上げ用ファイルには、文書内に
含まれた引用文であることを示すタグに基づいて、この
引用文のみを別の音声合成エンジンを用いるように指定
するための属性情報が埋め込まれてもよい。さらに、読
み上げ用ファイルには、疑問文であることを示すタグに
基づいて、その文の語尾のイントネーションを上げるた
めの属性情報が埋め込まれるようにしてもよい。さらに
また、読み上げ用ファイルには、必要に応じて、いわゆ
る「である調」の文体を「ですます調」の文体に変換す
るための属性情報を埋め込むこともできる。なお、この
場合、文書処理装置１は、このような属性情報を読み上
げ用ファイルに埋め込むのではなく、「である調」の文
体を「ですます調」の文体に変換して音声読み上げ用フ
ァイルを生成するようにしてもよい。In the reading file, attribute information for designating only this quotation to use another speech synthesis engine based on a tag indicating that the quotation is included in the document is included. It may be embedded. Further, attribute information for raising the intonation of the ending of the sentence may be embedded in the reading-out file based on a tag indicating that the sentence is a question sentence. Furthermore, in the reading-out file, if necessary, attribute information for converting a so-called "Dama-tona" style to a "Dama-tona" style can be embedded. In this case, instead of embedding such attribute information in the reading-out file, the document processing apparatus 1 converts the style of "Dana-tona" into the style of "Dan-Ma-tona" and converts the speech reading-out file. You may make it generate | occur | produce.

【０１６０】一方、図２４に示す読み上げ用ファイルに
は、文書の先頭に対応してCom=Lang=ENGという属性情報
が埋め込まれており、文書を記述している言語が英語で
あることを示している。また、読み上げ用ファイルに
は、Com=Vol=***という属性情報が埋め込まれている。
この属性情報は、読み上げの時の音量を示す。例えば、
Com=Vol=0は、文書処理装置のデフォルトの音量で読み
上げることを示している。また、Com=Vol=80は、デフォ
ルトの音量を８０％増量した音量で読み上げることを示
している。任意の、Com=Vol=***は、次のCom=Vol=***ま
で有効である。さらに、読み上げ用ファイルにおいて
は、タグファイル中で記述されている発音＝“two”と
いう読み属性情報に対応して、「ＩＩ」が「two」に置
換されている。On the other hand, in the reading file shown in FIG. 24, attribute information of Com = Lang = ENG is embedded corresponding to the head of the document, indicating that the language describing the document is English. ing. The attribute information of Com = Vol = *** is embedded in the file for reading out.
This attribute information indicates the volume at the time of reading. For example,
Com = Vol = 0 indicates that the text is read out at the default volume of the document processing apparatus. Com = Vol = 80 indicates that the default volume is read out at a volume increased by 80%. Any Com = Vol = *** is valid until the next Com = Vol = ***. Further, in the reading file, "II" is replaced by "two" in accordance with the reading attribute information of pronunciation = "two" described in the tag file.

【０１６１】文書処理装置１は、図２０に示す一連の工
程を経ることによって、このような読み上げ用ファイル
を生成する。まず文書処理装置１は、ステップＦ２０１
において、ＣＰＵ１３によって、受信又は作成したタグ
ファイルを解析する。ここで文書処理装置１は、文書を
記述している言語を判別するとともに、文書の段落、文
及び句の先頭や、読み属性情報をタグに基づいて探し出
す。続いて文書処理装置１は、ステップＦ２０２におい
て、ＣＰＵ１３によって、文書を記述している言語に応
じて文書の先頭にCom=Lang=***という属性情報を埋め込
む。The document processing apparatus 1 generates such a reading file through a series of steps shown in FIG. First, the document processing apparatus 1 executes step F201.
The CPU 13 analyzes the received or created tag file. Here, the document processing apparatus 1 determines the language in which the document is described, and searches for the head of the paragraph, sentence, and phrase of the document, and the reading attribute information based on the tag. Subsequently, in step F202, the document processing apparatus 1 embeds attribute information Com = Lang = *** at the head of the document by the CPU 13 in accordance with the language in which the document is described.

【０１６２】次に文書処理装置１は、ステップＦ２０３
において、ＣＰＵ１３によって、文書の段落、文及び句
の先頭を読み上げ用ファイルにおける属性情報に置換す
る。すなわち文書処理装置１は、タグファイル中の＜段
落＞、＜文＞及び＜＊＊＊句＞を、それぞれ、Com=begi
n_p、Com=begin_s及びCom=begin_phに置換する。Next, the document processing apparatus 1 executes step F203.
, The CPU 13 replaces the head of the paragraph, sentence and phrase of the document with the attribute information in the reading file. That is, the document processing apparatus 1 replaces <paragraph>, <sentence>, and <*** phrase> in the tag file with Com = begi
Replace with n_p, Com = begin_s and Com = begin_ph.

【０１６３】さらに文書処理装置１は、ステップＦ２０
４において、ＣＰＵ１３によって、同じレベルの統語構
造が表れて同じCom=begin_***が重複しているものを、
１つのCom=begin_***にまとめる。続いて文書処理装置
１は、ステップＦ２０５において、ＣＰＵ１３によっ
て、Com=begin_***に対応してPau=***を埋め込む。すな
わち文書処理装置１は、Com=begin_pの前にPau=500を埋
め込み、Com=begin_sの前にPau=100を埋め込み、Com=be
gin_phの前にPau=50を埋め込む。そして文書処理装置１
は、ステップＦ２０６において、ＣＰＵ１３によって、
読み属性情報に基づいて、正しい読みに置換する。すな
わち文書処理装置１は、発音＝“null”という読み属性
情報に基づいて、「（たんぱく）」を除去するととも
に、発音＝“りんぱかん”、発音＝“すみか”という読
み属性情報に基づいて、「リンパ管」、「住み家」を、
それぞれ、「りんぱかん」、「すみか」に置換する。Further, the document processing apparatus 1 executes step F20.
In 4, the same level of syntactic structure appears by the CPU 13 and the same Com = begin _ ***
Combine into one Com = begin _ ***. Subsequently, in step F205, the document processing apparatus 1 embeds Pau = *** in correspondence with Com = begin _ *** by the CPU 13. That is, the document processing apparatus 1 embeds Pau = 500 before Com = begin_p, embeds Pau = 100 before Com = begin_s,
Embed Pau = 50 before gin_ph. And the document processing device 1
Is determined by the CPU 13 in step F206.
Replace with correct reading based on reading attribute information. That is, the document processing apparatus 1 removes “(protein)” based on the pronunciation attribute information “pronunciation =“ null ”, and based on the pronunciation attribute information“ pronunciation = “Rinkan” and pronunciation = “Sumika”. "Lymphatic vessels", "houses"
Substitute “Rinkan” and “Sumika”, respectively.

【０１６４】文書処理装置１は、先に図１９に示したス
テップＦ１０２において、この図２０に示す処理を行う
ことによって、読み上げ用ファイルを自動的に生成す
る。文書処理装置１は、生成した読み上げ用ファイルを
ＲＡＭ１４に記憶させる。In step F102 shown in FIG. 19, the document processing apparatus 1 automatically generates a reading file by performing the processing shown in FIG. The document processing device 1 causes the RAM 14 to store the generated reading file.

【０１６５】つぎに、図１９のステップＦ１０３におけ
る読み上げ用ファイルを用いた処理について説明する。
文書処理装置１は、読み上げ用ファイルを用いて、ＲＯ
Ｍ１５やＨＤＤ３４等に予め記憶されている音声合成エ
ンジンに適した処理をＣＰＵ１３の制御のもとに行う。
具体的には、文書処理装置１は、読み上げ用ファイルに
埋め込まれているCom=Lang=***という属性情報に基づい
て、使用する音声合成エンジンを選択する。音声合成エ
ンジンは、言語や男声／女声等の種類に応じて識別子が
付されており、その情報が例えば初期設定ファイルとし
てＨＤＤ３４に記録されている。文書処理装置１は、初
期設定ファイルを参照し、言語に対応した識別子の音声
合成エンジンを選択する。．Next, the processing using the reading file in step F103 of FIG. 19 will be described.
The document processing device 1 uses the read-out file to
Under the control of the CPU 13, processing suitable for the speech synthesis engine stored in advance in the M15, the HDD 34, or the like is performed.
Specifically, the document processing apparatus 1 selects a speech synthesis engine to be used based on the attribute information of Com = Lang = *** embedded in the reading file. The speech synthesis engine is provided with an identifier according to the type of language or male / female voice, and the information is recorded in the HDD 34 as, for example, an initialization file. The document processing device 1 refers to the initialization file and selects a speech synthesis engine having an identifier corresponding to the language. .

【０１６６】また文書処理装置１は、読み上げ用ファイ
ルに埋め込まれているCom=begin_***を音声合成エンジ
ンに適した形式に変換する。例えば文書処理装置１は、
Com=begin_phをMark=10000のように１００００番台の番
号でマーク付けする。またCom=begin_sをMark=1000のよ
うに１０００番台の番号でマーク付けし、Com=begin_p
をMark=100のように１００番台の番号でマーク付けす
る。これは、＜句＞、＜文＞、＜段落＞の先頭が、それ
ぞれ１００００番台、１０００番台、１００番台の番号
で示されることを意味し、このマークによって＜句＞、
＜文＞、＜段落＞の先頭が識別できるようになる。さら
に、読み上げ用ファイルにおいては、音量の属性情報が
Voll=***のようにデフォルトの音量の百分率で表されて
いることから、文書処理装置１は、この属性情報に基づ
いて、百分率の情報を絶対値の情報に変換して求める。Further, the document processing apparatus 1 converts Com = begin _ *** embedded in the file for reading out into a format suitable for the speech synthesis engine. For example, the document processing device 1
Mark Com = begin_ph with a number in the 10000s, such as Mark = 10000. Also mark Com = begin_s with a number in the 1000s, such as Mark = 1000, and Com = begin_p
Is marked with a number in the 100s, such as Mark = 100. This means that the heads of <phrase>, <sentence>, and <paragraph> are indicated by numbers in the 10000s, 1000s, and 100s, respectively.
The head of <sentence> and <paragraph> can be identified. Furthermore, in the reading file, the volume attribute information is
Since the volume is represented by the default volume percentage such as Voll = ***, the document processing apparatus 1 converts the percentage information into absolute value information based on the attribute information and obtains the absolute value information.

【０１６７】文書処理装置１は、先に図１９に示したス
テップＦ１０３において、このような読み上げ用ファイ
ルを用いた処理を行うことによって、読み上げ用ファイ
ルを音声合成エンジンが文書を読み上げることが可能な
形式に変換するものとなる。In step F103 shown in FIG. 19, the document processing apparatus 1 performs processing using such a file for reading out, so that the text-to-speech file can be read out by the speech synthesis engine. Format.

【０１６８】つぎに、図１９のステップＦ１０４におけ
るユーザインターフェースを用いた操作について説明す
る。文書処理装置１は、ユーザが例えば入力部２０のマ
ウス等を操作して先に図１６又は図１７に示した読み上
げボタン３０６ｃをクリックすることによって、音声合
成エンジンを起動する。そして文書処理装置１は、図２
５に示すようなユーザインターフェース用の読み上げウ
インドウ４０１を表示部３０に表示する。Next, the operation using the user interface in step F104 of FIG. 19 will be described. The document processing device 1 activates the speech synthesis engine when the user operates the mouse or the like of the input unit 20 and clicks on the reading button 306c shown in FIG. 16 or FIG. Then, the document processing apparatus 1
A reading window 401 for a user interface as shown in FIG. 5 is displayed on the display unit 30.

【０１６９】この読み上げウインドウ４０１は、図示す
るように、文書を読み上げさせるための再生ボタン４２
０と、読み上げを停止させるための停止ボタン４２１
と、読み上げを一時停止させるための一時停止ボタン４
２２とを有する。また、この読み上げウインドウ４０１
は、文単位で頭出し、早戻し及び早送りさせるための頭
出しボタン４１１、早戻しボタン４１２及び早送りボタ
ン４１３と、段落単位で頭出し、早戻し及び早送りさせ
るための頭出しボタン４１４、早戻しボタン４１５及び
早送りボタン４１６と、句単位で頭出し、早戻し及び早
送りさせるための頭出しボタン４１７、早戻しボタン４
１８及び早送りボタン４１９とを有する。[0169] As shown in the figure, the readout window 401 has a play button 42 for reading out a document.
0 and a stop button 421 for stopping reading out
And pause button 4 to pause reading aloud
22. In addition, the reading window 401
Are a cue button 411, a fast rewind button 412, and a fast forward button 413 for cueing, fast rewinding, and fast forwarding in sentence units, a cueing button 414 for cueing, fast rewinding, and fast forwarding in paragraph units, and a fast rewinding button. A button 415 and a fast forward button 416, and a cue button 417 and a fast reverse button 4 for cueing, fast rewinding, and fast forwarding in phrase units
18 and a fast forward button 419.

【０１７０】さらに、読み上げウインドウ４０１は、読
み上げる対象を全文とするか、上述したようにして作成
された要約文とするかを選択するための選択スイッチ４
２３，４２４を有する。また、読み上げ時に画像を表示
する画像表示エリア４０３が設定され、読み上げている
人のイメージを表示したり、テロップ表示エリア４０２
が設けられ、読み上げ音声に対応して文字がテロップ表
示できるようにされている。Further, the reading window 401 is provided with a selection switch 4 for selecting whether to read the whole sentence or the summary sentence created as described above.
23,424. Further, an image display area 403 for displaying an image at the time of reading is set, and an image of a person who is reading is displayed, and a telop display area 402 is displayed.
Is provided so that characters can be displayed in a telop corresponding to the reading voice.

【０１７１】なお、ここでは図示しないが、例えば、音
声を増減させるためのボタンや読み上げの速さを増減さ
せるためのボタン、男声／女声等の声を変化させるため
のボタン等を有していてもよい。Although not shown here, for example, there are buttons for increasing and decreasing the voice, buttons for increasing and decreasing the reading speed, and buttons for changing the voice such as male / female voice. Is also good.

【０１７２】文書処理装置１は、ユーザがこれらの各種
ボタン／スイッチを例えば入力部２０のマウス等を操作
してクリック／選択することに応じて、音声合成エンジ
ンによる読み上げ動作を行う。例えば、文書処理装置１
は、ユーザが再生ボタン４２０をクリックすることによ
って、文書の読み上げを開始する。具体的には制御部１
１は音声合成処理により生成した音声信号を音声出力部
３３に供給し、音声として出力する。また文書処理装置
１は、停止ボタン４２１、一時停止ボタン４２２のクリ
ックにおいて、読み上げ処理の停止や一時停止を行う。The document processing apparatus 1 performs a reading operation by the speech synthesis engine in response to the user clicking / selecting these various buttons / switches by operating, for example, the mouse of the input unit 20 or the like. For example, the document processing device 1
Starts reading the document by clicking the play button 420 by the user. Specifically, the control unit 1
1 supplies the audio signal generated by the audio synthesis processing to the audio output unit 33 and outputs it as audio. In addition, the document processing apparatus 1 stops or pauses the reading process when the stop button 421 and the pause button 422 are clicked.

【０１７３】また、読み上げの途中でユーザが頭出しボ
タン４１１を押すことによって、現在読み上げている文
の先頭にジャンプして再び読み上げる。頭出しボタン４
１４、４１７についても同様に、それぞれ現在読み上げ
ている段落や句の先頭にジャンプして再び読み上げを行
う。制御部１１は、この頭出しボタン４１１、４１４、
４１７の操作に関しては、上記したマーク付により、ジ
ャンプ先を認識するものとなる。即ち、文に関する頭出
しボタン４１１が操作された際には、制御部１１は、現
在読み上げている文の途中位置から文を遡っていき、１
０００番台のマークをサーチする。そして１０００番台
のマークが検出されたら、そこから読み上げを再開する
ものとなる。段落や句の場合は、それぞれ１００番台、
１００００番台のマークを探して読み上げを再開するも
のとなる。この様な処理によって、例えば文書中でユー
ザが所望の部分を繰り返し再生させたいといった要求に
応えることができる。Further, when the user presses the cue button 411 during the reading, the user jumps to the head of the currently read sentence and reads the sentence again. Cue button 4
In the same manner, for 14 and 417, the program jumps to the beginning of the currently read paragraph or phrase, and is read again. The control unit 11 controls the cue buttons 411, 414,
Regarding the operation 417, the jump destination is recognized by the above-mentioned marking. In other words, when the cue button 411 relating to a sentence is operated, the control unit 11 traces the sentence back from the middle position of the sentence currently being read out.
Search for 000's mark. When a mark in the 1000's range is detected, reading out is resumed from there. In the case of paragraphs and phrases,
Reading is resumed by looking for a mark in the 10000s range. By such processing, for example, it is possible to respond to a request that a user wants to repeatedly reproduce a desired portion in a document.

【０１７４】文書処理装置１は、図１９のステップＦ１
０４において、ユーザがこのような読み上げウインドウ
４０１でのユーザインターフェースを用いた操作を行う
ことに応じて、音声合成エンジンにより文書を読み上げ
る。このようにして文書処理装置１は、所望の文書を音
声合成エンジンにより違和感なく読み上げることができ
る。The document processing device 1 executes the processing in step F1 in FIG.
At 04, in response to the user performing an operation using the user interface in the reading window 401, the text-to-speech engine reads the document. In this manner, the document processing apparatus 1 can read out a desired document by the speech synthesis engine without any discomfort.

【０１７５】ところで読み上げる対象の文書としては文
書データの本文でもよいし、要約文であってもよい。本
文か要約文かは、選択スイッチ４２３，４２４のクリッ
クにより選択されるが、いずれにしても、本文又は要約
文としてのタグファイルについて、図１９のステップＦ
１０２，Ｆ１０３の処理が行われることで、音声合成エ
ンジンによる文書読み上げが可能となる。By the way, the document to be read out may be the body of the document data or an abstract. The text or the summary is selected by clicking the selection switches 423 and 424. In any case, the tag file as the text or the summary is read in step F of FIG.
By performing the processing of steps 102 and F103, the text-to-speech reading by the speech synthesis engine becomes possible.

【０１７６】なお本例では、受信又は作成したタグファ
イルから読み上げ用ファイルを生成するものとしたが、
このような読み上げ用ファイルを生成せずに、タグファ
イルに基づいて直接読み上げを行うようにしてもよい。
この場合、文書処理装置１は、タグファイルを受信又は
作成した後、音声合成エンジンを用い、タグファイルに
付与されている段落、文及び句を示すタグに基づいて、
段落、文及び句の先頭に所定の休止期間を設けて読み上
げる。このようにすることによって、文書処理装置１
は、音声読み上げ用ファイルを生成することなく、タグ
ファイルに基づいて直接読み上げることができる。In this example, the reading file is generated from the received or created tag file.
Instead of generating such a reading file, reading may be directly performed based on the tag file.
In this case, after receiving or creating the tag file, the document processing device 1 uses a speech synthesis engine and, based on tags indicating paragraphs, sentences, and phrases attached to the tag file,
A certain pause is provided at the beginning of a paragraph, sentence or phrase to read out. By doing so, the document processing device 1
Can be read directly based on the tag file without generating a voice reading file.

【０１７７】７．表示される要約サイズに応じた読み上
げ処理続いて本例の文書処理装置で実現される、表示される要
約サイズに応じた読み上げ処理について説明する。上述
してきたように、本例の場合、閲覧ウインドウ３０１の
要約表示部３０４において要約文を表示でき、しかもそ
の要約文は、ユーザーが任意に可変できる要約表示部３
０４のサイズに応じた文書長で生成され、表示されるも
のとなる。これは上述したように、ユーザーが要約文の
サイズを指定できるものとなるが、ユーザが本文等の他
の表示を広くとりたいと思った場合や、他のウインドウ
を同時に表示している場合など、要約表示部３０４のサ
イズを十分にとれないことがある。或いはユーザーがわ
ざわざ要約表示部３０４のサイズを広げることを面倒に
思うこともある。例えばこのようなことから、要約とし
て十分な文書長で表示を行うことができず、結果的に、
文書の概略内容を提示する要約として不十分な表示しか
できないことも発生する。[0177] 7. Speech Processing According to Displayed Summary Size Next, a description will be given of a speech processing according to the displayed summary size, which is realized by the document processing apparatus of this example. As described above, in the case of this example, a summary can be displayed on the summary display 304 of the browsing window 301, and the summary can be arbitrarily changed by the user.
The document is generated and displayed with a document length corresponding to the size of the document 04. As described above, this allows the user to specify the size of the summary sentence.However, when the user wants to make other displays such as the text wider, or when other windows are displayed at the same time, In some cases, the size of the summary display unit 304 may not be sufficient. Alternatively, the user may be bothered to increase the size of the summary display unit 304. For example, because of this, it is not possible to display a summary with a sufficient document length, and as a result,
Insufficient display may occur as a summary showing the outline of the document.

【０１７８】そこで本例では、ユーザーが要約文につい
ての読み上げ処理を求めた場合は、その時点で作成され
ている要約文（即ち要約表示部３０４に表示されている
要約文）の文書長（即ち要約表示部３０４のサイズ）に
応じて、それが短い場合には、より長い文書長の要約文
を作成し、それを読み上げるようにするものである。つ
まり、表示される要約文が短くて内容把握に不十分なも
のであるときには、或る程度十分と思われる要約文を作
成して読み上げることで、ユーザーの文書内容の把握を
補助できるようにする。Therefore, in this example, when the user requests a reading process for the summary sentence, the document length (ie, the summary sentence displayed on the summary display unit 304) of the summary sentence created at that time (ie, the summary sentence). If the length is short according to the size of the summary display unit 304), a summary with a longer document length is created and read out. In other words, when the displayed summary is short and insufficient for understanding the content, it is possible to assist the user in understanding the content of the document by creating and reading out the summary that seems to be somewhat sufficient. .

【０１７９】このための処理を図２６に示して説明して
いく。なお本例では、このような読み上げ処理は、図２
５の読み上げウインドウ４０１において要約文の読み上
げを求める操作を行った際に行われる動作とするが、も
ちろんそれは一例である。The processing for this will be described with reference to FIG. In the present example, such a reading process is performed as shown in FIG.
The operation is performed when an operation for reading out the summary sentence is performed in the reading-out window 401 of No. 5, but this is merely an example.

【０１８０】いま、例えば図１６に示した閲覧ウインド
ウ３０１からユーザーが要約作成ボタン３０６ａをクリ
ックし、上述した図１８の処理で要約表示部３０４のサ
イズに応じた文書長の要約文が作成され、図２７のよう
に表示されているとする。ここでユーザーが読み上げボ
タン３０６ｃをクリックすると、上述した読み上げのた
めの処理が行われ、図２５の読み上げウインドウ４０１
が表示される。このときユーザーが選択スイッチ４２４
をクリックした上で、再生ボタン４２０をクリックし、
つまり要約文の読み上げを求めたとする。すると制御部
１１では図２６に示すような、要約文のサイズに応じた
読み上げ処理が行われる。Now, for example, the user clicks the summary creation button 306a from the browsing window 301 shown in FIG. 16, and a summary of a document length corresponding to the size of the summary display unit 304 is created by the processing of FIG. It is assumed that the display is as shown in FIG. Here, when the user clicks the read-aloud button 306c, the above-described process for the read-aloud is performed, and the read-aloud window 401 of FIG.
Is displayed. At this time, the user selects the switch 424
After clicking, click the play button 420,
That is, it is assumed that the user has read out the summary sentence. Then, the control section 11 performs a reading process according to the size of the summary sentence as shown in FIG.

【０１８１】まず制御部１１はステップＦ３０１で、現
在表示されている要約文のサイズを確認する。要約文は
要約表示部３０４のサイズに応じて生成されるため、こ
のステップＦ３０１では、現在の要約文を生成した際に
行われた図１８の処理における変数ｗｓを参照すればよ
い。そして制御部１１は、この変数ｗｓの値が、予め設
定された基準値ｗｓｔｈ以上であるか否かを判別する。
この基準値ｗｓｔｈは、要約文として或る程度十分な文
書長を表示できる要約表示部３０４のサイズとして設定
されているものである。First, in step F301, the control unit 11 confirms the size of the currently displayed summary sentence. Since the summary is generated according to the size of the summary display unit 304, in this step F301, the variable ws in the processing of FIG. 18 performed when the current summary is generated may be referred to. Then, the control unit 11 determines whether or not the value of the variable ws is equal to or greater than a preset reference value wsth.
The reference value wsth is set as the size of the summary display unit 304 that can display a certain sufficient document length as a summary sentence.

【０１８２】ここで例えば図２７に示した要約文表示エ
リア３０４のサイズとしての変数ｗｓは、基準値ｗｓｔ
ｈ未満であったとすると、制御部１１の処理はステップ
Ｆ３０２に進み、変数ｗｓｒｄを用意して、この変数ｗ
ｓｒｄに変数ｗｓ＋αの値を代入する。なお、「α」の
値の設定については各種考えられるが、それは後述す
る。Here, for example, the variable ws as the size of the summary sentence display area 304 shown in FIG.
If it is less than h, the process of the control unit 11 proceeds to step F302, prepares a variable wsrd, and
The value of the variable ws + α is substituted for srd. There are various possible settings for the value of “α”, which will be described later.

【０１８３】これにより変数ｗｓｒｄは、現在の要約表
示エリア３０４よりも「α」分だけ大きいサイズを示す
値となる。続いて制御部１１はステップＦ３０３で、変
数ｗｓｒｄを用いて、図１８の要約作成処理を実行す
る。この処理については重複説明を避けるが、ここで
は、図１８のステップＦ８２，Ｆ８５，Ｆ８９における
変数ｗｓに代えて、変数ｗｓｒｄが用いられることにな
る。従ってステップＦ３０３の処理により、現在の要約
文より長い文書長の要約文（要約文としてのタグファイ
ル）が生成される。As a result, the variable wsrd becomes a value indicating a size larger by “α” than the current summary display area 304. Subsequently, in step F303, the control unit 11 executes the digest creation processing of FIG. 18 using the variable wsrd. This process will not be described repeatedly, but here, the variable wsrd is used instead of the variable ws in steps F82, F85, and F89 in FIG. Therefore, by the process of step F303, a summary sentence (a tag file as a summary sentence) having a document length longer than the current summary sentence is generated.

【０１８４】続いて制御部１１はステップＦ３０４で、
ステップＦ３０３で生成したタグファイルから読み上げ
用ファイルを生成する。これは図１９のステップＦ１０
２の処理であり、図２０で詳述した処理が行われること
になる。読み上げ用ファイルが生成されたら、制御部１
１の処理はステップＦ３０５に進み、読み上げ用ファイ
ルを用いた処理を行う。つまり図１９のステップＦ１０
３で説明した処理と同様に、読み上げ用ファイルを音声
合成エンジンが文書を読み上げることが可能な形式に変
換する。具体的な処理は上述したとおりである。Subsequently, the control unit 11 determines in step F304 that
A reading-out file is generated from the tag file generated in step F303. This corresponds to step F10 in FIG.
This is the processing of No. 2 and the processing described in detail in FIG. 20 is performed. When the reading file is generated, the control unit 1
The process 1 proceeds to step F305, and performs a process using the reading file. That is, step F10 in FIG.
Similarly to the process described in 3, the speech file is converted into a format in which the speech synthesis engine can read the document. The specific processing is as described above.

【０１８５】ステップＦ３０５の処理が完了すること
で、音声合成エンジンは、ステップＦ３０３で生成した
要約文、つまり現在の要約文よりも長い文章とされた要
約文を読み上げることが可能となる。そこで制御部１１
はステップＦ３０６として比較的詳しい内容の要約文の
読み上げを開始させる。このとき表示としては、図２５
のような読み上げウインドウ４０１のままでもよいし、
例えば図２８に示すように読み上げウインドウ４０１の
一部（操作ボタン部分）のみを閲覧ウインドウ３０１に
重ねるような表示を行うようにしてもよい。図２８のよ
うな表示を行うことにより、表示される要約文はわずか
な文章であっても、読み上げにより、より詳しい要約が
提示されていることがユーザーは認識できる。Upon completion of the processing in step F305, the speech synthesis engine can read out the summary sentence generated in step F303, that is, the summary sentence longer than the current summary sentence. Therefore, the control unit 11
Starts reading out a relatively detailed summary sentence as step F306. At this time, the display shown in FIG.
May be the reading window 401 as in
For example, as shown in FIG. 28, a display may be performed in which only a part (operation button part) of the reading window 401 is superimposed on the browsing window 301. By performing the display as shown in FIG. 28, the user can recognize that a more detailed summary is presented by reading aloud even if the displayed summary is a small sentence.

【０１８６】読み上げ開始後は、ステップＦ３０７とし
て制御部１１はユーザインターフェースの操作に応じた
処理を行うことになる。即ち、図２５又は図２８のよう
に表示される各種ボタン４１１〜４２２を用いたユーザ
ー操作に応じて、再生停止、一時停止、頭出し、早送
り、早戻しなどを行っていく。After the start of the reading, the control section 11 performs a process corresponding to the operation of the user interface as step F307. That is, playback stop, pause, cueing, fast forward, fast rewind, etc. are performed in accordance with a user operation using various buttons 411 to 422 displayed as shown in FIG. 25 or FIG.

【０１８７】ところで、現在の要約文サイズが例えば図
１７に示した場合のように比較的大きいものであって、
ステップＦ３０１で変数ｗｓの値が、基準値ｗｓｔｈ以
上であった場合は、制御部１１の処理はステップＦ３０
８に進み、現在の要約文のタグファイルから読み上げ用
ファイルを生成する。さらにステップＦ３０５で、読み
上げ用ファイルを用いた処理を行う。そしてステップＦ
３０６で読み上げを開始する。即ちこの場合は、現在の
要約文をそのまま音声合成エンジンが文書を読み上げる
ことが可能な形式に変換し、読み上げ音声として出力す
るものとなる。By the way, if the current summary sentence size is relatively large as shown in FIG. 17, for example,
If the value of the variable ws is equal to or greater than the reference value wsth in step F301, the process of the control unit 11 proceeds to step F30.
Proceeding to S8, a reading file is generated from the tag file of the current summary sentence. Further, in step F305, a process using the reading file is performed. And step F
At 306, reading out is started. That is, in this case, the current summary sentence is converted into a format in which the text-to-speech engine can read out the document as it is, and is output as a read-out voice.

【０１８８】以上のような図２６の処理が行われること
により、例え要約文としては不十分な文書長でしか表示
されない場合であっても、読み上げにより、ユーザーは
より詳しい内容の要約文を聞くことができ、文書内容を
的確に把握できるようになる。また、表示されている要
約文が或る程度十分な内容である場合は、その文章をそ
のまま読み上げるようにすることで、読み上げられる要
約内容が必要以上に煩雑になることはない。もちろんそ
の際はステップＦ３０３の処理は不要となるため、処理
は簡易なものとなる。By performing the processing in FIG. 26 as described above, even if the summary text is displayed only with an insufficient document length, the user hears the summary text with more detailed contents by reading aloud. And the content of the document can be accurately grasped. If the displayed summary has a sufficient content to some extent, by reading the sentence as it is, the read summary does not become unnecessarily complicated. Of course, in that case, the process of step F303 is not necessary, so that the process is simple.

【０１８９】なお、ステップＦ３０２の「α」の値、つ
まり読み上げのための要約文の文書長を、どの程度の長
さにするかは、実際の使用上で好適な値が決められれば
よい。また「α」の値自体は固定値としておいてもよい
し、可変値とされてもよい。固定値とする場合は、例え
ば文書長が少なくとも基準値ｗｓｔｈに達する値とする
ことが考えられる。或いはユーザーが固定値としての
「α」の値を任意に設定できるようにしてもよい。可変
値とする場合については、例えばその時点の要約文サイ
ズ、つまり変数ｗｓの値に応じた値が係数演算或いはメ
モリテーブル等を用いて得られるようにすることが考え
られる。例えば、表示されている要約文が短いほど、読
み上げられる要約文が長くなるようにして、表示と読み
上げで相互に補完的に、ユーザーに対する要約内容の提
示が行われるようにすることが考えられる。つまり、表
示されている要約文と、読み上げられる要約文のサイズ
がリニアに、または段階的に反比例関係をもつことにな
るように「α」の値を算出又は選択するものである。The value of “α” in step F302, that is, the length of the digest sentence for reading aloud, may be determined by a value suitable for actual use. Further, the value of “α” may be a fixed value or a variable value. In the case of a fixed value, for example, it is conceivable that the document length is a value that reaches at least the reference value wsth. Alternatively, the user may be able to arbitrarily set the value of “α” as the fixed value. In the case of a variable value, for example, it is conceivable that a value corresponding to the summary sentence size at that time, that is, a value of the variable ws can be obtained by using a coefficient operation or a memory table. For example, it is conceivable that the shorter the displayed summary sentence is, the longer the summary sentence is read out, so that the display of the summary sentence is complementarily provided to the user so that the summary content is presented to the user. That is, the value of “α” is calculated or selected such that the size of the displayed summary sentence and the size of the summary sentence to be read have an inversely proportional relationship linearly or stepwise.

【０１９０】さらに、図２６の処理とは異なるが、例え
ば表示されている要約文が長いものである場合は、表示
される要約文より短い要約文を生成して、それを読み上
げるようにすることも考えられる。これにより、ユーザ
ーは読み上げ音声で非常に概略的な要約内容を把握した
上で、表示内容から、一段と深い内容を確認できるとい
うものとなる。Further, although different from the processing of FIG. 26, for example, when the displayed summary is long, a summary shorter than the displayed summary may be generated and read out. Conceivable. As a result, the user can confirm a very rough summary content in the read-out voice, and can confirm the further deep content from the display content.

【０１９１】ところで、このような表示される要約サイ
ズに応じた読み上げ処理は、図２５の読み上げウインド
ウ４０１から要約文の読み上げを求める操作が行われる
際としたが、要約文の読み上げとして、通常の読み上げ
処理と、図２６の読み上げ処理が選択できるようにして
もよい。例えば読み上げウインドウ４０１上の選択スイ
ッチとして選択スイッチ４２３，４２４の「全文」「要
約文」に加えて「詳細要約文」の選択スイッチを用意
し、「要約文」が選択された場合は、現在の要約文をそ
のまま読み上げるようにするが、「詳細要約文」が選択
された場合に、図２６の処理で読み上げが行われるよう
にするなどが考えられる。また、表示される要約サイズ
に応じた読み上げ処理は、閲覧ウインドウ３０１から直
接指示できるようにしてもよい。例えば閲覧ウインドウ
の要約作成ボタン３０６ａ、読み上げボタン３０６ｃの
他に、要約読み上げボタンを設け、それがクリックされ
た際に、図２６の処理が開始されるようにする例であ
る。By the way, the reading process according to the displayed summary size is performed when the operation of requesting the reading of the summary sentence from the reading window 401 of FIG. 25 is performed. The reading process and the reading process of FIG. 26 may be selectable. For example, as a selection switch on the reading window 401, a selection switch of "detailed summary sentence" is prepared in addition to "all text" and "summary sentence" of the selection switches 423 and 424, and when "summary sentence" is selected, The summary sentence is read aloud as it is, but if the “detailed summary sentence” is selected, the reading may be performed in the process of FIG. 26. The reading process according to the displayed digest size may be directly instructed from the browsing window 301. For example, in this example, a summary reading button is provided in addition to the summary creation button 306a and the reading button 306c of the browsing window, and when the button is clicked, the processing of FIG. 26 is started.

【０１９２】８．要約作成及び読み上げ処理以上の処理は、既に作成されている要約文を読み上げる
際の処理として説明したが、要約作成時に、その要約サ
イズに応じて同時に読み上げを行うことも考えれる。こ
のような処理を、図２９で説明していく。8. Summarization and Reading Process The above process has been described as a process for reading out a summary sentence that has already been created, but it is also conceivable to simultaneously read out a summary at the time of creation according to the size of the summary. Such a process will be described with reference to FIG.

【０１９３】例えばこの図２９の処理は、図１６のよう
に閲覧ウインドウ３０１が表示されている状態で、ユー
ザーが要約作成ボタン３０６ａをクリックした際に実行
されるものとする。まず制御部１１はステップＦ４０１
として、その時点の要約表示部３０４のサイズに応じた
要約文の作成処理を行う。即ち図１８で説明した処理を
実行する。これにより要約文が作成できたら、制御部１
１はステップＦ４０２において、その要約文を表示す
る。即ち、例えば図２７のように要約表示部３０４に要
約文を表示する。For example, it is assumed that the process of FIG. 29 is executed when the user clicks the summary creation button 306a while the browsing window 301 is displayed as shown in FIG. First, the control unit 11 executes step F401
Then, a process of creating a summary sentence according to the size of the summary display unit 304 at that time is performed. That is, the processing described in FIG. 18 is executed. If a summary sentence can be created by this, the control unit 1
1 displays the summary sentence in step F402. That is, a summary sentence is displayed on the summary display unit 304 as shown in FIG. 27, for example.

【０１９４】続いてステップＦ４０３以降で、要約文の
読み上げを実行する。このステップＦ４０３〜Ｆ４０９
は、上記図２６のステップＦ３０１〜３０７の処理と同
様であるため、詳細な説明は省略するが、即ちこのステ
ップＦ４０３〜Ｆ４０９では、作成された要約文サイ
ズ、即ち要約表示部３０４のサイズに相当する変数ｗｓ
が基準値ｗｓｔｈ未満であった場合は、その要約文より
も長い要約文を作成した上で読み上げファイルの生成や
音声合成エンジンに適した形式への変換が行われる。そ
して例えば図２８のような表示とともに、比較的詳しい
要約文の読み上げが開始され、その後はユーザーの操作
に応じた動作が行われる。Subsequently, in step F403 and subsequent steps, the summary sentence is read aloud. Steps F403 to F409
26 is the same as the processing in steps F301 to F307 in FIG. 26, and thus detailed description is omitted. That is, in steps F403 to F409, the size of the generated summary sentence, ie, the size of the summary display unit 304 Variable ws
Is smaller than the reference value wsth, a summary sentence longer than the summary sentence is created, and then a read-out file is generated or converted into a format suitable for a speech synthesis engine. Then, with the display as shown in FIG. 28, for example, the reading of a relatively detailed summary sentence is started, and thereafter, an operation according to the operation of the user is performed.

【０１９５】これにより、生成され表示された要約文が
わずかな文章であっても、読み上げにより、より詳しい
要約内容をユーザーが知ることができる。つまり読み上
げ音声が、表示内容の補足的な内容としてユーザーに提
示される。As a result, even if the generated and displayed summary sentence is a small sentence, the user can know more detailed summary content by reading aloud. That is, the reading voice is presented to the user as supplementary content of the display content.

【０１９６】一方、作成された要約文サイズが比較的大
きいものであった場合、つまりユーザーが要約表示部３
０４のエリアを広げた上で要約作成を求めた場合などで
あって、ステップＦ４０３で変数ｗｓの値が、基準値ｗ
ｓｔｈ以上であった場合は、制御部１１は読み上げを実
行せずに図２９の処理を終了する。これは、表示のみで
要約が十分に表現されていることから、より詳しい要約
内容の読み上げは必要ないとして処理を終えるものであ
る。On the other hand, if the size of the created summary sentence is relatively large, that is, if the user
04, the summarization is requested after the area is expanded, and the value of the variable ws is changed to the reference value w in step F403.
If it is not less than sth, the control unit 11 terminates the processing of FIG. 29 without executing the reading. In this case, since the summary is sufficiently expressed only by the display, it is determined that the reading of the more detailed summary content is not necessary, and the process ends.

【０１９７】但し、図中破線で示すように、ステップＦ
４０３で変数ｗｓの値が基準値ｗｓｔｈ以上であった場
合は、制御部１１の処理がステップＦ４１０に進み、現
在の要約文のタグファイルから読み上げ用ファイルを生
成した上で、ステップＦ４０７以降に進むようにしても
よい。即ち、表示されている要約文内容がそのまま読み
上げられるようにするものである。However, as shown by the broken line in FIG.
If the value of the variable ws is equal to or greater than the reference value wsth in 403, the process of the control unit 11 proceeds to step F410, generates a reading file from the current summary sentence tag file, and proceeds to step F407 and subsequent steps. You may be able to do it. That is, the displayed summary text content can be read out as it is.

【０１９８】以上のような図２９の処理が行われること
により、要約文が作成されユーザーに対して表示される
際に、その要約文のサイズが小さい場合は、読み上げに
よりより詳しい要約内容をユーザーに提示できる。従っ
て、ユーザーは文書の内容を的確に把握できるようにな
る。また、作成され表示される要約文が或る程度十分な
内容である場合は、読み上げは行わないようにするか、
或いはその文章をそのまま読み上げるようにすることで
処理は簡易なものとなる。When the summary sentence is created and displayed to the user by performing the processing of FIG. 29 as described above, if the size of the summary sentence is small, a more detailed summary content is read out to the user. Can be presented. Therefore, the user can accurately grasp the contents of the document. If the summary sentence is created and displayed to a certain extent, do not read it out,
Alternatively, by reading the sentence as it is, the processing is simplified.

【０１９９】なお、ステップＦ４０４の「α」の値の設
定については、図２６の場合と同様に各種の例が考えら
れる。また、表示される要約文より短い要約文を作成し
て読み上げることも可能である。もちろん、図２９のよ
うな動作を開始させるための操作については、多様に考
えられる。It should be noted that various examples can be considered for setting the value of “α” in step F404, as in the case of FIG. It is also possible to create a summary sentence shorter than the displayed summary sentence and read it out. Of course, various operations for starting the operation as shown in FIG. 29 are conceivable.

【０２００】９．携帯端末装置及び文書処理システム構
成以上説明してきた文書処理装置１は、要約文の表示エリ
ア（つまり要約表示部３０４）のサイズを、ユーザーが
任意に可変できるものであるが、要約文を表示できるサ
イズが固定とされた文書処理装置でも、その要約文サイ
ズを考慮して、より詳しい要約文を作成して読み上げを
行うようにすることもできる。そのような文書処理装置
として、携帯端末装置を例に挙げて説明していく。この
携帯端末装置は、図１に携帯端末５０として示したもの
であり、ユーザーは携帯端末５０を単体で使用できるほ
か、上記文書処理装置１と接続してシステム動作を行う
ことができるものである。9. Portable Terminal Device and Document Processing System Configuration In the document processing device 1 described above, the size of the display area of the summary text (that is, the summary display unit 304) can be arbitrarily changed by the user, but the summary text can be displayed. Even in a document processing device having a fixed size, a more detailed summary sentence can be created and read out in consideration of the size of the summary sentence. A portable terminal device will be described as an example of such a document processing device. This portable terminal device is shown as a portable terminal 50 in FIG. 1, and a user can use the portable terminal 50 by itself and can perform a system operation by connecting to the document processing device 1. .

【０２０１】以下では、携帯端末装置（携帯端末５０）
の構成及び動作、さらには携帯端末５０と文書処理装置
１による文書処理システムとしての動作について説明し
ていく。まず図３０、図３１で外観例を述べる。In the following, the portable terminal device (portable terminal 50)
And the operation of the portable terminal 50 and the document processing apparatus 1 as a document processing system will be described. First, an appearance example will be described with reference to FIGS.

【０２０２】図３０は携帯端末５０の外観例を示し、こ
の携帯端末５０は例えばカード型の小型軽量の装置とさ
れる。そして携帯端末５０の上面には、例えば液晶パネ
ルを用いた表示部５１や、所要数の操作部（操作キー）
５２が設けられる。表示部５１の表示エリアのサイズ
は、携帯端末の設計事情により決められるが例えば１行
又は数行程度の文書が表示可能とされる。また操作部５
２としての操作キーの数や操作内容は、携帯端末５０に
設けられる機能に応じて設定される。ここでは例えば、
各種文書の表示を切り換えるための送り／戻し操作や、
不要な文書を消去するための消去操作などが可能とされ
るものとする。FIG. 30 shows an example of the appearance of the portable terminal 50. The portable terminal 50 is, for example, a card-type small and lightweight device. On the upper surface of the portable terminal 50, for example, a display unit 51 using a liquid crystal panel and a required number of operation units (operation keys)
52 are provided. The size of the display area of the display unit 51 is determined by the design circumstances of the mobile terminal, but, for example, one or several lines of a document can be displayed. Operation unit 5
The number and operation contents of the operation keys as 2 are set according to functions provided in the mobile terminal 50. Here, for example,
Forward / back operation for switching the display of various documents,
It is assumed that an erasing operation for erasing unnecessary documents can be performed.

【０２０３】携帯端末５０の側面等の所定の位置には、
リーダ端子５３が設けられ、図１に示した携帯端末リー
ダ５０との間で情報転送が可能とされる。またヘッドホ
ン端子６０が設けられ、ユーザーはヘッドホン７０を接
続することで、後述する読み上げ音声を聞くことができ
る。At a predetermined position such as the side surface of the portable terminal 50,
A reader terminal 53 is provided to enable information transfer with the portable terminal reader 50 shown in FIG. In addition, a headphone terminal 60 is provided, and the user can listen to a reading voice described later by connecting the headphones 70.

【０２０４】図３１は、パーソナルコンピュータ９０の
外観を示すが、これは図１の文書処理装置１をパーソナ
ルコンピュータ９０により実現する例である。この場
合、汎用のパーソナルコンピュータ９０の構成として一
般的であるように、パーソナルコンピュータ本体には、
モニタディスプレイ９１、キーボード９４、マウス９５
等が接続される。またパーソナルコンピュータ本体には
フロッピーディスクドライブ９３、ＣＤ−ＲＯＭドライ
ブ９２等が配されている。モニタディスプレイ９１は図
１の表示部３０に、キーボード９４及びマウス９５は図
１の入力部２０に、フロッピーディスクドライブ９３及
びＣＤ−ＲＯＭドライブ９２は図１の記録再生部３１
に、それぞれ相当するものとなる。また図３１には図示
していないが、図１の通信部２１、本体１０、音声出力
部３３、ＨＤＤ３４としての構成もパーソナルコンピュ
ータ本体内に設けられる。FIG. 31 shows the appearance of the personal computer 90. This is an example in which the document processing apparatus 1 of FIG. In this case, as is generally the configuration of the general-purpose personal computer 90, the personal computer main body includes:
Monitor display 91, keyboard 94, mouse 95
Etc. are connected. In addition, a floppy disk drive 93, a CD-ROM drive 92, and the like are provided in the main body of the personal computer. The monitor display 91 is provided on the display unit 30 shown in FIG. 1, the keyboard 94 and the mouse 95 are provided on the input unit 20 shown in FIG. 1, and the floppy disk drive 93 and the CD-ROM drive 92 are provided on the recording / reproducing unit 31 shown in FIG.
, Respectively. Although not shown in FIG. 31, the configurations of the communication unit 21, the main body 10, the audio output unit 33, and the HDD 34 of FIG. 1 are also provided in the personal computer main body.

【０２０５】図３１にはパーソナルコンピュータ本体に
接続される別体装置として携帯端末リーダ９６を示して
いる。このような携帯端末リーダ９６が図１の携帯端末
リーダ２２の一例となる。この携帯端末リーダ９６に対
しては図３０の携帯端末を装着することができ、その
際、リーダ端子５３が接続されることで、図１に示した
構成により、制御部１０は携帯端末５０から文書データ
を受け取ることができるようになる。FIG. 31 shows a portable terminal reader 96 as a separate device connected to the personal computer main body. Such a portable terminal reader 96 is an example of the portable terminal reader 22 of FIG. The portable terminal reader 96 can be equipped with the portable terminal shown in FIG. 30. At this time, by connecting the reader terminal 53, the control unit 10 can communicate with the portable terminal 50 by the configuration shown in FIG. Document data can be received.

【０２０６】この図３１は、本例としての文書処理シス
テムを実現するハードウエア構成の一例であるが、もち
ろん他にも多様な例が考えられる。まず文書処理装置１
としては、汎用のパーソナルコンピュータを用いずに、
図１の構成を持つ専用の文書処理装置を形成してもよ
い。さらには、図１のようなデスクトップタイプのパー
ソナルコンピュータではなく、携帯可能なノートタイプ
のパーソナルコンピュータでもよいことはもちろんであ
る。また、携帯端末リーダ９６はパーソナルコンピュー
タ本体（もしくは専用の文書処理装置としての機器）に
内蔵されてもよい。さらに携帯端末５０と図１の本体１
０のデータ転送は、携帯端末リーダ９６（２２）等を用
いた有線で行うものの他、例えば赤外線伝送等で無線で
実行できるようにすることも考えられる。FIG. 31 shows an example of a hardware configuration for realizing the document processing system according to the present embodiment. Of course, various other examples are conceivable. First, the document processing device 1
As, without using a general-purpose personal computer,
A dedicated document processing device having the configuration shown in FIG. 1 may be formed. Further, it is needless to say that a portable notebook personal computer may be used instead of the desktop personal computer as shown in FIG. Further, the portable terminal reader 96 may be built in a personal computer main body (or a device as a dedicated document processing device). Further, the portable terminal 50 and the main body 1 of FIG.
The data transfer of 0 may be performed in a wired manner using the portable terminal reader 96 (22) or the like, or may be performed wirelessly by infrared transmission or the like.

【０２０７】図３２に携帯装置５０の構成を示す。携帯
装置５０は、上記した表示部５１、操作部５２、リーダ
端子５３、ヘッドホン端子６０の他に、制御部５４、文
書メモリ５５、受信部５６、表示ドライブ５７、インタ
ーフェース５８、音声出力部５９を備える。制御部５４
はＣＰＵ、ＲＯＭ、ＲＡＭを備えたマイクロコンピュー
タとされ、各種動作処理を行なう。FIG. 32 shows the configuration of the portable device 50. The portable device 50 includes a control unit 54, a document memory 55, a receiving unit 56, a display drive 57, an interface 58, and an audio output unit 59 in addition to the display unit 51, the operation unit 52, the reader terminal 53, and the headphone terminal 60 described above. Prepare. Control unit 54
Is a microcomputer having a CPU, a ROM, and a RAM, and performs various operation processes.

【０２０８】受信部５６は無線電話回線などから文書デ
ータを受信できる部位である。受信した文書データは制
御部５４に転送され、処理される。この受信部５６は、
例えばいわゆるポケットベルと同様の受信機能を備える
ものとされる。文書メモリ５５は、制御部５４の指示に
従って受信した文書データや、制御部５４で生成された
要約文などを格納する部位である。表示ドライブ５７
は、制御部５４からの指示に基づいて、表示部５１にお
ける表示動作を実行させる。制御部５４は文書メモリ５
５に格納されている要約文を表示ドライブ５７により表
示部５１に表示させることができる。The receiving section 56 is a section that can receive document data from a wireless telephone line or the like. The received document data is transferred to the control unit 54 and processed. This receiving unit 56
For example, it has a receiving function similar to a so-called pager. The document memory 55 is a section for storing document data received according to an instruction from the control unit 54, a summary sentence generated by the control unit 54, and the like. Display drive 57
Causes the display unit 51 to execute a display operation based on an instruction from the control unit 54. The control unit 54 controls the document memory 5
5 can be displayed on the display unit 51 by the display drive 57.

【０２０９】インターフェース５８は、リーダ端子５３
及び図１の携帯端末リーダ２２、インターフェース１２
を介して文書処理装置１の制御部１１にデータを転送す
るための処理を行う部位である。転送するデータとは文
書メモリ５５に格納されている文書データである。The interface 58 is connected to the reader terminal 53
And the portable terminal reader 22 and the interface 12 of FIG.
Is a part that performs processing for transferring data to the control unit 11 of the document processing apparatus 1 via the. The data to be transferred is document data stored in the document memory 55.

【０２１０】音声出力部５９は、読み上げ音声としての
音声信号をヘッドホン端子６０から出力するための部位
である。即ち制御部５４には後述する読み上げのための
プログラムや音声合成エンジンが搭載されており、制御
部５４での音声合成処理により得られた音声信号が音声
出力部５９に供給された際に、音声出力部５９はその音
声信号をヘッドホン端子６０から出力する動作を行う。[0210] The audio output section 59 is a section for outputting an audio signal as a reading voice from the headphone terminal 60. That is, the control unit 54 is equipped with a speech reading program and a speech synthesis engine, which will be described later. When the speech signal obtained by the speech synthesis process in the control unit 54 is supplied to the speech output unit 59, the speech The output unit 59 performs an operation of outputting the audio signal from the headphone terminal 60.

【０２１１】１０．携帯端末装置による文書受信処理携帯端末５０による文書データの受信時の制御部５４の
処理を図３３で説明する。受信部５６によって所定のサ
ーバから１又は複数の文書データが受信されると、制御
部５４はステップＦ５０１として、その文書データを内
部のＲＡＭに取り込むとともに、受信した文書データ本
文を文書メモリ５５に格納させる。10. Document Receiving Process by Portable Terminal Device The process of the control unit 54 when the portable terminal 50 receives document data will be described with reference to FIG. When the receiving unit 56 receives one or a plurality of document data from a predetermined server, the control unit 54 fetches the document data into the internal RAM and stores the received document data text in the document memory 55 in step F501. Let it.

【０２１２】なお受信部５６による受信形態としては多
様な例が考えられる。例えばサーバ組織が文書データを
任意の時点で携帯端末５０に送信してくるようにするこ
とが考えられる。この場合、例えば携帯装置５０を所有
するユーザーは、予めサーバ組織に登録しておく。これ
によりサーバ組織はそのユーザーの携帯装置５０に対し
て任意の時点で発呼を行い、回線接続に応じて文書デー
タを送信するものとなる。或いは受信部５６に送信機能
も備えるようにし、ユーザーがサーバに対して回線を接
続して情報送信を要求することで、サーバから文書デー
タが送信されてくるようにしてもよい。さらには、この
携帯装置５０を携帯電話や公衆電話に接続することで、
ユーザーが文書データの送信をサーバに要求できるよう
にすることも考えられる。Various examples are conceivable as the receiving form of the receiving section 56. For example, it is conceivable that the server organization transmits document data to the portable terminal 50 at an arbitrary time. In this case, for example, the user who owns the mobile device 50 registers in advance with the server organization. As a result, the server organization makes a call to the portable device 50 of the user at an arbitrary time and transmits the document data according to the line connection. Alternatively, the receiving unit 56 may be provided with a transmission function, and the user may connect the line to the server and request information transmission, so that document data is transmitted from the server. Furthermore, by connecting the portable device 50 to a mobile phone or a public phone,
It is also conceivable to allow a user to request the server to send document data.

【０２１３】制御部５４は受信した文書データ（本文）
を取り込んだら、ステップＦ５０２で、その文書データ
の要約文を作成する。この要約文の作成処理は、上述し
た文書処理装置１と同様に図１８の処理により実行す
る。但し、携帯装置５０の場合は、表示部５１の表示エ
リアが固定されているため、要約文は、その表示部５１
の表示エリアのサイズに応じた文書長とする。つまり図
１８のステップＦ８２における表示サイズｗｓは、表示
部５１のサイズに応じた固定長とされる。The control unit 54 receives the received document data (text)
In step F502, a summary sentence of the document data is created. This summary sentence creation processing is executed by the processing of FIG. 18 similarly to the above-described document processing apparatus 1. However, in the case of the mobile device 50, since the display area of the display unit 51 is fixed, the summary is displayed on the display unit 51.
The document length is determined according to the size of the display area. That is, the display size ws in step F82 in FIG. 18 is a fixed length according to the size of the display unit 51.

【０２１４】制御部５４は、受信した１又は複数の文書
のそれぞれについて要約文を作成したら、ステップＦ５
０３で、その要約文を文書メモリ５５に記憶させる。When the control unit 54 creates a summary sentence for each of the received one or a plurality of documents, the control unit 54 proceeds to step F5.
At 03, the summary is stored in the document memory 55.

【０２１５】続いて制御部５４は、ステップＦ５０４
で、受信した１又は複数の文書のそれぞれについて、読
み上げ用の要約文を作成する。ステップＦ５０２で作成
した要約文は、表示部５１のサイズに応じた非常に短い
ものとなるが、このステップＦ５０４では、それよりも
長い所定の文書長範囲で、受信した文書について要約文
を作成する。この処理は、ステップＦ５０２と同様に図
１８の処理により実行するが、この場合、図１８のステ
ップＦ８２等における表示サイズｗｓに代えて、ｗｓ＋
αの値を用いる。ここでいうｗｓ＋αとは、図２６のス
テップＦ３０２や図２９のステップＦ４０４で説明した
ものと同様の意味であり、つまりより長い文書長の要約
文を作成するための値である。但し、携帯端末５０の場
合は、要約文の表示エリアは固定であるため、αの値も
固定値でよい。但しユーザーが固定値としての値を変更
できるようにしてもよい。Subsequently, the control unit 54 determines in step F504
Then, for each of the received one or a plurality of documents, a summary for reading out is created. The summary sentence created in step F502 is very short according to the size of the display unit 51. In this step F504, a summary sentence is created for the received document within a predetermined document length range longer than that. . This processing is executed by the processing of FIG. 18 similarly to step F502. In this case, instead of the display size ws in step F82 or the like of FIG.
Use the value of α. Here, ws + α has the same meaning as that described in step F302 in FIG. 26 and step F404 in FIG. 29, that is, a value for creating a summary sentence having a longer document length. However, in the case of the portable terminal 50, since the display area of the summary text is fixed, the value of α may be a fixed value. However, the user may be able to change the fixed value.

【０２１６】ステップＦ５０４で、読み上げ用の要約文
としてのタグファイルが生成されたら、制御部５４はス
テップＦ５０５で読み上げ用ファイルを生成するととも
に、搭載している音声合成エンジンに適したデータ形態
に変換する処理を行う。これは、図２６のステップＦ３
０４，Ｆ３０５と同様の処理である。即ち、このステッ
プＦ５０４で、制御部５４は音声合成エンジンによって
読み上げが可能な形式の読み上げ用ファイルを得ること
ができる。When a tag file as a summary for reading is generated in step F504, the control unit 54 generates a file for reading in step F505 and converts the file into a data format suitable for the installed speech synthesis engine. Perform the following processing. This corresponds to step F3 in FIG.
04, F305. That is, in step F504, the control unit 54 can obtain a reading file in a format that can be read by the speech synthesis engine.

【０２１７】受信した１又は複数の文書のそれぞれにつ
いて、読み上げ用ファイルが生成できたら、ステップＦ
５０６で制御部５４は、その読み上げ用ファイルを文書
メモリ５５に記憶させて、受信時の一連の処理を終え
る。If a reading file has been generated for each of the received one or a plurality of documents, step F
In step 506, the control unit 54 stores the read-out file in the document memory 55, and ends a series of processes at the time of reception.

【０２１８】この様な処理により、文書メモリ５５に
は、例えば図３５（ａ）に示すように各文書についての
本文と要約文と読み上げ用ファイルが記憶されることに
なる。例えば図示するようにそれぞれ文書ナンバが付さ
れた上で、文書の本文（Ａ文書、Ｂ文書・・・）と、各
本文に対応して文書の要約文（Ａ要約、Ｂ要約・・・）
及び読み上げ用ファイル（Ａ読み上げファイル、Ｂ読み
上げファイル・・・）が格納された状態となる。By such processing, the document memory 55 stores, for example, a text, a summary, and a reading file for each document as shown in FIG. For example, as shown in the figure, a document number is assigned to each of the documents, and the text of the document (A document, B document,...) And a summary of the document corresponding to each text (A summary, B summary...)
And read-out files (A read-out file, B read-out file...).

【０２１９】１１．携帯端末装置による文書表示及び読
み上げ処理携帯装置５０には上記のように受信した文書データの本
文、要約文及び読み上げ用ファイルが文書メモリ５５に
格納されることで、ユーザーは、任意の時点で、各文書
の要約文を表示部５１に表示させて、各文書の内容を確
認することができる。そしてその際に表示されている要
約文よりも詳しい内容の読み上げ音声を聞くことができ
る。またユーザーが不要と思った文書は消去させること
もできる。後述するが、ユーザーが文書内容を確認し、
より詳しく内容を知りたい（つまり要約文だけでなく本
文を読みたい）と思ったものについては、携帯装置５０
から文書処理装置１に文書データを転送することで、本
文を閲覧することが可能となるものである。(11) Document Display and Speech Processing by Portable Terminal Device By storing the text, summary sentence, and speech file of the received document data in the document memory 55 in the portable device 50 as described above, the user can By displaying the summary of each document on the display unit 51, the content of each document can be confirmed. Then, the user can hear the read-out voice of the content more detailed than the summary sentence displayed at that time. Documents that the user deems unnecessary can be deleted. As described later, the user checks the document contents,
If you want to know the content in more detail (that is, you want to read the text as well as the summary),
The document data can be transferred to the document processing apparatus 1 to view the text.

【０２２０】文書メモリ５５に格納された各文書の要約
文をユーザーに提示するための制御部５４の処理は図３
４のようになる。なお、提示のためのユーザー操作とし
ては、操作部５２の各操作キーにより、提示開始及び次
文書提示を指示する操作、前文書の提示を指示する操
作、及び消去を指示する操作が可能であるとする。The processing of the control unit 54 for presenting a summary of each document stored in the document memory 55 to the user is shown in FIG.
It looks like 4. As the user operation for presentation, an operation for instructing presentation start and presentation of the next document, an operation for instructing presentation of the previous document, and an operation for instructing erasure can be performed using the operation keys of the operation unit 52. And

【０２２１】ユーザーが要約文の提示開始を求める操作
を行うと、制御部５４の処理は図３４のステップＦ６０
１からＦ６０２に進み、まず文書ナンバを参照する変数
ｎを「１」にセットする。そして制御部５４はステップ
Ｆ６０３で、図３５のように文書メモリ５５に格納され
ている文書データのうち、文書ナンバ（ｎ）の要約文を
読み出し、表示ドライブ５７に送って、その要約文を表
示部５１に表示させる。従ってまず図３５の「Ａ要約」
が表示されることになる。さらに制御部５４はステップ
Ｆ６０４で、文書ナンバ（ｎ）の読み上げ用ファイルを
読み出し、音声合成エンジンによる読み上げ動作を開始
させる。従って音声合成処理による読み上げ音声として
の音声信号が音声出力部５９に送られ、ヘッドホン端子
６０から出力される。When the user performs an operation for requesting the start of presentation of a summary sentence, the control unit 54 executes the process of step F60 in FIG.
From 1, the process proceeds to F <b> 602, and first, a variable n referring to the document number is set to “1”. Then, in step F603, the control unit 54 reads the summary of the document number (n) from the document data stored in the document memory 55 as shown in FIG. 35, sends the summary to the display drive 57, and displays the summary. It is displayed on the unit 51. Therefore, first, “A summary” in FIG.
Will be displayed. Further, in step F604, the control unit 54 reads out the file for reading out the document number (n), and starts the reading operation by the speech synthesis engine. Therefore, an audio signal as a read-out audio by the audio synthesis processing is sent to the audio output unit 59 and output from the headphone terminal 60.

【０２２２】或る要約文の提示、すなわち表示及び読み
上げを行っている期間は、制御部５４は、ステップＦ６
０５，Ｆ６０６，Ｆ６０７でユーザーの操作を監視して
いる。なお、ステップＦ６０４で開始された読み上げ
は、読み上げ用ファイルの全文の読み上げが終了した時
点で終了されるが、まだ読み上げが行われている途中で
あっても、ステップＦ６０５，Ｆ６０６，Ｆ６０７のい
ずれかでユーザー操作が検出された場合は終了される。During the period in which a certain summary sentence is presented, that is, displayed and read aloud, the control unit 54 executes step F6.
At 05, F606, and F607, user operations are monitored. Note that the reading started in step F604 ends when the reading of all the sentences in the reading file ends, but even if reading is still being performed, one of steps F605, F606, and F607 is performed. If a user operation is detected in, the process is terminated.

【０２２３】ユーザーが次文書の提示を求める操作を行
った場合は、制御部５４の処理はステップＦ６０５から
Ｆ６０８に進み、変数ｎをインクリメントしてステップ
Ｆ６０３に戻る。このステップＦ６０３では文書ナンバ
（ｎ）の要約文、つまりこのときは制御部５４は、次の
文書ナンバとして文書メモリ５５に格納されている文書
の要約文の表示処理を行うことになる。さらにステップ
Ｆ６０４で、その表示された要約文に対応する読み上げ
用ファイルに基づく読み上げ動作が開始される。If the user has performed an operation to request the presentation of the next document, the process of the control unit 54 proceeds from step F605 to F608, increments the variable n, and returns to step F603. In step F603, the summary sentence of the document number (n), that is, the control unit 54, in this case, performs display processing of the summary sentence of the document stored in the document memory 55 as the next document number. Further, in step F604, a reading operation based on the reading file corresponding to the displayed summary is started.

【０２２４】またユーザーが前文書の提示を求める操作
を行った場合は、制御部５４の処理はステップＦ６０６
からＦ６０９に進み、変数ｎをデクリメントしてステッ
プＦ６０３に戻る。このステップＦ６０３では文書ナン
バ（ｎ）の要約文、つまりこの場合は制御部５４は、そ
れまで表示していた文書の前の文書ナンバとして、文書
メモリ５５に格納されている文書の要約文の表示処理を
行うことになる。さらにステップＦ６０４で、その表示
された要約文に対応する読み上げ用ファイルに基づく読
み上げ動作が開始される。If the user performs an operation for requesting the presentation of the previous document, the process of the control unit 54 proceeds to step F606.
The process proceeds to F609, decrements the variable n, and returns to step F603. In this step F603, the summary sentence of the document number (n), that is, in this case, the control unit 54 displays the summary sentence of the document stored in the document memory 55 as the document number preceding the displayed document. Processing will be performed. Further, in step F604, a reading operation based on the reading file corresponding to the displayed summary is started.

【０２２５】この様な、次文書、前文書を求める操作に
応じた処理により、ユーザーは、受信され格納されてい
る各文書の要約文を、それぞれ任意に、表示及び読み上
げ音声で確認していくことができる。By the processing corresponding to the operation for obtaining the next document and the previous document, the user can arbitrarily confirm the summary sentence of each received and stored document by display and reading voice. be able to.

【０２２６】なお、図３４の処理フローとしては省略し
たが、ｎ＝１の状態、つまり文書ナンバ１の要約文が表
示されている状態で、ユーザーが前文書を求める操作を
行った場合は、その操作を無効とすればよい。同様に、
文書ナンバとして最後のナンバの文書の要約文が表示さ
れている状態で、ユーザーが次文書を求める操作を行っ
た場合は、その操作を無効とすればよい。但し、文書ナ
ンバ１の要約文が表示されている状態でユーザーが前文
書を求める操作を行った場合は、最後の文書ナンバにつ
いての要約文表示及び読み上げを行うようにしたり、最
後の文書の要約文が表示されている状態で、ユーザーが
次文書を求める操作を行った場合は、最初の文書ナンバ
についての要約文表示及び読み上げを行うようにしても
よい。Although omitted in the processing flow of FIG. 34, if the user performs an operation for obtaining the previous document in a state where n = 1, that is, a state in which the summary sentence of document number 1 is displayed, What is necessary is just to invalidate the operation. Similarly,
When the user performs an operation for obtaining the next document in a state where the summary of the last document is displayed as the document number, the operation may be invalidated. However, if the user performs an operation to obtain the previous document while the summary of document number 1 is displayed, the summary of the last document number is displayed and read out, or the summary of the last document is displayed. When the user performs an operation for obtaining the next document while the sentence is displayed, the summary sentence display and the reading of the first document number may be performed.

【０２２７】また本例では、或る要約文が表示されてい
る際にユーザーがその要約文を読んで（又は読み上げ音
声を聞いて）、その文書データが不要と考えた場合は、
消去操作を行えばよい。消去操作が行われると、制御部
５４の処理はステップＦ６０７からＦ６１０に進み、そ
の文書データの本文と要約文を文書メモリ５５上で消去
させる。また消去に応じて、文書ナンバを更新する。例
えば図３５（ａ）におけるＢ要約が表示されている際
に、ユーザーが消去操作を行ったとすると、制御部５４
はステップＦ６１０として、文書メモリ５５からＢ文書
とＢ要約とＢ読上ファイルのデータを削除し、さらに後
続の文書ナンバを繰り上げることで、例えば図３５
（ｂ）のような状態とする。つまり文書ナンバ「２」の
Ｂ文書としての本文、要約文、読上ファイルが削除され
ることに応じて、Ｃ文書、Ｄ文書をそれぞれ文書ナンバ
「２」「３」に繰り上げる。そして制御部５４の処理は
ステップＦ６０３、Ｆ６０４に戻るため（変数ｎはその
まま）、Ｂ文書の削除に伴って次のＣ文書のＣ要約が表
示され、かつＣ読み上げファイルによる読み上げが開始
される状態となる。In this example, when a user reads a summary (or listens to a reading voice) while a certain summary is being displayed and considers that the document data is unnecessary,
An erasing operation may be performed. When the erasing operation is performed, the process of the control unit 54 proceeds from step F607 to F610, and erases the text and the summary of the document data on the document memory 55. The document number is updated in accordance with the deletion. For example, if the user performs an erasing operation when the B summary in FIG. 35A is displayed, the control unit 54
In step F610, the data of the B document, the B summary, and the B reading file are deleted from the document memory 55, and the subsequent document number is incremented.
The state is as shown in FIG. That is, the C document and the D document are moved up to the document numbers “2” and “3”, respectively, in response to the deletion of the text, the summary sentence, and the reading file as the B document of the document number “2”. Then, since the processing of the control unit 54 returns to steps F603 and F604 (the variable n is kept as it is), the C summary of the next C document is displayed with the deletion of the B document, and the reading by the C reading file is started. Becomes

【０２２８】以上のような図３４の処理により、ユーザ
ーは受信した文書を、その要約文により内容を確認して
いくことができる。特に内容の確認は表示部５１での要
約文表示とともに、読み上げ音声として、表示内容より
も詳しい要約文が読み上げられる。従って、たとえ本例
のように表示部５１が非常に小さい携帯端末５０であっ
て、わずかな文書長の要約しか表示できないものであっ
ても、ユーザーは読み上げ音声を聞くことで、正確に文
書内容を把握できるものとなる。そしてユーザーは、文
書内容を把握した上で不要と判断した文書については容
易に消去していくことができる。また、必要な（消去し
ていない）文書については、その本文は、後にユーザー
がこの携帯端末５０を図３１のようなシステムにより文
書処理装置１に接続することで、内容を見聞きすること
ができる。By the processing of FIG. 34 as described above, the user can confirm the content of the received document by its summary. In particular, when confirming the content, a summary sentence that is more detailed than the displayed content is read aloud as a reading voice along with the display of the summary sentence on the display unit 51. Therefore, even if the display unit 51 is a very small portable terminal 50 as in this example and can display only a short summary of the document length, the user can accurately hear the text content by listening to the reading voice. Can be grasped. Then, the user can easily delete the document determined to be unnecessary after grasping the content of the document. In addition, for the necessary (not erased) document, the user can view and listen to the contents by connecting the portable terminal 50 to the document processing apparatus 1 using a system as shown in FIG. 31 later. .

【０２２９】ところで、この図３４、図３５で説明した
処理では、ユーザーは要約を見聞きして不要と判断した
文書データについては、文書メモリ５５から消去させる
ことができるとしたが、消去ではなく、文書処理装置１
に転送するか否かを選択するというような処理方式も考
えられる。By the way, in the processing described with reference to FIGS. 34 and 35, it is assumed that the user can delete from the document memory 55 the document data determined to be unnecessary by viewing and listening to the summary. Document processing device 1
A processing method of selecting whether or not to transfer the data to the destination is also conceivable.

【０２３０】例えば図３３の受信時の処理として制御部
５４は、受信した文書の本文と要約文と読み上げ用ファ
イルを文書メモリ５５に記憶させる際に、図３６（ａ）
のように転送フラグ＝０と設定して記憶させるようにす
る。そして図３４のような表示及び読み上げ処理の際に
は、ユーザー操作としては、操作部５２の各操作キーに
より、提示開始及び次文書提示を指示する操作、前文書
の提示を指示する操作、及び転送文書として指定する操
作が可能であるとする。For example, as a process at the time of reception shown in FIG. 33, the control unit 54 stores the text of the received document, the summary sentence, and the reading file in the document memory 55 as shown in FIG.
As described above, the transfer flag is set to 0 and stored. At the time of display and reading-out processing as shown in FIG. 34, the user operation includes an operation of instructing presentation start and presentation of the next document, an operation of instructing presentation of the previous document, and It is assumed that an operation to specify a transfer document is possible.

【０２３１】各文書の要約文の表示及び読み上げについ
ては図３４と同様とするが、ユーザーは各要約を確認し
ていった際に、より詳しく内容を知りたい文書（要約
文）を見つけたら、その要約文が表示されている状態
で、転送文書として指定する操作を行うようにする。そ
れに応じて制御部５４は、文書メモリ５５におけるその
文書の転送フラグを「１」にセットする。例えば図３６
（ｂ）は、ユーザーがＢ文書及びＣ文書について、転送
文書として指定した場合の例である。このようにしてユ
ーザーにとって重要な文書を指定できるようにしつつ、
重要でないとされた文書をその時点では消去しないで残
しておく処理方式も考えられる。The display and reading of the summary sentence of each document are the same as in FIG. 34. However, when the user checks each summary and finds a document (summary sentence) whose contents are to be understood in more detail, While the summary is being displayed, an operation of designating the document as a transfer document is performed. In response, the control unit 54 sets the transfer flag of the document in the document memory 55 to “1”. For example, FIG.
(B) is an example in which the user has designated the B document and the C document as transfer documents. In this way, you can specify documents that are important to your users,
There is also a processing method in which a document determined to be insignificant is left without being deleted at that time.

【０２３２】なお、ここではユーザーは重要な文書を選
択する方式としたが、逆に初期状態では転送フラグは全
て「１」としておき、ユーザーが不要指定操作を行うこ
とで、その文書については転送フラグを「０」にするよ
うな処理としてもよい。Here, the method in which the user selects an important document has been adopted. On the contrary, in the initial state, the transfer flags are all set to “1”, and the user performs an unnecessary designation operation to transfer the document. The processing may be such that the flag is set to “0”.

【０２３３】さらには、図３５、図３６で説明した方式
を組み合わせることも考えられる。例えばユーザーは消
去操作と転送指定操作が可能とされるようにし、各要約
文を見聞きしながら消去操作、転送指定操作、或いは操
作しないという３つの状態を選択できるようにする。そ
して消去操作があった場合は図３５の例のように消去を
行い、また転送指定操作があった場合は、図３６のよう
に転送フラグをたてるようにする。これらの操作が行わ
れなかったものは、保留として転送フラグは「０」のま
まであるが、消去もされない状態とするものである。も
ちろんこれ以外にも、表示／読み上げ方式や選択操作方
式は各種考えられる。Further, it is conceivable to combine the methods described with reference to FIGS. 35 and 36. For example, the user can perform an erasing operation and a transfer designation operation, and can select three states of an erasure operation, a transfer designation operation, or no operation while looking at each summary sentence. When an erasing operation is performed, erasing is performed as in the example of FIG. 35, and when a transfer specifying operation is performed, a transfer flag is set as illustrated in FIG. If these operations have not been performed, the transfer flag is set to “0” as a hold, but is not erased. Of course, other than this, various display / speech methods and selection operation methods are conceivable.

【０２３４】１２．文書処理システム動作携帯装置５０における文書データの受信、表示及び読み
上げ、さらには重要な文書の選択は以上のように行われ
るが、ユーザーは重要な文書については、文書処理装置
１を利用して本文を読むことができる。図３１に示した
ように携帯端末５０を携帯端末リーダ９６に装着するこ
とで、文書処理装置１の制御部１１は、携帯端末５０の
文書メモリ５５に記憶されている文書データを読み込む
ことができる。この場合は、文書処理装置１の制御部１
１と、携帯端末５０の制御部５４が相互に通信を行っ
て、文書メモリ５５の文書データを転送していくことに
なるが、例えば制御部１１が直接文書メモリ５５にアク
セスできるような構成をとることも考えられる。12. Operation of the Document Processing System The reception, display and reading of the document data and selection of important documents in the portable device 50 are performed as described above. Can read. By attaching the portable terminal 50 to the portable terminal reader 96 as shown in FIG. 31, the control unit 11 of the document processing apparatus 1 can read the document data stored in the document memory 55 of the portable terminal 50. . In this case, the control unit 1 of the document processing apparatus 1
1 and the control unit 54 of the portable terminal 50 communicate with each other to transfer the document data in the document memory 55. For example, a configuration in which the control unit 11 can directly access the document memory 55 is adopted. It is also possible to take.

【０２３５】転送される文書データは、図３５の例の場
合は、図３５（ｂ）のような状態で消去されていない全
文書についての本文データとなる。また図３６の例の場
合は、制御部５４は、図３６（ｂ）のように転送フラグ
＝１となっている文書を抽出して、制御部１１側に転送
していくことになる。なお、転送するデータは、本文の
みでもよいし、本文と要約文の両方であってもよい。も
ちろん読み上げ用ファイルを転送することも考えられ
る。In the case of the example shown in FIG. 35, the transferred document data is the text data of all the documents which have not been deleted in the state as shown in FIG. 35 (b). In the case of the example of FIG. 36, the control unit 54 extracts a document having the transfer flag = 1 as shown in FIG. 36B and transfers the document to the control unit 11 side. The data to be transferred may be only the text or both the text and the summary. Of course, it is also possible to transfer a reading file.

【０２３６】このようにして１又は複数の文書が転送さ
れるわけであるが、文書処理装置１では、転送されてき
た文書（本文）に対して、上述したような各種処理が可
能となる。即ち制御部１１は、転送されてきた文書デー
タに対して、分類モデルへの分類処理、ウインドウサイ
ズに応じた要約作成処理、本文又は要約文の読み上げ処
理、表示される要約サイズに応じた読み上げ処理などが
実行可能とされる。これによりユーザーは、携帯端末５
０で受信し、また選択した文書について、詳細な本文内
容を閲覧したり、読み上げ音声を聞いたり、また適正に
分類されていることで後の時点で容易に参照できたりす
るようになる。もちろん文書処理装置１（パーソナルコ
ンピュータ９０）のさらなる多様な機能を用いて、文書
データを有効利用できることになる。これらのことから
本例の文書処理システムでは、携帯端末５０により文書
データの入手、内容確認、選択が常時手軽に実行でき、
さらに必要な文書については文書処理装置で高度かつ多
様に処理できることになるため、ユーザーにとって非常
に有用なシステムとなる。As described above, one or a plurality of documents are transferred. In the document processing apparatus 1, the above-described various processes can be performed on the transferred document (text). That is, the control unit 11 performs classification processing on the transferred document data into a classification model, summary creation processing according to the window size, text-to-text or summary sentence reading-out processing, and text-to-speech reading processing according to the displayed summary size. Etc. are made executable. This allows the user to use the mobile terminal 5
In addition, the user can browse the detailed text contents, hear the read-aloud voice, and easily refer to the selected document at a later point in time by being properly classified. Of course, the document data can be effectively used by using various further functions of the document processing apparatus 1 (personal computer 90). From these facts, in the document processing system of the present example, the acquisition, the content confirmation, and the selection of the document data can be easily and always executed by the portable terminal 50.
Further, a required document can be processed in a high-level and various manners by a document processing apparatus, so that the system is very useful for a user.

【０２３７】以上本発明の実施の形態について説明して
きたが、これはあくまで一例であり、文書処理装置１
や、文書処理装置としての携帯端末５０の構成及び処理
例は多様に考えられる。例えば携帯端末５０で要約文だ
けでなく本文も表示又は読上するようにしてもよい。さ
らに、携帯端末５０の形状はカード型に限られるもので
はない。また、要約文の表示エリアが可変とされる文書
処理装置１と、要約文の表示エリアが固定とされる文書
処理装置としての携帯端末５０を例に挙げたが、もちろ
ん表示エリアサイズが可変とされるか固定とされるか
は、据置型の機器／携帯型の機器などの別に対応するも
のではなく、例えば据置型のパーソナルコンピュータで
実現される文書処理装置であっても、要約文の表示エリ
アが固定とされる場合もある。その場合は、上記携帯端
末５０で説明したような動作が適用できる。もちろんそ
の逆もあり得る。The embodiment of the present invention has been described above, but this is merely an example, and the document processing apparatus 1
In addition, various configurations and processing examples of the portable terminal 50 as the document processing device can be considered. For example, the portable terminal 50 may display or read not only the abstract but also the text. Further, the shape of the portable terminal 50 is not limited to the card type. Further, the document processing apparatus 1 in which the display area of the summary is variable and the portable terminal 50 as the document processing apparatus in which the display area of the summary is fixed have been described as an example. Whether it is fixed or fixed does not correspond to a stationary device / portable device or the like. For example, even if the document processing device is implemented by a stationary personal computer, the display of a summary sentence The area may be fixed. In that case, the operation described for the portable terminal 50 can be applied. Of course, the reverse is also possible.

【０２３８】また文書処理装置１や携帯端末５０を構成
する具体的なデバイス例は多様であり、例えば文書処理
装置１における入力部２０を例に挙げれば、キーボード
やマウスだけでなく、タブレット、ライトペン、赤外線
等を利用した無線コマンダ装置等の他のデバイスが考え
られる。There are various specific examples of devices that constitute the document processing apparatus 1 and the portable terminal 50. For example, when the input unit 20 in the document processing apparatus 1 is an example, not only a keyboard and a mouse but also a tablet and a light Other devices, such as a wireless commander device using a pen, infrared light, etc., are conceivable.

【０２３９】また実施の形態においては、通信部２２に
外部から電話回線等を介して文書が送信されるものとし
て説明したが、本発明は、これに限定されるものではな
い。例えば、衛星等を介して文書が送信される場合にも
適用できる他、記録／再生部３１において記録媒体３２
から読み出されたり、ＲＯＭ１５に予め文書が書き込ま
れていてもよい。Further, in the embodiment, the description has been made assuming that the document is transmitted from the outside to the communication unit 22 via a telephone line or the like, but the present invention is not limited to this. For example, the present invention can be applied to a case where a document is transmitted via a satellite or the like.
Or a document may be written in the ROM 15 in advance.

【０２４０】また実施の形態において、文書へのタグ付
けの方法の一例を示したが、本発明がこのタグ付けの方
法に限定されないことはもちろんである。さらに、上述
の実施の形態においては、日本語および英語の文章を例
示したが、本発明がこれらの言語に限られないことはい
うまでもない。このように、本発明は、その趣旨を逸脱
しない範囲で適宜変更が可能であることはいうまでもな
い。Further, in the embodiment, an example of a method of tagging a document has been described, but it is needless to say that the present invention is not limited to this tagging method. Furthermore, in the above-described embodiment, Japanese and English sentences have been exemplified, but it goes without saying that the present invention is not limited to these languages. As described above, it goes without saying that the present invention can be appropriately changed without departing from the spirit of the present invention.

【０２４１】さらにまた、本発明においては、記録媒体
３２として、上述した電子文書処理プログラムが書き込
まれたディスク状記録媒体やテープ状記録媒体等を提供
することが容易に実現できる。即ち本発明の記録媒体が
実現できる。なお、図１に示したＨＤＤ３４としても同
様に本発明の記録媒体とすることができる。そしてその
ような記録媒体３２によれば、上記してきた文書処理方
法を実現するプログラムを提供できることになり、例え
ば汎用のパーソナルコンピュータ等を用いて、本発明の
文書処理装置を容易に実現できる。もちろん記録媒体３
２としては、フロッピーディスクの他に、光ディスク、
光磁気ディスク、磁気テープ、フラッシュメモリ等によ
るメモリカード、メモリチップ等としてもよい。さらに
本発明の文書処理方法を実現するプログラムは、例えば
インターネット等のネットワーク通信を介しても提供す
ることができるものであり、従って、プログラムサーバ
側もしくは通信過程における記録媒体としても本発明は
適用できるものである。Further, in the present invention, it is possible to easily realize, as the recording medium 32, a disk-shaped recording medium or a tape-shaped recording medium in which the above-described electronic document processing program is written. That is, the recording medium of the present invention can be realized. The recording medium of the present invention can be similarly used as the HDD 34 shown in FIG. According to such a recording medium 32, a program for realizing the above-described document processing method can be provided, and the document processing apparatus of the present invention can be easily realized using, for example, a general-purpose personal computer. Of course, recording medium 3
2) In addition to floppy disks, optical disks,
It may be a memory card, a memory chip, or the like using a magneto-optical disk, magnetic tape, flash memory, or the like. Furthermore, the program for realizing the document processing method of the present invention can be provided also via a network communication such as the Internet. Therefore, the present invention can be applied to a program server or a recording medium in a communication process. Things.

【０２４２】[0242]

【発明の効果】以上の説明からわかるように本発明で
は、次のような効果が得られる。即ち本発明の文書処理
装置及び文書処理方法によれば、電子文書の要約文を表
示と音声出力によりユーザーに提示できるようにすると
ともに、読み上げ音声の元となる要約文は、表示される
要約文と文書長が同じもしくはそれ以上とされるように
している。このため、要約文表示エリアのサイズが小さ
く、要約文として十分な文章を表示できない場合に、読
み上げ音声としてユーザーの内容理解に十分な文書長の
文書が出力されるようにすることができ、要約文表示エ
リアの状況に関わらず、ユーザーは要約文内容（電子文
書内容）を明確に知ることができるという効果がある。As will be understood from the above description, the present invention has the following effects. That is, according to the document processing apparatus and the document processing method of the present invention, a digest sentence of an electronic document can be presented to a user by display and voice output, and the digest sentence of the read-out voice is displayed as a digest sentence. And the document length is the same or more. For this reason, when the size of the summary sentence display area is small and a sufficient sentence cannot be displayed as a summary sentence, it is possible to output a document having a document length sufficient for the user to understand the contents as a reading voice, Regardless of the state of the sentence display area, there is an effect that the user can clearly know the contents of the summary sentence (electronic document contents).

【０２４３】特に、表示手段における要約文表示エリア
のサイズは可変とされる場合は、読み上げ要約作成手段
は、要約文表示エリアのサイズに応じて文書長を設定し
て、読み上げ用要約文データを作成することで、要約文
表示エリアのサイズ状況に応じて最適な読み上げ出力を
行うことができる。例えばユーザーが内容把握に十分な
程度の文書長で表示できる場合は、読み上げ用要約文デ
ータのサイズは表示される要約文データと同等とするこ
とで十分であり、かつ頼長異様や区分を作成しなくても
よいため処理も簡単になる。一方、要約文表示エリアが
小さく、表示される要約文データの文書長が短くなっ
て、内容理解に不十分と考えられる場合は、より長い文
書長で読み上げ用要約文データを作成することで、ユー
ザーに対して文書内容を的確に提示できる。In particular, when the size of the summary sentence display area in the display means is variable, the reading summary creation means sets the document length according to the size of the summary sentence display area, and reads out the reading summary sentence data. By creating, it is possible to perform the optimal reading-aloud output according to the size of the summary display area. For example, if the user can display the document with a document length that is sufficient for comprehending the content, it is sufficient to make the size of the summary text for reading aloud the same as the displayed summary text data, and create a question and a section. Since it is not necessary to perform the processing, the processing is simplified. On the other hand, if the summary sentence display area is small and the document length of the displayed summary sentence data is short, and it is considered that the content is insufficient for comprehension of the content, by creating the summary sentence data for reading with a longer document length, Document contents can be accurately presented to the user.

【０２４４】また装置によっては表示手段における要約
文表示エリアのサイズが固定サイズとされる場合がある
が、そのサイズが小さく要約文データが短いものとなら
ざるを得ない場合は、読み上げ要約作成手段は、要約作
成手段で作成される要約文データの文書長を越える文書
長となる読み上げ用要約文データを作成し、それが音声
として出力されるようにすることで、表示手段のサイズ
的な制約に関わらず、常に文書内容をユーザーに的確に
提示できる。これは特に小型或いは携帯型の文書処理装
置などにおいて好適である。Depending on the device, the size of the summary display area in the display means may be fixed, but if the size is small and the summary data must be short, the reading summary creation means may be used. Creates summary text data for reading that has a document length that exceeds the document length of the summary text data created by the summary creation means, and outputs it as audio, thereby limiting the size of the display means. Regardless, the contents of the document can always be accurately presented to the user. This is particularly suitable for a small or portable document processing device.

【０２４５】さらに本発明の記録媒体によれば、本発明
の文書処理方法を実現するプログラムを提供できること
になり、例えば汎用のパーソナルコンピュータ等を用い
て、本発明の文書処理装置を容易に実現できるようにな
る。これにより、一般ユーザーは容易に上記効果を享受
できるものとなる。Further, according to the recording medium of the present invention, a program for realizing the document processing method of the present invention can be provided. For example, the document processing apparatus of the present invention can be easily realized using a general-purpose personal computer or the like. Become like Thus, the general user can easily enjoy the above effect.

[Brief description of the drawings]

【図１】本発明の実施の形態の文書処理装置のブロック
図である。FIG. 1 is a block diagram of a document processing apparatus according to an embodiment of the present invention.

【図２】実施の形態で用いる文書構造の説明図である。FIG. 2 is an explanatory diagram of a document structure used in the embodiment.

【図３】実施の形態の文章構造を表示するウインドウの
説明図である。FIG. 3 is an explanatory diagram of a window for displaying a sentence structure according to the embodiment;

【図４】実施の形態の手動分類処理のフローチャートで
ある。FIG. 4 is a flowchart of a manual classification process according to the embodiment.

【図５】実施の形態のインデックス作成処理のフローチ
ャートである。FIG. 5 is a flowchart of an index creation process according to the embodiment.

【図６】実施の形態のエレメントの活性値の説明図であ
る。FIG. 6 is an explanatory diagram of an activation value of an element according to the embodiment.

【図７】実施の形態の活性拡散処理のフローチャートで
ある。FIG. 7 is a flowchart of an active diffusion process according to the embodiment.

【図８】実施の形態の中心活性値更新処理のフローチャ
ートである。FIG. 8 is a flowchart of a central activation value update process according to the embodiment.

【図９】実施の形態の分類ウインドウの説明図である。FIG. 9 is an explanatory diagram of a classification window according to the embodiment.

【図１０】実施の形態の閲覧ウインドウの説明図であ
る。FIG. 10 is an explanatory diagram of a browsing window according to the embodiment.

【図１１】実施の形態の分類モデルの説明図である。FIG. 11 is an explanatory diagram of a classification model according to the embodiment.

【図１２】実施の形態の自動分類処理のフローチャート
である。FIG. 12 is a flowchart of an automatic classification process according to the embodiment.

【図１３】実施の形態の自動分類のフローチャートであ
る。FIG. 13 is a flowchart of automatic classification according to the embodiment.

【図１４】実施の形態の語義間関連度算出処理のフロー
チャートである。FIG. 14 is a flowchart of a word meaning association degree calculation process according to the embodiment;

【図１５】実施の形態の語義間関連度の説明図である。FIG. 15 is an explanatory diagram of a degree of association between meanings according to the embodiment.

【図１６】実施の形態の閲覧ウインドウの表示例の説明
図である。FIG. 16 is an explanatory diagram of a display example of a browsing window according to the embodiment.

【図１７】実施の形態の閲覧ウインドウの要約文を含む
表示例の説明図である。FIG. 17 is an explanatory diagram of a display example including a summary of a browsing window according to the embodiment;

【図１８】実施の形態の要約作成処理のフローチャート
である。FIG. 18 is a flowchart of a summary creation process according to the embodiment.

【図１９】実施の形態の文書読み上げ処理のフローチャ
ートである。FIG. 19 is a flowchart of a document reading process according to the embodiment.

【図２０】実施の形態の読み上げ用ファイル生成処理の
フローチャートである。FIG. 20 is a flowchart of a reading file generation process according to the embodiment;

【図２１】実施の形態のタグファイル例の説明図であ
る。FIG. 21 is an explanatory diagram of an example of a tag file according to the embodiment;

【図２２】実施の形態のタグファイル例の説明図であ
る。FIG. 22 is an explanatory diagram of an example of a tag file according to the embodiment;

【図２３】実施の形態の読み上げ用ファイル例の説明図
である。FIG. 23 is an explanatory diagram of an example of a reading file according to the embodiment;

【図２４】実施の形態の読み上げ用ファイル例の説明図
である。FIG. 24 is an explanatory diagram of an example of a reading file according to the embodiment;

【図２５】実施の形態の読み上げウインドウの説明図で
ある。FIG. 25 is an explanatory diagram of a reading window according to the embodiment.

【図２６】実施の形態の表示される要約サイズに応じた
読み上げ処理のフローチャートである。FIG. 26 is a flowchart of a reading process according to the displayed summary size according to the embodiment;

【図２７】実施の形態の閲覧ウインドウの表示例の説明
図である。FIG. 27 is an explanatory diagram of a display example of a browsing window according to the embodiment.

【図２８】実施の形態の表示される要約サイズに応じた
読み上げ処理時の閲覧ウインドウの表示例の説明図であ
る。FIG. 28 is an explanatory diagram of a display example of a browsing window during a reading process according to a displayed summary size according to the embodiment;

【図２９】実施の形態の要約生成及びそのサイズに応じ
た読み上げ処理のフローチャートである。FIG. 29 is a flowchart of a summary generation process and a reading process according to the size according to the embodiment.

【図３０】実施の形態の携帯端末装置の外観の説明図で
ある。FIG. 30 is an explanatory diagram of an appearance of the portable terminal device according to the embodiment;

【図３１】実施の形態の文書処理システムの説明図であ
る。FIG. 31 is an explanatory diagram of a document processing system according to an embodiment.

【図３２】実施の形態の携帯端末装置のブロック図であ
る。FIG. 32 is a block diagram of a portable terminal device according to an embodiment.

【図３３】実施の形態の携帯端末装置の文書受信処理の
フローチャートである。FIG. 33 is a flowchart of a document receiving process of the portable terminal device of the embodiment.

【図３４】実施の形態の携帯端末装置の表示及び読み上
げ処理のフローチャートである。FIG. 34 is a flowchart of a display and reading process of the portable terminal device of the embodiment.

【図３５】実施の形態の携帯端末装置の記憶形態の説明
図である。FIG. 35 is an explanatory diagram of a storage form of the portable terminal device of the embodiment.

【図３６】実施の形態の携帯端末装置の記憶形態の説明
図である。FIG. 36 is an explanatory diagram of a storage form of the portable terminal device of the embodiment.

【符号の説明】１文書処理装置、１０本体、１１制御部、１２
インターフェース、１３ＣＰＵ、１４ＲＡＭ、１５
ＲＯＭ、２０入力部、２１通信部、２２携帯端末
リーダ、３０表示部、３１記録再生部、３２記録
媒体、３３音声出力部、３４ＨＤＤ、５０携帯端
末、５１表示部、５２操作部、５３リーダ端子、
５４制御部、５５文書メモリ、５６受信部、５７
表示ドライブ、５９音声出力部、６０ヘッドホン
端子、１０１ウインドウ、２０１分類ウインドウ、
３０１閲覧ウインドウ、３０３文書表示部、３０４
要約表示部、４０１読み上げウインドウ[Description of Signs] 1 Document processing apparatus, 10 main body, 11 control unit, 12
Interface, 13 CPU, 14 RAM, 15
ROM, 20 input unit, 21 communication unit, 22 portable terminal reader, 30 display unit, 31 recording / reproducing unit, 32 recording medium, 33 audio output unit, 34 HDD, 50 portable terminal, 51 display unit, 52 operation unit, 53 reader Terminal,
54 control unit, 55 document memory, 56 receiving unit, 57
Display drive, 59 audio output unit, 60 headphone terminal, 101 window, 201 classification window,
301 browsing window, 303 document display unit, 304
Summary display section, 401 reading-out window

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成１１年７月７日（１９９９．７．７）[Submission date] July 7, 1999 (July 7, 1999)

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】０１３２[Correction target item name]

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【０１３２】なお、このため初めてステップＦ８４で文
字列Ｓ（ｉ）を生成した時点（ｉ＝１の時点）で、ステ
ップＦ８５で、文字列Ｓ（ｉ）が最大文字数ｗｓを越え
た場合は、文字列Ｓ（ｉ−１）は、ステップＦ８２で設
定した初期値としての文字列Ｓ（０）となるため、実質
的に要約文は作成できなかったことになる。これは、要
約表示部３０４のサイズが小さすぎたことに起因するた
め、ユーザーは画面上で要約表示部３０４の面積を広げ
た上で、再度、要約作成ボタン３０６ａをクリックし
て、図１８の処理が開始されるようにすればよい。When the character string S (i) exceeds the maximum number of characters ws in step F85 when the character string S (i) is first generated in step F84 (i = 1) at step F85, Since the character string S (i-1) becomes the character string S (0) as the initial value set in step F82, a summary sentence could not be created substantially. This is because the size of the summary display unit 304 is too small. Therefore, the user enlarges the area of the summary display unit 304 on the screen, and then clicks the summary creation button 306a again to return to FIG. What is necessary is just to make a process start.

【手続補正２】[Procedure amendment 2]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】０１３３[Correction target item name]

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【０１３３】ステップＦ８５で文字列Ｓ（ｉ）が最大文
字数ｗｓを越えていない場合は、上述のように制御部１
１の処理はステップＦ８６に進み、次に文字列に加える
部分を判断することになる。そして上記のようにステッ
プＦ８９に進んだ場合は、文字列ＳＳ（ｉ）が最大文字
数ｗｓを越えたか否かを判別する。ここで文字列ＳＳ
（ｉ）が最大文字数ｗｓを越えた場合は、制御部１１の
処理はステップＦ９１に進み、直前のステップＦ８８で
或るエレメントを加える前の文字列Ｓ（ｉ）を、要約文
とすることになる。つまり、これはステップＦ８８でエ
レメントを加えたことにより、最大文字数ｗｓを越えて
しまったことになるため、そのエレメントを加える前の
文字列Ｓ（ｉ）が、最大文字数ｗｓに近いが最大文字数
ｗｓを越えない文字列であると判断して、それを要約文
とするものである。If it is determined in step F85 that the character string S (i) does not exceed the maximum number of characters ws, as described above, the control unit 1
The process of step 1 proceeds to step F86, where a portion to be added to the next character string is determined. When the process proceeds to step F89 as described above, it is determined whether or not the character string SS (i) has exceeded the maximum number of characters ws. Where the string SS
If (i) exceeds the maximum number of characters ws, the process of the control unit 11 proceeds to step F91, and the character string S (i) before adding a certain element in step F88 immediately before is used as a summary sentence. Become. In other words, this means that the maximum number of characters ws has been exceeded by adding the element in step F88. Therefore, the character string S (i) before the addition of the element is close to the maximum number of characters ws but the maximum number of characters ws Is determined to be a character string that does not exceed, and is used as a summary sentence.

【手続補正３】[Procedure amendment 3]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】０２４３[Correction target item name] 0243

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【０２４３】特に、表示手段における要約文表示エリア
のサイズが可変とされる場合は、読み上げ要約作成手段
は、要約文表示エリアのサイズに応じて文書長を設定し
て、読み上げ用要約文データを作成することで、要約文
表示エリアのサイズ状況に応じて最適な読み上げ出力を
行うことができる。例えばユーザーが内容把握に十分な
程度の文書長で表示できる場合は、読み上げ用要約文デ
ータのサイズは表示される要約文データと同等とするこ
とで十分であり、かつより長い要約文を作成しなくても
よいため処理も簡単になる。一方、要約文表示エリアが
小さく、表示される要約文データの文書長が短くなっ
て、内容理解に不十分と考えられる場合は、より長い文
書長で読み上げ用要約文データを作成することで、ユー
ザーに対して文書内容を的確に提示できる。In particular, when the size of the summary sentence display area in the display means is variable, the reading summary creation means sets the document length in accordance with the size of the summary sentence display area, and reads out the reading summary sentence data. by creating, it is possible to perform an optimum speech output depending on the size condition of summary display area. For example, if the user can display the document with a document length long enough to grasp the contents, it is sufficient to make the size of the summary text for reading aloud the same as the displayed summary text data, and create a longer summary text. Since there is no need to do so, the processing is simplified. On the other hand, if the summary sentence display area is small and the document length of the displayed summary sentence data is short, and it is considered that the content is insufficient for comprehension of the content, by creating the summary sentence data for reading with a longer document length, Document contents can be accurately presented to the user.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 13/00 Ｇ１０Ｌ 3/00 Ｅ 21/06 Ｓ ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 13/00 G10L 3/00 E 21/06 S

Claims

[Claims]

1. A summary creating means for creating summary sentence data of an electronic document in accordance with the size of an abstract sentence display area, and a reading-out device having a document length longer than the document length of the summary sentence data created by the summary creating means. A document processing apparatus, comprising: a read-out summary creating unit capable of creating summary sentence data.

2. A display means having the summary sentence display area, a voice output means, and a display of the summary sentence data created by the summary creation means is executed in the summary sentence display area of the display means. Control means capable of causing the voice output means to output a read-out voice based on the read-out summary sentence data created by the read-out summary creation means. 2. The document processing device according to 1.

3. The summary text display area is variable in size, and the reading summary generation means sets a document length in accordance with the size of the summary text display area to generate reading summary text data. The document processing apparatus according to claim 1, wherein:

4. The summary summary display area has a fixed size, and the reading summary creation unit includes a reading summary that has a document length exceeding the document length of the summary sentence data created by the summary creation unit. 2. The document processing apparatus according to claim 1, wherein the document data is created.

5. A summary creating procedure for creating summary sentence data corresponding to the size of a summary sentence display area of a display unit for an electronic document, and a document having a length equal to or longer than the length of the summary sentence created in the summary creating procedure. A reading summary creation procedure for creating a long summary text for reading, and a display output based on the summary sentence data created in the summary creation procedure, in the summary display area of the display unit, and a voice output unit. And an output procedure for outputting a reading voice based on the reading summary sentence data created in the reading summary creating procedure.

6. An area size setting procedure for variably setting the size of the summary display area in the display unit, and in the reading summary creation procedure, the size of the summary display area set in the area size setting procedure is provided. 6. The document processing method according to claim 5, wherein a document length is set in accordance with the summary and the read summary sentence data is created.

7. The summary sentence display area in the display unit has a fixed size, and the read-out summary creation procedure has a document length exceeding the document length of the summary sentence data created in the summary creation procedure. 6. The document processing method according to claim 5, wherein summary text data for reading is created.

8. A summary creating procedure for creating summary sentence data corresponding to the size of a summary sentence display area of a display unit for an electronic document, and a document longer than the document length of the summary sentence created in the summary creating procedure. A reading summary creating procedure for creating a long reading summary sentence data; and a display output based on the summary sentence data created in the summary creating procedure in the summary displaying area of the display unit, and a voice output unit. And an output procedure for executing a read-out voice based on the read-out summary sentence data created in the read-out summary creation procedure.

9. The method according to claim 9, further comprising an area size setting step of variably setting a size of the summary sentence display area on the display unit, wherein the reading summary creation step includes a step of setting the size of the summary sentence display area set in the area size setting step. 9. The recording medium according to claim 8, wherein an operation control program for setting a document length according to the above and generating summary text data for reading out is recorded.

10. The read-out summary creating procedure records an operation control program for creating read-out summary sentence data having a document length exceeding the document length of the summary sentence data created in the summary creating procedure. 9. The recording medium according to claim 8, wherein: