JP2001014305A

JP2001014305A - Method and device for electronic document processing, and recording medium where electronic document processing program is recorded

Info

Publication number: JP2001014305A
Application number: JP11186838A
Authority: JP
Inventors: Katashi Nagao; 確長尾
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1999-06-30
Filing date: 1999-06-30
Publication date: 2001-01-19

Abstract

PROBLEM TO BE SOLVED: To read aloud an arbitrary document in the form of an electronic document by voice synthesis with high precision without any feeling of incompatibility. SOLUTION: The document processor when receiving a tag file as a tagged document (S1) derives property information for a reading-aloud process from the tag in the tag file and embeds the property information to generate a file for the reading-aloud process (S2). Then the document processor performs a process matched with a voice synthesis engine (S3) by using the generated file for the voice reading-aloud process and performs a process (S4) corresponding to operation that a user does by using a user interface.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、電子文書を処理す
る電子文書処理方法及び電子文書処理装置並びに電子文
書処理プログラムが記録された記録媒体に関する。[0001] 1. Field of the Invention [0002] The present invention relates to an electronic document processing method and an electronic document processing apparatus for processing an electronic document, and a recording medium on which an electronic document processing program is recorded.

【０００２】[0002]

【従来の技術】従来、インターネットにおいて、ウィン
ドウ形式でハイパーテキスト型情報を提供するアプリケ
ーションサービスとしてＷＷＷ（World Wide Web）が提
供されている。2. Description of the Related Art Conventionally, WWW (World Wide Web) has been provided as an application service for providing hypertext information in a window format on the Internet.

【０００３】ＷＷＷは、文書の作成、公開又は共有化の
文書処理を実行し、新しいスタイルの文書の在り方を示
したシステムである。しかし、文書の実際上の利用の観
点からは、文書の内容に基づいた文書の分類や要約とい
った、ＷＷＷを越える高度な文書処理が求められてい
る。このような高度な文書処理には、文書の内容の機械
的な処理が不可欠である。[0003] The WWW is a system that executes document processing for creating, publishing, or sharing a document and showing the way of a new style document. However, from the viewpoint of practical use of documents, advanced document processing beyond WWW, such as classification and summarization of documents based on the contents of the documents, is required. For such advanced document processing, mechanical processing of the contents of the document is indispensable.

【０００４】しかしながら、文書の内容の機械的な処理
は、以下のような理由から依然として困難である。すな
わち、第１には、ハイパーテキストを記述する言語であ
るＨＴＭＬ（Hyper Text Markup Language）は、文書の
表現については規定するが、文書の内容についてはほと
んど規定しないためである。第２には、文書間に構成さ
れたハイパーテキストのネットワークは、文書の読者に
とって文書の内容を理解するために必ずしも利用しやす
いものではないためである。第３には、一般に文書の著
作者は、読者の便宜を念頭に置かずに著作するが、文書
の読者の便宜が著作者の便宜と調整されることはないた
めである。However, mechanical processing of the contents of a document is still difficult for the following reasons. First, HTML (Hyper Text Markup Language), which is a language for describing hypertext, specifies the expression of a document, but hardly specifies the content of the document. Second, hypertext networks constructed between documents are not always easy for readers of the document to understand the contents of the document. Third, although the author of a document generally works without the convenience of the reader in mind, the convenience of the reader of the document is not coordinated with the convenience of the author.

【０００５】このように、ＷＷＷは、新しい文書の在り
方を示したシステムであるが、文書を機械的に処理しな
いために、高度な文書処理を行うことができなかった。
換言すると、高度な文書処理を実行するためには、文書
を機械的に処理することが必要となる。[0005] As described above, WWW is a system showing the way of a new document. However, since the document is not mechanically processed, advanced document processing cannot be performed.
In other words, in order to perform advanced document processing, it is necessary to process the document mechanically.

【０００６】そこで、文書の機械的な処理を目標とし
て、文書の機械的な処理を支援するシステムが自然言語
研究の成果に基づいて開発されている。自然言語研究に
よる文書処理として、文書の著作者等による文書の内部
構造についての属性情報、いわゆるタグの付与を前提と
した、文書に付与されたタグを利用する機械的な文書処
理が提案されている。Therefore, a system for supporting mechanical processing of documents has been developed based on the results of natural language research, with the goal of mechanical processing of documents. As a document processing based on natural language research, mechanical document processing using tags attached to a document has been proposed on the assumption that attribute information about the internal structure of the document by the author of the document, so-called tags are added. I have.

【０００７】ところで、ユーザは、例えばいわゆるサー
チエンジンのような情報検索システムを利用し、インタ
ーネットを介して提供される膨大な情報の中から所望の
情報を探し出すようにしている。この情報検索システム
は、指定されたキーワードに基づいて情報を検索し、検
索した情報をユーザに提供するシステムである。ユーザ
は、提供された情報の中から所望の情報を選択する。[0007] By the way, a user uses an information search system such as a so-called search engine to search for desired information from a vast amount of information provided via the Internet. This information search system is a system that searches for information based on a specified keyword and provides the searched information to a user. The user selects desired information from the provided information.

【０００８】情報検索システムにおいては、このように
容易に情報を検索することができるが、ユーザは、検索
されて提供された情報を一読して概略を理解し、それが
希望する情報であるか否かを判断する必要がある。この
作業は、特に、提供された情報の量が多い場合には、ユ
ーザにとって大きな負担となる。そこで、最近、テキス
ト情報、すなわち文書の内容を自動的に要約するシステ
ムであるいわゆる自動要約作成システムが注目されてい
る。[0008] In the information retrieval system, information can be easily retrieved as described above. However, the user reads the retrieved and provided information, understands the outline, and checks whether the information is the desired information. It is necessary to determine whether or not. This operation imposes a heavy burden on the user, especially when the amount of provided information is large. Therefore, recently, a so-called automatic summarization creating system, which is a system for automatically summarizing text information, that is, the contents of a document, has attracted attention.

【０００９】自動要約作成システムは、元の情報、すな
わち文書の大意を保持したままテキストの情報の長さや
複雑さを減らすことによって、要約を作成するシステム
である。ユーザは、この自動要約作成システムにより作
成された要約を一読することで、文書の概略を理解する
ことができる。The automatic summarizing system is a system for summarizing the original information, that is, by reducing the length and complexity of the text information while retaining the meaning of the document. The user can understand the outline of the document by reading the summary created by the automatic summary creation system.

【００１０】通常、自動要約作成システムは、テキスト
中の文や単語を１つの単位とし、それに何らかの情報に
基づいた重要度を付与して順序付けする。そして、自動
要約作成システムは、上位に順序付けした文や単語を寄
せ集め、要約を作成する。Normally, the automatic summarization creating system treats sentences and words in a text as one unit, and assigns a degree of importance based on some information to order them. Then, the automatic summarizing system collects sentences and words ordered in a higher order to create a summarization.

【００１１】[0011]

【発明が解決しようとする課題】ところで、近年のコン
ピュータの普及やネットワーク化の進展にともない、文
書処理の高機能化が求められており、そのなかでも、文
書を音声合成して読み上げる機能が求められている。By the way, with the spread of computers and the progress of networking in recent years, high-performance document processing has been demanded. Among them, a function of synthesizing a document by voice and reading it out is demanded. Have been.

【００１２】音声合成は、本来、音声の分析結果や人間
の音声の生成機構の模擬に基づいて機械的に音声を生成
するものであり、個々の言語の要素又は音素をディジタ
ル制御のもとに組み立てるものである。[0012] Speech synthesis originally generates speech mechanically based on the results of speech analysis and simulation of a human speech generation mechanism, and digitally controls elements or phonemes of individual languages. To assemble.

【００１３】しかしながら、音声合成においては、任意
の文書を読み上げる際に、文書の切れ目等を考慮して読
み上げることはできず、自然な読み上げを行うことはで
きなかった。また、音声合成においては、言語に応じ
て、使用する音声合成エンジンをユーザが適宜選択する
必要があった。さらに、音声合成においては、例えば専
門用語や難訓語といった読み誤りを生じやすい語を正確
に読み上げる精度は、使用する辞書に依存するものであ
った。However, in speech synthesis, when reading an arbitrary document, it is not possible to read the document in consideration of the breaks in the document, and it is not possible to perform natural reading. Further, in speech synthesis, it is necessary for a user to appropriately select a speech synthesis engine to be used according to a language. Furthermore, in speech synthesis, the accuracy with which words that are prone to reading errors, such as technical terms and difficult words, are read accurately depends on the dictionary used.

【００１４】本発明は、このような実情に鑑みてなされ
たものであり、任意の文書を音声合成により高精度で且
つ違和感がなく読み上げることができる電子文書処理方
法及び電子文書処理装置、並びに電子文書処理プログラ
ムが記録された記録媒体を提供することを目的とする。The present invention has been made in view of such circumstances, and has an electronic document processing method, an electronic document processing apparatus, and an electronic document processing method capable of reading an arbitrary document by speech synthesis with high accuracy and without a sense of incongruity. An object is to provide a recording medium on which a document processing program is recorded.

【００１５】[0015]

【課題を解決するための手段】上述した目的を達成する
本発明にかかる電子文書処理方法は、電子文書を処理す
る電子文書処理方法において、電子文書に基づいて、音
声合成して読み上げるための音声読み上げ用ファイルを
生成する音声読み上げ用ファイル生成工程を備えること
を特徴としている。According to the present invention, there is provided an electronic document processing method for processing an electronic document, comprising the steps of: It is characterized by comprising a voice reading file generating step of generating a reading file.

【００１６】このような本発明にかかる電子文書処理方
法は、電子文書に基づいて、音声読み上げ用ファイルを
生成して、電子文書を読み上げる。The electronic document processing method according to the present invention generates a voice reading file based on the electronic document, and reads the electronic document.

【００１７】また、上述した目的を達成する本発明にか
かる電子文書処理方法は、電子文書を処理する電子文書
処理方法において、複数の要素が階層化された内部構造
を有し、この内部構造を示すタグ情報が予め付与されて
いる電子文書を入力する文書入力工程と、タグ情報に基
づいて、電子文書を音声合成して読み上げる文書読み上
げ工程とを備えることを特徴としている。The electronic document processing method according to the present invention for achieving the above-mentioned object has a structure in which a plurality of elements are hierarchized in the electronic document processing method for processing an electronic document. It is characterized by comprising a document input step of inputting an electronic document to which tag information shown is given in advance, and a document reading step of reading out the electronic document by voice synthesis based on the tag information.

【００１８】このような本発明にかかる電子文書処理方
法は、複数の要素が階層化された内部構造を示すタグ情
報が予め付与されている電子文書を入力し、タグ情報に
基づいて電子文書を直接読み上げる。In the electronic document processing method according to the present invention, an electronic document to which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance is input, and the electronic document is converted based on the tag information. Read directly.

【００１９】さらに、上述した目的を達成する本発明に
かかる電子文書処理装置は、電子文書を処理する電子文
書処理装置において、電子文書に基づいて、音声合成し
て読み上げるための音声読み上げ用ファイルを生成する
音声読み上げ用ファイル生成手段を備えることを特徴と
している。Further, the electronic document processing apparatus according to the present invention, which achieves the above-mentioned object, is a digital document processing apparatus for processing an electronic document. It is characterized by comprising a voice reading file generating means for generating.

【００２０】このような本発明にかかる電子文書処理装
置は、電子文書に基づいて、音声読み上げ用ファイルを
生成し、この音声読み上げ用ファイルを用いて電子文書
を読み上げる。The electronic document processing apparatus according to the present invention generates a text-to-speech file based on the digital document, and reads the digital document using the text-to-speech file.

【００２１】さらにまた、上述した目的を達成する本発
明にかかる電子文書処理装置は、電子文書を処理する電
子文書処理装置において、複数の要素が階層化された内
部構造を有し、内部構造を示すタグ情報が予め付与され
ている電子文書を入力する文書入力手段と、タグ情報に
基づいて、電子文書を音声合成して読み上げる文書読み
上げ手段とを備えることを特徴としている。Further, an electronic document processing apparatus according to the present invention for achieving the above-mentioned object has an internal structure in which a plurality of elements are hierarchized in an electronic document processing apparatus for processing an electronic document. It is characterized by comprising a document input unit for inputting an electronic document to which tag information is given in advance, and a document reading unit for reading the electronic document by voice synthesis based on the tag information.

【００２２】このような本発明にかかる電子文書処理装
置は、複数の要素が階層化された内部構造を示すタグ情
報が予め付与されている電子文書を入力し、この電子文
書に付与されたタグ情報に基づいて電子文書を直接読み
上げる。Such an electronic document processing apparatus according to the present invention inputs an electronic document to which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance, and a tag assigned to the electronic document. Read electronic documents directly based on information.

【００２３】また、上述した目的を達成する本発明にか
かる電子文書処理プログラムが記録された記録媒体は、
電子文書を処理するコンピュータ制御可能な電子文書処
理プログラムが記録された記録媒体において、電子文書
処理プログラムは、電子文書に基づいて、音声合成して
読み上げるための音声読み上げ用ファイルを生成する音
声読み上げ用ファイル生成工程を備えることを特徴とし
ている。Further, a recording medium on which an electronic document processing program according to the present invention for achieving the above object is recorded,
In a recording medium on which a computer-controllable electronic document processing program for processing an electronic document is recorded, the electronic document processing program generates a speech-to-speech file for speech synthesis and speech based on the electronic document. It is characterized by having a file generation step.

【００２４】このような本発明にかかる電子文書処理プ
ログラムが記録された記録媒体は、電子文書に基づい
て、音声読み上げ用ファイルを生成して、電子文書を読
み上げる電子文書処理プログラムを提供する。[0024] The recording medium in which the electronic document processing program according to the present invention is recorded provides an electronic document processing program which generates a file for speech reading based on the electronic document and reads the electronic document.

【００２５】さらに、上述した目的を達成する本発明に
かかる電子文書処理プログラムが記録された記録媒体
は、電子文書を処理するコンピュータ制御可能な電子文
書処理プログラムが記録された記録媒体において、電子
文書処理プログラムは、複数の要素が階層化された内部
構造を有し、内部構造を示すタグ情報が予め付与されて
いる電子文書を入力する文書入力工程と、タグ情報に基
づいて、電子文書を音声合成して読み上げる文書読み上
げ工程とを備えることを特徴としている。Further, a recording medium on which an electronic document processing program according to the present invention for achieving the above-mentioned object is recorded is a computer-controlled electronic document processing program for processing an electronic document. The processing program has an internal structure in which a plurality of elements are hierarchized, and a document inputting step of inputting an electronic document to which tag information indicating the internal structure is added in advance, and audio processing of the electronic document based on the tag information And a document reading-out step for reading out by combining.

【００２６】このような本発明にかかる電子文書処理プ
ログラムが記録された記録媒体は、複数の要素が階層化
された内部構造を示すタグ情報が予め付与されている電
子文書を入力し、タグ情報に基づいて電子文書を直接読
み上げる電子文書処理プログラムを提供する。The recording medium on which the electronic document processing program according to the present invention is recorded receives an electronic document to which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance, An electronic document processing program for reading out an electronic document directly based on the electronic document.

【００２７】[0027]

【発明の実施の形態】以下、本発明を適用した具体的な
実施の形態について図面を参照しながら詳細に説明す
る。Embodiments of the present invention will be described below in detail with reference to the drawings.

【００２８】本発明の実施の形態として示す文書処理装
置は、与えられた電子文書やその電子文書から作成した
要約文を音声合成エンジンにより音声合成して読み上げ
る機能を有するものである。なお、以下の説明では、電
子文書を単に文書と記すものとする。A document processing apparatus shown as an embodiment of the present invention has a function of synthesizing a given electronic document and a summary sentence created from the electronic document by using a speech synthesis engine and reading out the speech. In the following description, an electronic document is simply referred to as a document.

【００２９】文書処理装置は、図１に示すように、制御
部１１及びインターフェース１２を有する本体１０と、
ユーザにより入力された情報を本体１０に供給する入力
部２０と、外部からの信号を受信して本体１０に供給す
る受信部２１と、サーバ２４と本体１０との間の通信処
理を行う通信部２２と、本体１０から出力される情報を
音声として出力する音声出力部３０と、本体１０から出
力される情報を表示する表示部３１と、記録媒体３３に
対して情報を記録及び／又は再生する記録／再生部３２
と、ハードディスクドライブ（Hard Disk Drive；ＨＤ
Ｄ）３４とを備える。As shown in FIG. 1, the document processing apparatus includes a main body 10 having a control unit 11 and an interface 12;
An input unit 20 that supplies information input by a user to the main body 10, a receiving unit 21 that receives an external signal and supplies the information to the main body 10, and a communication unit that performs communication processing between the server 24 and the main body 10. 22, an audio output unit 30 for outputting information output from the main unit 10 as audio, a display unit 31 for displaying information output from the main unit 10, and recording and / or reproducing information on a recording medium 33. Recording / playback unit 32
And Hard Disk Drive (HD)
D) 34.

【００３０】本体１０は、制御部１１と、インターフェ
ース１２とを有し、この文書処理装置の主要な部分を構
成する。The main body 10 has a control unit 11 and an interface 12, and constitutes a main part of the document processing apparatus.

【００３１】制御部１１は、この文書処理装置における
処理を実行するＣＰＵ（Central Processing Unit）１
３と、揮発性のメモリであるＲＡＭ（Random Access Me
mory）１４と、不揮発性のメモリであるＲＯＭ（Read O
nly Memory）１５とを有する。The control unit 11 is a CPU (Central Processing Unit) 1 that executes processing in the document processing apparatus.
3 and RAM (Random Access Me
mory) 14 and a ROM (Read O
nly Memory) 15.

【００３２】ＣＰＵ１３は、例えばＲＯＭ１５やハード
ディスクに記録されているプログラムにしたがって、プ
ログラムを実行するための制御を行う。ＲＡＭ１４に
は、ＣＰＵ１３が各種処理を実行する上で必要なプログ
ラムやデータが必要に応じて一時的に格納される。The CPU 13 performs control for executing the program in accordance with, for example, the program recorded in the ROM 15 or the hard disk. The RAM 14 temporarily stores programs and data necessary for the CPU 13 to execute various processes as necessary.

【００３３】インターフェース１２は、入力部２０、受
信部２１、通信部２２、表示部３１、記録／再生部３２
及びハードディスクドライブ３４に接続される。インタ
ーフェース１２は、制御部１１の制御のもとに、入力部
２０、受信部２１及び通信部２２を介して供給されるデ
ータの入力、表示部３１へのデータの出力、記録／再生
部３２に対するデータの入出力について、データを入出
力するタイミングを調整し、データの形式を変換する。The interface 12 includes an input unit 20, a receiving unit 21, a communication unit 22, a display unit 31, and a recording / reproducing unit 32.
And a hard disk drive 34. Under the control of the control unit 11, the interface 12 inputs data supplied via the input unit 20, the reception unit 21 and the communication unit 22, outputs data to the display unit 31, and outputs data to the recording / reproduction unit 32. For data input / output, adjust the data input / output timing and convert the data format.

【００３４】入力部２０は、この文書処理装置に対する
ユーザの入力を受ける部分である。この入力部２０は、
例えばキーボードやマウスにより構成される。ユーザ
は、この入力部２０を用いることで、例えば、キーボー
ドによりキーワードを入力したり、マウスにより表示部
３１に表示される文書のエレメントを選択して入力する
ことができる。なお、エレメントとは、文書を構成する
要素であって、例えば文書、文及び語を含むものであ
る。The input section 20 is a section for receiving a user input to the document processing apparatus. This input unit 20
For example, it is composed of a keyboard and a mouse. The user can use the input unit 20 to input a keyword using a keyboard or select and input an element of a document displayed on the display unit 31 using a mouse, for example. Note that the element is an element constituting a document, and includes, for example, a document, a sentence, and a word.

【００３５】受信部２１は、この文書処理装置に対し
て、外部から例えば通信回線を介して送信されるデータ
を受信する。この受信部２１は、電子文書である複数の
文書やこれらの文書を処理するための電子文書処理プロ
グラムを受信する。受信部２１により受信されたデータ
は、本体１０に供給される。The receiving section 21 receives data transmitted from the outside to the document processing apparatus via, for example, a communication line. The receiving unit 21 receives a plurality of documents that are electronic documents and an electronic document processing program for processing these documents. The data received by the receiving unit 21 is supplied to the main unit 10.

【００３６】通信部２２は、例えばモデムやターミナル
アダプタ等により構成され、電話回線を介してインター
ネット２３に接続される。インターネット２３には、文
書等のデータを格納したサーバ２４が接続されており、
通信部２２は、インターネット２３を介してサーバ２４
にアクセスし、このサーバ２４からデータを受信するこ
とができる。この通信部２２により受信されたデータ
は、本体１０に供給される。The communication section 22 is composed of, for example, a modem and a terminal adapter, and is connected to the Internet 23 via a telephone line. A server 24 storing data such as documents is connected to the Internet 23.
The communication unit 22 is connected to the server 24 via the Internet 23.
And data can be received from the server 24. The data received by the communication unit 22 is supplied to the main unit 10.

【００３７】音声出力部３０は、例えば、スピーカによ
り構成され、音声合成エンジン等により音声合成されて
得られる電気的な音声信号やその他の各種音声信号をイ
ンターフェース１２を介して入力し、音声に変換して出
力する。The audio output unit 30 is composed of, for example, a speaker, and receives an electric audio signal obtained by synthesizing a voice by a voice synthesizing engine or the like and various other audio signals via the interface 12 and converts the audio signal into audio. And output.

【００３８】表示部３１は、文字情報や画像情報をイン
ターフェース１２を介して入力し、表示する。この表示
部３１は、例えば陰極線管（Cathode Ray Tube；ＣＲ
Ｔ）や液晶表示装置（Liquid Crystal Display；ＬＣ
Ｄ）により構成され、例えば単数又は複数のウィンドウ
を表示し、このウィンドウ上に文字や図形等を表示す
る。The display unit 31 inputs and displays character information and image information via the interface 12. The display unit 31 is, for example, a cathode ray tube (CR).
T) and Liquid Crystal Display (LC)
D), for example, one or more windows are displayed, and characters, graphics, and the like are displayed on these windows.

【００３９】記録／再生部３２は、制御部１１の制御の
もとに、例えば、フロッピーディスク、光ディスク、光
磁気ディスクといった着脱可能な記録媒体３３に対して
データの記録及び／又は再生を行う。また、記録媒体３
３には、文書を処理するための電子文書処理プログラム
や処理対象とする文書が記録されている。The recording / reproducing unit 32 records and / or reproduces data on a removable recording medium 33 such as a floppy disk, an optical disk, or a magneto-optical disk under the control of the control unit 11. Also, the recording medium 3
3 records an electronic document processing program for processing a document and a document to be processed.

【００４０】ハードディスクドライブ３４は、大容量の
磁気記録媒体であるハードディスクに対してデータの記
録及び／又は再生を行う。The hard disk drive 34 records and / or reproduces data on a hard disk which is a large-capacity magnetic recording medium.

【００４１】このような文書処理装置は、以下のように
して所望の文書を受信し、表示部３１に表示する。Such a document processing apparatus receives a desired document and displays it on the display unit 31 as follows.

【００４２】文書処理装置においては、まずユーザが入
力部２０を操作してインターネット２３を介して通信を
行うためのプログラムを起動し、サーバ２４（サーチエ
ンジン）のＵＲＬ（Uniform Resource Locator）を入力
すると、制御部１１は、通信部２２を制御し、サーバ２
４にアクセスする。In the document processing apparatus, first, the user operates the input unit 20 to start a program for performing communication via the Internet 23, and inputs a URL (Uniform Resource Locator) of the server 24 (search engine). , The control unit 11 controls the communication unit 22 and the server 2
Access 4

【００４３】これに応じて、サーバ２４は、インターネ
ット２３を介して、文書処理装置の通信部２２に検索画
面のデータを出力する。文書処理装置においてＣＰＵ１
３は、このデータをインターフェース１２を介して表示
部３１に出力し、表示させる。In response, the server 24 outputs the data of the search screen to the communication unit 22 of the document processing device via the Internet 23. CPU1 in the document processing device
3 outputs the data to the display unit 31 via the interface 12 and causes the display unit 31 to display the data.

【００４４】文書処理装置においては、ユーザが入力部
２０を用いてこの検索画面上でキーワード等を入力して
検索を指令すると、通信部２２からインターネット２３
を介して、サーチエンジンとしてのサーバ２４に対して
検索命令が送信される。In the document processing apparatus, when the user inputs a keyword or the like on this search screen using the input unit 20 and instructs a search, the communication unit 22 transmits the keyword to the Internet 23.
, A search command is transmitted to the server 24 as a search engine.

【００４５】サーバ２４は、検索命令を受信すると、こ
の検索命令を実行し、得られた検索結果をインターネッ
ト２３を介して通信部２２に送信する。文書処理装置に
おいて制御部１１は、通信部２２を制御し、サーバ２４
から送信される検索結果を受信させ、その一部を表示部
３１に表示させる。Upon receiving the search command, the server 24 executes the search command and transmits the obtained search result to the communication unit 22 via the Internet 23. In the document processing apparatus, the control unit 11 controls the communication unit 22 and
, And a part of the search result is displayed on the display unit 31.

【００４６】具体的には、ユーザが入力部２０を用いて
例えば「ＴＣＰ」というキーワードを入力して検索を指
令した場合には、文書処理装置には、サーバ２４から
「ＴＣＰ」のキーワードを含む各種情報が送信され、表
示部３１には以下のような文書が表示される。Specifically, when the user instructs a search by inputting a keyword such as “TCP” using the input unit 20, the document processing apparatus includes the keyword “TCP” from the server 24. Various information is transmitted, and the following documents are displayed on the display unit 31.

【００４７】「TCP/IP(Transmission Control Protocol
/Internet Protocol)の歴史は、北米の、いや世界のコ
ンピュータネットワークの歴史であるといっても過言で
はない。そしてそのTCP/IPの歴史は、ARPANETを抜きに
して語ることはできない。ARPANETは正式名称をAdvance
d Research Project Agency Network（高等研究計画局
ネットワーク）といい、アメリカ国防省のDOD(Departme
nt of Defence)の国防高等研究計画局(DARPA:Defence A
dvanced Research Project Agency)がスポンサーとなっ
て構築されてきた、実験および研究用のパケット交換ネ
ットワークである。1969年北米西海岸の４個所の大学、
研究機関のホストコンピュータを50kbpsの回線で結んだ
きわめて小規模なネットワークからARPANETは出発し
た。当時は1945年に世界初のコンピュータであるENIAC
がペンシルバニア大学で開発され、1964年にはじめてIC
を理論素子として実装し、第３世代のコンピュータの歴
史を形成したメインフレームの汎用コンピュータシリー
ズが開発され、やっとコンピュータが産声をあげたばか
りあった。この時代背景を考えると、将来のコンピュー
タ通信の最盛を見越したこのようなプロジェクトは、ま
さに米国ならではのものであったといえるだろう。」"TCP / IP (Transmission Control Protocol)
It is no exaggeration to say that the history of the (Internet Protocol) is the history of computer networks in North America, or even the world. And the history of TCP / IP cannot be told without ARPANET. ARPANET has the official name Advance
d Research Project Agency Network, the US Department of Defense's DOD (Departme
nt of Defense) Defense Advanced Research Projects Agency (DARPA: Defense A)
advanced research project agency) sponsored and built for experimental and research packet switching networks. 1969 Four universities on the west coast of North America,
ARPANET departed from a very small network of research institute host computers connected by a 50kbps line. At the time, ENIAC, the world's first computer in 1945
Was developed at the University of Pennsylvania and became the first IC in 1964
Was implemented as a theoretical element, and a mainframe general-purpose computer series that formed the history of the third generation of computers was developed. Given this historical background, such a project that anticipated the future of computer communications could be said to have been unique to the United States. "

【００４８】この文書は、その内部構造を後述するタグ
付けによる属性情報によって記述されている。文書処理
装置における文書処理は、文書に付与されたタグを参照
して行われる。この実施の形態においては、文書の構造
を示す統語論的タグとともに、多言語間で文書の機械的
な内容理解を可能にするような意味的・語用論的タグを
文書に付与している。In this document, its internal structure is described by attribute information by tagging described later. Document processing in the document processing device is performed with reference to a tag attached to the document. In this embodiment, a semantic / pragmatic tag is added to a document to enable a mechanical understanding of the content of the document between multiple languages, in addition to a syntactic tag indicating the structure of the document. .

【００４９】統語論的タグ付けとしては、文書のツリー
状の内部構造を記述するタグ付けがある。すなわち、本
実施の形態においては、図２に示すように、このタグ付
けによる内部構造、文書、文、語彙エレメント等の各エ
レメント、通常リンク、参照・被参照リンク等が、タグ
として予め文書に付与されている。同図中において、白
丸“○”は、語彙、セグメント、文といった文書の要
素、すなわちエレメントであり、最下位の白丸“○”
は、文書における最小レベルの語に対応する語彙エレメ
ントである。また、実線は、語、句、節、文等の文書の
エレメント間のつながりを示す通常リンク（normal lin
k）である。破線は、参照・被参照による係り受け関係
を示す参照リンク（reference link）である。文書の内
部構造は、上位から下位への順序で、文書（documen
t）、サブディビジョン（subdivision）、段落（paragr
aph）、文（sentence ）、サブセンテンシャルセグメン
ト（subsentential segment ）、・・・、語彙エレメン
トから構成される。これらのうち、サブディビジョンと
段落は、オプションである。As a syntactic tagging, there is a tagging that describes a tree-like internal structure of a document. That is, in the present embodiment, as shown in FIG. 2, the internal structure by this tagging, each element such as a document, a sentence, and a vocabulary element, a normal link, a reference / referenced link, and the like are added to the document in advance as tags. Has been granted. In the figure, a white circle “○” is a document element such as a vocabulary, a segment, a sentence, that is, an element.
Is the vocabulary element corresponding to the lowest level word in the document. A solid line indicates a normal link (normal lin) indicating a connection between elements of the document such as a word, a phrase, a section, and a sentence.
k). A broken line is a reference link indicating a dependency relationship between a reference and a referenced. The internal structure of a document is the document (documen
t), subdivision, paragraph (paragr)
aph), sentence (sentence), subsentential segment (subsentential segment), ..., vocabulary element. Of these, subdivisions and paragraphs are optional.

【００５０】一方、意味論・語用論的なタグ付けとして
は、係り受け、例えば代名詞の指示対象等を示す統語構
造（syntactic structure）に関するタグ付けや多義語
の意味のように意味（semantic）の情報を記述するもの
がある。本実施の形態におけるタグ付けは、ＨＴＭＬ
（Hyper Text Markup Language）と同様なＸＭＬ（eXte
nsible Markup Language）の形式によるものである。On the other hand, the semantic / pragmatic tagging includes tagging of a syntactic structure indicating a dependency, for example, a referent of a pronoun, or semantic like a meaning of a polysemy. There is something that describes the information. Tagging in the present embodiment is performed in HTML.
XML (eXte) similar to (Hyper Text Markup Language)
nsible Markup Language).

【００５１】ここで、タグ付けされた文書の内部構造の
一例を以下に示すが、文書へのタグ付けは、この方法に
限定されるものではない。また、以下では、英語と日本
語の文書の例を示すが、タグ付けによる内部構造の記述
は、他の言語にも同様に適用可能であることを断ってお
く。Here, an example of the internal structure of the tagged document is shown below, but the tagging of the document is not limited to this method. In the following, examples of English and Japanese documents are shown, but it should be noted that the description of the internal structure by tagging is applicable to other languages as well.

【００５２】例えば、“Time flies like an arrow.”
という文については、＜文＞＜名詞句語義＝“Time0”＞time＜／名詞句＞＜動詞句＞＜動詞語義＝“fly1”＞flies＜／動詞＞＜形容動詞句＞＜形容動詞語義＝“like0”＞like＜
／形容動詞＞＜名詞句＞an＜名詞語義＝“arrow0”＞arrow＜／名
詞＞＜／名詞句＞＜／形容動詞句＞＜／動詞句＞.＜／文＞というようにタグ付けすることができる。For example, "Time flies like an arrow."
For the sentence, <sentence><noun phrase meaning = “Time0”> time </ noun phrase><verbphrase><verb meaning = “fly1”> flies </ verb><adjective verb phrase><adjective verb meaning = “Like0”> like <
/ Adjective verb><nounphrase> an <noun word meaning = "arrow0"> arrow </ noun></ noun phrase></ adjective verb phrase></ verb phrase>. Can be.

【００５３】ここで、＜文＞、＜名詞＞、＜名詞句＞、
＜動詞＞、＜動詞句＞、＜形容動詞＞、＜形容動詞句＞
は、それぞれ、文、名詞、名詞句、動詞、動詞句、形容
詞を含む前置詞句又は後置詞句／形容詞句、形容詞句／
形容動詞句のような文の統語構造を表している。タグ
は、エレメントの先端の直前及び終端の直後に対応して
配置される。エレメントの終端の直後に配置されるタグ
は、記号“／”によりエレメントの終端であることを示
している。エレメントは、統語的構成素、すなわち、
句、節及び文を示す。なお、語義（word sense）＝“ti
me0”は、語“time”の有する複数の意味、すなわち、
複数の語義のうちの第０番目の意味であることを指して
いる。具体的には、“time”には、名詞と動詞がある
が、ここでは“time”が名詞であることを示している。
この例示の他にも、例えば、語“オレンジ”は、少なく
とも植物の名前、色、果物の意味があるが、これらも語
義によって区別することができる。Here, <sentence>, <noun>, <noun phrase>,
<Verb>, <verb phrase>, <adjective verb>, <adjective verb phrase>
Is a preposition phrase or postposition phrase / adjective phrase / adjective phrase / adjective phrase / including sentence, noun, noun phrase, verb, verb phrase, and adjective, respectively.
It represents the syntactic structure of a sentence such as an adjective verb. Tags are arranged corresponding to immediately before the front end and immediately after the end of the element. The tag placed immediately after the end of the element indicates that it is the end of the element by the symbol "/". Elements are syntactic constructs, ie
Show phrases, clauses and sentences. Note that the word sense = “ti
"me0" has multiple meanings of the word "time",
This means that it is the 0th meaning of the plural meanings. Specifically, “time” includes a noun and a verb, but here, “time” indicates that it is a noun.
In addition to this example, for example, the word “orange” has at least the meaning of a plant name, a color, and a fruit, and these can also be distinguished by their meanings.

【００５４】このような文書を用いる文書処理装置にお
いては、図３に示すように、表示部３１のウィンドウ１
０１に統語構造を表示することができる。ウィンドウ１
０１においては、右半面１０３に語彙エレメントが表示
されるとともに、左半面１０２に文の内部構造が表示さ
れる。このウィンドウ１０１においては、日本語で記述
された文書のみならず、英語等の任意の言語で記述され
た文書についても、統語構造を表示することができる。In a document processing apparatus using such a document, as shown in FIG.
01 can indicate a syntactic structure. Window 1
In 01, the vocabulary element is displayed on the right half 103, and the internal structure of the sentence is displayed on the left half 102. In this window 101, not only a document described in Japanese but also a document described in an arbitrary language such as English can display a syntactic structure.

【００５５】具体的には、このウィンドウ１０１の右半
面１０３には、ここでは、タグ付けされた次に示すよう
な文書「Ａ氏のＢ会が終わったＣ市で、一部の大衆紙と
一般紙がその写真報道を自主規制する方針を紙面で明ら
かにした。」の一部が表示されている。この文書のタグ
付けの例を次に示す。More specifically, in the right half 103 of the window 101, the following tagged document “City where the B meeting of Mr. A has finished, The newspaper has stated on paper that it will voluntarily regulate its photo reporting. " An example of tagging this document follows.

【００５６】＜文書＞＜文＞＜形容動詞句関係＝“場
所”＞＜名詞句＞＜形容動詞句場所＝“Ｃ市”＞＜形容動詞句関係＝“主語”＞＜名詞句識別子＝
“Ｂ会”＞＜形容動詞句関係＝“所有”＞＜人名識別
子＝“Ａ氏”＞Ａ氏＜／人名＞の＜／形容動詞句＞＜組
織名識別子＝“Ｂ会”＞Ｂ会＜／組織名＞＜／名詞句
＞が＜／形容動詞句＞終わった＜／形容動詞句＞＜地名識別子＝“Ｃ市”＞
Ｃ市＜／地名＞＜／名詞句＞で、＜／形容動詞句＞＜形
容動詞句関係＝“主語”＞＜名詞句識別子＝“新
聞” 統語＝“並列”＞＜名詞句＞＜形容動詞句＞一部
の＜／形容動詞句＞大衆紙＜／名詞句＞と＜名詞＞一般
紙＜／名詞＞＜／名詞句＞が＜／形容動詞句＞＜形容動詞句関係＝“目的語”＞＜形容動詞句関係
＝“内容” 主語＝“新聞”＞＜形容動詞句関係＝
“目的語”＞＜名詞句＞＜形容動詞句＞＜名詞共参照
＝“Ｂ会”＞そ＜／名詞＞の＜／形容動詞句＞写真報道
＜／名詞句＞を＜／形容動詞句＞自主規制する＜／形容動詞句＞方針を＜／形容動詞句＞＜形容動詞句関係＝“位置”＞紙面で＜／形容動詞句
＞明らかにした。＜／文＞＜／文書＞<Document><sentence><adjective verb phrase relation = “place”><nounphrase><adjective verb phrase place = “C city”><adjective verb phrase relation = “subject”><noun phrase identifier =
“B meeting”><adjective verb phrase relation = “own”><person name identifier = “Mr. A”> Mr. A </ person name></ adjective verb phrase><organization name identifier = “B meeting”> B meeting < / Organization name><Nounphrase></ Adjective verb phrase> Finished </ Adjective verb phrase><Place name identifier = "C city">
C city </ place name></ noun phrase>, </ adjective verb phrase><adjective verb phrase relation = "subject"><noun phrase identifier = "newspaper" syntactic = "parallel"><nounphrase><adjective verb Phrases> Some </ adjective verb phrases> Popular newspaper </ noun phrase> and <noun> general paper </ noun></ noun phrase></ adjective verb phrase><adjective verb phrase> Relationship = "object"><Adjective verb phrase relation = “contents” subject = “newspaper”><adjective verb phrase relation =
"Object"><nounphrase><adjective verb phrase><noun co-reference = "B-kai"> so </ noun></ adjective verb phrase> photo coverage </ noun phrase></ adjective verb phrase> The self-regulated </ adjective verb phrase> policy was clarified as </ adjective verb phrase><adjective verb phrase relation = "position"></ adjective verb phrase> on paper. </ Text></text>

【００５７】この文書においては、「一部の大衆紙と一
般紙」は、統語＝“並列”というタグにより並列である
ことが表されている。並列の定義は、係り受け関係を共
有するということである。特に何も指定がない場合に
は、例えば、＜名詞句関係＝“ｘ”＞＜名詞＞Ａ＜／
名詞＞＜名詞＞Ｂ＜／名詞＞＜／名詞句＞は、ＡがＢに
依存関係があることを表す。In this document, "part of popular paper and general paper" is indicated by the tag "syntax" as "parallel". The definition of parallel is to share a dependency relationship. If nothing is specified, for example, <noun phrase relation = “x”><noun> A <//
Noun><Noun> B </ Noun></ Noun phrase> indicates that A has a dependency on B.

【００５８】また、関係＝“ｘ”は、関係属性を表す。
この関係属性は、統語、意味、修辞についての相互関係
を記述する。主語、目的語、間接目的語のような文法機
能、動作主、被動作主、受益者等のような主題役割、及
び理由、結果等のような修辞関係は、この関係属性によ
り記述される。関係属性は、関係＝***という形で表さ
れる。本実施の形態においては、主語、目的語、間接目
的語のような比較的容易な文法機能について関係属性を
記述する。The relation = “x” represents a relation attribute.
This relationship attribute describes the syntactic, semantic, and rhetorical relationships. Grammar functions such as subjects, objects, and indirect objects, subject roles such as actors, actors, and beneficiaries, and rhetorical relations such as reasons and results are described by the relation attributes. The relation attribute is represented in the form of relation = ***. In this embodiment, relational attributes are described for relatively easy grammatical functions such as subjects, objects, and indirect objects.

【００５９】また、この文書においては、例えば、“Ａ
氏”、“Ｂ会”、“Ｃ市”のような固有名詞について、
地名、人名、組織名等のタグにより属性が記述される。
これらの地名、人名、組織名等のタグが付与される語
は、固有名詞である。In this document, for example, “A
For proper nouns such as "Mr.", "B-kai", and "C city",
Attributes are described by tags such as a place name, a person name, and an organization name.
These words to which tags such as place names, personal names, and organization names are given are proper nouns.

【００６０】文書処理装置は、このようにタグ付けされ
た文書を受信することができる。文書処理装置は、ＣＰ
Ｕ１３によりＲＯＭ１５やハードディスクに記録されて
いる電子文書処理プログラムのうちの音声読み上げプロ
グラムを起動すると、図４に示すような一連の工程を経
ることによって、文書の読み上げを行う。まず、ここで
は、簡略化した各工程の説明を行い、その後、具体的な
文書例を用いて、各工程の説明を詳細に行う。The document processing apparatus can receive the document tagged in this way. The document processing device is a CP
When the voice reading program among the electronic document processing programs recorded in the ROM 15 or the hard disk is activated by U13, the document is read through a series of steps as shown in FIG. First, a simplified description of each step will be given, and then a detailed description of each step will be given using a specific document example.

【００６１】まず、文書処理装置は、同図に示すよう
に、ステップＳ１において、タグ付けされた文書を受信
する。なお、この文書には、後述するように、音声合成
を行うために必要なタグが付与されているものとする。
また、文書処理装置は、タグ付けされた文書を受信し、
その文書に音声合成を行うために必要なタグを新たに付
与して文書を作成することもできる。さらに、文書処理
装置は、タグ付けされていない文書を受信し、その文書
に音声合成を行うために必要なタグを含めたタグ付けを
行い、タグファイルを作成してもよい。以下では、この
ようにして受信又は作成されて用意されたタグ付けされ
た文書をタグファイルと記す。First, as shown in the figure, the document processing apparatus receives a tagged document in step S1. It is assumed that a tag required for performing speech synthesis is added to this document, as described later.
Also, the document processing device receives the tagged document,
It is also possible to create a document by newly adding a tag necessary for performing speech synthesis to the document. Further, the document processing apparatus may receive an untagged document, perform tagging on the document including a tag necessary for performing speech synthesis, and create a tag file. In the following, a tagged document received or created and prepared in this manner is referred to as a tag file.

【００６２】続いて、文書処理装置は、ステップＳ２に
おいて、ＣＰＵ１３の制御のもとに、タグファイルに基
づいて音声読み上げ用ファイルを生成する。この音声読
み上げ用ファイルは、後述するように、タグファイル中
のタグから、読み上げのための属性情報を導出し、この
属性情報を埋め込むことにより生成される。Subsequently, in step S2, the document processing apparatus generates a voice reading file based on the tag file under the control of the CPU 13. As will be described later, the voice reading file is generated by deriving attribute information for reading from a tag in the tag file and embedding the attribute information.

【００６３】続いて、文書処理装置は、ステップＳ３に
おいて、ＣＰＵ１３の制御のもとに、音声読み上げ用フ
ァイルを用いて、音声合成エンジンに適した処理を行
う。なお、この音声合成エンジンは、ハードウェアで構
成してもよいし、ソフトウェアで実現するようにしても
よい。音声合成エンジンをソフトウェアで実現する場合
には、そのアプリケーションプログラムは、文書処理装
置のＲＯＭ１５やハードディスク等に予め記憶されてい
る。Subsequently, in step S3, the document processing device performs a process suitable for the speech synthesis engine using the speech reading file under the control of the CPU 13. The speech synthesis engine may be configured by hardware or may be realized by software. When the speech synthesis engine is implemented by software, the application program is stored in the ROM 15 or the hard disk of the document processing apparatus in advance.

【００６４】そして、文書処理装置は、ステップＳ４に
おいて、ユーザが後述するユーザインターフェースを用
いて行う操作に応じて処理を行う。Then, in step S4, the document processing apparatus performs a process in accordance with an operation performed by a user using a user interface described later.

【００６５】文書処理装置は、このような処理を行うこ
とによって、与えられた文書を音声合成して読み上げる
ことができる。これらの各工程について、以下詳細に説
明する。By performing such processing, the document processing apparatus can synthesize a given document by speech and read it out. Each of these steps will be described in detail below.

【００６６】まず、ステップＳ１におけるタグ付けされ
た文書の受信又は作成について説明する。文書処理装置
は、例えば上述したように、先に図１に示したサーバ２
４にアクセスし、キーワード等に基づいて検索された結
果としての文書を受信する。また、文書処理装置は、タ
グ付けされた文書を受信し、その文書に音声合成を行う
ために必要なタグを新たに付与して文書を作成する。さ
らに、文書処理装置は、タグ付けされていない文書を受
信し、その文書に音声合成を行うために必要なタグを含
めたタグ付けを行い、タグファイルを作成することもで
きる。First, the reception or creation of a tagged document in step S1 will be described. The document processing device is, for example, as described above, the server 2 shown in FIG.
4 and receives the document as a result of the search based on the keyword or the like. Further, the document processing apparatus receives the tagged document, and newly adds a tag necessary for performing speech synthesis to the document to create the document. Further, the document processing apparatus can receive a document that has not been tagged, tag the document with a tag necessary for performing speech synthesis, and create a tag file.

【００６７】ここでは、図５又は図６に示すような日本
語又は英語による文書にタグ付けがなされたタグファイ
ルを受信又は作成したものとする。すなわち、図５に示
すタグファイルの元の文書は、次のような日本語の文書
である。Here, it is assumed that a tag file in which a document in Japanese or English as shown in FIG. 5 or FIG. 6 is tagged is received or created. That is, the original document of the tag file shown in FIG. 5 is the following Japanese document.

【００６８】「［素敵にエイジング］／８ガン転移、抑
えられる！？がんはこの十数年、わが国の死因第一位を占めている。
その死亡率は年齢が進むとともに増加傾向にある。高齢
者の健康を考えるとき、がんの問題を避けて通れない。
がんを特徴づけるのは、細胞増殖と転移である。人間の
細胞には、自動車でいえばアクセルに当たり、がんをど
んどん増殖する「がん遺伝子」と、ブレーキ役の「がん
抑制遺伝子」がある。双方のバランスが取れていれば問
題はない。正常な調節機能が失われ、細胞内でブレーキ
が利かない変異が起こると、がんの増殖が始まる。高齢
者の場合、長い年月の間にこの変異が蓄積し、がん化の
条件を備えた細胞の割合が増え、がん多発につながるわ
けだ。ところで、もう一つの特徴、転移という性質がな
ければ、がんはそれほど恐れる必要はない。切除するだ
けで、完治が可能になるからである。転移を抑制するこ
との重要性がここにある。この転移、がん細胞が増える
だけでは発生しない。がん細胞が細胞と細胞の間にある
蛋白（たんぱく）質などを溶かし、自分の進む道をつく
って、血管やリンパ管に入り込む。循環しながら新たな
“住み家”を探して潜り込む、といった複雑な動きをす
ることが、近年解明されつつある。」"[Nice aging] / 8 Cancer metastasis can be suppressed! Cancer has been the leading cause of death in Japan for more than 10 years.
The mortality rate is increasing with age. When thinking about the health of the elderly, you cannot avoid cancer.
Characterizing cancer is cell proliferation and metastasis. There are two types of human cells: "cancer genes", which act as accelerators in automobiles and rapidly grow cancer, and "cancer suppressing genes" that act as brakes. There is no problem if both are balanced. Loss of normal regulatory function and nonbreaking mutations in the cell initiate cancer growth. In the elderly, the mutation accumulates over the years, increasing the proportion of cells with cancerous conditions, leading to more cancers. By the way, without the other characteristic, metastasis, cancer doesn't have to be so afraid. This is because a complete cure is possible just by resection. Here is the importance of suppressing metastasis. This metastasis does not occur simply by increasing the number of cancer cells. Cancer cells dissolve proteins (proteins) between cells, create their own way, and enter blood vessels and lymph vessels. In recent years, it has been elucidated that complicated movements such as searching for a new “dwelling house” while circulating are performed. "

【００６９】文書処理装置は、この日本語の文書を受信
した場合には、図５に示すように、表示部３１に表示さ
れるウィンドウ１１０に文書を表示する。ウィンドウ１
１０は、文書の名称が表示される文書名表示部１１１、
キーワードが入力されるキーワード入力部１１２、後述
するように文書の要約文を作成するための実行ボタンで
ある要約作成実行ボタン１１３及び音声読み上げを実行
するための実行ボタンである読み上げ実行ボタン１１４
等が表示される表示領域１２０と、文書が表示される表
示領域１３０とに区分されている。表示領域１３０の右
端には、スクロールバー１３１と、このスクロールバー
１３１を上下に動かすためのボタン１３２，１３３が設
けられており、ユーザが例えば入力部２０のマウス等を
用いて、スクロールバー１３１を上下に直接動かした
り、ボタン１３２，１３３を押してスクロールバー１３
１を上下に動かすことによって、表示領域１３０に表示
される表示内容を縦方向にスクロールすることができ
る。When receiving the Japanese document, the document processing apparatus displays the document in a window 110 displayed on the display unit 31, as shown in FIG. Window 1
Reference numeral 10 denotes a document name display section 111 on which a document name is displayed.
A keyword input section 112 for inputting a keyword, an abstract creation execution button 113 as an execution button for creating an abstract sentence of a document as described later, and a reading execution button 114 as an execution button for executing voice reading aloud.
And the like, and a display area 130 where a document is displayed. At the right end of the display area 130, there are provided a scroll bar 131 and buttons 132 and 133 for moving the scroll bar 131 up and down. Move directly up and down or press buttons 132 and 133 to scroll
By moving 1 vertically, the display content displayed in the display area 130 can be scrolled in the vertical direction.

【００７０】一方、図６に示すタグファイルの元の文書
は、次のような英語の文書である。On the other hand, the original document of the tag file shown in FIG. 6 is the following English document.

【００７１】「During its centennial year, The Wall
Street Journal will report events of the past cen
tury that stand as milestones of American business
history. THREE COMPUTERS THAT CHANGED the face of
personal computing were launched in 1977. That ye
ar the Apple II, Commodore Pet and Tandy TRS came
to market. The computers were crude by today's sta
ndards. Apple II owners, for example, had to use t
heir television sets as screens and storeddata on
audiocassettes.」"During its centennial year, The Wall
Street Journal will report events of the past cen
tury that stand as milestones of American business
history. THREE COMPUTERS THAT CHANGED the face of
personal computing were launched in 1977. That ye
ar the Apple II, Commodore Pet and Tandy TRS came
to market.The computers were crude by today's sta
ndards.Apple II owners, for example, had to use t
heir television sets as screens and storeddata on
audiocassettes. "

【００７２】文書処理装置は、この英語の文書を受信し
た場合には、図６に示すように、表示部３１に表示され
るウィンドウ１４０に文書を表示する。ウィンドウ１４
０は、ウィンドウ１１０と同様に、文書の名称が表示さ
れる文書名表示部１４１、キーワードが入力されるキー
ワード入力部１４２、文書の要約文を作成するための実
行ボタンである要約作成実行ボタン１４３及び音声読み
上げを実行するための実行ボタンである読み上げ実行ボ
タン１４４等が表示される表示領域１５０と、文書が表
示される表示領域１６０とに区分されている。表示領域
１６０の右端には、スクロールバー１６１と、このスク
ロールバー１６１を上下に動かすためのボタン１６２，
１６３が設けられており、ユーザが例えば入力部２０の
マウス等を用いて、スクロールバー１６１を上下に直接
動かしたり、ボタン１６２，１６３を押してスクロール
バー１６１を上下に動かすことによって、表示領域１６
０に表示される表示内容を縦方向にスクロールすること
ができる。When receiving the English document, the document processing apparatus displays the document in a window 140 displayed on the display unit 31, as shown in FIG. Window 14
Reference numeral 0 denotes a document name display section 141 for displaying a document name, a keyword input section 142 for inputting a keyword, and a summary creation execution button 143 as an execution button for creating a summary of a document, similarly to the window 110. In addition, the display area 150 is divided into a display area 150 in which a read-out execution button 144 or the like, which is an execution button for executing voice reading-out, is displayed, and a display area 160 in which a document is displayed. At the right end of the display area 160, a scroll bar 161 and buttons 162 and 162 for moving the scroll bar 161 up and down are provided.
163 is provided, and the user moves the scroll bar 161 up and down directly by using, for example, a mouse or the like of the input unit 20, or moves the scroll bar 161 up and down by pressing the buttons 162 and 163.
The display content displayed at 0 can be scrolled in the vertical direction.

【００７３】図５又は図６に示す日本語又は英語の文書
は、それぞれ、図７又は図８に示すようなタグファイル
として構成されている。The Japanese or English document shown in FIG. 5 or FIG. 6 is configured as a tag file as shown in FIG. 7 or FIG. 8, respectively.

【００７４】ここで、図７に示すタグファイルは、同図
（Ａ）に見出しの部分である「［素敵にエイジング］／
８ガン転移、抑えられる！？」を抜粋したものを示し、
同図（Ｂ）に最後の段落である「この転移、がん細胞が
増えるだけでは発生しない。がん細胞が細胞と細胞の間
にある蛋白質などを溶かし、自分の進む道をつくって、
血管やリンパ管に入り込む。循環しながら新たな“住み
家”を探して潜り込む、といった複雑な動きをすること
が、近年解明されつつある。」を抜粋したものを示し、
残りの段落については省略したものである。この場合、
実際のタグファイルは、見出し部分から最後の段落まで
が１つのファイルとして構成されている。Here, the tag file shown in FIG. 7 has the heading “[Nicely Aged] /
8 cancer metastases can be suppressed! ? "
In the same paragraph (B), the last paragraph, “This metastasis does not occur just by increasing the number of cancer cells. The cancer cells dissolve proteins and the like between cells and create their own path,
Get into blood vessels and lymph vessels. In recent years, it has been elucidated that complicated movements such as searching for a new “dwelling house” while circulating are performed. "
The remaining paragraphs have been omitted. in this case,
The actual tag file is composed of one file from the heading to the last paragraph.

【００７５】同図（Ａ）に示す見出し部分において＜見
出し＞は、この部分が見出しであることを示している。
また、同図（Ｂ）に示す最後の段落には、関係属性が
“条件”や“手段”であることを示すタグ等が付与され
ている。さらに、同図（Ｂ）に示す最後の段落には、上
述した音声合成を行うために必要なタグの例が示されて
いる。<Heading> in the heading portion shown in FIG. 9A indicates that this portion is a heading.
In addition, a tag or the like indicating that the related attribute is “condition” or “means” is added to the last paragraph shown in FIG. Further, in the last paragraph shown in FIG. 3B, an example of a tag necessary for performing the above-described speech synthesis is shown.

【００７６】まず、音声合成を行うために必要なタグと
しては、「蛋白（たんぱく）」のように、元の文書に読
み仮名を示す情報が与えられているときに付与されるも
のがある。すなわち、この場合では、「たんぱくたんぱ
く」と重複して読み上げてしまうことを防ぐために、発
音＝“null”という読み属性情報が記述されており、
「（たんぱく）」の部分の読み上げを禁止するタグが付
与されている。また、このタグには、特殊な機能を有す
るものであることを示す情報が示されている。First, as a tag required for performing speech synthesis, there is a tag such as "protein" which is added when information indicating a reading pseudonym is given to an original document. In other words, in this case, in order to prevent the text from being read aloud in duplicate with “protein tan”, the reading attribute information of pronunciation = “null” is described.
A tag is added to prohibit the reading of "(protein)". In addition, the tag indicates information indicating that the tag has a special function.

【００７７】また、音声合成を行うために必要なタグと
しては、「リンパ管」のような専門用語や「住み家」の
ように、誤った読み上げを行う可能性のある難訓部分に
付与されるものがある。すなわち、この場合では、「り
んぱくだ」や「すみいえ」と読み上げてしまうことを防
ぐために、それぞれ、発音＝“りんぱかん”、発音＝
“すみか”という読み仮名を示す読み属性情報が記述さ
れている。Also, tags required for performing speech synthesis are attached to technical terms such as “lymphatic vessels” and difficult-to-learn portions such as “dwelling house” that may be erroneously read aloud. There is something. That is, in this case, in order to prevent the pronunciation of “Rinpakuda” or “Sumie”, pronunciation = “Rinpaku” and pronunciation =
Reading attribute information indicating a reading pseudonym “Sumika” is described.

【００７８】一方、図８に示すタグファイルには、補文
であることを示すタグや、複数の文が１つの文として連
続して構成されていることを示すタグが付与されてい
る。また、このタグファイルにおける音声合成を行うた
めに必要なタグとしては、「II」というローマ数字に対
して、発音＝“two”という読み属性情報が記述されて
いる。これは、「II」を「トゥ（two）」と読み上げさ
せたい場合に、「セカンド（second）」と読み上げてし
まうことを防ぐために記述されているものである。On the other hand, the tag file shown in FIG. 8 is provided with a tag indicating that it is a supplementary sentence or a tag indicating that a plurality of sentences are continuously formed as one sentence. In addition, as a tag necessary for performing speech synthesis in this tag file, pronunciation attribute information of pronunciation = “two” is described for Roman numeral “II”. This is described in order to prevent "II" from being read out as "two" when it is desired to be read out as "two".

【００７９】また、例えば文書内に引用文が含まれてい
る場合、このようなタグファイルには、図示しないが、
その文が引用文であることを示すタグが付与される。さ
らに、タグファイルには、例えば文書内に疑問文がある
場合、図示しないが、その文が疑問文であることを示す
タグが付与される。Further, for example, when a citation is included in a document, although not shown, such a tag file
A tag indicating that the sentence is a quote is attached. Further, when there is a question sentence in the document, for example, a tag indicating that the sentence is a question sentence is added to the tag file, though not shown.

【００８０】文書処理装置は、先に図４に示したステッ
プＳ１において、このように音声合成を行うために必要
なタグが付与された文書を受信又は作成する。In step S1 shown in FIG. 4, the document processing apparatus receives or creates a document to which a tag necessary for performing speech synthesis has been added.

【００８１】つぎに、ステップＳ２における音声読み上
げ用ファイルの生成について説明する。文書処理装置
は、タグファイル中のタグから、読み上げのための属性
情報を導出し、この属性情報を埋め込むことによって、
音声読み上げ用ファイルを生成する。Next, the generation of the voice reading file in step S2 will be described. The document processing device derives attribute information for reading out from the tag in the tag file, and embeds this attribute information,
Generate a speech file.

【００８２】具体的には、文書処理装置は、文書の段
落、文及び句の開始位置を示すタグを見つけ出し、これ
らのタグに対応して読み上げのための属性情報を埋め込
む。また、文書処理装置は、後述するように、文書の要
約文を作成した場合には、その要約文に含まれる部分の
開始位置を文書から見つけ出し、読み上げの際に音量を
増大させる属性情報を埋め込み、要約文に含まれる部分
であることを強調することもできる。More specifically, the document processing apparatus finds tags indicating the start positions of paragraphs, sentences and phrases of the document, and embeds attribute information for reading out in correspondence with these tags. Further, as described later, when a document summary is created, the document processing apparatus finds the start position of a portion included in the summary from the document, and embeds attribute information for increasing the volume when reading out. It can also be emphasized that the part is included in the summary sentence.

【００８３】文書処理装置は、先に図７又は図８に示し
たタグファイルから図９又は図１０に示すような音声読
み上げ用ファイルを生成する。なお、図９（Ａ）に示す
音声読み上げ用ファイルは、先に図７（Ａ）に示した見
出しの部分の抜粋に対応するものであり、同図（Ｂ）に
示す音声読み上げ用ファイルは、先に図８（Ｂ）に示し
た最後の段落の抜粋に対応するものである。実際の音声
読み上げ用ファイルは、見出し部分から最後の段落まで
が１つのファイルとして構成されていることは勿論であ
る。The document processing apparatus generates a speech reading file as shown in FIG. 9 or FIG. 10 from the tag file shown in FIG. 7 or FIG. The file for speech reading shown in FIG. 9A corresponds to the excerpt of the heading part shown in FIG. 7A earlier, and the file for speech reading shown in FIG. This corresponds to the excerpt of the last paragraph previously shown in FIG. It goes without saying that the actual voice reading file is composed of one file from the heading to the last paragraph.

【００８４】図９（Ａ）に示す音声読み上げ用ファイル
には、文書の開始位置に対応してCom=Lang=***という属
性情報が埋め込まれている。この属性情報は、文書を記
述している言語を示す。ここでは、Com=Lang=JPNという
属性情報であり、文書を記述している言語が日本語であ
ることを示している。文書処理装置においては、この属
性情報を参照することで、文書毎に言語に応じた適切な
音声合成エンジンを選択することができる。The attribute information Com = Lang = *** is embedded in the text-to-speech file shown in FIG. 9A corresponding to the start position of the document. This attribute information indicates the language describing the document. Here, the attribute information is Com = Lang = JPN, indicating that the language describing the document is Japanese. In the document processing apparatus, by referring to the attribute information, it is possible to select an appropriate speech synthesis engine corresponding to the language for each document.

【００８５】また、同図（Ａ）及び同図（Ｂ）に示す音
声読み上げ用ファイルには、Com=begin_p、Com=begin_s
及びCom=begin_phという属性情報が埋め込まれている。
これらの属性情報は、それぞれ、文書の段落、文及び句
の開始位置を示す。文書処理装置は、上述したタグファ
イル中のタグに基づいて、これらの段落、文及び句のう
ちの少なくとも２つの開始位置を検出する。音声読み上
げ用ファイルにおいて、例えば上述したタグファイル中
の＜形容動詞句＞＜名詞句＞のように、同じレベルの統
語構造を表すタグが連続して現れる部分に対しては、そ
れぞれに対応する数のCom=begin_phが埋め込まれずに、
まとめられて１つのCom=begin_phが埋め込まれる。Further, the voice reading files shown in FIGS. 9A and 9B include Com = begin_p and Com = begin_s.
And attribute information Com = begin_ph is embedded.
These pieces of attribute information indicate the start positions of paragraphs, sentences, and phrases of the document, respectively. The document processing device detects the start positions of at least two of these paragraphs, sentences, and phrases based on the tags in the tag file described above. In the text-to-speech file, for example, for portions where tags representing syntactic structures of the same level appear continuously, such as <adjective verb phrase><nounphrase> in the above-described tag file, the corresponding numbers are used. Com = begin_ph is not embedded,
It is put together and one Com = begin_ph is embedded.

【００８６】さらに、音声読み上げ用ファイルには、Co
m=begin_p、Com=begin_s及びCom=begin_phに対応して、
それぞれ、Pau=500、Pau=100及びPau=50という属性情報
が埋め込まれている。これらの属性情報は、それぞれ、
読み上げの際に５００ミリ秒、１００ミリ秒及び５０ミ
リ秒の休止期間を設けることを示す。すなわち、文書処
理装置は、文書の段落、文及び句の開始位置において、
それぞれ、５００ミリ秒、１００ミリ秒及び５０ミリ秒
の休止期間を設けて文書を音声合成エンジンにより読み
上げる。なお、これらの属性情報は、Com=begin_p、Com
=begin_s及びCom=begin_phに対応して埋め込まれる。そ
のため、例えばタグファイル中の＜形容動詞句＞＜名詞
句＞のように、同じレベルの統語構造を表すタグが連続
して現れる部分は、１つの句として捉えられ、それぞれ
に対応する数のPau=50が埋め込まれずに、まとめられて
１つのPau=50が埋め込まれる。また、例えばタグファイ
ル中の＜段落＞＜文＞＜名詞句＞のように、異なるレベ
ルの統語構造を表すタグが連続して現れる部分について
は、それぞれに対応するPau=***が埋め込まれる。その
ため、文書処理装置は、このような部分を読み上げる際
には、例えば文書の段落、文及び句のそれぞれの休止期
間を加算して得られる６５０ミリ秒の休止期間を設けて
読み上げる。このように、文書処理装置は、例えば、段
落、文及び句の順序で長さが短くなるように、段落、文
及び句に対応した休止期間を設けることで、段落、文及
び句の切れ目を考慮した違和感のない読み上げを行うこ
とができる。なお、この休止期間は、文書の段落、文及
び句の開始位置において、それぞれ、５００ミリ秒、１
００ミリ秒及び５０ミリ秒である必要はなく、適宜変更
することができる。[0086] Further, the voice reading file includes Co
According to m = begin_p, Com = begin_s and Com = begin_ph,
Attribute information of Pau = 500, Pau = 100, and Pau = 50 are respectively embedded. Each of these attribute information,
It shows that there are pause periods of 500 ms, 100 ms, and 50 ms when reading out. That is, at the start position of the paragraph, sentence and phrase of the document,
Documents are read aloud by the speech synthesis engine with pause periods of 500 ms, 100 ms, and 50 ms, respectively. In addition, these attribute information is Com = begin_p, Com
= begin_s and Com = begin_ph are embedded. Therefore, for example, a portion in which tags representing syntactic structures of the same level appear consecutively, such as <adjective verb phrase><nounphrase> in a tag file, is regarded as one phrase, and a corresponding number of Pau = 50 is not embedded, but one Pau = 50 is embedded. Also, for example, in portions where tags representing different levels of syntactic structure appear consecutively, such as <paragraph><sentence><nounphrase> in the tag file, corresponding Pau = *** is embedded. . For this reason, when reading out such a portion, the document processing apparatus reads out, for example, a pause period of 650 milliseconds obtained by adding the pause periods of paragraphs, sentences, and phrases of the document. As described above, the document processing apparatus provides a pause period corresponding to a paragraph, a sentence, and a phrase, for example, so that the length becomes shorter in the order of the paragraph, the sentence, and the phrase. Speaking can be performed without discomfort taking into account. Note that this pause period is 500 milliseconds at the start position of the paragraph, sentence, and phrase of the document, respectively.
The time does not need to be 00 ms and 50 ms, and can be changed as appropriate.

【００８７】さらにまた、同図（Ｂ）に示す音声読み上
げ用ファイルにおいては、タグファイル中で記述されて
いる発音＝“null”という読み属性情報に対応して、
「（たんぱく）」が除かれているとともに、発音＝“り
んぱかん”、発音＝“すみか”という読み属性情報に対
応して、「リンパ管」、「住み家」が、それぞれ、「り
んぱかん」、「すみか」に置換されている。文書処理装
置は、このような読み属性情報を埋め込むことで、音声
合成エンジンが参照する辞書の不備による読み誤りをす
ることがない。Further, in the voice reading file shown in FIG. 13B, the pronunciation attribute information of pronunciation = “null” described in the tag file corresponds to
“(Tanpaku)” has been removed, and “lymphatic vessels” and “dwellers” have been replaced by “Rinpkan” in accordance with the reading attribute information of pronunciation = “Rinpaku” and pronunciation = “Sumika”, respectively. , "Sumika". By embedding such reading attribute information, the document processing apparatus does not make a reading error due to a defect in the dictionary referred to by the speech synthesis engine.

【００８８】また、音声読み上げ用ファイルには、文書
内に含まれた引用文であることを示すタグに基づいて、
この引用文のみを別の音声合成エンジンを用いるように
指定するための属性情報が埋め込まれてもよい。Further, the voice reading file includes a tag based on a tag indicating a quote included in the document.
Attribute information for designating only this quoted sentence to use another speech synthesis engine may be embedded.

【００８９】さらに、音声読み上げ用ファイルには、疑
問文であることを示すタグに基づいて、その文の語尾の
イントネーションを上げるための属性情報が埋め込まれ
てもよい。Further, attribute information for raising the intonation of the ending of the sentence may be embedded in the voice reading file based on the tag indicating the question sentence.

【００９０】さらにまた、音声読み上げ用ファイルに
は、必要に応じて、いわゆる「である調」の文体を「で
すます調」の文体に変換するための属性情報を埋め込む
こともできる。なお、この場合、文書処理装置は、この
ような属性情報を音声読み上げ用ファイルに埋め込むの
ではなく、「である調」の文体を「ですます調」の文体
に変換して音声読み上げ用ファイルを生成するようにし
てもよい。Furthermore, if necessary, attribute information for converting a so-called "Dana-tona" style into a "Dan-Masuna" style can be embedded in the voice reading file. In this case, the document processing apparatus does not embed such attribute information in the text-to-speech file, but converts the style of "Dana-tona" to the style of "Dan-ma-tona" and converts the text-to-speech file. You may make it generate | occur | produce.

【００９１】一方、図１０に示す音声読み上げ用ファイ
ルには、文書の開始位置に対応してCom=Lang=ENGという
属性情報が埋め込まれており、文書を記述している言語
が英語であることを示している。On the other hand, the text-to-speech file shown in FIG. 10 has attribute information Com = Lang = ENG embedded therein corresponding to the start position of the document, and the language describing the document is English. Is shown.

【００９２】また、音声読み上げ用ファイルには、Com=
Vol=***という属性情報が埋め込まれている。この属性
情報は、読み上げの際の音量を示す。例えば、Com=Vol=
0は、文書処理装置のデフォルトの音量で読み上げるこ
とを示している。また、Com=Vol=80は、デフォルトの音
量を８０％増量した音量で読み上げることを示してい
る。任意のCom=Vol=***は、次のCom=Vol=***まで有効で
ある。[0092] In addition, the file for voice reading aloud includes Com =
The attribute information of Vol = *** is embedded. This attribute information indicates the volume at the time of reading out. For example, Com = Vol =
0 indicates that the document is read out at the default volume of the document processing apparatus. Com = Vol = 80 indicates that the default volume is read out at a volume increased by 80%. Any Com = Vol = *** is valid until the next Com = Vol = ***.

【００９３】さらに、音声読み上げ用ファイルにおいて
は、タグファイル中で記述されている発音＝“two”と
いう読み属性情報に対応して、「II」が「two」に置換
されている。Further, in the voice reading file, "II" is replaced by "two" in accordance with the reading attribute information of pronunciation = "two" described in the tag file.

【００９４】文書処理装置は、図１１に示す一連の工程
を経ることによって、このような音声読み上げ用ファイ
ルを生成する。The document processing apparatus generates such a text-to-speech file through a series of steps shown in FIG.

【００９５】まず、文書処理装置は、同図に示すよう
に、ステップＳ１１において、ＣＰＵ１３によって、受
信又は作成したタグファイルを解析する。ここで、文書
処理装置は、文書を記述している言語を判別するととも
に、文書の段落、文及び句の開始位置や、読み属性情報
をタグに基づいて探し出す。First, in step S11, the document processing apparatus analyzes the received or created tag file by the CPU 13 in step S11. Here, the document processing apparatus determines the language in which the document is described, and searches for the starting position of the paragraph, sentence, and phrase of the document and the reading attribute information based on the tag.

【００９６】続いて、文書処理装置は、ステップＳ１２
において、ＣＰＵ１３によって、文書を記述している言
語に応じて文書の開始位置にCom=Lang=***を埋め込む。Subsequently, the document processing device proceeds to step S12
, The CPU 13 embeds Com = Lang = *** at the start position of the document according to the language in which the document is described.

【００９７】続いて、文書処理装置は、ステップＳ１３
において、ＣＰＵ１３によって、文書の段落、文及び句
の開始位置を音声読み上げ用ファイルにおける属性情報
に置換する。すなわち、文書処理装置は、タグファイル
中の＜段落＞、＜文＞及び＜＊＊＊句＞を、それぞれ、
Com=begin_p、Com=begin_s及びCom=begin_phに置換す
る。Subsequently, the document processing device proceeds to step S13
, The CPU 13 replaces the start positions of the paragraphs, sentences and phrases of the document with the attribute information in the voice reading file. That is, the document processing apparatus replaces <paragraph>, <sentence>, and <*** phrase> in the tag file with,
Replace with Com = begin_p, Com = begin_s and Com = begin_ph.

【００９８】続いて、文書処理装置は、ステップＳ１４
において、ＣＰＵ１３によって、同じレベルの統語構造
が表れて同じCom=begin_***が重複しているものを、１
つのCom=begin_***にまとめる。Subsequently, the document processing device proceeds to step S14.
, The same level of syntactic structure appears and the same Com = begin _ ***
Into one Com = begin _ ***.

【００９９】続いて、文書処理装置は、ステップＳ１５
において、ＣＰＵ１３によって、Com=begin_***に対応
してPau=***を埋め込む。すなわち、文書処理装置は、C
om=begin_pの直前にPau=500を埋め込み、Com=begin_sの
直前にPau=100を埋め込み、Com=begin_phの直前にPau=5
0を埋め込む。Subsequently, the document processing device proceeds to step S15
, Pau = *** is embedded by the CPU 13 in correspondence with Com = begin _ ***. That is, the document processing device
Embed Pau = 500 just before om = begin_p, embed Pau = 100 just before Com = begin_s, and Pau = 5 just before Com = begin_ph
Embed 0.

【０１００】そして、文書処理装置は、ステップＳ１６
において、ＣＰＵ１３によって、読み属性情報に基づい
て、正しい読みに置換する。すなわち、文書処理装置
は、発音＝“null”という読み属性情報に基づいて、
「（たんぱく）」を除去するとともに、発音＝“りんぱ
かん”、発音＝“すみか”という読み属性情報に基づい
て、「リンパ管」、「住み家」を、それぞれ、「りんぱ
かん」、「すみか」に置換する。Then, the document processing device proceeds to step S16.
, The CPU 13 replaces the correct reading based on the reading attribute information. That is, the document processing apparatus, based on pronunciation attribute information of pronunciation = “null”,
“(Tanpaku)” is removed, and “lymphatic vessel” and “dweller” are changed to “Rinpkan” and “Sumika” based on the reading attribute information of pronunciation = “Rinpaku” and pronunciation = “Sumika”, respectively. ".

【０１０１】文書処理装置は、先に図４に示したステッ
プＳ２において、図１１に示す処理を行うことによっ
て、音声読み上げ用ファイルを自動的に生成する。文書
処理装置は、生成した音声読み上げ用ファイルをＲＡＭ
１４に記憶させる。In step S2 shown in FIG. 4, the document processing apparatus automatically performs the processing shown in FIG. 11 to automatically generate a file for reading out voice. The document processing device stores the generated voice reading file in the RAM.
14 is stored.

【０１０２】つぎに、図４中ステップＳ３における音声
読み上げ用ファイルを用いた処理について説明する。文
書処理装置は、音声読み上げ用ファイルを用いて、ＲＯ
Ｍ１５やハードディスク等に予め記憶されている音声合
成エンジンに適した処理をＣＰＵ１３の制御のもとに行
う。Next, the processing using the voice reading file in step S3 in FIG. 4 will be described. The document processing device uses the read-aloud file to
Under the control of the CPU 13, processing suitable for the speech synthesis engine stored in advance in the M15 or the hard disk or the like is performed.

【０１０３】具体的には、文書処理装置は、音声読み上
げ用ファイルに埋め込まれているCom=Lang=***という属
性情報に基づいて、使用する音声合成エンジンを選択す
る。音声合成エンジンは、言語や男声／女声等の種類に
応じて識別子が付されており、その情報が例えば初期設
定ファイルとしてハードディスクに記録されている。文
書処理装置は、初期設定ファイルを参照し、言語に対応
した識別子の音声合成エンジンを選択する。More specifically, the document processing apparatus selects a speech synthesis engine to be used based on the attribute information Com = Lang = *** embedded in the speech reading file. The speech synthesis engine is provided with an identifier according to a language, a type of male voice / female voice, and the like, and the information is recorded on the hard disk as, for example, an initialization file. The document processing device refers to the initialization file and selects the speech synthesis engine of the identifier corresponding to the language.

【０１０４】また、文書処理装置は、音声読み上げ用フ
ァイルに埋め込まれているCom=begin_***を音声合成エ
ンジンに適した形式に変換する。例えば、文書処理装置
は、Com=begin_pをMark=100のように１００番台の番号
でマーク付けし、Com=begin_sをMark=1000のように１０
００番台の番号でマーク付けし、Com=begin_phをMark=1
0000のように１００００番台の番号でマーク付けする。Further, the document processing device converts Com = begin _ *** embedded in the voice reading file into a format suitable for the voice synthesis engine. For example, the document processing apparatus marks Com = begin_p with a number in the 100s, such as Mark = 100, and sets Com = begin_s to 10 such as Mark = 1000.
Mark with 00's number, Com = begin_ph to Mark = 1
Mark with a number in the 10000s range, such as 0000.

【０１０５】さらに、音声読み上げ用ファイルにおいて
は、音量の属性情報がVol=***のようにデフォルトの音
量に対する増量分の百分率で表されていることから、文
書処理装置は、この属性情報に基づいて、百分率の情報
を絶対値の情報に変換して求める。Further, in the text-to-speech file, since the attribute information of the volume is expressed as a percentage of the increase with respect to the default volume, such as Vol = ***, the document processing apparatus stores the attribute information in this attribute information. Based on the information, the percentage information is converted into absolute value information.

【０１０６】文書処理装置は、先に図４に示したステッ
プＳ３において、このような音声読み上げ用ファイルを
用いた処理を行うことによって、音声読み上げ用ファイ
ルを音声合成エンジンが文書を読み上げることが可能な
形式に変換する。In step S3 shown in FIG. 4, the document processing apparatus performs processing using such a text-to-speech file so that the text-to-speech file can be read by the text-to-speech engine. To a different format.

【０１０７】つぎに、図４中ステップＳ４におけるユー
ザインターフェースを用いた操作について説明する。文
書処理装置は、ユーザが例えば入力部２０のマウス等を
操作して先に図５又は図６に示した読み上げ実行ボタン
１１４又は読み上げ実行ボタン１４４を押すことによっ
て、音声合成エンジンを起動する。そして、文書処理装
置は、図１２に示すようなユーザインターフェース用ウ
ィンドウ１７０を表示部３１に表示する。Next, the operation using the user interface in step S4 in FIG. 4 will be described. The document processing apparatus activates the speech synthesis engine when the user operates the mouse or the like of the input unit 20 and presses the read-out execution button 114 or the read-out execution button 144 shown in FIG. 5 or FIG. Then, the document processing apparatus displays a user interface window 170 as shown in FIG.

【０１０８】ユーザインターフェース用ウィンドウ１７
０は、同図に示すように、文書を読み上げさせるための
再生ボタン１７１と、読み上げを停止させるための停止
ボタン１７２と、読み上げを一時停止させるための一時
停止ボタン１７３とを有する。また、ユーザインターフ
ェース用ウィンドウ１７０は、文単位で頭出し、巻き戻
し及び早送りさせるための頭出しボタン１７４、巻き戻
しボタン１７５及び早送りボタン１７６と、段落単位で
頭出し、巻き戻し及び早送りさせるための頭出しボタン
１７７、巻き戻しボタン１７８及び早送りボタン１７９
と、句単位で頭出し、巻き戻し及び早送りさせるための
頭出しボタン１８０、巻き戻しボタン１８１及び早送り
ボタン１８２とを有する。さらに、ユーザインターフェ
ース用ウィンドウ１７０は、読み上げる対象を全文とす
るか、後述するように作成した要約文とするかを選択す
るための選択スイッチ１８３，１８４を有する。なお、
ユーザインターフェース用ウィンドウ１７０は、ここで
は図示しないが、例えば、音量を増減させるためのボタ
ンや読み上げの速さを増減させるためのボタン、男声／
女声等の声を変化させるためのボタン等を有していても
よい。User interface window 17
0 has a play button 171 for reading a document, a stop button 172 for stopping reading, and a pause button 173 for temporarily stopping reading as shown in FIG. The user interface window 170 includes a cue button 174, a rewind button 175, and a fast-forward button 176 for cueing, rewinding, and fast-forwarding in sentence units, and a cueing, rewinding, and fast-forwarding in paragraph units. Cue button 177, rewind button 178 and fast forward button 179
And a cue button 180, a rewind button 181 and a fast-forward button 182 for cueing, rewinding and fast-forwarding in phrase units. Further, the user interface window 170 has selection switches 183 and 184 for selecting whether to read the entire sentence or a summary sentence created as described later. In addition,
Although not shown here, the user interface window 170 is, for example, a button for increasing or decreasing the volume, a button for increasing or decreasing the reading speed, a male voice /
A button or the like for changing a voice such as a female voice may be provided.

【０１０９】文書処理装置は、ユーザがこれらの各種ボ
タン／スイッチを例えば入力部２０のマウス等を操作し
て押す／選択することによって、音声合成エンジンによ
る読み上げ動作を行う。例えば、文書処理装置は、ユー
ザが再生ボタン１７１を押すことによって、文書の読み
上げを開始し、読み上げの途中でユーザが頭出しボタン
１７４を押すことによって、現在読み上げている文の開
始位置にジャンプして再び読み上げる。また、文書処理
装置は、図４中ステップＳ３において行ったマーク付け
によって、読み上げの際にこのようなマーク単位でのジ
ャンプをすることができる。すなわち、文書処理装置
は、ユーザが例えば入出力部２０のマウス等を用いて巻
き戻しボタン１７８や早送りボタン１７９を押した場合
には、例えばMark=100のように、１００番台の番号であ
る段落の開始位置を示すマークのみを識別してジャンプ
する。同様に、文書処理装置は、ユーザが例えば入出力
部２０のマウス等を用いて巻き戻しボタン１７５及び早
送りボタン１７６、巻き戻しボタン１８１及び早送りボ
タン１８２をそれぞれ押した場合には、それぞれ、Mark
=1000、Mark=10000のように、１０００番台、１０００
０番台の番号である文、句の開始位置を示すマークのみ
を識別してジャンプする。このように、文書処理装置
は、読み上げの際に段落、文及び句単位でのジャンプを
行うことによって、例えば文書中でユーザが所望の部分
を繰り返し再生させたいといった要求に応えることがで
きる。The document processing apparatus performs a reading operation by the speech synthesis engine when the user operates / presses / selects these various buttons / switches by operating the mouse or the like of the input unit 20, for example. For example, the document processing apparatus starts reading the document by pressing the play button 171 by the user, and jumps to the start position of the currently read sentence by pressing the cue button 174 during reading. And read again. Further, the document processing apparatus can make such a jump in mark units at the time of reading aloud by the marking performed in step S3 in FIG. That is, when the user presses the rewind button 178 or the fast forward button 179 using, for example, a mouse or the like of the input / output unit 20, the document processing apparatus outputs a paragraph having a number in the 100s, such as Mark = 100. Jumps by identifying only the mark indicating the start position of. Similarly, when the user presses the rewind button 175, the fast-forward button 176, the rewind button 181, and the fast-forward button 182 using a mouse or the like of the input / output unit 20, for example,
= 1000, Mark = 10000, 1000s, 1000s
Jump is performed by identifying only the mark indicating the start position of the sentence or phrase, which is the zeroth number. In this way, the document processing apparatus can respond to a request that a user wants to repeatedly reproduce a desired portion in a document, for example, by performing a jump in paragraphs, sentences, and phrases in reading.

【０１１０】文書処理装置は、ステップＳ４において、
ユーザがこのようなユーザインターフェースを用いた操
作を行うことによって、音声合成エンジンにより文書を
読み上げる。読み上げた情報は、音声出力部３０から出
力される。In step S4, the document processing device determines
When the user performs an operation using such a user interface, the document is read out by the speech synthesis engine. The read information is output from the audio output unit 30.

【０１１１】このようにして、文書処理装置は、所望の
文書を音声合成エンジンにより違和感なく読み上げるこ
とができる。In this way, the document processing apparatus can read out a desired document by the speech synthesis engine without feeling uncomfortable.

【０１１２】つぎに、文書の要約文を作成した際の読み
上げ処理について説明する。まず、ここでは、タグ付け
された文書を要約して要約文を作成する処理について図
１３乃至図２１を参照して説明する。Next, a description will be given of a reading process when a summary sentence of a document is created. First, here, a process of summarizing the tagged documents to generate a summary sentence will be described with reference to FIGS.

【０１１３】文書処理装置においては、文書の要約を作
成する場合には、その文書が表示部３１に表示されてい
る状態で、ユーザが入力部２０を操作し、自動要約作成
モードを実行するように指令する。すなわち、文書処理
装置は、ＣＰＵ１３の制御のもとに、ハードディスクド
ライブ３４を駆動して、ハードディスクに記憶されてい
る電子文書処理プログラムのうちの自動要約文作成プロ
グラムを起動する。文書処理装置は、ＣＰＵ１３により
表示部３１を制御して、図１３に示すような自動要約文
作成プログラム用の初期画面を表示させる。ここでは、
表示部３１に表示されるウィンドウ１９０は、文書の名
称が表示される文書名表示部１９１、キーワードが入力
されるキーワード入力部１９２、文書の要約文を作成す
るための実行ボタンである要約作成実行ボタン１９３等
が表示される表示領域２００と、文書が表示される表示
領域２１０と、文書の要約文が表示される表示領域２２
０とに区分されている。In the document processing apparatus, when a summary of a document is created, the user operates the input unit 20 to execute the automatic summary creation mode while the document is displayed on the display unit 31. Command. That is, the document processing apparatus drives the hard disk drive 34 under the control of the CPU 13 to activate an automatic summary sentence creation program among the electronic document processing programs stored in the hard disk. In the document processing apparatus, the CPU 13 controls the display unit 31 to display an initial screen for an automatic summary sentence creating program as shown in FIG. here,
A window 190 displayed on the display unit 31 includes a document name display unit 191 for displaying the name of the document, a keyword input unit 192 for inputting a keyword, and a summary creation execution button for executing a summary sentence of the document. A display area 200 on which buttons 193 and the like are displayed, a display area 210 on which a document is displayed, and a display area 22 on which a summary of the document is displayed
0.

【０１１４】表示領域２００の文書名表示部１９１に
は、表示領域２１０に表示される文書の文書名等が表示
される。また、キーワード入力部１９２には、例えば入
力部２０のキーボード等を用いて文書の要約文を作成す
るためのキーワードが入力される。要約作成実行ボタン
１９３は、例えば入力部２０のマウス等を用いて押され
ることによって、表示領域２１０に表示されている文書
の要約作成処理を実行開始するための実行ボタンであ
る。In the document name display section 191 of the display area 200, the document name of the document displayed in the display area 210 is displayed. In addition, a keyword for creating a summary of a document is input to the keyword input unit 192 using, for example, the keyboard of the input unit 20. The summary creation execution button 193 is an execution button for starting execution of a summary creation process of the document displayed in the display area 210 by being pressed using, for example, the mouse of the input unit 20.

【０１１５】表示領域２１０には、文書が表示される。
表示領域２１０の右端には、スクロールバー２１１と、
このスクロールバー２１１を上下に動かすためのボタン
２１２，２１３が設けられており、ユーザが例えば入力
部２０のマウス等を用いて、スクロールバー２１１を上
下に直接動かしたり、ボタン２１２，２１３を押してス
クロールバー２１１を上下に動かすことによって、表示
領域２１０に表示される表示内容を縦方向にスクロール
することができる。ユーザは、入力部２０を操作するこ
とによって、表示領域２１０に表示されている文書の一
部を選択して要約させることもでき、文書全体を要約さ
せることもできる。In the display area 210, a document is displayed.
At the right end of the display area 210, a scroll bar 211,
Buttons 212 and 213 for moving the scroll bar 211 up and down are provided. The user can directly move the scroll bar 211 up and down by using, for example, a mouse of the input unit 20 or scroll by pressing the buttons 212 and 213. By moving the bar 211 up and down, the display content displayed in the display area 210 can be scrolled in the vertical direction. By operating the input unit 20, the user can select and summarize a part of the document displayed in the display area 210, or can summarize the entire document.

【０１１６】表示領域２２０には、要約文が表示され
る。同図においては、要約文がまだ作成されていない状
態であるため、この表示領域２２０には、何も表示され
ていない。ユーザは、入力部２０を操作することによっ
て、表示領域２２０の表示範囲（大きさ）を変更するこ
とができる。具体的には、ユーザは、同図に示す表示領
域２２０の表示範囲（大きさ）を、例えば図１４に示す
ように拡大することができる。In display area 220, a summary is displayed. In the figure, since a summary has not been created yet, nothing is displayed in this display area 220. The user can change the display range (size) of the display area 220 by operating the input unit 20. Specifically, the user can enlarge the display range (size) of the display area 220 shown in FIG. 14, for example, as shown in FIG.

【０１１７】文書処理装置は、ユーザが例えば入力部２
０のマウス等を用いて、要約作成実行ボタン１９３を押
してオン状態とすると、ＣＰＵ１３の制御のもとに、図
１５に示す処理を実行して要約文の作成を開始する。The document processing device allows the user to operate the input unit 2 for example.
When the user presses the summary creation execution button 193 with the mouse 0 or the like to turn it on, the process shown in FIG. 15 is executed under the control of the CPU 13 to create a summary sentence.

【０１１８】文書から要約文を作成する処理は、文書の
内部構造に関するタグ付けに基づいて実行される。文書
処理装置においては、先に図１４に示したように、ウィ
ンドウ１９０の表示領域２２０の大きさを変更すること
ができる。文書処理装置は、ＣＰＵ１３の制御のもと
に、新たにウィンドウ１９０が表示部３１に描画される
か、又は、表示領域２２０の大きさが変更された後、要
約作成実行ボタン１９３が操作されたときには、表示領
域２２０に適合するように、ウィンドウ１９０の表示領
域２１０に少なくともその一部が表示されている文書か
ら、要約文を作成する処理を実行する。The process of creating a summary sentence from a document is executed based on tagging relating to the internal structure of the document. In the document processing apparatus, the size of the display area 220 of the window 190 can be changed as shown in FIG. In the document processing apparatus, under the control of the CPU 13, after the window 190 is newly drawn on the display unit 31 or the size of the display area 220 is changed, the digest creation execution button 193 is operated. In some cases, a process of creating a summary sentence from a document at least a part of which is displayed in the display area 210 of the window 190 so as to match the display area 220 is executed.

【０１１９】まず、文書処理装置は、図１５に示すよう
に、ステップＳ２１において、ＣＰＵ１３の制御のもと
に、活性拡散と呼ばれる処理を行う。本実施の形態にお
いては、活性拡散により得られた中心活性値を重要度と
して採用することによって、文書の要約を行う。すなわ
ち、内部構造に関するタグ付けがされた文書において
は、活性拡散を行うことによって、各エレメントに対し
て、内部構造に関するタグ付けに応じた中心活性値を付
与することができる。First, as shown in FIG. 15, the document processing apparatus performs a process called active diffusion under the control of the CPU 13 in step S21. In the present embodiment, the document is summarized by employing the central activity value obtained by the activity diffusion as the importance. That is, in a document tagged with an internal structure, by performing active diffusion, a central activation value corresponding to the tagging with respect to the internal structure can be given to each element.

【０１２０】ここで、活性拡散は、中心活性値の高いエ
レメントと関わりのあるエレメントにも高い中心活性値
を与えるような処理である。すなわち、活性拡散は、照
応（anaphora；共参照（coreference））表現されたエ
レメントとその先行詞との間で中心活性値が等しくな
り、それ以外では各中心活性値が同じ値に収束してい
く。この中心活性値は、文書の内部構造に関するタグ付
けに応じて決定されるため、内部構造を考慮した文書の
分析に利用することができる。Here, the active diffusion is a process for giving a high central activity value to an element related to an element having a high central activity value. In other words, the active diffusion means that the central activity values become equal between the element expressed in anaphora (coreference) and its antecedent, and otherwise, each central activity value converges to the same value. . Since the central activity value is determined according to tagging relating to the internal structure of the document, it can be used for analysis of the document in consideration of the internal structure.

【０１２１】文書処理装置は、図１６に示す一連の工程
を経ることによって、活性拡散を実行する。The document processing apparatus executes active diffusion by going through a series of steps shown in FIG.

【０１２２】まず、文書処理装置は、同図に示すよう
に、ステップＳ４１において、ＣＰＵ１３の制御のもと
に、各エレメントの初期化を行う。文書処理装置は、語
彙エレメントを除いた全てのエレメントと語彙エレメン
トとに対して中心活性値の初期値を割り当てる。例え
ば、文書処理装置は、中心活性値の初期値として、語彙
エレメントを除いた全てのエレメントに対しては“１”
を、語彙エレメントに対しては“０”を割り当てる。ま
た、文書処理装置は、各エレメントの中心活性値の初期
値に均一ではない値を予め割り当てることによって、活
性拡散の結果得られた中心活性値に、初期値の偏りを反
映させることができる。例えば、文書処理装置は、ユー
ザが関心を有するエレメントに対しては、中心活性値の
初期値を高く設定することによって、ユーザの関心を反
映した中心活性値を得ることができる。First, as shown in the figure, the document processing apparatus initializes each element under the control of the CPU 13 in step S41. The document processing device assigns the initial value of the central activity value to all the elements except the vocabulary element and the vocabulary element. For example, the document processing apparatus sets the initial value of the central activity value to “1” for all elements except the vocabulary element.
And “0” is assigned to the vocabulary element. In addition, the document processing apparatus can reflect the bias of the initial value in the central activity value obtained as a result of activity diffusion by assigning a non-uniform value to the initial value of the central activity value of each element in advance. For example, the document processing apparatus can obtain a central activity value reflecting the interest of the user by setting a high initial value of the central activity value for an element of interest to the user.

【０１２３】エレメント間で参照・被参照による係り受
けの関係にあるリンクである参照・被参照リンクと、そ
れ以外のリンクである通常リンクとに関しては、エレメ
ントを連結するリンクの端点の端点活性値を“０”に設
定する。文書処理装置は、このようにして付与した端点
活性値の初期値を例えばＲＡＭ１４に記憶させる。Regarding the reference / referenced link, which is a link having a dependency relationship by reference / reference between elements, and the normal link, which is another link, the end point activation value of the end point of the link connecting the elements. Is set to “0”. The document processing apparatus stores the initial value of the end point activation value thus assigned, for example, in the RAM 14.

【０１２４】ここで、エレメントとエレメントの連結構
造の一例を図１７に示す。同図においては、文書を構成
するエレメントとリンクの構造の一部として、エレメン
トＥ_i及びエレメントＥ_jが示されている。エレメントＥ
_iとエレメントＥ_jとは、それぞれ、中心活性値ｅ_i，ｅ_j
を有し、リンクＬ_ijにて接続されている。リンクＬ_ijの
エレメントＥ_iに接続する端点は、Ｔ_ijであり、エレメ
ントＥ_jに接続する端点は、Ｔ_jiである。エレメントＥ_i
は、リンクＬ_ijにより接続されるエレメントＥ_jの他
に、リンクＬ_ik，Ｌ_il及びＬ_imにより図示しないエレメ
ントＥ_k，Ｅ_l及びＥ_mにそれぞれ接続している。エレメ
ントＥ_jは、リンクＬ_jiにより接続されるエレメントＥ_i
の他に、リンクＬ_jp，Ｌ_jq及びＬ_jrにより図示しないエ
レメントＥ_p，Ｅ_q及びＥ_rにそれぞれ接続している。Here, an example of a connection structure between elements is shown in FIG. In the figure, an element _Ei and an element _Ej are shown as a part of the structure of the element and the link that constitute the document. Element E
_i and element E _j are central activation values e _i and e _j , respectively.
And are connected by a link L _ij . The end point of the link L _ij connected to the element E _i is T _ij , and the end point of the link L _ij connected to the element E _j is T _ji . Element E _i
, In addition to the elements E _j, which is connected by a link L _ij, the link L _ik, L _il and L _im element E _k (not shown) by, respectively connected to the E _l and E _m. Element E _j is connected to element E _i connected by link L _ji
Besides, they are connected respectively to the link L _uk, L _jq and L elements E _p (not shown) by _jr, E _q and E _r of.

【０１２５】続いて、文書処理装置は、図１６中ステッ
プＳ４２において、ＣＰＵ１３の制御のもとに、文書を
構成するエレメントＥ_iを計数するカウンタの初期化を
行う。すなわち、文書処理装置は、エレメントを計数す
るカウンタのカウンタ値ｉを“１”に設定する。このこ
とにより、カウンタは、第１番目のエレメントＥ₁を参
照していることになる。[0125] Then, the document processing device, in step S42 in FIG. 16, under the control of the CPU 13, performs initialization of a counter for counting the elements E _i of a document. That is, the document processing apparatus sets the counter value i of the counter for counting the elements to “1”. Thus, the counter will be that refers to the first element E _1.

【０１２６】続いて、文書処理装置は、ステップＳ４３
において、ＣＰＵ１３の制御のもとに、カウンタが参照
するエレメントについて、新たな中心活性値を計算する
リンク処理を実行する。このリンク処理については、さ
らに後述する。Subsequently, the document processing device proceeds to step S43.
, Under the control of the CPU 13, a link process for calculating a new central activation value is executed for the element referred to by the counter. This link processing will be further described later.

【０１２７】続いて、文書処理装置は、ステップＳ４４
において、ＣＰＵ１３の制御のもとに、文書中の全ての
エレメントについて新たな中心活性値の計算が完了した
か否かを判断する。Subsequently, the document processing device proceeds to step S44.
, Under the control of the CPU 13, it is determined whether or not the calculation of a new central activation value has been completed for all elements in the document.

【０１２８】ここで、文書処理装置は、文書中の全ての
エレメントについて新たな中心活性値の計算が完了した
ことを判断した場合には、ステップＳ４５へと処理を移
行し、一方、文書中の全てのエレメントについて新たな
中心活性値の計算が完了していないことを判断した場合
には、ステップＳ４７へと処理を移行する。Here, when the document processing apparatus determines that the calculation of the new central activation value has been completed for all the elements in the document, the document processing apparatus shifts the processing to step S45. If it is determined that the calculation of the new central activity value has not been completed for all the elements, the process proceeds to step S47.

【０１２９】具体的には、文書処理装置は、ＣＰＵ１３
の制御のもとに、カウンタのカウンタ値ｉが、文書が含
むエレメントの総数に達したか否かを判断する。そし
て、文書処理装置は、カウンタのカウンタ値ｉが、文書
が含むエレメントの総数に達したことを判断した場合に
は、全てのエレメントが計算済みであるものとして、ス
テップＳ４５へと処理を移行する。一方、文書処理装置
は、カウンタのカウンタ値ｉが、文書が含むエレメント
の総数に達していないことを判断した場合には、全ての
エレメントについて計算が終了していないものとしてス
テップＳ４７へと処理を移行する。More specifically, the document processing device is a CPU 13
Under the control of, it is determined whether or not the counter value i of the counter has reached the total number of elements included in the document. If the document processing apparatus determines that the counter value i of the counter has reached the total number of elements included in the document, it is determined that all elements have been calculated, and the process proceeds to step S45. . On the other hand, when the document processing apparatus determines that the counter value i of the counter has not reached the total number of elements included in the document, it is determined that the calculation has not been completed for all elements, and the process proceeds to step S47. Transition.

【０１３０】文書処理装置は、カウンタのカウンタ値ｉ
が、文書が含むエレメントの総数に達していないことを
判断した場合には、ステップＳ４７において、ＣＰＵ１
３の制御のもとに、カウンタのカウント値ｉを“１”だ
けインクリメントさせ、カウンタのカウント値を“ｉ＋
１”とする。このことにより、カウンタは、ｉ＋１番目
のエレメント、すなわち次のエレメントを参照する。そ
して、文書処理装置は、ステップＳ４３へと処理を移行
し、端点活性値の計算及びこれに続く一連の行程が、次
のｉ＋１番目のエレメントについて実行される。The document processing apparatus sets the counter value i
Determines that the total number of elements included in the document has not been reached, in step S47
Under the control of 3, the count value i of the counter is incremented by “1” and the count value of the counter is incremented by “i +
As a result, the counter refers to the (i + 1) th element, that is, the next element. Then, the document processing apparatus shifts the processing to step S43, calculates the endpoint activation value, and follows the calculation. A series of steps is performed for the next (i + 1) th element.

【０１３１】また、文書処理装置は、カウンタのカウン
タ値ｉが、文書が含むエレメントの総数に達したことを
判断した場合には、ステップＳ４５において、ＣＰＵ１
３の制御のもとに、文書に含まれる全てのエレメントの
中心活性値の変化分、すなわち新たに計算された中心活
性値の元の中心活性値に対する変化分について平均値を
計算する。When the document processing device determines that the counter value i of the counter has reached the total number of elements included in the document, the document processing device determines in step S45 that the CPU 1
Under the control of 3, the average value is calculated for the change in the central activity value of all the elements included in the document, that is, the change in the newly calculated central activity value from the original central activity value.

【０１３２】文書処理装置は、ＣＰＵ１３の制御のもと
に、例えばＲＡＭ１４に記憶された元の中心活性値と新
たに計算した中心活性値を、文書に含まれる全てのエレ
メントについて読み出す。文書処理装置は、新たに計算
した中心活性値の元の中心活性値に対するそれぞれの変
化分の総和を文書に含まれるエレメントの総数で除する
ことにより、全てのエレメントの中心活性値の変化分の
平均値を計算する。文書処理装置は、このように計算し
た全てのエレメントの中心活性値の変化分の平均値を、
例えばＲＡＭ１４に記憶させる。Under the control of the CPU 13, the document processing apparatus reads, for example, the original center activity value stored in the RAM 14 and the newly calculated center activity value for all elements included in the document. The document processing apparatus divides the sum of the respective changes of the newly calculated central activity value from the original central activity value by the total number of elements included in the document, thereby obtaining the change in the central activity value of all the elements. Calculate the average value. The document processing device calculates the average value of the change in the central activity value of all the elements calculated in this way,
For example, it is stored in the RAM 14.

【０１３３】そして、文書処理装置は、ステップＳ４６
において、ＣＰＵ１３の制御のもとに、ステップＳ４５
で計算した全てのエレメントの中心活性値の変化分の平
均値が、予め設定された閾値以内であるか否かを判断す
る。そして、文書処理装置は、この変化分が閾値以内で
あると判断した場合には、この一連の行程を終了する。
一方、文書処理装置は、変化分が閾値以内でないと判断
した場合には、ステップＳ４２へと処理を移行し、カウ
ンタのカウント値ｉを“１”に設定して文書のエレメン
トの中心活性値を計算する一連の行程を再び実行する。
文書処理装置においては、これらのステップＳ４２乃至
ステップＳ４６のループが繰り返される毎に、変化分
は、徐々に減少する。Then, the document processing apparatus proceeds to step S46.
In step S45 under the control of the CPU 13,
It is determined whether or not the average value of the change in the central activity values of all the elements calculated in the above is within a preset threshold value. Then, when the document processing device determines that the change is within the threshold, the document processing apparatus ends the series of steps.
On the other hand, when the document processing device determines that the change is not within the threshold, the process proceeds to step S42, where the count value i of the counter is set to “1” and the central activation value of the element of the document is set. Execute the series of steps to be calculated again.
In the document processing apparatus, each time the loop of steps S42 to S46 is repeated, the amount of change gradually decreases.

【０１３４】文書処理装置は、このようにして活性拡散
を行うことができる。つぎに、この活性拡散を行うため
にステップＳ４３において実行されるリンク処理につい
て図１８を参照して説明する。なお、同図に示すフロー
チャートは、１つのエレメントＥ_iに対する処理を示し
たものであるが、この処理は、全てのエレメントに対し
て行われるものである。The document processing apparatus can perform active diffusion in this way. Next, a link process executed in step S43 to perform the active diffusion will be described with reference to FIG. The flowchart shown in the figure, but shows the processing for one element E _i, this processing is to be performed for all elements.

【０１３５】まず、文書処理装置は、同図に示すよう
に、ステップＳ５１において、ＣＰＵ１３の制御のもと
に、文書を構成する１つのエレメントＥ_iと一端が接続
されたリンクを計数するカウンタの初期化を行う。すな
わち、文書処理装置は、リンクを計数するカウンタのカ
ウント値ｊを“１”に設定する。このカウンタは、エレ
メントＥ_iと接続された第１番目のリンクＬ_ijを参照す
ることになる。[0135] First, the document processing apparatus, as shown in the figure, in step S51, under the control of the CPU 13, the counter for counting the links one element E _i and one end of a document is connected Perform initialization. That is, the document processing apparatus sets the count value j of the counter for counting links to “1”. This counter will refer to the first link L _ij connected to element E _i .

【０１３６】続いて、文書処理装置は、ステップＳ５２
において、ＣＰＵ１３の制御のもとに、エレメントＥ_i
とＥ_jを接続するリンクＬ_ijについて、関係属性のタグ
を参照することによって、そのリンクＬ_ijが通常リンク
であるか否かを判断する。文書処理装置は、リンクＬ_ij
が、語に対応する語彙エレメント、文に対応する文エレ
メント、段落に対応する段落エレメント等の間の関係を
示す通常リンクと、参照・被参照による係り受けの関係
を示す参照リンクのいずれであるかを判断する。文書処
理装置は、リンクＬ_ijが通常リンクであると判断した場
合には、ステップＳ５３へと処理を移行し、リンクＬ_ij
が参照リンクであると判断した場合には、ステップＳ５
４へと処理を移行する。Subsequently, the document processing device proceeds to step S52.
, Under the control of the CPU 13, the elements E _i
It is determined whether or not the link L _ij is a normal link by referring to the tag of the relation attribute for the link L _ij connecting the link L _ij and E _j . The document processing device uses the link L _ij
Is a normal link indicating the relationship between the vocabulary element corresponding to the word, the sentence element corresponding to the sentence, the paragraph element corresponding to the paragraph, and the like, and a reference link indicating the dependency relationship by reference / reference. Judge. When the document processing device determines that the link L _ij is a normal link, the document processing device shifts the processing to step S53 and executes the link L _ij
If it is determined that is a reference link, step S5
The processing shifts to step 4.

【０１３７】文書処理装置は、リンクＬ_ijが通常リンク
であると判断した場合には、ステップＳ５３において、
エレメントＥ_iの通常リンクＬ_ijに接続された端点Ｔ_ij
の新たな端点活性値を計算する処理を行う。If the document processing device determines that the link L _ij is a normal link, it proceeds to step S53.
An end point T _ij connected to the normal link L _ij of the element E _i
To calculate a new end point activity value.

【０１３８】このステップＳ５３では、ステップＳ５２
における判別により、リンクＬ_ijが通常リンクであるこ
とが明らかになっている。エレメントＥ_iの通常リンク
Ｌ_ijに接続される端点Ｔ_ijの新たな端点活性値ｔ_ijは、
エレメントＥ_jの端点活性値のうち、リンクＬ_ij以外の
リンクに接続する全ての端点Ｔ_jp，Ｔ_jq，Ｔ_jrの端点活
性値ｔ_jp、ｔ_jq，ｔ_jrと、エレメントＥ_iがリンクＬ_ij
により接続されるエレメントＥ_jの中心活性値ｅ_jとを加
算し、この加算で得た値を文書に含まれるエレメントの
総数で除することにより求められる。In step S53, step S52
It is clear from the determination in that the link _Lij is a normal link. The new endpoint activation value t _ij of the endpoint T _ij connected to the normal link L _ij of the element E _i is:
Of the end point activation values of the element E _j, the end point activation values t _jp , t _jq , t _{jr of} all the end points T _jp , T _jq , T _jr connected to the links other than the link L _ij , and the element E _i is the link L _ij
And the central activity value e _j of the element E _j connected by the following _formula, and the value obtained by this addition is divided by the total number of elements included in the document.

【０１３９】文書処理装置は、ＣＰＵ１３の制御のもと
に、例えばＲＡＭ１４から必要な端点活性値及び中心活
性値を読み出す。文書処理装置は、読み出された端点活
性値及び中心活性値について、上述のようにその通常リ
ンクと接続された端点の新たな端点活性値を計算する。
そして、文書処理装置は、このように計算した新たな端
点活性値を、例えばＲＡＭ１４に記憶させる。Under the control of the CPU 13, the document processing device reads necessary endpoint activation values and central activation values from the RAM 14, for example. The document processing device calculates a new endpoint activity value of the endpoint connected to the normal link as described above for the endpoint activity value and the center activity value that have been read.
Then, the document processing apparatus stores the new endpoint activation value calculated in this way in, for example, the RAM 14.

【０１４０】一方、文書処理装置は、リンクＬ_ijが通常
リンクでないと判断した場合には、ステップＳ５４にお
いて、エレメントＥ_iの参照リンクに接続された端点Ｔ
_ijの端点活性値を計算する処理を行う。[0140] On the other hand, the document processing device link if L _ij is determined not to be normal link, in step S54, the end point T which is connected to the reference link element E _i
A process for calculating the endpoint activity value of _ij is performed.

【０１４１】このステップＳ５４では、ステップＳ５２
における判別により、リンクＬ_ijが参照リンクであるこ
とが明らかになっている。エレメントＥ_iの参照リンク
Ｌ_ijに接続される端点Ｔ_ijの端点活性値ｔ_ijは、エレメ
ントＥ_jの端点活性値のうち、リンクＬ_ijを除いたリン
クに接続される全ての端点Ｔ_jp，Ｔ_jq，ｔ_jrの端点活性
値ｔ_jp，ｔ_jq，ｔ_jrと、エレメントＥ_iがリンクＬ_ijに
より接続されるエレメントＥ_jの中心活性値ｅ_jとを加算
することにより求められる。In step S54, step S52
It is clear from the determination in that the link _Lij is a reference link. Point activation values t _ij endpoint T _ij that is connected to the reference link L _ij of the element E _i, of the end-point activation value of the element E _j, all endpoints T _uk which is connected to the link, excluding the link L _ij, T _jq, point activation value t _uk of t _jr, is obtained by adding t _jq, and t _jr, a central activation value e _j of the element E _j of the element E _i is connected by a link L _ij.

【０１４２】文書処理装置は、ＣＰＵ１３の制御のもと
に、例えばＲＡＭ１４に記憶された端点活性値及び中心
活性値から、必要な端点活性値及び中心活性値を読み出
す。文書処理装置は、読み出された端点活性値及び中心
活性値を用いて、上述のように参照リンクと接続された
新たな端点活性値を計算する。そして、文書処理装置
は、このように計算した端点活性値を、例えばＲＡＭ１
４に記憶させる。Under the control of the CPU 13, the document processing device reads necessary endpoint activation values and central activation values from, for example, the endpoint activation values and the central activation values stored in the RAM 14. The document processing apparatus calculates a new endpoint activity value connected to the reference link as described above using the read endpoint activity value and the central activity value. Then, the document processing apparatus stores the calculated endpoint activation value in the RAM 1
4 is stored.

【０１４３】これらのステップＳ５３における通常リン
クの処理及びステップＳ５４における参照リンクの処理
は、ステップＳ５２からステップＳ５５に至り、ステッ
プＳ５７を介してステップＳ５２に戻るループに示すよ
うに、カウント値ｉにより参照されているエレメントＥ
_iに接続される全てのリンクＬ_ijに対して実行される。
なお、ステップＳ５７では、エレメントＥ_iに接続され
るリンクを計数するカウント値ｊをインクリメントして
いる。The processing of the normal link in step S53 and the processing of the reference link in step S54 go from step S52 to step S55, and are referred to by the count value i as shown in a loop returning to step S52 via step S57. Element E
_This is executed for all the links L _ij connected to _i .
In step S57, the are increments the count value j for counting the links connected to the element E _i.

【０１４４】文書処理装置は、これらのステップＳ５３
又はステップＳ５４の処理を行った後、ステップＳ５５
において、ＣＰＵ１３の制御のもとに、エレメントＥ_i
に接続される全てのリンクについて端点活性値が計算さ
れたか否かを判別する。そして、文書処理装置は、全て
のリンクについて端点活性値が計算されていると判断し
た場合には、ステップＳ５６の処理へと移行し、全ての
リンクについて端点活性値が計算されていないと判断し
た場合には、ステップＳ５７へと処理を移行する。The document processing apparatus performs these steps S53
Alternatively, after performing the processing of step S54, step S55
, Under the control of the CPU 13, the elements E _i
It is determined whether or not the endpoint activation values have been calculated for all the links connected to. If the document processing apparatus determines that the end point activation values have been calculated for all the links, the process proceeds to step S56, and determines that the end point activation values have not been calculated for all the links. In this case, the process proceeds to step S57.

【０１４５】ここで、文書処理装置は、全てのリンクに
ついて端点活性値が計算されていると判断した場合に
は、ステップＳ５６において、ＣＰＵ１３の制御のもと
に、エレメントＥ_iの中心活性値ｅ_iの更新を実行する。[0145] Here, the document processing device, when it is determined that the end-point activation value for all links is calculated, in step S56, under the control of the CPU 13, the central activity value e of the element E _i Perform _i update.

【０１４６】エレメントＥ_iの中心活性値ｅ_iの新たな
値、すなわち更新値は、エレメントＥ_iの現在の中心活
性値ｅ_iと、エレメントＥ_iの全ての端点の新たな端点活
性値との和であるｅ_i’＝ｅ_i＋Σｔ_j’をとることによ
り求められる。ここで、プライム“’”は、新たな値と
いう意味である。このように、新たな中心活性値は、そ
のエレメントの元の中心活性値に、そのエレメントの端
点の新たな端点活性値の総和に加えることにより得られ
る。[0146] new value of central activation value e _i of the element E _i, i.e. updated value is currently a central activation value e _i of the element E _i, the new end-point activation values of all of the end points of the element E _i It is obtained by taking the sum e _i '= e _i + _{ t _j '. Here, the prime “′” means a new value. Thus, the new central activity value is obtained by adding the element's original central activity value to the sum of the new endpoint activity values for the endpoints of the element.

【０１４７】文書処理装置は、ＣＰＵ１３の制御のもと
に、例えばＲＡＭ１４に記憶された端点活性値及び中心
活性値から必要な端点活性値を読み出す。文書処理装置
は、上述したような計算を実行し、そのエレメントＥ_i
の中心活性値ｅ_iを算出する。そして、文書処理装置
は、計算した新たな中心活性値ｅ_iを例えばＲＡＭ１４
に記憶させる。Under the control of the CPU 13, the document processing device reads necessary endpoint activation values from, for example, endpoint activation values and center activation values stored in the RAM 14. The document processing device performs the calculations as described above and its elements E _i
_Is calculated. Then, the document processing apparatus stores the calculated new central activation value e _{i in} , for example, the RAM 14.
To memorize.

【０１４８】このようにして、文書処理装置は、文書中
の各エレメントについて、新たな中心活性値を計算す
る。そして、文書処理装置は、このようにして図１５中
ステップＳ２１における活性拡散を実行する。In this way, the document processing device calculates a new central activation value for each element in the document. Then, the document processing apparatus executes the active diffusion in step S21 in FIG.

【０１４９】続いて、文書処理装置は、図１５中ステッ
プＳ２２において、ＣＰＵ１３の制御のもとに、先に図
１３に示した表示部３１に表示されているウィンドウ１
９０の表示領域２２０の大きさ、すなわちこの表示領域
２２０に表示可能な最大文字数をＷ_sと設定する。ま
た、文書処理装置は、ＣＰＵ１３の制御のもとに、要約
Ｓを初期化して初期値Ｓ₀＝””と設定する。これは、
要約に何も文字列が存在していないことを示す。文書処
理装置は、このように設定した、表示領域２２０に表示
可能な最大文字数Ｗ_s及び要約Ｓの初期値Ｓ₀を、例えば
ＲＡＭ１４に記憶させる。Subsequently, in step S22 in FIG. 15, the document processing apparatus, under the control of the CPU 13, sets the window 1 previously displayed on the display unit 31 shown in FIG.
Size of the display area 220 of 90, i.e., sets the maximum number of characters that can be displayed on the display area 220 and W _s. Further, the document processing apparatus initializes the digest S under the control of the CPU 13 and sets the initial value S ₀ = “”. this is,
Indicates that no strings are present in the summary. Document processing device, thus set, the initial value S ₀ of the maximum number of characters W _s and summary S can be displayed in the display area 220, is stored in, for example, the RAM 14.

【０１５０】続いて、文書処理装置は、ステップＳ２３
において、ＣＰＵ１３の制御のもとに、要約文の骨格の
順次での作成をカウントするカウンタのカウント値ｉを
“１”に設定する。すなわち、文書処理装置は、カウン
ト値について、ｉ＝１と設定する。文書処理装置は、こ
のように設定したカウント値ｉを例えばＲＡＭ１４に記
憶させる。Subsequently, the document processing device proceeds to step S23.
, Under the control of the CPU 13, the count value i of a counter that counts the sequential creation of the skeleton of the summary sentence is set to “1”. That is, the document processing apparatus sets i = 1 for the count value. The document processing apparatus stores the count value i set in this way in, for example, the RAM 14.

【０１５１】続いて、文書処理装置は、ステップＳ２４
において、ＣＰＵ１３の制御のもとに、カウンタのカウ
ント値ｉについて、要約作成対照の文章からｉ番目に平
均中心活性値の高い文の骨格を抽出する。ここで、平均
中心活性値とは、１つの文を構成する各エレメントの中
心活性値を平均したものである。文書処理装置は、例え
ばＲＡＭ１４に記憶させた要約Ｓ_i-1を読み出し、この
要約Ｓ_i-1に対して抽出した文の骨格の文字列を加え
て、要約Ｓ_iとする。そして、文書処理装置は、このよ
うにして得た要約Ｓ_iを、例えばＲＡＭ１４に記憶させ
る。同時に、文書処理装置は、文の骨格に含まれないエ
レメントの中心活性値順のリストｌ_iを作成し、このリ
ストｌ_iを例えばＲＡＭ１４に記憶させる。Subsequently, the document processing device proceeds to step S24.
, Under the control of the CPU 13, for the count value i of the counter, the skeleton of the sentence having the ith highest average central activity value is extracted from the text to be summarized. Here, the average central activity value is the average of the central activity values of the elements constituting one sentence. The document processing apparatus reads, for example, the digest S _i-1 stored in the RAM 14 and adds the extracted character string of the skeleton of the sentence to the digest S _i-1 to obtain the digest S _i . Then, the document processing device, a summary S _i obtained in this way, is stored in, for example, the RAM 14. At the same time, the document processing device, creates a list l _i the central activation value order of the elements that are not included in the backbone of the sentence, and stores the list l _i for example the RAM 14.

【０１５２】すなわち、このステップＳ２４において
は、文書処理装置は、ＣＰＵ１３の制御のもとに、活性
拡散の結果を用いて、平均中心活性値の大きい順に文を
選択し、選択された文の骨格を抽出する。文の骨格は、
文から抽出した必須エレメントにより構成される。必須
エレメントになり得るものは、エレメントの主辞（hea
d）と、主語（subject）、目的語（object）、間接目的
語（indirect object）、所有者（posessor）、原因（c
ause）、条件（condition）又は比較（comparison）の
関係属性を有するエレメントと、等位構造とされた関連
するエレメントが必須エレメントのときには、その等位
構造に直接含まれるエレメントとである。文書処理装置
は、文の必須エレメントをつなげて文の骨格を生成し、
要約に加える。That is, in step S24, the document processing device selects sentences in descending order of the average central activity value by using the result of the activity diffusion under the control of the CPU 13, and sets the skeleton of the selected sentence. Is extracted. The skeleton of the sentence is
It consists of required elements extracted from the sentence. Required elements can be element heads (hea
d) and subject, object, indirect object, owner (posessor), cause (c
an element having a relation attribute of “ause”, “condition” or “comparison”, and an element directly included in the coordination structure when the related element having the coordination structure is an essential element. The document processing apparatus connects the essential elements of the sentence to generate a sentence skeleton,
Add to the summary.

【０１５３】続いて、文書処理装置は、ステップＳ２５
において、ＣＰＵ１３の制御のもとに、要約Ｓ_iの長
さ、すなわち文字数がウィンドウ１９０の表示領域２２
０の最大文字数Ｗ_sよりも多いか否かを判断する。Subsequently, the document processing device proceeds to step S25.
Under the control of the CPU 13, the length of the digest S _i , that is, the number of characters is displayed in the display area 22 of the window 190.
0 often determines whether or not than the maximum number of characters W _s of.

【０１５４】ここで、文書処理装置は、要約Ｓ_iの文字
数が最大文字数Ｗ_sよりも多いと判断した場合には、ス
テップＳ３０において、ＣＰＵ１３の制御のもとに、要
約Ｓ_i-1を最終的な要約文として設定し、一連の処理を
終了する。なお、この場合には、要約Ｓ_i＝Ｓ₀＝“”を
出力するため、要約文は、表示領域２２０に表示されな
いことになる。If the document processing apparatus determines that the number of characters of the digest S _i is greater than the maximum character count W _s , the process proceeds to step S 30, where the digest S _i-1 is finalized under the control of the CPU 13. This is set as a brief summary, and a series of processing ends. In this case, since the summary S _i = S ₀ = “” is output, the summary is not displayed in the display area 220.

【０１５５】一方、文書処理装置は、要約Ｓ_iの文字数
が最大文字数Ｗ_sよりも多くないと判断した場合には、
ステップＳ２６の処理へと移行し、ＣＰＵ１３の制御の
もとに、ｉ＋１番目に平均中心活性値が高い文の中心活
性値と、ステップＳ２４で作成したリストｌ_iのエレメ
ントの中で最も中心活性値が高いエレメントの中心活性
値とを比較する。そして、文書処理装置は、ｉ＋１番目
に平均中心活性値が高い文の中心活性値が、リストｌ_i
のエレメントの中で最も中心活性値が高いエレメントの
中心活性値よりも高いと判断した場合には、ステップＳ
２８へと処理を移行する。一方、文書処理装置は、ｉ＋
１番目に平均中心活性値が高い文の中心活性値が、リス
トｌ_iのエレメントの中で最も中心活性値が高いエレメ
ントの中心活性値よりも高くないと判断した場合には、
ステップＳ２７へと処理を移行する。On the other hand, when the document processing apparatus determines that the number of characters of the digest S _i is not larger than the maximum number of characters W _s ,
The process proceeds to step S26, and under the control of the CPU 13, the central activity value of the sentence having the (i + 1) -th highest average central activity value and the central activity value among the elements of the list l _i created in step S24. Is compared to the central activity value of the element with the higher. Then, the document processing apparatus determines that the central activity value of the sentence having the (i + 1) -th highest average central activity value is the list l _i
If it is determined that the central activity value of the element having the highest central activity value is higher than the central activity value of the element having the highest
The processing shifts to 28. On the other hand, the document processing device
If it is determined that the central activity value of the sentence having the first highest average central activity value is not higher than the central activity value of the element having the highest central activity value among the elements of the list l _i ,
The process moves to step S27.

【０１５６】文書処理装置は、ｉ＋１番目に平均中心活
性値が高い文の中心活性値が、リストｌ_iのエレメント
の中で最も中心活性値が高いエレメントの中心活性値よ
りも高くないと判断した場合には、ステップＳ２７にお
いて、ＣＰＵ１３の制御のもとに、カウンタのカウント
値ｉを“１”だけインクリメントさせ、ステップＳ２４
へと処理を戻す。The document processing apparatus determines that the central activity value of the sentence having the (i + 1) -th highest average central activity value is not higher than the central activity value of the element having the highest central activity value among the elements of list l _i . In this case, in step S27, under the control of the CPU 13, the count value i of the counter is incremented by "1", and in step S24
Return processing to

【０１５７】また、文書処理装置は、ｉ＋１番目に平均
中心活性値が高い文の中心活性値が、リストｌ_iのエレ
メントの中で最も中心活性値が高いエレメントの中心活
性値よりも高いと判断した場合には、ステップＳ２８に
おいて、ＣＰＵ１３の制御のもとに、リストｌ_iエレメ
ントの中で最も中心活性値の高いエレメントｅを要約Ｓ
_iに加えてＳＳ_iを生成し、さらに、エレメントｅをリス
トｌ_iから削除する。そして、文書処理装置は、このよ
うにして生成した要約ＳＳ_iを例えばＲＡＭ１４に記憶
させる。The document processing apparatus determines that the central activity value of the sentence having the (i + 1) -th highest average central activity value is higher than the central activity value of the element having the highest central activity value among the elements of the list l _i. If so, in step S28, under the control of the CPU 13, the element e having the highest central activation value among the list l _i elements is summarized S.
to generate a SS _i in addition to the _i, further, to remove the element e from the list l _i. Then, the document processing apparatus stores the summary SS _i generated in this manner in, for example, the RAM 14.

【０１５８】続いて、文書処理装置は、ステップＳ２９
において、ＣＰＵ１３の制御のもとに、要約ＳＳ_iの文
字数がウィンドウ１９０の表示領域２２０の最大文字数
Ｗ_sよりも多いか否かを判別する。文書処理装置は、要
約ＳＳ_iの文字数が最大文字数Ｗ_sよりも多くないと判別
した場合には、ステップＳ２６からの処理を繰り返す。
一方、文書処理装置は、要約ＳＳ_iの文字数が最大文字
数Ｗ_sよりも多いと判別した場合には、ステップＳ３１
において、ＣＰＵ１３の制御のもとに、要約Ｓ_iを最終
的な要約文として設定し、表示領域２２０に表示して一
連の処理を終了する。このようにして、文書処理装置
は、最大文字数Ｗ_sよりも多くならないように要約文を
生成する。Subsequently, the document processing device proceeds to step S29.
In, under the control of the CPU 13, the number of characters in summary SS _i it is determined whether or not more than the maximum number of characters W _s of the display area 220 of the window 190. Document processing device, when the number of summary SS _i is determined to not more than the maximum number of characters W _s repeats the processing from step S26.
On the other hand, when the document processing apparatus determines that the number of characters of the digest SS _i is larger than the maximum number of characters W _s , the process proceeds to step S31.
, Under the control of the CPU 13, the summary S _i is set as the final summary sentence, displayed in the display area 220, and the series of processing is terminated. In this way, the document processing apparatus generates a summary so as not more than the maximum number of characters W _s.

【０１５９】文書処理装置は、このような一連の処理を
行うことによって、タグ付けされた文書を要約して要約
文を作成することができる。文書処理装置は、例えば図
１３に示した文書を要約した場合には、図１９に示すよ
うな要約文を作成し、表示範囲の表示領域２２０に表示
する。By performing such a series of processing, the document processing apparatus can summarize a tagged document and create a summary sentence. When the document processing apparatus summarizes, for example, the document shown in FIG. 13, the document processing apparatus creates a summary sentence as shown in FIG. 19 and displays it in the display area 220 of the display range.

【０１６０】すなわち、文書処理装置は、「TCP/IPの歴
史はARPANETを抜きにして語ることはできない。ARPANET
は1969年北米西海岸の４個所の大学、研究機関のホスト
コンピュータを50kbpsの回線で結んだ小規模なネットワ
ークからARPANETは出発した。当時は1964年にメインフ
レームの汎用コンピュータシリーズが開発された。この
時代背景を考えると、将来のコンピュータ通信の最盛を
見越したこのようなプロジェクトは、まさに米国ならで
はのものであったといえるだろう。」という要約文を作
成し、表示領域２２０に表示する。That is, the document processing apparatus states, "The history of TCP / IP cannot be described without ARPANET.
In 1969, ARPANET departed from a small network connecting host computers of four universities and research institutes on the west coast of North America with 50 kbps lines. At that time, a general-purpose computer series of mainframes was developed in 1964. Given this historical background, such a project that anticipated the future of computer communications could be said to have been unique to the United States. Is created and displayed in the display area 220.

【０１６１】文書処理装置においては、ユーザは、文書
の全文章を一読する代わりに、この要約文を読むこと
で、文章の概要を理解し、この文章が所望する情報であ
るか否かを判定することができる。In the document processing apparatus, instead of reading the entire sentence of the document, the user reads the summary sentence to understand the outline of the sentence and determines whether or not the sentence is the desired information. can do.

【０１６２】なお、文書処理装置においては、文書中の
エレメントに対して重要度を付与する方法としては、必
ずしも上述したような活性拡散を用いる必要はなく、例
えば、Zechnerが提案するように、単語にtf*idf法で重
み付けし、文書中に出現する単語の重みの総和を文書の
重要度とする方法でもよい。この方法の詳細は、“K.Ze
chner, Fast generation of abstracts from general d
omain text corporaby extracting relevant sentence
s, In Proc. of the 16th International Conference o
n Computational Linguistics, pp.986-989, 1996”に
説明されている。また、重要度の付与方法は、これらの
方法以外のものを利用することもできる。さらに、表示
領域２００のキーワード入力部１９２にキーワードを入
力することによって、そのキーワードに基づいた重要度
の設定を行うこともできる。In the document processing apparatus, as a method for assigning importance to elements in a document, it is not always necessary to use the active diffusion as described above. May be weighted by the tf * idf method, and the total weight of words appearing in the document may be used as the importance of the document. See “K.Ze
chner, Fast generation of abstracts from general d
omain text corporaby extracting relevant sentence
s, In Proc. of the 16th International Conference o
n Computational Linguistics, pp. 986-989, 1996 ". In addition, a method of assigning importance may be other than these methods. Further, the keyword input unit 192 of the display area 200 may be used. By inputting a keyword into, the importance can be set based on the keyword.

【０１６３】さて、文書処理装置は、先に図１４に示し
たように、表示部３１に表示されるウィンドウ１９０の
表示領域２２０の表示範囲を拡大することができるが、
作成した要約文が表示領域２２０に表示されている状態
において、表示領域２２０の表示範囲を変更すると、そ
の表示範囲に応じて、要約文の情報量を変更することが
できる。この場合、文書処理装置は、図２０に示す処理
を行う。The document processing apparatus can enlarge the display range of the display area 220 of the window 190 displayed on the display unit 31 as shown in FIG.
When the display range of the display area 220 is changed in a state where the created summary text is displayed in the display area 220, the information amount of the summary text can be changed according to the display range. In this case, the document processing device performs the processing shown in FIG.

【０１６４】すなわち、文書処理装置は、同図に示すよ
うに、ステップＳ６１において、ＣＰＵ１３の制御のも
とに、ユーザが入力部２０を操作することに対応して、
表示部３１に表示されたウィンドウ１９０の表示領域２
２０の表示範囲が変更されるまで待機する。That is, as shown in the figure, the document processing apparatus responds to the user operating the input unit 20 under the control of the CPU 13 in step S61,
Display area 2 of window 190 displayed on display unit 31
It waits until the display range of 20 is changed.

【０１６５】そして、文書処理装置は、表示領域２２０
の表示範囲が変更されると、ステップＳ６２へと処理を
移行し、ＣＰＵ１３の制御のもとに、表示領域２２０の
表示範囲を測定する。Then, the document processing device operates the display area 220.
Is changed, the process proceeds to step S62, and the display range of the display area 220 is measured under the control of the CPU 13.

【０１６６】以下、ステップＳ６３乃至ステップＳ６５
で行われる処理は、図１５中ステップＳ２２以降で行わ
れる処理と同様であり、表示領域２２０の表示範囲に対
応した要約文が作成されて終了する。The following steps S63 to S65
The process performed in step S22 is the same as the process performed after step S22 in FIG. 15, and a summary corresponding to the display range of the display area 220 is created, and the process ends.

【０１６７】すなわち、文書処理装置は、ステップＳ６
３において、ＣＰＵ１３の制御のもとに、表示領域２２
０の表示範囲の測定結果と、予め指定された文字の大き
さとに基づいて、表示領域２２０に表示される要約文の
総文字数を決定する。That is, the document processing apparatus performs step S6.
3, the display area 22 is controlled under the control of the CPU 13.
The total number of characters of the summary sentence displayed in the display area 220 is determined based on the measurement result of the display range of 0 and the character size specified in advance.

【０１６８】続いて、文書処理装置は、ステップＳ６４
において、ＣＰＵ１３の制御のもとに、作成される要約
がステップＳ６３において決定された文字数を越えない
ように、ＲＡＭ１４から重要度の高い順に文又は単語を
選択する。Subsequently, the document processing device proceeds to step S64.
, Under the control of the CPU 13, sentences or words are selected from the RAM 14 in descending order of importance so that the created summary does not exceed the number of characters determined in step S63.

【０１６９】そして、文書処理装置は、ステップＳ６５
において、ＣＰＵ１３の制御のもとに、ステップＳ６４
において選択された文又は単語をつなぎ合わせて要約文
を作成し、表示部３１の表示領域２２０に表示させる。Then, the document processing device proceeds to step S65.
In step S64 under the control of the CPU 13,
A summary sentence is created by joining the sentences or words selected in the above, and is displayed in the display area 220 of the display unit 31.

【０１７０】文書処理装置は、このような処理を行うこ
とによって、表示領域２２０の表示範囲に応じた要約文
を新たに作成することができる。例えば、文書処理装置
は、ユーザが入力部２０のマウスをドラッグ操作するこ
とにより表示領域２２０の表示範囲を拡大すると、より
詳細な要約文を新たに作成し、図２１に示すように、新
たな要約文をウィンドウ１９０の表示領域２２０に表示
する。By performing such processing, the document processing apparatus can newly create a summary sentence according to the display range of the display area 220. For example, when the user expands the display range of the display area 220 by dragging the mouse of the input unit 20 with the mouse, the document processing apparatus newly creates a more detailed summary sentence, and as shown in FIG. The summary is displayed in the display area 220 of the window 190.

【０１７１】すなわち、文書処理装置は、「TCP/IPの歴
史はARPANETを抜きにして語ることはできない。ARPANET
はアメリカ国防省DODの国防高等研究計画局がスポンサ
ーとなって構築されてきた、実験および研究用のパケッ
ト交換ネットワークである。1969年北米西海岸の４個所
の大学、研究機関のホストコンピュータを50kbpsの回線
で結んだきわめて小規模なネットワークからARPANETは
出発した。当時は1945年に世界初のコンピュータである
ENIACがペンシルバニア大学で開発され、1964年にはじ
めてICを理論素子として実装したメインフレームの汎用
コンピュータシリーズが開発され、やっとコンピュータ
が産声をあげたばかりあった。この時代背景を考える
と、将来のコンピュータ通信の最盛を見越したこのよう
なプロジェクトは、まさに米国ならではのものであった
といえるだろう。」という要約文を作成し、表示領域２
２０に表示する。That is, the document processing apparatus states, "The history of TCP / IP cannot be described without ARPANET.
Is an experimental and research packet-switched network sponsored by the U.S. Department of Defense DOD's Defense Advanced Research Projects Agency. In 1969, ARPANET departed from a very small network connecting host computers of four universities and research institutes on the west coast of North America with 50 kbps lines. At that time it was the world's first computer in 1945
ENIAC was developed at the University of Pennsylvania, and in 1964 the first mainframe general-purpose computer series that implemented ICs as theoretical elements was developed. Given this historical background, such a project that anticipated the future of computer communications could be said to have been unique to the United States. Is created, and the display area 2
20 is displayed.

【０１７２】このように、文書処理装置においては、表
示された要約文が簡略すぎて文書の概略を把握すること
ができない場合、ユーザは、表示領域２２０の表示範囲
を拡大することで、より多くの情報量を有するより詳細
な要約文を参照することができる。As described above, in the document processing apparatus, when the displayed summary is too simple to grasp the outline of the document, the user can increase the display range of the display area 220 to increase the display range. A more detailed summary sentence having the information amount of can be referred to.

【０１７３】文書処理装置は、このようにして文書の要
約文を作成する際に、ＣＰＵ１３によりＲＯＭ１５やハ
ードディスクに記録されている電子文書処理プログラム
のうちの音声読み上げプログラムを起動すると、図２２
に示すような一連の工程を経ることによって、文書又は
要約文の読み上げを行うことができる。なおここでは、
先に図６に示した文書を例として挙げて説明する。When the document processing apparatus creates the digest of the document in this way, when the CPU 13 activates the voice reading program among the electronic document processing programs recorded in the ROM 15 or the hard disk, the CPU 13 executes the process shown in FIG.
By going through a series of steps as shown in (1), a document or an abstract can be read aloud. Here,
First, the document shown in FIG. 6 will be described as an example.

【０１７４】まず、文書処理装置は、同図に示すよう
に、ステップＳ７１において、タグ付けされた文書を受
信する。なお、この文書は、上述したように、音声合成
を行うために必要なタグが付与されており、図８に示す
タグファイルとして構成されている。また、文書処理装
置は、タグ付けされた文書を受信し、その文書に音声合
成を行うために必要なタグを新たに付与して文書を作成
することもできる。さらに、文書処理装置は、タグ付け
されていない文書を受信し、その文書に音声合成を行う
ために必要なタグを含めたタグ付けを行い、タグファイ
ルを作成してもよい。なお、この工程は、図４中ステッ
プＳ１に対応するものである。First, the document processing apparatus receives a tagged document in step S71 as shown in FIG. As described above, this document is provided with a tag necessary for performing speech synthesis, and is configured as a tag file shown in FIG. Further, the document processing apparatus can receive a tagged document and create a document by newly adding a tag necessary for performing speech synthesis to the document. Further, the document processing apparatus may receive an untagged document, perform tagging on the document including a tag necessary for performing speech synthesis, and create a tag file. This step corresponds to step S1 in FIG.

【０１７５】続いて、文書処理装置は、ステップＳ７２
において、ＣＰＵ１３の制御のもとに、上述した方法に
より文書の要約文を作成する。ここで、要約文の元とな
る文書は、ステップＳ７１に示すようにタグ付けがなさ
れていることから、作成した要約文にも、文書に対応す
るタグが付与されている。Subsequently, the document processing device proceeds to step S72.
In step (1), under the control of the CPU 13, a digest of the document is created by the method described above. Here, since the document serving as the source of the summary is tagged as shown in step S71, the tag corresponding to the document is also added to the created summary.

【０１７６】続いて、文書処理装置は、ステップＳ７３
において、ＣＰＵ１３の制御のもとに、タグファイルに
基づいて文書の全内容についての音声読み上げ用ファイ
ルを生成する。この音声読み上げ用ファイルは、タグフ
ァイル中のタグから、読み上げのための属性情報を導出
し、この属性情報を埋め込むことにより生成される。Subsequently, the document processing device proceeds to step S73.
Then, under the control of the CPU 13, a speech reading file for all contents of the document is generated based on the tag file. The voice reading file is generated by deriving attribute information for reading from the tag in the tag file and embedding the attribute information.

【０１７７】この際、音声読み上げ用ファイルには、上
述したように、Com=Vol=***という読み上げの際の音量
を示す属性情報が埋め込まれる。ここで、文書処理装置
は、文書の全内容のうち、ステップＳ７２にて作成した
要約文に含まれる部分の開始位置について、エレメント
単位でCom=Vol=80という属性情報を埋め込むとともに、
それ以外の部分の開始位置については、Com=Vol=0とい
う属性情報を埋め込む。すなわち、文書処理装置は、要
約文に含まれる部分については、デフォルトの音量を８
０％増量した音量で読み上げる。なお、音量は、デフォ
ルトの音量を８０％増量したものである必要はなく、適
宜変更することができる。このように、文書処理装置
は、要約文に含まれる部分を読み上げの際にも強調する
ことができ、ユーザの注意を喚起することができる。な
お、この工程は、図４中ステップＳ２に対応するもので
ある。At this time, as described above, the attribute information indicating the volume at the time of reading aloud Com = Vol = *** is embedded in the voice reading file. Here, the document processing apparatus embeds the attribute information Com = Vol = 80 in element units for the start position of the part included in the summary sentence created in step S72 in the entire contents of the document.
For the start position of the other part, attribute information of Com = Vol = 0 is embedded. That is, the document processing apparatus sets the default volume to 8 for the part included in the summary sentence.
Read aloud at 0% increased volume. Note that the sound volume does not need to be increased by 80% from the default sound volume, and can be changed as appropriate. As described above, the document processing apparatus can emphasize the part included in the summary sentence even when reading out, and can call the user's attention. This step corresponds to step S2 in FIG.

【０１７８】続いて、文書処理装置は、ステップＳ７４
において、ＣＰＵ１３の制御のもとに、音声読み上げ用
ファイルを用いて、ＲＯＭ１５やハードディスク等に予
め記憶されている音声合成エンジンに適した処理を行
う。なお、この工程は、図４中ステップＳ３に対応する
ものである。Subsequently, the document processing device proceeds to step S74.
Under the control of the CPU 13, a process suitable for the speech synthesis engine stored in advance in the ROM 15, the hard disk, or the like is performed using the speech reading file. This step corresponds to step S3 in FIG.

【０１７９】そして、文書処理装置は、ステップＳ７５
において、ユーザが上述したユーザインターフェースを
用いて行う操作に応じて処理を行う。なお、この工程
は、図４中ステップＳ４に対応するものである。文書処
理装置は、例えばユーザが入力部２０のマウス等を用い
て、先に図１２に示したユーザインターフェース用ウィ
ンドウ１７０の選択スイッチ１８４を選択することによ
って、ステップＳ７２にて作成した要約文を読み上げ対
象とすることができる。この場合、文書処理装置は、例
えばユーザが入力部２０のマウス等を用いて、再生ボタ
ン１７１を押すことによって、要約文の読み上げを開始
することができる。また、文書処理装置は、例えばユー
ザが入力部２０のマウス等を用いて、選択スイッチ１８
３を選択し、再生ボタン１７１を押した場合には、上述
したように文書の読み上げを開始する。この際、文書処
理装置は、ステップＳ７３にて音声読み上げ用ファイル
に埋め込んだ属性情報に基づいて、要約文に含まれる部
分については音量を増大させて読み上げを行う。Then, the document processing apparatus proceeds to step S75.
, Processing is performed according to the operation performed by the user using the above-described user interface. This step corresponds to step S4 in FIG. The document processing device reads out the summary sentence created in step S72 by, for example, selecting the selection switch 184 of the user interface window 170 shown in FIG. 12 using the mouse or the like of the input unit 20 by the user. Can be targeted. In this case, the document processing apparatus can start reading out the summary sentence, for example, when the user presses the play button 171 using the mouse or the like of the input unit 20. In addition, the user can use the selection switch 18 by, for example,
When the user selects 3 and presses the play button 171, the reading of the document is started as described above. At this time, based on the attribute information embedded in the voice reading file in step S73, the document processing device reads the portion included in the summary sentence while increasing the volume.

【０１８０】文書処理装置は、このような処理を行うこ
とによって、与えられた文書や作成した要約文を読み上
げることができる。また、文書処理装置は、与えられた
文書を読み上げる際に、作成した要約文に応じて読み上
げ方を変化させることもできる。By performing such processing, the document processing apparatus can read out a given document or a prepared summary sentence. Further, when reading out a given document, the document processing apparatus can change the reading style according to the created summary.

【０１８１】以上説明したように、文書処理装置は、与
えられた文書から音声読み上げ用ファイルを自動的に生
成し、文書やその文書から作成した要約文を適切な音声
合成エンジンを用いて読み上げることができる。As described above, the document processing apparatus automatically generates a text-to-speech file from a given document, and reads a document and a summary sentence from the document using a suitable speech synthesis engine. Can be.

【０１８２】なお、本発明は、上述した実施の形態に限
定されるものではなく、例えば、文書や音声読み上げ用
ファイルへのタグ付けが上述のものに限定されるもので
はないことは勿論である。The present invention is not limited to the above-described embodiment, and it goes without saying that, for example, tagging a document or a file for reading out aloud is not limited to the above. .

【０１８３】また、上述した実施の形態においては、通
信部２２に外部から電話回線を介して文書が送信される
ものとして説明したが、本発明は、これに限定されるも
のではない。例えば、衛星等を介して文書が送信される
場合にも適用できる他、記録／再生部３２において記録
媒体３３から読み出されたり、ＲＯＭ１５に予め文書が
記録されていてもよい。Further, in the above-described embodiment, the description has been made assuming that the document is transmitted from the outside to the communication unit 22 via the telephone line. However, the present invention is not limited to this. For example, the present invention can be applied to a case where a document is transmitted via a satellite or the like. In addition, the document may be read from the recording medium 33 in the recording / reproducing unit 32 or the document may be recorded in the ROM 15 in advance.

【０１８４】さらに、上述した実施の形態においては、
受信又は作成したタグファイルから音声読み上げ用ファ
イルを生成するものとしたが、このような音声読み上げ
用ファイルを生成せずに、タグファイルに基づいて直接
読み上げるようにしてもよい。Further, in the above-described embodiment,
Although the voice reading file is generated from the received or created tag file, the voice reading file may be directly read based on the tag file without generating such a voice reading file.

【０１８５】この場合、文書処理装置は、タグファイル
を受信又は作成した後、音声合成エンジンを用い、タグ
ファイルに付与されている段落、文及び句を示すタグに
基づいて、段落、文及び句を識別し、これらの段落、文
及び句の開始位置に所定の休止期間を設けて読み上げ
る。タグファイルには、上述したように、読み上げを禁
止するための属性情報や、読み仮名又は発音を示す属性
情報が付与されており、文書処理装置は、読み上げが禁
止されている部分を除去するとともに、正確な読み又は
発音に置換して読み上げを行う。また、文書処理装置
は、読み上げの途中で、ユーザが上述したユーザインタ
ーフェースを操作することによって、タグファイルに付
与されている段落、文及び句を示すタグに基づいて、段
落、文及び句の単位で読み上げの際の頭出し、早送り又
は巻き戻しを行うこともできる。In this case, after receiving or creating the tag file, the document processing apparatus uses the speech synthesis engine to generate a paragraph, a sentence, and a phrase based on the tag indicating the paragraph, sentence, and phrase attached to the tag file. , And a predetermined pause period is provided at the start position of these paragraphs, sentences and phrases to read out. As described above, the tag file is provided with attribute information for prohibiting reading aloud and attribute information indicating a reading kana or pronunciation, and the document processing device removes the portion where reading is prohibited and , And perform reading aloud by replacing with correct reading or pronunciation. In addition, the document processing apparatus operates the above-described user interface during reading, and the unit of the paragraph, the sentence, and the phrase based on the tag indicating the paragraph, the sentence, and the phrase attached to the tag file. Can be used to perform cueing, fast forward or rewind when reading out.

【０１８６】このようにすることによって、文書処理装
置は、音声読み上げ用ファイルを生成することなく、タ
グファイルに基づいて文書を直接読み上げることができ
る。Thus, the document processing apparatus can directly read out a document based on the tag file without generating a voice reading out file.

【０１８７】さらにまた、本発明においては、記録媒体
３３として、上述した電子文書処理プログラムが書き込
まれたディスク状記録媒体やテープ状記録媒体等を提供
することも容易に実現できる。Further, in the present invention, it is possible to easily realize, as the recording medium 33, a disk-shaped recording medium or a tape-shaped recording medium in which the above-mentioned electronic document processing program is written.

【０１８８】また、上述した実施の形態においては、表
示部３１に表示される種々のウィンドウを操作するデバ
イスとして入力部２０のマウスを例示したが、本発明が
これに限定されるものではないことはいうまでもない。
例えば、このようなデバイスとしては、タブレットやラ
イトペン等も利用することができる。Further, in the above-described embodiment, the mouse of the input unit 20 is exemplified as a device for operating various windows displayed on the display unit 31, but the present invention is not limited to this. Needless to say.
For example, a tablet, a light pen, or the like can be used as such a device.

【０１８９】さらに、上述した実施の形態においては、
日本語及び英語の文書を例示したが、本発明がいかなる
言語にも適用可能であることは勿論である。Further, in the above-described embodiment,
Although Japanese and English documents have been illustrated, it is understood that the present invention is applicable to any language.

【０１９０】このように、本発明は、その趣旨を逸脱し
ない範囲で適宜変更が可能であることはいうまでもな
い。As described above, it goes without saying that the present invention can be appropriately changed without departing from the spirit of the present invention.

【０１９１】[0191]

【発明の効果】以上詳細に説明したように、本発明にか
かる電子文書処理方法は、電子文書に基づいて、音声合
成して読み上げるための音声読み上げ用ファイルを生成
する音声読み上げ用ファイル生成工程を備える。As described above in detail, the electronic document processing method according to the present invention includes a voice reading file generating step of generating a voice reading file for voice synthesis based on an electronic document. Prepare.

【０１９２】したがって、本発明にかかる電子文書処理
方法は、電子文書に基づいて、音声読み上げ用ファイル
を生成することによって、電子文書を読み上げることを
可能とする。Therefore, the electronic document processing method according to the present invention makes it possible to read out an electronic document by generating a voice reading file based on the electronic document.

【０１９３】また、本発明にかかる電子文書処理方法
は、複数の要素が階層化された内部構造を有し、この内
部構造を示すタグ情報が予め付与されている電子文書を
入力する文書入力工程と、タグ情報に基づいて、電子文
書を音声合成して読み上げる文書読み上げ工程とを備え
る。In the electronic document processing method according to the present invention, there is provided a document input step of inputting an electronic document having an internal structure in which a plurality of elements are hierarchized, and to which tag information indicating the internal structure is added in advance. And a text-to-speech process in which an electronic document is voice-synthesized and read out based on tag information.

【０１９４】したがって、本発明にかかる電子文書処理
方法は、複数の要素が階層化された内部構造を示すタグ
情報が予め付与されている電子文書を入力し、タグ情報
に基づいて電子文書を直接読み上げることを可能とす
る。Therefore, in the electronic document processing method according to the present invention, an electronic document to which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance is input, and the electronic document is directly converted based on the tag information. Enables reading aloud.

【０１９５】さらに、本発明にかかる電子文書処理装置
は、電子文書に基づいて、音声合成して読み上げるため
の音声読み上げ用ファイルを生成する音声読み上げ用フ
ァイル生成手段を備える。Further, the electronic document processing apparatus according to the present invention is provided with a voice reading file generating means for generating a voice reading file for voice synthesis based on the electronic document.

【０１９６】したがって、本発明にかかる電子文書処理
装置は、電子文書に基づいて、音声読み上げ用ファイル
を生成することができ、この音声読み上げ用ファイルを
用いて電子文書を読み上げることができる。Therefore, the electronic document processing device according to the present invention can generate a text-to-speech file based on an electronic document, and can read a digital document using this text-to-speech file.

【０１９７】さらにまた、本発明にかかる電子文書処理
装置は、複数の要素が階層化された内部構造を有し、内
部構造を示すタグ情報が予め付与されている電子文書を
入力する文書入力手段と、タグ情報に基づいて、電子文
書を音声合成して読み上げる文書読み上げ手段とを備え
る。Further, the electronic document processing apparatus according to the present invention has a document input means for inputting an electronic document having an internal structure in which a plurality of elements are hierarchized, and to which tag information indicating the internal structure is added in advance. And a text-to-speech means for reading out an electronic document by voice synthesis based on the tag information.

【０１９８】したがって、本発明にかかる電子文書処理
装置は、複数の要素が階層化された内部構造を示すタグ
情報が予め付与されている電子文書を入力し、この電子
文書に付与されたタグ情報に基づいて電子文書を直接読
み上げることができる。Therefore, the electronic document processing apparatus according to the present invention inputs an electronic document to which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance, and the tag information added to this electronic document is input. The electronic document can be read aloud directly based on.

【０１９９】また、本発明にかかる電子文書処理プログ
ラムが記録された記録媒体における電子文書処理プログ
ラムは、電子文書に基づいて、音声合成して読み上げる
ための音声読み上げ用ファイルを生成する音声読み上げ
用ファイル生成工程を備える。An electronic document processing program in a recording medium on which an electronic document processing program according to the present invention is recorded is a speech reading file for generating a speech reading file for speech synthesis and reading based on an electronic document. The method includes a generation step.

【０２００】したがって、本発明にかかる電子文書処理
プログラムが記録された記録媒体は、音声読み上げ用フ
ァイルを生成して、電子文書を読み上げる電子文書処理
プログラムを提供することができる。そのため、この電
子文書処理プログラムが提供された装置は、電子文書を
読み上げることが可能となる。Therefore, the recording medium on which the electronic document processing program according to the present invention is recorded can provide an electronic document processing program which generates a file for reading out voice and reads out the electronic document. Therefore, the device provided with the electronic document processing program can read out the electronic document.

【０２０１】さらに、本発明にかかる電子文書処理プロ
グラムが記録された記録媒体における電子文書処理プロ
グラムは、複数の要素が階層化された内部構造を有し、
内部構造を示すタグ情報が予め付与されている電子文書
を入力する文書入力工程と、タグ情報に基づいて、電子
文書を音声合成して読み上げる文書読み上げ工程とを備
える。Further, the electronic document processing program in the recording medium on which the electronic document processing program according to the present invention is recorded has an internal structure in which a plurality of elements are hierarchized,
The method includes a document input step of inputting an electronic document to which tag information indicating an internal structure is added in advance, and a document reading step of reading out an electronic document by voice synthesis based on the tag information.

【０２０２】したがって、本発明にかかる電子文書処理
プログラムが記録された記録媒体は、複数の要素が階層
化された内部構造を示すタグ情報が予め付与されている
電子文書を入力し、タグ情報に基づいて電子文書を直接
読み上げる電子文書処理プログラムを提供することがで
きる。そのため、この電子文書処理プログラムが提供さ
れた装置は、電子文書を入力して直接読み上げることが
可能となる。Therefore, the recording medium on which the electronic document processing program according to the present invention is recorded inputs an electronic document in which tag information indicating an internal structure in which a plurality of elements are hierarchized is added in advance, and the tag information is used as the tag information. An electronic document processing program that reads an electronic document directly based on the electronic document can be provided. Therefore, an apparatus provided with the electronic document processing program can input an electronic document and read it directly.

[Brief description of the drawings]

【図１】本発明の実施の形態として示す文書処理装置の
構成を説明するブロック図である。FIG. 1 is a block diagram illustrating a configuration of a document processing apparatus shown as an embodiment of the present invention.

【図２】文書の内部構造を示す図である。FIG. 2 is a diagram showing an internal structure of a document.

【図３】表示部の表示内容を説明する図であって、文書
の内部構造をタグにより表示したウィンドウを示す図で
ある。FIG. 3 is a view for explaining display contents of a display unit, and is a view showing a window in which an internal structure of a document is displayed by a tag.

【図４】文書の読み上げを行う際の一連の処理を説明す
るフローチャートである。FIG. 4 is a flowchart illustrating a series of processes when reading out a document.

【図５】受信又は作成した日本語の文書の一例を示す図
であって、文書を表示したウィンドウを示す図である。FIG. 5 is a diagram illustrating an example of a received or created Japanese document, and is a diagram illustrating a window displaying the document.

【図６】受信又は作成した英語の文書の一例を示す図で
あって、文書を表示したウィンドウを示す図である。FIG. 6 is a diagram illustrating an example of a received or created English document, and is a diagram illustrating a window displaying the document.

【図７】図５に示すタグ付けされた日本語の文書である
タグファイルを示す図である。FIG. 7 is a diagram showing a tag file which is a tagged Japanese document shown in FIG. 5;

【図８】図６に示すタグ付けされた英語の文書であるタ
グファイルを示す図である。FIG. 8 is a diagram showing a tag file which is the tagged English document shown in FIG. 6;

【図９】図７に示すタグファイルから生成した音声読み
上げ用ファイルを示す図である。FIG. 9 is a diagram showing a voice reading file generated from the tag file shown in FIG. 7;

【図１０】図８に示すタグファイルから生成した音声読
み上げ用ファイルを示す図である。FIG. 10 is a diagram showing a voice reading file generated from the tag file shown in FIG. 8;

【図１１】音声読み上げ用ファイルを生成する際の一連
の処理を説明するフローチャートである。FIG. 11 is a flowchart illustrating a series of processes when generating a voice reading file.

【図１２】ユーザインターフェース用ウィンドウを示す
図である。FIG. 12 is a diagram showing a window for a user interface.

【図１３】文書を表示したウィンドウを示す図である。FIG. 13 is a diagram illustrating a window displaying a document.

【図１４】文書を表示したウィンドウを示す図であっ
て、要約文を表示する表示領域が図１３に示す表示領域
よりも拡大された様子を示す図である。14 is a diagram illustrating a window displaying a document, and is a diagram illustrating a state where a display area for displaying a summary sentence is larger than the display area illustrated in FIG. 13;

【図１５】要約文を作成する際の一連の処理を説明する
フローチャートである。FIG. 15 is a flowchart illustrating a series of processes when creating a summary sentence.

【図１６】活性拡散を行う際の一連の処理を説明するフ
ローチャートである。FIG. 16 is a flowchart illustrating a series of processes when performing active diffusion.

【図１７】活性拡散の処理を説明するためのエレメント
の連結構造を示す図である。FIG. 17 is a diagram showing a connection structure of elements for explaining an active diffusion process.

【図１８】活性拡散のリンク処理を行う際の一連の処理
を説明するフローチャートである。FIG. 18 is a flowchart illustrating a series of processes when performing link processing of active spread.

【図１９】文書とその要約文を表示したウィンドウを示
す図である。FIG. 19 is a diagram showing a window displaying a document and its summary.

【図２０】要約文を表示する表示領域の表示範囲を変更
して新たに要約文を作成する際の一連の処理を説明する
フローチャートである。FIG. 20 is a flowchart illustrating a series of processes when a new summary is created by changing the display range of the display area for displaying the summary.

【図２１】文書とその要約文を表示したウィンドウを示
す図であって、図１４に示すウィンドウに要約文を表示
した様子を示す図である。FIG. 21 is a diagram showing a window displaying a document and its summary sentence, and showing a state in which the summary sentence is displayed in the window shown in FIG. 14;

【図２２】要約文を作成して文書の読み上げを行う際の
一連の処理を説明するフローチャートである。FIG. 22 is a flowchart illustrating a series of processing when a summary sentence is created and a document is read aloud.

[Explanation of symbols]

１０本体、１１制御部、１２インターフェー
ス、１３ＣＰＵ、１４ＲＡＭ、１５ＲＯＭ、
２０入力部、２１受信部、２２通信部、３
０音声出力部、３１表示部、３２記録／再生
部、３３記録媒体、３４ＨＤＤ10 body, 11 control unit, 12 interface, 13 CPU, 14 RAM, 15 ROM,
20 input section, 21 receiving section, 22 communication section, 3
0 audio output unit, 31 display unit, 32 recording / reproducing unit, 33 recording medium, 34 HDD

Claims

[Claims]

1. An electronic document processing method for processing an electronic document, comprising: a voice reading file generating step of generating a voice reading file for voice synthesis and reading based on the electronic document. Electronic document processing method.

2. The electronic document processing according to claim 1, further comprising a processing step of performing a process suitable for a voice synthesis engine using the voice reading file generated in the voice reading file generating step. Method.

3. The electronic document processing according to claim 1, wherein the electronic document has an internal structure in which a plurality of elements are hierarchized, and tag information indicating the internal structure is added in advance. Method.

4. The electronic document is provided with tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document. 4. The electronic document processing method according to claim 3, wherein a paragraph, a sentence, and a phrase constituting the electronic document are identified based on tag information indicating a sentence and a phrase.

5. The electronic document processing method according to claim 3, wherein the electronic document is provided with tag information necessary for performing speech synthesis.

6. The electronic document processing method according to claim 5, wherein the tag information necessary for performing the speech synthesis includes attribute information for prohibiting reading out.

7. The electronic document processing method according to claim 5, wherein the tag information necessary for performing the speech synthesis includes attribute information indicating a pronunciation or pronunciation.

8. The electronic document processing device according to claim 1, wherein in the step of generating a file for reading out aloud, the file for reading out aloud is generated by adding attribute information indicating a language describing the electronic document. Method.

9. The method according to claim 9, wherein in the step of generating a file for reading out aloud, a paragraph,
2. The electronic document processing method according to claim 1, wherein the voice reading file is generated by adding attribute information indicating a start position of a sentence and a phrase.

10. In the step of generating a text-to-speech file, if the attribute information indicating the same type of syntactic structure among the attribute information indicating the start position of the paragraph, sentence, and phrase is continuously duplicated, 10. The electronic document processing method according to claim 9, wherein the attribute information of the electronic document is singly collected.

11. The speech reading file generating step includes the step of adding attribute information indicating that a pause period is provided in correspondence with the attribute information indicating the start position of the paragraph, sentence, and phrase to convert the speech reading file. The electronic document processing method according to claim 9, wherein the electronic document is generated.

12. The electronic document processing method according to claim 1, wherein in the step of generating a file for reading out aloud, the part for which reading out is prohibited is removed to generate the file for reading out aloud.

13. The electronic document processing method according to claim 1, wherein in the voice reading file generating step, the voice reading file is generated by substituting correct reading or pronunciation.

14. The electronic document processing method according to claim 1, wherein in the voice reading file generating step, the voice reading file is generated by adding attribute information indicating a volume of voice reading.

15. A speech synthesis engine according to claim 2, wherein in the processing step, a speech synthesis engine is selected based on attribute information indicating a language describing the electronic document attached to the speech reading file. Electronic document processing method.

16. The electronic document according to claim 2, wherein, in the processing step, an absolute value of a reading volume is obtained based on attribute information indicating a reading volume given to the reading file. Processing method.

17. The speech synthesis engine according to claim 1, wherein said plurality of elements constituting said electronic document include a paragraph, 3. The electronic document processing method according to claim 2, wherein cueing, fast-forwarding, or rewinding at the time of reading is performed in units of sentences and phrases.

18. An electronic document processing method for processing an electronic document, wherein a document input for inputting an electronic document in which a plurality of elements has a hierarchical internal structure and tag information indicating the internal structure is added in advance. An electronic document processing method, comprising: a process; and a text-to-speech process of reading out the electronic document by voice synthesis based on the tag information.

19. In the document input step, an electronic document to which tag information indicating at least a paragraph, a sentence, and a phrase is input among a plurality of elements constituting the electronic document, 19. The electronic document processing method according to claim 18, wherein a pause is provided at a start position of the paragraph, the sentence, and the phrase based on the tag information indicating the paragraph, the sentence, and the phrase, and the reading is performed.

20. The electronic document has tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document. In the document reading step, the paragraph, the sentence, and the tag are displayed. 19. The electronic document processing method according to claim 18, wherein paragraphs, sentences, and phrases constituting the electronic document are identified based on tag information indicating phrases.

21. The electronic document processing method according to claim 18, wherein the electronic document is provided with tag information necessary for performing speech synthesis.

22. The electronic document processing method according to claim 21, wherein the tag information necessary for performing the speech synthesis includes attribute information for prohibiting reading out.

23. The electronic document processing method according to claim 21, wherein the tag information necessary for performing the speech synthesis includes attribute information indicating a reading kana or pronunciation.

24. The electronic document processing method according to claim 18, wherein in the document reading step, a part whose reading is prohibited is removed to read the electronic document.

25. The electronic document processing method according to claim 18, wherein in the document reading step, the electronic document is read aloud by replacing with correct reading or pronunciation.

26. In the reading step, based on tag information indicating a paragraph, a sentence, and a phrase out of a plurality of elements constituting the electronic document, cueing at the time of reading out in units of a paragraph, a sentence, and a phrase. 19. The electronic document processing method according to claim 18, wherein fast forward or rewind is performed.

27. An electronic document processing apparatus for processing an electronic document, comprising: a voice reading file generating unit that generates a voice reading file for voice synthesis and reading based on the electronic document. Electronic document processing device.

28. The electronic document processing according to claim 27, further comprising processing means for performing processing suitable for a speech synthesis engine using the speech reading file generated by the speech reading file generating means. apparatus.

29. The electronic document processing according to claim 27, wherein the electronic document has an internal structure in which a plurality of elements are hierarchized, and tag information indicating the internal structure is added in advance. apparatus.

30. The electronic document has tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document. 30. The electronic document processing apparatus according to claim 29, wherein a paragraph, a sentence, and a phrase constituting the electronic document are identified based on tag information indicating a sentence and a phrase.

31. The electronic document processing apparatus according to claim 29, wherein the electronic document is provided with tag information necessary for performing speech synthesis.

32. The electronic document processing apparatus according to claim 31, wherein the tag information necessary for performing the speech synthesis includes attribute information for prohibiting reading out.

33. The electronic document processing apparatus according to claim 31, wherein the tag information necessary for performing the speech synthesis includes attribute information indicating a reading kana or a pronunciation.

34. The electronic document processing apparatus according to claim 27, wherein said voice reading file generating means generates the voice reading file by adding attribute information indicating a language describing the electronic document. apparatus.

35. The reading-aloud file generating means, wherein a plurality of elements constituting the electronic document include a paragraph,
28. The electronic document processing apparatus according to claim 27, wherein the voice reading file is generated by adding attribute information indicating a start position of a sentence and a phrase.

36. When the attribute information indicating the same type of syntactic structure among the attribute information indicating the start position of the paragraph, sentence, and phrase is continuously duplicated, 36. The electronic document processing apparatus according to claim 35, wherein the attribute information of the electronic document is collectively collected.

37. The text-to-speech file generating means adds attribute information indicating that a pause period is to be provided corresponding to the attribute information indicating the start position of the paragraph, sentence or phrase, and generates the text-to-speech file. The electronic document processing apparatus according to claim 35, wherein the electronic document processing apparatus generates the electronic document.

38. The electronic document processing apparatus according to claim 27, wherein the voice reading file generating means generates the voice reading file by removing a portion where reading is prohibited.

39. The electronic document processing apparatus according to claim 27, wherein the voice reading file generating means generates the voice reading file by replacing the reading with correct reading or pronunciation.

40. The electronic document processing apparatus according to claim 27, wherein the voice reading file generating means generates the voice reading file by adding attribute information indicating a reading volume.

41. The speech processing system according to claim 28, wherein said processing means selects a speech synthesis engine based on attribute information, which is given to said speech reading file and indicates a language describing said electronic document. Electronic document processing device.

42. The electronic document according to claim 28, wherein said processing means obtains an absolute value of a reading volume based on attribute information indicating a reading volume given to the voice reading file. Processing equipment.

43. The text-to-speech engine, based on the text-to-speech file to which attribute information indicating a start position of a paragraph, a sentence and a phrase among a plurality of elements constituting the electronic document is added. 29. The electronic document processing apparatus according to claim 28, wherein cueing, fast-forwarding, or rewinding at the time of reading is performed in units of sentences and phrases.

44. An electronic document processing apparatus for processing an electronic document, comprising: a document input device for inputting an electronic document in which a plurality of elements have a hierarchical internal structure and tag information indicating the internal structure is added in advance. An electronic document processing apparatus comprising: means for reading out the electronic document based on the tag information;

45. The document input unit inputs, from among a plurality of elements constituting the electronic document, an electronic document to which tag information indicating at least a paragraph, a sentence, and a phrase is added, The electronic document processing apparatus according to claim 44, wherein a pause is provided at a start position of the paragraph, the sentence, and the phrase based on the tag information indicating the paragraph, the sentence, and the phrase, and the reading is performed.

46. The electronic document has tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document, and the document reading means reads the paragraph, sentence, and phrase. Paragraphs constituting the electronic document based on tag information indicating phrases,
The electronic document processing apparatus according to claim 44, wherein a sentence and a phrase are identified.

47. The electronic document processing apparatus according to claim 44, wherein the electronic document is provided with tag information necessary for performing speech synthesis.

48. The electronic document processing apparatus according to claim 47, wherein the tag information necessary for performing the speech synthesis includes attribute information for prohibiting reading out.

49. The electronic document processing apparatus according to claim 47, wherein the tag information necessary for performing the speech synthesis includes attribute information indicating a pronunciation or pronunciation.

50. The electronic document processing device according to claim 44, wherein said document reading means reads out the electronic document by removing a portion for which reading is prohibited.

51. The electronic document processing apparatus according to claim 44, wherein the document reading means reads the electronic document by replacing the reading with correct reading or pronunciation.

52. The document reading means, based on tag information indicating a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document, finds a head at the time of reading in a unit of a paragraph, a sentence, and a phrase. The electronic document processing apparatus according to claim 44, wherein fast forward or rewind is performed.

53. A recording medium on which a computer-controllable electronic document processing program for processing an electronic document is recorded, wherein the electronic document processing program is used for reading aloud a voice for synthesizing and reading aloud based on the electronic document. A recording medium on which an electronic document processing program is recorded, comprising a voice reading file generating step of generating a file.

54. The electronic document processing program further comprises a processing step of performing processing suitable for a speech synthesis engine using the speech reading file generated in the speech reading file generating step. Term 53
A recording medium on which the electronic document processing program described above is recorded.

55. The electronic document processing according to claim 53, wherein the electronic document has an internal structure in which a plurality of elements are hierarchized, and tag information indicating the internal structure is added in advance. A recording medium on which a program is recorded.

56. The electronic document, to which tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document is added. 56. The recording medium according to claim 55, wherein a paragraph, a sentence, and a phrase constituting said electronic document are identified based on tag information indicating a sentence and a phrase.

57. A recording medium according to claim 55, wherein said electronic document is provided with tag information necessary for performing speech synthesis.

58. A recording medium on which a computer-controllable electronic document processing program for processing an electronic document is recorded, wherein said electronic document processing program has an internal structure in which a plurality of elements are hierarchized, Electronic document processing, comprising: a document inputting step of inputting an electronic document to which tag information indicating the following is given in advance; and a text-to-speech step of voice-synthesizing and reading out the electronic document based on the tag information. A recording medium on which a program is recorded.

59. In the document inputting step, an electronic document to which tag information indicating at least a paragraph, a sentence and a phrase is added among a plurality of elements constituting the electronic document is input. The recording according to claim 58, wherein a pause is provided at a start position of the paragraph, the sentence, and the phrase based on the tag information indicating the paragraph, the sentence, and the phrase. Medium.

60. The electronic document has tag information indicating at least a paragraph, a sentence, and a phrase among a plurality of elements constituting the electronic document. In the document reading step, the paragraph, sentence, and 59. The recording medium according to claim 58, wherein a paragraph, a sentence, and a phrase constituting the electronic document are identified based on tag information indicating a phrase.

61. A recording medium according to claim 58, wherein said electronic document is provided with tag information necessary for performing speech synthesis.