JPH11175307A

JPH11175307A - Method and device for reading document aloud

Info

Publication number: JPH11175307A
Application number: JP9341394A
Authority: JP
Inventors: Toshiaki Kobayashi; 俊明小林
Original assignee: COMPUTER CONSULTING KK
Current assignee: COMPUTER CONSULTING KK
Priority date: 1997-12-11
Filing date: 1997-12-11
Publication date: 1999-07-02

Abstract

PROBLEM TO BE SOLVED: To discriminate between the body and quotations of a document into which the quotations are inserted when the document is read aloud by extracting the specific partial documents from the document and reading the extracted partial documents aloud by voice synthesis in a different voice from other document parts. SOLUTION: The document to be read aloud is inputted from a document input means 12. When a code detecting means 13 does not detect a code '>' indicating a quotation at the head of each line of the inputted document, a voice selecting means 16 selects, for example, a male voice as a 1st voice for reading the body aloud and the 1st voice selected by the voice selecting means 16 is synthesized to read the document aloud. The document continues to be read aloud until the code detecting means 13 detects a carriage-return code. When the code detecting means 13 detects the code '>', the voice selecting means 16 selects, for example, a female voice as a 2nd voice for reading the quotation aloud and the 2nd voice selected by the voice selecting means 16 is synthesized to read the document aloud.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、文書読上げ方法
および装置に関し、特に、同一文書内で異なる条件で音
声合成を行い文書の認識率を高めることのできる文書読
上げ方法および装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method and apparatus for reading a document, and more particularly, to a method and apparatus for reading a document which can perform speech synthesis under different conditions in the same document and increase the recognition rate of the document.

【０００２】[0002]

【従来技術】最近、電子ファイルに記述された文書を音
声合成して読上げる技術が多く利用されている。例え
ば、電子メールを音声合成して読上げるシステムでは、
電子メールを受信すると、予めユーザが設定した電話番
号への発呼を行い、電話回線が接続された後、受信した
電子メールを音声合成して読上げてその電子メールの内
容をユーザに伝えるといったことも行われている。2. Description of the Related Art In recent years, a technique for reading a document described in an electronic file by voice synthesis has been widely used. For example, in a system that reads out e-mails by voice synthesis,
When an e-mail is received, a call is made to a telephone number set by the user in advance, and after the telephone line is connected, the received e-mail is voice-synthesized and read out to convey the contents of the e-mail to the user. Has also been done.

【０００３】一方、電子メールにおいては一般の郵便等
で通信するメールと異なり、文書の複写等の加工が容易
であるという特性等のため、受信した文書の一部または
全部を引用文として含ませた文書を返信メールとする等
の使い方がされることが多い。[0003] On the other hand, electronic mail differs from general mail in that it is communicated by mail or the like. For this reason, processing such as copying of a document is easy. Therefore, a part or all of a received document is included as a quotation. In many cases, such documents are used as reply mails.

【０００４】ここで、図７に電子メールの記述例を示
す。図７（ａ）は、ユーザＢからユーザＡへの電子メー
ルを示した図であり、図７（ｂ）は、これに対するユー
ザＡからユーザＢへの返信を示した図である。FIG. 7 shows an example of a description of an electronic mail. FIG. 7A is a diagram showing an e-mail from the user B to the user A, and FIG. 7B is a diagram showing a reply from the user A to the user B.

【０００５】図７（ｂ）に示すように、ユーザＡはユー
ザＢへの返信に受信したメール（図７（ａ）参照）の３
行目から６行目までを引用文として利用し、行頭に
“＞”を付加することで本文と引用文を区別している。[0005] As shown in FIG. 7 (b), the user A receives the reply to the user B (see FIG.
The lines from line 6 to line 6 are used as quotations, and the text and quotation are distinguished by adding “>” to the beginning of the line.

【０００６】ところで、図７（ｂ）に示すような引用文
が記述された電子メールを音声合成により読上げた場
合、引用文の前後で（図７（ｂ）では２行目から３行
目、６行目から７行目）、文章の繋がりがおかしくなり
意味が不明確となってしまうことがある。By the way, when an e-mail in which a quote is described as shown in FIG. 7B is read out by speech synthesis, before and after the quote (in FIG. 7B, the second to third lines, From the sixth line to the seventh line), the connection of sentences may be strange and the meaning may be unclear.

【０００７】また、図８に示すように日本文と英文とが
混在している文書を音声合成で読上げた場合、日本文を
正しく読上げ、英文を「エス・ユー・エム・エム・アー
ル・ワイ」といったようにアルファベットとして読上げ
てしまい、読上げた文書を聞いて意味を理解することが
困難となってしまうことがある。[0007] Further, as shown in FIG. 8, when a document in which Japanese sentences and English sentences are mixed is read out by speech synthesis, the Japanese sentences are read out correctly, and the English sentences are read as "S.U.M.M.R.Y."", And it may be difficult to understand the meaning by listening to the read text.

【０００８】[0008]

【発明が解決しようとする課題】上述のように、音声合
成により文書を読上げる際に、その読上げる文書が引用
文の挿入されている文書や異なる言語が混在している文
書であった場合には、読上げられた文書を聞いて意味を
理解することが困難となってしまうといった問題点があ
った。As described above, when a document is read aloud by speech synthesis, if the read document is a document in which a quote is inserted or a document in which different languages are mixed. Has a problem that it is difficult to understand the meaning by listening to the read text.

【０００９】そこで、この発明は、引用文が挿入されて
いる文書を読上げる際には、本文と引用文を判別できる
ように読上げ、異なる言語が混在する文書に対しても各
文を各言語に対応させて読上げることのできる文書読上
げ方法および装置を提供することを目的とする。Therefore, according to the present invention, when reading out a document in which a quotation is inserted, the text is read out so that the text and the quotation can be distinguished. It is an object of the present invention to provide a text-to-speech method and apparatus that can read aloud in response to the following.

【００１０】[0010]

【課題を解決するための手段】上述した目的を達成する
ため、請求項１の発明では、電子ファイルに記述された
文書を音声合成により読上げる文書読上げ方法におい
て、前記文書から特定の部分文書を抽出し、該抽出した
部分文書を前記音声合成により他の文書部分と異なる音
声で合成して読上げることを特徴とする。According to a first aspect of the present invention, there is provided a document reading method for reading a document described in an electronic file by speech synthesis. It is characterized in that the extracted partial document is synthesized by the above-mentioned speech synthesis with a different voice from other document portions, and is read aloud.

【００１１】また、請求項２の発明では、請求項１の発
明において、前記文書の本文から引用文を抽出し、該抽
出した引用文を前記本文と異なる音声合成を行うことを
特徴とする。According to a second aspect of the present invention, in the first aspect of the present invention, a quotation is extracted from the text of the document, and the extracted quotation is subjected to speech synthesis different from that of the text.

【００１２】また、請求項３の発明では、請求項２の発
明において、前記文書は、受信した電子メールに記述さ
れた文書であり、該文書から引用文を示す符号を検出
し、該検出結果に基づいて前記引用文の抽出行い、該抽
出した引用文を前記本文と異なる声色で音声合成を行う
ことを特徴とする。According to a third aspect of the present invention, in the second aspect of the invention, the document is a document described in a received electronic mail, and a code indicating a quote is detected from the document, and the detection result is obtained. The cited text is extracted on the basis of the extracted text, and the extracted quoted text is subjected to voice synthesis with a voice tone different from that of the text.

【００１３】また、請求項４の発明では、請求項１の発
明において、前記文書を該文書を記述した言語に対応し
て抽出し、前記抽出した文書を該文書の記述言語に対応
した音声合成を行って読上げることを特徴とする。According to a fourth aspect of the present invention, in the first aspect of the present invention, the document is extracted according to a language in which the document is described, and the extracted document is subjected to speech synthesis corresponding to a description language of the document. And read aloud.

【００１４】また、請求項５の発明では、電子ファイル
に記述された文書を音声合成により読上げる文書読上げ
装置において、前記文書から部分文書を抽出する抽出手
段と、該抽出手段により抽出された部分文書を他の部分
文書と異なる音声で合成する音声合成手段とを具備する
ことを特徴とする。According to a fifth aspect of the present invention, there is provided a document reading apparatus for reading a document described in an electronic file by voice synthesis, comprising: an extracting unit for extracting a partial document from the document; and a part extracted by the extracting unit. Voice synthesis means for synthesizing the document with a different voice from the other partial documents.

【００１５】また、請求項６の発明では、請求項５の発
明において、前記抽出手段は、前記文書の本文から引用
文を抽出し、前記音声合成手段は、前記分割手段により
分割された本文と引用文とを異なる音声で音声合成する
ことを特徴とする。According to a sixth aspect of the present invention, in the fifth aspect of the present invention, the extracting means extracts a quotation from the text of the document, and the speech synthesizing means extracts the quotation from the text divided by the dividing means. It is characterized by synthesizing a quotation with a different voice.

【００１６】また、請求項７の発明では、請求項６の発
明において、前記文書は、受信した電子メールに記述さ
れた文書であり、前記分割手段は、前記文書から引用文
を示す符号を検出する符号検出手段を具備し、前記音声
合成手段は、前記分割手段により分割された本文と引用
文とを異なる声色で音声合成することを特徴とする。According to a seventh aspect of the present invention, in the sixth aspect, the document is a document described in a received e-mail, and the dividing means detects a code indicating a quote from the document. The voice synthesizing unit synthesizes the text and the quoted text divided by the dividing unit with different voice colors.

【００１７】また、請求項８の発明では、請求項５の発
明において、前記抽出手段は、前記文書を該文書を記述
した言語に対応して抽出し、前記音声合成手段は、前記
抽出手段により抽出された文書の記述言語に対応する音
声合成を行うことを特徴とする。According to an eighth aspect of the present invention, in the fifth aspect of the invention, the extracting means extracts the document corresponding to a language in which the document is described, and the speech synthesizing means outputs Speech synthesis corresponding to the description language of the extracted document is performed.

【００１８】また、請求項９の発明では、電子ファイル
に記述された文書を音声合成により読上げる文書読上げ
プログラムを記録した媒体において、前記文書から特定
の部分文書を抽出し、該抽出した部分文書を前記音声合
成により他の文書部分と異なる音声で合成して読上げる
ことを特徴とする。According to a ninth aspect of the present invention, a specific partial document is extracted from a document on a medium in which a document reading program for reading a document described in an electronic file by speech synthesis is recorded, and the extracted partial document is extracted. Is synthesized by the above-mentioned voice synthesis with a voice different from that of another document part, and is read aloud.

【００１９】[0019]

【発明の実施の形態】以下、この発明に係わる文書読上
げ方法および装置の一実施例を添付図面を参照して詳細
に説明する。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a block diagram showing an embodiment of a document reading method and apparatus according to the present invention;

【００２０】図１は、文書読上げ装置の構成を示すブロ
ック図である。FIG. 1 is a block diagram showing the configuration of the document reading device.

【００２１】図１において、文書読上げ装置１０は、各
部の制御を行う制御手段１１、読上げる文書を入力する
文書入力手段１２、引用文の開始や終了などを示す符号
を検出する符号検出手段１３、改行コードや文末を示す
ＥＯＦ（ｅｎｄｏｆｆｉｌｅ）等のコードを検出す
る制御コード検出手段１４、音声合成を行って文書を読
上げる音声合成手段１５、音声合成を行う声色を選択す
る声色選択手段１６を具備して構成される。In FIG. 1, a text-to-speech apparatus 10 includes a control unit 11 for controlling each unit, a document input unit 12 for inputting a text to be read, and a code detection unit 13 for detecting a code indicating the start or end of a quoted sentence. Control code detecting means 14 for detecting a line feed code or a code such as EOF (end of file) indicating the end of a sentence; voice synthesizing means 15 for performing voice synthesis to read a document; and voice color selecting means for selecting a voice color for performing voice synthesis. 16 is provided.

【００２２】電子メール等の文書に引用文を挿入する方
法に制限はないが、引用文を行単位で挿入して引用文の
行頭に“＞”等の符号を付加したり、“『”と“』”で
引用文を囲んで本文中に挿入する方法が一般的である。There is no limitation on the method of inserting a quotation in a document such as an electronic mail. However, a quotation is inserted in units of lines, and a mark such as ">" is added to the beginning of the quotation, or "" It is common to insert quotes around the quotes with "".

【００２３】ここで、図２を参照して、行頭に“＞”等
の符号を付加して引用文を挿入した文書を読上げる際の
文書読上げ装置１０の動作を説明する。Referring now to FIG. 2, the operation of the text-to-speech apparatus 10 when reading a document in which a quote such as ">" is added at the beginning of a line and a quote is inserted will be described.

【００２４】図２は、文書読上げ装置の動作の流れを示
すフローチャートである。文書読上げ装置１０が動作を
開始すると（ステップ１０１）、文書入力手段１２から
読上げる文書が入力される。次に、符号検出手段１３が
入力された文書の各行の行頭に引用文を示す符号“＞”
が付加されているか否かを検出し（ステップ１０２）、
符号“＞”が検出されなければ（ステップ１０２でＮ
Ｏ）、声色選択手段１６が本文を読上げる第１声色とし
て、例えば男性の声色を選択し（ステップ１０３）、音
声合成手段１６が選択された第１声色で音声合成を行い
文書を読上げる（ステップ１０５）。文書の読上げは、
制御コード検出手段１４が改行コードを検出するまで続
けられ（ステップ１０６でＮＯ）、改行コードが検出さ
れ（ステップ１０６でＹＥＳ）、これが文書の末尾でな
ければ（ステップ１０７でＮＯ）、再び符号検出手段１
３による符号“＞”の検出が行われる（ステップ１０
２）。FIG. 2 is a flowchart showing the flow of the operation of the document reading device. When the document reading apparatus 10 starts operating (step 101), a document to be read is input from the document input unit 12. Next, the sign detecting means 13 adds a sign “>” indicating a quotation at the beginning of each line of the input document.
Is detected (step 102).
If the sign “>” is not detected (N in step 102)
O), the voice selecting means 16 selects, for example, a male voice as the first voice to read the text (step 103), and the voice synthesizing means 16 performs voice synthesis using the selected first voice and reads the document (step 103). Step 105). When reading a document,
The control is continued until the control code detecting means 14 detects a line feed code (NO in step 106), a line feed code is detected (YES in step 106), and if this is not the end of the document (NO in step 107), the code is detected again. Means 1
3 is detected (step 10).
2).

【００２５】また、符号検出手段１３が符号“＞”を検
出すると（ステップ１０２でＹＥＳ）、声色選択手段１
６が引用文を読上げる第２声色として、例えば女性の声
色を選択し（ステップ１０４）、音声合成手段１６が選
択された第２声色で音声合成を行い文書を読上げる（ス
テップ１０５）。文書の読上げは、第１の声色で読上げ
る場合と同様に制御コード検出手段１４が改行コードを
検出するまで続けられる（ステップ１０６でＮＯ）。When the sign detecting means 13 detects a sign ">" (YES in step 102), the timbre selecting means 1
6 selects a female voice, for example, as the second voice to read the quotation (step 104), and the voice synthesizing means 16 performs voice synthesis using the selected second voice and reads the document (step 105). The reading of the document is continued until the control code detecting means 14 detects a line feed code in the same manner as in the case of reading in the first voice (NO in step 106).

【００２６】このように、読上げる文書を行毎に符号
“＞”の検出を行って、符号が検出されなければ本文と
して第１声色で読上げ、符号が検出されれば引用文とし
て第２の声色で読上げを行い、文末まで読上げを行うと
（ステップ１０７でＹＥＳ）、文書読上げ装置１０は文
書の読上げを終了する（ステップ１０８）。As described above, the text to be read is detected for each line by detecting the sign ">". If the sign is not detected, the document is read in the first voice as the text. If the sign is detected, the second sentence is referred to as the quotation. When the text-to-speech is read out to the end of the sentence (YES in step 107), the text-to-speech apparatus 10 ends the text-to-speech (step 108).

【００２７】ところで、電子メール等では、受信したメ
ールを引用文として利用することが多いが、引用文が挿
入された文書を引用する、つまり、先に引用された文を
二重に引用するといった場合もある。By the way, in an electronic mail or the like, a received mail is often used as a quote, but a document in which a quote is inserted is quoted, that is, a quoted sentence is quoted twice. In some cases.

【００２８】図３は、二重に引用がなされている文書を
読上げる際の文書読上げ装置の動作の流れを示すフロー
チャートである。文書読上げ装置１０が動作を開始し
（ステップ２０１）、文書入力手段１２から読上げる文
書が入力されると、符号検出手段１３が入力された文書
の行頭に引用文を示す符号“＞”が付加されているか否
かを検出するが（ステップ２０２、２０４）、検出の結
果、符号“＞”が付加されていなければ（ステップ２０
２でＮＯ）、声色選択手段１６が本文を読上げる第１声
色として、例えば男性の声色を選択し（ステップ２０
３）、符号“＞”が１つ付加されていた場合には（ステ
ップ２０２でＹＥＳ、ステップ２０４でＮＯ）、声色選
択手段１６が引用文を読上げる第２声色として、例えば
女性の声色を選択し（ステップ２０５）、符号“＞”が
２つ付加され“＞＞”となっていた場合には（ステップ
２０２でＹＥＳ、ステップ２０２でＹＥＳ）、声色選択
手段１６が二重の引用文を読上げる第３声色として、例
えば子供の声色を選択する（ステップ２０６）。FIG. 3 is a flow chart showing the flow of the operation of the document reading device when reading a double-cited document. When the text-to-speech apparatus 10 starts operating (step 201) and a text to be read is input from the document input means 12, the code detection means 13 adds a symbol ">" indicating a citation to the beginning of the input document. Is detected (steps 202 and 204), but as a result of the detection, if the sign “>” is not added (step 20).
(NO in step 2), the voice selecting means 16 selects, for example, a male voice as the first voice to read the text (step 20).
3) If one sign “>” has been added (YES in step 202, NO in step 204), for example, a female voice is selected as the second voice that the voice color selecting means 16 reads out the quotation. (Step 205), when two symbols “>” are added and the result is “>>” (YES in Step 202, YES in Step 202), the voice selecting means 16 reads the double quote. For example, a child's voice is selected as the third voice to be raised (step 206).

【００２９】文書を読上げる声色が選択されると、音声
合成手段１６が選択された声色で音声合成を行い文書を
読上げ（ステップ２０７）、制御コード検出手段１４が
改行コードを検出するまで続けられる（ステップ２０８
でＮＯ）。When the voice to read the document is selected, the voice synthesizing means 16 synthesizes the voice with the selected voice and reads the document (step 207), and continues until the control code detecting means 14 detects the line feed code. (Step 208
NO).

【００３０】この文書の読上げは、文書の末尾まで繰り
返され（ステップ２０９でＮＯ）、文末まで読上げを行
うと（ステップ２０９でＹＥＳ）、文書読上げ装置１０
は動作を終了する（ステップ２１０）。This reading of the document is repeated until the end of the document (NO in step 209), and when the reading is performed to the end of the sentence (YES in step 209), the document reading apparatus 10
Ends the operation (step 210).

【００３１】次に、図４を参照して、“『”と“』”等
で囲まれた引用文を挿入した文書を読上げる際の文書読
上げ装置１０の動作を説明する。Next, with reference to FIG. 4, the operation of the text-to-speech apparatus 10 when reading a document in which a quote enclosed between "" and "" is inserted will be described.

【００３２】図４は、符号で囲まれた引用文を挿入した
文書を読上げる際の文書読上げ装置の動作の流れを示す
フローチャートである。文書読上げ装置１０が動作を開
始し（ステップ３０１）、文書入力手段１２から読上げ
る文書が入力されると、まず、声色選択手段１６が本文
を読上げる声色として、第１声色を選択する（ステップ
３０２）。FIG. 4 is a flow chart showing the flow of the operation of the document reading aloud device when reading out a document in which a quote enclosed by a code is inserted. When the text-to-speech apparatus 10 starts operation (step 301) and a document to be read is input from the document input means 12, first, the voice selection means 16 selects the first voice as the voice to read the text (step 301). 302).

【００３３】次に、符号検出手段１３で符号“『”
と“』”の検出を行いながら（ステップ３０３でＮ
Ｏ）、音声合成手段１５が第１声色で文書の読上げを行
う（ステップ３０８）。文書の読上げは、制御コード検
出手段１４が文末（ファイル末）を示すＥＯＦを検出す
るまで続けられる（ステップ３０９でＮＯ）。Next, the code ""
And """(N in step 303).
O), the voice synthesizing unit 15 reads out the document in the first voice (step 308). The reading of the document is continued until the control code detecting means 14 detects the EOF indicating the end of the sentence (end of file) (NO in step 309).

【００３４】ここで、符号検出手段１３が符号“『”ま
たは“』”を検出し（ステップ３０３でＹＥＳ）、その
符号が“『”であれば（ステップ３０４でＹＥＳ）、引
用文の開始箇所と判断して声色選択手段１６が引用文を
読上げる声色として第２声色を選択し（ステップ３０
５）、音声合成手段１５が第２声色で文書の読上げを行
う（ステップ３０８）。Here, the sign detecting means 13 detects the sign "" or "" (YES in step 303), and if the sign is "" (YES in step 304), the starting position of the quote And the second voice is selected as the voice to which the voice selecting means 16 reads the quotation (step 30).
5) The voice synthesizing unit 15 reads out the document in the second voice (step 308).

【００３５】第２声色で文書の読上げが続けられ（ステ
ップ３０８、３０９でＮＯ）、符号検出手段１３が再び
符号“『”または“』”を検出し（ステップ３０３でＹ
ＥＳ）、その符号が“』”であれば（ステップ３０４で
ＮＯ）、引用文の終了箇所と判断して声色選択手段１６
が本部を読上げる声色として第１声色を選択し（ステッ
プ３０７）、音声合成手段１５が第１声色で文書の読上
げを行う（ステップ３０８）。The reading of the document in the second voice is continued (NO in steps 308 and 309), and the code detecting means 13 detects the code "" or "" again (Y in step 303).
ES), if the sign is "" (NO in step 304), it is determined that the quotation is at the end and the voice color selecting means 16
Selects the first voice as the voice to read the headquarters (step 307), and the voice synthesizing unit 15 reads the document in the first voice (step 308).

【００３６】このようにして文書の読上げが続けられ、
制御コード検出手段１４が文末（ファイル末）を示すＥ
ＯＦを検出すると（ステップ３０９でＹＥＳ）、文書読
上げ装置１０は動作を終了する（ステップ３１０）。In this manner, the reading of the document is continued,
The control code detecting means 14 indicates the end of the sentence (end of file) E
When OF is detected (YES in step 309), document reading-aloud device 10 ends the operation (step 310).

【００３７】また、ここでの説明は省略するが、行頭に
符号“＞＞”を付加した二重引用と同様に“『”
と“』”で囲まれた引用文内でさらに“『”と“』”で
囲まれた二重引用文を別の声色で読上げることもでき
る。Although the description is omitted here, as in the case of a double quote in which a symbol ">>" is added at the beginning of a line, "["
Within the quotes enclosed by "and"", the double quotes enclosed by""and""can be read out in a different voice.

【００３８】なお、この実施例では引用文を示す符号を
“＞”や“『”、“』”として説明したが、別の符号を
同様の方法で使用した場合にも、声色を変更して文書の
読上げを行うことができる。In this embodiment, reference numerals indicating quotes are described as ">", "", "". However, even when another code is used in the same manner, the voice is changed. Can read aloud a document.

【００３９】また、特定のグループ内等で送受を行って
いるメールを読上げる際には、文書読上げ装置１０に図
示しない声色登録手段を具備し、そのグループ内のユー
ザの声色を登録し、文書の記述を行ったユーザの声色で
文書の読上げを行うようにすることもできる。When reading out an e-mail sent or received within a specific group or the like, the text-to-speech apparatus 10 is provided with a voice registering means (not shown) to register the voices of the users in the group. The document can be read aloud in the voice of the user who made the description.

【００４０】次に、この発明に係わる文書読上げ方法お
よび装置の第２の実施例について説明する。Next, a description will be given of a second embodiment of the document reading method and apparatus according to the present invention.

【００４１】図５は、第２の実施例における文書読上げ
装置の構成を示すブロック図である。図５において、文
書読上げ装置２０は、各部の制御を行う制御手段２１、
読上げる文書を入力する文書入力手段２２、文書を記述
した言語を示す情報を検出する言語情報検出手段２３、
改行コードや文末を示すＥＯＦ（ｅｎｄｏｆｆｉｌ
ｅ）等のコードを検出する制御コード検出手段２４、音
声合成を行って文書を読上げる音声合成手段２５、音声
合成を行う言語を選択する言語選択手段２６を具備して
構成される。FIG. 5 is a block diagram showing the configuration of the document reading device according to the second embodiment. In FIG. 5, a document reading aloud device 20 includes a control unit 21 for controlling each unit,
A document input unit 22 for inputting a document to be read, a language information detecting unit 23 for detecting information indicating a language describing the document,
EOF (end of file) indicating line feed code or end of sentence
e) a control code detecting means 24 for detecting codes, a voice synthesizing means 25 for performing voice synthesis and reading a document, and a language selecting means 26 for selecting a language for performing voice synthesis.

【００４２】この文書読上げ装置２０は、電子ファイル
に記述された文書が、どの言語で記述されたかを言語情
報検出手段２３で検出して、各言語に対応した文書読上
げを行う。言語情報検出手段２３は、例えば、電子メー
ルにおいては、電子メールのフォーマット規約のＭＩＭ
Ｅとよばれる規格で多国語対応がなされており、このＭ
ＩＭＥのタグ情報を判別することで、文書が何語で記述
されているかを判別する。In the document reading device 20, the language information detecting means 23 detects in which language the document described in the electronic file is described, and reads the document corresponding to each language. For example, in the case of an electronic mail, the linguistic information detecting means
Multilingual support is provided by a standard called E.
By determining the tag information of the IME, it is determined in which language the document is described.

【００４３】ここで、図６を参照して文書読上げ装置２
０の動作を説明する。図６は、文書読上げ装置２０の動
作の流れを示すフローチャートである。Here, with reference to FIG.
The operation of 0 will be described. FIG. 6 is a flowchart showing the operation flow of the document reading device 20.

【００４４】文書読上げ装置２０が動作を開始し（ステ
ップ４０１）、文書入力手段２２から読上げる文書が入
力されると、言語選択手段２６が、初期設定されている
言語、例えば日本語を選択する（ステップ４０２）。次
に、言語情報検出手段２３が言語情報を検出すれば（ス
テップ４０３でＹＥＳ）、言語選択手段２６が検出され
た情報に基づいた言語を選択し（ステップ４０４）、言
語情報検出手段２３が言語情報を検出しなければ（ステ
ップ４０３でＮＯ）、そのままの言語で音声合成手段２
５が、言語選択手段２６が選択した言語で音声合成を行
って文書を読上げる（ステップ４０５）。When the text-to-speech apparatus 20 starts operating (step 401) and a text to be read is input from the document input means 22, the language selection means 26 selects an initially set language, for example, Japanese. (Step 402). Next, if the linguistic information detecting means 23 detects linguistic information (YES in step 403), the language selecting means 26 selects a language based on the detected information (step 404). If no information is detected (NO in step 403), the voice synthesizing means 2 uses the language as it is.
5 performs speech synthesis in the language selected by the language selecting means 26 and reads out the document (step 405).

【００４５】このステップ４０３乃至４０５の言語の変
更および文書の読上げは、制御コード検出手段２４が文
末（ファイル末）を示すＥＯＦを検出するまで繰り返さ
れ（ステップ４０６でＮＯ）、ＥＯＦが検出されると
（ステップ４０６でＹＥＳ）、文書読上げ装置２０は動
作を終了する（ステップ４０７）。The change of the language and the reading of the document in steps 403 to 405 are repeated until the control code detecting unit 24 detects the EOF indicating the end of the sentence (end of file) (NO in step 406), and the EOF is detected. (YES in step 406), the document reading apparatus 20 ends the operation (step 407).

【００４６】なお、第１の実施例では本文と引用文を異
なる声色で読上げる文書読上げ装置を、第２の実施例で
は言語毎にその言語に対応した音声合成を行って文書を
読上げる文書読上げ装置の説明をしたが、両者を同一の
装置で実現できるように構成してもよいことを付記して
おく。In the first embodiment, a text-to-speech apparatus which reads out a text and a quotation in different voices is used. In the second embodiment, a text-to-speech document is read out by performing speech synthesis corresponding to each language for each language. Although the text-to-speech device has been described, it should be added that both may be configured to be realized by the same device.

【００４７】また、第１の実施例および第２の実施例に
おける文書読上げ装置は、ともにコンピュータ上のメモ
リやＣＰＵ等で構成し、上述の動作を行わせるためのプ
ログラムを動作させることにより実現させることもでき
る。The document reading apparatus in the first embodiment and the second embodiment are both constituted by a memory or a CPU of a computer, and are realized by operating a program for performing the above operation. You can also.

【００４８】[0048]

【発明の効果】以上説明したように、この発明によれ
ば、読上げる文書から引用文を示す符号を検出し、該検
出結果に基づいて引用文を本文とは異なる声色で音声合
成して読上げるように構成したので、ユーザが本文と引
用文とを意識して聞き分けることができ、文書の認識率
が向上する。As described above, according to the present invention, a code indicating a quotation is detected from a document to be read, and based on the detection result, the quotation is speech-synthesized with a voice tone different from that of the text and read. Since the user is able to distinguish between the text and the quoted text, the recognition rate of the document is improved.

【００４９】また、読上げる文書を記述した言語を判定
して、該言語に対応した音声合成により文書を読上げる
ように構成したため、複数の言語で記述された文書の認
識率を高めることができる。Further, since the language in which the document to be read is described is determined and the document is read out by speech synthesis corresponding to the language, the recognition rate of the document described in a plurality of languages can be increased. .

[Brief description of the drawings]

【図１】文書読上げ装置の構成を示すブロック図。FIG. 1 is a block diagram illustrating a configuration of a document reading device.

【図２】文書読上げ装置の動作の流れを示すフローチャ
ート。FIG. 2 is a flowchart showing the flow of the operation of the document reading device.

【図３】二重に引用がなされている文書を読上げる際の
文書読上げ装置の動作の流れを示すフローチャート。FIG. 3 is a flowchart showing the flow of the operation of the document reading device when reading a double-cited document.

【図４】符号で囲まれた引用文を挿入した文書を読上げ
る際の文書読上げ装置の動作の流れを示すフローチャー
ト。FIG. 4 is a flowchart showing the flow of the operation of the document reading device when reading a document in which a quote enclosed by a code is inserted.

【図５】第２の実施例における文書読上げ装置の構成を
示すブロック図。FIG. 5 is a block diagram illustrating a configuration of a document reading device according to a second embodiment.

【図６】第２の実施例における文書読上げ装置の動作の
流れを示すフローチャート。FIG. 6 is a flowchart showing the operation flow of the document reading apparatus according to the second embodiment.

【図７】電子メールの記述例を示した図。FIG. 7 is a diagram showing a description example of an electronic mail.

【図８】日本文と英文とが混在している文書例を示した
図。FIG. 8 is a diagram showing an example of a document in which Japanese sentences and English sentences are mixed.

[Explanation of symbols]

１０文書読上げ装置１１制御手段１２文書入力手段１３符号検出手段１３１４制御コード検出手段１４１５音声合成手段１６声色選択手段２０文書読上げ装置２１制御手段２２文書入力手段２３言語情報検出手段２４制御コード検出手段２５音声合成手段２６言語選択手段 DESCRIPTION OF SYMBOLS 10 Document reading-out device 11 Control means 12 Document input means 13 Code detection means 13 14 Control code detection means 14 15 Voice synthesis means 16 Voice color selection means 20 Document reading-out apparatus 21 Control means 22 Document input means 23 Language information detection means 24 Control code detection Means 25 Speech synthesis means 26 Language selection means

Claims

[Claims]

1. A document reading method for reading a document described in an electronic file by speech synthesis, wherein a specific partial document is extracted from the document, and the extracted partial document is different from another document portion by the speech synthesis. A text-to-speech method characterized by speech synthesis.

2. The method according to claim 1, further comprising extracting a quote from the text of the document, and performing a speech synthesis on the extracted quote that is different from that of the text.

3. The document is a document described in a received e-mail. A code indicating a quote is detected from the document, the quote is extracted based on the detection result, and the extracted quote is extracted. 3. The method according to claim 2, wherein speech synthesis is performed on the sentence in a voice different from that of the text.

4. The method according to claim 1, wherein the document is extracted according to a language in which the document is described, and the extracted document is read out by performing speech synthesis corresponding to a description language of the document. How to read documents.

5. A text-to-speech apparatus for reading a document described in an electronic file by speech synthesis, comprising: extracting means for extracting a partial document from the document; and extracting the partial document extracted by the extracting means with another partial document. A text-to-speech apparatus comprising: voice synthesis means for synthesizing with different voices.

6. The method according to claim 1, wherein the extracting means extracts a quote from the text of the document, and the speech synthesis means synthesizes the text and the quote divided by the dividing means with different voices. The document reading device according to claim 5, wherein

7. The document according to claim 1, wherein the document is a document described in a received e-mail, the division unit includes a code detection unit that detects a code indicating a quotation from the document, 7. The text-to-speech apparatus according to claim 6, wherein the text and the quoted text divided by the dividing means are synthesized in different voices.

8. The extracting unit extracts the document corresponding to a language in which the document is described, and the speech synthesizing unit performs speech synthesis corresponding to a description language of the document extracted by the extracting unit. 6. The document reading device according to claim 5, wherein:

9. A medium in which a document reading program for reading a document described in an electronic file by voice synthesis is recorded, and a specific partial document is extracted from the document, and the extracted partial document is converted to another document by the voice synthesis. A medium storing a document reading program characterized by being read out by synthesizing with a different voice from the document part.