JP2002269004A

JP2002269004A - Electronic mail reader

Info

Publication number: JP2002269004A
Application number: JP2001072070A
Authority: JP
Inventors: Katsumi Nishitani; 克巳西谷
Original assignee: Sumitomo Wiring Systems Ltd; AutoNetworks Technologies Ltd; Sumitomo Electric Industries Ltd
Current assignee: Sumitomo Wiring Systems Ltd; AutoNetworks Technologies Ltd; Sumitomo Electric Industries Ltd
Priority date: 2001-03-14
Filing date: 2001-03-14
Publication date: 2002-09-20

Abstract

PROBLEM TO BE SOLVED: To provide an electronic mail reader where the listening sense of a user on the reading output of an electronic mail can be improved. SOLUTION: In the electronic mail reader, a plurality of pieces of sound quality information of senders (or the other persons such as entertainers), which are made to correspond to the respective senders of the electronic mails, are registered in a sound quality data base 9b. The sender of the electronic mail is automatically discriminated by header information of the electronic mail being the object of reading and by information in a sender data base 9a. Sound for reading electronic mail is synthesized based on sound quality information in the sound quality data base 9b, which is made to correspond to the sender.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、電子メール読み上
げ装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an electronic mail reading device.

【０００２】[0002]

【従来の技術】電子メールを音声により読み上げて出力
する従来の電子メール読み上げ装置では、テキスト音声
合成技術が用いられている。この一般的なテキスト音声
合成技術では、標準的な声質情報として男女２種類の声
質情報が備えられている。そして、電子メールの差出人
のメールアドレス等に基づいて、その男女２種類のうち
のいずれかの声質情報が選択され、その声質情報に基づ
いて作成された音声によって電子メールが読み上げ出力
されるようになっている。2. Description of the Related Art A conventional text-to-speech apparatus that reads out and outputs an e-mail by voice uses a text-to-speech synthesis technique. In this general text-to-speech synthesis technology, two types of male and female voice quality information are provided as standard voice quality information. Then, based on the e-mail address of the sender of the e-mail or the like, one of the two types of voice quality information is selected, and the e-mail is read aloud by a voice created based on the voice quality information. Has become.

【０００３】なお、このような電子メール読み上げ装置
は、例えば、車載用等に用いられ、運転中の受信電子メ
ールの内容確認等に好適に用いられるようになってい
る。[0003] Such an e-mail reading device is used, for example, for in-vehicle use, and is suitably used for checking the contents of received e-mails during driving.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上述の
従来技術のように、差出人のメールアドレス等によって
男女別の声質を切り替えて電子メールの読み上げを行う
構成では、用意されている声質の種類も少なく、読み上
げの際の声質が差出人のものと大きく異なる場合が多
く、読み上げ出力のユーザに違和感を与えてしまい、聴
取感が損なわれるという問題がある。However, in the configuration in which the voice quality is switched by gender according to the sender's mail address or the like and the e-mail is read out as in the above-mentioned prior art, there are few types of voice quality prepared. In many cases, the voice quality at the time of reading out is significantly different from that of the sender, giving a sense of incongruity to the user of the reading out output, and causing a problem that the listening feeling is impaired.

【０００５】そこで、本発明は前記問題点に鑑み、電子
メールの読み上げ出力のユーザの聴取感の向上が図れる
電子メール読み上げ装置を提供することを目的とする。SUMMARY OF THE INVENTION In view of the above problems, an object of the present invention is to provide an e-mail reading apparatus which can improve the user's perceived readability of e-mail reading output.

【０００６】[0006]

【課題を解決するための手段】前記目的を達成するため
の技術的手段は、電子メールを音声により読み上げて出
力する電子メール読み上げ装置であって、スピーカと、
前記電子メールの前記差出人が登録された差出人データ
ベースを記憶する第１の記憶手段と、前記差出人データ
ベースに含まれる前記各差出人に対応付けられた声質情
報が登録された声質データベースを記憶する第２の記憶
手段と、与えられた前記電子メールのヘッダ情報を解析
し、その電子メールが前記差出人データベースに含まれ
る前記差出人のうちのいずれの差出人によるものかを判
別する差出人判別手段と、前記第２の記憶手段の前記声
質データベースに含まれる前記声質情報のうちの前記差
出人判別手段によって判別された前記差出人に対応する
前記声質情報に基づいて、与えられた前記電子メールを
その声質情報に対応する声質で読み上げるための音声を
合成し、その音声を前記スピーカを介して出力する音声
合成手段と、を備えることを特徴とする。A technical means for achieving the above object is an electronic mail reading device for reading out an electronic mail by voice and outputting the read out electronic mail.
A first storage unit for storing a sender database in which the sender of the electronic mail is registered, and a second storage unit for storing a voice quality database in which voice quality information associated with each of the senders included in the sender database is registered. Storage means, a sender discriminating means for analyzing header information of the given e-mail, and discriminating which of the senders included in the sender database the e-mail is from; Based on the voice quality information corresponding to the sender determined by the sender determination means among the voice quality information included in the voice quality database of the storage means, the given electronic mail is converted into a voice quality corresponding to the voice quality information. Voice synthesizing means for synthesizing voice to be read out and outputting the voice via the speaker. And wherein the Rukoto.

【０００７】好ましくは、前記電子メール読み上げ装置
は、音声情報およびその音声情報に対応する前記差出人
を示す差出人情報を取り込む第１の情報取込手段と、前
記第１の情報取込手段によって取り込まれた前記音声情
報に基づいて、その音声情報の声質の特徴を抽出し、そ
の声質の特徴を示す前記声質情報を作成する特徴抽出手
段と、前記特徴抽出手段が作成した前記声質情報、およ
び、前記差出人情報によって特定され、その声質情報の
作成に用いられた前記音声情報に対応する前記差出人
を、互いに対応付けて前記声質データベースおよび前記
差出人データベースに登録する第１のデータベース作成
手段と、をさらに備えるのがよい。[0007] Preferably, the electronic mail reading device is fetched by first information fetching means for fetching voice information and sender information indicating the sender corresponding to the voice information, and the first information fetching means. Based on the voice information, a voice quality feature of the voice information is extracted, and the voice quality information created by the voice feature information is generated by the voice quality information indicating the voice quality feature. First database creation means for registering the sender specified by the sender information and corresponding to the voice information used to create the voice quality information in the voice quality database and the sender database in association with each other. Is good.

【０００８】また、好ましくは、前記第１の情報取込手
段は、前記音声情報を、持ち運び可能な記録媒体から、
あるいは通信回線を介して取り込むのがよい。[0008] Preferably, the first information fetching means converts the voice information from a portable recording medium.
Alternatively, it is good to take in via a communication line.

【０００９】さらに、好ましくは、前記特徴抽出手段
は、前記声質情報の作成を行う際に、前記第１の情報取
込手段によって取り込まれた前記音声情報に対応する前
記差出人に対応する既存の前記声質情報が既に前記声質
データベース内に存在する場合には、その既存の声質情
報が示す声質の特徴と、前記第１の情報取込手段によっ
て取り込まれた前記音声情報から新たに抽出した特徴と
に基づいて、前記差出人に対応する声質の特徴をより反
映した新たな声質情報を作成するのがよい。[0009] Further, preferably, the feature extracting means, when creating the voice quality information, stores the voice quality information in the existing information corresponding to the sender corresponding to the voice information captured by the first information capturing means. When the voice quality information already exists in the voice quality database, the voice quality characteristics indicated by the existing voice quality information and the characteristics newly extracted from the voice information captured by the first information capturing unit are added. Based on this, it is preferable to create new voice quality information that further reflects the characteristics of voice quality corresponding to the sender.

【００１０】また、好ましくは、前記電子メール読み上
げ装置は、前記声質情報およびその声質情報に対応する
前記差出人を示す差出人情報を取り込む第２の情報取込
手段と、前記第２の情報取込手段が取り込んだ前記声質
情報、および、前記差出人情報によって特定され、その
声質情報に対応する前記差出人を、互いに対応付けて前
記声質データベースおよび前記差出人データベースにそ
れぞれ登録する第２のデータベース作成手段と、をさら
に備えるのがよい。[0010] Preferably, the electronic mail reading device further comprises: second information capturing means for capturing the voice quality information and sender information indicating the sender corresponding to the voice quality information; and second information capturing means. The voice quality information captured, and, the second database creation means, which is specified by the sender information, and registers the sender corresponding to the voice quality information in the voice quality database and the sender database in association with each other. It is better to have more.

【００１１】さらに、好ましくは、前記第２の情報取込
手段は、前記声質情報を、持ち運び可能な記録媒体か
ら、あるいは通信回線を介して取り込むのがよい。Further, preferably, the second information capturing means captures the voice quality information from a portable recording medium or via a communication line.

【００１２】また、好ましくは、前記第２のデータベー
ス作成手段は、取り込まれた前記声質情報を前記声質デ
ータベースに登録する際に、その声質情報に対応する前
記差出人に対応する既存の前記声質情報が既に前記声質
データベース内に存在する場合には、その既存の声質情
報が示す声質の特徴と前記第２の情報取込手段によって
新たに取り込まれた声質情報が示す声質の特徴とに基づ
いて、前記差出人に対応する声質の特徴をより反映した
声質情報を新たに作成して前記声質データベースに登録
する機能をさらに備えるのがよい。[0012] Preferably, the second database creation means, when registering the fetched voice quality information in the voice quality database, stores the existing voice quality information corresponding to the sender corresponding to the voice quality information. If already present in the voice quality database, based on the voice quality characteristics indicated by the voice quality information indicated by the existing voice quality information and the voice quality characteristics indicated by the voice quality information newly captured by the second information capturing means, It is preferable to further include a function of newly creating voice quality information that further reflects characteristics of voice quality corresponding to the sender and registering the information in the voice quality database.

【００１３】[0013]

【発明の実施の形態】＜全体構成の概略説明＞図１は、
本発明の一実施形態に係る電子メール読み上げ装置のブ
ロック図である。この電子メール読み上げ装置は、図１
に示すように、入力手段である操作部１と、通信部３
と、読取部５と、情報処理部７と、記憶部（第１および
第２の記憶手段）９と、表示部１０と、アンプ１１と、
スピーカ１３とを備えて構成されており、ここでは、車
載用を想定している。情報処理部７は、中央制御部７
ａ、メール送受信部７ｂ、情報取込部７ｃ、メール管理
部７ｄ、声質情報管理部（特徴抽出手段、第１および第
２のデータベース作成手段）７ｅ、表示処理部７ｆおよ
び音声合成部（差出人判別部および音声合成手段に対
応）７ｇを備えている。これの構成のうち、本発明に係
る第１および第２の情報取込手段には、操作部１、通信
部３、読取部５、中央制御部７ａおよび情報取込部７ｃ
が対応している。DESCRIPTION OF THE PREFERRED EMBODIMENTS <Schematic Description of Overall Configuration> FIG.
1 is a block diagram of an electronic mail reading device according to an embodiment of the present invention. This e-mail reading device is shown in FIG.
As shown in FIG. 1, an operation unit 1 as an input unit and a communication unit 3
A reading unit 5, an information processing unit 7, a storage unit (first and second storage means) 9, a display unit 10, an amplifier 11,
It is configured to include a speaker 13, and here, it is assumed to be mounted on a vehicle. The information processing unit 7 includes a central control unit 7
a, a mail transmission / reception unit 7b, an information acquisition unit 7c, a mail management unit 7d, a voice quality information management unit (feature extraction unit, first and second database creation units) 7e, a display processing unit 7f, and a speech synthesis unit (sender identification) 7g). In this configuration, the first and second information capturing means according to the present invention include an operation unit 1, a communication unit 3, a reading unit 5, a central control unit 7a, and an information capturing unit 7c.
Is supported.

【００１４】ここで、この読み上げ装置は、大略的に、
電子メールの送受信機能、および受信した電子メールの
音声による読み上げ出力機能を備えている。そして、そ
の読み上げ出力は、電子メールの差出人の声質の特徴に
応じて合成された音声、あるいは差出人に予め対応付け
られた声質の音声によって行われるようになっている。Here, this reading device generally includes
It has a function of sending and receiving e-mails and a function of reading out received e-mails by voice. The text-to-speech output is performed by a voice synthesized according to the characteristics of the voice quality of the sender of the electronic mail, or a voice of voice quality previously associated with the sender.

【００１５】この電子メールの読み上げに関しては、読
み上げを行う読み上げモードと、読み上げを行わない非
読み上げモードとが切り替え可能に用意されている。ま
た、読み上げモードにおいては、第１および第２のサブ
モードが設けられており、第１のサブモードでは、電子
メールがその電子メールの差出人に対応付けらえた声質
の音声で読み上げられるようになっており、第２のサブ
モードでは、電子メールが手動設定された男性または女
性の標準的な声質の音声で読み上げられるようになって
いる。As for the reading of the e-mail, a reading mode in which reading is performed and a non-reading mode in which reading is not performed are prepared so as to be switchable. In the reading mode, first and second sub-modes are provided. In the first sub-mode, an e-mail is read out with a voice having a voice quality associated with the sender of the e-mail. In the second sub-mode, the e-mail is read aloud in a manually set male or female standard voice.

【００１６】この読み上げモードと非読み上げモードと
の間、および第１のサブモードと第２のサブモードとの
間のモード切り替えは、操作部１から入力されるモード
切り替えのための操作入力に基づいて、後述する中央制
御部７ａによって行われる。The mode switching between the reading mode and the non-reading mode and between the first sub-mode and the second sub-mode is based on an operation input from the operating unit 1 for mode switching. This is performed by the central control unit 7a described later.

【００１７】また、読み上げモードの第１のサブモード
では、各差出人にその差出人本人の声質や、特定の芸能
人等の他の声質を対応付けておくと、その対応付けてお
いた声質が自動的に選択され、その選択された声質の音
声で各差出人からの電子メールが読み上げられるように
なっている。In the first sub-mode of the reading mode, if the voice quality of the sender or another voice quality such as a particular entertainer is associated with each sender, the associated voice quality is automatically set. , And the e-mail from each sender is read out using the voice of the selected voice quality.

【００１８】＜声質の特徴抽出および再現の原理説明＞
次に、読み上げ装置の具体的構成について説明する前
に、本実施形態に採用されている声質の特徴の抽出手法
およびその抽出した声質の特徴の再現手法について簡単
に説明する。なお、前記各手法には、例えば「音声・音
情報のディジタル信号処理」（昭晃堂、１９９７、鹿野
清宏等共著）等の文献に示される公知技術（スベクトル
包絡を用いた手法）が用いられる。<Principle explanation of voice quality feature extraction and reproduction>
Next, before describing a specific configuration of the reading device, a brief description will be given of a method of extracting voice characteristic features and a method of reproducing the extracted voice characteristic employed in the present embodiment. For each of the above-mentioned techniques, a known technique (a technique using a vector envelope) shown in a document such as “Digital signal processing of voice / sound information” (co-authored by Shokodo, 1997, Kiyohiro Kano, etc.) is used. Can be

【００１９】音声により文書を読み上げる際の声質の特
徴には、音響的特徴（音色）と韻律的特徴（音の長さ、
高さ、強さ）とがあり、声質の特徴抽出は、音声合成に
用いられる各音素についてのこれら２つの特徴を示す特
徴パラメータ（ピッチ成分（高さ）、強さ、スペクトル
包絡成分）をディジタル音声情報から抽出することによ
り行われる。この特徴抽出処理は、後述する声質情報管
理部７ｅによって行われる。The features of voice quality when reading a document by voice include acoustic features (tone colors) and prosodic features (sound duration,
The feature extraction of voice quality is performed by digitally extracting feature parameters (pitch component (height), intensity, and spectral envelope component) indicating these two features for each phoneme used for speech synthesis. This is performed by extracting from audio information. This feature extraction process is performed by a voice quality information management unit 7e described later.

【００２０】この抽出工程をさらに詳細に説明すると、
以下のようになる。まず、差出人に対応付けるべきその
差出人等（仮にＡさんとする）のディジタル音声情報に
対して音声認識を行うことにより、意味を持ったテキス
ト情報（＝音素列）と、各音素の開始・終了時間を得
る。音声認識では特徴パラメータのうちスペクトル包絡
のみを使用するが、音声情報を分析する際には、各音素
の声質の特徴を示すピッチ成分（高さ）と強さも同時に
求めることができる。よって、音声認識の結果得られた
音素列と、音声情報を分析した結果とを対応付けること
により、各音素の声質の特徴を示す声質情報が得られ
る。なお、テキスト情報の抽出は、後述する音韻データ
ベース９ｄに格納された音韻モデル、および後述する語
彙データベース９ｅに格納された語彙辞書を用いて行わ
れる。The extraction step will be described in more detail.
It looks like this: First, by performing speech recognition on the digital voice information of the sender or the like (tentatively, Mr. A) to be associated with the sender, meaningful text information (= phoneme sequence) and start / end time of each phoneme Get. In the speech recognition, only the spectral envelope among the feature parameters is used. However, when analyzing the speech information, the pitch component (height) and the strength indicating the characteristics of the voice quality of each phoneme can be obtained at the same time. Therefore, by associating the phoneme sequence obtained as a result of the speech recognition with the result of analyzing the speech information, voice quality information indicating the characteristics of the voice quality of each phoneme can be obtained. The extraction of text information is performed using a phoneme model stored in a phoneme database 9d described later and a vocabulary dictionary stored in a vocabulary database 9e described later.

【００２１】ところで、日本語の基本的な音素は４３個
で構成されているが、後に続く音素によって音素の特徴
が変わるため、音声合成では例えば１０００程度に拡張
された音素が使用されるようになっている。例えば、
「赤（ａｋａ）」という発音中の「ｋ」の音素と「秋
（ａｋｉ）」という発音中の「ｋ」の音素とは、後に続
く音素の違いにより特徴が異なり、音声合成では別個の
ものとして取り扱われる。By the way, although the basic phonemes of Japanese are composed of 43, the characteristics of the phonemes change depending on the following phonemes, so that in speech synthesis, for example, phonemes expanded to about 1000 are used. Has become. For example,
The phoneme of "k" in the pronunciation of "red (aka)" and the phoneme of "k" in the pronunciation of "autumn (aki)" have different characteristics due to the difference in the following phonemes, and are different in speech synthesis. Is treated as

【００２２】続いて、その抽出された各音素の声質パタ
ーンと、その各音素の予め用意されている標準的な声質
の特徴を示す標準パターンとの写像関係が計算され、そ
の計算結果に基づいて、音声合成に必要とされる全音素
の声質の標準パターンを変換してＡさんの声質パターン
を再現するために必要な変換用コードブックが作成され
る。本実施形態では、この変換用コードブックがＡさん
の声質を再現するための声質情報として、後述する声質
データベース９ｂに登録される。なお、一般に音声情報
から実際に声質パターンが取得できる音素の数は、音声
合成に必要とされる全音素（ここでは１０００個程度）
の音素のうちのごく一部であり、音声パターンが実際に
取得できなかった他の音素については、実際に声質パタ
ーンが取得できた音素のサンプリング結果に基づいて、
類推的に標準パターンの変換用コードが作成される。Subsequently, a mapping relation between the extracted voice quality pattern of each phoneme and a standard pattern indicating a standard voice quality feature prepared for each phoneme is calculated, and based on the calculation result, Then, a conversion codebook necessary for reproducing the voice pattern of Mr. A by converting a standard pattern of voice quality of all phonemes required for speech synthesis is created. In the present embodiment, this conversion codebook is registered in a voice quality database 9b described later as voice quality information for reproducing the voice quality of Mr. A. In general, the number of phonemes for which a voice quality pattern can be actually obtained from speech information is the total number of phonemes required for speech synthesis (here, about 1000).
For other phonemes that are only a part of the phonemes of which the voice pattern could not be actually obtained, based on the sampling results of the phonemes for which the voice quality pattern was actually obtained,
By analogy, a conversion code for the standard pattern is created.

【００２３】そして、読み上げ用の音声合成の際には、
声質データベース９ｂに保存された変換用コードブック
を用いて各音素の前記標準パターンを変換することによ
り、各差出人に対応付けられた各音素の声質の声質パタ
ーンを再現し、その声質パターンに基づいて読み上げ用
の音声が合成されるようになっている。この音声合成処
理は、後述する音声合成部７ｇによって行われる。な
お、変形例として、各音素の前記声質パターンを声質情
報として声質データベース９ｂに登録し、その声質パタ
ーンに基づいて読み上げ用の声質を再現するようにして
もよい。In the case of speech synthesis for reading aloud,
By converting the standard pattern of each phoneme using the conversion codebook stored in the voice quality database 9b, the voice quality pattern of each phoneme associated with each sender is reproduced, and based on the voice quality pattern. Speech for reading is synthesized. This speech synthesis processing is performed by a speech synthesis unit 7g described later. As a modification, the voice quality pattern of each phoneme may be registered as voice quality information in the voice quality database 9b, and the voice quality for reading aloud may be reproduced based on the voice quality pattern.

【００２４】＜装置構成の詳細説明＞次に、読み上げ装
置の構成について説明する。操作部１は、ユーザからの
操作入力を受け付けるためのものである。<Detailed Description of Apparatus Configuration> Next, the configuration of the reading apparatus will be described. The operation unit 1 is for receiving an operation input from a user.

【００２５】通信部３は、携帯電話機等を備えて構成さ
れており、情報処理部７のメール送受信部７ｂおよび情
報取込部７ｃの制御により、無線電話回線を利用して通
信回線（インターネット等の通信ネットワーク等）に接
続し、通信回線を介して所定のサーバ等との間で情報送
受を行う機能を有している。また、通信部３は、電話回
線に接続して通話を行う機能を有しており、情報取込部
７ｃの制御により、電話回線を介して所定の通話相手と
の間で通話処理を行う電話機能も有している。The communication unit 3 is provided with a portable telephone or the like, and is controlled by a mail transmission / reception unit 7b and an information acquisition unit 7c of the information processing unit 7 to use a communication line (such as the Internet) using a wireless telephone line. And a function of transmitting and receiving information to and from a predetermined server or the like via a communication line. The communication unit 3 has a function of making a call by connecting to a telephone line, and under the control of the information acquisition unit 7c, a telephone that performs a call process with a predetermined communication partner via the telephone line. It also has the ability.

【００２６】読取部５は、情報取込部７ｃの制御によ
り、ＣＤ−ＲＯＭ等の持ち運び可能な記録媒体５ａから
音声情報あるいは声質情報等の情報の取り込みを行う。The reading section 5 fetches information such as voice information or voice quality information from a portable recording medium 5a such as a CD-ROM under the control of the information fetching section 7c.

【００２７】記憶部９には、差出人データベース９ａ、
声質データベース９ｂ、メールデータベース９ｃ、音韻
データベース９ｄおよび語彙データベース９ｅが備えら
れている。差出人データベース９ａには、複数の差出人
のアドレス情報（名前、電話番号、メールアドレス、性
別、声質情報識別用ＩＤ等）が含まれている。メールデ
ータベース９ｃには、受信した電子メールが格納され
る。The storage unit 9 stores a sender database 9a,
A voice quality database 9b, a mail database 9c, a phoneme database 9d, and a vocabulary database 9e are provided. The sender database 9a includes address information of a plurality of senders (name, telephone number, mail address, gender, voice quality information identification ID, etc.). The received e-mail is stored in the mail database 9c.

【００２８】声質データベース９ｂには、音声合成に用
いる各音素の標準的な声質の特徴を示す標準パターン情
報と、差出人データベース９ａ内に登録された各差出人
に対応付けられ、各音素の前記標準パターンを用いて各
差出人に対応づけられた声質を再現するための複数の変
換用コードブック、および標準的な男女の２つの声質パ
ターンが含まれている。The voice quality database 9b has standard pattern information indicating the characteristics of the standard voice quality of each phoneme used for speech synthesis, and is associated with each sender registered in the sender database 9a. , And a plurality of conversion codebooks for reproducing voice quality associated with each sender by using, and two standard male and female voice quality patterns.

【００２９】音韻データベース９ｄおよび語彙データベ
ース９ｅには、音声認識に用いられる音韻モデルおよび
語彙辞書が格納されている。The phoneme database 9d and the vocabulary database 9e store phoneme models and vocabulary dictionaries used for speech recognition.

【００３０】情報処理部７の中央制御部７ａは、操作部
１を介して入力される操作入力に基づいて、情報処理部
７の各機能要素（７ｂ〜７ｇ）の制御を司っているとと
もに、操作部１からの入力等に基づいて電子メールを作
成する機能も有している。The central control unit 7a of the information processing unit 7 controls the functional elements (7b to 7g) of the information processing unit 7 based on an operation input input via the operation unit 1. And a function of creating an e-mail based on an input from the operation unit 1 or the like.

【００３１】メール送受信部７ｂは、中央制御部７ａか
ら与えられるメール受信指令に応答して通信部３および
通信ネットワークを介して所定のサーバから電子メール
を受信する機能と、中央制御部７ａから与えられるメー
ル送信指令に応答して、中央制御部７ａから与えられる
電子メールを、通信部３および通信ネットワークを介し
て所定の送信先に送信する機能とを有している。メール
送受信部７ｂによって受信された電子メールは、一旦メ
ール管理部７ｄに送られる。The mail transmission / reception unit 7b has a function of receiving an electronic mail from a predetermined server via the communication unit 3 and the communication network in response to a mail reception command given from the central control unit 7a, and a function provided by the central control unit 7a. And a function of transmitting an e-mail provided from the central control unit 7a to a predetermined destination via the communication unit 3 and the communication network in response to a given mail transmission command. The e-mail received by the mail transmitting / receiving unit 7b is once sent to the mail management unit 7d.

【００３２】情報取込部７ｃは、中央制御部７ａから与
えられる情報取込指令に応答して、通信部３および通信
ネットワークを介して所定のサーバから、あるいは読取
部５を介して記録媒体５ａから音声情報あるいは声質情
報（ここでは声質パラメータ）の取り込みを行う機能を
有している。情報取得を通信ネットワークを介したサー
バと記録媒体５ａとのいずれからから行うのか、および
取得情報として音声情報と声質情報とのいずれを取得す
るのかは、情報取得指令の示す内容に従って決定され
る。また、音声情報および声質情報には、その音声情報
および声質情報の本人等を示す人物情報（差出人情報）
が付与されている場合があり、その場合には、人物情報
も音声情報および声質情報とともに取り込まれる。The information fetching unit 7c responds to an information fetching instruction given from the central control unit 7a, from a predetermined server via the communication unit 3 and the communication network, or from the recording medium 5a via the reading unit 5. Has a function of taking in voice information or voice quality information (here, voice quality parameters) from the. Whether the information is obtained from the server or the recording medium 5a via the communication network, and whether the voice information or the voice quality information is obtained as the obtained information is determined according to the contents indicated by the information obtaining command. The voice information and voice quality information include personal information (sender information) indicating the identity of the voice information and voice quality information.
May be given, and in that case, the person information is also taken in together with the voice information and the voice quality information.

【００３３】さらに、情報取込部７ｃは、中央制御部７
ａの制御により、通信部３に電話回線を介して通話相手
との間で通話を可能とさせ、その通話相手の音声情報を
通信部３を介して取り込む機能も有している。Further, the information fetching unit 7c includes a central control unit 7
Under the control of a, the communication unit 3 also has a function of enabling communication with a communication partner via a telephone line, and taking in voice information of the communication partner via the communication unit 3.

【００３４】メール管理部７ｄは、メール送受信部７ｂ
から与えられた電子メールをメールデータベース９ｃに
保存する機能と、中央制御部７ａから与えられる読み上
げ指令に応答して、その読み上げ指令によって特定され
る電子メールをメールデータベース９ｃから読み出して
音声合成部７ｇに送り出す機能と、中央制御部７ａから
与えられる表示指令に応答して、その表示指令によって
特定される電子メールをメールデータベース９ｃから読
み出して表示処理部７ｆに送り出す機能とを有してい
る。The mail management section 7d is provided with a mail transmission / reception section 7b.
In response to a read-out command given from the central control unit 7a, the electronic mail specified by the read-out command is read from the mail database 9c, and the voice synthesizing unit 7g is read. And a function of reading an e-mail specified by the display command from the mail database 9c in response to a display command given from the central control unit 7a and sending it to the display processing unit 7f.

【００３５】声質情報管理部７ｅは、大略的に、中央制
御部７ａから与えられる指示に応じて、変換用コードブ
ックの作成処理、差出人データベース９ａおよび声質デ
ータベース９ｂの管理等を行っている。この声質情報管
理部７ｅのより詳細な処理内容については、図２のフロ
ーチャート等に基づいて後述する。The voice quality information management section 7e generally performs conversion codebook creation processing, management of the sender database 9a and voice quality database 9b, etc. in accordance with an instruction given from the central control section 7a. More detailed processing content of the voice quality information management unit 7e will be described later based on the flowchart of FIG.

【００３６】表示処理部７ｆは、中央制御部７ａの制御
に従って表示部１０の表示内容を制御しており、例えば
操作情報や電子メールの内容を表示部１０を介して表示
出力する。The display processing unit 7f controls the display contents of the display unit 10 under the control of the central control unit 7a, and outputs and outputs, for example, operation information and the contents of an e-mail via the display unit 10.

【００３７】音声合成部７ｇは、大略的に、この読み上
げ装置の動作モードが読み上げモードとなっている際
に、中央制御部７ａから与えられる指示に応じて、電子
メールごとに前述の各サブモードに対応する声質の読み
上げ用音声を合成し、メール管理部７ｄから与えられる
電子メールをアンプ１１およびスピーカ１３を介して読
み上げ出力する。この音声合成部７ｇのより詳細な処理
内容については、図５のフローチャート等に基づいて後
述する。In general, when the operation mode of the reading device is the reading mode, the voice synthesizing unit 7g responds to the instruction given from the central control unit 7a to send each of the sub-modes described above for each e-mail. And synthesizes a read-out voice having a voice quality corresponding to the above, and reads out and outputs an e-mail provided from the mail management unit 7d via the amplifier 11 and the speaker 13. More detailed processing content of the voice synthesizing unit 7g will be described later based on the flowchart of FIG.

【００３８】＜動作説明＞図２は、声質データベース等
の作成のための処理内容を示すフローチャートである。
ステップＳ１で、操作部１を介して入力された情報取込
命令が中央制御部７ａによって受け付けられると、ステ
ップＳ２にて、情報の取込処理が行われる。この情報取
込処理では、中央制御部７ａの指示により情報取込部７
ｃが、通信部３あるいは読取部５を介して音声情報ある
いは声質情報（ここでは声質パラメータ）の取り込みを
行うようになっている。取り込んだ情報は、声質情報管
理部７ｅに与えられる。<Description of Operation> FIG. 2 is a flow chart showing the processing contents for creating a voice quality database and the like.
In step S1, when the central control unit 7a receives an information fetch instruction input via the operation unit 1, information fetch processing is performed in step S2. In this information fetching process, the information fetching section 7 is instructed by the central control section 7a.
c captures voice information or voice quality information (here, voice quality parameters) via the communication unit 3 or the reading unit 5. The captured information is provided to the voice quality information management unit 7e.

【００３９】ここで、通信部３を介した音声情報の取込
の態様には、通信ネットワークを介して特定のサーバか
ら音声情報等を取得する態様と、電話回線を介して通話
相手の発話内容からその通話相手の音声情報を取得する
態様とがあり、いずれの態様で音声情報を取得するか
は、ユーザによって指定される。また、取得対象の音声
情報あるいは声質情報にその音声情報あるいは声質情報
の人物情報が付与されている場合には、その人物情報も
一緒に取り込まれる。Here, the manner in which voice information is taken in through the communication unit 3 includes a mode in which voice information and the like are acquired from a specific server via a communication network, and a speech content of a communication partner through a telephone line. , The voice information of the other party is obtained, and in which mode the voice information is obtained is specified by the user. Further, when the voice information or voice quality information to be acquired is given the personal information of the voice information or voice quality information, the personal information is also taken in.

【００４０】続くステップＳ３では、ステップＳ２で音
声情報が取得された場合に、前述のように声質情報管理
部７ｅによって、音声認識によりその音声情報に含まれ
るテキスト情報の抽出が行われ、そのテキスト情報と音
声情報とに基づいてその音声情報に含まれる各音素の声
質の特徴を示す声質パラメータの抽出が行われ、ステッ
プＳ４に進む。なお、ステップＳ２で音声情報ではなく
声質パラメータが取り込まれた場合には、この特徴抽出
処理を行うことなく次のステップＳ４に進む。In the following step S3, when the voice information is obtained in step S2, the voice quality information management unit 7e extracts the text information included in the voice information by voice recognition as described above, and Based on the information and the voice information, voice quality parameters indicating characteristics of voice quality of each phoneme included in the voice information are extracted, and the process proceeds to step S4. When the voice quality parameter is read instead of the voice information in step S2, the process proceeds to the next step S4 without performing the feature extraction process.

【００４１】ステップＳ４では、ステップＳ２で取得し
た取得情報に人物情報が含まれており、かつその人物情
報と差出人データベース９ａの情報とに基づいて、その
人物情報に対応する差出人データベース９ａ内の差出人
を特定することが可能か否かが、声質情報管理部７ｅに
よって判断される。そして、人物情報に含まれるその人
物の名前、メールアドレス、電話番号等の情報と、差出
人データベース９ａ内の各差出人のアドレス情報とに基
づいて、人物情報に対応する差出人データベース９ａ内
の差出人が特定できる場合にはステップＳ５に進む一
方、取得情報に人物情報が含まれていなかったり、人物
情報あるいは差出人データベース９ａの情報が不十分で
あったりして対応する差出人の特定ができない場合には
ステップＳ８に進む。In step S4, person information is included in the acquired information obtained in step S2, and the sender in the sender database 9a corresponding to the person information is determined based on the person information and the information in the sender database 9a. Is determined by the voice quality information management unit 7e. Then, the sender in the sender database 9a corresponding to the person information is specified based on the information such as the name, mail address, telephone number, etc. of the person included in the person information and the address information of each sender in the sender database 9a. If it is possible, the process proceeds to step S5. On the other hand, if the acquired information does not include the person information, or if the person information or the information in the sender database 9a is insufficient, the corresponding sender cannot be specified. Proceed to.

【００４２】ステップＳ５では、声質情報管理部７ｅの
動作モードが、変換用コードブックの作成および登録等
を自動的に行う自動コードブック作成モードに設定され
ているか、あるいはユーザに作成等の要否を確認してか
ら変換用コードブックの作成および登録等を行う手動モ
ードに設定されているかが、声質情報管理部７ｅによっ
て判断され、自動コードブック作成モードに設定されて
いる場合にはステップＳ６に進む一方、設定されていな
い場合にはステップＳ８に進む。ここで、自動コードブ
ック作成モードと手動モードとの間の切り替えは、操作
部１および中央制御部７ａを介したユーザ入力により行
うことができる。In step S5, the operation mode of the voice quality information management unit 7e is set to an automatic codebook creation mode for automatically creating and registering a conversion codebook, or the necessity of creation by the user. Is confirmed, the voice quality information management unit 7e determines whether or not the mode is set to the manual mode for creating and registering the codebook for conversion. If the mode is set to the automatic codebook creation mode, the process proceeds to step S6. On the other hand, if not set, the process proceeds to step S8. Here, switching between the automatic codebook creation mode and the manual mode can be performed by a user input via the operation unit 1 and the central control unit 7a.

【００４３】ステップＳ６では、ステップＳ４で特定さ
れた差出人データベース９ａ内の差出人に対応付けられ
た変換用コードブックが声質データベース９ｂ内に存在
するか否かが、差出人データベース９ａの対応する差出
人のアドレス情報内における声質情報識別用ＩＤの有無
等に基づいて声質情報管理部７ｅによって判断され、対
応する変換用コードブックが存在する場合にはステップ
Ｓ７に進む一方、対応する変換用コードブックが存在し
ない場合にはステップＳ１１に進む。In step S6, it is determined whether or not the conversion codebook associated with the sender in the sender database 9a specified in step S4 exists in the voice quality database 9b by checking the address of the corresponding sender in the sender database 9a. The voice quality information management unit 7e determines based on the presence / absence of the voice quality information identification ID in the information, and if there is a corresponding conversion codebook, the process proceeds to step S7, while the corresponding conversion codebook does not exist. In this case, the process proceeds to step S11.

【００４４】ステップＳ７では、声質情報管理部７ｅに
よって、ステップＳ３で抽出された声質パラメータある
いはステップＳ２で取得された声質パラメータを用い
て、声質情報識別用ＩＤによって特定された対応する既
存の変換用コードブックの内容が最適化処理される。In step S7, the voice quality information management unit 7e uses the voice quality parameter extracted in step S3 or the voice quality parameter obtained in step S2, and uses the voice quality parameter identified by the voice quality information identification ID to store the corresponding existing conversion information. The contents of the codebook are optimized.

【００４５】ところで、一般に、前述のように音声情報
から各音素の声質パラメータを抽出し、その声質パラメ
ータに基づいて変換用コードブックを作成し、その変換
用コードブックに基づいて各音素の元の声質の再現を行
う場合、１回のサンプリングでは、真の声質に近い声質
を再現するのは困難なことが多い。図３中の符号Ａは、
特定の音素の真の声質の特徴を示すスペクトログラムを
模式的に示すものであり、符号Ｂ１は、１回のサンプリ
ング結果に基づいて再現された前記音素の声質の特徴を
示すスペクトログラムを模式的に示すものであり、これ
より１回のサンプリングでは真の声質の特徴の再現が難
しいことが分かる。By the way, in general, voice parameters of each phoneme are extracted from voice information as described above, a conversion codebook is created based on the voice quality parameters, and an original code of each phoneme is generated based on the conversion codebook. When reproducing voice quality, it is often difficult to reproduce voice quality close to true voice quality by one sampling. The symbol A in FIG.
FIG. 4 schematically shows a spectrogram showing characteristics of the true voice quality of a specific phoneme, and reference numeral B1 schematically shows a spectrogram showing characteristics of the voice quality of the phoneme reproduced based on one sampling result. From this, it can be seen that it is difficult to reproduce the characteristics of the true voice quality by one sampling.

【００４６】そこで、本実施形態では、各音素の声質再
現に用いる声質の特徴を示す声質パラメータが追加的に
得られた場合には、ステップＳ７にて、対応する既存の
変換用コードブックから得られる既存の声質パラメータ
と、ステップＳ３で追加的に得られた声質パラメータと
に基づいて、変換用コードブックが再構成されて最適化
され、その最適化された変換用コードブックが声質デー
タベース９ｂに登録されるようになっている。図４中の
符号Ｂ２は、１回目および２回目のサンプリング結果に
基づいて再現された前記音素の声質の特徴を示すスペク
トログラムを模式的に示すものであり、これより真の声
質により近い声質が再現できていることが分かる。Therefore, in the present embodiment, when a voice quality parameter indicating a voice quality characteristic used for voice quality reproduction of each phoneme is additionally obtained, in step S7, the voice quality parameter is obtained from the corresponding existing conversion codebook. Based on the existing voice quality parameters obtained and the voice quality parameters additionally obtained in step S3, the conversion codebook is reconfigured and optimized, and the optimized conversion codebook is stored in the voice quality database 9b. It is to be registered. The symbol B2 in FIG. 4 schematically shows a spectrogram showing characteristics of the voice quality of the phoneme reproduced based on the first and second sampling results, and a voice quality closer to the true voice quality is reproduced. You can see that it is done.

【００４７】ステップＳ１１では、声質情報管理部７ｅ
によりステップＳ３で抽出された声質パラメータ、ある
いはステップＳ２で取得された声質パラメータに基づい
て、前述のようにして変換用コードブックが新規に作成
されて声質データベース９ｂ内に登録されるとともに、
その変換用コードブックを識別するための声質情報識別
用ＩＤが作成されて、差出人データベース９ａ内の対応
する差出人のアドレス情報に登録される。In step S11, voice quality information management section 7e
Based on the voice quality parameters extracted in step S3 or the voice quality parameters acquired in step S2, a conversion codebook is newly created and registered in the voice quality database 9b as described above,
A voice quality information identification ID for identifying the conversion codebook is created and registered in the address information of the corresponding sender in the sender database 9a.

【００４８】なお、このステップＳ１１では、アドレス
情報として差出人データベース９ａ内に登録すべき情報
が、ステップＳ２または後述するステップＳ８で取得さ
れている場合には、その情報が差出人データベース９ａ
内に登録される。In this step S11, if the information to be registered as address information in the sender database 9a has been obtained in step S2 or step S8 described later, the information is stored in the sender database 9a.
Registered within.

【００４９】ステップＳ８では、声質情報管理部７ｅか
ら中央制御部７ａに対し、ユーザに対する声質情報の登
録の要否確認処理およびユーザからの必要情報の入力受
付処理を行うべきことが指示され、その要否確認処理お
よび必要情報の入力受付処理が行われる。In step S8, the voice quality information management unit 7e instructs the central control unit 7a to perform a process of confirming the necessity of registration of voice quality information for the user and a process of accepting input of necessary information from the user. A necessity confirmation process and a necessary information input reception process are performed.

【００５０】要否確認処理では、要否確認のための入力
をユーザに促すための確認出力が中央制御部７ａの制御
により表示部１０あるいは音声合成部７ｇおよびスピー
カ１３を介して行われる。また、必要情報の入力受付処
理では、前記確認出力に応じてユーザから操作部１を介
して登録の要否を示す入力が行われると、その入力内容
が声質情報管理部７ｅに与えられるとともに、その声質
情報を対応付けるべき差出人を特定するための差出人情
報の受け付けが操作部１を通じて行われる。入力された
差出人情報は、中央制御部７ａから声質情報管理部７ｅ
に与えられる。In the necessity confirmation processing, a confirmation output for prompting the user for the necessity confirmation input is performed through the display unit 10 or the voice synthesis unit 7g and the speaker 13 under the control of the central control unit 7a. In the input receiving process of the necessary information, when an input indicating the necessity of registration is performed by the user via the operation unit 1 in response to the confirmation output, the input content is provided to the voice quality information management unit 7e, Acceptance of sender information for specifying a sender to be associated with the voice quality information is performed through the operation unit 1. The input sender information is sent from the central control unit 7a to the voice quality information management unit 7e.
Given to.

【００５１】ステップＳ９では、ステップＳ８にて登録
を指示する登録命令が入力されたか否かが声質情報管理
部７ｅによって判断され、登録命令が入力された場合に
はステップＳ１０に進む一方、登録命令が入力されない
場合には登録を行うことなく処理が終了される。In step S9, the voice quality information management section 7e determines whether or not a registration command for instructing registration has been input in step S8. If the registration command has been input, the process proceeds to step S10, while the registration command has been input. If is not input, the process ends without performing registration.

【００５２】ステップＳ１０では、声質情報の新規登録
か否かの判断が声質情報管理部７ｅによって行われ、新
規登録の場合にはステップＳ１１に進み、変換用コード
ブックの新規作成および登録処理等が行われる一方、新
規登録でない場合にはステップＳ７に進み、既存の変換
用コードブックの最適化処理が行われる。In step S10, the voice quality information management section 7e determines whether or not the voice quality information is newly registered. If the voice quality information is newly registered, the process proceeds to step S11, where the conversion codebook is newly created and registered. On the other hand, if it is not a new registration, the process proceeds to step S7, where the existing codebook for conversion is optimized.

【００５３】なお、このステップＳ１０での新規登録か
否かの判断は、ステップＳ８で入力された差出人情報が
示す差出人が、差出人データベース９ａ内に登録されて
おり、かつ、その差出人に対応する変換用コードブック
が声質データベース９ｂ内に既に登録されているか否か
により判断され、差出人情報が示す差出人が差出人デー
タベース９ａ内に存在しない場合や、存在してもその差
出人に対応する変換用コードブックが声質データベース
９ｂ内に存在しない場合には、新規登録と判断される。It should be noted that whether or not a new registration is made in step S10 is determined by determining that the sender indicated by the sender information input in step S8 is registered in the sender database 9a and that the conversion corresponding to the sender is to be performed. It is determined based on whether or not the sender code book is already registered in the voice quality database 9b. If the sender indicated by the sender information does not exist in the sender database 9a, or if the sender exists, the conversion code book corresponding to the sender is not determined. If it does not exist in the voice quality database 9b, it is determined to be a new registration.

【００５４】これによって、電子メールの各差出人本人
の音声情報（あるいは声質情報）や、芸能人等の特定の
他の人物の音声情報（あるいは声質情報）を取得して、
その取得した音声情報により抽出したその音声情報の声
質の特徴を示す声質情報を、各差出人から送信されてき
た電子メールの読み上げ出力用の声質情報として容易に
保存し、利用することができるようになっている。As a result, the voice information (or voice quality information) of each sender of the electronic mail and the voice information (or voice quality information) of another specific person such as an entertainer are obtained.
Voice quality information indicating the voice characteristics of the voice information extracted from the obtained voice information can be easily stored and used as voice quality information for reading out the e-mail sent from each sender. Has become.

【００５５】図５は、電子メールの読み上げのための声
質の作成のための処理内容を示すフローチャートであ
り、この読み上げ装置の動作モードが、電子メールの読
み上げを行う読み上げモードの第１のサブモードに設定
されている場合の処理内容を示している。なお、第１の
サブモードでは、電子メールがその電子メールの差出人
に対応付けられた声質の音声で読み上げられるようにな
っている。FIG. 5 is a flowchart showing the contents of processing for creating a voice quality for reading out an e-mail. The operation mode of this reading out device is a first sub-mode of a reading out mode for reading out an e-mail. Shows the processing contents when it is set to. In the first sub-mode, the electronic mail is read aloud in a voice having a voice quality associated with the sender of the electronic mail.

【００５６】ステップＳ２１では、操作部１を介して読
み上げ命令および読み上げ対象の電子メールを特定する
メール特定情報の入力が行われると、それに応答して、
中央制御部７ａからメール管理部７ｄおよび音声合成部
７ｇにメール読み上げ指令が与えられる。ここで、メー
ル管理部７ｄに与えられる読み上げ指令には、読み上げ
対象の電子メールを特定するためのメール特定情報も付
与されている。音声合成部７ｇに与えられる読み上げ指
令には、第１のサブモードで読み上げを行うべきことを
示す指示が付与されている。また、このステップＳ２１
では、メール管理部７ｄによって、読み上げ指令に応答
して、その読み上げ指令に付与されたメール特定情報に
よって特定される電子メールがメールデータベース９ｃ
から読み出されて音声合成部７ｇに送り出される。In step S21, when a reading command and mail specifying information for specifying an electronic mail to be read are input via the operation unit 1, in response to the input,
The central control unit 7a issues a mail reading instruction to the mail management unit 7d and the voice synthesis unit 7g. Here, the read-out command given to the mail management unit 7d is also provided with mail specifying information for specifying an electronic mail to be read out. The reading instruction given to the voice synthesizing unit 7g is given an instruction indicating that reading should be performed in the first sub-mode. This step S21
Then, in response to the reading instruction, the e-mail specified by the mail specifying information given to the reading instruction is sent to the mail database 9c by the mail managing unit 7d.
And sent out to the speech synthesizer 7g.

【００５７】なお、以下のステップＳ２２〜Ｓ３０の処
理は音声合成部７ｇが中心となって実行される。The following steps S22 to S30 are executed mainly by the voice synthesizer 7g.

【００５８】続くステップＳ２２では、メール管理部７
ｄから与えられた電子メールのヘッダ情報（例えば、
「ｆｒｏｍ：」の部分等）が解析され、そのヘッダ情報
に含まれるその電子メールの差出人を特定するための情
報（差出人の名前、メールアドレス等）が抽出される。In the following step S22, the mail management unit 7
header information of the e-mail given from d (for example,
The “from:” part is analyzed, and information (sender name, mail address, etc.) for identifying the sender of the electronic mail included in the header information is extracted.

【００５９】続くステップＳ２３では、ステップＳ２２
でヘッダ情報から抽出した情報に対応する差出人のアド
レス情報が差出人データベース９ａ内に存在するか否か
が判断され、存在する場合には差出人データベース９ａ
内の対応する差出人が特定されてステップＳ２４に進む
一方、存在しない場合にはステップＳ３０に進む。In the following step S23, step S22
It is determined whether the address information of the sender corresponding to the information extracted from the header information exists in the sender database 9a, and if so, the sender database 9a
The corresponding sender is specified and the process proceeds to step S24. If the sender does not exist, the process proceeds to step S30.

【００６０】ステップＳ２４では、ステップＳ２３で特
定された差出人に対応付けられた声質情報（ここでは変
換用コードブック）が声質データベース９ｂ内に存在す
るか否かが、その特定された差出人のアドレス情報内に
おける声質情報識別用ＩＤの有無等により判断され、対
応する声質情報が存在する場合にはステップＳ２５に進
む一方、対応する声質情報が存在しない場合にはステッ
プＳ２７に進む。In step S24, it is determined whether or not voice quality information (here, a conversion codebook) associated with the sender specified in step S23 exists in voice quality database 9b, and the address information of the specified sender is determined. The determination is made based on the presence or absence of the voice quality information identification ID, and if the corresponding voice quality information exists, the process proceeds to step S25. If the corresponding voice quality information does not exist, the process proceeds to step S27.

【００６１】ステップＳ２５では、差出人データベース
９ａ内の声質情報識別用ＩＤに基づいて読み上げ対象の
電子メールの差出人に対応付けられている声質の変換用
コードブックが声質データベース９ｂ内から読み出さ
れ、その変換用コードブックと声質データベース９ｂ内
に保存されている各音素の標準パラメータとに基づい
て、前述のようにしてその差出人に対応付けられている
各音素の声質の特徴を示す声質パラメータが作成され、
ステップＳ２６に進む。In step S25, a conversion codebook of voice quality associated with the sender of the electronic mail to be read out is read out from the voice quality database 9b based on the voice quality information identification ID in the sender database 9a. Based on the conversion codebook and the standard parameters of each phoneme stored in the voice quality database 9b, voice quality parameters indicating the voice quality characteristics of each phoneme associated with the sender are created as described above. ,
Proceed to step S26.

【００６２】一方、ステップＳ３０では、予めユーザに
より操作部１および中央制御部７ａを介して指定された
標準的な声質の声質パラメータ（標準的な男性または女
性の音声パラメータ）が声質データベース９９ｂから読
み出され、ステップＳ２６に進む。On the other hand, in step S30, standard voice parameters (standard male or female voice parameters) specified in advance by the user via the operation unit 1 and the central control unit 7a are read from the voice database 99b. Is issued and the process proceeds to step S26.

【００６３】また、ステップＳ２７では、この音声合成
部２７の動作モードが、差出人の性別を自動的に判別す
る性別自動判別モードに設定されているか否か判断さ
れ、設定されている場合にはステップＳ２８に進む一
方、設定されていない場合にはステップＳ３０に進む。
なお、音声合成部２７の動作モードの性別自動判別モー
ドへの設定およびその解除は、操作部１および中央制御
部７ａを介したユーザ入力により行われる。In step S27, it is determined whether or not the operation mode of the voice synthesizing unit 27 is set to a gender automatic discrimination mode for automatically discriminating the gender of the sender. While the process proceeds to S28, if not set, the process proceeds to step S30.
The setting of the operation mode of the voice synthesis unit 27 to the automatic gender discrimination mode and the release thereof are performed by a user input via the operation unit 1 and the central control unit 7a.

【００６４】ステップＳ２８では、ステップＳ２２でヘ
ッダ情報から抽出された情報と、差出人データベース９
ａ内の情報とに基づいて、読み上げ対象の電子メールの
差出人の性別の自動判別が可能か否かが判断され、可能
である場合には性別の自動判別が行われてステップＳ２
９に進む一方、不可能の場合はステップＳ３０に進む。In step S28, the information extracted from the header information in step S22 and the sender database 9
Based on the information in a, it is determined whether or not the gender of the sender of the electronic mail to be read can be automatically determined, and if so, the gender is automatically determined, and step S2 is performed.
On the other hand, if it is impossible, the process proceeds to step S30.

【００６５】ステップＳ２９では、ステップＳ２８の性
別の自動判別結果に対応する男性または女性の標準的な
声質パラメータが、声質データベース９ｂから読み出さ
れてステップＳ２６に進む。In step S29, standard male or female voice parameters corresponding to the result of the automatic gender discrimination in step S28 are read from the voice database 9b, and the flow advances to step S26.

【００６６】ステップＳ２６では、ステップＳ２５，Ｓ
２９，Ｓ３０で作成または読み出された各音素の声質パ
ラメータに基づいて、前述のようにしてその声質パラメ
ータに対応する声質の読み上げ出力用の音声が合成さ
れ、その音声によって読み上げ対象の電子メール（電子
メールの本文や差出人の名前等）がアンプ１１およびス
ピーカ１３を介して読み上げ出力される。At step S26, steps S25, S
29, based on the voice quality parameters of each phoneme created or read out in S30, a voice for voice-to-voice output of voice quality corresponding to the voice quality parameter is synthesized as described above. The text of the e-mail and the name of the sender are read out and output via the amplifier 11 and the speaker 13.

【００６７】なお、この読み上げ装置の動作モードが、
読み上げモードの第２のサブモードに設定されている場
合には、読み上げ命令の入力に応答して、ユーザにより
指定された予め用意された標準的な男性または女性の各
音素の声質パラメータが声質データベース９ｂから音声
合成部７ｇに読み込まれ、この声質パラメータによって
読み上げ用の音声が音声合成部７ｇによって合成され、
その音声によって読み上げ対象の電子メールがアンプ１
１およびスピーカ１３を介して読み上げ出力されるよう
になっている。The operation mode of the reading device is as follows.
When the second sub-mode of the reading mode is set, in response to the input of the reading command, the voice parameters of the standard male or female phonemes specified by the user are stored in the voice quality database. 9b is read by the voice synthesis unit 7g, and the voice for reading out is synthesized by the voice synthesis unit 7g based on the voice quality parameters.
The e-mail to be read aloud by the voice is amplifier 1
1 and a speaker 13.

【００６８】以上のように、本実施形態によれば、電子
メールの各差出人ごとに読み上げ出力用の任意の声質情
報を用意して、その電子メールをその電子メールの差出
人に対応付けられた声質で自動的に読み上げ出力するこ
とができ、ユーザの聴取感の向上が図れるとともに、声
質選択のための煩わしい操作も不要である。As described above, according to the present embodiment, arbitrary voice quality information for reading out and outputting is prepared for each sender of the electronic mail, and the voice is associated with the voice quality associated with the sender of the electronic mail. And can automatically read out and output, improving the user's listening feeling and eliminating the need for cumbersome operation for voice quality selection.

【００６９】また、声質情報として各差出人本人の声質
の特徴を示す声質情報を用意しておくことにより、電子
メールをその差出人の声質で読み上げ出力することがで
き、聴取感をさらに向上させることができる。Also, by preparing voice quality information indicating characteristics of the voice quality of each sender as voice quality information, an e-mail can be read out and output in the voice quality of the sender, and the listening feeling can be further improved. it can.

【００７０】さらに、音声情報を取り込み、その音声情
報に基づいて声質情報を作成する機能、および声質情報
を取り込む機能が備えられているため、ハードウェアや
ソフトウェアの変更を伴わずに差出人本人の声質情報
や、差出人に対応付ける声質情報を取得することができ
る。Further, since a function of taking in voice information and creating voice quality information based on the voice information and a function of taking in voice quality information are provided, the voice quality of the sender can be changed without changing hardware or software. Information and voice quality information associated with the sender can be obtained.

【００７１】また、ＣＤ−ＲＯＭ等の記録媒体５ａに人
の声や声質情報を記録しておくことにより、あるいはイ
ンターネット等の通信ネットワークや電話回線等の通信
回線を介して音声情報あるいは声質情報の取り込みを容
易に行うことができる。Further, by recording human voice and voice quality information on a recording medium 5a such as a CD-ROM, or by transmitting voice information or voice quality information through a communication network such as the Internet or a communication line such as a telephone line. Incorporation can be easily performed.

【００７２】さらに、差出人に対応する音声情報あるい
は声質情報を取得するたびに、その差出人に対応する声
質情報をより最適化させることができ、例えば、差出人
の声質により近い音声を合成して読み上げ出力を行うこ
とができる。Further, every time voice information or voice quality information corresponding to the sender is obtained, the voice quality information corresponding to the sender can be further optimized. For example, a voice closer to the voice quality of the sender is synthesized and read out and output. It can be performed.

【００７３】[0073]

【発明の効果】請求項１に記載の発明によれば、第２の
記憶手段には、予め登録された各差出人の声質の特徴を
示す声質情報の声質データベースが記憶されており、第
２の記憶手段の声質データベースに含まれる声質情報の
うちの差出人判別手段によって判別された差出人に対応
する声質情報に基づいて、与えられた電子メールをその
声質情報に対応する声質で読み上げるための音声が合成
されてスピーカを介して出力されるようになっているた
め、電子メールの各差出人ごとに読み上げ出力用の任意
の声質情報を用意して、その電子メールをその差出人に
対応付けられた声質で自動的に読み上げ出力することが
でき、ユーザの聴取感の向上が図れるとともに、声質選
択のための煩わしい操作も不要である。According to the first aspect of the present invention, the second storage means stores a voice quality database of voice quality information indicating voice characteristics of each sender registered in advance. Based on the voice quality information corresponding to the sender determined by the sender determination means in the voice quality information included in the voice quality database of the storage means, a voice for reading out the given e-mail with the voice quality corresponding to the voice quality information is synthesized. Is output through the speaker, and any voice quality information for reading out and outputting is prepared for each sender of the e-mail, and the e-mail is automatically converted to the voice quality corresponding to the sender. It is possible to read out aloud and to improve the listening feeling of the user, and it is not necessary to perform a troublesome operation for voice quality selection.

【００７４】また、声質情報として各差出人本人の声質
の特徴を示す声質情報を用意しておくことにより、電子
メールをその差出人本人の声質で読み上げ出力すること
ができ、聴取感をさらに向上させることができる。Also, by preparing voice quality information indicating the characteristics of the voice quality of each sender as voice quality information, an e-mail can be read out and output in the voice quality of the sender, thereby further improving the listening feeling. Can be.

【００７５】請求項２に記載の発明によれば、音声情報
を取り込み、その音声情報に基づいて声質情報を作成す
るようになっているため、ハードウェアやソフトウェア
の変更を伴わずに差出人本人の声質情報や、差出人に対
応付ける声質情報を取得することができる。According to the second aspect of the present invention, since voice information is taken in and voice quality information is created based on the voice information, the sender's identity can be changed without changing hardware or software. Voice quality information and voice quality information associated with the sender can be acquired.

【００７６】請求項３に記載の発明によれば、特徴抽出
のための音声情報の取り込みが、持ち運び可能な記録媒
体から、あるいは通信回線を介して行われるため、ＣＤ
−ＲＯＭ等の記録媒体に人の声を記録しておくことによ
り、あるいはインターネット等の通信ネットワークや電
話回線等の通信回線を介して音声情報の取り込みを容易
に行うことができる。According to the third aspect of the present invention, audio information for feature extraction is captured from a portable recording medium or via a communication line.
-Voice information can be easily captured by recording human voice in a recording medium such as a ROM or through a communication network such as the Internet or a telephone line.

【００７７】請求項４に記載の発明によれば、差出人に
対応する音声情報を取得するたびに、その差出人に対応
する声質情報をより最適化させることができ、例えば、
差出人の声質により近い音声を合成して読み上げ出力を
行うことができる。According to the fourth aspect of the present invention, each time voice information corresponding to a sender is acquired, voice quality information corresponding to the sender can be further optimized.
A voice closer to the voice quality of the sender can be synthesized and read out.

【００７８】請求項５に記載の発明によれば、声質情報
を取り込んで音声合成を行うようになっているため、ハ
ードウェアやソフトウェアの変更を伴わずに差出人本人
の声質情報や、差出人に対応付ける声質情報を取得する
ことができる。According to the fifth aspect of the present invention, since voice synthesis is performed by taking in voice quality information, the voice quality information of the sender is associated with the sender without changing hardware or software. Voice quality information can be obtained.

【００７９】請求項６に記載の発明によれば、声質情報
の取り込みが、持ち運び可能な記録媒体から、あるいは
通信通信を介して行われるため、ＣＤ−ＲＯＭ等の記録
媒体に人の声質情報を記録しておくことにより、あるい
はインターネット等の通信ネットワークや電話回線等の
通信回線を介して声質情報の取り込みを容易に行うこと
ができる。According to the sixth aspect of the present invention, since the voice quality information is taken in from a portable recording medium or through communication, the voice quality information of the person is recorded on a recording medium such as a CD-ROM. The recording of the voice quality information can be easily performed by recording or through a communication network such as the Internet or a communication line such as a telephone line.

【００８０】請求項７に記載の発明によれば、差出人に
対応する声質情報を取得するたびに、その差出人に対応
する声質情報をより最適化させることができ、例えば、
差出人の声質により近い音声を合成して読み上げ出力を
行うことができる。According to the seventh aspect of the present invention, each time voice quality information corresponding to a sender is obtained, the voice quality information corresponding to the sender can be further optimized.
A voice closer to the voice quality of the sender can be synthesized and read out.

[Brief description of the drawings]

【図１】本発明の一実施形態に係る電子メール読み上げ
装置のブロック図である。FIG. 1 is a block diagram of an electronic mail reading device according to an embodiment of the present invention.

【図２】声質データベース等の作成のための処理内容を
示すフローチャートである。FIG. 2 is a flowchart showing processing contents for creating a voice quality database and the like.

【図３】実際に再現される声質および再現すべき真の声
質のスペクトログラムを比較した図である。FIG. 3 is a diagram comparing spectrograms of actually reproduced voice quality and true voice quality to be reproduced.

【図４】実際に再現される声質および再現すべき真の声
質のスペクトログラムを比較した図である。FIG. 4 is a diagram comparing spectrograms of actually reproduced voice quality and a true voice quality to be reproduced.

【図５】電子メールの読み上げのための声質の作成のた
めの処理内容を示すフローチャートである。FIG. 5 is a flowchart illustrating processing for creating voice quality for reading out an e-mail.

[Explanation of symbols]

１操作部３通信部５読取部５ａ記録媒体７情報処理部７ａ中央制御部７ｂメール送受信部７ｃ情報取込部７ｄメール管理部７ｅ声質情報管理部７ｆ表示処理部７ｇ音声合成部９記憶部９ａ差出人データベース９ｂ声質データベース９ｃメールデータベース１０表示部１３スピーカ Reference Signs List 1 operation unit 3 communication unit 5 reading unit 5a recording medium 7 information processing unit 7a central control unit 7b mail transmission / reception unit 7c information acquisition unit 7d mail management unit 7e voice quality information management unit 7f display processing unit 7g voice synthesis unit 9 storage unit 9a Sender database 9b Voice quality database 9c Email database 10 Display unit 13 Speaker

───────────────────────────────────────────────────── フロントページの続き (72)発明者西谷克巳愛知県名古屋市南区菊住１丁目７番10号株式会社オートネットワーク技術研究所内Ｆターム(参考） 5D045 AA07 AB01 AB26 5K030 GA18 HA06 HB01 JT01 KA07 KA20 LB16 ────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Katsumi Nishitani 1-7-10 Kikuzumi, Minami-ku, Nagoya-shi, Aichi F-term in Auto Network Engineering Laboratory Co., Ltd. 5D045 AA07 AB01 AB26 5K030 GA18 HA06 HB01 JT01 KA07 KA20 LB16

Claims

[Claims]

1. An electronic mail reading device for reading an electronic mail by voice and outputting the read electronic mail, wherein: a speaker; a first storage unit for storing a sender database in which the sender of the electronic mail is registered; A second storage means for storing a voice quality database in which voice quality information associated with each of the senders included in the e-mail is analyzed, and the header information of the given e-mail is analyzed, and the e-mail is stored in the sender database. A sender discriminating unit for discriminating which of the senders is included, and the sender discriminated by the sender discriminating unit of the voice quality information included in the voice quality database included in the second storage unit. Based on the voice quality information corresponding to
A voice synthesizing means for synthesizing a voice for reading out the given e-mail with a voice quality corresponding to the voice quality information, and outputting the voice via the speaker.

2. The electronic mail reading device according to claim 1, wherein the first information capturing unit captures voice information and sender information indicating the sender corresponding to the voice information, and the first information capturing unit captures the voice information. Based on the voice information, extract voice quality features of the voice information,
A feature extraction unit that creates the voice quality information indicating the feature of the voice quality, the voice quality information created by the feature extraction unit, and the voice information specified by the sender information and used to create the voice quality information. The e-mail reading apparatus according to claim 1, further comprising: first database creating means for registering the corresponding senders in the voice quality database and the sender database in association with each other.

3. The apparatus according to claim 2, wherein the first information capturing unit captures the audio information from a portable recording medium or via a communication line.
The e-mail reading device according to the item.

4. The method according to claim 1, wherein, when the voice quality information is created, the existing voice quality information corresponding to the sender corresponding to the voice information captured by the first information capturing means is generated. If it already exists in the voice quality database, based on the voice quality characteristics indicated by the existing voice quality information and the characteristics newly extracted from the voice information captured by the first information capturing unit, 3. A new voice quality information which further reflects voice characteristics corresponding to the sender.
Or the electronic mail reading device according to 3.

5. The electronic mail reading device according to claim 2, wherein the voice quality information and second information capturing means for capturing sender information indicating the sender corresponding to the voice quality information, and wherein the second information capturing means captures the voice information. The voice quality information, and a second database creation unit that is specified by the sender information and registers the sender corresponding to the voice quality information in the voice quality database and the sender database in association with each other. The electronic mail reading device according to any one of claims 1 to 4, wherein:

6. The apparatus according to claim 5, wherein the second information capturing means captures the voice quality information from a portable recording medium or via a communication line.
An e-mail reading device according to the item.

7. The second database creating means, when registering the fetched voice quality information in the voice quality database, the existing voice quality information corresponding to the sender corresponding to the voice quality information already has the voice quality information. If it exists in the database, it corresponds to the sender based on the voice quality characteristics indicated by the existing voice quality information and the voice quality characteristics indicated by the voice quality information newly captured by the second information capturing means. 7. The electronic mail reading device according to claim 5, further comprising a function of newly creating voice quality information that further reflects characteristics of the voice quality to be performed and registering the voice quality information in the voice quality database.