JPH077335B2

JPH077335B2 - Conversational text-to-speech device

Info

Publication number: JPH077335B2
Application number: JP61304397A
Authority: JP
Inventors: 修新家
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1986-12-20
Filing date: 1986-12-20
Publication date: 1995-01-30
Anticipated expiration: 2010-01-30
Also published as: JPS63157226A

Description

【発明の詳細な説明】〔概要〕例えば、複数の人物が登場する脚本の台詞の読み上げを
機械に行わせるための発明である。このために、脚本の
中に記入されている登場人物を判別するための登場人物
判別部と、登場人物の発声上の特徴を登録した登場人物
情報テーブルと、複数の音素ファイルとを設ける。脚本
の中に登場人物甲が「×××」と言う文章を読み上げる
と記入されていた場合には、登場人物甲の発声上の特徴
を登場人物情報テーブルから取り出し、この特徴に従っ
て音素ファイルを選択し、選択された音素ファイルから
「×××」に対応する音素パラメタを取り出し、登場人
物甲のその他の特徴（音量、音域、発声速度、アクセン
トなど）に従って「×××」と言う文章を音声出力す
る。DETAILED DESCRIPTION [Overview] For example, the invention is for causing a machine to read the dialogue of a script in which a plurality of persons appear. For this purpose, a character discrimination unit for discriminating the characters written in the script, a character information table in which the vocal characteristics of the characters are registered, and a plurality of phoneme files are provided. If it is written in the script that the character A is read aloud as "XXX", the vocal characteristics of the character A are extracted from the character information table and the phoneme file is selected according to this feature. Then, the phoneme parameters corresponding to "XXXXX" are extracted from the selected phoneme file, and the sentence "XXX" is spoken according to the other characteristics of the character A (volume, range, vocalization speed, accent, etc.). Output.

[Industrial application field]

本発明は、例えば複数の人物が登場する脚本の中の台詞
を自動的に読み上げるようになった会話型文章読み上げ
装置に関するものである。本発明の会話型文章読み上げ
装置の利用分野としては、予め日本語で印刷された登場
人物名入りの文章を見ながら各俳優がシナリオ読み上げ
を行う演劇や映画等の分野、或いは小説を朗読すること
により盲人や視力の弱い人向けの図書館等施設への設置
などがある。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a conversational text-to-speech device that automatically reads a dialogue in a script in which a plurality of persons appear. The field of use of the conversational text-to-speech device of the present invention is to read a novel, a field such as a theater or a movie, in which each actor reads a scenario while reading a sentence printed in Japanese with a character's name. Due to this, there are installations in facilities such as libraries for the blind and those with low vision.

[Conventional technology]

第４図は従来の日本語読み上げ装置の構成を示す図であ
る。同図において、１は日本語文章解析部、２は日本語
辞書、３は音声パラメタ設定部、４は音声合成部、７は
音素ファイル、８はスピーカ、９はマニュアル操作盤を
それぞれ示している。FIG. 4 is a diagram showing the configuration of a conventional Japanese reading-aloud device. In the figure, 1 is a Japanese sentence analysis unit, 2 is a Japanese dictionary, 3 is a voice parameter setting unit, 4 is a voice synthesis unit, 7 is a phoneme file, 8 is a speaker, and 9 is a manual operation panel. .

日本語文章解析部１は、日本語文章を日本語辞書２を参
照しながら単語の列に分割し、単語の列を適当な大きさ
の発声単位に分割し、発声単位の読みを順番に音声パラ
メタ設定部３に送る。日本語辞書２には、単語や単語の
読み、アクセント情報などがが登録されている。音素フ
ァイル７は１個しか示されていないが、実際には男性用
音素ファイル、女性用音素ファイルのように複数個存在
するものである。マニュアル操作盤９は、何れの音素フ
ァイル７を使用するかの指示、音量、音色、音域、発声
速度等を設定するものである。音声パラメタ設定部７
は、音声単位の読みに対応する音素パラメタをマニュア
ル操作盤９で指示された音素ファイル７から取り出し、
マニュアル操作盤９からの制御装置に従って音域や音
色、発声速度、音量などを設定し、これらを音声合成部
４に送る。音声合成部４は、音声パラメタ設定部３から
のデータに基づいて音声合成を行う。音声合成部４から
出力される電気信号はスピーカ８によって音声に変換さ
れる。The Japanese sentence analysis unit 1 divides the Japanese sentence into word strings by referring to the Japanese dictionary 2, divides the word strings into voicing units of an appropriate size, and reads the voicing units in order. It is sent to the parameter setting unit 3. In the Japanese dictionary 2, words, word readings, accent information, etc. are registered. Although only one phoneme file 7 is shown, there are actually a plurality of phoneme files such as a male phoneme file and a female phoneme file. The manual operation panel 9 is used to set an instruction as to which phoneme file 7 to use, volume, tone color, tone range, utterance speed, and the like. Voice parameter setting section 7
Retrieves the phoneme parameter corresponding to the reading of each voice unit from the phoneme file 7 designated by the manual operation panel 9,
The tone range, tone color, utterance speed, volume, etc. are set in accordance with the control device from the manual operation panel 9, and these are sent to the voice synthesizer 4. The voice synthesis unit 4 performs voice synthesis based on the data from the voice parameter setting unit 3. The electric signal output from the voice synthesizer 4 is converted into voice by the speaker 8.

[Problems to be solved]

従来の日本語読み上げ装置では、マニュアル操作盤の設
定情報に基づいて日本語文章を読み上げたり、或いは一
定の音色、音域、発声速度で以て日本語文章を読み上げ
ていた。しかしながら、最近は日本語文章を読み上げる
応用分野は拡大しており、複数人による文章読み上げが
必要な分野では、従来技術では対応できなかった。In the conventional Japanese reading device, the Japanese sentence is read out based on the setting information of the manual operation panel, or the Japanese sentence is read out with a certain tone color, range and utterance speed. However, recently, the application fields for reading out Japanese sentences have been expanding, and the field that requires text reading by a plurality of persons cannot be dealt with by the conventional technology.

本発明は、この点に鑑みて創作されたものであって、登
場人物の発声上の特徴を予めデータとして登録して置
き、複数人による文章読み上げを読み分け得るようにな
った会話型文章読み上げ装置を提供することを目的とし
ている。The present invention was created in view of this point, and the conversational text-to-speech device capable of pre-registering the utterance characteristics of the characters as data in advance and distinguishing the text-to-speech by a plurality of persons. Is intended to provide.

[Means for solving problems]

第１図は本発明の原理図である。同図において、１は日
本語文章解析部、２は日本語辞書、３は音声パラメタ設
定部、４は音声合成部、５は登場人物判別部、６は登場
人物情報テーブル、７は音素ファイル、８はスピーカを
それぞれ示している。FIG. 1 is a principle diagram of the present invention. In the figure, 1 is a Japanese sentence analysis unit, 2 is a Japanese dictionary, 3 is a voice parameter setting unit, 4 is a voice synthesis unit, 5 is a character identification unit, 6 is a character information table, 7 is a phoneme file, Reference numerals 8 denote speakers, respectively.

一般に脚本の中には、甲「×××」、乙「×××」と表
現された複数人物が登場する。日本語辞書解析部１は、
日本語辞書２を参照しながら日本語文章を解析し、単語
の列に分割する。そして、登場人物名を検出すると、こ
れを登場人物判別部５に渡す。鍵括弧内の読み上げ文章
については単語の読みの列を発声単位に分割し、発声単
位の読みを順番に音声パラメタ設定部３に渡す。この
際、単語のアクセント情報も渡される。第２図に示すよ
うに、登場人物情報テーブル６の中には、甲，乙，…な
どの登場人物名と、登場人物の発声上の特徴を特定でき
る音声特徴情報とが予め登録されている。発声特徴情報
とは、年令、性別、役柄上の出身地、音素ファイル番号
等を意味している。登場人物情報テーブル６の内容は書
換可能である。登場人物判別部５は、日本語文章解析部
１から登場人物名（例えば甲）を渡されると、登場人物
情報テーブル６の内容を読み出し、登場人物情報テーブ
ル６の中に甲と言う登場人物名が存在するか否かを調べ
る。存在する場合には、登場人物判別部５は、登場人物
情報テーブル６に対して登場人物が甲であることを通知
する。そうすると、登場人物情報テーブル６から登場人
物甲の発声特徴情報が読み出される。第１図には音素フ
ァイル７は１個しか示されていないが、音素ファイル７
は男性用の音素ファイル、女性用の音素ファイルのよう
に複数個存在するものである。第３図に示すように、音
素ファイル７の中には、音素とそれに対応するパラメタ
が格納されている。登場人物情報テーブル６から読み出
された音素ファイル番号に従って音素ファイル７が選択
される。読み出された発声特徴情報の中の年令情報や出
身地情報は音声パラメタ設定部３に渡される。音声パラ
メタ設定部３は、日本語文章解析部１から渡された読み
に対応するパラメタを選択された音素ファイル７から取
り出し、受け取った年令情報や出身地情報に従って、音
量や音域、発声速度などを決定し、アクセントの修正も
行う。そして音声パラメタ設定部３は、音素パラメタ、
音量情報、音域情報、発声速度情報、アクセント情報な
どを音声合成部４に渡す。音声合成部４は、これらのデ
ータに基づいて音声を合成し、音声合成信号を出力す
る。音声合成部４は、例えばパコール型のものである。
音声合成部４から出力される電気的な音声合成信号は、
スピーカ８によって音声に変換される。なお、入力され
た日本語文章の「×××」の登場人物を解析することに
より、発声順序が自動的に指示される。Generally, in the script, a plurality of characters represented as “A” and “B” are displayed. The Japanese dictionary analysis unit 1
The Japanese sentence is analyzed while referring to the Japanese dictionary 2 and divided into word strings. When the character name is detected, it is passed to the character determination unit 5. For the reading text in the brackets, the reading sequence of words is divided into utterance units, and the readings in utterance units are sequentially passed to the voice parameter setting unit 3. At this time, word accent information is also passed. As shown in FIG. 2, in the character information table 6, character names such as A, B, ... And voice feature information that can specify the utterance feature of the character are registered in advance. . The utterance feature information means age, sex, place of origin in a role, phoneme file number, and the like. The contents of the character information table 6 can be rewritten. When the character determination unit 5 receives a character name (for example, A) from the Japanese sentence analysis unit 1, the character determination unit 5 reads the content of the character information table 6 and the character name called A in the character information table 6. Check if exists. When the character exists, the character determination unit 5 notifies the character information table 6 that the character is A. Then, the utterance characteristic information of the character A is read from the character information table 6. Although only one phoneme file 7 is shown in FIG.
Is a phoneme file for men and a plurality of phoneme files for women. As shown in FIG. 3, the phoneme file 7 stores phonemes and parameters corresponding thereto. The phoneme file 7 is selected according to the phoneme file number read from the character information table 6. The age information and the birthplace information in the read utterance feature information are passed to the voice parameter setting unit 3. The voice parameter setting unit 3 takes out the parameter corresponding to the reading passed from the Japanese sentence analysis unit 1 from the selected phoneme file 7, and according to the received age information and birthplace information, the volume, range, vocalization speed, etc. And correct the accent. Then, the voice parameter setting unit 3 uses the phoneme parameter,
Volume information, range information, utterance speed information, accent information, etc. are passed to the voice synthesizer 4. The voice synthesizer 4 synthesizes voices based on these data and outputs a voice synthesis signal. The voice synthesizing unit 4 is, for example, a Pcall type.
The electrical voice synthesis signal output from the voice synthesis unit 4 is
It is converted into voice by the speaker 8. Note that the utterance order is automatically instructed by analyzing the characters in the input Japanese sentence "XXX".

全体の文章構成を変更する場合、読み上げ装置へ入力さ
れる日本語文章そのものの移動や挿入、変更、置換え等
の編集機能が必要であるが、このシステムの構成例とし
ては、一般のワープロと文章読み上げ装置の接続形態が
考えられる。When changing the overall sentence structure, it is necessary to have editing functions such as moving, inserting, changing, and replacing the Japanese sentences that are input to the reading device. A connection form of the reading device is considered.

〔The invention's effect〕

複数人での会話型文章を読み上げる機能を持った読み上
げ装置は従来存在しなかったが、本発明のように登場人
物判別部、登場人物情報テーブル及び複数の音素ファイ
ルを具備することにより、複数人での会話型文章の読み
上げが可能となり、例えばシナリオの自動読み上げや小
説の登場人物に合わせた自動読み上げが可能となる。Although a reading device having a function of reading a conversational sentence by a plurality of people has not existed in the past, a plurality of people can be provided by providing a character determination unit, a character information table, and a plurality of phoneme files as in the present invention. It becomes possible to read conversational sentences in, for example, automatic reading of scenarios and automatic reading according to the characters of a novel.

[Brief description of drawings]

第１図は本発明の原理図、第２図は登場人物情報テーブ
ルの構成例を示す図、第３図は音素ファイルの構成例を
示す図、第４図は従来例の構成を示す図である。１……日本語文章解析部、２……日本語辞書、３……音
声パラメタ設定部、４……音声合成部、５……登場人物
判別部、７……登場人物情報テーブル、７……音素ファ
イル、８……スピーカ、９……マニュアル操作盤。FIG. 1 is a principle diagram of the present invention, FIG. 2 is a diagram showing a configuration example of a character information table, FIG. 3 is a diagram showing a configuration example of a phoneme file, and FIG. 4 is a diagram showing a configuration of a conventional example. is there. 1 ... Japanese sentence analysis unit, 2 ... Japanese dictionary, 3 ... Voice parameter setting unit, 4 ... Voice synthesis unit, 5 ... Character discrimination unit, 7 ... Character information table, 7 ... Phoneme file, 8 ... Speaker, 9 ... Manual operation panel.

Claims

[Claims]

1. A speech synthesis unit (4), a plurality of phoneme files (7), a Japanese dictionary (2), a Japanese sentence analysis unit (1), and a Japanese sentence to be read aloud. Character discriminating unit (5) for discriminating a character who is a voiced speaker
And a character information table (6) that registers vocal characteristic information that affects the vocal volume, tone color, range, vocal speed, accent condition such as gender, age, birthplace of the character, and character discrimination The volume, the range, the accent, etc. are determined based on the utterance characteristic information on the character information table (6) corresponding to the persons identified by the section (5), and the phoneme file ( 7) is selected, the phoneme parameters corresponding to the reading sent from the Japanese analysis unit (1) are extracted from the selected phoneme file (7), and the phoneme parameters, the volume, the range, and the accent are extracted from the speech synthesis unit (4). ), And a voice parameter setting unit (3) for sending to the conversation type sentence reading device.