JP2003173197A

JP2003173197A - Text voice synthesizing apparatus

Info

Publication number: JP2003173197A
Application number: JP2001371335A
Authority: JP
Inventors: Hiroshi Sasaki; 佐々木　　寛
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2001-12-05
Filing date: 2001-12-05
Publication date: 2003-06-20

Abstract

<P>PROBLEM TO BE SOLVED: To provide a text voice synthesizing apparatus which reads a word just as desired by a user without caring about the operation mode of a system and without designating the format for agates (KANA, Japanese syllabary) alongside Chinese characters every time to the same notation word. <P>SOLUTION: A text voice conversion part 142 is provided with a text analyzing part 11 which analyzes words in an inputted text by using a main dictionary 144 and a user dictionary 143 to convert them into an intermediate language, a voice synthesizing part 12 for synthesizing a voice waveform from the intermediate language, an agate format processing part 13 which judges whether agate format data in which the words and their reading are written side by side is included or not in the inputted text associating with the text analyzing part and which judges whether a format as an agate format is satisfied or not, and a user word adding part 14 which receives the agate format data recognized to be the agate format from the agate format processing part, adds word information required for user dictionary registration and adds it to the user dictionary. <P>COPYRIGHT: (C)2003,JPO

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、テキストデータを
合成音声で読み上げるテキスト音声合成装置に関し、特
に、ユーザ辞書の自動登録機能を備えたテキスト音声合
成装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a text-to-speech synthesizer for reading text data as synthesized speech, and more particularly to a text-to-speech synthesizer having a user dictionary automatic registration function.

【０００２】[0002]

【従来の技術】近年、インターネットの急速な利用者拡
大に伴い、それにアクセスするためのパソコン、あるい
は、携帯電話、ＰＤＡ、ページャ（ポケットベル（登録
商標））などの携帯型情報端末が、企業、家庭、あるい
は学校などに急激な勢いで普及している。2. Description of the Related Art In recent years, with the rapid expansion of users of the Internet, a personal computer for accessing it, or a portable information terminal such as a mobile phone, a PDA, a pager (pager (registered trademark)), has been It is rapidly gaining popularity in homes and schools.

【０００３】その理由の一つとして、電子メールやイン
ターネットニュース等のメッセージ交換システムがあ
る。さらに、最近では、これらのメッセージ（例えば、
電子メール）を音声に変換して電話に転送したり、ある
いは端末上で音声に変換して読み上げたり、あるいは電
子メールの着信の通知を宛先利用者が所有するページャ
に出力したり、ファクシミリからのイメージ情報をマル
チメディア電子メールとして送信するような、種々のメ
ッセージシステムや情報端末を統合した新たな形のメッ
セージ交換システムが登場し始めており、これら電子メ
ール等のメッセージと音声合成とを中心としたサービス
が、さらなる利用者拡大に拍車をかけている。One of the reasons for this is a message exchange system for electronic mail, Internet news and the like. In addition, these messages (eg,
(E-mail) is converted to voice and transferred to the telephone, or converted to voice and read aloud on the terminal, or notification of incoming e-mail is output to the pager owned by the destination user, or sent from a facsimile. A new type of message exchange system that integrates various message systems and information terminals, such as sending image information as multimedia e-mail, has begun to appear, centering on these e-mail messages and voice synthesis. The service is spurring further user expansion.

【０００４】このメッセージ交換システムの中で、電子
メールやネットワークニュースを読み上げるための必要
な機能の一つに「テキスト解析処理」がある。テキスト
解析処理は、予め用意した文法情報と単語情報を利用し
て、入力された日本語文を解析し、音声合成パラメータ
を出力するための処理である。In this message exchange system, "text analysis processing" is one of the necessary functions for reading aloud electronic mail and network news. The text analysis process is a process for analyzing an input Japanese sentence using grammatical information and word information prepared in advance and outputting a voice synthesis parameter.

【０００５】しかし、これら電子メールやネットワーク
ニュースには、新語や造語が多く使用されているため、
ユーザの意図した通りの読み上げをしない場合がある。
そこで、この問題を解決するため、従来のテキスト音声
合成装置においては、ユーザが自由にカスタマイズでき
るユーザ辞書を、システム辞書とは別に設けることで、
上記問題を解決していた。However, since many new words and coined words are used in these electronic mails and network news,
In some cases, the text is not read exactly as the user intended.
Therefore, in order to solve this problem, in the conventional text-to-speech synthesizer, a user dictionary that the user can freely customize is provided separately from the system dictionary.
The above problem was solved.

【０００６】この方法によれば、ユーザの意図した通り
に読み上げさせるために、ユーザ辞書を前もって編集し
ておき、テキスト音声合成装置に、編集したユーザ辞書
を登録する手段を新たに設けることで、テキスト音声合
成装置が入力テキストデータを解析する際にメイン辞書
だけでなく、ユーザ辞書も利用することが可能となり、
上記問題を解決している。According to this method, the user dictionary is edited in advance so that the user can read it as intended, and a means for registering the edited user dictionary in the text-to-speech synthesizer is newly provided. When the text-to-speech synthesizer analyzes the input text data, not only the main dictionary but also the user dictionary can be used.
It solves the above problem.

【０００７】しかしながら、この従来技術には、以下の
ような問題点（１）がある。問題点（１）：ユーザ辞書を編集するために、ユーザ辞
書編集装置、あるいは、ユーザ辞書編集ソフトウェア等
のテキスト音声合成装置以外の手段が必要である。However, this conventional technique has the following problem (1). Problem (1): In order to edit the user dictionary, means other than the text dictionary synthesizer such as the user dictionary editing device or the user dictionary editing software is required.

【０００８】この問題に対して、テキスト形式の単語デ
ータを装置に直接入力する方法が文献：（テキスト音声
変換装置：発明協会公開技報（公技番号９４−１７８９
８））に示されている。以下、この技術について、簡単
に説明する。To solve this problem, a method for directly inputting word data in a text format into a device has been disclosed in a document: (Text-to-speech converter: JIII Journal of Technical Disclosure (Kogi No. 94-1789).
8)). Hereinafter, this technique will be briefly described.

【０００９】この文献に記載されたテキスト音声変換装
置においては、「テキスト音声変換モード」と「ユーザ
辞書登録モード」の２つの動作モードがあり、モード切
替えコマンドをキーボードから入力することによりモー
ド切替えが行なわれる。The text-to-speech conversion device described in this document has two operation modes, a "text-speech conversion mode" and a "user dictionary registration mode", and mode switching can be performed by inputting a mode switching command from the keyboard. Done.

【００１０】図１４は、メイン辞書とユーザ辞書を備え
た一般的なテキスト音声合成装置の全体構成を示したも
のであり、「テキスト音声変換モード」においては、テ
キストデータが「読み上げデータ」として扱われ、制御
部１４１は、入力されたテキストデータをテキスト音声
変換部１４２へ送る。テキスト音声変換部１４２は、各
単語の読みとアクセントが登録されたメイン辞書１４４
とユーザ辞書１４３を用いてテキストデータを解析し、
最終的に音声波形データへ変換する。生成された音声波
形データはスピーカから出力される。このようにしてテ
キストデータ１４６の読み上げが行われる。FIG. 14 shows the overall structure of a general text-to-speech synthesizer having a main dictionary and a user dictionary. In the "text-speech conversion mode", text data is treated as "readout data". The control unit 141 sends the input text data to the text-to-speech conversion unit 142. The text-to-speech conversion unit 142 uses the main dictionary 144 in which the reading and accent of each word are registered.
And analyze the text data using the user dictionary 143,
Finally converted to voice waveform data. The generated voice waveform data is output from the speaker. In this way, the text data 146 is read aloud.

【００１１】「ユーザ辞書登録モード」においては、テ
キストデータが「登録単語データ」として扱われ、制御
部１４１は、入力されたテキストデータをデータ変換部
１４９へ送る。データ変換部１４９は、テキストデータ
が所定の書式に合っているかどうかを調べる。具体的に
は、単語の表記・読み・アクセントが正しく記述されて
いるかどうかを調べ、誤りがあれば、制御部１４１はデ
ータ変換部１４９から誤りの箇所・内容に関する情報を
受取り、それを表示部１４７に表示してユーザに通知す
る。誤りがなければ、データ変換部１４９はテキストデ
ータを所定のバイナリ形式に変換し、それをユーザ辞書
１４３に格納する。このようにして単語登録が行なわれ
る。In the "user dictionary registration mode", the text data is treated as "registered word data", and the control unit 141 sends the input text data to the data conversion unit 149. The data conversion unit 149 checks whether the text data conforms to a predetermined format. Specifically, it is checked whether or not the word notation / reading / accent is correctly described, and if there is an error, the control unit 141 receives information about the error location / content from the data conversion unit 149 and displays it. It is displayed on 147 to notify the user. If there is no error, the data conversion unit 149 converts the text data into a predetermined binary format and stores it in the user dictionary 143. In this way, word registration is performed.

【００１２】以上のように、この従来技術によれば、ユ
ーザ辞書単語登録の際、ユーザがテキスト形式の単語デ
ータを装置に直接入力できるので、人間にとって分かり
難いバイナリ形式のデータをユーザが作成し、入力する
ということがなくなる。また、テキスト形式のデータを
予めバイナリ形式に変換してから装置に入力するという
手間を省くことが出来る。更に、テキスト形式からバイ
ナリ形式へ変換するための専用プログラム（ツール）を
装置とは別に用意する必要もなくなる。よって、前記問
題点（１）を解決することができる。As described above, according to this conventional technique, since the user can directly input the word data in the text format to the device when registering the words in the user dictionary, the user can create the binary format data which is difficult for humans to understand. , No more typing. Further, it is possible to save the trouble of converting the text format data into the binary format in advance and then inputting the data into the apparatus. Further, it is not necessary to prepare a dedicated program (tool) for converting the text format into the binary format separately from the device. Therefore, the problem (1) can be solved.

【００１３】しかし、この従来技術には、問題点（２）
がある。問題点（２）：ユーザは、テキスト音声変換装置の現在
の動作モードが、「テキスト音声変換モード」と「ユー
ザ辞書登録モード」のどちらであるかを意識して、モー
ドを切替えた上で使用しなければならず、操作上の負担
がかかる。However, this conventional technique has a problem (2).
There is. Problem (2): The user switches the mode in consideration of whether the current operation mode of the text-to-speech converter is the “text-speech conversion mode” or the “user dictionary registration mode”. Must be done, which is a burden on operation.

【００１４】この問題に対して、テキスト音声変換モー
ドで、テキスト中に単語の読みを直接指定することで、
指定の通りに読み上げさせる方法が提案されている。以
下、この従来技術について簡単に説明する。To solve this problem, in the text-to-speech conversion mode, by directly specifying the reading of a word in the text,
A method has been proposed in which the text is read as specified. The conventional technique will be briefly described below.

【００１５】装置の全体構成は、図１４に示した全体構
成と同様である。この場合、テキスト音声変換部１４２
は、外部データ１４６を受け取り、漢字かな混じり文か
ら韻律付発音記号（以下、中間言語と表記）に変換する
ためのテキスト解析部（図示せず）と中間言語から音声
波形を生成するための音声合成部（図示せず）と、テキ
スト解析部からテキストデータを受け取り、このテキス
トデータがルビフォーマットであるか否か判定し、ルビ
フォーマット（図７参照）の場合は単語の表記と読みを
取得し、テキスト解析部へ出力するためのルビフォーマ
ット処理部（図示せず）から構成されている。The overall configuration of the device is similar to that shown in FIG. In this case, the text-to-speech conversion unit 142
Is a text analysis unit (not shown) for receiving the external data 146 and converting a kanji / kana mixed sentence into a prosodic phonetic symbol (hereinafter referred to as an intermediate language) and a voice for generating a voice waveform from the intermediate language. The text data is received from the synthesis unit (not shown) and the text analysis unit, and it is determined whether or not the text data is in the ruby format. In the case of the ruby format (see FIG. 7), the word notation and the reading are acquired. , And a ruby format processing unit (not shown) for outputting to the text analysis unit.

【００１６】ここでルビフォーマットについて、図７を
用いて簡単に説明する。ルビフォーマットとは、表記デ
ータ７１と読みデータ７２から構成されており、テキス
トデータ中に挿入することで、表記データ７１を読みデ
ータ７２の通りに読み上げるための指定方法である。The ruby format will be briefly described with reference to FIG. The ruby format is composed of notation data 71 and reading data 72, and is a designation method for inserting the notation data 71 into the text data and reading the notation data 71 as the reading data 72.

【００１７】次に全体の処理について簡単に説明する。
ユーザは、「テキスト音声変換モード」において、単語
の読みを直接指定したい箇所に、読みを設定して、テキ
スト読上げモードで制御部１４１へテキストデータ１４
６を送信する。制御部１４１は、入力されたテキストデ
ータ１４６をテキスト音声変換部１４２へ送る。Next, the overall processing will be briefly described.
In the "text-to-speech conversion mode", the user sets the reading at a position where he / she wants to directly specify the reading of the word, and sends the text data 14 to the control unit 141 in the text reading mode.
Send 6. The control unit 141 sends the input text data 146 to the text-to-speech conversion unit 142.

【００１８】テキスト音声変換部１４２内のテキスト解
析部は、上記テキストデータを受け取り、ユーザ辞書１
４３、メイン辞書１４４、およびルビフォーマット処理
部を用いて、単語切り出し処理を行い、切出された複数
の単語をその単語に付属している情報を元に、フレーズ
単位にまとめて中間言語を生成する。音声合成部は生成
した中間言語からメイン辞書１４４を用いて音声波形を
生成し、スピーカ１４５へ出力する。The text analysis section in the text-to-speech conversion section 142 receives the above-mentioned text data, and the user dictionary 1
43, the main dictionary 144, and the ruby format processing unit are used to perform word cutout processing, and a plurality of cutout words are collected in phrase units based on information attached to the words to generate an intermediate language. To do. The speech synthesizer generates a speech waveform from the generated intermediate language using the main dictionary 144 and outputs it to the speaker 145.

【００１９】以上のように、この従来技術によると、テ
キスト音声変換部にルビフォーマットを処理する手段を
新たに設けることで、ルビフォーマットを用いて記載さ
れたテキストを使用することが可能となる。これによ
り、テキスト音声変換装置以外の特別な手段を用意する
ことなく、その場で読み上げ方を自由にカスタマイズす
ることができる。さらに、現在の動作モードを意識する
必要はなくなり、前記問題点（２）を解決できる。As described above, according to this conventional technique, by additionally providing a means for processing the ruby format in the text-to-speech conversion unit, it is possible to use the text described by using the ruby format. As a result, the way of reading can be freely customized on the spot without preparing any special means other than the text-to-speech converter. Furthermore, it is not necessary to be aware of the current operation mode, and the problem (2) can be solved.

【００２０】しかしながら、この従来技術には、以下に
示す問題点（３）がある。問題点（３）：指定した単語に対してのみ、読みを指定
する技術であるため、同一表記の単語が、テキスト中に
複数回出現する場合は、出現する度に、ルビフォーマッ
トにより指定する必要がある。However, this conventional technique has the following problem (3). Problem (3): This is a technique to specify the reading only for the specified word, so if the word with the same notation appears multiple times in the text, it must be specified in the ruby format each time it appears. There is.

【００２１】[0021]

【発明が解決しようとする課題】以上説明したように、
ユーザ辞書を用いることで、ネットニュースや電子メー
ル等のメッセージデータに頻出する新語や造語を、ユー
ザの意図した通りに読み上げることが可能になるもの
の、前述の問題点（２）が生じ、これを回避するために
ルビフォーマットを使用すると、問題点（３）が生じ
る。現在のところ、この両者を同時に解決する技術は、
まだ提案されていない。As described above,
By using the user dictionary, new words and coined words frequently appearing in message data such as net news and electronic mail can be read aloud as intended by the user, but the above problem (2) occurs and Using the ruby format to avoid causes problem (3). At present, the technology to solve both of these at the same time is
Not yet proposed.

【００２２】この発明は、ルビフォーマット指定された
単語を自動的にユーザ辞書に追加する手段を設けること
により、システムの動作モードを意識することなく、ま
た、同一表記単語に対してルビ指定を毎回行うことな
く、ユーザの意図した通りに単語を読み上げるテキスト
音声合成装置を提供することを目的とする。According to the present invention, by providing a means for automatically adding a ruby-format-specified word to the user dictionary, the ruby specification can be made every time for the same notation word without being aware of the operation mode of the system. An object of the present invention is to provide a text-to-speech synthesizer that reads out a word as intended by a user without performing the operation.

【００２３】[0023]

【課題を解決するための手段】そのために、この発明の
テキスト音声合成装置においては、入力テキストを音声
波形に変換するためのテキスト音声変換部と、入力テキ
ストの単語を解析するために装置に予め備えられたメイ
ン辞書とユーザによる単語登録が可能なユーザ辞書とを
備えたテキスト音声合成装置において、テキスト音声変
換部は、メイン辞書とユーザ辞書を用いて入力テキスト
における単語を解析して中間言語に変換するテキスト解
析部と、中間言語から音声波形を合成する音声合成部
と、テキスト解析部と連携して入力テキストに単語とそ
の読みを並置して記載したルビフォーマットデータが含
まれるか否かの判定とルビフォーマットとしての書式を
満足しているか否かの判定を行うルビフォーマット処理
部と、ルビフォーマットであると認識されたルビフォー
マットデータを前記ルビフォーマット処理部から受取り
ユーザ辞書登録に必要な単語情報を付加して前記ユーザ
辞書に追加するユーザ単語追加部とを備えている。Therefore, in the text-to-speech synthesizer of the present invention, a text-to-speech conversion unit for converting the input text into a speech waveform and a device for analyzing the words of the input text are provided in advance in the apparatus. In a text-to-speech synthesizer including a main dictionary provided and a user dictionary in which words can be registered by the user, the text-to-speech conversion unit analyzes the words in the input text using the main dictionary and the user dictionary and converts them into an intermediate language. Whether the text analysis unit for conversion, the voice synthesis unit for synthesizing a speech waveform from an intermediate language, and the text analysis unit cooperate with the input text to include ruby format data in which words and their readings are juxtaposed. A ruby format processing unit that makes a judgment and a judgment as to whether or not the ruby format is satisfied, and a ruby former The recognized ruby format data to be collected by by adding the word information necessary to the user dictionary registration receipt from the ruby format processing unit and a user word additional unit to be added to the user dictionary.

【００２４】[0024]

【発明の実施の形態】以下、本発明のテキスト音声合成
装置の実施の形態について、図面を参照しながら詳細に
説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of a text-to-speech synthesizer of the present invention will be described in detail below with reference to the drawings.

【００２５】第１の実施の形態［構成］図１は、図１４のテキスト音声合成装置におけ
るテキスト音声変換部１４２の第１の実施の形態として
の構成を示した図であり、このテキスト音声変換部１４
２は、メイン辞書１４４とユーザ辞書１４３を使用して
入力テキスト（図８参照）の単語を解析し中間言語に変
換するためのテキスト解析部１１と、テキスト解析部１
１から出力された中間言語から音声波形を生成するため
の音声合成部１２と、テキスト解析部と連携してルビフ
ォーマット（図７参照）をチェックするためのルビフォ
ーマット処理部１３と、ルビフォーマットであると認識
されたデータを、ルビフォーマット処理部１３から受取
り、文法情報、発音情報等のユーザ辞書登録に必要な単
語情報を付加してユーザ辞書１４３に追加するユーザ単
語追加部１４から構成される。First Embodiment [Structure] FIG. 1 is a diagram showing the structure of the text-to-speech converter 142 in the text-to-speech synthesizer of FIG. 14 as the first embodiment. Part 14
Reference numeral 2 denotes a text analysis unit 11 for analyzing a word of an input text (see FIG. 8) by using the main dictionary 144 and the user dictionary 143, and a text analysis unit 1.
1. A speech synthesis unit 12 for generating a speech waveform from the intermediate language output from 1, a ruby format processing unit 13 for checking a ruby format (see FIG. 7) in cooperation with a text analysis unit, and a ruby format. The user word adding unit 14 receives data recognized as being present from the ruby format processing unit 13, adds word information necessary for registering the user dictionary such as grammatical information and pronunciation information, and adds it to the user dictionary 143. .

【００２６】［動作］先ず、図１と図１４を参照して装
置全体の動作説明を行う。このテキスト音声合成装置
は、「テキスト音声変換モード」において、テキストデ
ータを「読み上げデータ」として扱う。制御部１４１
は、入力されたテキストデータをテキスト音声変換部１
４２へ送る。テキスト音声変換部１４２は、テキストデ
ータを解析し、中間言語に変換する。このとき、テキス
トデータに、ルビフォーマットデータが含まれていれ
ば、ルビフォーマット処理部１３とユーザ単語追加部１
４を使用して、ユーザ辞書１４３を更新する。生成され
た中間言語は、音声合成部１２に送られ、音声波形デー
タに変換されてスピーカ１４５から出力される。[Operation] First, the operation of the entire apparatus will be described with reference to FIGS. 1 and 14. This text-to-speech synthesizer handles text data as "readout data" in the "text-speech conversion mode". Control unit 141
Is a text-to-speech conversion unit 1 for input text data.
Send to 42. The text-to-speech conversion unit 142 analyzes the text data and converts it into an intermediate language. At this time, if the text data includes ruby format data, the ruby format processing unit 13 and the user word adding unit 1
4 is used to update the user dictionary 143. The generated intermediate language is sent to the voice synthesizer 12, converted into voice waveform data, and output from the speaker 145.

【００２７】次にテキスト解析部１１の動作について詳
細に説明する。テキスト解析部１１は、テキストデータ
を受け取ると、ユーザ辞書１４３、メイン辞書１４４、
およびルビフォーマット処理部１３を用いて、単語切り
出し処理を行い、切出された複数の単語のそれぞれに付
属している情報を元に、フレーズ単位にまとめて中間言
語を出力する（図１２参照）。この時、入力テキスト中
にルビフォーマット指定データが存在すると、該当箇所
に対しては、メイン辞書やユーザ辞書と照合せず、指定
された読みになるように単語切出し処理を行う。さら
に、このとき、該ルビ指定から表記と読みとアクセント
型とモーラ数と品詞を１組の単語データとして取得し、
ユーザ辞書へ追加しておく。Next, the operation of the text analysis unit 11 will be described in detail. When the text analysis unit 11 receives the text data, the user dictionary 143, the main dictionary 144,
Using the ruby format processing unit 13, word cutout processing is performed, and based on the information attached to each of the cutout words, the intermediate language is output in a phrase unit (see FIG. 12). . At this time, if the ruby format designation data is present in the input text, the word cutout process is performed on the corresponding portion so that the designated reading is made without collating with the main dictionary or the user dictionary. Further, at this time, the notation, the pronunciation, the accent type, the number of mora, and the part of speech are acquired from the ruby designation as a set of word data,
Add it to the user dictionary.

【００２８】なお、本実施の形態において、品詞データ
は、ルビフォーマットデータにない情報であるが、シス
テムが自動的に普通名詞として割当てるようにしてい
る。これによりルビフォーマットデータから抽出された
情報と追加情報が図１３に示すような単語データとして
ユーザ辞書に登録されるので、以後、同一表記の単語が
出現するとユーザ辞書に登録された読みで中間言語に変
換することが可能となる。In the present embodiment, the part-of-speech data is information not included in the ruby format data, but the system automatically assigns it as a common noun. As a result, the information extracted from the ruby format data and the additional information are registered in the user dictionary as word data as shown in FIG. 13. Therefore, when a word of the same notation appears thereafter, the reading in the user dictionary will be performed in the intermediate language. Can be converted to.

【００２９】次に、図２を参照してテキスト解析部１１
の単語切出し処理、特にルビフォーマットデータの処理
について、詳細に説明する。Next, referring to FIG. 2, the text analysis unit 11
The word cutout process, particularly the process of ruby format data, will be described in detail.

【００３０】テキストが入力されると、テキストの先頭
文字から始まる単語を切出す処理をする。この処理は再
帰的に実行される処理であり、切出された単語は、作業
バッファに格納され、その初期状態は空である。When the text is input, the word starting from the first character of the text is cut out. This process is a process executed recursively, and the extracted words are stored in the work buffer and the initial state is empty.

【００３１】（Ｓ１）先頭が文末であれば、単語切出し
成功として、Ｓ９に進む。文末でなければ、Ｓ２へ進
む。(S1) If the beginning is the end of a sentence, it is determined that the word extraction has succeeded, and the process proceeds to S9. If it is not the end of the sentence, proceed to S2.

【００３２】（Ｓ２）ルビフォーマット処理部１３は、
入力文先頭にルビフォーマットデータが存在するかどう
かを判定する。ルビフォーマット処理部は、所定のルビ
書式規則と照合して、該書式を満足するかどうかを判定
し、その結果をテキスト解析部１１に返送する。テキス
ト解析部は、ルビフォーマット処理部の判定結果がＹｅ
ｓ（書式と合致している）であれば、Ｓ１１へ処理を進
め、Ｎｏであれば、Ｓ３へ処理を進める。(S2) The ruby format processing unit 13
It is determined whether or not ruby format data exists at the beginning of the input sentence. The ruby format processing section collates with a predetermined ruby format rule to determine whether or not the format is satisfied, and returns the result to the text analysis section 11. In the text analysis unit, the determination result of the ruby format processing unit is Yes.
If s (matches the format), the process proceeds to S11, and if No, the process proceeds to S3.

【００３３】（Ｓ３）メイン辞書１４４、およびユーザ
辞書１４３を検索して、文先頭から始まる単語をすべて
取得し、これらを単語候補群とする。(S3) The main dictionary 144 and the user dictionary 143 are searched to acquire all the words starting from the beginning of the sentence, and these are used as a word candidate group.

【００３４】（Ｓ４）取得した単語候補群の単語候補数
が０ならば、単語切出しが失敗したとして単語切出しを
終了するためＳ１０へ進む。単語候補数が１個以上であ
れば、Ｓ５へ進む。(S4) If the number of word candidates in the acquired word candidate group is 0, it is determined that the word cutting has failed, and the process proceeds to S10 to end the word cutting. If the number of word candidates is one or more, the process proceeds to S5.

【００３５】（Ｓ５）取得した単語候補群の中から単語
候補を１つ選択する。(S5) One word candidate is selected from the acquired word candidate group.

【００３６】（Ｓ６）選択した単語候補と、作業バッフ
ァに格納されている一つ前の単語の文法的接続関係をチ
ェックする。チェックの結果、文法的接続関係が許容さ
れる関係にある場合は、当該単語候補を作業バッファに
格納してＳ７に進み、許されない組み合わせの場合には
Ｓ４に戻って他の単語候補を選択する処理を行う。(S6) Check the grammatical connection relation between the selected word candidate and the previous word stored in the work buffer. As a result of the check, if the grammatical connection relation is allowed, the word candidate is stored in the work buffer and the process proceeds to S7. If the combination is not allowed, the process returns to S4 to select another word candidate. Perform processing.

【００３７】（Ｓ７）当該単語候補（ルビフォーマット
データから抽出された単語ならば当該ルビフォーマット
による単語）の次の文字から始まる文を切出すため、再
帰的に当該単語切出し処理を行う。(S7) In order to cut out a sentence starting from the next character of the word candidate (if the word is extracted from the ruby format data, the word in the ruby format), the word cutting process is recursively performed.

【００３８】（Ｓ８）ステップＳ７の結果が成功の場合
はＳ９へ進み、失敗の場合は選択した単語候補を棄却し
て、Ｓ４へ戻り、次の単語候補について処理を行う。(S8) If the result of step S7 is successful, the process proceeds to S9. If the result is unsuccessful, the selected word candidate is discarded, and the process returns to S4 to process the next word candidate.

【００３９】（Ｓ９）単語の切出しが成功したとしてこ
の処理を終了する。(S9) This process is terminated on the assumption that the words have been cut out successfully.

【００４０】（Ｓ１０）単語の切出しが失敗したとして
この処理を終了する。(S10) This process is terminated assuming that the cutting of the word has failed.

【００４１】（Ｓ１１）ステップＳ２の判定結果、ルビ
フォーマットデータであると判定された場合の処理であ
る。ルビフォーマットデータから、表記、読み、アクセ
ント型、モーラ数、品詞等の単語情報を抽出して１組の
単語データとし、作業バッファへ追加してＳ１２へ進
む。ただし、単語情報にはルビフォーマットデータから
抽出したことを示すためのフラグをセットし、後段で必
要となる文法的接続チェックが必ず成功するようにして
いる。(S11) This process is performed when it is determined that the result is ruby format data as a result of the determination in step S2. Word information such as notation, reading, accent type, number of mora, and part-of-speech is extracted from the ruby format data as one set of word data, which is added to the work buffer and the process proceeds to S12. However, a flag is set in the word information to indicate that it has been extracted from the ruby format data, so that the grammatical connection check required in the latter stage will always succeed.

【００４２】（Ｓ１２）ルビフォーマットデータから抽
出した単語情報を、ユーザ辞書１４３へ追加し、Ｓ７へ
進む。なお、このＳ１２の処理が終了した直後から、ル
ビフォーマットデータによる登録単語が有効になる。(S12) Word information extracted from the ruby format data is added to the user dictionary 143, and the process proceeds to S7. Note that the registered word based on the ruby format data becomes valid immediately after the processing of S12 is completed.

【００４３】上記処理によれば、ルビフォーマットデー
タを含んだ入力テキストが単語に分割され、各単語の読
みとアクセント情報が取得され、最終的に中間言語に変
換でされる。また、ルビフォーマットデータを処理する
のと同時に、該ルビフォーマットで指定された単語をユ
ーザ辞書へ自動的に登録できるので、ユーザは装置の動
作モードを意識する必要なくユーザ単語登録を行うこと
が可能になる。According to the above processing, the input text including the ruby format data is divided into words, the reading and accent information of each word is acquired, and finally converted into an intermediate language. Also, at the same time as processing the ruby format data, the word specified in the ruby format can be automatically registered in the user dictionary, so that the user can register the user word without having to be aware of the operation mode of the device. become.

【００４４】第２の実施の形態［構成］図３に示すように、第２の実施の形態における
テキスト音声変換部１４２は、第１の実施の形態におけ
るテキスト音声変換部１４２とほぼ同様であるがルビ選
別部２４が追加されている点で異なっている。Second Embodiment [Structure] As shown in FIG. 3, the text-to-speech conversion unit 142 in the second embodiment is almost the same as the text-speech conversion unit 142 in the first embodiment. Is different in that a ruby selection unit 24 is added.

【００４５】第２の実施の形態におけるテキスト音声変
換部１４２は、メイン辞書１４４とユーザ辞書１４３を
用いて入力テキストから単語を抽出し中間言語に変換す
るためのテキスト解析部２１と、テキスト解析部２１か
ら出力された中間言語を音声波形に変換するための音声
合成部２２と、テキスト解析部２１と連携して入力テキ
スト中にルビフォーマットデータが含まれるかどうか、
さらに、ルビフォーマットとしての書式を満足している
かをチェックするためのルビフォーマット処理部２３
と、ルビフォーマットであると認識されたルビフォーマ
ットデータを、ルビフォーマット処理部２３から受取っ
て文法情報、発音情報等のユーザ辞書登録に必要な単語
情報を付加して、ユーザ辞書１４３に追加するためのユ
ーザ単語追加部２５と、ルビフォーマットデータからユ
ーザ辞書追加指定情報を取得し、この情報に従って、当
該ルビフォーマットデータから作成した単語情報をユー
ザ辞書に追加するか否かを決定するルビ選別部２４を設
けている。The text-to-speech conversion unit 142 in the second embodiment uses the main dictionary 144 and the user dictionary 143 to extract a word from the input text and converts it into an intermediate language, and a text analysis unit 21. Whether the ruby format data is included in the input text in cooperation with the voice synthesis unit 22 for converting the intermediate language output from 21 into a voice waveform and the text analysis unit 21.
Further, the ruby format processing unit 23 for checking whether or not the format as the ruby format is satisfied.
And to add to the user dictionary 143 the ruby format data recognized as being in the ruby format, from the ruby format processing unit 23, added with word information necessary for user dictionary registration such as grammatical information and pronunciation information. Of the user word adding unit 25 and the user dictionary adding designation information from the ruby format data, and according to this information, the ruby selecting unit 24 that determines whether to add the word information created from the ruby format data to the user dictionary. Is provided.

【００４６】［動作］まず、図３と図１４を参照して装
置全体の動作説明を行う。第１の実施の形態と同様に、
「テキスト音声変換モード」において、テキストデータ
を「読み上げデータ」として扱う。制御部１４１は、入
力されたテキストデータ（図１０参照）をテキスト音声
変換部１４２へ送る。テキスト音声変換部１４２は、テ
キストデータを解析し、中間言語に変換する。このと
き、テキストデータに、ルビフォーマットデータ（図９
参照）が含まれていれば、ルビフォーマット処理部２３
とルビ選別部２４とユーザ単語追加部２５を使用して、
ユーザ辞書１４３を更新する。なお、このとき、ルビ選
別部２４で登録指定のあったルビフォーマットデータだ
けをユーザ辞書に登録する。生成された中間言語は、音
声合成部２２に送られ、音声波形データに変換されてス
ピーカ１４５から出力される。[Operation] First, the operation of the entire apparatus will be described with reference to FIGS. 3 and 14. Similar to the first embodiment,
In the "text-to-speech conversion mode", text data is treated as "readout data". The control unit 141 sends the input text data (see FIG. 10) to the text-to-speech conversion unit 142. The text-to-speech conversion unit 142 analyzes the text data and converts it into an intermediate language. At this time, the ruby format data (see FIG. 9) is added to the text data.
Reference) is included, the ruby format processing unit 23
Using the ruby selection unit 24 and the user word addition unit 25,
The user dictionary 143 is updated. At this time, only the ruby format data designated for registration by the ruby selection unit 24 is registered in the user dictionary. The generated intermediate language is sent to the voice synthesizer 22, converted into voice waveform data, and output from the speaker 145.

【００４７】次に、テキスト解析部２１の動作を詳細に
説明する。テキスト解析部２１は、テキストデータ（図
１０参照）を受け取るとユーザ辞書１４３、メイン辞書
１４４、およびルビフォーマット処理部２３を用いて、
単語切り出し処理を行う。この時、ルビフォーマットデ
ータ（図９参照）が入力テキスト中に存在する場合、該
当箇所に対しては、メイン辞書やユーザ辞書と照合せ
ず、指定された読みになるように単語切出し処理を行
う。このとき、ルビフォーマットデータ内に、ユーザ辞
書登録指定（図９の９３）が１と設定されていれば、こ
のルビフォーマットデータから表記と読みとアクセント
型とモーラ数と品詞を取得し、１組の単語データとして
ユーザ辞書へ追加する。これによりユーザ辞書登録指定
のあったルビフォーマットデータについてだけ、以後ル
ビフォーマット指定による読みが有効になるので、同一
表記の単語が出現するとユーザ辞書に登録された読みで
中間言語に変換することが可能となる。Next, the operation of the text analysis unit 21 will be described in detail. Upon receiving the text data (see FIG. 10), the text analysis unit 21 uses the user dictionary 143, the main dictionary 144, and the ruby format processing unit 23 to
Performs word segmentation processing. At this time, if the ruby format data (see FIG. 9) is present in the input text, the word cut-out process is performed so that the corresponding reading is performed without matching with the main dictionary or the user dictionary. . At this time, if the user dictionary registration designation (93 in FIG. 9) is set to 1 in the ruby format data, the notation, the reading, the accent type, the number of mora, and the part of speech are acquired from the ruby format data, and one set is obtained. Add to the user dictionary as word data of. As a result, only for the ruby format data for which the user dictionary registration has been specified, the reading by specifying the ruby format will be enabled, so that if a word with the same notation appears, it can be converted into an intermediate language with the reading registered in the user dictionary. Becomes

【００４８】次に、図４を参照してテキスト解析部２１
の単語切出し処理、特にルビフォーマットデータの処理
について詳細に説明する。Next, referring to FIG. 4, the text analysis unit 21
The word cutout process, particularly the ruby format data process, will be described in detail.

【００４９】テキストデータが入力されると、テキスト
の先頭文字から始まる単語を切出す処理を開始する。こ
の処理は再帰的に実行される処理であり、切出された単
語は、作業バッファに格納され、その初期状態は空であ
る。When the text data is input, the process of cutting out the word starting from the first character of the text is started. This process is a process executed recursively, and the extracted words are stored in the work buffer and the initial state is empty.

【００５０】（Ｓ２１）先頭が文末であれば、単語切出
し成功として、Ｓ２９に進む。文末でなければＳ２２へ
進む。(S21) If the beginning is the end of a sentence, it is determined that the word extraction has succeeded, and the process proceeds to S29. If it is not the end of the sentence, proceed to S22.

【００５１】（Ｓ２２）ルビフォーマット処理部２３
は、入力文先頭にルビフォーマットデータが存在するか
どうかを判定する。ルビフォーマット処理部は、所定の
ルビ書式規則と照合して、該書式を満足するかどうかを
判定し、その結果をテキスト解析部２１に返送する。テ
キスト解析部は、ルビフォーマット処理部の判定結果が
Ｙｅｓ（書式と合致）であれば、Ｓ３１へ処理を進め、
Ｎｏであれば、Ｓ２３へ処理を進める。(S22) Ruby format processing unit 23
Determines whether ruby format data exists at the beginning of the input sentence. The ruby format processing unit collates with a predetermined ruby format rule, determines whether or not the format is satisfied, and returns the result to the text analysis unit 21. If the determination result of the ruby format processing unit is Yes (matches with the format), the text analysis unit advances the processing to S31,
If No, the process proceeds to S23.

【００５２】（Ｓ２３）メイン辞書１４４、およびユー
ザ辞書１４３を検索して、文先頭から始まる単語をすべ
て取得し、これらを単語候補群とする。(S23) The main dictionary 144 and the user dictionary 143 are searched to acquire all the words starting from the beginning of the sentence, and these are set as a word candidate group.

【００５３】（Ｓ２４）取得した単語候補群の単語候補
数が０ならば、単語切出しが失敗したとして、単語切出
し処理を終了するため、Ｓ３０へ進む。単語候補数が１
個以上であれば、Ｓ２５へ進む。(S24) If the number of word candidates in the acquired word candidate group is 0, it is determined that the word cutting has failed, and the word cutting process is terminated, so that the process proceeds to S30. Number of word candidates is 1
If the number is greater than or equal to the number, the process proceeds to S25.

【００５４】（Ｓ２５）取得した単語候補群の中から単
語候補を１つ選択する。(S25) One word candidate is selected from the acquired word candidate group.

【００５５】（Ｓ２６）選択した単語候補と、作業バッ
ファに格納されている一つ前の単語の文法的接続関係を
チェックする。チェックの結果、文法的接続関係が許容
される関係にある場合は、当該単語候補を作業バッファ
に格納してＳ２７に進み、許されない組み合わせの場合
にはＳ２４に戻り、他の単語候補を選択する。(S26) The grammatical connection relation between the selected word candidate and the previous word stored in the work buffer is checked. As a result of the check, if the grammatical connection relation is allowed, the word candidate is stored in the work buffer and the process proceeds to S27. If the combination is not allowed, the process returns to S24 to select another word candidate. .

【００５６】（Ｓ２７）当該単語候補（ルビフォーマッ
トデータから抽出された単語ならば当該ルビフォーマッ
トによる単語）の次の文字から始まる文を切出すため、
再帰的に単語切出し処理を呼出す。(S27) In order to cut out a sentence starting from the next character of the word candidate (if the word is extracted from the ruby format data, the word is in the ruby format),
Call the word cutout process recursively.

【００５７】（Ｓ２８）Ｓ２７の結果が成功の場合はＳ
２９へ進み、失敗の場合は選択した単語候補を棄却し
て、Ｓ２４へ戻り、次の単語候補について処理を行う。(S28) If the result of S27 is success, S
In step 29, if it fails, the selected word candidate is rejected, the process returns to step S24, and the next word candidate is processed.

【００５８】（Ｓ２９）単語の切出しが成功したとし
て、この処理を終了する。(S29) This process is terminated on the assumption that the words have been cut out successfully.

【００５９】（Ｓ３０）単語の切出しが失敗したとし
て、この処理を終了する。(S30) This process is terminated on the assumption that the extraction of words has failed.

【００６０】（Ｓ３１）ステップＳ２２の判定結果、ル
ビフォーマットデータであると判定された場合の処理で
ある。ルビフォーマットデータから、表記、読み、アク
セント型、モーラ数、品詞を抽出して１組の単語データ
とし、作業バッファへ追加しＳ３２へ進む。ただし、単
語情報にはルビフォーマットから抽出したことを示すた
めのフラグをセットし、後段で必要となる文法的接続チ
ェックが必ず成功するようにしている。(S31) This process is performed when it is determined that the data is ruby format data as a result of the determination in step S22. The notation, reading, accent type, mora number, and part of speech are extracted from the ruby format data to form a set of word data, which is added to the work buffer and the process proceeds to S32. However, a flag is set in the word information to indicate that it has been extracted from the ruby format, so that the grammatical connection check, which is required in the latter stage, will always succeed.

【００６１】（Ｓ３２）ルビフォーマットデータからユ
ーザ辞書追加指定情報（図９の９３参照）を抽出し、こ
の情報が１であるか否かを判定する。この結果がＹｅｓ
であれば、Ｓ２３へ進み、ＮｏであればＳ２７へ進む。
これにより、ユーザの希望したルビフォーマット指定の
単語だけがユーザ辞書に登録されることになる。(S32) User dictionary addition designation information (see 93 in FIG. 9) is extracted from the ruby format data, and it is determined whether or not this information is 1. This result is Yes
If so, the process proceeds to S23, and if No, the process proceeds to S27.
As a result, only the words in the ruby format designated by the user are registered in the user dictionary.

【００６２】（Ｓ３３）ルビフォーマット指定による単
語情報をユーザ辞書１４３へ追加し、Ｓ２７へ進む。な
お、このＳ３３の処理が終了した直後から、ルビフォー
マット指定による登録単語が有効になる。(S33) Word information specified by the ruby format is added to the user dictionary 143, and the process proceeds to S27. Immediately after the processing of S33 ends, the registered word specified by the ruby format becomes valid.

【００６３】上記処理によれば、ルビフォーマットデー
タを含んだ入力テキストが単語に分割され、各単語の読
みとアクセント情報が取得され、最終的に中間言語に変
換される。また、ルビフォーマットデータを処理するの
と同時に、該ルビフォーマットで指定された単語をユー
ザ辞書へ自動的に登録できるので、ユーザは装置の動作
モードを意識する必要なくユーザ単語登録を行うことが
可能になる。According to the above processing, the input text containing the ruby format data is divided into words, the reading and accent information of each word is acquired, and finally converted into an intermediate language. Also, at the same time as processing the ruby format data, the word specified in the ruby format can be automatically registered in the user dictionary, so that the user can register the user word without having to be aware of the operation mode of the device. become.

【００６４】さらに、本実施の形態によれば、ユーザ辞
書追加指定情報をルビフォーマットデータに設け、この
情報を取得し、取得した値に応じてユーザ辞書への追加
／非追加を切替えるようにしたので、ユーザが指定した
ルビフォーマットによる単語だけをユーザ辞書に追加す
ることが可能になる。Further, according to the present embodiment, the user dictionary addition designation information is provided in the ruby format data, this information is acquired, and addition / non-addition to the user dictionary is switched according to the acquired value. Therefore, it is possible to add only the words in the ruby format specified by the user to the user dictionary.

【００６５】第３の実施の形態［構成］図５に示すように、第３の実施の形態における
テキスト音声変換部１４２は、メイン辞書１４４とユー
ザ辞書１４３を用いて入力テキストの単語を解析して、
中間言語に変換するためのテスト解析部３１と、テキス
ト解析部３１から出力された中間言語を音声波形に変換
するための音声合成部３２と、入力テキスト中のルビフ
ォーマットデータをチェックするためのルビフォーマッ
ト処理部３３と、ルビフォーマットデータであると認識
されたデータを、ルビフォーマット処理部３３から受取
り、文法情報、発音情報等のユーザ辞書登録に必要な単
語情報を取得して、ユーザ辞書１４３に追加するための
ユーザ単語追加部３６とルビフォーマットデータからユ
ーザ辞書追加指定情報を取得し、この情報に従って、当
該ルビフォーマットデータをユーザ辞書に追加するか否
かを決定するルビ選別部３４と、当該ルビフォーマット
データが既にメイン辞書１４４に登録済であるか否かを
判定するための重複データ検査部３５を設けている。Third Embodiment [Structure] As shown in FIG. 5, the text-to-speech conversion unit 142 in the third embodiment analyzes the words of the input text using the main dictionary 144 and the user dictionary 143. hand,
A test analysis unit 31 for converting into an intermediate language, a voice synthesis unit 32 for converting the intermediate language output from the text analysis unit 31 into a voice waveform, and a ruby for checking ruby format data in the input text. The format processing unit 33 and the data recognized as the ruby format data are received from the ruby format processing unit 33, and the word information necessary for registering the user dictionary such as grammatical information and pronunciation information is acquired and stored in the user dictionary 143. The user word addition unit 36 for adding and the user dictionary addition designation information from the ruby format data, and according to this information, the ruby selection unit 34 that determines whether or not to add the ruby format data to the user dictionary, and A key for determining whether or not the ruby format data has already been registered in the main dictionary 144. The data check portion 35 is provided.

【００６６】［動作］次に、図５と図１４を参照して、
装置全体の動作説明を行う。まず、「テキスト音声変換
モード」において、テキストデータを「読み上げデー
タ」として扱う。制御部１４１は、入力されたテキスト
データ（図１０参照）をテキスト音声変換部１４２へ送
る。テキスト音声変換部１４２は、入力したテキストデ
ータを解析し、中間言語に変換する。このとき、テキス
トデータに、ルビフォーマットデータ（図９参照）が含
まれていれば、ルビフォーマット処理部３３とルビ選別
部３４とユーザ単語追加部３６と重複データ検査部３５
を用いてユーザ辞書１４３を更新する。なお、このと
き、ルビ選別部３４で登録指定のあったルビフォーマッ
トデータであって、かつ、重複データ検査部３５でメイ
ン辞書１４４に同一単語が登録されていない場合だけ、
当該ルビフォーマット指定された単語データをユーザ辞
書に登録する。その後、取得した中間言語は、音声合成
部３２に送られ、音声波形データに変換され、スピーカ
１４５から出力される。[Operation] Next, referring to FIG. 5 and FIG.
The operation of the entire apparatus will be described. First, in the “text-speech conversion mode”, text data is treated as “read-aloud data”. The control unit 141 sends the input text data (see FIG. 10) to the text-to-speech conversion unit 142. The text-to-speech conversion unit 142 analyzes the input text data and converts it into an intermediate language. At this time, if the text data includes ruby format data (see FIG. 9), the ruby format processing unit 33, the ruby selecting unit 34, the user word adding unit 36, and the duplicate data checking unit 35.
To update the user dictionary 143. At this time, only when the ruby format data has been designated for registration by the ruby selection unit 34 and the same word is not registered in the main dictionary 144 by the duplicate data inspection unit 35,
The word data specified by the ruby format is registered in the user dictionary. After that, the acquired intermediate language is sent to the voice synthesis unit 32, converted into voice waveform data, and output from the speaker 145.

【００６７】次に、テキスト解析部３１の動作を詳細に
説明する。テキスト解析部３１は、テキストデータを受
け取り、ユーザ辞書１４３、メイン辞書１４４、および
ルビフォーマット処理部３３、ルビ選別部３４、重複デ
ータ検索部３５、ユーザ単語追加部３６を用いて、単語
切り出し処理を行い、入力テキストを単語区切りデータ
（図１１参照）に変換する。次いで、切出された複数の
単語を該単語に付属している情報を元に、フレーズ単位
にまとめて中間言語を出力する（図１２参照）。Next, the operation of the text analysis unit 31 will be described in detail. The text analysis unit 31 receives the text data, and uses the user dictionary 143, the main dictionary 144, the ruby format processing unit 33, the ruby selection unit 34, the duplicate data search unit 35, and the user word addition unit 36 to perform a word segmentation process. Then, the input text is converted into word segmentation data (see FIG. 11). Then, based on the information attached to the extracted words, the extracted words are put together in phrase units and an intermediate language is output (see FIG. 12).

【００６８】この時、ルビフォーマットデータが入力テ
キスト中に存在すると、該当箇所に対しては、メイン辞
書やユーザ辞書と照合せず、指定された読みになるよう
に単語切出し処理を行う。さらに、このとき、ルビフォ
ーマットデータ内のユーザ辞書登録指定が１の場合は、
該ルビフォーマットデータから表記と読みとアクセント
型とモーラ数と品詞を取得し（図１３参照）、これらを
１組の単語データとして、ユーザ辞書へ追加する。At this time, if the ruby format data is present in the input text, the word cut-out process is performed on the corresponding portion so as to obtain the designated reading without collating with the main dictionary or the user dictionary. Furthermore, at this time, if the user dictionary registration designation in the ruby format data is 1,
The notation, reading, accent type, mora number, and part of speech are acquired from the ruby format data (see FIG. 13), and these are added to the user dictionary as a set of word data.

【００６９】これによりユーザ辞書登録指定のあった単
語だけについて、当該ルビフォーマットデータが今後有
効になるので、同一表記の単語が出現するとユーザ辞書
に登録された読みで中間言語に変換することが可能とな
る。As a result, the ruby format data will be valid only for the words for which the user dictionary has been designated to be registered, so that when words of the same notation appear, the reading registered in the user dictionary can be converted into an intermediate language. Becomes

【００７０】次に、図６を参照してテキスト解析部３１
の単語切り出し処理、特にルビフォーマットデータの処
理について詳細に説明する。Next, referring to FIG. 6, the text analysis unit 31
The word cutout processing, particularly the processing of ruby format data, will be described in detail.

【００７１】テキストが入力されると、テキストの先頭
文字から始まる単語を切出す処理を開始する。この処理
は再帰的に処理される処理であり、切出された単語は、
作業バッファに格納され、その初期状態は空である。When the text is input, the process of cutting out the word starting from the first character of the text is started. This process is a recursive process, and the extracted words are
It is stored in the working buffer and its initial state is empty.

【００７２】（Ｓ４１）先頭が文末であれば、単語切出
し成功として、Ｓ４９に進む。文末でなければ、Ｓ４２
へ進む。(S41) If the beginning is the end of a sentence, it is determined that the word extraction has succeeded, and the process proceeds to S49. If not the end of sentence, S42
Go to.

【００７３】（Ｓ４２）ルビフォーマット処理部３３
は、入力文先頭にルビフォーマットデータが存在するか
どうかを判定する。ルビフォーマット処理部は、所定の
ルビ書式規則と照合して、該書式を満足するかどうかを
判定し、その結果をテキスト解析部に返送する。テキス
ト解析部は、ルビフォーマット処理部の判定結果がＹｅ
ｓであれば、Ｓ５１へ処理を進め、Ｎｏであれば、Ｓ４
３へ処理を進める。(S42) Ruby format processor 33
Determines whether ruby format data exists at the beginning of the input sentence. The ruby format processing unit collates with a predetermined ruby format rule, determines whether or not the format is satisfied, and returns the result to the text analysis unit. In the text analysis unit, the determination result of the ruby format processing unit is Yes.
If s, the process proceeds to S51, and if No, S4.
The process proceeds to 3.

【００７４】（Ｓ４３）メイン辞書、およびユーザ辞書
を検索して、文先頭から始まる単語をすべて取得し、こ
れらを単語候補群とする。(S43) The main dictionary and the user dictionary are searched to acquire all the words starting from the beginning of the sentence, and these are set as a word candidate group.

【００７５】（Ｓ４４）取得した単語候補群の単語候補
数が０ならば、単語切出しが失敗したとして、単語切出
し処理を終了するため、Ｓ５０へ進む。単語候補数が１
個以上であれば、Ｓ４５へ進む。(S44) If the number of word candidates in the acquired word candidate group is 0, it is determined that the word cutting has failed, and the word cutting process is terminated, so that the process proceeds to S50. Number of word candidates is 1
If the number is greater than or equal to the number, the process proceeds to S45.

【００７６】（Ｓ４５）取得した単語候補群の中から単
語候補を１つ選択する。(S45) One word candidate is selected from the acquired word candidate group.

【００７７】（Ｓ４６）選択した単語候補と、作業バッ
ファに格納されている一つ前の単語の文法的接続関係を
チェックする。チェックの結果、文法的接続関係が許容
される関係にある場合は、当該単語候補を作業バッファ
に格納してＳ４７に進み、許されない組み合わせの場合
にはＳ４４に戻り、他の単語候補を選択する。(S46) Check the grammatical connection relation between the selected word candidate and the previous word stored in the work buffer. As a result of the check, if the grammatical connection relation is allowed, the word candidate is stored in the work buffer and the process proceeds to S47. If the combination is not allowed, the process returns to S44 and another word candidate is selected. .

【００７８】（Ｓ４７）当該単語候補（ルビフォーマッ
トデータから抽出された単語ならば当該ルビフォーマッ
トによる単語）の次の文字から始まる文を切出すため、
再帰的に当該単語切出し処理を呼出す。(S47) To cut out a sentence starting from the next character of the word candidate (if the word is extracted from the ruby format data, the word is in the ruby format),
The word cutout process is recursively called.

【００７９】（Ｓ４８）ステップＳ４７の結果が成功の
場合はＳ４９へ進み、失敗の場合は選択した単語候補を
棄却して、Ｓ４４へ戻り、次の単語候補について処理を
行う。(S48) If the result of step S47 is successful, the process proceeds to S49. If the result is unsuccessful, the selected word candidate is discarded, and the process returns to S44 to process the next word candidate.

【００８０】（Ｓ４９）単語の切出しが成功したとし
て、この処理を終了する。(S49) Assuming that the words have been cut out successfully, this process ends.

【００８１】（Ｓ５０）単語の切出しが失敗したとし
て、この処理を終了する。(S50) This process is terminated because it is determined that the cutting of the word has failed.

【００８２】（Ｓ５１）ステップＳ４２の判定結果、ル
ビフォーマットデータであると判定された場合の処理で
ある。ルビフォーマットデータから、表記、読み、アク
セント型、モーラ数、品詞を取得して１組の単語データ
とし、作業バッファへ追加しＳ５２へ進む。ただし、単
語情報にはルビフォーマットから抽出したことを示すた
めのフラグをセットし、後段で必要となる文法的接続チ
ェックが必ず成功するようにしている。(S51) This process is performed when it is determined that the data is ruby format data as a result of the determination in step S42. The notation, reading, accent type, number of mora, and part-of-speech are obtained from the ruby format data to form a set of word data, which is added to the work buffer and the process proceeds to S52. However, a flag is set in the word information to indicate that it has been extracted from the ruby format, so that the grammatical connection check, which is required in the latter stage, will always succeed.

【００８３】（Ｓ５２）ルビフォーマットデータからユ
ーザ辞書指定情報（図９の９３参照）を抜き出し、この
拡張情報が１であれるか否かを判定する。Ｙｅｓであれ
ば（１であれば）、Ｓ５３へ進み、Ｎｏであれば、Ｓ４
７に進む。(S52) The user dictionary designation information (see 93 in FIG. 9) is extracted from the ruby format data, and it is determined whether or not this extension information is 1. If Yes (if 1), proceed to S53, and if No, S4.
Proceed to 7.

【００８４】（Ｓ５３）取得した単語情報と同一表記、
同一読みの単語が、メイン辞書に既に登録されているか
否か判定する。メイン辞書を検索し、当該単語と同一表
記、同一読みの単語が存在した場合は、重複データあり
としてＳ４７に進み、存在しなかった場合は、Ｓ５４に
進む。(S53) The same notation as the acquired word information,
It is determined whether or not the words having the same reading are already registered in the main dictionary. When the main dictionary is searched and a word having the same notation and the same reading as the word exists, it is determined that there is duplicate data, and the process proceeds to S47. If not, the process proceeds to S54.

【００８５】（Ｓ５４）取得した単語情報を、ユーザ辞
書１４３へ追加し、Ｓ４７へ進む。(S54) The acquired word information is added to the user dictionary 143, and the process proceeds to S47.

【００８６】上記処理に従えば、ルビフォーマットデー
タを含んだ入力テキストが単語に分割され、各単語の読
みとアクセント情報を取得でき、最終的に中間言語に変
換できる。また、ルビフォーマットを処理するのと同時
に、ルビフォーマット指定された単語をユーザ辞書へ自
動的に登録できるので、ユーザは装置の動作モードを意
識する必要なくユーザ単語登録を行うことが可能にな
る。According to the above processing, the input text containing the ruby format data is divided into words, and the reading and accent information of each word can be acquired and finally converted into an intermediate language. In addition, since the word in the ruby format can be automatically registered in the user dictionary at the same time as the ruby format is processed, the user can perform the user word registration without having to be aware of the operation mode of the device.

【００８７】さらに、本実施の形態によれば、メイン辞
書に同一単語データが存在する場合は、ユーザ辞書に追
加しないため、ユーザは単語データの重複を意識するこ
となく、ユーザ辞書への単語登録を行うことができる。Further, according to the present embodiment, when the same word data exists in the main dictionary, it is not added to the user dictionary, so that the user can register the word in the user dictionary without being aware of the duplication of the word data. It can be performed.

【００８８】尚、本発明は前述の各実施の形態に限定さ
れるものでは無く、本発明の趣旨に基づいて種々変形さ
せることが可能である。The present invention is not limited to the above-mentioned embodiments, but can be modified in various ways based on the spirit of the present invention.

【００８９】例えば、前述の実施の形態においては、ル
ビフォーマットにより指定された単語データをユーザ辞
書に登録しているが、ルビフォーマットに代えて表記と
中間言語文字列が組となったフォーマットで指定するよ
うにしてもよい。For example, in the above-described embodiment, the word data designated by the ruby format is registered in the user dictionary. However, instead of the ruby format, it is designated by the format in which the notation and the intermediate language character string are paired. You may do it.

【００９０】また、特に人名や地名をルビフォーマット
により指定することにより、これらの単語はシステム辞
書に登録されていないものが多いため、ユーザの意図し
た通りに読み上げさせるのに特に有効となる。Further, in particular, by designating a person's name or a place name in the ruby format, many of these words are not registered in the system dictionary, so that it is particularly effective for reading them out as intended by the user.

【００９１】[0091]

【発明の効果】以上詳細に説明したように、本発明によ
れば、ルビフォーマットにより指定した単語を、システ
ムが自動的にユーザ辞書に登録するので、ユーザは、同
一表記の単語に対して、ルビ指定を繰返す煩わしさから
開放されるという効果と、ユーザは、読み上げモードで
ユーザ単語の辞書登録が可能となるので、単語登録モー
ドと読み上げモードの区別を意識する必要がないという
効果と、ユーザ辞書編集装置を追加導入することなく、
ユーザ辞書に単語を登録することができるという効果が
ある。As described above in detail, according to the present invention, the system automatically registers the word designated by the ruby format in the user dictionary. The effect of being free from the hassle of repeating ruby designation, and the effect that the user can register the user words in the reading mode dictionary, so that it is not necessary to be aware of the distinction between the word registration mode and the reading mode, and the user Without introducing a dictionary editing device,
The effect is that words can be registered in the user dictionary.

[Brief description of drawings]

【図１】第１の実施の形態におけるテキスト音声変換部
の構成図である。FIG. 1 is a configuration diagram of a text-to-speech conversion unit according to the first embodiment.

【図２】第１の実施の形態におけるテキスト解析部の単
語切出し処理フローである。FIG. 2 is a word cutout processing flow of a text analysis unit according to the first embodiment.

【図３】第２の実施の形態におけるテキスト音声変換部
の構成図である。FIG. 3 is a configuration diagram of a text-to-speech conversion unit according to the second embodiment.

【図４】第２の実施の形態におけるテキスト解析部の単
語切出し処理フローである。FIG. 4 is a word cutout processing flow of a text analysis unit according to the second embodiment.

【図５】第３の実施の形態におけるテキスト音声変換部
の構成図である。FIG. 5 is a configuration diagram of a text-to-speech conversion unit according to the third embodiment.

【図６】第３の実施の形態におけるテキスト解析部の単
語切出し処理フローである。FIG. 6 is a word cutout processing flow of a text analysis unit according to the third embodiment.

【図７】ルビフォーマットの例１である。FIG. 7 is an example 1 of a ruby format.

【図８】入力テキストの例１である。FIG. 8 is an example 1 of input text.

【図９】ルビフォーマットの例２である。FIG. 9 is a second example of a ruby format.

【図１０】入力テキストの例２である。FIG. 10 is an example 2 of input text.

【図１１】単語切出し処理結果の例である。FIG. 11 is an example of a word cutout processing result.

【図１２】テキスト解析部の出力例である。FIG. 12 is an output example of a text analysis unit.

【図１３】追加単語データの一例である。FIG. 13 is an example of additional word data.

【図１４】一般的なテキスト音声合成装置の全体構成図
である。FIG. 14 is an overall configuration diagram of a general text-to-speech synthesizer.

[Explanation of symbols]

１１、２１、３１テキスト解析部１２、２２、３２音声合成部１３、２３、３３ルビフォーマット処理部１４、２５、３６ユーザ単語追加部１４２テキスト音声変換部１４３ユーザ辞書１４４メイン辞書２４、３４ルビ選別部３５複データ検査部 11, 21, 31 Text analysis section 12, 22, 32 Speech synthesizer 13, 23, 33 ruby format processing unit 14, 25, 36 User word addition section 142 Text-to-speech converter 143 user dictionary 144 Main dictionary 24, 34 Ruby sorting section 35 Multiple data inspection section

Claims

[Claims]

1. A text-to-speech conversion unit for converting an input text into a speech waveform, a main dictionary pre-installed in the apparatus for analyzing words in the input text, and a user dictionary in which a user can register words. In the text-to-speech synthesis device, the text-to-speech conversion unit synthesizes a speech waveform from the intermediate language and a text analysis unit that analyzes words in the input text using the main dictionary and the user dictionary and converts the words into an intermediate language. In cooperation with the speech synthesis unit and the text analysis unit, it is determined whether or not the input text includes ruby format data in which words and their readings are juxtaposed, and whether or not the ruby format is satisfied. The ruby format processing unit that determines whether or not the ruby format data recognized as the ruby format is Text-to-speech conversion system which is characterized in that a user word additional portion receives from the formatting processing unit adds the word information necessary to the user dictionary registration add to the user dictionary.

2. The text-to-speech synthesis apparatus according to claim 1, wherein the text-to-speech conversion unit further acquires extension information from the ruby format data, and adds the ruby format data to a user dictionary according to the extension information. A text-to-speech conversion device comprising a ruby selection unit for determining whether or not to perform.

3. The text-to-speech synthesis apparatus according to claim 2, wherein the text-to-speech conversion unit further determines whether or not the word information extracted from the ruby format data is already registered in the main dictionary. A text-to-speech conversion device, which is provided with a duplicate data inspection unit.

4. The text-to-speech synthesis apparatus according to claim 1, wherein the words extracted from the ruby format data are words representing a place name and a person's name. apparatus.

5. The text-to-speech synthesizer according to claim 1, wherein a format represented by a set of notation and intermediate language character string is used instead of the ruby format. Text-to-speech synthesizer.