JP2001014310A

JP2001014310A - Device and method for compressing conversion dictionary used for voice synthesis application

Info

Publication number: JP2001014310A
Application number: JP11187598A
Authority: JP
Inventors: Atsushi Yamamoto; 篤志山本; Akihiro Kimura; 晋太木村
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1999-07-01
Filing date: 1999-07-01
Publication date: 2001-01-19

Abstract

PROBLEM TO BE SOLVED: To generate a new dictionary wherein the priority of data is set according to the needs of users. SOLUTION: This method includes a dictionary size input part for inputting the size of a dictionary and a priority decision part 12 which decides the priority of vocabularies stored in the dictionary. The priority decision part extracts vocabularies of high priority from a dictionary main body 13 in order according to corpus 14 information, and stores extracted vocabularies up to the dictionary size inputted at the dictionary input part 11 to generate a new dictionary.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、テキストデータや
波形データ等に基づいて音声合成を行うアプリケーショ
ンにおける、変換に用いる辞書を圧縮する装置及び方法
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus and a method for compressing a dictionary used for conversion in an application for performing speech synthesis based on text data, waveform data, and the like.

【０００２】[0002]

【従来の技術】昨今のコンピュータ技術の急速な進展に
よって、従来はそのデータ量の膨大さによって、処理時
間が実用的な範囲に収まらないことを理由として敬遠さ
れてきた音声を用いたアプリケーションが多々作成され
るようになってきた。2. Description of the Related Art Due to the rapid progress of computer technology in recent years, there have been many applications using voice, which have been shunned because the processing time is not within a practical range due to the huge amount of data. It is being created.

【０００３】その中でも、テキストデータや波形データ
等に基づいて、音声を人工的に合成して発生させるアプ
リケーションは、利用者にコンピュータを使用している
ことを意識させないユーザインタフェースを実現するた
めに広く用いられるようになってきた。[0003] Above all, applications that artificially synthesize speech based on text data, waveform data, and the like are widely used to realize a user interface that does not make the user aware of using a computer. It is being used.

【０００４】かかるアプリケーションを実現するために
は、変換のための辞書が必須となってくる。すなわち、
音声合成アプリケーションにおいては、単語辞書や波形
辞書が必要となる。In order to realize such an application, a dictionary for conversion is essential. That is,
In a speech synthesis application, a word dictionary and a waveform dictionary are required.

【０００５】しかし、一般にこれらの辞書によって音声
合成の明瞭度を上げるためには、辞書に含まれるべき情
報は膨大なものになるおそれがあり、またその情報量が
多くなればなるほど演算処理時間は長くなることから、
辞書に含まれるべき情報をどの程度にするのかは、実用
上重要な課題となっている。However, in general, in order to increase the clarity of speech synthesis using these dictionaries, the information to be included in the dictionaries may be enormous, and the more the amount of information, the longer the processing time becomes. Because it becomes long,
Determining how much information should be included in a dictionary is an important practical issue.

【０００６】実用的かつコンパクトな辞書を作成する方
法については、多くの方法が開示されている。例えば、
特開平５−１８９４１５号公報においては、優先順位の
固定された辞書から、長期学習情報やニューロテーブル
の活性値に基づいた基本語を取り出すことで、新たな精
選辞書を作成する方法が開示されている。Many methods have been disclosed for creating a practical and compact dictionary. For example,
Japanese Patent Application Laid-Open No. 5-189415 discloses a method of creating a new selected dictionary by extracting basic words based on long-term learning information and the activation value of a neuro table from a dictionary having a fixed priority. I have.

【０００７】[0007]

【発明が解決しようとする課題】しかし、上述したよう
な従来の方法では、元になる辞書の優先順位が固定され
ていることから、新たな精選辞書を作成するための基本
語抽出条件によっては、基本語の抽出が効率よく行われ
ない場合が生じるという問題点があった。すなわち、利
用者のニーズは時々刻々変化するものであり、優先順位
が固定されているということは、たとえ長期学習情報等
によって抽出条件を付加したとしても、辞書内の探索順
序は固定された優先順位に従うことから、付加される抽
出条件によっては抽出されるべき基本語の探索効率が悪
く、抽出に時間がかかってしまうおそれがある。However, in the conventional method as described above, since the priorities of the original dictionaries are fixed, depending on the basic word extraction conditions for creating a new selected dictionary. However, there is a problem that the extraction of the basic word may not be performed efficiently. In other words, the needs of the user change from moment to moment, and the fact that the priorities are fixed means that the search order in the dictionary is fixed even if extraction conditions are added by long-term learning information or the like. Since the order is followed, the search efficiency of the basic word to be extracted is low depending on the added extraction condition, and the extraction may take time.

【０００８】また、特殊事情による抽出条件が必要とさ
れる場合、例えば特定の雑誌に関する語彙を優先的に抽
出するとか、今日はこのカテゴリについて、明日はこの
カテゴリについて、というように、優先基準が短期間で
頻繁に変化する場合においては、上述したような従来の
方法では対応しきれない。[0008] In addition, when extraction conditions due to special circumstances are required, for example, vocabulary relating to a specific magazine is preferentially extracted, or a priority criterion such as this category for today, this category for tomorrow, and the like. In the case of frequently changing in a short period, the conventional method as described above cannot cope with the situation.

【０００９】本発明は、上記問題点を解決すべく、利用
者のニーズに合わせて、データの優先順位を設定した新
たな辞書を作成することのできる音声合成アプリケーシ
ョンに用いる変換辞書圧縮装置及び方法を提供すること
を目的とする。In order to solve the above problems, the present invention provides a conversion dictionary compression apparatus and method for use in a speech synthesis application capable of creating a new dictionary in which data priorities are set according to the needs of the user. The purpose is to provide.

【００１０】[0010]

【課題を解決するための手段】上記目的を達成するため
に本発明にかかる音声合成アプリケーションに用いる変
換辞書圧縮装置は、辞書のサイズを入力する辞書サイズ
入力部と、辞書に格納されている語彙の優先順位を判定
する優先順位判定部とを含み、優先順位判定部におい
て、コーパス情報に基づいて辞書本体から優先順位の高
い語彙を順に抽出し、辞書サイズ入力部で入力された辞
書サイズになるまで抽出した語彙を格納して新たな辞書
を作成することを特徴とする。According to the present invention, there is provided a conversion dictionary compression apparatus used for a speech synthesis application according to the present invention, comprising: a dictionary size input unit for inputting a dictionary size; and a vocabulary stored in the dictionary. A priority determining unit that determines the priority of the dictionary, and in the priority determining unit, extracts words having a high priority from the dictionary body in order based on the corpus information, and obtains the dictionary size input by the dictionary size input unit. A new dictionary is created by storing the extracted vocabulary up to this point.

【００１１】かかる構成により、コーパス情報に従っ
て、辞書に含まれる語彙の優先順位を動的に変化させる
ことができるので、特定の雑誌に関する語彙を優先的に
抽出するとか、優先基準が短期間で頻繁に変化する場合
においても、コーパス情報を変えるだけで抽出すべき基
本語の探索を効率良く行うことが可能となる。With this configuration, the priority of the vocabulary included in the dictionary can be dynamically changed in accordance with the corpus information. Therefore, the vocabulary related to a specific magazine is preferentially extracted, or the priority standard is frequently set in a short period of time. , It is possible to efficiently search for basic words to be extracted simply by changing the corpus information.

【００１２】次に、上記目的を達成するために本発明に
かかる音声合成アプリケーションに用いる変換辞書圧縮
方法は、辞書のサイズを入力する工程と、辞書に格納さ
れている語彙の優先順位を判定する工程とを含み、コー
パス情報に基づいて辞書本体から優先順位の高い語彙を
順に抽出し、入力された辞書サイズになるまで抽出した
語彙を格納して新たな辞書を作成することを特徴とす
る。Next, in order to achieve the above object, a conversion dictionary compression method used for a speech synthesis application according to the present invention comprises the steps of inputting the size of a dictionary and determining the priority of vocabulary stored in the dictionary. And extracting a vocabulary having a high priority from the dictionary body based on the corpus information, and storing the extracted vocabulary until the input dictionary size is reached, to create a new dictionary.

【００１３】かかる構成により、コーパス情報に従っ
て、辞書に含まれる語彙の優先順位を動的に変化させる
ことができるので、特定の雑誌に関する語彙を優先的に
抽出するとか、優先基準が短期間で頻繁に変化する場合
においても、コーパス情報を変えるだけで抽出すべき基
本語の探索を効率良く行うことが可能となる。With this configuration, the priority of the vocabulary included in the dictionary can be dynamically changed according to the corpus information. Therefore, the vocabulary related to a specific magazine can be preferentially extracted, or the priority criterion is frequently set in a short period of time. , It is possible to efficiently search for basic words to be extracted simply by changing the corpus information.

【００１４】また、本発明は、上記のような音声合成ア
プリケーションに用いる変換辞書圧縮の機能をコンピュ
ータの処理ステップとして実行するソフトウェアを特徴
とするものであり、具体的には、辞書のサイズを入力す
る工程と、辞書に格納されている語彙の優先順位を判定
する工程とを含み、コーパス情報に基づいて辞書本体か
ら優先順位の高い語彙を順に抽出し、入力された辞書サ
イズになるまで抽出した語彙を格納して新たな辞書を作
成する音声合成アプリケーションに用いる変換辞書圧縮
方法並びにそのような工程をプログラムとして記録した
コンピュータ読み取り可能な記録媒体であることを特徴
とする。Further, the present invention is characterized by software for executing the function of the conversion dictionary compression used for the above-mentioned speech synthesis application as a processing step of a computer, and more specifically, to input a dictionary size. And deciding the priority order of the vocabulary stored in the dictionary. The vocabulary with the highest priority is sequentially extracted from the dictionary body based on the corpus information, and extracted until the input dictionary size is reached. A conversion dictionary compression method used for a speech synthesis application that stores a vocabulary and creates a new dictionary, and a computer-readable recording medium that records such a process as a program.

【００１５】かかる構成により、コンピュータ上へ当該
プログラムをロードさせ実行することで、コーパス情報
に従って、辞書に含まれる語彙の優先順位を動的に変化
させることができるので、特定の雑誌に関する語彙を優
先的に抽出するとか、優先基準が短期間で頻繁に変化す
る場合においても、コーパス情報を変えるだけで抽出す
べき基本語の探索を効率良く行うことが可能となる音声
合成アプリケーションに用いる変換辞書圧縮装置を実現
することができる。With this configuration, by loading and executing the program on the computer, the priority of the vocabulary included in the dictionary can be dynamically changed in accordance with the corpus information. Conversion dictionary compression for speech synthesis applications that enables efficient search of basic words to be extracted by simply changing corpus information, even when the priority criteria frequently change in a short period of time or when the priority criteria change frequently. The device can be realized.

【００１６】[0016]

【発明の実施の形態】以下、本発明の実施の形態にかか
る音声合成アプリケーションに用いる変換辞書圧縮装置
について、図面を参照しながら説明する。図１は本発明
の実施の形態にかかる音声合成アプリケーションに用い
る変換辞書圧縮装置の構成図である。図１において、１
１は辞書サイズ入力部を、１２は優先順位判定部を、１
３は辞書本体を、１４はコーパスを、１５は新規作成辞
書を、それぞれ示す。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, a conversion dictionary compression device used for a speech synthesis application according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to an embodiment of the present invention. In FIG. 1, 1
1 is a dictionary size input unit, 12 is a priority order determination unit, and 1 is a dictionary size input unit.
3 indicates a dictionary body, 14 indicates a corpus, and 15 indicates a newly created dictionary.

【００１７】図１において、まず辞書サイズ入力部１１
において作成する辞書のサイズを入力する。辞書のサイ
ズを入力するのは、例えばＰＤＡ（Personal DigitalAs
sistant ）や携帯端末のように計算機資源の限られてい
る媒体においても、音声を用いたアプリケーションを用
いることができるようにするためである。したがって、
ここで入力すべき辞書サイズは、元々の辞書サイズより
も小さい値を指定する。なお、入力は辞書のサイズのみ
に限定する必要はなく、辞書の登録語彙数等であっても
良い。In FIG. 1, first, a dictionary size input unit 11
Enter the size of the dictionary created in. The dictionary size is entered, for example, by using a PDA (Personal Digital
This is because it is possible to use an application using voice even in a medium having limited computer resources such as a sistant) and a portable terminal. Therefore,
Here, the dictionary size to be input specifies a value smaller than the original dictionary size. The input need not be limited to only the size of the dictionary, but may be the number of words registered in the dictionary.

【００１８】そして、優先順位判定部１２において、辞
書本体１３を参照しながら、辞書サイズ入力部１１から
入力された辞書サイズとなるまで、優先順位の高い順に
基本語を抽出する。ここで、優先順位自体はコーパス１
４の内容に基づいて定められる。コーパス１４の内容と
しては、例えば「コンピュータ」関連の文献であると
か、特定の新聞に掲載されている内容等が考えられる。Then, the priority order judging unit 12 extracts the basic words in descending order of priority until the size of the dictionary input from the dictionary size input unit 11 is reached while referring to the dictionary main body 13. Here, the priority itself is corpus 1
4 is determined based on the content. The contents of the corpus 14 may be, for example, documents related to “computers” or contents published in a specific newspaper.

【００１９】最後に、基本語抽出条件に合致した基本語
を辞書本体１３から順次抽出しながら、新規作成辞書１
５を作成していく。このようにして作成された新規作成
辞書１５は、辞書のサイズとしては辞書本体よりも小さ
くなっているのに対して、例えば音声合成アプリケーシ
ョンにおける合成音声の明瞭度は、特定の条件下におい
ては辞書本体を用いているのと遜色無いものとすること
が可能である。例えば、コーパス１４によって「コンピ
ュータ」というカテゴリに属する基本語が優先的に抽出
された新規作成辞書１５が作成された場合には、「コン
ピュータ」関連の単語の合成音声については通常の辞書
１３を用いた場合と同様の明瞭度を保持していることが
期待できる。Finally, while the basic words that match the basic word extraction conditions are sequentially extracted from the dictionary body 13, the newly created dictionary 1
5 is created. The newly created dictionary 15 created in this way is smaller in dictionary size than the dictionary itself, whereas the clarity of synthesized speech in a speech synthesis application, for example, under specific conditions, It is possible to compare with using the main body. For example, when a newly created dictionary 15 in which the basic words belonging to the category “computer” are preferentially extracted by the corpus 14 is used, the ordinary dictionary 13 is used for the synthesized speech of the words related to “computer”. It can be expected that the same clarity is maintained as in the case where the image is displayed.

【００２０】次に、図２は本発明の実施の形態にかかる
音声合成アプリケーションに用いる変換辞書圧縮装置を
音声合成アプリケーションに適用した一実施例を示す。
図２において、２１は入力装置を、２２は優先順位判定
装置を、２３は辞書本体を、２４はコーパスを、２５は
新規作成辞書を、２６は波形辞書を、２７は音声合成ア
プリケーションを、それぞれ示す。FIG. 2 shows an embodiment in which the conversion dictionary compression device used for the speech synthesis application according to the embodiment of the present invention is applied to a speech synthesis application.
In FIG. 2, 21 is an input device, 22 is a priority determination device, 23 is a dictionary body, 24 is a corpus, 25 is a newly created dictionary, 26 is a waveform dictionary, and 27 is a speech synthesis application. Show.

【００２１】図２において、まず入力装置２１において
利用するべき辞書の圧縮サイズを入力する。なお、入力
は辞書のサイズのみに限定する必要はなく、辞書の登録
語彙数等を入力しても良い。In FIG. 2, first, a compression size of a dictionary to be used in the input device 21 is inputted. The input need not be limited to only the size of the dictionary, and the number of registered words in the dictionary may be input.

【００２２】そして、優先順位判定装置２２において、
辞書本体２３を参照しながら、入力装置２１から入力さ
れた辞書サイズとなるように優先順位の高い順に基本語
を抽出する。ここで、優先順位自体は、コーパス２４を
用いることによって利用者が自由に設定することができ
る。例えば、「インターネット」というカテゴリに属す
る基本語の優先順位を高めるようなものでも良いし、特
定の雑誌・業界紙に掲載されている基本語の優先順位を
高めるものであっても良い。Then, in the priority determining device 22,
While referring to the dictionary main body 23, the basic words are extracted in descending order of priority so as to have the dictionary size input from the input device 21. Here, the priority itself can be freely set by the user by using the corpus 24. For example, the priority of basic words belonging to the category of "Internet" may be increased, or the priority of basic words published in a specific magazine or trade paper may be increased.

【００２３】最後に、基本語抽出条件に合致した基本語
を辞書本体２３から順次抽出しながら、新規作成辞書２
５を作成していく。かかる新規作成辞書２５と、波形辞
書２６に基づいて、音声合成アプリケーション２７が形
成される。Finally, while the basic words that match the basic word extraction conditions are sequentially extracted from the dictionary body 23, the newly created dictionary 2
5 is created. A speech synthesis application 27 is formed based on the newly created dictionary 25 and the waveform dictionary 26.

【００２４】このようにして作成された新規作成辞書２
５は、辞書のサイズとしては辞書本体よりも小さくなっ
ているのに対して、例えば音声合成アプリケーションに
おける合成音声の明瞭度は、特定の条件下においては辞
書本体を用いているのと遜色無いものとすることが可能
である。例えば、コーパス２４によって「インターネッ
ト」というカテゴリに属する基本語が優先的に抽出され
た新規作成辞書２５が作成された場合には、「インター
ネット」関連の単語の合成音声については通常の辞書２
３を用いた場合と同様の明瞭度を保持していることが期
待できる。The newly created dictionary 2 created in this way
5 indicates that the dictionary size is smaller than that of the dictionary itself, whereas the clarity of the synthesized speech in a speech synthesis application, for example, is not inferior to using the dictionary body under specific conditions. It is possible. For example, when a newly created dictionary 25 in which the basic words belonging to the category of “Internet” are preferentially extracted by the corpus 24, the synthesized dictionary of the words related to “Internet” is stored in the normal dictionary 2.
It can be expected that the same clarity as in the case of using No. 3 is maintained.

【００２５】また、優先順位自体を直接変更入力できる
構成であっても良い。例えば図３は本発明の実施の形態
にかかる音声合成アプリケーションに用いる変換辞書圧
縮装置に優先順位変更装置３１を追加した一実施例を示
す。Further, the configuration may be such that the priority itself can be directly changed and input. For example, FIG. 3 shows an embodiment in which a priority changing device 31 is added to the conversion dictionary compression device used for the speech synthesis application according to the embodiment of the present invention.

【００２６】図３において、優先順位変更装置３１は、
出現回数等のパラメータを変更することができ、一方で
は辞書に保管されるべきデータ自体を変更することも可
能である。すなわち、出現回数等のパラメータを変更す
ることで、辞書自体の優先度分布を変更することができ
ると共に、辞書に保管されるべきデータ自体を変更（追
加・削除等）することで出現回数等が変わり、辞書の優
先度が変更される。In FIG. 3, the priority changing device 31
Parameters such as the number of appearances can be changed, while data itself to be stored in the dictionary can be changed. That is, the priority distribution of the dictionary itself can be changed by changing parameters such as the number of appearances, and the number of appearances can be changed by changing (adding, deleting, etc.) the data itself to be stored in the dictionary. Changes, and the priority of the dictionary is changed.

【００２７】また、図４は波形辞書について、同様の辞
書サイズの圧縮処理を行う一実施例を示す。ここで波形
辞書とは、音声自体の出力波形を意味する。したがっ
て、同じ「あ」という言葉の表示であっても、その前後
の文字、あるいは文脈によってその出力波形は変化す
る。FIG. 4 shows an embodiment in which the same dictionary size compression processing is performed on the waveform dictionary. Here, the waveform dictionary means an output waveform of the voice itself. Therefore, even if the same word "a" is displayed, the output waveform changes depending on the characters before and after the word or the context.

【００２８】図４において、まず入力装置２１において
利用するべき波形辞書２６の圧縮サイズを入力する。な
お、特に波形辞書のサイズのみに限定する必要はなく、
波形辞書の登録波形数の上限値を入力しても良いし、新
規に作成すべき波形辞書のカテゴリを入力するものであ
っても良い。In FIG. 4, first, the compression size of the waveform dictionary 26 to be used in the input device 21 is input. It is not necessary to limit only to the size of the waveform dictionary.
The upper limit of the number of registered waveforms in the waveform dictionary may be input, or a category of a waveform dictionary to be newly created may be input.

【００２９】そして、優先順位判定装置２２において、
波形辞書本体２６を参照しながら、入力装置２１から入
力された辞書サイズとなるように優先順位の高い順に波
形を抽出する。ここで、優先順位自体は、コーパス２４
を用いることによって利用者が自由に設定することがで
きる。例えば、「インターネット」というカテゴリに属
する波形の優先順位を高めるようなものでも良いし、特
定のインタビュー等に表出されている音声波形の優先順
位を高めるものであっても良い。Then, in the priority determining device 22,
While referring to the waveform dictionary main body 26, waveforms are extracted in descending order of priority so as to have the dictionary size input from the input device 21. Here, the priority itself is the corpus 24
Can be set freely by the user. For example, the priority of a waveform belonging to the category "Internet" may be increased, or the priority of a voice waveform expressed in a specific interview or the like may be increased.

【００３０】最後に、波形抽出条件に合致した波形を波
形辞書本体２６から順次抽出しながら、新規波形辞書４
１を作成していく。かかる新規波形辞書４１と辞書２３
に基づいて、音声合成アプリケーション２７が形成され
る。Finally, while sequentially extracting the waveforms meeting the waveform extraction conditions from the waveform dictionary main body 26, the new waveform dictionary 4
Create one. The new waveform dictionary 41 and the dictionary 23
, A speech synthesis application 27 is formed.

【００３１】このようにして作成された新規波形辞書４
１は、辞書のサイズとしては波形辞書本体２６よりも小
さくなっているのに対して、例えば音声合成アプリケー
ションにおける合成音声の明瞭度は、特定の条件下にお
いては波形辞書本体２６を用いているのと遜色無いもの
とすることが可能である。The new waveform dictionary 4 thus created
1 indicates that the size of the dictionary is smaller than that of the waveform dictionary main body 26, whereas the clarity of synthesized speech in a speech synthesis application, for example, is that the waveform dictionary main body 26 is used under specific conditions. It is possible to make it comparable.

【００３２】また、図５に示すように、新たな辞書を作
成する際に、カバー率を確認しながら作成することもで
きる。ここでカバー率とは、文章を作成した場合に、新
規作成辞書に登録された語によって当該文章で使用され
る語の何％をカバーできているのかを表す指標である。
図５において、５１はカバー率計算装置を、５２はカバ
ー率表示装置を、それぞれ示す。As shown in FIG. 5, when a new dictionary is created, it can be created while checking the coverage. Here, the cover rate is an index indicating what percentage of the words used in the sentence are covered by the words registered in the newly created dictionary when the sentence is created.
In FIG. 5, reference numeral 51 denotes a cover ratio calculating device, and 52 denotes a cover ratio display device.

【００３３】新規作成辞書２５が作成されたら、元の辞
書本体２３とともにカバー率計算装置５１が参照して、
新規作成辞書２５がどの程度までカバーできているのか
を示す客観的な指標を計算する。そして、かかる計算値
をカバー率表示装置５２において常時表示する。When the newly created dictionary 25 is created, the coverage calculator 51 refers to the dictionary with the original dictionary body 23, and
An objective index indicating how much the newly created dictionary 25 can cover is calculated. Then, the calculated value is always displayed on the coverage ratio display device 52.

【００３４】このように、新規作成辞書２５におけるカ
バー率を監視しながら入力装置２１から入力する条件や
コーパス２４の内容等を変更することができるので、利
用者の使用に耐えうる範囲内で、可能な限り辞書サイズ
を小さくすることが可能となる。As described above, the conditions input from the input device 21 and the contents of the corpus 24 can be changed while monitoring the coverage in the newly created dictionary 25. It is possible to reduce the dictionary size as much as possible.

【００３５】また、図６はサイズ圧縮の対象となるのが
辞書ではなく波形辞書が対象である場合に、明瞭度を確
認しながら新たな波形辞書を作成する実施例を示す。す
なわち、波形データを削減して新たな波形辞書を作成し
た場合の、最終的に出力される合成音声の明瞭度を明確
にすることで、利用者が利用できる限度まで波形辞書サ
イズを圧縮しようとするものである。図６において、６
１は明瞭度推定装置を、６２は明瞭度表示装置を、それ
ぞれ示す。FIG. 6 shows an embodiment for creating a new waveform dictionary while confirming the clarity when the size compression target is not a dictionary but a waveform dictionary. In other words, when a new waveform dictionary is created by reducing the waveform data, the clarity of the finally output synthesized speech is clarified, so that the size of the waveform dictionary is reduced to the limit that can be used by the user. Is what you do. In FIG. 6, 6
Reference numeral 1 denotes a clarity estimation device, and 62 denotes a clarity display device.

【００３６】図６では、新規波形辞書４１が作成された
ら、元の波形辞書本体２６とともに明瞭度推定装置６１
が参照して、新規波形辞書４１によってどの程度まで合
成音声が明瞭であるのかを示す客観的な指標を計算す
る。そして、かかる計算値を明瞭度表示装置６２におい
て常時表示する。In FIG. 6, when a new waveform dictionary 41 is created, the clarity estimating device 61 is created together with the original waveform dictionary main body 26.
Calculates an objective index indicating to what extent the synthesized speech is clear by the new waveform dictionary 41. Then, the calculated value is always displayed on the clarity display device 62.

【００３７】このように、新規波形辞書４１における明
瞭度を監視しながら入力装置２１から入力する条件やコ
ーパス２４の内容等を変更することができるので、利用
者の使用に耐えうる範囲内で、可能な限り波形辞書のサ
イズを小さくすることが可能となる。As described above, the conditions input from the input device 21 and the contents of the corpus 24 can be changed while monitoring the clarity of the new waveform dictionary 41. The size of the waveform dictionary can be reduced as much as possible.

【００３８】以上のように本実施の形態によれば、音声
合成アプリケーションにおいて、変換のための辞書サイ
ズを、変換効率を落とすことなく圧縮することが可能と
なる。As described above, according to the present embodiment, in a speech synthesis application, the dictionary size for conversion can be compressed without lowering the conversion efficiency.

【００３９】次に、本発明の実施の形態にかかる音声合
成アプリケーションに用いる変換辞書圧縮装置を実現す
るプログラムの処理の流れについて説明する。図８に本
発明の実施の形態にかかる音声合成アプリケーションに
用いる変換辞書圧縮装置を実現するプログラムの処理の
流れ図を示す。Next, a description will be given of a processing flow of a program for realizing the conversion dictionary compression apparatus used for the speech synthesis application according to the embodiment of the present invention. FIG. 8 is a flowchart showing the processing of a program for realizing the conversion dictionary compression device used for the speech synthesis application according to the embodiment of the present invention.

【００４０】図７において、利用者が辞書を圧縮する条
件の一つとして辞書サイズを入力する（ステップＳ７０
１）。そして、優先順位の高い基本語から順に辞書本体
から抽出する（ステップＳ７０２）。In FIG. 7, the user inputs a dictionary size as one of the conditions for compressing the dictionary (step S70).
1). Then, the basic words are extracted from the dictionary body in descending order of priority (step S702).

【００４１】辞書本体から抽出した基本語は、新たな辞
書を構成するデータとして、新規作成辞書に順次格納さ
れていく（ステップＳ７０３）。そして、新規作成辞書
が当所利用者が要求していた辞書サイズになったら（ス
テップＳ７０４：Ｙｅｓ）、当該新規作成辞書をアプリ
ケーション用の辞書を作成することが可能となる。The basic words extracted from the dictionary body are sequentially stored in the newly created dictionary as data constituting a new dictionary (step S703). When the newly created dictionary has the dictionary size requested by the user in our office (step S704: Yes), the newly created dictionary can be created as a dictionary for an application.

【００４２】本発明の実施の形態にかかる音声合成アプ
リケーションに用いる変換辞書圧縮装置を実現するプロ
グラムを記憶した記録媒体は、図８に示す記録媒体の例
に示すように、ＣＤ−ＲＯＭ８２−１やフロッピーディ
スク８２−２等の可搬型記録媒体８２だけでなく、通信
回線の先に備えられた他の記憶装置８１や、コンピュー
タ８３のハードディスクやＲＡＭ等の記録媒体８４のい
ずれでも良く、プログラム実行時には、プログラムはロ
ーディングされ、主メモリ上で実行される。As shown in the example of the recording medium shown in FIG. 8, the recording medium storing the program for realizing the conversion dictionary compression apparatus used for the speech synthesizing application according to the embodiment of the present invention is a CD-ROM 82-1. Not only a portable recording medium 82 such as a floppy disk 82-2, but also any other storage device 81 provided at the end of a communication line or a recording medium 84 such as a hard disk or a RAM of a computer 83 may be used. , The program is loaded and executed on the main memory.

【００４３】また、本発明の実施の形態にかかる音声合
成アプリケーションに用いる変換辞書圧縮装置により生
成された新規作成辞書等を記録した記録媒体も、図８に
示す記録媒体の例に示すように、ＣＤ−ＲＯＭ８２−１
やフロッピーディスク８２−２等の可搬型記録媒体８２
だけでなく、通信回線の先に備えられた他の記憶装置８
１や、コンピュータ８３のハードディスクやＲＡＭ等の
記録媒体８４のいずれでも良く、例えば本発明にかかる
音声合成アプリケーションに用いる変換辞書圧縮装置を
利用する際にコンピュータ８３により読み取られる。Also, as shown in the example of the recording medium shown in FIG. 8, a recording medium that records a newly created dictionary and the like generated by the conversion dictionary compression device used for the speech synthesis application according to the embodiment of the present invention is CD-ROM 82-1
Recording medium 82 such as a disk or a floppy disk 82-2
Not only other storage devices 8 provided at the end of the communication line
1 or a recording medium 84 such as a hard disk or a RAM of the computer 83, which is read by the computer 83 when using the conversion dictionary compression apparatus used for the speech synthesis application according to the present invention.

【００４４】[0044]

【発明の効果】以上のように本発明にかかる音声合成ア
プリケーションに用いる変換辞書圧縮装置によれば、コ
ーパス情報に従って、辞書に含まれる語彙の優先順位を
動的に変化させることができるので、特定の雑誌に関す
る語彙を優先的に抽出するとか、優先基準が短期間で頻
繁に変化する場合においても、抽出すべき基本語の探索
を効率良く行うことが可能となる。As described above, according to the conversion dictionary compression apparatus used in the speech synthesis application according to the present invention, the priority of words included in the dictionary can be dynamically changed according to the corpus information. Even if the vocabulary relating to the magazine is preferentially extracted or the priority standard changes frequently in a short period of time, it is possible to efficiently search for the basic words to be extracted.

[Brief description of the drawings]

【図１】本発明の実施の形態にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置の構成図FIG. 1 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to an embodiment of the present invention.

【図２】本発明の一実施例にかかる音声合成アプリケ
ーションに用いる変換辞書圧縮装置の構成図FIG. 2 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to an embodiment of the present invention;

【図３】本発明の他の実施例にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置の構成図FIG. 3 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to another embodiment of the present invention.

【図４】本発明の他の実施例にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置の構成図FIG. 4 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to another embodiment of the present invention.

【図５】本発明の他の実施例にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置の構成図FIG. 5 is a configuration diagram of a conversion dictionary compression apparatus used for a speech synthesis application according to another embodiment of the present invention.

【図６】本発明の他の実施例にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置の構成図FIG. 6 is a configuration diagram of a conversion dictionary compression device used for a speech synthesis application according to another embodiment of the present invention.

【図７】本発明の実施の形態にかかる音声合成アプリ
ケーションに用いる変換辞書圧縮装置における処理の流
れ図FIG. 7 is a flowchart of processing in the conversion dictionary compression device used for the speech synthesis application according to the embodiment of the present invention;

【図８】記録媒体の例示図FIG. 8 is an exemplary diagram of a recording medium.

[Explanation of symbols]

１１辞書サイズ入力部１２優先順位判定部１３、２３辞書本体１４、２４コーパス１５、２５新規作成辞書２１入力装置２２優先順位判定装置２６波形辞書２７音声合成アプリケーション３１優先度変更装置４１新規波形辞書５１カバー率計算装置５２カバー率表示装置６１新規波形辞書６１明瞭度推定装置６２明瞭度表示装置８１回線先の記憶装置８２ＣＤ−ＲＯＭやフロッピーディスク等の可搬型記
録媒体８２−１ＣＤ−ＲＯＭ８２−２フロッピーディスク８３コンピュータ８４コンピュータ上のＲＡＭ／ハードディスク等の記
録媒体DESCRIPTION OF SYMBOLS 11 Dictionary size input part 12 Priority judgment part 13, 23 Dictionary main body 14, 24 Corpus 15, 25 Newly created dictionary 21 Input device 22 Priority judgment device 26 Waveform dictionary 27 Voice synthesis application 31 Priority change device 41 New waveform dictionary 51 Coverage ratio calculation device 52 Coverage ratio display device 61 New waveform dictionary 61 Clarity estimation device 62 Clarity display device 81 Storage device of line destination 82 Portable recording medium such as CD-ROM or floppy disk 82-1 CD-ROM 82- 2 Floppy disk 83 Computer 84 Recording media such as RAM / hard disk on computer

Claims

[Claims]

1. A dictionary size input unit for inputting the size of a dictionary, and a priority determination unit for determining a priority of a vocabulary stored in the dictionary, wherein the priority determination unit determines a priority of a vocabulary based on corpus information. A speech synthesis application characterized by sequentially extracting vocabularies of high priority from the dictionary body, storing the extracted vocabulary until the dictionary size input by the dictionary size input unit and creating a new dictionary. Conversion dictionary compression device to be used.

2. A step of inputting a size of the dictionary, and a step of determining a priority of a vocabulary stored in the dictionary, extracting words having a higher priority from the dictionary body in order based on the corpus information, A conversion dictionary compression method for use in a speech synthesis application, wherein a new dictionary is created by storing the vocabulary extracted until the input dictionary size is reached.

3. A step of inputting a size of the dictionary, and a step of determining a priority of a vocabulary stored in the dictionary, extracting words having a higher priority from the dictionary body in order based on the corpus information, A computer-readable recording medium storing a program to be executed by a computer, storing the vocabulary extracted until the input dictionary size is reached, and creating a new dictionary.