JPS62208125A

JPS62208125A - Sentence reading device

Info

Publication number: JPS62208125A
Application number: JP61051943A
Authority: JP
Inventors: Fukami Kamiyama; 神山　ふかみ; Makoto Sueda; 末田　信; Tetsuo Tamura; 田村　鉄夫
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1986-03-10
Filing date: 1986-03-10
Publication date: 1987-09-12
Also published as: JPS6349244B2

Abstract

PURPOSE:To attain reading to be easily listened by inserting silent codes into a sentence reading string in accordance with previously set conditions. CONSTITUTION:When a sentence consisting of a character string is inputted from a document input part 11, a sentence analyzing part 12 analyzes the sentence while referring a reading dictionary 13 to execute word identification processing 121. Then, silent insertion condition deciding processing 122 is executed in accordance with previously set condition such as the continuation of the same vowel on a border of words or the continuation of specified word groups such as numerals and reading string forming processing 123 for a reading string into which a necessary silent part is inserted is executed. The reading string processed by said processing is outputted as voice through a reading string storing part 14, a voice output part 15, and so on. Thus, sentence reading to be easily listened can be attained by inserting required silent parts.

Description

【発明の詳細な説明】〔概要〕文章読上げ装置から出力される文章の読上げ音声中で、
単語間が母音同士で接続されたりすると。[Detailed Description of the Invention] [Summary] In the text reading voice output from the text reading device,
When words are connected by vowels.

癒着して聞き分けにくくなる場合があるため、必要に応
じて、単語間に無音区間を挿入できるようにする。Since words may coalesce and become difficult to distinguish, it is possible to insert silent intervals between words as necessary.

[Industrial application field]

本発明は２表記された文章を入力して読上げ出力を行う
文章読上げ装置に関するものであり、特に単語区分を明
瞭にするため無音区間を挿入する読み列生成方式に関す
る。The present invention relates to a text reading device that inputs a sentence written in binary notation and outputs it aloud, and particularly relates to a reading string generation method that inserts silent intervals to clarify word divisions.

[Conventional technology]

一般の日本語文の形で表記された文章を文字入力し、そ
の正しい読みを音声出力する文章読上げ装置は１文書の
確認や校正などの多くの用途において、有用なものとし
て、最近注目されている。Text-to-speech devices that input sentences written in the form of ordinary Japanese sentences and output the correct pronunciation aloud have recently been attracting attention as useful for many purposes such as checking and proofreading a single document. .

日本語文では、１つの漢字に音訓等の複数の“読み”が
存在している場合が多く、意味や連続する単語と単語と
の関係などで用法が異なり、さらには、濁音変化や、音
便変化等が生じうるため。In Japanese sentences, a single kanji often has multiple readings, such as onkun, and the usage differs depending on the meaning and the relationship between consecutive words. Because changes may occur.

文章に対応する正しい“読み”を確定するためには１文
章についての多面的な解析が必要とされる。In order to determine the correct "reading" for a sentence, a multifaceted analysis of a single sentence is required.

第３図は、従来の文章読上げ装置の基本的な構成を示し
たものである。FIG. 3 shows the basic configuration of a conventional text reading device.

図において、３１は文章入力部、３２は文章解析部。In the figure, 31 is a text input section, and 32 is a text analysis section.

３３は読み辞書、３４は読み列格納部、３５は音声出力
部である。33 is a reading dictionary, 34 is a reading sequence storage section, and 35 is an audio output section.

文章入力部３１は、キーボード等を用いて日本語文章を
入力する。The text input unit 31 inputs Japanese text using a keyboard or the like.

文章解析部３２は、読み辞書３３を用いて入力された文
章データを解析し、各単語を同定してその読み列を作成
し、読み列格納部３４に格納する。文章中の単語の同定
は、読み辞書３３から候補単語を取り出し１文章データ
との間でＤＰマツチングを行って、最適な単語の組合わ
せを選択する方法で行われる。The text analysis unit 32 analyzes the input text data using the reading dictionary 33, identifies each word, creates a pronunciation sequence thereof, and stores the pronunciation sequence in the pronunciation sequence storage unit 34. Identification of words in a sentence is performed by extracting candidate words from the reading dictionary 33 and performing DP matching with one sentence data to select an optimal combination of words.

読み辞書３３には、ｉ語の表記とその読み、用法等の文
法が登録されている。単語には、漢字語。In the reading dictionary 33, the notation of the i-word, its pronunciation, usage, and other grammar are registered. The words are kanji words.

カタカナ語、漢字かなまじり語などが含まれる。Includes katakana, kanji, kana, and other words.

音声出力部３５は、読み列格納部３４から文章の読み列
を取り出し、音声合成を行って、音声出力する。The audio output unit 35 takes out the pronunciation of the sentence from the pronunciation storage unit 34, performs speech synthesis, and outputs the result as a voice.

このようにして、任意の表記された文章を読上げ装置に
入力すれば、その適切な読み列が自動的に作成され、読
上げが行われる。In this way, if any written text is input into the reading device, an appropriate reading sequence will be automatically created and the text will be read aloud.

[Problem that the invention seeks to solve]

従来の文章読上げ装置では１作成した読み列の中に母音
が連続していると、読上げの際に、音声合成上の理由か
ら、それらは母音の長音として発声され、たとえば「ア
ア」という表記と「アー」という表記とに対応する発声
は、いずれも“アー”となって９表記の違いを区別して
聞き取ることができなかった。具体例をあげると、「砂
糖売り」は“サトーリ”と発声される。また、別の例と
して、数字が連続する表記、たとえばｒ２２２２」は、
読上げ時に“ニーニーニーニー”と発声され、聞き分け
にくいという問題があった。With conventional sentence reading devices, if there are consecutive vowels in a created reading sequence, for reasons of speech synthesis, they are uttered as long vowels during reading, such as the notation ``aa''. The utterances corresponding to the notation "ah" were all "ah", and it was not possible to distinguish and hear the difference between the nine notations. To give a specific example, "sugar seller" is pronounced as "satori." In addition, as another example, a notation with consecutive numbers, such as "r2222", is
There was a problem with the words being uttered as "nee nee nee nee" when being read aloud, making them difficult to distinguish.

[Means for solving problems]

本発明は２文章解析の際、読み列中の必要部分に無音コ
ードを挿入できるようにして、読上げ発声中に無音区間
を設定し、単語の聞き分けを容易にするものである。The present invention makes it possible to insert silence codes into necessary parts of the reading sequence when analyzing two sentences, and sets silent intervals during reading aloud, thereby making it easier to distinguish between words.

第１図に１本発明の原理的構成を示す。FIG. 1 shows the basic configuration of the present invention.

図において、　１１は文章入力部、１２は文章解析部。In the figure, 11 is a text input section, and 12 is a text analysis section.

１３は読み辞書、　１４は読み列格納部、１５は音声出
力部、１２１は単語同定処理、１２２は無音挿入条件判
定処理、１２３は読み列作成処理を示す。13 is a reading dictionary; 14 is a reading sequence storage unit; 15 is an audio output unit; 121 is a word identification process; 122 is a silence insertion condition determination process; and 123 is a reading sequence creation process.

文章入力部１１は、読上げるべき文章の表記を入力し、
漢字コードの文字列データとして文章解析部１２に供給
する。The text input unit 11 inputs the notation of the text to be read out,
It is supplied to the text analysis section 12 as character string data of Kanji code.

文章解析部１２は、入力された文字列データを種々に区
分し、読み辞書１３を検索して、単語同定処理１２１を
行い１次に同定された単語列を解析し。The text analysis unit 12 divides the input character string data into various types, searches the reading dictionary 13, performs word identification processing 121, and analyzes the primary identified word string.

単語間母音接・続や連続数字などの所定の無音コード挿
入条件に合致する部分を識別し、無音コード挿入位置を
指示する無音挿入条件判定処理１２２を行う。このよう
にして、同定された単語列および無音コード挿入指示に
したがって、読み列作成処理１２３を行う。Silence insertion condition determination processing 122 is performed to identify a portion that matches a predetermined silence code insertion condition such as inter-word vowel connection/continuation or continuous digits, and to specify a silence code insertion position. In this manner, the reading sequence creation process 123 is performed according to the identified word sequence and the silent code insertion instruction.

読み列格納部１４は文章解析部１２により作成された読
み列データを一旦蓄積する。The reading sequence storage unit 14 temporarily stores the reading sequence data created by the text analysis unit 12.

音声出力部１５は、読み列格納部１４に蓄積されている
読み列データを読み出し、音声合成して音声出力する。The audio output unit 15 reads out the reading sequence data stored in the reading sequence storage unit 14, synthesizes the data, and outputs the voice.

その際、読み列データ中に無音コードが存在すれば、そ
の位置に適当な一定の無音区間を挿入して音声出力を区
切る。At this time, if a silence code exists in the pronunciation sequence data, an appropriate fixed silence interval is inserted at that position to delimit the audio output.

[Effect]

本発明によれば、読上げ時に無音区間を挿入する条件を
予め設定しておくことにより、任意の表記された文章に
ついて、読み列中に無音コードを挿入し、その読上げ発
声中に無音区間による区切りを入れることができる。According to the present invention, by setting in advance the conditions for inserting a silent section when reading aloud, a silent code can be inserted in the reading sequence for any written text, and the silent section can be used as a delimiter during the reading. can be entered.

無音コードを挿入する条件としては、同定された単語列
中の順次の単語の境界で同じ母音と母音とが接している
場合、数字などの予め指定されている単語のグループが
連続している場合、その他任意の条件が使用できる。こ
のような条件に合致する単語列が検出された場合、該当
する単語間に無音コードが挿入される。The conditions for inserting a silent code are when the same vowel is in contact with the same vowel at the boundary between successive words in the identified word string, or when pre-specified word groups such as numbers are consecutive. , any other conditions can be used. When a word string matching these conditions is detected, a silence code is inserted between the corresponding words.

特定の単語のグループを指定する方法としては。As a way to specify a specific group of words.

読み辞書中でフラグ等により識別可能にする方法。A method of making it identifiable using flags, etc. in the reading dictionary.

あるいは別にテーブルでもつ方法などがある。Alternatively, there is another way to hold it on a table.

また無音コードの挿入は、無音コードを挿入する可能性
のある各単語に、無音コードを付加したものと付加しな
いものとの２種類を読み辞書中に登録しておき、無音コ
ード挿入の条件判定結果にしたがっていずれか一方を選
択することにより挿入する方法が簡単であるが、読み辞
書内の単語を無音コードなしのもののみとし９条件判定
結果にしたがってプログラムにより無音コードを付加す
る方法も可能である。In addition, to insert a silence code, read two types of words for which a silence code may be inserted, one with and without a silence code, and register them in the dictionary, and then judge the conditions for inserting a silence code. The simplest method is to select one of the words according to the results, but it is also possible to insert only words without silence codes in the reading dictionary and add silence codes by program according to the results of the 9-condition judgment. be.

〔Example〕

第２図に本発明の１実施例の構成を示す。 FIG. 2 shows the configuration of one embodiment of the present invention.

第２図に示されている構成は、第１図の構成を基礎とし
ており、参照番号も同じものが使用されている。ただし
、説明を具体的なレベルで行う必要から１文章解析部１
２中に、細部の手順が追加して示されている。The configuration shown in FIG. 2 is based on the configuration in FIG. 1, and the same reference numerals are used. However, because it is necessary to explain at a concrete level, 1 sentence analysis section 1
2, additional detailed steps are shown.

読み辞書１３には、砂、砂糖、砂漠・・・や１図示を省
略されているが数字１．２．・・・、９等の区切（無音
）を入れて発音することが望まれる単語には。Reading Dictionary 13 includes sand, sugar, desert... and numbers 1, 2, etc., although illustrations are omitted. ..., for words that should be pronounced with a 9th grade (silence).

予め無音コードを付けないものと付けたものとの２種類
を用意して置き、いずれか一方を選択可能にする。勿論
、必ず区切りを入れて発声することが要求されている特
定の単語については、無音コード付きのもののみとする
ことができる。Two types are prepared in advance, one without a silent code and one with one, and one of them can be selected. Of course, for specific words that must be uttered with breaks, only those with silence codes can be used.

文章入力部１１から入力された文章の表記に基づく文字
列データは２文章解析部１２の単語同定処理１２１にお
いて、まず種々に区分され、読み辞書１３を検索する。The character string data based on the notation of the sentence inputted from the sentence input section 11 is first classified into various types in the word identification process 121 of the two-sentence analysis section 12, and then the reading dictionary 13 is searched.

検索の結果、複数の候補単語が得られ、これらの検索結
果の単語は、単語ラティステーブルと呼ばれるテーブル
に順次的に格納される。As a result of the search, a plurality of candidate words are obtained, and these search result words are sequentially stored in a table called a word lattice table.

続いて、単語ラティステーブル内の単語について照合を
行って、入力文章に対応する最適な単語列を確定する。Next, the words in the word lattice table are compared to determine the optimal word string corresponding to the input sentence.

次の無音挿入条件判定処理１２２においては、確定され
た単語列を対象に、無音コード挿入のための条件をチェ
ックし１条件を充たす単語間には。In the next silence insertion condition determination process 122, conditions for inserting a silence code are checked for the determined word string, and between words that satisfy one condition.

無音コードを挿入する指示を行う。Instructs to insert a silence code.

次の読み列作成処理においては、無音コード挿入指示を
伴う単語列に基づいて読み列データを作成する。ここで
無音コード挿入指示があった読み列中の位置には、無音
コードが挿入される。たとえば、「砂糖売り場」の表記
に対応する読み列は。In the next pronunciation sequence creation process, pronunciation data is created based on the word string accompanied by a silent code insertion instruction. Here, the silence code is inserted at the position in the reading sequence where the silence code insertion instruction was given. For example, the reading sequence that corresponds to the notation "sugar counter" is:

サトウ・無音コード・ウリバとなる。Sato, silent code, and Uriba.

作成された読み列データは、読み列格納部１４に一旦格
納された後、音声出力部１５において音声合成され、無
音コードには無音区間（ポーズ）が設定されて、音声出
力される。たとえば、上記の「砂糖売り場」の例では“
サトウ”と“ウリバの間に無音区間が置かれ、単語間の
区切が行われる。The created pronunciation data is once stored in the pronunciation storage unit 14, and then is synthesized into speech in the audio output unit 15, and a silent section (pause) is set in the silent code and output as audio. For example, in the “sugar counter” example above, “
A silent section is placed between "Sato" and "Uriba" to separate the words.

無音コードに対応する無音区間の長さは、予め適当な長
さに設定されている。しかし、複数の無音コードを連続
挿入することにより、無音区間を任意に延ばすことが可
能である。また無音コードを複数種設けて、無音区間の
長さを任意に指定できるようにすることも可能である。The length of the silent section corresponding to the silent code is set to an appropriate length in advance. However, by continuously inserting a plurality of silence codes, it is possible to extend the silence interval arbitrarily. It is also possible to provide a plurality of types of silence codes so that the length of the silence section can be specified arbitrarily.

〔Effect of the invention〕

本発明により１文章読上げ装置から発声出力される読み
列について、単語の聞き分けが容易となり、読上げ校正
に利用される場合など９作業能率と信頼性との向上が可
能となる。According to the present invention, it becomes easy to distinguish between words in a reading sequence outputted from a single-sentence reading device, and work efficiency and reliability can be improved when the reading sequence is used for reading proofreading.

[Brief explanation of drawings]

第１図は本発明の原理的構成図、第２図は本発明の１実
施例の構成図、第３図は従来例装置の基本構成図である
。第１図中。１１：文章入力部１２：文章解析部１３：読み辞書１４：読み列格納部１５：音声出力部１２１：単語同定処理１２２：無音挿入条件判定処理１２３：読み列作成処理FIG. 1 is a basic configuration diagram of the present invention, FIG. 2 is a configuration diagram of one embodiment of the present invention, and FIG. 3 is a basic configuration diagram of a conventional device. In Figure 1. 11: Text input unit 12: Text analysis unit 13: Reading dictionary 14: Reading sequence storage unit 15: Audio output unit 121: Word identification process 122: Silence insertion condition determination process 123: Reading sequence creation process

Claims

[Scope of Claims] A pronunciation dictionary that provides the pronunciation of a written word and a sentence analysis unit are provided, and the text analysis unit receives the written text as input and analyzes the text using the reading dictionary, and generates a pronunciation sequence of the sentence. In the text reading device that generates and outputs audio, the text analysis unit includes means for inserting a silence code into the pronunciation of the word determined as a result of the text analysis, and inserts silence into the pronunciation sequence according to the determination result of the silence code insertion condition. A text reading device that is characterized by inserting a code.