JPH0229796A - Numeral reading adding means for rule voice synthesizing device - Google Patents

Numeral reading adding means for rule voice synthesizing device

Info

Publication number
JPH0229796A
JPH0229796A JP63180587A JP18058788A JPH0229796A JP H0229796 A JPH0229796 A JP H0229796A JP 63180587 A JP63180587 A JP 63180587A JP 18058788 A JP18058788 A JP 18058788A JP H0229796 A JPH0229796 A JP H0229796A
Authority
JP
Japan
Prior art keywords
words
reading
readings
numeral
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP63180587A
Other languages
Japanese (ja)
Other versions
JP3002202B2 (en
Inventor
Junko Komatsu
小松 順子
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP63180587A priority Critical patent/JP3002202B2/en
Publication of JPH0229796A publication Critical patent/JPH0229796A/en
Application granted granted Critical
Publication of JP3002202B2 publication Critical patent/JP3002202B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Abstract

PURPOSE:To give correct reading at the time of a morpheme and to give the reading which can cope with general phoneme variation by registering all combinations of the numeral having phoneme variation which can not be handled according to the rule and auxiliary numeral previously in a dictionary as exceptional reading. CONSTITUTION:The device is provided with a means which changes fluctuations of various numerical expressions such as decimals, fractions, and round numbers and expressions using arithmetic numbers and Chinese character numbers to a standard form based upon the Chinese character numbers, analyzes the format of a document after the conversion to the standard form and divides the document into words, and generates the part of speech, reading, etc., of each word. Further, a means which handles variation in phoneme by the combination of a numeral and an auxiliary numeral is provided and all combinations of numeral and auxiliary numeral which can not handle the phoneme variation are regarded as exceptional reading according to the rule based upon the array of phonemes at the junction part between the numeral and auxiliary numeral and registered in the dictionary as one-digit words. Then correct reading is given at the time of the morphemic analysis, and standard reading which is not registered in the dictionary as exceptional reading to give the reading which handles general phoneme variation.

Description

【発明の詳細な説明】 産業上の利用分野 本発明は、数表記を含む任意の日本語文章を読み−にげ
て音声出力する規則音声合成装置における数詞読み付与
方法に関する。
DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a method for assigning numeral pronunciations in a regular speech synthesizer that reads and outputs speech from arbitrary Japanese sentences containing numeric notations.

従来の技術 近年、自然語処理機器の発展には目覚ましいものがあり
、その一つとして、任意の日本語文章を音声出力する規
則音声合成装置がある。この装置によれば、数詞をも扱
うことになるが、日本語文章の数表記には、漢数字、算
用数字などが混在して用いられている。さらに、数詞と
助数詞との組合せによって、「二日(ぶつか)」のよう
に慣用的な読みをするものや、[9本(きゆうほん)」
に対しrlo本(じゆっぽん)」などのように数詞又は
助数詞の読みが変化するものもある。
BACKGROUND OF THE INVENTION In recent years, natural language processing equipment has made remarkable progress, and one example is a regular speech synthesis device that outputs speech from arbitrary Japanese sentences. This device also handles number words, but Chinese numerals, arithmetic numerals, etc. are used in combination to represent numbers in Japanese sentences. Furthermore, depending on the combination of a number word and a particle, there are some that have idiomatic readings, such as ``two days'' (tashi), and ``nine books'' (kiyuuhon).
In some cases, the pronunciation of the number word or particle word changes, such as ``rlo book (jiyuppon)''.

ここに、数詞と助数詞との組合せによる読みの変化に対
応するため、従来は、次のような方法をとっている。ま
ず、数詞の音韻変化のパターンを何種類かに分類してお
き、ある助数詞がどのパターンをとるかという情報を助
数詞に付加しておき、数詞との組合せにより数詞の音韻
を変化させる。
Here, in order to cope with the change in reading due to the combination of a number word and a particle, the following method has conventionally been adopted. First, the phonological change patterns of number words are classified into several types, information about which pattern a certain number word takes is added to the number word, and the phoneme of the number word is changed by combining with the number word.

次に、助数詞自身の音韻変化のパターンを何種類かに分
類しておき、ある助数詞がどのパターンをとるかという
情報を助数詞に付加しておき、数詞との組合せによって
助数詞の音韻を変化させるというものである。
Next, the patterns of phonological change of the classifier itself are classified into several types, and information about which pattern a certain classifier takes is added to the classifier, and the phonology of the classifier can be changed depending on the combination with the number word. It is something.

発明が解決しようとする問題点 しかし、このような方法では、助数詞がどの数詞音韻変
化パターン及び助数詞音韻変化パターンをとるかを、前
もって登録しておかなくてはならない。よって、入力文
章中に未知の助数詞が含まれている場合(日本語文章の
場合には、種々の数表記がある)には、音韻変化に対応
できないものである。
Problems to be Solved by the Invention However, in such a method, it is necessary to register in advance which number word phoneme change pattern and the number word phoneme change pattern the number word takes. Therefore, if an unknown particle is included in the input sentence (in the case of Japanese sentences, there are various number notations), it is not possible to deal with phonological changes.

問題点を解決するための手段 小数、分数、概数等の種々の数表記、及び算用数字や漢
数字を用いた表記のゆれを、漢数字を基本にした標準形
に直す手段と、標準形に直した後の文章を形態素解析し
、単語に分割し、単語の品詞と読みなどを与える手段と
、数詞と助数詞との組合せによる音韻の変化に対応する
手段とを備え、数詞と助数詞との接続部分の音韻並びに
よる規則では数詞と助数詞の組合せによる音韻変化に対
応できないものを全て例外読みとして1桁の数詞と助数
詞とをまとめて1単語として辞書に登録し、形態素解析
の際に正しい読みを付与し、例外読みとして辞書に登録
されていないものには標準の読みを付与し、最後に数詞
と助数詞との接続部分の音韻並びによる規則で数詞と助
数詞との音韻変化に対応する。
Means to solve the problem A means to convert various number notations such as decimals, fractions, round numbers, etc., as well as variations in the notation using arithmetic numerals and Chinese numerals, into a standard form based on Chinese numerals, and a standard form. It is equipped with a means to morphologically analyze the converted sentence, divide it into words, and give the part of speech and pronunciation of the word, and a means to deal with changes in phonology due to the combination of number words and particle words. The rules based on the phonetic arrangement of connected parts cannot handle the phonological changes caused by the combination of number words and classifiers as exception readings, and the one-digit number word and classifier are registered together as one word in the dictionary, and the correct reading is determined during morphological analysis. A standard reading is given to those that are not registered in the dictionary as exceptional readings, and finally, a rule based on the phonological arrangement of the connection part between a number word and a particle is used to deal with the phonological change between a number word and a particle.

この際、例外読み中で和語読みするものは品詞を区別し
ておき、形態素解析をする際に和語読みするものを通常
の数詞には接続させないことにより1桁の数のときには
和語読みとし、それ以外のときには標準読みとする。
At this time, we distinguish the part of speech of the words that are read in Japanese in the exceptional reading, and when we perform morphological analysis, we do not connect the words that are read in Japanese to normal number words, so that when the number is one digit, it is read in Japanese. , otherwise the standard reading is used.

作用 音韻変化パターンの中には、数詞と助数詞との接続部分
の音韻並びによる規則で音韻変化に対応できるものと、
当該規則によっては対応できない音韻変化をするものと
の双方がある。しかし、規則によって対応できない音韻
変化をする数詞と助数詞との組合せは少数であるので、
これらを全て例外読みとして辞書に予め登録しておく。
Among the action phonological change patterns, there are those that can correspond to phonological changes using rules based on the phonological arrangement of the connecting part between number words and particle words.
There are both cases of phonological changes that cannot be handled depending on the rules. However, there are only a few combinations of number words and particle words that have phonological changes that cannot be handled by rules.
All of these are registered in advance in the dictionary as exceptional readings.

そして、形態素解析の際に正しい読みの付与を行い、例
外読みとして辞書に登録されていない大部分のものにつ
いては、標市の読みを与え、最後に数詞と助数詞との接
続部分の音韻並びによる規則で音韻変化に対応すること
になる。
Then, during morphological analysis, the correct reading is assigned, and for most of the exception readings that are not registered in the dictionary, the reading of Shibeichi is given, and finally, the phonological arrangement of the connection part between the number word and the particle is given. The rules will correspond to phonological changes.

また、形態素解析が行う隣接する単語の接続チエツクを
利用することで、和語読みする場合とそうてない場合と
の読み分けも行われる。
Furthermore, by using the connection check of adjacent words performed by morphological analysis, it is possible to distinguish between Japanese readings and non-Japanese readings.

つまり、数詞読み付与処理が簡素化され、入力文中に未
知の助数詞があってもその標準読みさえIjえられれば
一般的な読みの変化に対応できることになる。
In other words, the process of assigning number word readings is simplified, and even if there is an unknown particle in the input sentence, as long as the standard reading can be determined, it is possible to deal with changes in general readings.

実施例 本発明の一実施例を図面を参照して説明する。Example An embodiment of the present invention will be described with reference to the drawings.

本実施例の数詞読み付与処理は、図面に示すように、1
文読み込みの後の、数表記の標準化、形態素解析、数詞
読みの付与の3段階の処理により行われる。
The number word reading assignment process of this embodiment is as shown in the drawing.
After reading the sentence, it is performed in three stages: standardization of number notation, morphological analysis, and assignment of number word reading.

まず、数表記の標準化では、第1表に示すよう第1表 
数表記とその標準形 を基本にした標べへ形に直すものである。これによって
辞訂には漢数字の標準形の表記を登録しておけばよい。
First, in the standardization of number notation, as shown in Table 1,
It converts numbers into signs based on number notation and its standard form. With this, it is sufficient to register the standard form of kanji numerals in the dictionary.

一方、辞書では、通常の数詞は、第2表のよう第2表 
 数詞の標準的な読み に、小数や分数、概数などの種々の数表記、及び算用数
字や漢数字を用いた表記のゆれを、漢数字うな標準的な
読みにしておく。
On the other hand, in the dictionary, ordinary number words are listed in Table 2 as shown in Table 2.
In addition to the standard reading of number words, various number notations such as decimals, fractions, and round numbers, as well as variations in notation using arithmetic numerals and Chinese numerals, are changed to the standard reading of kanji numerals.

また、数詞と助数詞との接続部分の音韻並びによる規則
では数詞と助数詞の組合せによる音韻変化に対応できず
、例外読みとして、1桁の数詞と助数詞とをまとめて1
単語として辞書に登録しておくものとしては、例えば第
3表に示すようなも第3表  例外的、慣用的な数詞読
み のがある。第3表からも判るように、本実施例の例外読
みの単語は2つのグループに分けられている。何れのグ
ループに属する単語も読みの変化が規則に対応できない
ものではあるが、特に、グループ2に属するものは、和
語読み(慣用的な読み)をするものである。即ち、グル
ープ2のような和語読みは、1桁の数(1〜9)の場合
にのみ用いるものであるので、例えば、 例1 [九月三日J −[九月(くがつ)」「三日(みつか)」例2 「I−三日」 −N−(じゆ−)J  r三(さん)」「日(にち)」
なる形1原素解析の際の和語読みの読み分は例示におい
て、例1のような場合にはグループ2に従う読みを!j
、えるが、例2の「十三」のように1桁でなくなったと
きには「十」を通常の数詞として読みを付与するように
する。つまり、グループ2に属するlit語については
、通常の数詞には接続しないようにしておくことにより
、例1のように和語読みする場合と、例2のようにそう
でない場合との読み分けを行う。また、例外読みとして
辞書に登録されていない場合は、数詞と助数詞とは別の
単語として解析されて、各々には標準的な読みが与えら
れる。
In addition, the rules based on the phonological arrangement of the connection part between number words and particle words cannot accommodate the phonological changes caused by the combination of number words and particle words, and as an exception reading, one-digit number words and particle words are grouped together as one.
Examples of words that should be registered in the dictionary are those shown in Table 3. As can be seen from Table 3, the words with exceptional readings in this example are divided into two groups. Although the changes in pronunciation of the words belonging to any group cannot correspond to the rules, the words belonging to group 2 in particular have a Japanese reading (ordinary reading). In other words, Japanese pronunciations like Group 2 are used only for one-digit numbers (1 to 9), so for example, Example 1 [September 3rd J - [September (Kugatsu) ” “Mitsuka” Example 2 “I-Mitsu” -N- (Jiyu-) J r三(san) “Nichi”
Naru form 1 The Japanese pronunciation for elemental analysis is shown in the example, and in cases like Example 1, read according to Group 2! j
, but when it is no longer a single digit, such as "13" in Example 2, the reading is given as "10" as a normal numeral. In other words, for lit words belonging to group 2, by not connecting them to normal number words, it is possible to distinguish between cases in which they are read in Japanese, as in example 1, and cases in which they are not, as in example 2. . Furthermore, if the word is not registered in the dictionary as an exceptional reading, the number word and the number word are analyzed as separate words, and each is given a standard reading.

最後に、標準的な読みを付けておいた数詞、助数詞の内
で、促音化、濁音化、半濁音化などの音韻変化をするも
のについて、音韻変化規則によって読みを変化させる。
Finally, among the number words and classifiers that have been given standard readings, the readings are changed using phoneme change rules for those that undergo phonetic changes such as consonantization, dusky consonance, and semi-voiced consonance.

この際に用いる規則は、数詞と助数詞との接続部分の音
韻並びにより定めた第4表、第5表のようなものである
The rules used in this case are as shown in Tables 4 and 5, which are determined by the phonetic arrangement of the connecting parts between number words and particle words.

第4表  数詞の音韻変化規則 第5票 助数詞の音韻変化規則 このように、本実施例の数詞読み付与処理によれば、数
詞読み付与の精度を下げずに、数詞読み付与処理を簡素
化させることができ、がっ、入力文章中に未知の助数詞
が含まれていたとしてもその助数詞の標準的な読みさえ
与えれば一般的な音韻変化には対応して、読みを付与す
ることができる。
Table 4 Phonological change rules for number words Table 5 Phonological change rules for classifiers As described above, according to the number word reading assignment process of this embodiment, the number word reading assignment process can be simplified without reducing the accuracy of number word reading assignment. Even if an unknown classifier is included in the input sentence, as long as the standard pronunciation of the classifier is given, the reading can be assigned in response to general phonological changes.

発明の効果 本発明は、上述したように音韻変化パターン中には数詞
と助数詞との接続部分の音韻並びにょる規則で音韻変化
に対応できるものと当該規則によっては対応できない音
韻変化をするものとの2種類があるが、規則によって対
応できない音韻変化をする数詞と助数詞との組合せが少
数であるので、これらを全て例外読みとして予め辞書に
登録しておき、形態素解析の際に正しい読みの付与を行
い、例外読みとして辞書に登録されていないものには標
準の読みを与え、最後に数詞と助数詞との接続部分の音
韻並びによる規則で音韻変化に対応させ、さらには、形
態素解析の際に行う隣接する単語の接続チエツクを利用
して和語読みする場合とそうでない場合とを読み分ける
ようにしたので、数詞読み付与の精度を下げることなく
数詞読み付与処理を簡素化させることができ、かつ、入
力文章中に未知の助数詞が含まれていてもその助数詞の
標準的な読みさえ与えれば、一般的な音韻変化には対応
できる読みの付与が可能なものである。
Effects of the Invention As described above, the present invention has two types of phonological change patterns: those that can be accommodated by the phonological and related rules at the connection between a number word and a particle, and those that cannot be accommodated by the rules. There are two types, but since there are a small number of combinations of number words and classifiers that have phonetic changes that cannot be handled by rules, all of these are registered in the dictionary as exceptional readings in advance, and the correct reading is assigned during morphological analysis. Then, standard readings are given to words that are not registered in the dictionary as exceptional readings, and finally, phonological changes are dealt with using rules based on the phonological arrangement of the connection between number words and particle words.Furthermore, during morphological analysis, By using the connection check of adjacent words to distinguish between Japanese readings and non-Japanese readings, the process of adding number readings can be simplified without reducing the accuracy of adding number readings. Moreover, even if an unknown classifier is included in the input sentence, as long as the standard pronunciation of the classifier is given, it is possible to assign a reading that can accommodate general phonological changes.

【図面の簡単な説明】[Brief explanation of the drawing]

図面は本発明の処理手順を示すフローチャートである。 The drawing is a flowchart showing the processing procedure of the present invention.

Claims (1)

【特許請求の範囲】 1、任意の日本語文章を音声出力する規則音声合成装置
における数詞読み付与方法において、小数、分数、概数
等の種々の数表記、及び算用数字や漢数字を用いた表記
のゆれを、漢数字を基本にした標準形に直す手段と、標
準形に直した後の文章を形態素解析し、単語に分割し、
単語の品詞と読みなどを与える手段と、数詞と助数詞と
の組合せによる音韻の変化に対応する手段とを備え、数
詞と助数詞との接続部分の音韻並びによる規則では数詞
と助数詞の組合せによる音韻変化に対応できないものを
全て例外読みとして1桁の数詞と助数詞とをまとめて1
単語として辞書に登録し、形態素解析の際に正しい読み
を付与し、例外読みとして辞書に登録されていないもの
には標準の読みを付与し、最後に数詞と助数詞との接続
部分の音韻並びによる規則で数詞と助数詞との音韻変化
に対応することを特徴とする規則音声合成装置における
数詞読み付与方法。 2、例外読み中で和語読みするものは品詞を区別してお
き、形態素解析をする際に和語読みするものを通常の数
詞には接続させないことにより1桁の数のときには和語
読みとし、それ以外のときには標準読みとすることを特
徴とする請求項1記載の規則音声合成装置における数詞
読み付与方法。
[Scope of Claims] 1. A method for assigning number readings in a regular speech synthesizer that outputs speech of arbitrary Japanese sentences, using various number notations such as decimals, fractions, and round numbers, as well as arithmetic numerals and Chinese numerals. A means of converting the fluctuations in writing into a standard form based on Chinese numerals, a morphological analysis of the sentence after converting it to the standard form, and dividing it into words.
It is equipped with a means for giving the part of speech and pronunciation of a word, and a means for responding to phonological changes due to the combination of a number word and a classifier, and a rule based on the phonological order of the connection part between a number word and a classifier, which corresponds to a phonological change due to a combination of a number word and a classifier. 1-digit number words and classifiers are collectively read as 1, with all words that cannot correspond to
It is registered as a word in the dictionary, the correct reading is given during morphological analysis, the standard reading is given to words that are not registered in the dictionary as exceptional readings, and finally, the phonological order of the connection part between number word and particle is given. A method for assigning number readings in a regular speech synthesis device, characterized in that the rules correspond to phonological changes between number words and particle words. 2. For exception readings, we distinguish the parts of speech for Japanese readings, and when performing morphological analysis, we do not connect Japanese readings to normal number words, so when it is a one-digit number, we use Japanese readings. 2. The method for assigning numeral pronunciations in a regular speech synthesizer according to claim 1, wherein the standard pronunciation is used in other cases.
JP63180587A 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer Expired - Lifetime JP3002202B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63180587A JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63180587A JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Publications (2)

Publication Number Publication Date
JPH0229796A true JPH0229796A (en) 1990-01-31
JP3002202B2 JP3002202B2 (en) 2000-01-24

Family

ID=16085871

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63180587A Expired - Lifetime JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Country Status (1)

Country Link
JP (1) JP3002202B2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5171171A (en) * 1988-12-28 1992-12-15 Yamaha Hatsudoki Kabushiki Kaisha Kill switch assembly for small watercraft
JPH07129619A (en) * 1993-10-29 1995-05-19 Hiuka Sangyo Kk Voice electronic book
JP2005331775A (en) * 2004-05-20 2005-12-02 Nippon Hoso Kyokai <Nhk> Voice synthesizer and voice synthesis program

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101732667B1 (en) * 2016-04-27 2017-05-08 주식회사 필룩스 Lighting apparatus

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5171171A (en) * 1988-12-28 1992-12-15 Yamaha Hatsudoki Kabushiki Kaisha Kill switch assembly for small watercraft
JPH07129619A (en) * 1993-10-29 1995-05-19 Hiuka Sangyo Kk Voice electronic book
JP2005331775A (en) * 2004-05-20 2005-12-02 Nippon Hoso Kyokai <Nhk> Voice synthesizer and voice synthesis program
JP4603290B2 (en) * 2004-05-20 2010-12-22 日本放送協会 Speech synthesis apparatus and speech synthesis program

Also Published As

Publication number Publication date
JP3002202B2 (en) 2000-01-24

Similar Documents

Publication Publication Date Title
DE68913669D1 (en) Pronunciation of names by a synthesizer.
JP2002117027A (en) Feeling information extracting method and recording medium for feeling information extracting program
JPH0229796A (en) Numeral reading adding means for rule voice synthesizing device
JPS5892063A (en) Idiom processing system
JPH0210957B2 (en)
JPS635792B2 (en)
Gaved Pronunciation and text normalisation in applied text-to-speech systems.
JP3407293B2 (en) Character replacement device
JP2614912B2 (en) Text-to-speech device
JPH0375898B2 (en)
JPH09281993A (en) Phonetic symbol forming device
JPH0760378B2 (en) Text-to-speech device
JPH01233550A (en) Display system for chinese language
JPH02240699A (en) Voice synthesizer
JPS63189933A (en) Device for reading sentence aloud
JPH03254963A (en) Braille writer of text
JPS63157266A (en) Forming system for word dictionary
JPH10124501A (en) Method for adding japanese reading and device and program medium for adding japanese reading
JPS61177573A (en) Forming device of japanese document
JPH01241671A (en) Alphabet/kana converting system
JPS63249261A (en) Voice type proofreading supporting system
JPS6285380A (en) Kanji/kana converter
JPH02130599A (en) Voice synthesizing device
JPS63157264A (en) Sentence read-out device
JPS62210578A (en) Translation system from japanese to chinese

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20071112

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9