JP3002202B2 - Numeral reading device in rule speech synthesizer - Google Patents

Numeral reading device in rule speech synthesizer

Info

Publication number
JP3002202B2
JP3002202B2 JP63180587A JP18058788A JP3002202B2 JP 3002202 B2 JP3002202 B2 JP 3002202B2 JP 63180587 A JP63180587 A JP 63180587A JP 18058788 A JP18058788 A JP 18058788A JP 3002202 B2 JP3002202 B2 JP 3002202B2
Authority
JP
Japan
Prior art keywords
reading
word
numeral
classifier
numerical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP63180587A
Other languages
Japanese (ja)
Other versions
JPH0229796A (en
Inventor
順子 小松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP63180587A priority Critical patent/JP3002202B2/en
Publication of JPH0229796A publication Critical patent/JPH0229796A/en
Application granted granted Critical
Publication of JP3002202B2 publication Critical patent/JP3002202B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Description

【発明の詳細な説明】 産業上の利用分野 本発明は、数表記を含む任意の日本語文章を読み上げ
て音声出力する規則音声合成装置における数詞読み付与
装置に関する。
Description: BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a number reading device in a rule-based speech synthesizer that reads out an arbitrary Japanese sentence including a numerical notation and outputs it as a voice.

従来の技術 近年、自然語処理機器の発展には目覚ましいものがあ
り、その一つとして、任意の日本語文章を音声出力する
規則音声合成装置がある。この装置によれば、数詞をも
扱うことになるが、日本語文章の数表記には、漢数字、
算用数字などが混在して用いられている。さらに、数詞
と助数詞との組合せによつて、「二日(ふつか)」のよ
うに慣用的な読みをするものや、「9本(きゅうほ
ん)」に対し「10本(じゅっぽん)」などのように数詞
又は助数詞の読みが変化するものもある。
2. Description of the Related Art In recent years, the development of natural language processing devices has been remarkable, and one of them is a rule speech synthesizer that outputs an arbitrary Japanese sentence. According to this device, numeric characters are also handled, but the number notation of Japanese sentences includes Chinese numerals,
Arithmetic numbers are mixed. Furthermore, the combination of a numeral and a classifier is used to read a conventional phrase such as "Futatsuka", or "10" for "9" (Kyuhon). In some cases, the reading of a number or a classifier changes, such as.

ここに、数詞と助数詞との組合せによる読みの変化に
対応するため、従来は、次のような方法をとつている。
まず、数詞の音韻変化のパターンを何種類かに分類して
おき、ある助数詞がどのパターンをとるかという情報を
助数詞に付加しておき、数詞との組合せにより数詞の音
韻を変化させる。次に、助数詞自身の音韻変化のパター
ンを何種類かに分類しておき、ある助数詞がどのパター
ンをとるかという情報を助数詞に付加しておき、数詞と
の組合せによつて助数詞の音韻を変化させるというもの
である。
Here, in order to cope with a change in reading due to a combination of a numeral and a classifier, the following method has conventionally been used.
First, the pattern of the phonetic change of a numeral is classified into several types, information indicating which pattern a certain classifier takes is added to the classifier, and the phoneme of the numeral is changed by a combination with the numeral. Next, the pattern of the phonetic change of the classifier itself is classified into several types, and information on which pattern a certain classifier takes is added to the classifier, and the phoneme of the classifier is changed by the combination with the numeral. It is to make it.

発明が解決しようとする問題点 しかし、このような方法では、助数詞がどの数詞音韻
変化パターン及び助数詞音韻変化パターンをとるかを、
前もつて登録しておかなくてはならない。よつて、入力
文章中に未知の助数詞が含まれている場合(日本語文章
の場合には、種々の数表記がある)には、音韻変化に対
応できないものである。
Problems to be Solved by the Invention However, in such a method, it is necessary to determine which of the numeral classifier and the classifier phoneme change pattern the classifier takes.
You must register before. Therefore, when an unknown classifier is included in an input sentence (in the case of a Japanese sentence, there are various numerical notations), it is not possible to cope with a phonological change.

問題点を解決するための手段 単語辞書、規則テーブルを備え、形態素解析処理、数
詞読み付与処理を行う音声合成装置における数詞読み付
与装置において、単語辞書は、単語の品詞、読みを記憶
すると共に、数詞と助数詞の並びを1単語の和語読みと
して記憶し、規則テーブルは、数詞と助数詞の接続部分
の音韻並びに対する音韻変化を記憶し、形態素解析処理
は、入力される日本語文章を単語辞書を用いて単語に分
割すると共に、品詞、読みを付与し、数詞読み付与処理
は、形態素解析処理で付与した単語の品詞、読みが規則
テーブルの音韻並びに合致する際は、対応する音韻変化
に読みを変化させるようにした。
Means for solving the problem A word dictionary, comprising a rule table, a morphological analysis process, a numeral part in a speech synthesis device that performs a numeral part addition process, a word dictionary, while storing the part of speech of the word, reading, The sequence of numbers and classifiers is stored as a one-word Japanese word reading, the rule table stores the phoneme changes for the phoneme sequence at the connecting part of the numbers and classifiers, and the morphological analysis process converts the input Japanese sentence into a word dictionary. When the part of speech and the reading are given by the morphological analysis processing, when the part of speech and the reading given by the morphological analysis process match the phoneme in the rule table and the corresponding phoneme change, Was changed.

また、形態素解析処理は、分割した単語の並びが数
詞、和語読みである場合には、和語読み単語を数詞、助
数詞に分割して読みを付与するようにした。
Further, in the morphological analysis process, when the arrangement of the divided words is a numeral and a Japanese word reading, the Japanese word read word is divided into a number word and a classifier and a reading is given.

作用 音韻変化パターンの中には、数詞と助数詞との接続部
分の音韻並びによる規則で音韻変化に対応できるもの
と、当該規則によつては対応できない音韻変化をするも
のとの双方がある。しかし、規則によつて対応できない
音韻変化をする数詞と助数詞との組合せは少数であるの
で、これらを全て例外読みとして辞書に予め登録してお
く。そして、形態素解析の際に正しい読みの付与を行
い、例外読みとして辞書に登録されていない大部分のも
のについては、標準の読みを与え、最後に数詞と助数詞
との接続部分の音韻並びによる規則で音韻変化に対応す
ることになる。
Action There are two types of phonological change patterns, one that can respond to phonological changes by a rule based on the phonological arrangement of the connecting part of a number and a classifier, and one that makes phonological changes that cannot be handled by the rules. However, since there are only a few combinations of numbers and classifiers that change phonemes that cannot be handled by the rules, these are all registered in the dictionary in advance as exceptional readings. Then, correct readings are given at the time of morphological analysis, standard readings are given for most exceptions that are not registered in the dictionary, and finally rules based on the phoneme arrangement of the connection part between the numeral and the classifier are given. Corresponds to the phoneme change.

また、形態素解析が行う隣接する単語の接続チエツク
を利用することで、和語読みする場合とそうでない場合
との読み分けも行われる。
In addition, by using the connection check of adjacent words performed by morphological analysis, it is possible to distinguish between reading Japanese words and reading other words.

つまり、数詞読み与処理が簡素化され、入力文中に未
知の助数詞があつてもその標準読みさえ与えられれば一
般的な読みの変化に対応できることになる。
In other words, the number reading process is simplified, and even if there are unknown classifiers in the input sentence, it is possible to cope with general reading changes as long as the standard reading is given.

実施例 本発明の一実施例を図面を参照して説明する。本実施
例の数詞読み付与処理は、図面に示すように、1文読み
込みの後の、数表記の標準化、形態素解析、数詞読みの
付与の3段階の処理により行われる。
Embodiment An embodiment of the present invention will be described with reference to the drawings. As shown in the drawing, the numeral reading addition processing according to the present embodiment is performed by three-stage processing of standardizing numerical notation, morphological analysis, and adding numeral reading after reading one sentence.

まず、数表記の標準化では、第1表に示すよう に、小数や分数、概数などの種々の数表記、及び算用数
字や漢数字を用いた表記のゆれを、漢数字を基本にした
標準形に直すものである。これによつて辞書には漢数字
の標準形の表記を登録しておけばよい。
First, in standardization of numerical notation, as shown in Table 1 In addition, various numerical notations such as decimal numbers, fractions, and approximate numbers, and fluctuations in notations using arithmetic numerals and Chinese numerals are converted into a standard form based on Chinese numerals. In this way, it is sufficient to register the notation of the standard form of Chinese numerals in the dictionary.

一方、辞書では、通常の数詞は、第2表のよう うな標準的な読みにしておく。On the other hand, in a dictionary, ordinary numerals are as shown in Table 2. Keep the standard reading like that.

また、数詞と助数詞との接続部分の音韻並びによる規
則では数詞と助数詞の組合せによる音韻変化に対応でき
ず、例外読みとして、1桁の数詞と助数詞とをまとめて
1単語として辞書に登録しておくものとしては、例えば
第3表に示すようなも のがある。第3表からも判るように、本実施例の例外読
みの単語は2つのグループに分けられている。何れのグ
ループに属する単語も読みの変化が規則に対応できない
ものではあるが、特に、グループ2に属するものは、和
語読み(慣用的な読み)をするものである。即ち、グル
ープ2のような和語読みは、1桁の数(1〜9)の場合
にのみ用いるものであるので、例えば、 例1 「九月三日」 →「九月(くがつ)」「三日(みっか)」 例2 「十三日」 →「十(じゅー)」「三(さん)」「日(にち)」 なる形態素解析の際の和語読みの読み分け例示におい
て、例1のような場合にはグループ2に従う読みを与え
るが、例2の「十三」のように1桁でなくなつたときに
は「十」を通常の数詞として読みを付与するようにす
る。つまり、グループ2に属する単語については、通常
の数詞には接続しないようにしておくことにより、例1
のように和語読みする場合と、例2のようにそうでない
場合との読み分けを行う。また、例外読みとして辞書に
登録されていない場合は、数詞と助数詞とは別の単語と
して解析されて、各々には標準的な読みが与えられる。
In addition, the rule based on the phoneme arrangement of the connecting part of a numeral and a classifier cannot cope with a phonological change caused by a combination of a numeral and a classifier, and a one-digit numeral and a classifier are collectively registered as one word in the dictionary as an exceptional reading. For example, as shown in Table 3 There is As can be seen from Table 3, the words of the exceptional reading of this embodiment are divided into two groups. Although words belonging to any of the groups do not change in reading in accordance with the rules, in particular, words belonging to group 2 perform Japanese reading (conventional reading). That is, Japanese reading like Group 2 is used only in the case of a one-digit number (1 to 9). For example, Example 1 “September 3rd” → “September (Kugatsu)” Example 2 "Thirteen days"->"Thirteendays" → "Ten (san)""San(san)""Day(Nichi)" In the case of Example 1, the reading according to Group 2 is given, but when the number is no longer one digit as in "Thirteen" in Example 2, the reading is given as "10" as a normal numeral. In other words, the words belonging to the group 2 are not connected to ordinary numerals, so that the first example can be used.
And the case where it is not read as in Example 2. If the word is not registered in the dictionary as an exceptional reading, the numeral and the classifier are analyzed as different words, and a standard reading is given to each word.

最後に、標準的な読みを付けておいた数詞、助数詞の
内で、促音化、濁音化、半濁音化などの音韻変化をする
ものについて、音韻変化規則によつて読みを変化させ
る。この際に用いる規則は、数詞と助数詞との接続部分
の音韻並びにより定めた第4表、第5表のようなもので
ある。
Finally, among the numbers and classifiers to which standard readings are attached, the readings are changed according to the phonological change rules for those that change phonologically such as vocalization, muddying, and semi-dulling. The rules used at this time are as shown in Tables 4 and 5, which are defined by the phonemes of the connection between the numeral and the classifier.

このように、本実施例の数詞読み付与処理によれば、
数詞読み付与の精度を下げずに、数詞読み付与処理を簡
素化させることができ、かつ、入力文章中に未知の助数
詞が含まれていたとしてもその助数詞の標準的な読みさ
え与えれば一般的な音韻変化には対応して、読みを付与
することができる。
As described above, according to the numeral reading addition processing of the present embodiment,
It is possible to simplify the process of giving a number reading without lowering the accuracy of the addition of the number reading, and even if the input sentence contains an unknown classifier, it is only necessary to provide a standard reading of the classifier. A reading can be given in response to a great phoneme change.

発明の効果 本発明は、上述したように音韻変化パターン中には数
詞と助数詞との接続部分の音韻並びによる規則で音韻変
化に対応できるものと当該規則によつては対応できない
音韻変化をするものとの2種類があるが、規則によつて
対応できない音韻変化をする数詞と助数詞との組合せが
少数であるので、これらを全て例外読みとして予め辞書
に登録しておき、形態素解析の際に正しい読みの付与を
行い、例外読みとして辞書に登録されていないものには
標準の読みを与え、最後に数詞と助数詞との接続部分の
音韻並びによる規則で音韻変化に対応させ、さらには、
形態素解析の際に行う隣接する単語の接続チエツクを利
用して和語読みする場合とそうでない場合とを読み分け
るようにしたので、数詞読み付与の精度を下げることな
く数詞読み付与処理を簡素化させることができ、かつ、
入力文章中に未知の助数詞が含まれていてもその助数詞
の標準的な読みさえ与えれば、一般的な音韻変化には対
応できる読みの付与が可能なものである。
Advantageous Effects of the Invention As described above, the present invention provides a phonological change pattern in which a phonological change can be handled by a rule based on a phonological arrangement of a connecting part of a number and a classifier, and a phonological change that cannot be handled by the rule. However, since there are only a few combinations of numbers and classifiers that change phonologically that cannot be handled by the rules, all of them are registered in the dictionary in advance as exceptional readings, and are correct when morphological analysis is performed. Yomi is given, standard readings are given to those not registered in the dictionary as exceptional readings, and finally, correspondences with phonological changes are made according to the rules based on the phonological arrangement of the connecting part between the number and the classifier.
Simplify the process of giving a number reading without lowering the accuracy of the addition of the number reading, because it uses the connection check of adjacent words performed during morphological analysis to distinguish between the case of reading a Japanese word and the case of not reading it. And can be
Even if an unknown classifier is included in the input sentence, it is possible to provide a reading that can respond to general phonetic changes as long as the standard reading of the classifier is given.

【図面の簡単な説明】[Brief description of the drawings]

図面は本発明の処理手順を示すフローチヤートである。 The drawing is a flowchart showing the processing procedure of the present invention.

Claims (2)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】単語辞書、規則テーブルを備え、形態素解
析処理、数詞読み付与処理を行う音声合成装置における
数詞読み付与装置において、 単語辞書は、単語の品詞、読みを記憶すると共に、数詞
と助数詞の並びを1単語の和語読みとして記憶し、 規則テーブルは、数詞と助数詞の接続部分の音韻並びに
対する音韻変化を記憶し、 形態素解析処理は、入力される日本語文章を単語辞書を
用いて単語に分割すると共に、品詞、読みを付与し、 数詞読み付与処理は、形態素解析処理で付与した単語の
品詞、読みが規則テーブルの音韻並びに合致する際は、
対応する音韻変化に読みを変化させる 規則音声合成装置における数詞読み付与装置。
1. A speech number synthesizing apparatus comprising a word dictionary and a rule table, and performing morphological analysis processing and numeric word reading addition processing. A word dictionary stores a word part of speech and a reading, and includes a numeral and a classifier. Is stored as a one-word Japanese word reading, the rule table stores the phoneme change for the phoneme sequence of the connected part of the number and the classifier, and the morphological analysis process uses the word dictionary to input the input Japanese sentence. In addition to dividing words into words and giving parts of speech and readings, the numerical part reading processing is performed when the parts of speech and readings of words given in the morphological analysis processing match the phonemes of the rule table.
Numeral reading device in a rule-based speech synthesizer that changes readings to corresponding phonological changes.
【請求項2】形態素解析処理は、分割した単語の並びが
数詞、和語読みである場合には、和語読み単語を数詞、
助数詞に分割して読みを付与する 請求項1に記載の規則音声合成装置における数詞読み付
与装置。
In the morphological analysis processing, if the sequence of the divided words is a numeral word and a Japanese word reading, the Japanese word reading word is a numeral word and
The numerical-numerical-reading apparatus in the rule-based speech synthesizer according to claim 1, wherein the numerical-numerical number is divided and the reading is assigned.
JP63180587A 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer Expired - Lifetime JP3002202B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP63180587A JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP63180587A JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Publications (2)

Publication Number Publication Date
JPH0229796A JPH0229796A (en) 1990-01-31
JP3002202B2 true JP3002202B2 (en) 2000-01-24

Family

ID=16085871

Family Applications (1)

Application Number Title Priority Date Filing Date
JP63180587A Expired - Lifetime JP3002202B2 (en) 1988-07-20 1988-07-20 Numeral reading device in rule speech synthesizer

Country Status (1)

Country Link
JP (1) JP3002202B2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101732667B1 (en) * 2016-04-27 2017-05-08 주식회사 필룩스 Lighting apparatus

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5171171A (en) * 1988-12-28 1992-12-15 Yamaha Hatsudoki Kabushiki Kaisha Kill switch assembly for small watercraft
JPH07129619A (en) * 1993-10-29 1995-05-19 Hiuka Sangyo Kk Voice electronic book
JP4603290B2 (en) * 2004-05-20 2010-12-22 日本放送協会 Speech synthesis apparatus and speech synthesis program

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101732667B1 (en) * 2016-04-27 2017-05-08 주식회사 필룩스 Lighting apparatus

Also Published As

Publication number Publication date
JPH0229796A (en) 1990-01-31

Similar Documents

Publication Publication Date Title
US4773009A (en) Method and apparatus for text analysis
CN100568225C (en) The Words symbolization processing method and the system of numeral and special symbol string in the text
JP5231698B2 (en) How to predict how to read Japanese ideograms
US5208863A (en) Encoding method for syllables
JP3002202B2 (en) Numeral reading device in rule speech synthesizer
Kamran Malik et al. Transliterating urdu for a broad-coverage urdu/hindi lfg grammar
JPH06282290A (en) Natural language processing device and method thereof
JPH0244080B2 (en)
JPS635792B2 (en)
JP3029403B2 (en) Sentence data speech conversion system
JPH0210957B2 (en)
JPH01233550A (en) Display system for chinese language
JP3407293B2 (en) Character replacement device
JP2614912B2 (en) Text-to-speech device
JPH0375898B2 (en)
JPH0778155A (en) Document recognizing device
JPS60150169A (en) Electronic word dictionary
JPH09281993A (en) Phonetic symbol forming device
JPH0460754A (en) Kana/kanji (chinese character) conversion system
JPH0218661A (en) Japanese word input device
JPS59127146A (en) Sentence reading-out device
JPH02144660A (en) Kana/kanji converter
JPS62210578A (en) Translation system from japanese to chinese
JPS61177573A (en) Forming device of japanese document
JPS63316161A (en) Document preparing device

Legal Events

Date Code Title Description
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20071112

Year of fee payment: 8

FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9

EXPY Cancellation because of completion of term
FPAY Renewal fee payment (event date is renewal date of database)

Free format text: PAYMENT UNTIL: 20081112

Year of fee payment: 9