JPS63140329A - Sentence reading system - Google Patents

Sentence reading system

Info

Publication number
JPS63140329A
JPS63140329A JP61286025A JP28602586A JPS63140329A JP S63140329 A JPS63140329 A JP S63140329A JP 61286025 A JP61286025 A JP 61286025A JP 28602586 A JP28602586 A JP 28602586A JP S63140329 A JPS63140329 A JP S63140329A
Authority
JP
Japan
Prior art keywords
word
naturalized
sentence
dictionary
standard form
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61286025A
Other languages
Japanese (ja)
Inventor
Fukami Kamiyama
神山 ふかみ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP61286025A priority Critical patent/JPS63140329A/en
Publication of JPS63140329A publication Critical patent/JPS63140329A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To read aloud a sentence smoothly without being conscious of fluctuation in a naturalized word, by providing a conversion part which convert the naturalized word having the fluctuation to a standard for. CONSTITUTION:When the naturalized word exists in a text inputted from a sentence input part 11, and it is not the standard form, it is converted to the standard form at the conversion part 121, and a word dictionary 16 is retrieved. In other words, when the text inputted from the input part 11 is the naturalized word having the fluctuation, a dictionary retrieval part 12 sends it to the conversion part 121. The conversion part 121 converts it to the reference form by a prescribed conversion rule, and retrieves the word dictionary. A word identification part 13 selects an optimum word string based on a retrieved result, and generates a reading string by a reading string storage part 14, and performs voice synthesis at a voice output part, then, outputs voice. In such way, it is possible to read aloud the sentence without being conscious of the fluctuation of the naturalized word.

Description

【発明の詳細な説明】 〔概 要〕 本発明は日本語情報処理装置における文章読み上げ方式
において、表記された文章中のゆらぎのある外来語によ
る単語辞書検索、単語同定を容易にするため、ゆらぎの
ある外来語を標準形に変換する変換部を設け、標準形に
変換された外来語について単語辞書検索を行い、単語同
定するようにしたものである。
[Detailed Description of the Invention] [Summary] The present invention uses fluctuations in a text reading method in a Japanese information processing device to facilitate word dictionary searches and word identification using foreign words with fluctuations in written sentences. A conversion unit is provided to convert a certain foreign word into a standard form, and a word dictionary search is performed on the foreign word converted to the standard form to identify the word.

〔産業上の利用分野〕[Industrial application field]

本発明は日本語情報処理装置に関し、特に、表記された
文章を入力し、その文字列中の単語を識別するための文
章解析を行い音声出力する文章読み上げ方式に関する。
The present invention relates to a Japanese language information processing device, and more particularly to a text reading method that inputs written text, analyzes the text to identify words in the character string, and outputs the text aloud.

〔従来の技術及び発明が解決しようとする問題点〕日本
語情報処理において、単語、文節、句等の文字列からな
る単語等の表記部分とその読み方、文法等を格納した単
語辞書を有し、表記された文章をこの単語辞書を参照し
て単語等を固定し、読み列を生成する文章読み上げ方式
は既に知られている。
[Prior art and problems to be solved by the invention] In Japanese information processing, there is a word dictionary that stores the notation parts of words, etc. consisting of character strings such as words, phrases, phrases, etc., their pronunciation, grammar, etc. , a sentence reading method is already known in which a written sentence is referred to in this word dictionary, words, etc. are fixed, and a reading sequence is generated.

この場合、表記された文章中に、いわゆる“ゆらぎ”の
あるカタカナ外来語、例えば、バイオリンに対してヴァ
イオリン、ウィスキーに対してウィスキー等が含まれる
ことがある。即ち、外来語は主として欧米語から国語に
取り入れたことばであり、日本語の音にない原音をどの
ように書き表わすかが問題となり、書き表わし方は語に
よって必ずしも一定しておらず、又、人によっても様々
なためである。
In this case, the written sentence may include katakana loanwords with so-called "fluctuations", such as "violin" for violin, "whiskey" for whiskey, etc. In other words, foreign words are mainly words that have been introduced into the Japanese language from Western languages, and the problem is how to write the original sounds that do not exist in Japanese, and the way they are written is not necessarily the same depending on the word. This is because it varies depending on the person.

このように書き表わし方がまちまちな外来語を許す日本
語にあって、機械翻訳、文書メールの読み上げ、データ
ベース検索の出力、読書機等の文章解析として外来語の
単語処理技術の早期開発が重要な課題となっている。
As the Japanese language allows foreign words to be written in different ways, it is important to develop word processing technology for foreign words as early as possible for machine translation, text-mail reading, database search output, and text analysis for reading devices. This has become a major issue.

第32図は従来の文章読み上げ方式の構成図である。第
3図において、31は文章入力部、32は辞書検索部、
33は単語同定部、34は読み列格納部、35は音声出
力部、そして36は単語辞書である。辞書検索部32と
単語同定部33により文章解析部37を構成する。
FIG. 32 is a block diagram of a conventional text reading system. In FIG. 3, 31 is a text input section, 32 is a dictionary search section,
33 is a word identification section, 34 is a reading sequence storage section, 35 is an audio output section, and 36 is a word dictionary. The dictionary search section 32 and the word identification section 33 constitute a text analysis section 37.

文章解析部37は例えばキーボード等の文章入力部31
から入力された文章を、単語辞書36がら参照して候補
単語をピックアップし単語同定部33で最適な単語の組
合せを選択する。単語辞書36には単語の表記とその読
み方、文法、アクセント等が格納されており、それらか
ら読み列格納部34により読み列をつくり、音声出力部
35で音声合成を行って音声を出力する。
The text analysis unit 37 includes a text input unit 31 such as a keyboard, for example.
The word dictionary 36 is used to refer to the input sentence, and candidate words are picked up, and the word identification section 33 selects the optimal combination of words. The word dictionary 36 stores word notations, their pronunciations, grammar, accents, etc. From these, a pronunciation sequence is created by a pronunciation storage section 34, and a speech output section 35 performs speech synthesis and outputs speech.

上記の説明から明らかな如く、書き方が一定でない外来
語のために、すべてのバリエーションが単語辞書に登録
されていないと、辞書中に登録されていない外来語が文
章中に存在した場合、単語同定に失敗し文章をなめらか
に読み上げることができなくなる。例えば、辞書中には
「データー」、「ウィスキー」、「バリエーション」と
登録されており文章の方には「データ」、「ウィスキー
」、「ヴアリエーション」という形で表記されていた場
合、前記の如く単語同定は失敗する。
As is clear from the above explanation, if all variations of a loan word are not registered in the word dictionary because the writing style is not fixed, if a loan word that is not registered in the dictionary exists in a sentence, word identification will be difficult. This makes it difficult to read sentences smoothly. For example, if the words ``data,''``whiskey,'' and ``variation'' are registered in the dictionary, but the sentences are written as ``data,''``whiskey,'' and ``variation,'' then Like this, word identification fails.

〔問題点を解決するための手段〕[Means for solving problems]

本発明は上述した問題点を解消した文章読み上げ方式を
提供することにあり、その手段は、カタカナ外来語の書
き方の標準形を予め設定しておき、表記した文章中のゆ
らぎのあるカタカナ外来語につい、て所定の変換規則に
より前記標準形に変換する変換部を備え、変換された標
準形について単語辞書検索を行い、単語同定するように
したことを特徴とする。
The object of the present invention is to provide a text reading method that solves the above-mentioned problems, and the means thereof is to set a standard form for writing katakana foreign words in advance, and read out katakana foreign words that have fluctuations in written sentences. The present invention is characterized in that it includes a conversion unit that converts the converted standard form into the standard form according to a predetermined conversion rule, and performs a word dictionary search on the converted standard form to identify words.

〔実施例〕〔Example〕

第1図は本発明に係る文章読み上げ方式の構成図である
。第1図において、11は文章入力部、12は辞書検索
部、13は単語同定部、121は外来語標準形変換部、
14は読み列格納部、15は音声出力部、そして16は
単語辞書部である。辞書検索部12、単語同定部13お
よび外来語標準形変換部121により文章解析部17を
構成する。
FIG. 1 is a block diagram of a text reading method according to the present invention. In FIG. 1, 11 is a text input section, 12 is a dictionary search section, 13 is a word identification section, 121 is a foreign word standard form conversion section,
14 is a reading sequence storage section, 15 is an audio output section, and 16 is a word dictionary section. The dictionary search section 12, the word identification section 13, and the foreign word standard form conversion section 121 constitute the sentence analysis section 17.

このような構成において、キーボード等により文章入力
部11から入力された文の中に外来語がありこれが定め
られた標準形でなかった場合、変換部121で標準形に
変換して単語辞書16を検索する。即ち、文章入力部1
1に入力された文が、例えば「ヴァイオリンの音をコン
ピューターで」とすると、辞書検索部12では上記文章
中のカタカナ表記の部分を外来語標準形変換部121に
送り、「ヴァイオリン」の「ヴア」を「ハ」に、「コン
ピューター」の長音部分をとって「コンピュタ」とし単
語辞書16を検索し候補単語をピンクアンプする。単語
辞書16には第2図に示すように、標準形への変換結果
を示す“表記”、“読み”、“文法”、“アクセント”
などが格納されている。
In such a configuration, if there is a foreign word in a sentence inputted from the text input section 11 using a keyboard or the like and it is not in the prescribed standard form, the conversion section 121 converts it into the standard form and converts it into the word dictionary 16. search for. That is, text input section 1
For example, if the sentence input in step 1 is ``the sound of a violin on a computer,'' the dictionary search unit 12 sends the katakana notation in the sentence to the loan word standard form conversion unit 121, and converts it to the ``voice'' of ``violin.'''' as ``ha'' and the long part of ``computer'' as ``computer'', search the word dictionary 16, and pink-amplify candidate words. As shown in Figure 2, the word dictionary 16 includes "notation", "reading", "grammar", and "accent" that indicate the conversion result to the standard form.
etc. are stored.

外来語標準形変換部121は、上記の例の「ヴア、ヴイ
、ヴ、ヴエ、ヴオ」を[バ、ビ、ブ、べ、ボ]、に、「
長音はすべて除く」に、さらに、「ライ、ウニ、ウォ」
を「ライ、ウニ、ウォ」にするなど表記文章中のゆらぎ
部分の変換規則を持っている。
The loanword standard form conversion unit 121 converts "Vua, Vui, Vu, Vue, Vuo" in the above example into [Ba, Bi, Bu, Be, Bo], "
"Excluding all long sounds" and "rai, uni, wo"
It has rules for converting fluctuations in written sentences, such as changing ``rai, uni, wo''.

この変換は例えばソフト的に行われ、IF文、GOTO
文等により変換される。
This conversion is performed by software, for example, using IF statements, GOTO
Converted by sentences, etc.

候補単語のピックアップ後は単語同定部13で最適な単
語列を選択し、読み列格納部14では確定した単語列に
従って読み列「バイオリンノオトオコンピューターデ」
を生成し格納し、音声出力部15は読み列に基づき音声
合成を行い音声出力をする。
After picking up the candidate words, the word identification section 13 selects the most suitable word string, and the reading string storage section 14 stores the reading string "Violin no Otoo Computer De" according to the determined word string.
The speech output unit 15 performs speech synthesis based on the reading sequence and outputs speech.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、日本語情報処理装置における、文章か
らその読みを生成する文章読み上げ方式において、外来
語のゆらぎを気にすることなく日本語の円滑な単語同定
処理と文章解析が可能となる。
According to the present invention, in a text-to-speech method for generating pronunciations from sentences in a Japanese information processing device, smooth word identification processing and text analysis of Japanese are possible without worrying about fluctuations in foreign words. .

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明に係る文章読み上げ方式の構成図、 第2図は第1図構成における単語辞書の詳細図、および 第3図は従来の構成図である。 (符号の説明) 11 、31・・・文章入力部、 12 、32・・・辞書検索部、 13 、33・・・単語同定部、 14.34・・・読み列格納部、 15.35・・・音声出力部、 16 、36・・・単語辞書、 121・・・外来語標準形変換部。 従来の文章読み上げ方式の構成図 第3図 FIG. 1 is a configuration diagram of a text reading method according to the present invention, Figure 2 is a detailed diagram of the word dictionary in the configuration shown in Figure 1, and FIG. 3 is a conventional configuration diagram. (Explanation of symbols) 11, 31... text input section, 12, 32...dictionary search section, 13, 33...word identification section, 14.34... reading sequence storage section, 15.35...Audio output section, 16, 36...word dictionary, 121...Loan word standard form conversion unit. Configuration diagram of conventional text reading method Figure 3

Claims (1)

【特許請求の範囲】[Claims] 1、表記した文章を入力しその文字列中の単語を識別す
るため文章解析を行ない音声出力する日本語情報処理装
置における文章読み上げ方式において、カタカナ外来語
の書き方の標準形を予め設定しておき、前記表記した文
章中のゆらぎのあるカタカナ外来語について所定の変換
規則により前記標準形に変換する変換部を備え、変換さ
れた標準形について単語辞書検索を行い、単語同定する
ようにしたことを特徴とする文章読み上げ方式。
1. In the text reading method of a Japanese information processing device that inputs a written sentence, analyzes the text to identify the words in the string, and outputs the sound, a standard form for writing katakana foreign words is set in advance. , comprising a converting unit that converts a katakana foreign word with fluctuations in the written sentence into the standard form according to a predetermined conversion rule, and performs a word dictionary search on the converted standard form to identify the word. Characteristic text reading method.
JP61286025A 1986-12-02 1986-12-02 Sentence reading system Pending JPS63140329A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61286025A JPS63140329A (en) 1986-12-02 1986-12-02 Sentence reading system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61286025A JPS63140329A (en) 1986-12-02 1986-12-02 Sentence reading system

Publications (1)

Publication Number Publication Date
JPS63140329A true JPS63140329A (en) 1988-06-11

Family

ID=17699001

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61286025A Pending JPS63140329A (en) 1986-12-02 1986-12-02 Sentence reading system

Country Status (1)

Country Link
JP (1) JPS63140329A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0267684A (en) * 1988-09-02 1990-03-07 Hitachi Ltd Calibration supporting system and dictionary retrieving system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60122429A (en) * 1983-12-06 1985-06-29 Fujitsu Ltd Monosyllable voice input system
JPS613266A (en) * 1984-06-15 1986-01-09 Nec Corp Kanji conversion system for katakana described name

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60122429A (en) * 1983-12-06 1985-06-29 Fujitsu Ltd Monosyllable voice input system
JPS613266A (en) * 1984-06-15 1986-01-09 Nec Corp Kanji conversion system for katakana described name

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0267684A (en) * 1988-09-02 1990-03-07 Hitachi Ltd Calibration supporting system and dictionary retrieving system

Similar Documents

Publication Publication Date Title
US6477495B1 (en) Speech synthesis system and prosodic control method in the speech synthesis system
JP4811557B2 (en) Voice reproduction device and speech support device
Fordyce et al. Prosody prediction for speech synthesis using transformational rule-based learning.
Demenko et al. JURISDIC: Polish Speech Database for Taking Dictation of Legal Texts.
Greibus et al. The phoneme set influence for Lithuanian speech commands recognition accuracy
Labied et al. Moroccan dialect “Darija” automatic speech recognition: a survey
JP3576066B2 (en) Speech synthesis system and speech synthesis method
KR100369507B1 (en) Voice recognition/synthesis systems based on standard pronunciation analysis methodology and methods therefor
JP2000172289A (en) Method and record medium for processing natural language, and speech synthesis device
Meng et al. CU VOCAL: corpus-based syllable concatenation for Chinese speech synthesis across domains and dialects.
JPS63140329A (en) Sentence reading system
JP3981619B2 (en) Recording list acquisition device, speech segment database creation device, and device program thereof
JP2001117922A (en) Device and method for translation and recording medium
JPH0760378B2 (en) Text-to-speech device
JPS62119591A (en) Sentence reciting apparatus
Kaur et al. BUILDING AText-TO-SPEECH SYSTEM FOR PUNJABI LANGUAGE
JPH02308194A (en) Foreign language learning device
JPH0229797A (en) Text voice converting device
Tavfiqovna Linguistic Features of Speech in Russian and Uzbek
JPS63163925A (en) Device for reading sentence aloud
Fujimura The role of linguistics for future speech technology
KR0136423B1 (en) Phonetic change processing method by validity check of sound control symbol
Rajeswari et al. Developing Intonation Pattern for Tamil Text To Speech Synthesis System
JPH07140999A (en) Device and method for voice synthesis
Tian et al. Modular design for Mandarin text-to-speech synthesis