JPS60107171A - Voice kana (japanese syllabary) kanji (chinese character) converting system - Google Patents

Voice kana (japanese syllabary) kanji (chinese character) converting system

Info

Publication number
JPS60107171A
JPS60107171A JP58214537A JP21453783A JPS60107171A JP S60107171 A JPS60107171 A JP S60107171A JP 58214537 A JP58214537 A JP 58214537A JP 21453783 A JP21453783 A JP 21453783A JP S60107171 A JPS60107171 A JP S60107171A
Authority
JP
Japan
Prior art keywords
kana
kana character
character string
storage means
replaced
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58214537A
Other languages
Japanese (ja)
Inventor
Yutaka Ooyama
裕 大山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp, Nippon Electric Co Ltd filed Critical NEC Corp
Priority to JP58214537A priority Critical patent/JPS60107171A/en
Publication of JPS60107171A publication Critical patent/JPS60107171A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/53Processing of non-Latin text

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To realize a voice Kana/Kanji converting system possible for sounding a sentence as a normal reading by referencing a replaced Kana character storage means and an unreplaced Kana character storage means. CONSTITUTION:A voice input means 1 transmits a Kana character string to a dictionary retrieval means 3. The dictionary retrieval means 3 uses the Kana character string as a key to retrieve the dictionary in the dictionary storage means 2. Thus, partial Kana characters unmatched as the result are left. A Kana character replacing means 6 has a function to replace required Kana character strings left as the result of dictionary retrieval into other Kana characters by referencing the replaced Kana character storage means 4 and the unreplaced Kana character storage means 5. To the replaced Kana character storage means 4, ''Kana characters to be replaced'' and ''Kana character after replacement'' are stored. Moreover, Kana characters not to be replaced among Kana character strings including Kana characters being object of replacement are stored in the unreplaced Kana character string storage means 5.

Description

【発明の詳細な説明】 本発明は、発音された日本語がら漢字かな混じシ文字列
を得る音声カナ漢字変換方式に関するものであシ、さら
に詳細には、「は」 「を」 「へ」など、読みと表記
が異なる単結に関しても、読みの通シの発音から正しい
表記を得る方式に関するものである。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a phonetic kana-kanji conversion method for obtaining a character string mixed with kanji, kana, and kanji from pronounced Japanese. This article concerns a method for obtaining the correct notation from the pronunciation of the reading, even for single-junctions that have different readings and notations.

音声カナ漢字変換方式は、数千字種もの漢字を含む日本
語の文章を単にその読みの通シに発声することによシ計
算機等情報処理機器へ入力できる方式であシ、これによ
れは、利用者はほとんど訓練なしに日本語全入力するこ
とが可能である。しかし、日本語の文章中には、その表
記である文字と読みとで異なるものがいくつが存在する
。例えば、「私は」の「は」は「−ゎ」と発音される。
The phonetic kana-kanji conversion method is a method that allows Japanese sentences containing thousands of types of kanji to be input into information processing equipment such as computers by simply saying their readings aloud. , users can input all Japanese characters with almost no training. However, there are many Japanese sentences that have different characters and readings. For example, the ``ha'' in ``Washiwa'' is pronounced as ``-ゎ''.

従って、従来は、「わ」 「たJ rLJ rわ」でな
く、「わ」 [たJ rLJ rはjの如く、表記通υ
の読みで発音しなければならなかった。また、この不便
さを防ぐために自動的に「わ」を「は」に変換する方式
を用いると、「わ」 「た」「シ」「わ」は、「は」 
「た」「シ」「は」と変換されてしまうことになる。
Therefore, conventionally, instead of "wa""taJ rLJ rwa", "wa" [taJ rLJ r is like j, commonly written υ
I had to pronounce it using the reading. Also, in order to avoid this inconvenience, if you use a method that automatically converts "wa" to "ha", "wa", "ta", "shi", "wa" will become "ha".
It ends up being converted into ``ta'', ``shi'', and ``ha''.

本発明の目的は、上記の欠点を排除し、利用者が、通常
の読みの通りに文章全発音することが可能な音声カナ漢
字変換方式を提供することにある。
SUMMARY OF THE INVENTION An object of the present invention is to provide a phonetic kana-kanji conversion method that eliminates the above-mentioned drawbacks and allows a user to pronounce entire sentences as they are normally read.

本発明によれば、発音された日本語から漢字かな混じり
文字列を得る音声カナ漢字変換方式において、発音され
た日本語文章を音声認識してカナ文字列を出力する音声
入力手段と、複数個の単語の少なくとも読みと表記を備
えた辞書を記憶するための辞書記憶手段と、該カナ文字
列をキーに該辞書を検索し該表記を得るための辞書検索
手段と、マツチングしなかった該カナ文字列の残余部分
を構成するカナ文字のうちで、別の文字に置き換えるべ
きものを記憶するための置換カナ文字記憶手段および、
置き候え対象とならないカナ文字列パターンを記憶する
ための非置換カナ文字列記憶手段と、前記置換カナ文字
記憶手段および前記非置換カナ文字H行憶手段を参照す
ることにより、前記残余部分の全部あるいは一部分を別
のカナ文字列に置き換えるカナ文字置換手段と、前記表
記および前記残余部分から漢字かな混じり文字列を形成
し出力する漢字かな混じシ文字列形成手段によって、通
常の読み通シの発音から漢字かな混じシ文字列を得るこ
とができる。
According to the present invention, in a phonetic kana-kanji conversion method for obtaining a character string containing kanji and kana from pronounced Japanese, a plurality of voice input means that perform voice recognition on a pronounced Japanese sentence and output a kana character string; a dictionary storage means for storing a dictionary having at least the pronunciation and notation of the word; a dictionary search means for searching the dictionary using the kana character string as a key to obtain the notation; Replacement kana character storage means for storing kana characters that should be replaced with other characters among the kana characters forming the remaining part of the character string;
By referring to the non-replacement kana character string storage means for storing kana character string patterns that are not subject to replacement, the replacement kana character storage means, and the non-replacement kana character H line storage means, all of the remaining portions are stored. Alternatively, by using a kana character replacement means that replaces a part with another kana character string, and a kanji-kana mixed character string forming means that forms and outputs a kanji-kana mixed character string from the above-mentioned notation and the above-mentioned remaining part, the pronunciation of the normal yomitsushi can be achieved. You can get a string of characters mixed with Kanji and Kana.

以下に図面を用いて具体的な実施例を掲けて本発明を説
明する。
The present invention will be described below with reference to the drawings and specific examples.

第1図は、本発明の一実施例を示すブロック図である。FIG. 1 is a block diagram showing one embodiment of the present invention.

1は翼背された日本語文章を音声認識してカナ文字列を
出力する音声入力手段であう、2は複数個の単語の少な
くとも読みと表記金偏えた辞書記憶手段でオシ、3は該
カナ文字列をキーに該辞書を検索し該表記を得るだめの
辞書検索手段であシ、4はマツチングしなかった該カナ
文字列の残余部分を構成するカナ文字のうちで、別の文
字に置き換えるべきものを記憶するだめの置換カナ文字
記憶手段であシ、5は置き換え対象となららないカナ文
字列パターンを記憶するための非置換カナ文字列記憶手
段であシ、6は前記置換カナ文字記憶手段および前記非
置換カナ文字列記憶手段を参照することによ)、前記残
余部分の全部あるいは一部分を別のカナ文字列に置き換
えるカナ文字置換手段であシ、7は前記表記および前記
残余部分から漢字かな混じシ文字列を形成し出力す−る
漢字かな混じり文字列形成手段である。ここでは、「沿
岸においては」を得るために、「え」「ん」 「が」 
「ん」 「に」 「お」 「いj 「て」「わ」と発音
する場合を例にとって説明する。
1 is a voice input means for voice recognition of a Japanese sentence and outputs a kana character string; 2 is a dictionary storage means that includes at least the pronunciation and notation of a plurality of words; and 3 is a voice input means for outputting the kana character string. This is a dictionary search method that searches the dictionary using the column as a key to obtain the notation, and 4 should be replaced with another character among the kana characters that make up the remaining part of the kana character string that was not matched. 5 is a replacement kana character string storage means for storing kana character string patterns that are not to be replaced; 6 is the replacement kana character storage means for storing kana character string patterns that are not to be replaced; 7 is a kana character replacement means for replacing all or part of the remaining part with another kana character string (by referring to the means and the non-replaced kana character string storage means); This is a kanji/kana mixed character string forming means for forming and outputting a kanji/kana mixed character string. Here, to get "on the coast," we use "e", "n", and "ga".
Let's take the example of pronouncing ``n'', ``ni'', ``o'', ``ij'', ``te'', and ``wa''.

「え」 「ん」 「がj 「ん」 「に」 「お」 「
い」「て」 [わjの発音によシ、音声入力手段1は、
「えんがんにおいてわ」というカナ文字列を辞書検索手
段3に送る。辞書検索手段3は、このカナ文字列をキー
に辞書記憶手段2内の辞書を検索する。ここで「えんが
ん」という読みを持つ「沿岸」という表記が得られたと
仮定する。その結果、マツチングしなかった「において
わ」という部分カナ文字が残される。
"E""N""Gaj"N""Ni""O""
"i""te" [Depending on the pronunciation of waj, the voice input means 1 is
The kana character string "Engan tewa" is sent to the dictionary search means 3. The dictionary search means 3 searches the dictionary in the dictionary storage means 2 using this kana character string as a key. Let us assume that we have obtained the notation ``coastal'' which has the reading ``engan''. As a result, the unmatched partial kana character ``deiwa'' is left.

カナ文字置換手段6は、置換カナ文字記憶手段4と非置
換カナ文字記憶手段5を参照するととによシ、辞書検索
の結果残されたカナ文字列のうちで必要なものについて
別のカナ文字に置き換える機能を有する。置換カナ文字
記憶手段4には、「置換されるカナ文字」と「置換後の
カナ文字」が記憶される。また、非置換カナ文字列記憶
手段5には、置換対象とがるカナ文字(すなわち置換カ
ナ文字記憶手段4内の「置換されるカナ文字」)を含む
カナ文字列のうち、そのカナ文字が置換されるべきでな
いものが記憶される。
Referring to the replaced kana character storage means 4 and the non-replaced kana character storage means 5, the kana character replacement means 6 replaces necessary kana characters with other kana characters among the kana character strings left as a result of the dictionary search. It has the function of replacing it with The substituted kana character storage means 4 stores "kana characters to be replaced" and "kana characters after replacement". In addition, the non-replacement kana character string storage means 5 stores the kana character among the kana character strings that include the sharp kana character to be replaced (that is, the "kana character to be replaced" in the replacement kana character storage means 4). Those that should not be replaced are remembered.

第2図は置換カナ文字記憶手段4における置換対応文字
の一実施例を示す図、第3図は非置換カナ文字記憶手段
5における文字の一実施例を示す図である。第4図はカ
ナ文字変更手段6の処理手順の一例を示す図である。
FIG. 2 is a diagram showing an example of characters corresponding to replacement in the replacement kana character storage means 4, and FIG. 3 is a diagram showing an example of characters in the non-replacement kana character storage means 5. FIG. 4 is a diagram showing an example of the processing procedure of the kana character changing means 6.

第4図の処理手順を、前記の残余カナ文字列「において
わ」に適用すると以下のようになる。
When the processing procedure shown in FIG. 4 is applied to the remaining kana character string "Nidewa", the result is as follows.

まず、非置換カナ文字列記憶手段5内のカナ文字列を、
「においてわ」で検索する。非置換カナ文字列記憶手段
5として第3図の例を用いたとすると、「おいて」が検
索さhる。次に、マツチングしなかったカナ文字が、置
換カナ文字記憶手段4内の「置換されるカナ文字」であ
る場合には、これを「置換後のカナ文字」に置き換える
。置換カナ文字記憶手段4として第2図の例を用いたと
すると、「においてわ」から「おいて」を除いた「に」
 「わ」のうち、「わ」が「は」に置き換えられる。そ
の結果、「においてわ」は、「においては」と置換され
る。漢字かな混じ9文字列形成手段7は、辞書検索手段
3で得た表記(沿岸)とカナ文字置換手段6で置換を済
ませた残余カナ文字列(においては)から漢字かな混じ
多文字列を形成し、出力する。その結果「沿岸において
は」を得ることができろ。
First, the kana character string in the non-replaced kana character string storage means 5 is
Search for "Nidewa". If the example shown in FIG. 3 is used as the non-substituted kana character string storage means 5, "Ote" will be searched. Next, if the unmatched kana character is the "kana character to be replaced" in the replacement kana character storage means 4, it is replaced with the "replaced kana character". If we use the example shown in Figure 2 as the substituted kana character storage means 4, then ``ni'' is obtained by removing ``te'' from ``deiwa''.
In "wa", "wa" is replaced with "ha". As a result, "dewa" is replaced with "dewa". The kanji-kana-mixed 9-character string forming means 7 forms a kanji-kana-mixed multi-character string from the notation (coastal) obtained by the dictionary search means 3 and the remaining kana character string (inha) that has been replaced by the kana character replacement means 6. and output. As a result, you can get ``on the coast''.

このように、本発明を用いることにより、発音と表記に
ずれのある日本語に関しても、そのずれを意識すること
なく、通常の発音の通シに日本語文章を入力し、漢字か
な混じり文字列を得ることができる。なお本実施例では
、辞書検索を1つのカナ文字列について1回しか行なっ
ていないが、例えば、「じようはうしよシわ」といった
入力について「じようほう」と「しよシ」に対応する「
情報」 「処理」の2単語を検索し、残余部である「わ
」を「は」に置換して、「情報処理は」を得るといった
方法をとることも可能である。また、本発明を説明する
ために使用した処理手順・図面などは、あくまでも一実
施例であり、特許請求の範囲を限定するものではない。
In this way, by using the present invention, even when there is a discrepancy in Japanese pronunciation and writing, you can input Japanese sentences into the normal pronunciation system without being aware of the discrepancy, and create character strings containing kanji and kana. can be obtained. In this example, the dictionary search is performed only once for one kana character string, but for example, for an input such as "Jiyou wa Ushiyo Shiwa", it corresponds to "Jiyouhou" and "Shiyoshi". do"
It is also possible to search for the two words "information" and "processing" and replace the remaining part "wa" with "ha" to obtain "information processing." Furthermore, the processing procedures, drawings, etc. used to explain the present invention are merely examples, and do not limit the scope of the claims.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明を実現するための一実施例を示したブロ
ック図であり、第2図は置換カナ文字記憶手段における
置換対応文字の一実施例を示す図、第3図は非置換カナ
文字列記憶手段における文字の一実施例を示す図、第4
図はカナ文字置換手段の処理手順の一例合示したフロー
チャート図である。 図において、1は音声入力手段、2は辞書記憶手段、3
t、、c;’f書検索手段、4はfmカナ文字記憶手段
、5は非置換カナ文字記憶手段、6はカナ文字置換手段
、7は漢字かな混じυ文字列形成手段でおる。 第1図 乙 5 72図 第3図 1
FIG. 1 is a block diagram showing an embodiment for realizing the present invention, FIG. 2 is a block diagram showing an embodiment of replacement compatible characters in a replacement kana character storage means, and FIG. A fourth diagram showing an example of characters in a character string storage means.
The figure is a flowchart showing an example of the processing procedure of the kana character replacement means. In the figure, 1 is a voice input means, 2 is a dictionary storage means, and 3 is a voice input means.
t,,c; 'f-book search means, 4 is fm kana character storage means, 5 is non-replacement kana character storage means, 6 is kana character replacement means, and 7 is kanji-kana mixed υ character string forming means. Figure 1 Otsu 5 72 Figure 3 Figure 1

Claims (1)

【特許請求の範囲】[Claims] ■1重合金れた日本語から漢字かな混じシ文字列を得る
音声カナ漢字変換方式において、発音された日本語文章
を音声認識してカナ文字列を出力する音声入力手段と、
複数個の単語の少なくとも読みと表記を備えた辞書を記
憶するための辞書記憶手段と、該カナ文字列をキーに該
辞書を検索し該表記を得るための辞書検索手段と、マツ
チングし々かった該カナ文字列の残余部分を構成するカ
ナ文字のうちで、別の文字に置き換えるべきものを記憶
するための置換カナ文字記憶手段および、置き換え対象
と々らないカナ文字列パターンを記憶するための非置換
カナ文字列記憶手段と、前記置換カナ文字記憶手段およ
び前記非置換カナ文字列記憶手段を参照することによシ
、前記残余部分の全部あるいは一部分を別のカナ文字列
に置き換えるカナ文字置換手段と、前記表記および前記
残余部分から漢字かな混じシ文字列を形成し出方する漢
字かな混じシ文字列形成手段を備えたことを特徴とする
音声カナ漢字変換方式。
■In the phonetic kana-kanji conversion method for obtaining a kanji-kana-mixed character string from Japanese words that have been combined, a voice input means that performs voice recognition on a pronounced Japanese sentence and outputs a kana character string;
A dictionary storage means for storing a dictionary including at least readings and spellings of a plurality of words; and a dictionary search means for searching the dictionary using the kana character string as a key to obtain the spelling. Replacement kana character storage means for storing kana characters that should be replaced with other characters among the kana characters constituting the remaining part of the kana character string, and for storing unusual kana character string patterns to be replaced. Kana characters that replace all or part of the remaining part with another kana character string by referring to the non-replaced kana character string storage means, the replaced kana character storage means, and the unreplaced kana character string storage means. A phonetic kana-kanji conversion method comprising a replacement means and a means for forming a character string containing kanji and kana from the notation and the remaining portion.
JP58214537A 1983-11-15 1983-11-15 Voice kana (japanese syllabary) kanji (chinese character) converting system Pending JPS60107171A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58214537A JPS60107171A (en) 1983-11-15 1983-11-15 Voice kana (japanese syllabary) kanji (chinese character) converting system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58214537A JPS60107171A (en) 1983-11-15 1983-11-15 Voice kana (japanese syllabary) kanji (chinese character) converting system

Publications (1)

Publication Number Publication Date
JPS60107171A true JPS60107171A (en) 1985-06-12

Family

ID=16657372

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58214537A Pending JPS60107171A (en) 1983-11-15 1983-11-15 Voice kana (japanese syllabary) kanji (chinese character) converting system

Country Status (1)

Country Link
JP (1) JPS60107171A (en)

Similar Documents

Publication Publication Date Title
JP2013117978A (en) Generating method for typing candidate for improvement in typing efficiency
US4811400A (en) Method for transforming symbolic data
JPH0447440A (en) Converting system for word
JPS60107171A (en) Voice kana (japanese syllabary) kanji (chinese character) converting system
JP3531222B2 (en) Similar character string search device
JPS58123129A (en) Converting device of japanese syllabary to chinese character
JP2003178087A (en) Retrieval device and method for electronic foreign language dictionary
JPH07129572A (en) Kanji converter
JP3285149B2 (en) Foreign language electronic dictionary search method and apparatus
JPS6054041A (en) Pronounced kana-kanji conversion system
JPS62144269A (en) Information retrieving device
JPS58123126A (en) Dictionary retrieving device
JP3048793B2 (en) Character converter
JP2744241B2 (en) Character processor
JPS6175467A (en) Kana and kanji converting device
JPH0656609B2 (en) Chinese input device
JPS5998236A (en) Input device of japanese sentence
JPH07210571A (en) Device and method for word retrieval processing
JP2004355496A (en) Bilingual dictionary of english and another language in phoneme index multi-element matrix structure
JPS6024630A (en) Forming system of "kana" character string provided with control information
JPH0457148A (en) Document preparing device
JPH03204698A (en) Text voice synthesizing device
Wonderly et al. Linguistics and Christian Missions—Iii
JPH0345423B2 (en)
JPH10154141A (en) Kana-to-kanji (japanese syllabary-to-chinese character) conversion device