JPS59176835A - Method and device for producing sound input sentence - Google Patents

Method and device for producing sound input sentence

Info

Publication number
JPS59176835A
JPS59176835A JP58050648A JP5064883A JPS59176835A JP S59176835 A JPS59176835 A JP S59176835A JP 58050648 A JP58050648 A JP 58050648A JP 5064883 A JP5064883 A JP 5064883A JP S59176835 A JPS59176835 A JP S59176835A
Authority
JP
Japan
Prior art keywords
kana
character string
kanji
sentence
voice input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP58050648A
Other languages
Japanese (ja)
Other versions
JPH0376492B2 (en
Inventor
Yutaka Kamiyanagi
上柳 裕
Takahiko Ogita
荻田 隆彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP58050648A priority Critical patent/JPS59176835A/en
Publication of JPS59176835A publication Critical patent/JPS59176835A/en
Publication of JPH0376492B2 publication Critical patent/JPH0376492B2/ja
Granted legal-status Critical Current

Links

Abstract

PURPOSE:To facilitate easy operation for the production of a sentence and at the same time to improve the processing efficiency by producing a sound input sentence without confirming successively the KANA (Japanses syllabary) character string corresponding to the sound input. CONSTITUTION:The KANA character string confirmed by a sound confirming part 3 is collated with the confirmation correspondence career list contained in a file device 1. The presence or absence is checked for the next candidate KANA character string different from said confirmed KANA character string. The result of this check is added to an analysis route. A KANA/KANJI (Chinese character) converting part 6 analyzes the form elements for all KANA character strings corresponding to each analysis route and produces a key KANJI-KANA character string for each route. The KANJI-KANA character train having the minimum number of form elements is selected and displayed on a display 4. When the presence of an error is informed from a console 5, the next candidate KANJI-KANA sentence is displayed. When this sentence is correct, it is sent to a sentence editing part 8. Then a series of sentences are edited.

Description

【発明の詳細な説明】 (a)発明の技術分野 本発明は文章作成装置に係り、とくに音声入力された文
を電子回路的に漢字“かな混り文に変換する装置に関す
る。
DETAILED DESCRIPTION OF THE INVENTION (a) Technical Field of the Invention The present invention relates to a text creation device, and more particularly to a device for converting a voice-input sentence into a kanji (kanji) or kana-mixed sentence using an electronic circuit.

(b)技術の背景 近時、日本語ワードプロセッサと称される文章作成装置
が普及しつつあるが、その入力方式には大別して、漢字
タブレット方式と“かな”漢字変換方式とがある。
(b) Technical Background Recently, text creation devices called Japanese word processors have become popular, and their input methods can be broadly divided into two types: the kanji tablet method and the "kana" kanji conversion method.

前者はタブレット上に表示されている漢字・かな”等の
中から必要な文字を選択して、辞書中の文字を検索する
ものであり、後者は文章を単語単位で“かな”文字で入
力し、この入力ごとに必要な単語を漢字に変換するもの
であって、同音異義語の場合には再変換を繰り返して必
要な漢字を得るものである。
The former allows you to search for characters in a dictionary by selecting the required character from the kanji/kana characters displayed on the tablet, while the latter allows you to enter sentences word by word using kana characters. , the necessary word is converted into Kanji for each input, and in the case of homophones, the reconversion is repeated to obtain the necessary Kanji.

(C)従来技術と問題点 従来の文書作成装置においては、上記のような文字タブ
レットを用いたり、あるいはキーボードから“かな”・
数字等の文字を入力するものが主体であった。
(C) Prior art and problems In conventional document creation devices, character tablets such as those mentioned above are used, or "kana" and "kana" characters are input from the keyboard.
It was mainly used to input characters such as numbers.

一方、マンマシン対話方式として最も自然な音声入出力
方式の開発が進められており、上記のような文章作成装
置に対しても音声入力方式を適用する試みがなされては
いるが、従来の方式は音声入力された“かな”をディス
プレイ画面上に逐次表示して“かな”文字列を作り、該
“かな”文字列を“かな”漢字変換するものであって、
常にディスプレイ画面上に現れる“かな”文字を確認し
ている必要があり、キーボード入力方式に比較して使い
易いものではなく、その改良が要望されていた。
On the other hand, the development of the most natural voice input/output method as a man-machine dialogue method is progressing, and attempts have been made to apply the voice input method to the above-mentioned text creation devices. is a system that sequentially displays ``kana'' input by voice on a display screen to create a ``kana'' character string, and converts the ``kana'' character string into ``kana'' into kanji.
Since it is necessary to constantly check the "kana" characters appearing on the display screen, it is not as easy to use as the keyboard input method, and improvements have been desired.

(d)発明の目的 本発明の目的は、音声入力文章作成において音声入力に
対応する“かな”文字列を逐次確認することなく文章作
成を実行可能とすることである。
(d) Object of the Invention An object of the present invention is to enable text creation by voice input without sequentially checking the "kana" character string corresponding to the voice input.

(e)発明の構成 本発明は、“かな”漢字変換を用いる音声入力文章作成
において、認識対応履歴表(例えば、ある“かな”文字
“力”が音声入力された時に、これが“か”として認識
された場合、“は”として認識された場合、その他の“
かな”文字として認識された場合等のそれぞれの場合の
認識度数の統計データから成るテーブル)を音声パター
ンと同じファイルにあらかじめ格納しておき、音声入力
文章作成処理開始時ごとに該認識対応履歴表を文章作成
部に転送し、認識“かな”文字列と該認識対応履歴表と
を照合して該認識“かな”文字列とは別に可能な“かな
”文字列の有無を検証し、別に可能な“かな”文字列が
存在する場合にはこれに対応する解析ルートを開設し、
該解析ルートごとに前記別に可能な“かな”文字列のそ
れぞれを順次解析して漢字“かな”混り文を出力するこ
とを特徴とする。
(e) Structure of the Invention The present invention provides a recognition correspondence history table (for example, when a certain "kana" character "chi" is input by voice, when creating a voice input text using "kana" kanji conversion, If it is recognized as “wa”, other “
A table consisting of statistical data of the recognition frequency in each case, such as when the character is recognized as a kana character, is stored in advance in the same file as the voice pattern, and the recognition correspondence history table is stored each time the voice input sentence creation process starts. is transferred to the text creation department, and the recognized "kana" character string is compared with the recognition correspondence history table to verify whether there is a possible "kana" character string other than the recognized "kana" character string, and to determine whether there is a possible "kana" character string. If a “kana” character string exists, an analysis route corresponding to this is established,
It is characterized in that each of the separately possible "kana" character strings is sequentially analyzed for each analysis route and a sentence containing kanji "kana" is output.

Tf1発明の実施例 以下に本発明の実施例を図面を参照して説明する。Examples of Tf1 invention Embodiments of the present invention will be described below with reference to the drawings.

第1図および第2図はそれぞれ従来および本発明に係る
音声入力文章作成装置の機能ブロック図である。
FIG. 1 and FIG. 2 are functional block diagrams of a conventional voice input sentence creation device and a voice input sentence creation device according to the present invention, respectively.

第1図において、ファイル装置1にはあらかじめ音声入
力操作者ごとの標準音声パターンが登録されている。マ
イク2から入力された該操作者の音声は音声認識部3に
おいて音節(一般に“かな”文字)1!位にパターン分
解されたのちファイル装置1の標準音声パターンと比較
され、該標準音声パターンの内の最も類似している音節
、すなわち“かな”文字として認識される。
In FIG. 1, a standard voice pattern for each voice input operator is registered in advance in a file device 1. The operator's voice input from the microphone 2 is processed by the voice recognition unit 3 into 1 syllable (generally "kana" character)! After the pattern is decomposed into digits, it is compared with the standard speech pattern of the file device 1 and recognized as the most similar syllable in the standard speech pattern, that is, the "kana" character.

このようにして認識された、入力音声に対応する1かな
”文字列がディスプレイ4上に表示され、音声入力“か
な”文字が正しく認識されたと判定した場合には操作卓
5から“かな”漢字変換の実行を指示し、あるいは句読
点等を検知することにより実行の開始が提起され、これ
により該認識“かな”文字列は、“かな”漢字変換部6
において解析され(該“かな”文字列を単語辞書を参照
しながら形態素と呼ぶ単位に種々に分割し、該形態素に
含まれている各形態素間の接続情報、最長形態素等を調
べるもので、一般には該“かな”文字列を最長一致法に
基づき品詞分解し、該品詞間の接続関係を調べる)、最
も確からしい単語の配列、すなわち漢字仮名混り列が選
び出され、これが漢字“かな”混り文としてディスプレ
イ4上に表示される。
The character string ``1 Kana'' that corresponds to the input voice recognized in this way is displayed on the display 4, and if it is determined that the voice input ``Kana'' character has been correctly recognized, the character string ``Kana'' is displayed on the console 5. The execution is started by instructing execution of conversion or by detecting a punctuation mark, etc., and the recognized “kana” character string is converted into “kana” by the “kana” kanji conversion unit 6.
(The method is to divide the "kana" character string into various units called morphemes while referring to a word dictionary, and check the connection information between each morpheme included in the morpheme, the longest morpheme, etc. decomposes the "kana" character string into parts of speech based on the longest match method and examines the connections between the parts of speech), and selects the most probable word arrangement, that is, a combination of kanji and kana characters, and this is the kanji "kana". It is displayed on the display 4 as a mixed sentence.

ディスプレイ4上に表示された“かな”文字列に誤りが
ある場合には操作卓5からその旨を6知し、マイク2か
ら当該“かな”文字を再度音声入力し、正しく認識され
たかどうかをディスプレイ4上で確認する。また、ディ
スプレイ4上に表示された漢字“かな”混り文に誤りが
ある場合には操作卓5からその旨入力すると、音声認識
部3に対して次候補の“かな”文字列の出力が要求され
(音声認識部3では入力音声とファイル装置1の音声パ
ターンとの近偵度に基づき複数の候補認識“かな”文字
列を設定しており、各候補間の距離をスレッショルド値
と比較し一近似度情報を得る一該スレソショルド値以下
の距離の候補間にノードを形成し該各候補を順次選択す
る)、該次候補の“かな”文字列につき“かな”漢字変
換が行われて、ディスプレイ4上に対応する漢字“かな
”混り文が出力される。
If there is an error in the "kana" character string displayed on the display 4, the operation console 5 notifies you of this, and the user inputs the "kana" character again aloud from the microphone 2 to check whether it has been correctly recognized. Check on Display 4. In addition, if there is an error in the kanji character string "kana" displayed on the display 4, input that fact from the console 5, and the next candidate "kana" character string will be output to the speech recognition unit 3. (The voice recognition unit 3 sets multiple candidate recognition "kana" character strings based on the degree of closeness between the input voice and the voice pattern of the file device 1, and compares the distance between each candidate with a threshold value.) (1) Obtaining similarity information (1) Forming nodes between candidates whose distance is less than the threshold value and selecting each candidate in turn), "Kana" Kanji conversion is performed for the "Kana" character string of the next candidate, The corresponding kanji "kana" mixed sentence is output on the display 4.

本発明においては、音声入力操作者ごとの標準音声パタ
ーンを格納しているファイル装置1に前記のような認識
対応履歴表をあらかじめ格納しYおく。また、第2図に
示すように“かな”前処理部9および“かな”後処理部
10を“かな”漢字変換部6の前後に設けている。
In the present invention, a recognition correspondence history table as described above is stored in advance in the file device 1 that stores standard voice patterns for each voice input operator. Further, as shown in FIG. 2, a "kana" preprocessing section 9 and a "kana" postprocessing section 10 are provided before and after the "kana" kanji conversion section 6.

第2図において、音声入力文章作成処理の開始により、
ファイル装置1に格納されている前記認識対応履歴表が
“かな”前処理部9のファイル装置11に転送される。
In FIG. 2, with the start of the voice input sentence creation process,
The recognition correspondence history table stored in the file device 1 is transferred to the file device 11 of the “kana” preprocessing section 9.

マイク2からの音声入力“かな”文字はファイル装置1
の音声パターンと特徴点の照合をされて音声認識部3に
より順次認識され、認識“かな”文字列としてファイル
装置11に格納される。この場合、入力“かな”文字が
誤認識された可能性の有無を検証するために、認識“か
な”文字列と前記認識対応履歴表とが照合され、認識“
かな”文字列とは別の“かな”文字列が有り得る場合に
は、前記音声認識部3における次候補の“かな”文字列
の解析ルートに追加して、該別の“かな”文字列のそれ
ぞれを解析するためのルートが開設される。
The voice input “kana” characters from microphone 2 are sent to file device 1.
The voice pattern and the feature points are compared, sequentially recognized by the voice recognition section 3, and stored in the file device 11 as a recognized "kana" character string. In this case, in order to verify whether there is a possibility that the input "kana" character was misrecognized, the recognized "kana" character string is compared with the recognition correspondence history table, and the recognized "kana" character string is compared with the recognition correspondence history table.
If there is a possibility that there is a different “kana” character string from the “kana” character string, it is added to the analysis route of the next candidate “kana” character string in the speech recognition unit 3, and the other “kana” character string is analyzed. A route will be established to analyze each.

以下、従来と同様にして“かな”漢字変換部6において
、ファイル装置7に格納されている単語辞書を参照しな
がら、上記のルートに対応する“かな”文字列すべてに
ついて順次形態素解析が行われ、各ルートごとに最も確
からしい代表漢字“かな”混り列が作成され、これらの
代表漢字“かな”混り列のうちで、該漢字“かな”混り
列を構成する形態素の数が最小となるものが選択され。
Thereafter, in the same manner as before, in the "kana" kanji conversion unit 6, morphological analysis is sequentially performed on all "kana" character strings corresponding to the above root while referring to the word dictionary stored in the file device 7. , the most probable representative kanji "kana" mixed string is created for each route, and among these representative kanji "kana" mixed strings, the number of morphemes that make up the kanji "kana" mixed string is the smallest. is selected.

これがディスプレイ4上に漢字“かな”混り文として表
示れ、音声入力操作者によりその正当性が判定される。
This is displayed on the display 4 as a sentence containing the kanji "kana", and its validity is determined by the voice input operator.

ディスプレイ4上に表示された該漢字“かな”混り文に
誤りがある等判定された場合には、操作卓5からその旨
を入力すると、“がな”漢字変換部6に対して次候補の
漢字“かな”混り列の送出が要求され、これに対応する
漢字“がな”混り文がディスプレイ4表示される。
If it is determined that there is an error in the kanji ``kana'' character displayed on the display 4, input that information from the console 5, and the ``gana'' kanji converter 6 will select the next candidate. The transmission of the kanji "kana" mixed string is requested, and the corresponding kanji "gana" mixed string is displayed on the display 4.

正しい漢字“かな”混り文が出力された旨が操作卓5か
ら入力されると、これが文章編集部8に一時保存され、
個々の漢字“かな”混り文が一連の文章として編集され
る。
When a message indicating that a sentence containing the correct kanji "kana" has been output is input from the console 5, this is temporarily stored in the sentence editing section 8, and
Sentences containing individual kanji "kana" are edited as a series of sentences.

本発明においては、最終的に選択された漢字“かな”混
り列に対応する“かな”文字列と音声認識部3によって
認識された“かな”文字列とが“かな”後処理部10に
おいて比較され、その対応関係に基づきファイル装置1
1上に格納されている認識対応履歴表のデータ、すなわ
ち認識対応度数値が更新される。
In the present invention, the “kana” character string corresponding to the finally selected kanji “kana” mixed string and the “kana” character string recognized by the speech recognition unit 3 are processed in the “kana” post-processing unit 10. File device 1 is compared based on the correspondence relationship.
The data in the recognition correspondence history table stored on 1, that is, the recognition correspondence degree value is updated.

更新された認識対応履歴表は音声入力操作が終了時にフ
ァイル装置1に転送される。
The updated recognition correspondence history table is transferred to the file device 1 when the voice input operation is completed.

上記のようにして作成された漢字“かな”混り文はプリ
ンタ12により印字出力される。
The sentence containing the kanji "kana" created as described above is printed out by the printer 12.

なお、第1図および第2図において13はディスプレイ
4およびプリンタ12の出力制御装置である。
1 and 2, reference numeral 13 represents an output control device for the display 4 and printer 12. In FIG.

(8)発明の効果 本発明によれば、音声入力される“かな”文字を逐次確
認した上で“かな”漢字変換する必要がなく、任意の一
定長さの漢字“かな”混り文の状態で認識・変換の正し
さを判定すればよく、音声入力文章作成操作を容易にす
るとともに処理能率を向上する効果が大である。
(8) Effects of the Invention According to the present invention, there is no need to sequentially check the "kana" characters that are input by voice and convert them into "kana", and it is not necessary to convert them into "kana" or "kana" characters of any given length. It is sufficient to judge the correctness of recognition/conversion based on the state, which has a great effect of facilitating the voice input sentence creation operation and improving processing efficiency.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図および第2図はそれぞれ従来および本発明に係る
音声入力文章作成装置の機能ブロック図である。 図において、1はファイル装置、2はマイク、3は音声
認識部、4はディスプレイ、5は操作卓、6は“かな”
漢字変換部、7および11はファイル装置、8は文章編
集部、9は“かな”前処理部、10は“かな”後処理部
、12はプリンタ、13は出方制御装置である。 見 1 図
FIG. 1 and FIG. 2 are functional block diagrams of a conventional voice input sentence creation device and a voice input sentence creation device according to the present invention, respectively. In the figure, 1 is a file device, 2 is a microphone, 3 is a voice recognition unit, 4 is a display, 5 is a console, and 6 is a “kana”
7 and 11 are file devices, 8 is a text editing section, 9 is a "kana" pre-processing section, 10 is a "kana" post-processing section, 12 is a printer, and 13 is an output control device. See 1 figure

Claims (4)

【特許請求の範囲】[Claims] (1)“かな”漢字変換を用いる音声入力文章作成刃−
式において、認識“かな”文字列と認識対応履歴表とを
照合して認識“かな”文字列とは別に可能な“かな”文
字列の有無を検証し、別に可能な“かな”文字列が存在
する場合にはこれに対応する解析ルートを開設し、該解
析ルートごとに前記別に可能な“かな”文字列のそれぞ
れを順次解析して漢字“かな”混り文を出力することを
特徴とする音声入力文章作成方式。
(1) Voice input sentence creation blade using “kana” kanji conversion
In the formula, the recognized "kana" character string is compared with the recognition correspondence history table to verify the presence or absence of a possible "kana" character string other than the recognized "kana" character string, and to determine the presence or absence of a possible "kana" character string separately from the recognized "kana" character string. If it exists, a corresponding analysis route is established, and for each analysis route, each of the separately possible "kana" character strings is sequentially analyzed and a sentence containing kanji "kana" is output. Voice input text creation method.
(2)音声入力操作者ごとに認識対応履歴表を音声パタ
ーンと同じファイルに格納しておき、音声人力−操作時
に文章作成部に転送することを特徴とする特許請求の範
囲第1項記載の音声入力文章作成方式。
(2) The recognition correspondence history table is stored in the same file as the voice pattern for each voice input operator, and is transferred to the text creation section at the time of voice input operation. Voice input text creation method.
(3)認識“かな”文字列におよび各“がな”文字と出
力漢字”かな”混り文に対応する“がな”文字列におけ
る各“かな”文字の対応するものどうしを比較し、その
対応関係に基づき認識対応履歴表の内容を更新すること
を特徴とする特許請求の範囲第1項記載の音声入力文章
作成方式。
(3) Compare the correspondence of each “kana” character in the recognized “kana” character string and the “gana” character string corresponding to each “kana” character and the output kanji “kana” mixed sentence, The voice input sentence creation method according to claim 1, wherein the content of the recognition correspondence history table is updated based on the correspondence relationship.
(4)“かな”漢字変換を用いる音声入力文章作成装置
において、認識対応履歴表を格納した音声パターンファ
イルと、該認識対応履歴表および音声入力情報に基づき
複数の解析ルートを開設ならびに −制御するための“
かな”漢字前処理部と、認識“かな”文字列における各
“かな”文字と出力漢字“かな”混り文に対応する“か
な”文字列における各“かな”文字の対応するものどう
しを比較し、その対応関係に基づき認識対応履歴表の内
容を更新するための“かな”後処理部を設けたことを特
徴とする音声入力文章作成装置。
(4) In a voice input text creation device that uses “kana” Kanji conversion, create and control multiple analysis routes based on a voice pattern file storing a recognition correspondence history table and the recognition correspondence history table and voice input information. for"
The Kana/Kanji preprocessing unit compares each “Kana” character in the recognized “Kana” character string with the corresponding “Kana” character in the “Kana” character string corresponding to the output Kanji “Kana” mixed sentence. and a "kana" post-processing unit for updating the contents of the recognition correspondence history table based on the correspondence relationship.
JP58050648A 1983-03-26 1983-03-26 Method and device for producing sound input sentence Granted JPS59176835A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58050648A JPS59176835A (en) 1983-03-26 1983-03-26 Method and device for producing sound input sentence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58050648A JPS59176835A (en) 1983-03-26 1983-03-26 Method and device for producing sound input sentence

Publications (2)

Publication Number Publication Date
JPS59176835A true JPS59176835A (en) 1984-10-06
JPH0376492B2 JPH0376492B2 (en) 1991-12-05

Family

ID=12864754

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58050648A Granted JPS59176835A (en) 1983-03-26 1983-03-26 Method and device for producing sound input sentence

Country Status (1)

Country Link
JP (1) JPS59176835A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6381586A (en) * 1986-09-25 1988-04-12 Toshiba Corp Information processor

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59132038A (en) * 1983-01-17 1984-07-30 Nec Corp Evaluating method of kana character string

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS59132038A (en) * 1983-01-17 1984-07-30 Nec Corp Evaluating method of kana character string

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6381586A (en) * 1986-09-25 1988-04-12 Toshiba Corp Information processor

Also Published As

Publication number Publication date
JPH0376492B2 (en) 1991-12-05

Similar Documents

Publication Publication Date Title
US4468756A (en) Method and apparatus for processing languages
US6188977B1 (en) Natural language processing apparatus and method for converting word notation grammar description data
JP5231698B2 (en) How to predict how to read Japanese ideograms
JPH0352058A (en) Document processor for voice input
JPS59176835A (en) Method and device for producing sound input sentence
JPS62165267A (en) Voice word processor device
CN111429886A (en) Voice recognition method and system
JPH11338498A (en) Voice synthesizer
JP2004206659A (en) Reading information determination method, device, and program
JPH0561905A (en) Sentence analyzing device
JPS59121425A (en) Chinese phonetic alphabet of kanji converter
JP3001334B2 (en) Language processor for recognition
JPS62224859A (en) Japanese language processing system
JPH08272780A (en) Processor and method for chinese input processing, and processor and method for language processing
JPH0916575A (en) Pronunciation dictionary device
JPS61223971A (en) Sentence generating device
JPS6315633B2 (en)
JPH0778155A (en) Document recognizing device
JPS61292775A (en) Numeral-kanji converting system
JPH11161651A (en) Phonetic symbol generator
JPS58214931A (en) Word separating device
JPH05298364A (en) Phonetic symbol forming system
JPH0414168A (en) Word processor
JPH11232268A (en) Document processor, agate arranging method and storage medium
JPS63316160A (en) Document preparing device