JPS59176835A

JPS59176835A - Method and device for producing sound input sentence

Info

Publication number: JPS59176835A
Application number: JP58050648A
Authority: JP
Inventors: Yutaka Kamiyanagi; 上柳　裕; Takahiko Ogita; 荻田　隆彦
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1983-03-26
Filing date: 1983-03-26
Publication date: 1984-10-06
Also published as: JPH0376492B2

Abstract

PURPOSE:To facilitate easy operation for the production of a sentence and at the same time to improve the processing efficiency by producing a sound input sentence without confirming successively the KANA (Japanses syllabary) character string corresponding to the sound input. CONSTITUTION:The KANA character string confirmed by a sound confirming part 3 is collated with the confirmation correspondence career list contained in a file device 1. The presence or absence is checked for the next candidate KANA character string different from said confirmed KANA character string. The result of this check is added to an analysis route. A KANA/KANJI (Chinese character) converting part 6 analyzes the form elements for all KANA character strings corresponding to each analysis route and produces a key KANJI-KANA character string for each route. The KANJI-KANA character train having the minimum number of form elements is selected and displayed on a display 4. When the presence of an error is informed from a console 5, the next candidate KANJI-KANA sentence is displayed. When this sentence is correct, it is sent to a sentence editing part 8. Then a series of sentences are edited.

Description

【発明の詳細な説明】（ａ）発明の技術分野本発明は文章作成装置に係り、とくに音声入力された文
を電子回路的に漢字“かな混り文に変換する装置に関す
る。DETAILED DESCRIPTION OF THE INVENTION (a) Technical Field of the Invention The present invention relates to a text creation device, and more particularly to a device for converting a voice-input sentence into a kanji (kanji) or kana-mixed sentence using an electronic circuit.

（ｂ）技術の背景近時、日本語ワードプロセッサと称される文章作成装置
が普及しつつあるが、その入力方式には大別して、漢字
タブレット方式と“かな”漢字変換方式とがある。(b) Technical Background Recently, text creation devices called Japanese word processors have become popular, and their input methods can be broadly divided into two types: the kanji tablet method and the "kana" kanji conversion method.

前者はタブレット上に表示されている漢字・かな”等の
中から必要な文字を選択して、辞書中の文字を検索する
ものであり、後者は文章を単語単位で“かな”文字で入
力し、この入力ごとに必要な単語を漢字に変換するもの
であって、同音異義語の場合には再変換を繰り返して必
要な漢字を得るものである。The former allows you to search for characters in a dictionary by selecting the required character from the kanji/kana characters displayed on the tablet, while the latter allows you to enter sentences word by word using kana characters. , the necessary word is converted into Kanji for each input, and in the case of homophones, the reconversion is repeated to obtain the necessary Kanji.

（Ｃ）従来技術と問題点従来の文書作成装置においては、上記のような文字タブ
レットを用いたり、あるいはキーボードから“かな”・
数字等の文字を入力するものが主体であった。(C) Prior art and problems In conventional document creation devices, character tablets such as those mentioned above are used, or "kana" and "kana" characters are input from the keyboard.
It was mainly used to input characters such as numbers.

一方、マンマシン対話方式として最も自然な音声入出力
方式の開発が進められており、上記のような文章作成装
置に対しても音声入力方式を適用する試みがなされては
いるが、従来の方式は音声入力された“かな”をディス
プレイ画面上に逐次表示して“かな”文字列を作り、該
“かな”文字列を“かな”漢字変換するものであって、
常にディスプレイ画面上に現れる“かな”文字を確認し
ている必要があり、キーボード入力方式に比較して使い
易いものではなく、その改良が要望されていた。On the other hand, the development of the most natural voice input/output method as a man-machine dialogue method is progressing, and attempts have been made to apply the voice input method to the above-mentioned text creation devices. is a system that sequentially displays ``kana'' input by voice on a display screen to create a ``kana'' character string, and converts the ``kana'' character string into ``kana'' into kanji.
Since it is necessary to constantly check the "kana" characters appearing on the display screen, it is not as easy to use as the keyboard input method, and improvements have been desired.

（ｄ）発明の目的本発明の目的は、音声入力文章作成において音声入力に
対応する“かな”文字列を逐次確認することなく文章作
成を実行可能とすることである。(d) Object of the Invention An object of the present invention is to enable text creation by voice input without sequentially checking the "kana" character string corresponding to the voice input.

（ｅ）発明の構成本発明は、“かな”漢字変換を用いる音声入力文章作成
において、認識対応履歴表（例えば、ある“かな”文字
“力”が音声入力された時に、これが“か”として認識
された場合、“は”として認識された場合、その他の“
かな”文字として認識された場合等のそれぞれの場合の
認識度数の統計データから成るテーブル）を音声パター
ンと同じファイルにあらかじめ格納しておき、音声入力
文章作成処理開始時ごとに該認識対応履歴表を文章作成
部に転送し、認識“かな”文字列と該認識対応履歴表と
を照合して該認識“かな”文字列とは別に可能な“かな
”文字列の有無を検証し、別に可能な“かな”文字列が
存在する場合にはこれに対応する解析ルートを開設し、
該解析ルートごとに前記別に可能な“かな”文字列のそ
れぞれを順次解析して漢字“かな”混り文を出力するこ
とを特徴とする。(e) Structure of the Invention The present invention provides a recognition correspondence history table (for example, when a certain "kana" character "chi" is input by voice, when creating a voice input text using "kana" kanji conversion, If it is recognized as “wa”, other “
A table consisting of statistical data of the recognition frequency in each case, such as when the character is recognized as a kana character, is stored in advance in the same file as the voice pattern, and the recognition correspondence history table is stored each time the voice input sentence creation process starts. is transferred to the text creation department, and the recognized "kana" character string is compared with the recognition correspondence history table to verify whether there is a possible "kana" character string other than the recognized "kana" character string, and to determine whether there is a possible "kana" character string. If a “kana” character string exists, an analysis route corresponding to this is established,
It is characterized in that each of the separately possible "kana" character strings is sequentially analyzed for each analysis route and a sentence containing kanji "kana" is output.

Ｔｆ１発明の実施例以下に本発明の実施例を図面を参照して説明する。Examples of Tf1 invention Embodiments of the present invention will be described below with reference to the drawings.

第１図および第２図はそれぞれ従来および本発明に係る
音声入力文章作成装置の機能ブロック図である。FIG. 1 and FIG. 2 are functional block diagrams of a conventional voice input sentence creation device and a voice input sentence creation device according to the present invention, respectively.

第１図において、ファイル装置１にはあらかじめ音声入
力操作者ごとの標準音声パターンが登録されている。マ
イク２から入力された該操作者の音声は音声認識部３に
おいて音節（一般に“かな”文字）１！位にパターン分
解されたのちファイル装置１の標準音声パターンと比較
され、該標準音声パターンの内の最も類似している音節
、すなわち“かな”文字として認識される。In FIG. 1, a standard voice pattern for each voice input operator is registered in advance in a file device 1. The operator's voice input from the microphone 2 is processed by the voice recognition unit 3 into 1 syllable (generally "kana" character)! After the pattern is decomposed into digits, it is compared with the standard speech pattern of the file device 1 and recognized as the most similar syllable in the standard speech pattern, that is, the "kana" character.

このようにして認識された、入力音声に対応する１かな
”文字列がディスプレイ４上に表示され、音声入力“か
な”文字が正しく認識されたと判定した場合には操作卓
５から“かな”漢字変換の実行を指示し、あるいは句読
点等を検知することにより実行の開始が提起され、これ
により該認識“かな”文字列は、“かな”漢字変換部６
において解析され（該“かな”文字列を単語辞書を参照
しながら形態素と呼ぶ単位に種々に分割し、該形態素に
含まれている各形態素間の接続情報、最長形態素等を調
べるもので、一般には該“かな”文字列を最長一致法に
基づき品詞分解し、該品詞間の接続関係を調べる）、最
も確からしい単語の配列、すなわち漢字仮名混り列が選
び出され、これが漢字“かな”混り文としてディスプレ
イ４上に表示される。The character string ``1 Kana'' that corresponds to the input voice recognized in this way is displayed on the display 4, and if it is determined that the voice input ``Kana'' character has been correctly recognized, the character string ``Kana'' is displayed on the console 5. The execution is started by instructing execution of conversion or by detecting a punctuation mark, etc., and the recognized “kana” character string is converted into “kana” by the “kana” kanji conversion unit 6.
(The method is to divide the "kana" character string into various units called morphemes while referring to a word dictionary, and check the connection information between each morpheme included in the morpheme, the longest morpheme, etc. decomposes the "kana" character string into parts of speech based on the longest match method and examines the connections between the parts of speech), and selects the most probable word arrangement, that is, a combination of kanji and kana characters, and this is the kanji "kana". It is displayed on the display 4 as a mixed sentence.

ディスプレイ４上に表示された“かな”文字列に誤りが
ある場合には操作卓５からその旨を６知し、マイク２か
ら当該“かな”文字を再度音声入力し、正しく認識され
たかどうかをディスプレイ４上で確認する。また、ディ
スプレイ４上に表示された漢字“かな”混り文に誤りが
ある場合には操作卓５からその旨入力すると、音声認識
部３に対して次候補の“かな”文字列の出力が要求され
（音声認識部３では入力音声とファイル装置１の音声パ
ターンとの近偵度に基づき複数の候補認識“かな”文字
列を設定しており、各候補間の距離をスレッショルド値
と比較し一近似度情報を得る一該スレソショルド値以下
の距離の候補間にノードを形成し該各候補を順次選択す
る）、該次候補の“かな”文字列につき“かな”漢字変
換が行われて、ディスプレイ４上に対応する漢字“かな
”混り文が出力される。If there is an error in the "kana" character string displayed on the display 4, the operation console 5 notifies you of this, and the user inputs the "kana" character again aloud from the microphone 2 to check whether it has been correctly recognized. Check on Display 4. In addition, if there is an error in the kanji character string "kana" displayed on the display 4, input that fact from the console 5, and the next candidate "kana" character string will be output to the speech recognition unit 3. (The voice recognition unit 3 sets multiple candidate recognition "kana" character strings based on the degree of closeness between the input voice and the voice pattern of the file device 1, and compares the distance between each candidate with a threshold value.) (1) Obtaining similarity information (1) Forming nodes between candidates whose distance is less than the threshold value and selecting each candidate in turn), "Kana" Kanji conversion is performed for the "Kana" character string of the next candidate, The corresponding kanji "kana" mixed sentence is output on the display 4.

本発明においては、音声入力操作者ごとの標準音声パタ
ーンを格納しているファイル装置１に前記のような認識
対応履歴表をあらかじめ格納しＹおく。また、第２図に
示すように“かな”前処理部９および“かな”後処理部
１０を“かな”漢字変換部６の前後に設けている。In the present invention, a recognition correspondence history table as described above is stored in advance in the file device 1 that stores standard voice patterns for each voice input operator. Further, as shown in FIG. 2, a "kana" preprocessing section 9 and a "kana" postprocessing section 10 are provided before and after the "kana" kanji conversion section 6.

第２図において、音声入力文章作成処理の開始により、
ファイル装置１に格納されている前記認識対応履歴表が
“かな”前処理部９のファイル装置１１に転送される。In FIG. 2, with the start of the voice input sentence creation process,
The recognition correspondence history table stored in the file device 1 is transferred to the file device 11 of the “kana” preprocessing section 9.

マイク２からの音声入力“かな”文字はファイル装置１
の音声パターンと特徴点の照合をされて音声認識部３に
より順次認識され、認識“かな”文字列としてファイル
装置１１に格納される。この場合、入力“かな”文字が
誤認識された可能性の有無を検証するために、認識“か
な”文字列と前記認識対応履歴表とが照合され、認識“
かな”文字列とは別の“かな”文字列が有り得る場合に
は、前記音声認識部３における次候補の“かな”文字列
の解析ルートに追加して、該別の“かな”文字列のそれ
ぞれを解析するためのルートが開設される。The voice input “kana” characters from microphone 2 are sent to file device 1.
The voice pattern and the feature points are compared, sequentially recognized by the voice recognition section 3, and stored in the file device 11 as a recognized "kana" character string. In this case, in order to verify whether there is a possibility that the input "kana" character was misrecognized, the recognized "kana" character string is compared with the recognition correspondence history table, and the recognized "kana" character string is compared with the recognition correspondence history table.
If there is a possibility that there is a different “kana” character string from the “kana” character string, it is added to the analysis route of the next candidate “kana” character string in the speech recognition unit 3, and the other “kana” character string is analyzed. A route will be established to analyze each.

以下、従来と同様にして“かな”漢字変換部６において
、ファイル装置７に格納されている単語辞書を参照しな
がら、上記のルートに対応する“かな”文字列すべてに
ついて順次形態素解析が行われ、各ルートごとに最も確
からしい代表漢字“かな”混り列が作成され、これらの
代表漢字“かな”混り列のうちで、該漢字“かな”混り
列を構成する形態素の数が最小となるものが選択され。Thereafter, in the same manner as before, in the "kana" kanji conversion unit 6, morphological analysis is sequentially performed on all "kana" character strings corresponding to the above root while referring to the word dictionary stored in the file device 7. , the most probable representative kanji "kana" mixed string is created for each route, and among these representative kanji "kana" mixed strings, the number of morphemes that make up the kanji "kana" mixed string is the smallest. is selected.

これがディスプレイ４上に漢字“かな”混り文として表
示れ、音声入力操作者によりその正当性が判定される。This is displayed on the display 4 as a sentence containing the kanji "kana", and its validity is determined by the voice input operator.

ディスプレイ４上に表示された該漢字“かな”混り文に
誤りがある等判定された場合には、操作卓５からその旨
を入力すると、“がな”漢字変換部６に対して次候補の
漢字“かな”混り列の送出が要求され、これに対応する
漢字“がな”混り文がディスプレイ４表示される。If it is determined that there is an error in the kanji ``kana'' character displayed on the display 4, input that information from the console 5, and the ``gana'' kanji converter 6 will select the next candidate. The transmission of the kanji "kana" mixed string is requested, and the corresponding kanji "gana" mixed string is displayed on the display 4.

正しい漢字“かな”混り文が出力された旨が操作卓５か
ら入力されると、これが文章編集部８に一時保存され、
個々の漢字“かな”混り文が一連の文章として編集され
る。When a message indicating that a sentence containing the correct kanji "kana" has been output is input from the console 5, this is temporarily stored in the sentence editing section 8, and
Sentences containing individual kanji "kana" are edited as a series of sentences.

本発明においては、最終的に選択された漢字“かな”混
り列に対応する“かな”文字列と音声認識部３によって
認識された“かな”文字列とが“かな”後処理部１０に
おいて比較され、その対応関係に基づきファイル装置１
１上に格納されている認識対応履歴表のデータ、すなわ
ち認識対応度数値が更新される。In the present invention, the “kana” character string corresponding to the finally selected kanji “kana” mixed string and the “kana” character string recognized by the speech recognition unit 3 are processed in the “kana” post-processing unit 10. File device 1 is compared based on the correspondence relationship.
The data in the recognition correspondence history table stored on 1, that is, the recognition correspondence degree value is updated.

更新された認識対応履歴表は音声入力操作が終了時にフ
ァイル装置１に転送される。The updated recognition correspondence history table is transferred to the file device 1 when the voice input operation is completed.

上記のようにして作成された漢字“かな”混り文はプリ
ンタ１２により印字出力される。The sentence containing the kanji "kana" created as described above is printed out by the printer 12.

なお、第１図および第２図において１３はディスプレイ
４およびプリンタ１２の出力制御装置である。1 and 2, reference numeral 13 represents an output control device for the display 4 and printer 12. In FIG.

（８）発明の効果本発明によれば、音声入力される“かな”文字を逐次確
認した上で“かな”漢字変換する必要がなく、任意の一
定長さの漢字“かな”混り文の状態で認識・変換の正し
さを判定すればよく、音声入力文章作成操作を容易にす
るとともに処理能率を向上する効果が大である。(8) Effects of the Invention According to the present invention, there is no need to sequentially check the "kana" characters that are input by voice and convert them into "kana", and it is not necessary to convert them into "kana" or "kana" characters of any given length. It is sufficient to judge the correctness of recognition/conversion based on the state, which has a great effect of facilitating the voice input sentence creation operation and improving processing efficiency.

[Brief explanation of the drawing]

第１図および第２図はそれぞれ従来および本発明に係る
音声入力文章作成装置の機能ブロック図である。図において、１はファイル装置、２はマイク、３は音声
認識部、４はディスプレイ、５は操作卓、６は“かな”
漢字変換部、７および１１はファイル装置、８は文章編
集部、９は“かな”前処理部、１０は“かな”後処理部
、１２はプリンタ、１３は出方制御装置である。見　１　図FIG. 1 and FIG. 2 are functional block diagrams of a conventional voice input sentence creation device and a voice input sentence creation device according to the present invention, respectively. In the figure, 1 is a file device, 2 is a microphone, 3 is a voice recognition unit, 4 is a display, 5 is a console, and 6 is a “kana”
7 and 11 are file devices, 8 is a text editing section, 9 is a "kana" pre-processing section, 10 is a "kana" post-processing section, 12 is a printer, and 13 is an output control device. See 1 figure

Claims

[Claims]

(1) Voice input sentence creation blade using “kana” kanji conversion
In the formula, the recognized "kana" character string is compared with the recognition correspondence history table to verify the presence or absence of a possible "kana" character string other than the recognized "kana" character string, and to determine the presence or absence of a possible "kana" character string separately from the recognized "kana" character string. If it exists, a corresponding analysis route is established, and for each analysis route, each of the separately possible "kana" character strings is sequentially analyzed and a sentence containing kanji "kana" is output. Voice input text creation method.

(2) The recognition correspondence history table is stored in the same file as the voice pattern for each voice input operator, and is transferred to the text creation section at the time of voice input operation. Voice input text creation method.

(3) Compare the correspondence of each “kana” character in the recognized “kana” character string and the “gana” character string corresponding to each “kana” character and the output kanji “kana” mixed sentence, The voice input sentence creation method according to claim 1, wherein the content of the recognition correspondence history table is updated based on the correspondence relationship.

(4) In a voice input text creation device that uses “kana” Kanji conversion, create and control multiple analysis routes based on a voice pattern file storing a recognition correspondence history table and the recognition correspondence history table and voice input information. for"
The Kana/Kanji preprocessing unit compares each “Kana” character in the recognized “Kana” character string with the corresponding “Kana” character in the “Kana” character string corresponding to the output Kanji “Kana” mixed sentence. and a "kana" post-processing unit for updating the contents of the recognition correspondence history table based on the correspondence relationship.