JPS60105037A

JPS60105037A - Production system of voice input sentence

Info

Publication number: JPS60105037A
Application number: JP58212019A
Authority: JP
Inventors: Yutaka Kamiyanagi; 上柳　裕; Takahiko Ogita; 荻田　隆彦
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1983-11-11
Filing date: 1983-11-11
Publication date: 1985-06-10

Abstract

PURPOSE:To reduce the load of a processor and to improve the processing speed for production of a voice input sentence, by applying a statistic method for selection of candidates to delete undesired candidates and also to avoid the wrong setting of candidates. CONSTITUTION:The input voice is applied to an analysis/collation means 11 for calculation of resemblance degree, and a selecting/holding means 12 selects and holds a single or plural candidates according to the resemblance degrees. Then a candidate re-evaluation means 13 refers to the 1st statistic table 34 to reduce the number of candidates and to arrange the orders. A connectability evaluation means 14 has collation between the preceding and following candidates for each candidate by the 2nd and 3rd statistic tables 35 and 36 and then evaluates and reduces the appearance rate to send those candidates to a KANA (Japanese syllabary)/KANJI (Chinese character) converting part 20. A separating means 21 separates the processing unit by means of inter-syllable marks, etc. and sets an array route by the recognizing information to give the priority. Then the means 21 gives access to a word dictionary 37 for each route to check the grammatical connection for each form element and combination together with other route evaluation. Thus the input characters all undergo the KANA/KANJI conversion, and access is given to a KANJI dictionary 38 to deliver a KANJI-KANA sentence.

Description

【発明の詳細な説明】（ａ）　発明の技術分野本発明はデータ処理システムにおける音声入力による文
章作成方式に関する。DETAILED DESCRIPTION OF THE INVENTION (a) Technical Field of the Invention The present invention relates to a text creation method using voice input in a data processing system.

（ｂｌ　技術の背景近年データ処理技術の発展と普及に伴いデータ処理シス
テムにおける人出カ手段の一端として、半導体特に集積
化技術の進展に支えられ音声認識アルゴリズムのための
複雑局度な論理回路および高速大容量金製する手段がよ
り小形且低コストの高集積回路素子（ＬＳＩ）によって
実現さｎるに従い、日本語音声によるデータ処理が対話
形式に適し、操作者に特別の習熟訓練音節すことなく容
易に利用出来る特徴ケ生かしてマンマシンインタフェー
スの理想目標とする音声タイプライタが実用化されるよ
うになった〇（ｃ）　従来技術と問題点第１図は従来および本発明の一実施例における音声入力
文章作成方式の概念図を示す。図においてｌＯは音声認
識部、２０はカナ漢字変換部、４０は文書編集部、５０
は出方制御部＋　６０８はフ゛リンタ、６０ｂはディス
プレイ＋　７０ａはマイクロフォンおよび７０．ｂは操
作具である。(bl Technology background) In recent years, with the development and spread of data processing technology, complex logic circuits for speech recognition algorithms and As high-speed, large-capacity devices become available through smaller, lower-cost highly integrated circuit devices (LSIs), data processing using Japanese audio is suitable for interactive formats and provides operators with special familiarization training syllables. A voice typewriter, which is an ideal target for a man-machine interface, has been put into practical use by taking advantage of the features that make it easy to use. A conceptual diagram of the voice input sentence creation method is shown. In the figure, lO is a voice recognition unit, 20 is a kana-kanji conversion unit, 40 is a document editing unit, and 50 is a voice recognition unit.
is an output control unit, 608 is a printer, 60b is a display, 70a is a microphone, and 70. b is an operating tool.

従来においては操作者の発声による音声入力をマイクロ
ホン６０ａより音声認識部１ｏに入力せしめその分析結
果による音声パターン會音声辞書との照合によりその類
仰１度に従い該音声入力に対応する単数または複敬の候
補からなるカナ金ディスプレイ６０ｂの画面に表示し、
操作者は該候補の中力１ら逐次正解のカナを確定しっ＼
カナ列葡作成し日本人に最も理解し易い漢字カナ混り文
を作成する方式によっている。このため音声入力による
文章作成方式は特別の習熟を必要としないが操作者は常
時カナを確認する作業が必要になり、操作者の発声が疲
労等によりて状態が震ったりするとディスプレイ６０ｂ
に表示される候補の選択が適正に得ら扛なかったり、候
補数が多過ぎたりする事態が発生し、他の一意的な打鍵
入力によるワードプロセッサの処理に比較して却ってカ
ナ列を確定する工程が増加し処理が煩わしくなり処理速
度が上らない欠点１有している。Conventionally, the voice input by the operator's utterance is inputted to the voice recognition unit 1o from the microphone 60a, and the result of the analysis is compared with the voice pattern conference voice dictionary to determine the singular or compound form corresponding to the voice input according to the degree of similarity. displayed on the screen of the Kanakane display 60b consisting of candidates,
The operator confirms the correct kana one by one starting from the middle power 1 of the candidate.
The method is to create a kana series and create a sentence containing kanji and kana that is easiest for Japanese people to understand. For this reason, although the method of creating sentences using voice input does not require special skill, the operator must constantly check the kana, and if the operator's voice becomes shaky due to fatigue etc., the display 60b
In some cases, the selection of candidates displayed on the screen may not be properly selected, or the number of candidates may be too large. This method has the disadvantage that the processing speed is not increased due to the increase in processing time.

ｆｄｌ　本発明の目的本発明の目的は上記の欠点全除去するため、音声入力を
音声辞書に比較して得る類似度に従って抽出した各候補
について、次工程において複数候補の組合せからなるカ
ナ列における接続可能性全前方接続、後方接続の統計テ
ーブルと照合する以前に各候補自身がその認識における
類似度に対応して他のカナと誤る可能性の出現率順位を
配列する統計テーブルと照合して音声パターンを統計的
な面からも吟味再評価する手段を挿入し、その照合結果
によってはより候補数を削減したり、あるいは優先順位
の見直し全行いより正解が得ら牡以後のカナ列ルートの
切出し、カナ漢字変換における試行繰返しの負担全削減
すると共に操作の従来におけるカナの逐次選択における
煩わしさ全除去して処理速度全向上する手段を提供しよ
うとするものである。fdl Purpose of the Present Invention The purpose of the present invention is to eliminate all of the above-mentioned drawbacks, and in the next step, for each candidate extracted according to the similarity obtained by comparing the voice input with a voice dictionary, the connection in a kana string consisting of a combination of multiple candidates is determined. Before checking with a statistical table of all possible forward connections and backward connections, each candidate is compared with a statistical table that ranks the probability of mistaking it for another kana according to the degree of similarity in its recognition. Inserting a means to re-evaluate the pattern by examining it from a statistical perspective, and depending on the matching results, the number of candidates can be further reduced, or if the correct answer is obtained after all attempts to review the priority order, cut out the kana string route after the first one. The present invention aims to provide a means for completely reducing the burden of trial repetition in kana-kanji conversion, as well as eliminating all the troublesome operations associated with sequential selection of kana characters in the past, thereby improving processing speed.

（ｅｌ　発明の構成未知入力音声を予め音声辞書に登録した特徴量標準バタ
ンと照合する音声認識部に得られる音声バタンデータを
カナ漢字変換部により漢字カナ混り文章として出力せし
める音声入力文章作成システムにおいて、入力音声をス
ペクトラム時系列により分析してその特徴量全標準バタ
ンと逐一比較して類似度ｋｎｉ出する分析照合手段、分
析照合手段に得られる類似度に従いその上位より単数ま
たは複数の候補を選択保持する手段、各候補句：にその
類似度に対応して該候補が他のカナに誤る可能ｆトの出
現率順位？配列する第１Ｍ、計テーブルと照合し候補の
順位配列または重み付けを再評価する手段、候補毎に先
行接続および後続する他のカナ文字との接続についてそ
れぞｆ′Ｌ第２．第３統計テーブルと比較し“Ｃその接
続性を評価する手段、接続性評価手段により抽出選択さ
Ｂ７’（上位候補の組合せによるカナ列候補を形態素抜
りは／およびその組合せ毎に切出す手段、文法ケ参照し
単語、漢字辞＠ｉ照合しつ＼切出し長欠評価してカナ列
全漢字に変換する手段および切出し巣位の組合せルート
を評価する手段を具備しでなり、制御部は音声認識部の
分析照合手段に入力音声音印加して音声データの候補毎
に類似度ケ算出せしめ、−選択保持手段をして選択配列
せしめた候補について再評価手段による臥補数、順位の
削減または／および補正を実行せしめると共に該候補に
ついて接続評価手段をして候補の配列組合せ力）らなる
カナ列におけるその接続妥当性から上位候補を選択して
第１手順全英行せしめ、更にカナ漢字変換部において該
上位候補の組合せからなるカナ列ルートについて切出し
手段、カナ漢字変換手段およびルート評価手段またはそ
の相互繰返しにより最優先カナ列ルートに得る第２手段
を実行せしめて、漢字カナ混り文章音出力せしめること
全特徴とする音声入力文章作成方式？提供することによ
って達成することができる。(el) Structure of the Invention A voice input sentence creation system in which the voice recognition unit compares unknown input voice with the feature value standard bang registered in advance in a voice dictionary, and outputs the voice bang data obtained by the voice recognition unit as a sentence containing kanji and kana by the kana-kanji conversion unit , an analysis and matching means analyzes the input speech in spectrum time series and compares the feature values point by point with all standard batons to obtain a degree of similarity, and one or more candidates are selected from the top according to the similarity obtained by the analysis and comparison means Means for selecting and holding each candidate phrase: Arranges the probability of occurrence of the candidate to be mistaken for another kana according to its degree of similarity. A means for re-evaluating, a means for evaluating the connectivity of each candidate by comparing it with the second and third statistical tables for preceding connections and subsequent connections with other kana characters. Extract and select by B7' (Morphological extraction of kana string candidates based on combinations of top candidates/and means to cut out each combination, grammar ke reference, words, kanji dictionary @i collation, \cut out length and shortness evaluation, and kana The controller is equipped with a means for converting the entire string into kanji and a means for evaluating the combination route of the extraction nest position, and the control section applies the input speech sound to the analysis and matching means of the speech recognition section and calculates the similarity score for each candidate of speech data. - causing the re-evaluation means to perform reduction and/or correction of the complement number, rank, and/or correction for the candidates selected and arranged by the selection holding means; In the first step, the top candidates are selected based on their connection validity in the kana string, and are made to perform all English in the first step.Furthermore, in the kana-kanji conversion section, a cutting means, a kana-kanji conversion means, and a route evaluation are performed for the kana string route consisting of the combination of the top candidates. This can be achieved by providing a voice input sentence creation method which has all the characteristics of executing the second means of obtaining the highest priority kana string route by means or mutual repetition thereof, and outputting the sounds of sentences containing kanji and kana.

げ）発明の実施例以下図面を参照しつ＼本発明の一実施例について説明す
る。G) Embodiment of the Invention An embodiment of the invention will be described below with reference to the drawings.

第２図は本発明の一実施例における音声入力文章作成方
式のブロック図、第３図はその処理手順例図および第４
図は音声入力文章作成方式における第一統計テーブルの
データ例図を示す。図において０は制御部、１０は音声
認識部、１１は分析照合手段、１２は候補選択保持手段
、１３は候補再評価手段、１４は接続性評価手段、２０
はカナ漢字変換部、２１はカナ列切出手段、２２はカナ
漢字変換手段、２３はルート評価手段、３０は記憶部、
３１は制御プログラム、３２は制御データ。FIG. 2 is a block diagram of a voice input sentence creation method in an embodiment of the present invention, FIG. 3 is an example of its processing procedure, and FIG.
The figure shows an example of data of the first statistical table in the voice input text creation method. In the figure, 0 is a control unit, 10 is a speech recognition unit, 11 is an analysis matching unit, 12 is a candidate selection holding unit, 13 is a candidate re-evaluation unit, 14 is a connectivity evaluation unit, and 20
21 is a kana-kanji conversion unit, 21 is a kana string extraction unit, 22 is a kana-kanji conversion unit, 23 is a route evaluation unit, 30 is a storage unit,
31 is a control program, and 32 is control data.

３３は音声辞書、３３ａ＝ｎ標準バタン、３４は第１統
計テーブル、３５は第２統計テーブル、３６は第３統計
テーブル、３７は単語辞書および３８は漢字辞書である
。33 is a voice dictionary, 33a=n standard batan, 34 is a first statistical table, 35 is a second statistical table, 36 is a third statistical table, 37 is a word dictionary, and 38 is a kanji dictionary.

本発明の一実施例では第２図における制御部Ｏは記憶部
３０の制御プログラム３１．制御データ３２に従って構
成各部を制御し、第３図に示す手順に従って入力音声を
音声認識し且その内容ヶカナ混り漢字文に変換して出力
する。記憶部３０は制御プログラム３１．制御データ３
２の他、不特定話者に対応する単音節標準パターン″ｌ
Ｊ：たは／および特定話者の学習モードにおいて得られ
るカナ１文字毎に〜１０個程０訓練サンプルから平均的
な標準バタンｔｌ−登録蓄積する音声辞書、ある入力音
声が分析照合手段の分析機能によりスペクトラムは系列
による特微量分析出力會得て照合機能に保持され、音声
辞書３３の標準バタン３３ａ＝ｎと比較照合さし最も類
似度が高いものから順に複数のカナ候補が得られる。勿
論類似度が０．９９あるいは０．９５と高い場合は単一
の候補として問題ないが他にも大差ない類似度例えば０
．６１　、０．５Ｂ　。In one embodiment of the present invention, the control unit O in FIG. Each component is controlled according to the control data 32, and the input voice is recognized according to the procedure shown in FIG. 3, and the contents are converted into kanji and kakana characters and output. The storage unit 30 stores a control program 31. Control data 3
In addition to 2, there is a monosyllabic standard pattern ``l'' that corresponds to unspecified speakers.
J: About 10 for each kana character obtained in the learning mode of a specific speaker.A speech dictionary that registers and accumulates an average standard batan tl from 0 training samples, and a certain input speech is analyzed by the analysis verification means. According to the function, the spectrum is obtained as a series-based feature quantity analysis output and held in the collation function, and compared and collated with the standard button 33a=n of the speech dictionary 33, and a plurality of kana candidates are obtained in order from the one with the highest degree of similarity. Of course, if the similarity is as high as 0.99 or 0.95, there is no problem as a single candidate, but if the similarity is not much different, for example 0.
．． 61, 0.5B.

０．５７．０．５３　、０．５１　、０．４８のような
候補が得ら扛た場合従来は上位４候補場合によりでは全
候補について接続性評価手段１４に送出し、これ等のす
べての組合せについて逐一評価作業を施していたが、本
実施例では次の候補再評価手段により類似度に対応して
該候補が他のカナと誤る割合を統計的に配列した第１統
計テーブル３４と照合して吟味する。尚第１〜３統計テ
ーブルは第２図に単一ブロックとして表示したがその内
容は標準バタンと同様各カナに対して第４図の第１統計
テーブルにおけるデータ例図のようにそｎぞれ用意され
ているものとする。Conventionally, when candidates such as 0.57, 0.53, 0.51, and 0.48 are obtained, all of the top four candidates are sent to the connectivity evaluation means 14, and all of these candidates are sent to the connectivity evaluation means 14. The combinations were evaluated one by one, but in this embodiment, the following candidate re-evaluation means is used to compare the candidates with a first statistical table 34 that statistically arranges the percentage of candidates mistaken for other kana according to the degree of similarity. and examine it carefully. Although the 1st to 3rd statistical tables are shown as a single block in Fig. 2, the contents are similar to the standard batan, and for each kana, as shown in the example data in the 1st statistical table in Fig. 4. It shall be prepared.

第４図の例では候補Ａの類似度が５０のときはＢはＡに
対し１７／　、Ｄは１５／　、Ｃは１０７．。In the example of FIG. 4, when candidate A's similarity is 50, B is 17/, D is 15/, and C is 107. .

５０　５０の割合で誤る可能性のあることを示す。従ってテーブル
にも表示さルないＡ５０に対し工と誤る可能性はＯであ
る。勿論有意差が少なければ類似度対応をある範囲の類
似度について共通データ１使っても良い。Indicates that there is a probability of error at a rate of 50 50. Therefore, there is a zero possibility that A50, which is not displayed on the table, will be mistaken for A50. Of course, if the significant difference is small, the common data 1 may be used for similarity correspondence within a certain range.

第１例として候補選択保持手段に得られた候補順位では
第５順位でも第１統計テーブル３４における第１候補例
えばＡ５０に対する誤り易い可能性の出現率が第３ｊｗ
−の例えばＣよりも高い第■順位にあり、同様に図示省
略したが第２候補例えばＢ４８に対する誤り率が第３順
位のＣよりも高い第■順位に共通して存在する場合は第
３順位のＣ會下げ第５順位のＩＩＪ上げて順位を補整す
る。第２例として第１候補例えばＥ８０があって第１統
計テーブル３４におけるＥ８０に第２候補Ｆ７５だけが
第■順位に存在し、その他の可能性がＯ′ｆ、Ｘは操作
者の設定したしきい値例えば５以下であり、またＦ７５
に第１候補Ｅ８０だけが例えばＥ７０として第■順位に
存在し補完状態にあるときは第３候補以降を削除する。As a first example, in the candidate ranking obtained by the candidate selection holding means, even in the fifth ranking, the appearance rate of the possibility of error for the first candidate, for example A50, in the first statistical table 34 is the 3rd jw.
-, for example, is in rank ■, which is higher than C, and similarly exists in rank ■, which is also omitted from the illustration, but the error rate for the second candidate, for example, B48, is higher than C, which is the third rank, the third rank. The ranking will be corrected by raising IIJ, which was 5th place, by lowering C-kai. As a second example, there is a first candidate, e.g. E80, and only the second candidate F75 exists in the second rank for E80 in the first statistical table 34, and other possibilities are O'f and X is set by the operator. Threshold value, for example, 5 or less, and F75
If only the first candidate E80, for example E70, exists in the second rank and is in a complementary state, the third and subsequent candidates are deleted.

また第３例として前述の大差ない類似度例えば０．６１
〜０．４８と多数の候補が得られたときでも第１統計テ
ーブルに得られる情報で第２順位および第３順位のみに
補完が得ら扛その他の候補相互組合せでしきい値以下に
あるような場合は第２順位の０．５８と第３順位の０．
５７だけ４４して第１順位仙全削除する。このように従
来は分析照合手段１１に得らｎる類似度だけに従って候
補を選択していたが候補再評価手段１３において第１統
計テーブルケ照合し、候補数の削減またＧＶおよび順位
調整會実行して接続性評価手段１４に候補情報を送出す
る。以上は複数の候補より削除する方向で説明したが場
合によっては候補が少なくて候補再評価手段１３によっ
て類似度が低〈従来なら対象外となるものを逆に追加す
ることもある。次に接続性評価手段１４は各候補の前後
２文字のカナ組合せについて、第２統計テーブルにより
先行接続する候補ｘｌ　ｌ　’ｊ２　＋　ｘｓと照合し
て例えば該候補が「あ」であればＸ、ア、ｘ２ア＋Ｘ３
ア・・・・・の出現率を評価して先行候補を削減し、第
３統計テーブルにより後続接続する候補）’＋＋Ｙｔ＋
ｙ３　・・と照合してアン１．アＹｔｒアｙ、・・・・
の出現率全評価して後続候補を削減する。更に前後から
の接続評価におけるアンド条件または／およびオア条件
により候補は追加削減される。このようにして得られた
候補を組合せて出来るカナ例の候補をカナ漢字変換部２
０に送出する。In addition, as a third example, the above-mentioned similarity with no significant difference, for example, 0.61
Even when a large number of candidates (~0.48) are obtained, the information obtained in the first statistical table shows that only the second and third ranks are complemented. In this case, the second rank is 0.58 and the third rank is 0.58.
44 by 57 and delete the first rank Senzen. In this way, in the past, candidates were selected only according to the degree of similarity obtained by the analysis and matching means 11, but the candidate re-evaluation means 13 performs the first statistical table comparison to reduce the number of candidates and perform GV and ranking adjustment meetings. Then, candidate information is sent to the connectivity evaluation means 14. The above description has been made in the direction of deleting candidates from a plurality of candidates, but in some cases, there are few candidates and the candidate re-evaluation means 13 may add candidates with low similarity (which would otherwise be excluded). Next, the connectivity evaluation means 14 compares the kana combination of the two characters before and after each candidate with the preceding connected candidate xl l 'j2 + xs using the second statistical table, and for example, if the candidate is "a", then A, x2 a+X3
Evaluate the appearance rate of A..., reduce the preceding candidates, and use the third statistical table to connect the candidates to succeed)'++Yt+
Check with y3... and find Anne 1. A Ytr ay...
Evaluate all appearance rates of and reduce subsequent candidates. Furthermore, candidates are additionally reduced by AND conditions and/or OR conditions in the connection evaluation from before and after. The kana-kanji converter 2 converts the kana example candidates created by combining the candidates obtained in this way.
Send to 0.

カナ漢字変換部２０におけるカナ列切出手段２１は音節
スペース間または数字、記号等による大きい切出しケ施
して変換処理単位會得た後、カナ列全構成する各候補間
の近似度情報の評価に従い各候補の組合せによる配列ル
ートとその優先順位を与え複数のルーｉｆ設定する。The kana string cutting means 21 in the kana-kanji conversion unit 20 performs large cutting between syllable spaces or numbers, symbols, etc. to form a conversion processing unit, and then performs a conversion process according to the evaluation of the similarity information between each candidate constituting the entire kana string. A plurality of routes are set by giving an array route and its priority based on each combination of candidates.

各ルートについて逐一カナ列候補を単語辞書３７にアク
セスしつ＼形態素または／およびその組合せ毎に文法的
な接続チェック會単語辞書３７の接続情報と照合評価し
つ＼、形態素とその組合せ長の確からしさ會チェックす
る。For each route, the kana string candidates are accessed one by one to the word dictionary 37, and each morpheme and/or combination thereof is checked for grammatical connections.The length of the morphemes and their combinations is confirmed by comparing and evaluating them with the connection information in the group word dictionary 37. Check out the personality.

他のカナ列ルートについても同様に処理上繰返して評価
し、例えば最小形態素数等により最も確からしい優先カ
ナル−１決定してカナ漢字変換手段２２に送出する。カ
ナ漢字変換手段２２は該優先カナルート候補全受信しカ
ナ文字列ケ漢字辞誓３８にアクセスし漢字カナ混り文章
に変換して出力する、ルート評価手段２３はカナ列切換
手段２１、カナ漢字変換手段２２に有効形態素が得らｎ
ないとき別の配列ルートによるフォーマルバック処理を
行う機能である。The other kana sequence roots are evaluated repeatedly in the same way, and the most probable priority canal-1 is determined based on, for example, the minimum number of morphemes, and sent to the kana-kanji conversion means 22. The kana-kanji conversion means 22 receives all the priority kana route candidates, accesses the kana character string to kanji dictionary 38, converts it into a sentence containing kanji and kana, and outputs it. Means 22 obtains a valid morpheme n
This function performs formal back processing using another array route when there is no such array.

本実施例では以上のように構成さ牡カナ接続性評価手段
１４に入力するカナ候補の状態で候補再評価手段１３に
より他と誤る可能性についての統計的なデータにより評
価を施し、候補数の削減や順位補正を行うので、従来分
析照合手段に得らｎる音声パタンの類似度比較だけによ
る候補の決定に対し合理的な見直しによって適切な候補
の選択が得ら几、接続性評価手段１４食含めた後続する
各機能の処理全軽減することが出来る。In this embodiment, the candidate re-evaluation means 13 evaluates the kana candidates inputted to the male-kana connectivity evaluation means 14 using statistical data regarding the possibility of mistaking them for other kana candidates configured as described above, and reduces the number of candidates. Since the reduction and rank correction are performed, the selection of appropriate candidates can be obtained through rational review, whereas the selection of candidates is only based on the similarity comparison of the speech patterns obtained by conventional analysis and matching means. It is possible to completely reduce the processing of each subsequent function, including food processing.

＜ｇ）　発明の詳細な説明したように本発明によ几ば従央における分析出力
による類似度比較だけでは見落しとなっていた候補選択
について統計的手法を加味したことにより不必要な候補
を効果的に削減したり、過った候補の設定を除去出来る
ので以後の処理機能における負担上軽減し処理速度を向
上せしめ誤変換率の少い音声入力文章作成方式が得られ
る。<g) As described in detail, according to the present invention, unnecessary candidates can be eliminated by adding statistical methods to the selection of candidates that would have been overlooked only by comparing the similarity based on the analysis output in Juo. Since it is possible to effectively reduce or remove incorrect candidate settings, it is possible to reduce the burden on subsequent processing functions, improve processing speed, and obtain a voice input text creation method with a low rate of erroneous conversion.

[Brief explanation of drawings]

第１図は従来および本発明の一実施例における音声入力
文章作成方式の概念図、第２図は本発明の一実施例にお
ける音声入力文章作成方式のブロック図、第３図はその
処理手順例図および第４図は音声入力文章作成方式にお
ける第１統計テーブルのデータ例図金示す。図において
Ｏは制御部。１０は音声認識部、１１は分析照合手段、１２は候補選
択保持手段、１３は候補再評価手段、１４は接続性評価
手段、２０はカナ漢字変換部、３０は記憶部、３１は制
御ブロクラム、３２は制御データ、３３は音声辞書、３
３ａ”−ｎは標準バタン。３４は第１統計テーブル、３５は第２統絹テーブル、３
６は第３統計テーブル、４０は文書編集部。５０は出力開側１部、６０ａはプリンタおよび６０ｂは
ディスプレイである。寮１ｌｆＩ寮？呵Ｉ− Ｙ−３唄許４酊FIG. 1 is a conceptual diagram of a voice input sentence creation method in a conventional method and an embodiment of the present invention, FIG. 2 is a block diagram of a voice input sentence creation method in an embodiment of the present invention, and FIG. 3 is an example of its processing procedure. 4 and 4 show data examples of the first statistical table in the voice input text creation method. In the figure, O is the control unit. 10 is a speech recognition unit, 11 is an analysis verification unit, 12 is a candidate selection and holding unit, 13 is a candidate re-evaluation unit, 14 is a connectivity evaluation unit, 20 is a kana-kanji conversion unit, 30 is a storage unit, 31 is a control block, 32 is control data, 33 is a voice dictionary, 3
3a”-n is the standard baton. 34 is the first statistical table, 35 is the second silk table, 3
6 is a third statistical table, and 40 is a document editing department. Reference numeral 50 represents an output open side, 60a represents a printer, and 60b represents a display. Dormitory 1lfI Dormitory?呵I- Y-3 Utaho 4 Drunkenness

Claims

[Claims]

In a voice input text creation system, the voice recognition unit compares the unknown input voice with the standard feature value registered in the voice dictionary.In the voice input sentence creation system, the voice recognition unit outputs the input voice as a sentence containing kanji and kana by the all-kana-kanji conversion unit. An analysis matching means that analyzes the voice in spectrum time series and compares the feature values point by point with a standard bang to calculate the similarity, and selects and retains all candidates from the top one or more according to the 36a similarity obtained by the analysis matching means. means for each candidate,
Means for re-evaluating the ranking arrangement or weighting of the candidates by comparing them with a first statistical table that arranges the occurrence rate rankings of the probability of the candidate being mistaken for another kana corresponding to the low degree, and for each candidate, prior connection and Regarding connections with other subsequent kana characters, the second. Means for evaluating connectivity by comparing with the third statistical table, means for cutting out kana sequence candidates by morphemes and/or combinations thereof, referring to grammar, matching words in kanji dictionaries, and \cutting length. The control unit is equipped with means for evaluating and converting kana strings into kanji and means for determining the combination route of cutting units. The degree of similarity is calculated for each candidate, and the number of candidates and the ranking are reduced by the re-evaluation means for the candidates selected and arranged by the selection holding means.
and correction is performed, and the connection evaluation means selects the top candidate based on its connection validity in the kana string consisting of the arrangement combination of the candidates and executes the first step, and further the kana-kanji conversion unit The cutting means, the kana-kanji conversion means and the route evaluation means, or the second means which obtains the highest priority kana string route t-V by repeating these means, are executed for the kana string route consisting of a combination of candidates, and the sound of the sentence containing kanji and kana is executed. A voice input text creation method characterized by output.