JPS60105037A - Production system of voice input sentence - Google Patents

Production system of voice input sentence

Info

Publication number
JPS60105037A
JPS60105037A JP58212019A JP21201983A JPS60105037A JP S60105037 A JPS60105037 A JP S60105037A JP 58212019 A JP58212019 A JP 58212019A JP 21201983 A JP21201983 A JP 21201983A JP S60105037 A JPS60105037 A JP S60105037A
Authority
JP
Japan
Prior art keywords
kana
candidates
kanji
candidate
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58212019A
Other languages
Japanese (ja)
Inventor
Yutaka Kamiyanagi
上柳 裕
Takahiko Ogita
荻田 隆彦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP58212019A priority Critical patent/JPS60105037A/en
Publication of JPS60105037A publication Critical patent/JPS60105037A/en
Pending legal-status Critical Current

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

PURPOSE:To reduce the load of a processor and to improve the processing speed for production of a voice input sentence, by applying a statistic method for selection of candidates to delete undesired candidates and also to avoid the wrong setting of candidates. CONSTITUTION:The input voice is applied to an analysis/collation means 11 for calculation of resemblance degree, and a selecting/holding means 12 selects and holds a single or plural candidates according to the resemblance degrees. Then a candidate re-evaluation means 13 refers to the 1st statistic table 34 to reduce the number of candidates and to arrange the orders. A connectability evaluation means 14 has collation between the preceding and following candidates for each candidate by the 2nd and 3rd statistic tables 35 and 36 and then evaluates and reduces the appearance rate to send those candidates to a KANA (Japanese syllabary)/KANJI (Chinese character) converting part 20. A separating means 21 separates the processing unit by means of inter-syllable marks, etc. and sets an array route by the recognizing information to give the priority. Then the means 21 gives access to a word dictionary 37 for each route to check the grammatical connection for each form element and combination together with other route evaluation. Thus the input characters all undergo the KANA/KANJI conversion, and access is given to a KANJI dictionary 38 to deliver a KANJI-KANA sentence.

Description

【発明の詳細な説明】 (a) 発明の技術分野 本発明はデータ処理システムにおける音声入力による文
章作成方式に関する。
DETAILED DESCRIPTION OF THE INVENTION (a) Technical Field of the Invention The present invention relates to a text creation method using voice input in a data processing system.

(bl 技術の背景 近年データ処理技術の発展と普及に伴いデータ処理シス
テムにおける人出カ手段の一端として、半導体特に集積
化技術の進展に支えられ音声認識アルゴリズムのための
複雑局度な論理回路および高速大容量金製する手段がよ
り小形且低コストの高集積回路素子(LSI)によって
実現さnるに従い、日本語音声によるデータ処理が対話
形式に適し、操作者に特別の習熟訓練音節すことなく容
易に利用出来る特徴ケ生かしてマンマシンインタフェー
スの理想目標とする音声タイプライタが実用化されるよ
うになった〇 (c) 従来技術と問題点 第1図は従来および本発明の一実施例における音声入力
文章作成方式の概念図を示す。図においてlOは音声認
識部、20はカナ漢字変換部、40は文書編集部、50
は出方制御部+ 608はフ゛リンタ、60bはディス
プレイ+ 70aはマイクロフォンおよび70.bは操
作具である。
(bl Technology background) In recent years, with the development and spread of data processing technology, complex logic circuits for speech recognition algorithms and As high-speed, large-capacity devices become available through smaller, lower-cost highly integrated circuit devices (LSIs), data processing using Japanese audio is suitable for interactive formats and provides operators with special familiarization training syllables. A voice typewriter, which is an ideal target for a man-machine interface, has been put into practical use by taking advantage of the features that make it easy to use. A conceptual diagram of the voice input sentence creation method is shown. In the figure, lO is a voice recognition unit, 20 is a kana-kanji conversion unit, 40 is a document editing unit, and 50 is a voice recognition unit.
is an output control unit, 608 is a printer, 60b is a display, 70a is a microphone, and 70. b is an operating tool.

従来においては操作者の発声による音声入力をマイクロ
ホン60aより音声認識部1oに入力せしめその分析結
果による音声パターン會音声辞書との照合によりその類
仰1度に従い該音声入力に対応する単数または複敬の候
補からなるカナ金ディスプレイ60bの画面に表示し、
操作者は該候補の中力1ら逐次正解のカナを確定しっ\
カナ列葡作成し日本人に最も理解し易い漢字カナ混り文
を作成する方式によっている。このため音声入力による
文章作成方式は特別の習熟を必要としないが操作者は常
時カナを確認する作業が必要になり、操作者の発声が疲
労等によりて状態が震ったりするとディスプレイ60b
に表示される候補の選択が適正に得ら扛なかったり、候
補数が多過ぎたりする事態が発生し、他の一意的な打鍵
入力によるワードプロセッサの処理に比較して却ってカ
ナ列を確定する工程が増加し処理が煩わしくなり処理速
度が上らない欠点1有している。
Conventionally, the voice input by the operator's utterance is inputted to the voice recognition unit 1o from the microphone 60a, and the result of the analysis is compared with the voice pattern conference voice dictionary to determine the singular or compound form corresponding to the voice input according to the degree of similarity. displayed on the screen of the Kanakane display 60b consisting of candidates,
The operator confirms the correct kana one by one starting from the middle power 1 of the candidate.
The method is to create a kana series and create a sentence containing kanji and kana that is easiest for Japanese people to understand. For this reason, although the method of creating sentences using voice input does not require special skill, the operator must constantly check the kana, and if the operator's voice becomes shaky due to fatigue etc., the display 60b
In some cases, the selection of candidates displayed on the screen may not be properly selected, or the number of candidates may be too large. This method has the disadvantage that the processing speed is not increased due to the increase in processing time.

fdl 本発明の目的 本発明の目的は上記の欠点全除去するため、音声入力を
音声辞書に比較して得る類似度に従って抽出した各候補
について、次工程において複数候補の組合せからなるカ
ナ列における接続可能性全前方接続、後方接続の統計テ
ーブルと照合する以前に各候補自身がその認識における
類似度に対応して他のカナと誤る可能性の出現率順位を
配列する統計テーブルと照合して音声パターンを統計的
な面からも吟味再評価する手段を挿入し、その照合結果
によってはより候補数を削減したり、あるいは優先順位
の見直し全行いより正解が得ら牡以後のカナ列ルートの
切出し、カナ漢字変換における試行繰返しの負担全削減
すると共に操作の従来におけるカナの逐次選択における
煩わしさ全除去して処理速度全向上する手段を提供しよ
うとするものである。
fdl Purpose of the Present Invention The purpose of the present invention is to eliminate all of the above-mentioned drawbacks, and in the next step, for each candidate extracted according to the similarity obtained by comparing the voice input with a voice dictionary, the connection in a kana string consisting of a combination of multiple candidates is determined. Before checking with a statistical table of all possible forward connections and backward connections, each candidate is compared with a statistical table that ranks the probability of mistaking it for another kana according to the degree of similarity in its recognition. Inserting a means to re-evaluate the pattern by examining it from a statistical perspective, and depending on the matching results, the number of candidates can be further reduced, or if the correct answer is obtained after all attempts to review the priority order, cut out the kana string route after the first one. The present invention aims to provide a means for completely reducing the burden of trial repetition in kana-kanji conversion, as well as eliminating all the troublesome operations associated with sequential selection of kana characters in the past, thereby improving processing speed.

(el 発明の構成 未知入力音声を予め音声辞書に登録した特徴量標準バタ
ンと照合する音声認識部に得られる音声バタンデータを
カナ漢字変換部により漢字カナ混り文章として出力せし
める音声入力文章作成システムにおいて、入力音声をス
ペクトラム時系列により分析してその特徴量全標準バタ
ンと逐一比較して類似度kni出する分析照合手段、分
析照合手段に得られる類似度に従いその上位より単数ま
たは複数の候補を選択保持する手段、各候補句:にその
類似度に対応して該候補が他のカナに誤る可能fトの出
現率順位?配列する第1M、計テーブルと照合し候補の
順位配列または重み付けを再評価する手段、候補毎に先
行接続および後続する他のカナ文字との接続についてそ
れぞf′L第2.第3統計テーブルと比較し“Cその接
続性を評価する手段、接続性評価手段により抽出選択さ
B7’(上位候補の組合せによるカナ列候補を形態素抜
りは/およびその組合せ毎に切出す手段、文法ケ参照し
単語、漢字辞@i照合しつ\切出し長欠評価してカナ列
全漢字に変換する手段および切出し巣位の組合せルート
を評価する手段を具備しでなり、制御部は音声認識部の
分析照合手段に入力音声音印加して音声データの候補毎
に類似度ケ算出せしめ、−選択保持手段をして選択配列
せしめた候補について再評価手段による臥補数、順位の
削減または/および補正を実行せしめると共に該候補に
ついて接続評価手段をして候補の配列組合せ力)らなる
カナ列におけるその接続妥当性から上位候補を選択して
第1手順全英行せしめ、更にカナ漢字変換部において該
上位候補の組合せからなるカナ列ルートについて切出し
手段、カナ漢字変換手段およびルート評価手段またはそ
の相互繰返しにより最優先カナ列ルートに得る第2手段
を実行せしめて、漢字カナ混り文章音出力せしめること
全特徴とする音声入力文章作成方式?提供することによ
って達成することができる。
(el) Structure of the Invention A voice input sentence creation system in which the voice recognition unit compares unknown input voice with the feature value standard bang registered in advance in a voice dictionary, and outputs the voice bang data obtained by the voice recognition unit as a sentence containing kanji and kana by the kana-kanji conversion unit , an analysis and matching means analyzes the input speech in spectrum time series and compares the feature values point by point with all standard batons to obtain a degree of similarity, and one or more candidates are selected from the top according to the similarity obtained by the analysis and comparison means Means for selecting and holding each candidate phrase: Arranges the probability of occurrence of the candidate to be mistaken for another kana according to its degree of similarity. A means for re-evaluating, a means for evaluating the connectivity of each candidate by comparing it with the second and third statistical tables for preceding connections and subsequent connections with other kana characters. Extract and select by B7' (Morphological extraction of kana string candidates based on combinations of top candidates/and means to cut out each combination, grammar ke reference, words, kanji dictionary @i collation, \cut out length and shortness evaluation, and kana The controller is equipped with a means for converting the entire string into kanji and a means for evaluating the combination route of the extraction nest position, and the control section applies the input speech sound to the analysis and matching means of the speech recognition section and calculates the similarity score for each candidate of speech data. - causing the re-evaluation means to perform reduction and/or correction of the complement number, rank, and/or correction for the candidates selected and arranged by the selection holding means; In the first step, the top candidates are selected based on their connection validity in the kana string, and are made to perform all English in the first step.Furthermore, in the kana-kanji conversion section, a cutting means, a kana-kanji conversion means, and a route evaluation are performed for the kana string route consisting of the combination of the top candidates. This can be achieved by providing a voice input sentence creation method which has all the characteristics of executing the second means of obtaining the highest priority kana string route by means or mutual repetition thereof, and outputting the sounds of sentences containing kanji and kana.

げ)発明の実施例 以下図面を参照しつ\本発明の一実施例について説明す
る。
G) Embodiment of the Invention An embodiment of the invention will be described below with reference to the drawings.

第2図は本発明の一実施例における音声入力文章作成方
式のブロック図、第3図はその処理手順例図および第4
図は音声入力文章作成方式における第一統計テーブルの
データ例図を示す。図において0は制御部、10は音声
認識部、11は分析照合手段、12は候補選択保持手段
、13は候補再評価手段、14は接続性評価手段、20
はカナ漢字変換部、21はカナ列切出手段、22はカナ
漢字変換手段、23はルート評価手段、30は記憶部、
31は制御プログラム、32は制御データ。
FIG. 2 is a block diagram of a voice input sentence creation method in an embodiment of the present invention, FIG. 3 is an example of its processing procedure, and FIG.
The figure shows an example of data of the first statistical table in the voice input text creation method. In the figure, 0 is a control unit, 10 is a speech recognition unit, 11 is an analysis matching unit, 12 is a candidate selection holding unit, 13 is a candidate re-evaluation unit, 14 is a connectivity evaluation unit, and 20
21 is a kana-kanji conversion unit, 21 is a kana string extraction unit, 22 is a kana-kanji conversion unit, 23 is a route evaluation unit, 30 is a storage unit,
31 is a control program, and 32 is control data.

33は音声辞書、33a=n標準バタン、34は第1統
計テーブル、35は第2統計テーブル、36は第3統計
テーブル、37は単語辞書および38は漢字辞書である
33 is a voice dictionary, 33a=n standard batan, 34 is a first statistical table, 35 is a second statistical table, 36 is a third statistical table, 37 is a word dictionary, and 38 is a kanji dictionary.

本発明の一実施例では第2図における制御部Oは記憶部
30の制御プログラム31.制御データ32に従って構
成各部を制御し、第3図に示す手順に従って入力音声を
音声認識し且その内容ヶカナ混り漢字文に変換して出力
する。記憶部30は制御プログラム31.制御データ3
2の他、不特定話者に対応する単音節標準パターン″l
J:たは/および特定話者の学習モードにおいて得られ
るカナ1文字毎に〜10個程0訓練サンプルから平均的
な標準バタンtl−登録蓄積する音声辞書、ある入力音
声が分析照合手段の分析機能によりスペクトラムは系列
による特微量分析出力會得て照合機能に保持され、音声
辞書33の標準バタン33a=nと比較照合さし最も類
似度が高いものから順に複数のカナ候補が得られる。勿
論類似度が0.99あるいは0.95と高い場合は単一
の候補として問題ないが他にも大差ない類似度例えば0
.61 、0.5B 。
In one embodiment of the present invention, the control unit O in FIG. Each component is controlled according to the control data 32, and the input voice is recognized according to the procedure shown in FIG. 3, and the contents are converted into kanji and kakana characters and output. The storage unit 30 stores a control program 31. Control data 3
In addition to 2, there is a monosyllabic standard pattern ``l'' that corresponds to unspecified speakers.
J: About 10 for each kana character obtained in the learning mode of a specific speaker.A speech dictionary that registers and accumulates an average standard batan tl from 0 training samples, and a certain input speech is analyzed by the analysis verification means. According to the function, the spectrum is obtained as a series-based feature quantity analysis output and held in the collation function, and compared and collated with the standard button 33a=n of the speech dictionary 33, and a plurality of kana candidates are obtained in order from the one with the highest degree of similarity. Of course, if the similarity is as high as 0.99 or 0.95, there is no problem as a single candidate, but if the similarity is not much different, for example 0.
.. 61, 0.5B.

0.57.0.53 、0.51 、0.48のような
候補が得ら扛た場合従来は上位4候補場合によりでは全
候補について接続性評価手段14に送出し、これ等のす
べての組合せについて逐一評価作業を施していたが、本
実施例では次の候補再評価手段により類似度に対応して
該候補が他のカナと誤る割合を統計的に配列した第1統
計テーブル34と照合して吟味する。尚第1〜3統計テ
ーブルは第2図に単一ブロックとして表示したがその内
容は標準バタンと同様各カナに対して第4図の第1統計
テーブルにおけるデータ例図のようにそnぞれ用意され
ているものとする。
Conventionally, when candidates such as 0.57, 0.53, 0.51, and 0.48 are obtained, all of the top four candidates are sent to the connectivity evaluation means 14, and all of these candidates are sent to the connectivity evaluation means 14. The combinations were evaluated one by one, but in this embodiment, the following candidate re-evaluation means is used to compare the candidates with a first statistical table 34 that statistically arranges the percentage of candidates mistaken for other kana according to the degree of similarity. and examine it carefully. Although the 1st to 3rd statistical tables are shown as a single block in Fig. 2, the contents are similar to the standard batan, and for each kana, as shown in the example data in the 1st statistical table in Fig. 4. It shall be prepared.

第4図の例では候補Aの類似度が50のときはBはAに
対し17/ 、Dは15/ 、Cは107.。
In the example of FIG. 4, when candidate A's similarity is 50, B is 17/, D is 15/, and C is 107. .

50 50 の割合で誤る可能性のあることを示す。従ってテーブル
にも表示さルないA50に対し工と誤る可能性はOであ
る。勿論有意差が少なければ類似度対応をある範囲の類
似度について共通データ1使っても良い。
Indicates that there is a probability of error at a rate of 50 50. Therefore, there is a zero possibility that A50, which is not displayed on the table, will be mistaken for A50. Of course, if the significant difference is small, the common data 1 may be used for similarity correspondence within a certain range.

第1例として候補選択保持手段に得られた候補順位では
第5順位でも第1統計テーブル34における第1候補例
えばA50に対する誤り易い可能性の出現率が第3jw
−の例えばCよりも高い第■順位にあり、同様に図示省
略したが第2候補例えばB48に対する誤り率が第3順
位のCよりも高い第■順位に共通して存在する場合は第
3順位のC會下げ第5順位のIIJ上げて順位を補整す
る。第2例として第1候補例えばE80があって第1統
計テーブル34におけるE80に第2候補F75だけが
第■順位に存在し、その他の可能性がO′f、Xは操作
者の設定したしきい値例えば5以下であり、またF75
に第1候補E80だけが例えばE70として第■順位に
存在し補完状態にあるときは第3候補以降を削除する。
As a first example, in the candidate ranking obtained by the candidate selection holding means, even in the fifth ranking, the appearance rate of the possibility of error for the first candidate, for example A50, in the first statistical table 34 is the 3rd jw.
-, for example, is in rank ■, which is higher than C, and similarly exists in rank ■, which is also omitted from the illustration, but the error rate for the second candidate, for example, B48, is higher than C, which is the third rank, the third rank. The ranking will be corrected by raising IIJ, which was 5th place, by lowering C-kai. As a second example, there is a first candidate, e.g. E80, and only the second candidate F75 exists in the second rank for E80 in the first statistical table 34, and other possibilities are O'f and X is set by the operator. Threshold value, for example, 5 or less, and F75
If only the first candidate E80, for example E70, exists in the second rank and is in a complementary state, the third and subsequent candidates are deleted.

また第3例として前述の大差ない類似度例えば0.61
〜0.48と多数の候補が得られたときでも第1統計テ
ーブルに得られる情報で第2順位および第3順位のみに
補完が得ら扛その他の候補相互組合せでしきい値以下に
あるような場合は第2順位の0.58と第3順位の0.
57だけ44して第1順位仙全削除する。このように従
来は分析照合手段11に得らnる類似度だけに従って候
補を選択していたが候補再評価手段13において第1統
計テーブルケ照合し、候補数の削減またGVおよび順位
調整會実行して接続性評価手段14に候補情報を送出す
る。以上は複数の候補より削除する方向で説明したが場
合によっては候補が少なくて候補再評価手段13によっ
て類似度が低〈従来なら対象外となるものを逆に追加す
ることもある。次に接続性評価手段14は各候補の前後
2文字のカナ組合せについて、第2統計テーブルにより
先行接続する候補xl l ’j2 + xsと照合し
て例えば該候補が「あ」であればX、ア、x2ア+X3
ア・・・・・の出現率を評価して先行候補を削減し、第
3統計テーブルにより後続接続する候補)’++Yt+
y3 ・・と照合してアン1.アYtrアy、・・・・
の出現率全評価して後続候補を削減する。更に前後から
の接続評価におけるアンド条件または/およびオア条件
により候補は追加削減される。このようにして得られた
候補を組合せて出来るカナ例の候補をカナ漢字変換部2
0に送出する。
In addition, as a third example, the above-mentioned similarity with no significant difference, for example, 0.61
Even when a large number of candidates (~0.48) are obtained, the information obtained in the first statistical table shows that only the second and third ranks are complemented. In this case, the second rank is 0.58 and the third rank is 0.58.
44 by 57 and delete the first rank Senzen. In this way, in the past, candidates were selected only according to the degree of similarity obtained by the analysis and matching means 11, but the candidate re-evaluation means 13 performs the first statistical table comparison to reduce the number of candidates and perform GV and ranking adjustment meetings. Then, candidate information is sent to the connectivity evaluation means 14. The above description has been made in the direction of deleting candidates from a plurality of candidates, but in some cases, there are few candidates and the candidate re-evaluation means 13 may add candidates with low similarity (which would otherwise be excluded). Next, the connectivity evaluation means 14 compares the kana combination of the two characters before and after each candidate with the preceding connected candidate xl l 'j2 + xs using the second statistical table, and for example, if the candidate is "a", then A, x2 a+X3
Evaluate the appearance rate of A..., reduce the preceding candidates, and use the third statistical table to connect the candidates to succeed)'++Yt+
Check with y3... and find Anne 1. A Ytr ay...
Evaluate all appearance rates of and reduce subsequent candidates. Furthermore, candidates are additionally reduced by AND conditions and/or OR conditions in the connection evaluation from before and after. The kana-kanji converter 2 converts the kana example candidates created by combining the candidates obtained in this way.
Send to 0.

カナ漢字変換部20におけるカナ列切出手段21は音節
スペース間または数字、記号等による大きい切出しケ施
して変換処理単位會得た後、カナ列全構成する各候補間
の近似度情報の評価に従い各候補の組合せによる配列ル
ートとその優先順位を与え複数のルーif設定する。
The kana string cutting means 21 in the kana-kanji conversion unit 20 performs large cutting between syllable spaces or numbers, symbols, etc. to form a conversion processing unit, and then performs a conversion process according to the evaluation of the similarity information between each candidate constituting the entire kana string. A plurality of routes are set by giving an array route and its priority based on each combination of candidates.

各ルートについて逐一カナ列候補を単語辞書37にアク
セスしつ\形態素または/およびその組合せ毎に文法的
な接続チェック會単語辞書37の接続情報と照合評価し
つ\、形態素とその組合せ長の確からしさ會チェックす
る。
For each route, the kana string candidates are accessed one by one to the word dictionary 37, and each morpheme and/or combination thereof is checked for grammatical connections.The length of the morphemes and their combinations is confirmed by comparing and evaluating them with the connection information in the group word dictionary 37. Check out the personality.

他のカナ列ルートについても同様に処理上繰返して評価
し、例えば最小形態素数等により最も確からしい優先カ
ナル−1決定してカナ漢字変換手段22に送出する。カ
ナ漢字変換手段22は該優先カナルート候補全受信しカ
ナ文字列ケ漢字辞誓38にアクセスし漢字カナ混り文章
に変換して出力する、ルート評価手段23はカナ列切換
手段21、カナ漢字変換手段22に有効形態素が得らn
ないとき別の配列ルートによるフォーマルバック処理を
行う機能である。
The other kana sequence roots are evaluated repeatedly in the same way, and the most probable priority canal-1 is determined based on, for example, the minimum number of morphemes, and sent to the kana-kanji conversion means 22. The kana-kanji conversion means 22 receives all the priority kana route candidates, accesses the kana character string to kanji dictionary 38, converts it into a sentence containing kanji and kana, and outputs it. Means 22 obtains a valid morpheme n
This function performs formal back processing using another array route when there is no such array.

本実施例では以上のように構成さ牡カナ接続性評価手段
14に入力するカナ候補の状態で候補再評価手段13に
より他と誤る可能性についての統計的なデータにより評
価を施し、候補数の削減や順位補正を行うので、従来分
析照合手段に得らnる音声パタンの類似度比較だけによ
る候補の決定に対し合理的な見直しによって適切な候補
の選択が得ら几、接続性評価手段14食含めた後続する
各機能の処理全軽減することが出来る。
In this embodiment, the candidate re-evaluation means 13 evaluates the kana candidates inputted to the male-kana connectivity evaluation means 14 using statistical data regarding the possibility of mistaking them for other kana candidates configured as described above, and reduces the number of candidates. Since the reduction and rank correction are performed, the selection of appropriate candidates can be obtained through rational review, whereas the selection of candidates is only based on the similarity comparison of the speech patterns obtained by conventional analysis and matching means. It is possible to completely reduce the processing of each subsequent function, including food processing.

<g) 発明の詳細 な説明したように本発明によ几ば従央における分析出力
による類似度比較だけでは見落しとなっていた候補選択
について統計的手法を加味したことにより不必要な候補
を効果的に削減したり、過った候補の設定を除去出来る
ので以後の処理機能における負担上軽減し処理速度を向
上せしめ誤変換率の少い音声入力文章作成方式が得られ
る。
<g) As described in detail, according to the present invention, unnecessary candidates can be eliminated by adding statistical methods to the selection of candidates that would have been overlooked only by comparing the similarity based on the analysis output in Juo. Since it is possible to effectively reduce or remove incorrect candidate settings, it is possible to reduce the burden on subsequent processing functions, improve processing speed, and obtain a voice input text creation method with a low rate of erroneous conversion.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は従来および本発明の一実施例における音声入力
文章作成方式の概念図、第2図は本発明の一実施例にお
ける音声入力文章作成方式のブロック図、第3図はその
処理手順例図および第4図は音声入力文章作成方式にお
ける第1統計テーブルのデータ例図金示す。図において
Oは制御部。 10は音声認識部、11は分析照合手段、12は候補選
択保持手段、13は候補再評価手段、14は接続性評価
手段、20はカナ漢字変換部、30は記憶部、31は制
御ブロクラム、32は制御データ、33は音声辞書、3
3a”−nは標準バタン。 34は第1統計テーブル、35は第2統絹テーブル、3
6は第3統計テーブル、40は文書編集部。 50は出力開側1部、60aはプリンタおよび60bは
ディスプレイである。 寮1lfI 寮?呵 I− Y−3唄 許4酊
FIG. 1 is a conceptual diagram of a voice input sentence creation method in a conventional method and an embodiment of the present invention, FIG. 2 is a block diagram of a voice input sentence creation method in an embodiment of the present invention, and FIG. 3 is an example of its processing procedure. 4 and 4 show data examples of the first statistical table in the voice input text creation method. In the figure, O is the control unit. 10 is a speech recognition unit, 11 is an analysis verification unit, 12 is a candidate selection and holding unit, 13 is a candidate re-evaluation unit, 14 is a connectivity evaluation unit, 20 is a kana-kanji conversion unit, 30 is a storage unit, 31 is a control block, 32 is control data, 33 is a voice dictionary, 3
3a”-n is the standard baton. 34 is the first statistical table, 35 is the second silk table, 3
6 is a third statistical table, and 40 is a document editing department. Reference numeral 50 represents an output open side, 60a represents a printer, and 60b represents a display. Dormitory 1lfI Dormitory?呵I- Y-3 Utaho 4 Drunkenness

Claims (1)

【特許請求の範囲】[Claims] 未知入力音声を予め音声辞書に登録した特徴量標準バタ
ンと照合する音声認識部に得らnる音声バタンデータ全
カナ漢字変換部により漢字カナ混り文章として出力せし
める音声入力文章作成システムにおいて、入力音声をス
ペクトラム時系列により分析してその特徴量を標準バタ
ンと逐一比較して類似度會算出する分析照合手段、分析
照合手段に得ら36a似度に従いその上位より単数また
は複数の候補全選択保持する手段、各候補毎にその類、
低度に対応して該候補が他のカナに誤る可能性の出現率
順位音配列する第1統計テーブルと照合し候補の順位配
列または重み付はケ再評価する手段、候補毎に先行接続
および後続する他のカナ文字との接続についてそれぞれ
第2.第3統計テーブルと比較してその接続性を評価す
る手段、接合せによるカナ列候補を形態素または/およ
びその組合せ毎に切出す手段、文法を参照し単語、漢字
辞書上照合しつ\切出し長を評価してカナ列を漢字に変
換する手段および切出し単位の組合せル−トを針側する
手段全具備してなり、制御部は音声認識部の分析照合手
段に入力音声を印加して音声データの候補毎に類似度を
算出せしめ、選択保持手段上して選択配列せしめた候補
について再評価手段による候補数、順位の削減まkは/
および補正を実行せしめると共に該候補について接続評
価手段上して候補の配列組合せからなるカナ列における
その接続妥当性から上位候補を選択して第1手順を実行
せしめ、更にカナ漢字変換部において該上位候補の組合
せからなるカナ列ルートについて切出し手段、カナ漢字
変換手段およびル−ト評価手段またはその相互繰返しに
より最優先カナ列ルーt−V得る第2手段に実行せしめ
て、漢字カナ混り文章音出力せしめることを特徴とする
音声入力文章作成方式。
In a voice input text creation system, the voice recognition unit compares the unknown input voice with the standard feature value registered in the voice dictionary.In the voice input sentence creation system, the voice recognition unit outputs the input voice as a sentence containing kanji and kana by the all-kana-kanji conversion unit. An analysis matching means that analyzes the voice in spectrum time series and compares the feature values point by point with a standard bang to calculate the similarity, and selects and retains all candidates from the top one or more according to the 36a similarity obtained by the analysis matching means. means for each candidate,
Means for re-evaluating the ranking arrangement or weighting of the candidates by comparing them with a first statistical table that arranges the occurrence rate rankings of the probability of the candidate being mistaken for another kana corresponding to the low degree, and for each candidate, prior connection and Regarding connections with other subsequent kana characters, the second. Means for evaluating connectivity by comparing with the third statistical table, means for cutting out kana sequence candidates by morphemes and/or combinations thereof, referring to grammar, matching words in kanji dictionaries, and \cutting length. The control unit is equipped with means for evaluating and converting kana strings into kanji and means for determining the combination route of cutting units. The degree of similarity is calculated for each candidate, and the number of candidates and the ranking are reduced by the re-evaluation means for the candidates selected and arranged by the selection holding means.
and correction is performed, and the connection evaluation means selects the top candidate based on its connection validity in the kana string consisting of the arrangement combination of the candidates and executes the first step, and further the kana-kanji conversion unit The cutting means, the kana-kanji conversion means and the route evaluation means, or the second means which obtains the highest priority kana string route t-V by repeating these means, are executed for the kana string route consisting of a combination of candidates, and the sound of the sentence containing kanji and kana is executed. A voice input text creation method characterized by output.
JP58212019A 1983-11-11 1983-11-11 Production system of voice input sentence Pending JPS60105037A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58212019A JPS60105037A (en) 1983-11-11 1983-11-11 Production system of voice input sentence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58212019A JPS60105037A (en) 1983-11-11 1983-11-11 Production system of voice input sentence

Publications (1)

Publication Number Publication Date
JPS60105037A true JPS60105037A (en) 1985-06-10

Family

ID=16615527

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58212019A Pending JPS60105037A (en) 1983-11-11 1983-11-11 Production system of voice input sentence

Country Status (1)

Country Link
JP (1) JPS60105037A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61294591A (en) * 1985-06-21 1986-12-25 Toshiba Corp Information input device
GB2403370B (en) * 2003-06-17 2007-04-25 Ash Technologies Res Ltd A viewing device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61294591A (en) * 1985-06-21 1986-12-25 Toshiba Corp Information input device
JPH0469960B2 (en) * 1985-06-21 1992-11-09 Tokyo Shibaura Electric Co
GB2403370B (en) * 2003-06-17 2007-04-25 Ash Technologies Res Ltd A viewing device

Similar Documents

Publication Publication Date Title
US8352266B2 (en) System and methods for improving accuracy of speech recognition utilizing concept to keyword mapping
US5878390A (en) Speech recognition apparatus equipped with means for removing erroneous candidate of speech recognition
EP1800294B1 (en) System and methods for improving accuracy of speech recognition
KR100853173B1 (en) Automatic speech interpretation system based on statistical automatic translation mode, translation processing method and training method thereof
KR100669241B1 (en) System and method of synthesizing dialog-style speech using speech-act information
CN111898342A (en) Chinese pronunciation verification method based on edit distance
JPS60105037A (en) Production system of voice input sentence
CA2880554C (en) System and methods for improving accuracy of speech recognition
JPS62165267A (en) Voice word processor device
KR20210020294A (en) Method And Apparatus for Providing Speech Recognition of Word Unit or Sentence Unit
JPS6229796B2 (en)
JP2001013992A (en) Voice understanding device
JPH08185197A (en) Japanese analyzing device and japanese text speech synthesizing device
JPS6039238A (en) Voice input sentence forming system
JP4140248B2 (en) Database search device
CN114333788A (en) Voice recognition correction method and system based on user intention analysis
Chen et al. A New Prosodic Phrasing Model for Chinese TTS Systems.
KR20000026814A (en) Method for separating word clause for successive voice recognition and voice recognition method using the method
JP2000214881A (en) Apparatus and method for sound recognition linguistic model generation
JPS6132167A (en) Kana-kanji conversion processor
JPH02255941A (en) Kana/kanji conversion system
JPH1185749A (en) Japanese sentence abbreviated element interpolation device
Duncan System combination for improved automatic generation of N-best proper nouns pronunciation
JPH11327871A (en) Voice synthesizing device
JPH0262659A (en) Extracting device for correction candidate character of japanese sentence