JPH03257499A

JPH03257499A - Character data input device

Info

Publication number: JPH03257499A
Application number: JP2057136A
Authority: JP
Inventors: Mikio Ogisu; 荻須　幹雄
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1990-03-08
Filing date: 1990-03-08
Publication date: 1991-11-15

Abstract

PURPOSE:To reduce deterioration of phonetic features of a synthesized sound due to the influence of time-directional distortion by performing shifting, compression, and expansion in finite N kinds of time directions which are given previously and generating N deformed phoneme matrixes, and inputting them to a range finder calculating device. CONSTITUTION:When a phoneme vector which is a variable constant representing spectrum envelope information on an input voice is inputted to an input terminal 1, a phoneme matrix generating device 2 stores the phoneme vector by constant (L+2P) frames and outputs the phoneme matrix consisting of (L+2P) phoneme vectors at intervals of L frames. The matrix is inputted from the device 2 to a restrained time-direction deforming device 8. The device 8 performs the shifting, compression, and expansion of the input phoneme matrix in N finite kinds of time directions to generate N deformed phoneme matrixes. Consequently, transmission in a fixed frame period becomes possible and the deterioration of phonetic features of the synthesized sound due to the influence of the time-directional distortion is reducible.

Description

【発明の詳細な説明】産業上の利用分野本発明は、音声データから文字データに自動的に変換す
る文字データ入力装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a character data input device that automatically converts voice data into character data.

従来の技術従来、文字データ装置の入力形態の一つとして音声入力
方法が考案されている。音声データに対し、音声分析を
行ない、標準的な単語パターンや単語辞書を用いて、音
声データに対応する文字データを検索し、音声認識を行
なうという手段がとられている。2. Description of the Related Art Conventionally, a voice input method has been devised as one of the input forms for character data devices. A method is used to perform voice analysis on voice data, search for character data corresponding to the voice data using standard word patterns and word dictionaries, and perform voice recognition.

発明が解決しようとする課題しかしながら、音声入力速度が高速であるため、リアル
タイムで音声データを文字データに変換するのは困難で
あった。また、音声データは音声情報が定量的てないた
め、音声分析が正しく行なわれず、誤認識することもあ
った。Problems to be Solved by the Invention However, since the voice input speed is high, it has been difficult to convert voice data into character data in real time. Furthermore, since voice data does not contain quantitative voice information, voice analysis may not be performed correctly, resulting in erroneous recognition.

これらの理由により、音声入力速度がキーボード入力に
比べ２数倍、高速であるにもかかわらず市販の文字デー
タ装置はキーボーＩ・入力方法がほとんどであった。For these reasons, most commercially available character data devices use the keyboard I input method, even though the voice input speed is several times faster than keyboard input.

本発明は上記従来の課題を解決するもので、音声データ
から文字データへの変換を正確に行なう、音声入力可能
な文字データ入力装置を提供することを目的としている
。The present invention has been made to solve the above-mentioned conventional problems, and an object of the present invention is to provide a character data input device capable of voice input, which accurately converts voice data into character data.

課題を解決するための手段この課題を解決するために本発明の文字データ入力装置
は、音声入力装置の手段として、音声記録装置を持ち、
音声データを一度、音声記録し、音声記録データに対し
文字データへの変換を行ない文字データへの変換が間に
合わない場合、音声記録データからの入力を停止するこ
とにより、入力データを制限するように構成されている
。Means for Solving the Problem In order to solve this problem, the character data input device of the present invention has a voice recording device as a voice input device,
The voice data is recorded once, and the voice recorded data is converted to character data. If the conversion to character data cannot be completed in time, the input data is limited by stopping input from the voice recorded data. It is configured.

作用この構成により、音声データを文字データに自動的に変
換することができ、キーボード入力のように文字データ
入力装置に則する知識がなくても、文章を作成できる装
置を実現できる。Effect: With this configuration, it is possible to automatically convert voice data into character data, and it is possible to realize a device that can create sentences even without knowledge of character data input devices such as keyboard input.

実施例以下本発明の実施例について説明する。Example Examples of the present invention will be described below.

第１図は本発明の文字データ入力装置の構成を示した図
である。音声記録装置■は音声入力されたデータを音声
データ２として記録する。記録された音声データ２は音
声記録装置１で再生され音声認識装置３て音声解析され
単語パターン或いは単語辞書（図示せず）デジタルデー
タである２値データとしてテンポラリメモリ４に格納さ
れる。FIG. 1 is a diagram showing the configuration of a character data input device of the present invention. The audio recording device (2) records the audio input data as audio data 2. The recorded voice data 2 is reproduced by the voice recording device 1, voice analyzed by the voice recognition device 3, and stored in the temporary memory 4 as binary data, which is word pattern or word dictionary (not shown) digital data.

２値データ（テンポラリメモリ〉４は単語・文章変換装
置５により、単語文章データ６に変換される。音声記録
装置１で記録された音声データ２はアナログデータであ
り、音声認識袋Ｎ３で音声のスペクトル解析がされた後
、標準的な単語パターンや、単語辞書を用いて次々とデ
ジタルデータに変換しテンポラリメモリ４に格納してい
く。テンポラリメモリ４には音声記録装置１から音声認
識装置３へのデータロードの開始情報を格納するロード
ポインタ７と単語・文章変換装置５がテンポラリメモリ
４のどこまでを変換したかを示すノルリントポインタ８
があり、変換スタート時のまだ変換が行なわれていない
状態に発生ずるロードポインタ７の値とカレントポイン
タ８の値が一致する場合を除いて、ロードポインタ７と
カレントポインタ８の値が一致するまで音声認識装置３
はテンポラリメモリ４の２値データのロードを続ける。The binary data (temporary memory) 4 is converted into word/sentence data 6 by the word/sentence conversion device 5.The audio data 2 recorded by the audio recording device 1 is analog data, and the audio recognition bag N3 converts the audio data into word/sentence data 6. After the spectrum analysis, standard word patterns and word dictionaries are used to convert the data into digital data one after another and store it in the temporary memory 4. a load pointer 7 that stores data load start information; and a norlint pointer 8 that indicates how much of the temporary memory 4 has been converted by the word/sentence conversion device 5.
, and the value of load pointer 7 and current pointer 8 match, which occurs when the conversion starts and the value of current pointer 8 matches, until the value of load pointer 7 and current pointer 8 match. Voice recognition device 3
continues loading the binary data in the temporary memory 4.

従って、２値データのローディング中、音声記録装置１
は音声データ２を再生を続けると共にロードポインタ７
の更新を行なう。ロードポインタ７とカレントポインタ
８の値を入力とする比較器９がロードポインタ７の更新
によりロードポインタ７とカレントポインタ８の値が一
致したことを検出すると、ロード停止信号１０を音声記
録装置１と音声認識装置３に伝送し、音声記録装置１に
対しては音声データ２の再生の停止を、音声認識装置３
に対しては音声データの変換を停止する。単語・文章変
換装置５はテンポラリメモリ４内の２値データを文脈解
析し、かな漢字変換をし、単語・文章データ６を作成す
ると共に、カレントポインタ８に対し、ポインタ値の更
新をする。カレントポインタ８の更新によりロードポイ
ンタ７とカレントポインタ８の値が比較器９により一致
したと判定されると単語・文章変換装置５に対し、変換
停止信号１１が出力され、テンポラリメモリ４内の２値
データから単語文章データ６への変換が停止される。比
較器９からのロード停止信号１０はカレントポインタ８
が更新されることにより、また、変換停止信号１１はロ
ードポインタ７が更新されることにより解除され、動作
が再開される。音声認識スピードが間に合わない場合、
音声記録装置からの再生データ量を、一定量再生すれば
、音声記録装置を停止し、音声認識終了信号（図示せず
）により再生を再開することもでき、かつ音声認識と単
語・文章変換の間にテンポラリメモリ４を介することで
、単語・文章変換スピードの問題を解決することができ
る。Therefore, while loading binary data, the audio recording device 1
continues playing audio data 2 and moves the load pointer 7
Update. When the comparator 9 which inputs the values of the load pointer 7 and the current pointer 8 detects that the values of the load pointer 7 and the current pointer 8 match due to updating of the load pointer 7, a load stop signal 10 is sent to the audio recording device 1. The voice recognition device 3 instructs the voice recording device 1 to stop playing the voice data 2.
, the conversion of audio data is stopped. The word/sentence conversion device 5 analyzes the context of the binary data in the temporary memory 4, performs kana-kanji conversion, creates word/sentence data 6, and updates the pointer value of the current pointer 8. When the comparator 9 determines that the values of the load pointer 7 and the current pointer 8 match by updating the current pointer 8, a conversion stop signal 11 is output to the word/sentence conversion device 5, and the Conversion from value data to word sentence data 6 is stopped. The load stop signal 10 from the comparator 9 is the current pointer 8
By updating , the conversion stop signal 11 is canceled by updating the load pointer 7, and the operation is restarted. If the voice recognition speed is not fast enough,
Once a certain amount of data has been reproduced from the voice recording device, the voice recording device can be stopped and playback can be resumed using a voice recognition end signal (not shown). By interposing the temporary memory 4 in between, the problem of word/sentence conversion speed can be solved.

第２図は本発明の校正機能の構成について示した図であ
る。かなデータ（かな漢字変換される前のデータ）や−
旦かな漢字変換等処理されたデータであるが音声の誤認
識により、文章が整っていないデータである単語文章デ
ータ６に対し、文脈解析及び、校正機能を有する単語・
文章変換装置５を介して、単純な２値データに変換され
テンポラリメモリ４に格納される。単語・文章変換装置
５ては文脈上、音声誤認識のために文章が整っていない
と判断すると２値データ変換の際に単語辞書（図示せず
）から最適な単語を取り出し置き換えを実行する。２値
データが格納されたテンポラリメモリ４は単語・文章変
換装置５により再度、単語文章データ６に変換され、か
な漢字変換等の処理がされる。ロードポインタ７は単語
文章データ６のどこまてが２値データに再変換され、テ
ンポラリメモリ４に格納されたかを示すポインタで、ま
た、カレントポインタ８は２値データが格納されたテン
ポラリメモリ４のどこまでが単語・文章データに変換さ
れたかを示すポインタである。校正開始時を除いて、ロ
ードポインタ７とカレントポインタ８の値が一致するま
で単語文章データ６の２値データへの変換は実行され、
ロードポインタ７の更新を行ない、ロードポインタ７の
値の更新によりロードポインタ７とカレントポインタ８
の値が一致すると比較器９によりロード停止信号１２が
単語文章データ６に出力され、データのロードが停止さ
れる。一方、単語文章変換装置５は２値データの格納さ
れたテンポラリメモリ４内のデータを単語・文章データ
に変換を行ない。カレントポインタ８の更新を行なう。FIG. 2 is a diagram showing the configuration of the calibration function of the present invention. Kana data (data before kana-kanji conversion) and -
Word text data 6, which is data that has been processed such as simple kanji conversion, but the text is not well-organized due to misrecognition of the voice, is processed using word/sentence data that has context analysis and proofreading functions.
It is converted into simple binary data via the text conversion device 5 and stored in the temporary memory 4. If the word/sentence conversion device 5 determines that the sentence is not in order due to speech misrecognition due to the context, it extracts the most suitable word from a word dictionary (not shown) and executes replacement during binary data conversion. The temporary memory 4 in which the binary data is stored is again converted into word/sentence data 6 by the word/sentence conversion device 5, and subjected to processing such as kana/kanji conversion. The load pointer 7 is a pointer indicating which part of the word sentence data 6 has been reconverted to binary data and stored in the temporary memory 4, and the current pointer 8 is a pointer indicating which part of the word sentence data 6 has been reconverted to binary data and stored in the temporary memory 4. This is a pointer indicating how far the data has been converted into word/sentence data. Except at the start of proofreading, conversion of word sentence data 6 to binary data is executed until the values of load pointer 7 and current pointer 8 match,
Load pointer 7 is updated, and by updating the value of load pointer 7, load pointer 7 and current pointer 8 are updated.
When the values match, the comparator 9 outputs a load stop signal 12 to the word sentence data 6, and the data loading is stopped. On the other hand, the word/sentence conversion device 5 converts the data in the temporary memory 4 in which binary data is stored into word/sentence data. The current pointer 8 is updated.

カレントポインタ８の値の更新によりカレントポインタ
８とロードポインタ７の値が一致すると比較器９より変
換停止信号１１が単語・文章変換装置５に入力され変換
が停止される。ロード停止信号１２の停止解除はカレン
トポインタ８の値の更新によって、変換停止信号１１の
停止解除はロードポインタ７の値の更新によって解除さ
れる。音声入力データを誤認識したデータ等に対して校
正することができ、より正確に文章を作成することがで
きる。When the values of the current pointer 8 and the load pointer 7 match by updating the value of the current pointer 8, a conversion stop signal 11 is input from the comparator 9 to the word/sentence conversion device 5, and the conversion is stopped. The stop of the load stop signal 12 is canceled by updating the value of the current pointer 8, and the stop of the conversion stop signal 11 is canceled by updating the value of the load pointer 7. It is possible to proofread voice input data for misrecognized data, etc., and it is possible to create sentences more accurately.

発明の効果以上のように本発明によれば、音声データを文字データ
に自動的に変更するシステムを校正することができ、音
声誤認識に対する構成機能により、完成度の高い文章を
文字データ入力装置に対する知識がなくても作成するこ
とがてきる。Effects of the Invention As described above, according to the present invention, it is possible to calibrate a system that automatically changes voice data to text data, and by using the configuration function to prevent speech misrecognition, highly complete sentences can be input into a text data input device. It can be created without any knowledge of.

[Brief explanation of drawings]

第１図は本発明の文字データ入力装置の構成を示した図
、第２図は本発明の校正機能の構成について示した図で
ある。１・・・・・・音声記録装置、２・・・・・・音声デー
タ、３・・・・・・音声記録装置、４・・・・・・２値
データ（テンポラリメモリ〉、５・・・・・・単語・文
章変換装置、６・・・・・・単語・文章データ、７・・
・・・・ロードポインタ、８・・・・・・カレントポイ
ンタ、９・・・・・・比較器、１０．１２・・・・・・
ロード停止信号、１１・・・・・・変換停止信号。FIG. 1 is a diagram showing the configuration of a character data input device of the present invention, and FIG. 2 is a diagram showing the configuration of a proofreading function of the present invention. 1...Audio recording device, 2...Audio data, 3...Audio recording device, 4...Binary data (temporary memory), 5... ...Word/sentence conversion device, 6...Word/sentence data, 7...
...Load pointer, 8...Current pointer, 9...Comparator, 10.12...
Load stop signal, 11... Conversion stop signal.

Claims

[Claims]

(1) Have a voice recording device as a voice input method, and
A character data input device equipped with a voice recognition device, which converts voice data from a voice recording device into binary data through a band-pass filter, and has a temporary memory that stores the binary data as temporary working data. In a system for converting into word or sentence data, a data load permission signal from the voice recording device to the temporary memory that has been converted from binary data to word or sentence data is output from the word/sentence conversion device to the voice recording device. The audio data is binarized through a bandpass filter, and
The converted data is loaded into the temporary memory, and when the temporary memory area is full, the word/sentence conversion device outputs a signal to stop loading the audio data into the audio recording device. A character data input device characterized by creating words and sentences.

(2) The character data input device according to claim 1, wherein the data converted into words and sentences can be inputted again into the word and sentence conversion device to proofread the words and sentences.