JPS636355B2

JPS636355B2 -

Info

Publication number: JPS636355B2
Application number: JP55029803A
Authority: JP
Inventors: Iwao Yamabe; Akira Toda
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 1980-03-11
Filing date: 1980-03-11
Publication date: 1988-02-09
Also published as: JPS56126160A

Description

【発明の詳細な説明】この発明は、音声を漢字コードに変換して電算
写植システムに入力するための漢字入力装置に関
する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a kanji input device for converting speech into kanji codes and inputting them into a computer phototypesetting system.

従来、コンピユータによる漢字処理を行なうに
際し、漢字データの入力は漢字キーボードや漢字
タブレツトを用いてオペレータが手や指で操作し
ている。このため、漢字データの入力に多大の労
力を要すると共に技術的な習熟を必要とし、肉体
的な疲労を伴うといつた欠点がある。よつて、こ
の発明の目的はこのような欠点のない印刷用漢字
入力装置を提供することにある。 Conventionally, when processing kanji using a computer, an operator inputs kanji data using a kanji keyboard or a kanji tablet using his or her hands or fingers. Therefore, inputting kanji data requires a great deal of effort, requires technical skill, and is physically tiring. Therefore, an object of the present invention is to provide a kanji input device for printing that does not have these drawbacks.

以下にこの発明を説明する。 This invention will be explained below.

この発明は音声を漢字コードに変換して電算写
植システムに入力するための印刷用漢字入力装置
に関し、第１図に示すように、マイク１から入力
される音声の特徴パラメータCPを抽出するパラ
メータ抽出装置１０と、所定音素の音素パラメー
タを音素コードと共に記憶している。音素データ
記憶装置２０と、パラメータ抽出装置１０からの
特徴パラメータCPと音素データ記憶装置２０か
らの音素パラメータSPとを比較し、所定類似度
以上でかつ最も類似度の高い音素パラメータに対
応する音素コードSCを出力する音声判定装置３
０と、話者１００の唇１０１の開閉を監視する監
視装置２００と、音声判定装置３０から判別し難
い音素コードSCが出力された時に、唇開閉監視
装置２００からの開閉信号NSによつて正しい音
素コードを変換用音素コードTSCとして出力す
るゲート装置４０と、種々の見出語に対応する記
述語コードを国語辞書として記憶している国語辞
書記憶装置５０と、ゲート装置４０から出力され
る変換用音素コードTSCと国語辞書記憶装置５
０の見出語とを比較し、一致する当該見出語の記
述語データDWを呼出すと共にフオントメモリ２
からのフオント情報FIに従つて表示装置３に出
力する仮名漢字変換装置６０と、表示装置３に表
示された同音異義語又は同音異種の語の１つを選
択指定すると共に、フアンクシヨン、約もの及び
該当語がない場合の所望構成文字を入力するため
の選択指定装置７０と、この選択指定装置の選択
指定に従つて仮名漢字変換装置６０から出力され
る漢字コードCHを記憶し、割付情報と共に電算
写植システム４に伝送するための漢字コード記憶
装置８０とを設けたものである。ここにおいて、
パラメータ抽出装置１０は第２図に示すように、
サンプリング信号SMに従つてマイク１からの音
声信号ASを所定間隔（たとえば10ｍｓ）毎にサ
ンプリングするサンプリング回路１１と、このサ
ンプリング回路１１でサンプリングされたサンプ
リングデータSDを８チヤンネルのバンドパスフ
イルタ１２Ｆ１〜１２Ｆ８で波するフイルタ１
２と、このフイルタ１２（１２Ｆ１〜１２Ｆ８）
の各出力を特徴パラメータCPとしての音声量子
信号（CH１Ｓ〜CH８Ｓ；各２ビツト）に量子
化する量子化回路１３とで構全されている。な
お、フイルタ１２を構成するバンドパスフイルタ
１２Ｆ１〜１２Ｆ８の各通過バンド幅は第３図の
通りであり、音声周波数200〜5000Hzを対数的に
等分割するようになつている。 The present invention relates to a printing kanji input device for converting speech into kanji codes and inputting them into a computerized phototypesetting system.As shown in FIG. The device 10 stores phoneme parameters of a predetermined phoneme along with a phoneme code. The phoneme data storage device 20 compares the feature parameter CP from the parameter extraction device 10 with the phoneme parameter SP from the phoneme data storage device 20, and generates a phoneme code corresponding to the phoneme parameter having the highest similarity and having a predetermined degree of similarity or higher. Audio determination device 3 that outputs SC
0, the monitoring device 200 that monitors the opening and closing of the lips 101 of the speaker 100, and the speech determination device 30 output a phoneme code SC that is difficult to distinguish, and the opening/closing signal NS from the lip opening/closing monitoring device 200 determines that the phoneme code is correct. A gate device 40 that outputs a phoneme code as a conversion phoneme code TSC, a Japanese language dictionary storage device 50 that stores descriptive word codes corresponding to various headwords as a Japanese language dictionary, and a conversion output from the gate device 40. Phoneme code TSC and Japanese dictionary storage device 5
0 is compared with the entry word, and the descriptive word data DW of the matching entry word is called and the font memory 2 is
The kana-kanji conversion device 60 outputs to the display device 3 according to the font information FI from A selection designation device 70 for inputting desired constituent characters when a corresponding word does not exist, and a kanji code CH output from the kana-kanji conversion device 60 according to the selection designation of this selection designation device are stored and computerized together with the allocation information. A kanji code storage device 80 for transmission to the phototypesetting system 4 is provided. put it here,
As shown in FIG. 2, the parameter extraction device 10
A sampling circuit 11 samples the audio signal AS from the microphone 1 at predetermined intervals (for example, 10 ms) according to the sampling signal SM, and the sampling data SD sampled by the sampling circuit 11 is passed through eight channels of bandpass filters 12F1 to 12F8. Filter 1 that waves with
2 and this filter 12 (12F1 to 12F8)
The quantization circuit 13 quantizes each output into audio quantum signals (CH1S to CH8S; 2 bits each) as characteristic parameters CP. The passband widths of the bandpass filters 12F1 to 12F8 constituting the filter 12 are as shown in FIG. 3, and are designed to equally divide the audio frequency of 200 to 5000 Hz logarithmically.

また、唇開閉監視装置２００は話者１００の唇
１０１近辺を映し出すためのTVカメラ２０１
と、このTVカメラ２０１からの映像信号をデイ
ジタル信号に変換するAD変換器２０２と、この
AD変換器２０２からのデイジタル信号を１フレ
ーム毎に記憶するフレームメモリ２０３と、この
記憶情報から唇１０１の開閉を判断する開閉判断
回路２０４とで構成されている。 The lip opening/closing monitoring device 200 also includes a TV camera 201 for displaying the vicinity of the lips 101 of the speaker 100.
, an AD converter 202 that converts the video signal from this TV camera 201 into a digital signal, and this
It is comprised of a frame memory 203 that stores digital signals from the AD converter 202 frame by frame, and an open/close determination circuit 204 that determines whether the lips 101 are open or closed based on this stored information.

なお、音素データ記憶装置２０には第４図に示
すように、５種（Ａ〜Ｏ）の母音の音素パラメー
タ（チヤンネルCH１〜CH８）と15種（Ｂ〜Ｚ）
の子音の音素パラメータ（チヤンネルCH１〜
CH８）とが、それぞれJISコードによる16進数
の４桁で表わされる音素コードと共に記憶されて
いる。また、国語辞書記憶装置５０は第５図に示
すようにアイウエオ順に配列された見出語と、こ
れに対応する平仮名データ、片仮名データ、漢字
データ等を示す記述語データ（JISコードによる
16進数の４桁）とが記憶されており、この国語辞
書記憶装置５０は日常使用する国語辞典ないし国
語辞書としての機能を有するものである。 As shown in FIG. 4, the phoneme data storage device 20 stores 5 types (A~O) of vowel phoneme parameters (channels CH1~CH8) and 15 types (B~Z) of vowel phoneme parameters.
Consonant phoneme parameters (channel CH1~
CH8) are stored together with phoneme codes each expressed as a four-digit hexadecimal number according to JIS code. The Japanese language dictionary storage device 50 also stores entry words arranged in alphabetical order as shown in FIG.
This Japanese language dictionary storage device 50 has a function as a Japanese language dictionary or a Japanese language dictionary for daily use.

このような構成において、音声入力者（音声を
漢字コードCHAに変換して電算写植システム４
に入力するオペレータ等）はTVカメラ２０１の
前に立つと共に、マイク１の前で所定の文章等を
いわゆるわ〓か〓ち〓読みで読むと、マイク１か
らの音声信号ASがパラメータ抽出装置１０内の
サンプリング回路１１に入力されてサンプリング
される。すなわち、文章が「桜は、花です。」の
場合には、「SAKURA HA TEN HANA
DESU MARU」と読む。ここに、音声信号AS
はたとえば「Ｅ」（え）の場合、第６図に示すよ
うになり、これがサンプリング回路１１に入力さ
れてサンプリング信号SMに従つて10ms毎にサン
プリングされ、そのサンプリングデータSDがフ
イルタ１２の８チヤンネルのバンドパスフイルタ
１２Ｆ１〜１２Ｆ８にそれぞれ入力される。しか
して、これらバンドパスフイルタ１２Ｆ１〜１２
Ｆ８はそれぞれ第３図に示すバンド幅毎にサンプ
リングデータSDを通過させ、その各通過データ
を８チヤンネルの量子化回路１３Ｑ１〜１３Ｑ８
に入力し、各バンド幅における振幅値に対応して
それぞれ特徴パラメータCPとしての４段陥（00
〜11）２ビツトの音声量子信号CH１Ｓ〜CH８
Ｓに変換する。ここにおいて、かかる各２ビツト
の８チヤンネル音声量子信号CH１Ｓ〜CH８Ｓ
で成る特徴パラメータCPはたとえば第７図に示
すように、サンプリング時間SAMP１，SAMP
２，……に従つて出力され、これが音声判定装置
３０に入力される。しかして、音声判定装置３０
には漢字入力装置を使用する作業者等の音素パラ
メータが記憶されており、たとえば第４図に示す
ように母音（Ａ〜Ｏ）及び子音（Ｂ〜Ｚ）に対応
する音素パラメータ、音素コードがその作業者用
として記憶されている。そして、この記憶された
音素パラメータCH１〜CH８とパラメータ抽出
装置１０からの特徴パラメータCP（音声量子信号
CH１Ｓ〜CH８Ｓ）とを各サンプリング毎に比
較し、その類似度を判定する。この場合、各チヤ
ンネルのデータについて４段階の内の±１の誤差
範囲は両者が一致するとみなしてその同一となる
チヤンネル数を計数し、その計数値がたとえば
「６」（８チヤンネルのうちの６チヤンネルが一
致）の時にその音素パラメータに対応する音素コ
ードを出力する。すなわち、たとえば音素パラメ
ータＱがチヤンネル１から「00」、「10」、「11」、
「01」、「00」、「10」、「10」、「01」、「00」の場
合に、
特徴パラメータCPがチヤンネル１から「01」、
「10」、「11」、「10」、「00」、「10」、「01」、「
00」と
なればチヤンネル１及び４が±１の誤差であり他
は全て一致している（この時の計数値は「８」）
ので、上記音素パラメータＱに対応する音素コー
ドを出力することになる。また、上記音素パラメ
ータＱに対して特徴パラメータCPがチヤンネル
１から「10」、「01」、「11」、「11」、「11」、「00
」、
「01」、「11」の場合には一致するチヤンネル数が
「２」であるので、音素コードは出力されない。
このような比較動作を各音素パラメータについて
行ない、そのうちで最も一致数の高い音素パラメ
ータに対応する音素コードを出力する。 In such a configuration, a voice inputter (who converts the voice into a kanji code CHA and inputs it into the computer phototypesetting system 4)
When an operator (such as an operator inputting data into a computer) stands in front of the TV camera 201 and reads a predetermined sentence in front of the microphone 1, the audio signal AS from the microphone 1 is transmitted to the parameter extraction device 10. The signal is input to the sampling circuit 11 in the internal circuit and sampled. In other words, if the sentence is ``Cherry blossoms are flowers,'' it would be ``SAKURA HA TEN HANA.''
DESU MARU”. Here, the audio signal AS
For example, in the case of "E" (E), it becomes as shown in FIG. are respectively input to bandpass filters 12F1 to 12F8. Therefore, these band pass filters 12F1 to 12
F8 passes the sampling data SD for each band width shown in FIG.
, and a four-stage defect (00
~11) 2-bit audio quantum signal CH1S~CH8
Convert to S. Here, each of the 2-bit 8-channel audio quantum signals CH1S to CH8S
For example, as shown in FIG. 7, the feature parameter CP consisting of sampling times SAMP1, SAMP
2, . . . and input into the speech determination device 30. However, the voice determination device 30
The phoneme parameters of the worker who uses the kanji input device are stored, and for example, as shown in Fig. 4, the phoneme parameters and phoneme codes corresponding to vowels (A to O) and consonants (B to Z) are stored. It is stored for use by that worker. Then, the stored phoneme parameters CH1 to CH8 and the feature parameter CP (speech quantum signal
CH1S to CH8S) are compared for each sampling to determine the degree of similarity. In this case, the error range of ±1 out of 4 levels for each channel's data is considered to be the same, and the number of channels that are the same is counted, and the counted value is, for example, "6" (6 out of 8 channels). When the channels match), the phoneme code corresponding to that phoneme parameter is output. That is, for example, if the phoneme parameter Q is "00", "10", "11" from channel 1,
In the case of "01", "00", "10", "10", "01", "00",
Feature parameter CP is "01" from channel 1,
"10", "11", "10", "00", "10", "01", "
00", channels 1 and 4 have an error of ±1, and everything else matches (the count value at this time is "8")
Therefore, a phoneme code corresponding to the above phoneme parameter Q is output. Also, for the above phoneme parameter Q, the feature parameter CP is from channel 1 to "10", "01", "11", "11", "11", "00".
”,
In the case of "01" and "11", the number of matching channels is "2", so no phoneme code is output.
Such a comparison operation is performed for each phoneme parameter, and a phoneme code corresponding to the phoneme parameter with the highest number of matches among them is output.

そして、かかる音素コードが所定時間（たとえ
ば30ｍｓ）だけ継続した時、すなわちサンプリン
グ時間SAM１，SAM２，……について３回以上
同一の音素コードが続いた時に限り当該音素コー
ドを判別した音素コードSCとしてまとめて出力
する。したがつて、特徴パラメータCPと音素パ
ラメータSPとの比較から、たとえば第８図Ａの
如き比較結果が出た場合、その出力すべき音素コ
ードSCは同図Ｂの如き音素コード列となり、３
回以上続いた場合に限り同図Ｃのように１まとめ
にした形態で出力する。このようにして出力され
た音素コードSCはゲート装置４０に入力される。 Then, only when such a phoneme code continues for a predetermined time (for example, 30ms), that is, when the same phoneme code continues three or more times for sampling times SAM1, SAM2, ..., the phoneme code is summarized as a discriminated phoneme code SC. and output it. Therefore, if a comparison result as shown in FIG. 8A is obtained from the comparison between the feature parameter CP and the phoneme parameter SP, the phoneme code SC to be outputted will be a phoneme code string as shown in FIG.
Only when it continues more than once, it is output in a form as shown in C of the same figure. The phoneme code SC output in this manner is input to the gate device 40.

一方、オペレータ等の音声入力時、その唇１０
１の動きはTVカメラ２０１に映し出され、その
開閉の状態を監視するが、その動作を第９図のフ
ローチヤートを参照して説明する。 On the other hand, when an operator or the like inputs voice, the lips 10
1 is projected onto the TV camera 201, and its opening/closing status is monitored.The operation will be explained with reference to the flowchart of FIG.

先ず、TVカメラ２０１の画面にオペレータ等
の話者１００の唇１０１が映し出されるように、
TVカメラ２０１がセツトされる。しかして、画
面の映像はAD変換器２０２でデイジタル信号に
変換され、そのデイジタル映像信号や１フレーム
毎にフレームメモリ２０３に記憶される。そし
て、フレームメモリ２０３に記憶された情報を開
閉判断回路２０４が読出して処理し、唇１０１の
開閉を判断すると共に、その対応を開閉信号NS
として出力する。また、マイク１からの音声処理
においては鼻音の「Ｍ」、「Ｎ」の区別及び無声破
裂音「Ｐ」、「Ｔ」の区別についての特徴パラメー
タを抽出し難く、その判別が困難である。しかし
て、鼻音「Ｍ」及び破裂音「Ｐ」は唇１０１が閉
じている時に発せられ、鼻音「Ｎ」及び破裂音
「Ｔ」は唇１０１が開いている時に発せられるこ
とが分つているので、この「Ｍ」、「Ｎ」音の判別
を唇開閉監視装置２００からの開閉信号NSによ
つて行なう。すなわちゲート装置４０は、「Ｍ」、
「Ｎ」、「Ｐ」、「Ｔ」音以外の音素については前述
のような音声判定装置３０による出力動作を行な
い、鼻音の「Ｍ」、「Ｎ」音又は破裂音「Ｐ」、
「Ｔ」音が検知された時にのみ唇開閉監視装置２
００からの開閉信号NSを参照し、正しい音素を
変換用音素コードTSCとして出力する。 First, the lips 101 of the speaker 100 such as an operator are displayed on the screen of the TV camera 201.
TV camera 201 is set. The image on the screen is converted into a digital signal by the AD converter 202, and the digital image signal and each frame are stored in the frame memory 203. Then, the open/close determination circuit 204 reads out and processes the information stored in the frame memory 203, determines whether the lips 101 are open or closed, and receives the corresponding open/close signal NS.
Output as . Furthermore, in processing the sound from the microphone 1, it is difficult to extract characteristic parameters for distinguishing between the nasal sounds "M" and "N" and the voiceless plosive sounds "P" and "T", making it difficult to distinguish between them. Therefore, it is known that the nasal sound "M" and the plosive sound "P" are produced when the lips 101 are closed, and the nasal sound "N" and the plosive sound "T" are produced when the lips 101 are open. , the "M" and "N" sounds are discriminated based on the opening/closing signal NS from the lip opening/closing monitoring device 200. That is, the gate device 40 is "M",
For phonemes other than the "N", "P", and "T" sounds, the output operation is performed by the speech determination device 30 as described above, and the nasal "M", "N" sounds or the plosive "P",
Lip opening/closing monitoring device 2 only when “T” sound is detected
Referring to the opening/closing signal NS from 00, the correct phoneme is output as the conversion phoneme code TSC.

このようにして得られたゲート装置４０からの
変換用音素コードTSCは仮名漢字変換装置６０
に入力される。たとえば「桜は、花です。」とい
う文章を入力した場合には、その変換用音素コー
ドTSCの内容は「SAKURA HA TEN HANA
DESU MARU」となり、これら各音素を16進４
桁のJISコードで表わしたデータとなる。かくし
て仮名漢字変換装置６０は入力される変換用音素
コードTSCを順次国語辞書記憶装置５０に伝送
し、その音索列に該当する見出語があるか否かを
チエツクし、ある場合にはその都度見出語に対応
する記述語データDWを呼出して来る。これと同
時にその記述語データDWをフオントメモモリ２
に送り、これに相当するフオント情報FIを呼出
して、このフオント情報FIに従つて上記記述語
データDWの全てを表示装置３に表示する。この
場合、記述語データDWに同音異義（たとえば第
５図の「橋」、「端」、「箸」）又は同音異種（たと
えば第５図の「さくら」、「サクラ」、「桜」）の語
がない時には、その記述語データDWをそのまま
漢字コードCHとして漢字コード記憶装置８０に
出力する。しかして、表示装置３には当該見出語
に対応する記述語データDWの全て、たとえば見
出語「SAKURA」についてはその記述語データ
DWである「さくら（2435、242F、2469）」、「サ
クラ（2535、252F、2569）」、「桜（6115）」がそ
の整理番号１〜３と共に表示されるので、オペレ
ータはこの中から所望の語を選択指定装置７０に
よつて該当番号を指定することによつて選択す
る。かくして、選択指定装置７０で所望の語を選
択指定すると、その選択された語のみが表示装置
３に再表示されると共に、その漢字コードCHが
出力されて漢字コード記憶装置８０に記憶され
る。このような選択指定装置７０による選択指定
動作は入力される変換用音素コードTSCに従つ
て順次行なわれ、選択指定された漢字コードCH
が漢字コード記憶装置８０に記憶される。なお、
選択指定装置７０は記述語データDWの選択指定
のほかに、クワタ（１字分のスペース）、復帰改
行（強制的に次行の行頭に折り返す）、ルビ（ふ
り仮名）、傍線（字に罫線や傍点をふる）、改段
（次段に移る）、改頁（次頁に移る）等のフアンク
シヨン指令を行なうと共に、記号（たとえば
「、」、（、）、；）や絵（たとえば■、〓、〓、〓）
の約ものについての指定も行なう。また、国語辞
書記憶装置５０に登録された見出語がない場合
や、登録してあつても読み方が異なつている場合
等にも選択指定装置７０によつて所望の語を入力
指定することになる。 The conversion phoneme code TSC from the gate device 40 obtained in this way is transferred to the kana-kanji conversion device 60.
is input. For example, if you input the sentence "Cherry blossoms are flowers," the conversion phoneme code TSC would be "SAKURA HA TEN HANA."
DESU MARU” and convert each of these phonemes into hexadecimal 4
The data is expressed in digit JIS code. In this way, the kana-kanji conversion device 60 sequentially transmits the input conversion phoneme codes TSC to the Japanese language dictionary storage device 50, checks whether or not there is an entry word that corresponds to the phoneme string, and if so, The descriptive word data DW corresponding to the headword is called each time. At the same time, the descriptor data DW is transferred to the font memory 2.
, the corresponding font information FI is called, and all of the descriptor word data DW is displayed on the display device 3 in accordance with this font information FI. In this case, the descriptive word data DW contains homophones (for example, "hashi", "edge", and "chopsticks" in Figure 5) or homophones (for example, "sakura", "sakura", and "cherry blossom" in Figure 5). When there is no word, the descriptor word data DW is directly output to the kanji code storage device 80 as the kanji code CH. Therefore, all of the descriptor data DW corresponding to the entry word, for example, the descriptor data DW for the entry word "SAKURA" are displayed on the display device 3.
The DWs “Sakura (2435, 242F, 2469),” “Sakura (2535, 252F, 2569),” and “Sakura (6115)” are displayed together with their serial numbers 1 to 3, so the operator can choose from among them. The word is selected by specifying the corresponding number using the selection specifying device 70. Thus, when a desired word is selected and specified by the selection and specification device 70, only the selected word is re-displayed on the display device 3, and its Kanji code CH is output and stored in the Kanji code storage device 80. Such a selection designation operation by the selection designation device 70 is performed sequentially according to the input conversion phoneme code TSC, and the selected kanji code CH
is stored in the kanji code storage device 80. In addition,
In addition to selecting and specifying the descriptor data DW, the selection specifying device 70 also selects and specifies quarters (a space for one character), carriage return (forcibly wraps to the beginning of the next line), ruby (furikana), and sidelines (ruled lines and Function commands such as line break (move to the next column), page break (move to the next page), etc., as well as symbols (for example, ",", (, ), ;) and pictures (for example, ■, 〓 ,〓,〓)
We also specify the terms of . Furthermore, even if there is no entry word registered in the Japanese dictionary storage device 50, or if the entry word is registered but has different pronunciations, the selection and designation device 70 can be used to input and designate the desired word. Become.

このようにして出力される仮名漢字変換装置６
０からの漢字コードCHは一旦漢字コード記憶装
置８０に記憶され、この記憶データが電算写植シ
ステム４の漢字コードCHAとして出力される。
この場合、電算写植システム４には見出の位置及
び大きさ、１頁分の段や欄の構成、１行の字数、
文字の大きさや種類等の割付を指定する割付情報
LYも同時に入力されるようになつている。 Kana-kanji conversion device 6 output in this way
The kanji code CH starting from 0 is temporarily stored in the kanji code storage device 80, and this stored data is output as the kanji code CHA of the computer phototypesetting system 4.
In this case, the computer phototypesetting system 4 includes the position and size of headings, the structure of columns and columns for one page, the number of characters in one line,
Layout information that specifies the layout of font size, type, etc.
LY is also entered at the same time.

以上のようにこの発明によれば、音声認識をマ
イクからの音声信号による特徴パラメータで行な
うと共に、話者の唇開閉監視装置からの開閉信号
によつても行なつているので、極めて正確な音声
−漢字コード変換を実現することができる。 As described above, according to the present invention, voice recognition is performed using the characteristic parameters of the voice signal from the microphone, and is also performed using the opening/closing signals from the speaker's lip opening/closing monitoring device, so that extremely accurate voice can be obtained. - Kanji code conversion can be realized.

以上の説明では文章をいわゆるわ〓か〓ち〓読
みで入力する例について述べているが、文章を単
音節で発声して別途選択指定装置７０でわ〓か〓
ち〓記号（わ〓か〓ち〓読みの空白部に相当する
もの）を入力するようにしても良い。また上述の
実施例では音声の特徴パラメータを８チヤンネル
のバンドパスフイルタ等で得ているが、これは16
チヤンネルが20チヤンネルでも可能であり、音声
量子信号の各チヤンネルビツト数も任意（たとえ
ば８ビツト）にすることもできる。 The above explanation describes an example in which sentences are input in the so-called wakachi reading.
It is also possible to input a chi symbol (corresponding to the blank space in the waka chi reading). In addition, in the above embodiment, the voice characteristic parameters are obtained using an 8-channel bandpass filter, etc., but this is 16 channels.
It is possible to have 20 channels, and the number of bits in each channel of the audio quantum signal can also be arbitrary (for example, 8 bits).

[Brief explanation of the drawing]

第１図はこの発明の一実施例を示すブロツク構
成図、第２図はパラメータ抽出装置の構成例を示
すブロツク図、第３図はこの発明に用いるフイル
タ（８チヤンネル）のバンド幅の例を示す図、第
４図は音素データ記憶装置に記憶されているデー
タの様子を示す図、第５図は国語辞書記憶装置の
記憶状態の一例を示す図、第６図は音声信号の一
例を示す図、第７図はパラメータ抽出装置の出力
である特徴パラメータの一例を示す図、第８図Ａ
〜Ｃは音声判定装置からの音素コードの出力の様
子を示す図、第９図はこの発明の音素コード出力
までの動作を示すフローチヤートである。１……マイク、２……フオントメモリ、３……
表示装置、４……電算写植システム、１０……パ
ラメータ抽出装置、１１……サンプリング回路、
１２……フイルタ、１３……量子化回路、２０…
…音素データ記憶装置、３０……音声判定装置、
４０……ゲート装置、５０……国語辞書記憶装
置、６０……仮名漢字変換装置、７０……選択指
定装置、８０……漢字コード記憶装置、１００…
…話者（オペレータ等）、１０１……唇、２００
……唇開閉監視装置、２０１……TVカメラ、２
０２……AD変換器、２０３……フレームメモ
リ、２０４……開閉判断回路、CP……特徴パラ
メータ、SP……音素パラメータ、PS……パラト
グラフ信号、SC……音素コード、TSC……変換
用音素コード、DW……記述語データ、FI……フ
オント情報、AS……音声信号、SM……サンプ
リング信号、SD……サンプリングデータ、PD…
…パラトグラフデータ、LY…割付情報。 FIG. 1 is a block diagram showing an embodiment of this invention, FIG. 2 is a block diagram showing an example of the configuration of a parameter extraction device, and FIG. 3 is an example of the bandwidth of a filter (8 channels) used in this invention. 4 is a diagram showing the state of data stored in the phoneme data storage device, FIG. 5 is a diagram showing an example of the storage state of the Japanese language dictionary storage device, and FIG. 6 is a diagram showing an example of the audio signal. 7A and 7B are diagrams showing an example of feature parameters that are output from the parameter extraction device, and FIG. 8A
-C are diagrams showing how the phoneme code is output from the speech determination device, and FIG. 9 is a flowchart showing the operation up to the output of the phoneme code of the present invention. 1...Microphone, 2...Font memory, 3...
Display device, 4... Computer phototypesetting system, 10... Parameter extraction device, 11... Sampling circuit,
12... Filter, 13... Quantization circuit, 20...
...Phoneme data storage device, 30...Speech determination device,
40...Gate device, 50...Japanese language dictionary storage device, 60...Kana-kanji conversion device, 70...Selection specification device, 80...Kanji code storage device, 100...
...Speaker (operator, etc.), 101...Lip, 200
...Lip opening/closing monitoring device, 201...TV camera, 2
02...AD converter, 203...Frame memory, 204...Opening/closing judgment circuit, CP...Characteristic parameter, SP...Phoneme parameter, PS...Paratograph signal, SC...Phoneme code, TSC...Conversion phoneme Code, DW...descriptive word data, FI...font information, AS...audio signal, SM...sampling signal, SD...sampling data, PD...
...paratograph data, LY...allocation information.

Claims

[Scope of Claims] 1 (a) A parameter extraction device that extracts feature parameters of input speech; (b) A phoneme data storage device that stores phoneme parameters of a predetermined phoneme along with a phoneme code; (c) (d) monitoring the opening and closing of the speaker's lips; a lip opening/closing monitoring device; (e) a gate device that outputs a correct phoneme code based on an opening/closing signal from the lip opening/closing monitoring device when a phoneme code that is difficult to distinguish is output from the speech determining device; (f) (g) a phoneme code for conversion outputted from the gate device and the entry word of the Japanese dictionary storage device; (h) a kana-kanji conversion device that compares the descriptive word data of the corresponding headword and outputs it to a display device according to the font information from the font memory; a selection and designation device for selecting and designating one of the words and inputting a function, a punctuation, and a desired constituent character in the case where the corresponding word does not exist; A kanji input device for printing, comprising: a kanji code storage device for storing kanji codes output from a conversion device and transmitting the kanji codes together with layout information to a computer phototypesetting system.