JPH07113837B2

JPH07113837B2 - Character generator

Info

Publication number: JPH07113837B2
Application number: JP61143746A
Authority: JP
Inventors: 勝信伏木田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1986-06-18
Filing date: 1986-06-18
Publication date: 1995-12-06
Anticipated expiration: 2010-12-06
Also published as: JPS62299898A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は、ワードプロセツサ等に使用され、文字系列を
出力する文字生成装置に関する。Description: TECHNICAL FIELD The present invention relates to a character generation device which is used in a word processor or the like and outputs a character sequence.

（従来の技術）従来キーボード等より文字データを入力した後に予め定
められたフオントの文字列を前記文字データに従つてプ
リンタにより出力することにより文章を作成する装置が
知られており、ワードプロセツサと呼ばれている。(Prior Art) Conventionally, there is known a device for creating a sentence by inputting character data from a keyboard or the like and then outputting a predetermined font character string by a printer according to the character data, and a word processor. It is called.

（発明が解決しようとする問題点）しかしながら、従来のワードプロセツサにおいては出力
された文字列が画一的であり手書文字に見られるような
親しみ易さがなく非言語的な情報が伝達されない欠点が
あつた。(Problems to be solved by the invention) However, in the conventional word processor, the output character string is uniform and non-verbal information is transmitted without the familiarity as seen in handwritten characters. There was a flaw that was not done.

本発明の目的は文書作成者の非言語的な情報の伝達を可
能とし親しみ易い文字列を生成することを可能とする音
声認識を用いた文字生成装置を提供することにある。It is an object of the present invention to provide a character generation device using voice recognition that enables non-verbal information transmission by a document creator and generates a familiar character string.

（問題点を解決するための手段）本願の第１の発明は、音声認識を行なうことにより入力
音声を文字列に変換する手段と、前記入力音声からピッ
チ周波数変化幅または調音結合度を非言語データとして
抽出する手段と、前記文字列および前記非言語データか
ら文字を生成する手段とから構成されている。(Means for Solving Problems) A first invention of the present application is a means for converting an input voice into a character string by performing voice recognition, and a pitch frequency change width or an articulatory coupling degree from the input voice in a non-language. It is composed of means for extracting as data and means for generating characters from the character string and the non-language data.

また、本願の第２の発明は、キーボードから文字列を入
力する手段と、前記文字列に対応する音声から非言語デ
ータを抽出する手段と、前記文字列および前記非言語デ
ータから文字を生成する手段とから構成されている。A second invention of the present application is a means for inputting a character string from a keyboard, a means for extracting non-language data from a voice corresponding to the character string, and a character generated from the character string and the non-language data. And means.

（発明の原理）一般に音声には言語情報の他に非言語情報が含まれてい
ることが知られている。例えば、ピツチ周波数の変化は
感情的な情報を含んでおりピツチ周波数の変化幅の大き
い場合は、小さい場合に比べて感情が高揚していること
を示す。また、強意を表わす単語はピツチ周波数が高く
なる傾向がある。さらに、音声には調音結合と呼ばれる
性質があり、該文字に対応する音声のパタンが前後の文
字の影響を受ける。この影響の度合（調音結合係数と呼
ぶ）は例えば音声のホルマント分析を行なうことにより
単独に発声された文字に対応するホルマントパラメータ
との距離を計ることにより求められることが知られてい
る。本願発明は、以上述べた如き音声の諸性質を利用し
て、音声の非言語的なパラメータをも用いて文字の生成
を行なうものである。(Principle of the Invention) It is generally known that speech contains non-language information in addition to linguistic information. For example, a change in pitch frequency includes emotional information, and a large change in pitch frequency indicates that the emotion is higher than that in a small change. In addition, the pitch frequency of a word indicating strong intention tends to be high. Further, the voice has a property called articulation coupling, and the pattern of the voice corresponding to the character is influenced by the preceding and succeeding characters. It is known that the degree of this effect (referred to as an articulation coupling coefficient) is obtained by measuring the distance from a formant parameter corresponding to a character uttered alone by performing a formant analysis of the voice. The present invention utilizes the various characteristics of speech as described above to generate characters using non-verbal parameters of speech.

前記、非言語的な音声パラメータを文字に反映させる方
法としては、例えば、意味の強さを表わす各単語のピツ
チ周波数の平均値により文字の大きさを制御することに
より実現できる。また、調音結合係数は発声の「ていね
いさ」に表わしているから、文字生成の際に書き順デー
タと共に文字データを予め用意しておき、前記書き順デ
ータに従つて生成される線分データに対して低域通過フ
イルタを通し、前記低域通過フイルタの遮断周波数を前
記調音結合係数により制御して、生成される文字の「て
いねいさ」を変えることによりかわり易く表現される。The method of reflecting the non-verbal voice parameter on the character can be realized, for example, by controlling the character size by the average value of the pitch frequencies of the words representing the strength of meaning. Further, since the articulatory coupling coefficient is expressed in the "gentleness" of the utterance, character data is prepared in advance along with the writing order data at the time of character generation, and the line segment data is generated in accordance with the writing order data. On the other hand, it can be expressed easily by passing through a low-pass filter, controlling the cut-off frequency of the low-pass filter by the articulation coupling coefficient, and changing the "gentleness" of the generated character.

一方、言語データの入力方法としては音声を文字に変換
する方式が知られているが、現在の技術では変換の正解
率が必ずしも高くない。そこで、言語データの入力はキ
ーボードを用いて行ない、前記非言語データは音声から
抽出する方式も有効である。On the other hand, as a method of inputting linguistic data, a method of converting speech into characters is known, but in the present technology, the accuracy rate of conversion is not necessarily high. Therefore, it is also effective to input language data using a keyboard and extract the non-language data from voice.

（実施例）本願発明の実施例を図面を参照して詳細に説明する。(Example) The Example of this invention is described in detail with reference to drawings.

第１図は本願発明の実施例を示すブロック図である。本
図の装置では、スイツチ５の切換えによつて本願の第１
の発明および第２の発明の実施例が選択して実現でき
る。最初に、言語データおよび非言語データをともに音
声から抽出して行なう本願の第１の発明の実施例につい
て説明する。初期設定のために、キーボード２から初期
設定データがスイツチ５に送られスイツチ５の設定が行
なわれる。まず、音声波形が音声波形入力端子１を介し
て特徴抽出回路２に入力される。特徴抽出回路２は前記
音声波形からホルマントデータ、振幅データ、ピツチデ
ータ等の特徴パラメータ値列を抽出し音声認識部３内の
言語データ抽出回路3aおよび非言語データ抽出回路3bに
それぞれ出力する。言語データ抽出回路3aは前記特徴パ
ラメータ値列を文字列に変換しスイツチ５を介して標準
文字データメモリ６に出力する。前記文字列への変換方
式は例えば新美著「音声認識」（共立出版）に詳しいの
でここでは説明を省略する。一方、非言語データ抽出回
路3bは、前記特徴パラメータ値列から前記調音結合係数
等の非言語データを抽出し、この非言語データをスイツ
チ５を介して文字変形処理回路７に出力する。文字変形
処理回路７は、前記文字列に従つて標準文字データメモ
リ６から出力される文字データを前記非言語データに従
つて変形し、変形文字データとしてプリント装置８に出
力する。プリント装置８は前記変形文字データに従つて
文字をプリントする。FIG. 1 is a block diagram showing an embodiment of the present invention. In the apparatus of this figure, the first switch of the present application is realized by switching the switch 5.
The inventions of 1 and 2 can be selectively implemented. First, an embodiment of the first invention of the present application, in which both linguistic data and non-linguistic data are extracted from speech, will be described. For initial setting, the initial setting data is sent from the keyboard 2 to the switch 5, and the setting of the switch 5 is performed. First, a voice waveform is input to the feature extraction circuit 2 via the voice waveform input terminal 1. The feature extraction circuit 2 extracts feature parameter value strings such as formant data, amplitude data, pitch data, etc. from the voice waveform and outputs them to the language data extraction circuit 3a and the non-language data extraction circuit 3b in the voice recognition unit 3, respectively. The language data extraction circuit 3a converts the characteristic parameter value string into a character string and outputs it to the standard character data memory 6 via the switch 5. The method of converting to the character string is detailed in, for example, "Voice Recognition" by Niimi (Kyoritsu Shuppan), so the description thereof is omitted here. On the other hand, the non-verbal data extraction circuit 3b extracts non-verbal data such as the articulation coupling coefficient from the characteristic parameter value sequence and outputs the non-verbal data to the character transformation processing circuit 7 via the switch 5. The character modification processing circuit 7 modifies the character data output from the standard character data memory 6 according to the character string according to the non-language data, and outputs the modified character data to the printing device 8 as modified character data. The printing device 8 prints characters according to the modified character data.

以上、述べた実施例においては、言語データは音声から
抽出するものとしたが、言語データをキーボード４から
入力する本願の第２の発明の実施例もスイツチ５の切換
えにより可能となる。この場合は、キーボード４から予
め初期設定データをスイツチ制御データ伝送路43経由で
スイツチ５に入力してスイツチ５を制御し、キーボード
４の出力を文字データ伝送路41を介して標準文字データ
メモリ６に接続することにより行なうことができる。In the embodiment described above, the language data is extracted from the voice, but the embodiment of the second invention of the present application in which the language data is input from the keyboard 4 is also possible by switching the switch 5. In this case, the initialization data is input from the keyboard 4 in advance to the switch 5 via the switch control data transmission path 43 to control the switch 5, and the output of the keyboard 4 is sent via the character data transmission path 41 to the standard character data memory 6 Can be done by connecting to.

（発明の効果）以上述べたように本発明によれば、非言語的な情報を生
成される文字列に含ませることができ、より親しみ易い
文字列の生成が可能となる。(Effect of the Invention) As described above, according to the present invention, it is possible to include non-linguistic information in a generated character string, and it is possible to generate a more familiar character string.

[Brief description of drawings]

第１図は本願発明の一実施例を示すブロック図である。１……音声波形入力端子、２……特徴抽出回路、３……
音声認識部、3a……言語データ抽出回路、3b……非言語
データ抽出回路、４……キーボード入力装置、５……ス
イツチ、６……標準文字データメモリ、７……文字変形
処理回路、８……プリント装置、31,41……文字データ
伝送路、32,42……非言語データ伝送路、43……スイツ
チ制御データ伝送路。FIG. 1 is a block diagram showing an embodiment of the present invention. 1 ... Voice waveform input terminal, 2 ... Feature extraction circuit, 3 ...
Speech recognition unit, 3a ... language data extraction circuit, 3b ... non-language data extraction circuit, 4 ... keyboard input device, 5 ... switch, 6 ... standard character data memory, 7 ... character transformation processing circuit, 8 ...... Printer, 31,41 …… Character data transmission line, 32,42 …… Non-language data transmission line, 43 …… Switch control data transmission line.

Claims

[Claims]

1. A means for converting an input voice into a character string, a means for extracting a pitch frequency variation width or a degree of articulation as non-language data from the input voice, and a character generated according to the character string and the non-language data. And a character generation device.

2. A means for inputting a character string from a keyboard,
A character generation device comprising: a unit for extracting non-language data from a voice corresponding to the character string; and a unit for generating a character according to the character string and the non-language data.