JPH0199093A

JPH0199093A - Reference pattern generator for voice recognition

Info

Publication number: JPH0199093A
Application number: JP62254731A
Authority: JP
Inventors: Jun Hoyano; 保屋野　純
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1987-10-12
Filing date: 1987-10-12
Publication date: 1989-04-17

Abstract

PURPOSE: To prepare a reference pattern with characteristics corresponding to the features of an input means by arranging a filter capable of controlling frequency characteristics on the input side of a reference pattern preparing circuit. CONSTITUTION: A data base 1 for voice samples recorded by the use of a storage device such as a magnetic tape is connected to the reference pattern preparing circuit 3 through the filter 2 such as a digital filter capable of controlling frequency characteristics and the circuit 3 prepares a voice recognition reference pattern 4. Since the pattern 4 is made equivalent to the pattern of an input voice inputted through a microphone or equivalent to an input voice pattern inputted through a telephone set in accordance with the setting of characteristics, the pattern 4 corresponding to the change of an input means can easily be prepared while using the same data base 1.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、不特定話者の音声認識等に用いられる標準パ
ターンの作成装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a standard pattern creation device used for speech recognition of unspecified speakers.

[Conventional technology]

不特定話者の発声した単語音声を認識するには、多数話
者の各単語音声毎に認識用の標準パターンを作成してお
き、不特定入力音声のパターンと各標準パターンとを比
較し、最も近似した標準パターンの単語を認識結果とす
る手段が用いられておυ、この際、多数話者の各単語音
声から代表的なパターンを選択するには、クラスタリン
グの手法が用いられている。In order to recognize word sounds uttered by unspecified speakers, standard patterns for recognition are created for each word sound of multiple speakers, and the pattern of the unspecified input sound is compared with each standard pattern. A method is used in which the word of the standard pattern that is most similar is used as the recognition result, and in this case, a clustering method is used to select a representative pattern from each word voice of many speakers.

[Problem that the invention seeks to solve]

しかし、標準パターンを作成するには、話者の単語音声
から求めた音声サンプルのデータベースを用い、これに
基づいて標準パターンの作成を行なっておシ、同一話者
による同一単語音声であっても、音声入力手段が例えば
比較的忠実度の高いマイクロホンの場合と忠実度の低い
電話機の場合との如く異れば、変換周波数特性の差異に
より標準パターンも異なるため、入力手段の変更に応じ
その都度音声サンプルを収集し、かつ、データベースを
更新しなければならず、この操作が面倒となる問題を生
じている。However, in order to create a standard pattern, a database of speech samples obtained from the word speech of the speaker is used, and the standard pattern is created based on this. If the audio input means is different, for example, a microphone with relatively high fidelity and a telephone with low fidelity, the standard pattern will also differ due to the difference in conversion frequency characteristics. Voice samples must be collected and the database must be updated, creating a problem in which this operation becomes cumbersome.

[Means for solving problems]

前述の問題を解決するため、本発明はつぎの手段によシ
構成するものとなっている。In order to solve the above-mentioned problem, the present invention is constructed by the following means.

すなわち、上述の標準パターンを作成する装置において
、データベースの与えられる標準パターン作成回路の入
力側へ設けられた周波数特性の制御可能なｐ波器を備え
たものである。That is, the apparatus for creating the standard pattern described above is provided with a p-wave generator whose frequency characteristics can be controlled, which is provided on the input side of the standard pattern creating circuit provided with the database.

[Effect]

したがって、−旦音声サンプルを収集し、データベース
を作成しておけば、入力手段の周波数特性に応じてＰ波
器の周波数特性を制御することにより、同一のデータベ
ースに基づき、入力手段の特性と対応した特性の標準パ
ターンを作成することができる。Therefore, once voice samples are collected and a database is created, by controlling the frequency characteristics of the P-wave device according to the frequency characteristics of the input means, it is possible to correspond to the characteristics of the input means based on the same database. It is possible to create standard patterns with specific characteristics.

〔Example〕

以下、実施例を示す図によって本発明の詳細な説明する
。Hereinafter, the present invention will be explained in detail with reference to figures showing examples.

第１図はブロック図であシ、磁気テープ等の記憶装置に
より記録された音声サンプルのデータベース１は、ディ
ジタルフィルタ等の周波数特性を制御することの可能な
ろ波器２を介し、標準パターン作成回路３へ与えられ、
同回路３の出力として音声認識用の標準パターン４が作
成され、メモリ等へ格納されるものとなっている。FIG. 1 is a block diagram. A database 1 of audio samples recorded on a storage device such as a magnetic tape is processed through a standard pattern creation circuit through a filter 2 that can control frequency characteristics such as a digital filter. given to 3,
A standard pattern 4 for voice recognition is created as an output of the circuit 3 and is stored in a memory or the like.

ここにおいて、データベース１は、比較的高忠実度のマ
イクロホン等に工９収集された音声サンプルから作成さ
れておシ、音声認識を行なう音声が同等の入力手段によ
り与えられる場合は、第２図に周波数ｆ対振幅大の特性
を示すとおり、平坦特性Ｆがｐ波器２の周波数特性とし
て制御により設定され、入力手段が電話機の場合には、
これの周波数特性に応じ同等の特性Ｔが同様に設定され
る。Here, the database 1 is created from voice samples collected using a relatively high-fidelity microphone, etc. If the voice for voice recognition is given by an equivalent input means, the database 1 is shown in FIG. As shown in the characteristic of frequency f vs. large amplitude, when the flat characteristic F is set by control as the frequency characteristic of the p-wave device 2, and the input means is a telephone,
An equivalent characteristic T is similarly set according to this frequency characteristic.

したがって、特性Ｆの設定により標準パターン４は、マ
イクロホンを介する入力音声のパターンと等価なものと
なシ、特性Ｔの設定によっては、標準パターン４が電話
機を介する入力音声のパターンと等価なものとなるため
、同一のデータベース１を用いながら、入力手段の変更
に応じた標準パターン４を容易に作成することができる
。Therefore, depending on the setting of characteristic F, standard pattern 4 is equivalent to the pattern of input audio through a microphone, and depending on the setting of characteristic T, standard pattern 4 is equivalent to the pattern of input audio through a telephone. Therefore, while using the same database 1, it is possible to easily create a standard pattern 4 that corresponds to a change in the input means.

なお、ｐ波器２は、標準パターン作成回路３の入力側へ
設ければよく、中間に他の回路が介在しても同様であり
、Ｐ波器２の特性は第２図のもののみならず、入力手段
の特性に応じて設定すればよい。Note that the P-wave device 2 may be provided on the input side of the standard pattern creation circuit 3, and the same applies even if another circuit is interposed in between.The characteristics of the P-wave device 2 are those shown in FIG. First, it may be set according to the characteristics of the input means.

〔Effect of the invention〕

以上の説明により明らかなとお９本発明によれば、標準
パターン作成回路の入力側へ周波数特性の制御可能なｐ
波器を設け、これを介して音声サンプルのデータベース
を与えるものとしたため、同一のデータベースから入力
手段に応じた標準パターンを作成することが自在となり
、音声サンプル収集の手間が省略できるため、音声認識
用の標準パターン作成において顕著な効果が得られる。As is clear from the above explanation, according to the present invention, a controllable frequency characteristic pixel is provided on the input side of the standard pattern creation circuit.
By providing a voice sample database through which a voice sample database is provided, it is possible to create a standard pattern according to the input method from the same database, and the trouble of voice sample collection can be omitted, making it possible to improve voice recognition. A remarkable effect can be obtained in the creation of standard patterns for applications.

[Brief explanation of the drawing]

図は本発明の実施例を示し、第１図はブロック図、第２
図はｐ波器の周波数特性を示す図である。１・・・・データベース、２・・・・Ｐ波器、３・・・
・標準パターン作成回路、４・・・・標準パターン。The figures show embodiments of the present invention, with Figure 1 being a block diagram and Figure 2 being a block diagram.
The figure shows the frequency characteristics of a p-wave device. 1...Database, 2...P-wave device, 3...
・Standard pattern creation circuit, 4...Standard pattern.

Claims

[Claims]

A device for creating a standard pattern for speech recognition based on a database of speech samples, characterized by comprising a filter whose frequency characteristics can be controlled and provided on the input side of a standard pattern creation circuit provided with the database. Standard pattern creation device for speech recognition.