JPS602997A

JPS602997A - Speaker identifying voice inputting method

Info

Publication number: JPS602997A
Application number: JP58110636A
Authority: JP
Inventors: 達也山口
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1983-06-20
Filing date: 1983-06-20
Publication date: 1985-01-09

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（１）発明の技術分野本発明は複数話者の標準パターンを話者毎に音声入力装
置に格納しておき話者を指定することにより入力話者の
音声を認識する装置におりで、話者の指定を闇路化し遠
隔操１乍のできるようにした音声入力方法に関するもの
でるる。Detailed Description of the Invention (1) Technical Field of the Invention The present invention recognizes the voice of an input speaker by storing standard patterns for multiple speakers in a voice input device for each speaker and specifying the speaker. This paper relates to a voice input method that makes it possible to specify a speaker by remote control using a device that uses the device.

（２）従来技術と問題点従来の音声入力装置は複数話者の標準パターンを話者毎
に待ち、認識に先立って藷：＃を指定することによ多入
力話者の標準パターン全認識装置に格納するものが多用
されている。(2) Conventional technology and problems Conventional voice input devices wait for the standard patterns of multiple speakers for each speaker, and then specify the ``#'' before recognition. It is often used to store things in

この場合の話者指定の方法は、複数話者の標準パターン
が登録されている場合は、ハード的なスイッチを設けて
選択するか、または千−ボードから指定していた。さら
によく用いられる方法として、ｔｉ−ｉ１毎に自分の標
準パターンを盆録したフロッピィディスクを持ち、作業
開始時に装着を行なっていた。しかし、これらの方法で
は標準パターンき切替える場什には、必ず音声入力装置
のｔｆｔまで行って操作しなくてはならず、遠隔縁１′
畦ができなかった。In this case, when standard patterns for multiple speakers are registered, a hardware switch is provided to select a speaker, or a speaker is specified from a thousand boards. A more commonly used method is to have a floppy disk containing the standard pattern for each ti-i1, and to insert it at the start of work. However, with these methods, when switching between standard patterns, it is necessary to go to the TFT of the voice input device and operate it, and the remote edge 1'
I couldn't form a ridge.

（３）発明の目的本発明の目的は仮載８古者の標準パターンを４谷納して
おき、話者を指定することにより人力話者の標準パター
ンを認識装置ｆＫ格＋Ｗ３する装置において、話者の指
定をｔｔｓｗｈ化するとともに遠隔操作のできる話者戚
別音声入力方法全提供することでろる。(3) Purpose of the Invention The purpose of the present invention is to provide a device for recognizing standard patterns of human speakers by storing four standard patterns of virtual speakers and specifying the speaker. In addition to specifying the speaker in a ttsw format, it also provides a voice input method for each speaker that can be controlled remotely.

（４）発明の構成前記目的全達成するため、本発明の話者識別音声入力方
法は複数話者の標準パターンｔ−話者毎に持ち、話者に
対応した標準パターンを認識装置に格納することによシ
入力話者の音声を認識できる音声認識装置において、各
話者の標準パターン領域の一部に％話者の名前をそれぞ
れ音声で登録しておき、入力話者が自分の名前を音声入
力することによシ自分の標準パターンを認識装置に格納
しうることを特徴とするものである。(4) Structure of the Invention In order to achieve all of the above objects, the speaker identification voice input method of the present invention has a standard pattern t for each speaker, and stores the standard pattern corresponding to the speaker in the recognition device. Particularly, in a speech recognition device that can recognize the voice of an input speaker, the name of each speaker is registered in a part of the standard pattern area of each speaker, and the input speaker can input his/her name. The feature is that one's own standard pattern can be stored in the recognition device by voice input.

（５）発明の実施例本発明では、６話者の標準パターン績域の切替えを音声
を用いて行なうことが基本でアシ、これによル遠ｐ４操
作を可能とするもので必る。(5) Embodiments of the Invention In the present invention, it is basically necessary to use voice to switch the standard pattern performance ranges of six speakers, thereby enabling remote P4 operation.

本発明でこの話者の指定を音声で行な５ｆｃめ、谷話者
の標準パターン領域の一部分に谷話者が自分で発声した
名前の標準パターンを全部配列した系引部を設ける。In the present invention, the speaker is designated by voice, and at 5fc, a reference section in which all the standard patterns of names uttered by the valley speaker is arranged is provided in a part of the valley speaker's standard pattern area.

これは話者ｔ−ｆ’ｔｘ定する名前の音声が入力した場
合に直ちにアクセスし易いように起源したものである。This was developed to facilitate immediate access when the voice of the name defined by the speaker tf'tx is input.

第１図＜（Ｌ）〜（ｃ）と第２図■〜■は本′発明の各
話者の標準パターンの登録方法の説明図である。FIGS. 1 (L) to (c) and FIGS. 2 (1) to (2) are explanatory views of the method of registering standard patterns for each speaker according to the present invention.

第１図（α）■〜■は話者Ａの標準パターン領域１を示
したものでろる。領域の上部に索引領域２を設けこの領
域には全話者Ａ、Ｂ、Ｃが自分で発音した名前の標準パ
ターンを配列しておく。下部には話者Ａのｑ！ｒ種の必
要な標準パターン３が格納される。これを登録するには
、同図（α）■で話者Ａによシ索引偵域２のＡの部分と
Ａの音声パターン３を登録し、同図（α）■で話ｆｉＢ
によシ索引領域のＢの部分を、同図（α）■で話者Ｃに
よシ系引偵域のＣの部分を登録し、同図（α）■に示す
標準パターンを生成し、話者Ａの標準パターンとするも
のである。FIG. 1 (α) ■ to ■ show the standard pattern area 1 of speaker A. An index area 2 is provided at the top of the area, and standard patterns of names pronounced by all speakers A, B, and C are arranged in this area. At the bottom is speaker A's q! R types of required standard patterns 3 are stored. To register this, register the part A of the index reconnaissance area 2 and the voice pattern 3 of A to speaker A in (α)■ in the same figure, and
Register part B of the index area B and part C of the index area for speaker C in (α)■ in the same figure, and generate the standard pattern shown in (α)■ in the same figure. This is the standard pattern for speaker A.

同図（’ｂ）　、　（ｃ）はそれぞれ話者Ｂ、Ｃの標準
パターンでるる。Figures ('b) and (c) are standard patterns for speakers B and C, respectively.

第２図■〜■は他の登録方法の説明図である。FIGS. 2-2 are explanatory diagrams of other registration methods.

同図■〜■は話者Ａ　、　Ｂ　、　Ｃ卆それぞれ第１図
（α）■と同様にそれぞれの音声で名前の標準パターン
を索引領域２に各棟の必要な標準パター／を領域３に登
録し、これを同図■のメモリ１０上でノット（プログラ
ム）により名前の空欄に対し相互に転送して書込むもの
である。In the same figure, ■ to ■ are for speakers A, B, and C.Same as in Figure 1 (α)■, the standard pattern of the name in each voice is placed in index area 2, and the required standard pattern for each building is placed in area 3. This is registered and mutually transferred and written in the blank column of the name using a knot (program) on the memory 10 shown in (2) in the same figure.

第６図は本発明の実施例の構′成説明図である。FIG. 6 is an explanatory diagram of the configuration of an embodiment of the present invention.

同図において、音声入力装置はコントローラ１１で制御
されｆｃ７アイル１０．ソフトフェア１２．操作装置１
６２衷示部１４よシ成プ、コントローラ１１を介し認識
装置１６に接続される。ファイル１ｏには第１図、第２
図の方法によ多登録された各話、ｆ毎の標準パターン領
域にその先頭に全話者の名前の標準パター／の索引領域
きゼしたものが格納されている。In the figure, the voice input device is controlled by a controller 11, and an fc7 isle 10. Softfare 12. Operating device 1
62 is constructed from the display section 14 and connected to the recognition device 16 via the controller 11. File 1o contains figures 1 and 2.
For each story registered by the method shown in the figure, a standard pattern area for each f is stored with an index area of the standard pattern / of the names of all speakers at the beginning.

コントローン１１は７アイル１ｏ内の各話者毎の標準パ
ターンを登録時、認識時の膏込み、続出しの制御を行な
う。この場合予め定められたプログラムによる制御はソ
フトウェア１２にょシ、オペレータによる手動制御およ
びモニタは操作装置１３と表示部１４を用りて制御を行
なう。登録時、認識時の音声入力は話者の音声をマイク
ロホーン１５よシ認識装置１６に入れ、ここで久方音声
の標準パターンを作成する。認識装置の標準ノくターン
を入替えるときは話者の名前を音声で入力し、名前に対
応した認識コードをコントロー２１１に過少、ファイル
１０内の話者毎の標準パター／の中から認識コードに対
応し是話者、たとえばＢｏ標準・くターン領域の内容を
一括してコントローラ１１を介して認識装置１６に転送
してバッファ替ングし１話者Ｂの以下の音声入力に対応
する。認識装置１６は音声入力装置の近傍に直接接続さ
れてもよいし、離れｙｃ場所に設けて同線で接続され、
登録、認識。The control unit 11 controls the standard patterns for each speaker in the seven aisles 1o when they are registered, when they are recognized, and when they are recognized. In this case, control according to a predetermined program is performed by the software 12, and manual control and monitoring by an operator are performed using the operating device 13 and the display unit 14. At the time of registration and recognition, the voice of the speaker is inputted into the microphone 15 and into the recognition device 16, where a standard pattern of Kugata's voice is created. When replacing the standard pattern of the recognition device, enter the speaker's name audibly, enter the recognition code corresponding to the name into the controller 211, and enter the recognition code from the standard patterns/for each speaker in the file 10. In response to this, the contents of the standard and customary areas of the speaker, for example, Bo, are collectively transferred to the recognition device 16 via the controller 11 and buffered to correspond to the following voice input from one speaker B. The recognition device 16 may be directly connected in the vicinity of the voice input device, or may be provided at a remote location and connected on the same line,
Registration, recognition.

話者指定等のモード指定等を遠隔制御可能とすることが
できる。Mode designation such as speaker designation can be remotely controlled.

第４図は第３図の実施例の認識時の動作を示す流れ図で
ある。FIG. 4 is a flowchart showing the operation of the embodiment shown in FIG. 3 during recognition.

同図において、認關装［１６のモード指定制御によ）コ
ントローラ１１ｔ−話′４指定モードとし、話者がマイ
クロホー／１５を通し話者名ｔ−認識装置１６に入力す
る。ここで話者名の音声パターンを作成し、コントロー
ラ１１全通してファイル１０に送少、前述の索引を行な
い対応する登録データがあるか否かを調べ、あればこの
話者の音声パターン領域の内容を一括してｇ織装置１６
に過少、以下の音声入力に対してはこの内容と照合して
認識を行なうものである。In the same figure, the controller 11t-talk'4 designation mode is set (by the mode designation control of 16), and the speaker inputs the speaker's name t-to the recognition device 16 through the microphone/15. Here, a voice pattern of the speaker's name is created, sent to the file 10 through the entire controller 11, and the above-mentioned index is performed to check whether there is corresponding registered data. G-weaving device 16
If the voice input is too small or below, it will be recognized by comparing it with this content.

（６）発明の詳細な説明したように、本発明によれば、話者はＪ＆初自分
の名前を音声入力することによシ、音声入力装置内の対
応する標準パターン領域の内容を選択してｇ織装置に過
少、以下の音声入力に用いられるから、自分で音声入力
装置の所へ行く必要もなく操作が闇路化され、かつ遠隔
操作が容易に可能となる。従って、音声入力装置の利用
範囲の拡大に役立つことが期待される。(6) As described in detail, according to the present invention, the speaker selects the content of the corresponding standard pattern area in the voice input device by vocally inputting his or her name. Since the voice input device is used for the following voice inputs, there is no need to go to the voice input device by yourself, the operation can be done in the dark, and remote control is easily possible. Therefore, it is expected that the present invention will be useful in expanding the scope of use of voice input devices.

[Brief explanation of drawings]

第１図、第２図は本発明の音声辞４登録方法の説明図、
第６図は本発明の実施例の構成説明図、第４図は本発明
の認識時の動作を示す流れ図であ夛、図中、１は話者毎
音声パターン領域、２は索引部、６は音声パターン都、
１０は音声ファイル）１１ハコントローラ、１２はソフ
トウェア、１６は操作装置、１４は辰示部、１５はマイ
クロホーン１１６は認識装置を示す。特許出願人　富士通株式金社復代理人　弁理士　１）坂　善　型開１図（ｂ）　（ｃ）第２図第３図第４図FIG. 1 and FIG. 2 are explanatory diagrams of the phonetic dictionary 4 registration method of the present invention,
FIG. 6 is a configuration explanatory diagram of an embodiment of the present invention, and FIG. 4 is a flowchart showing the operation during recognition of the present invention. In the figure, 1 is a voice pattern area for each speaker, 2 is an index section, and 6 is the voice pattern capital,
10 is an audio file) 11 is a controller, 12 is software, 16 is an operating device, 14 is a display section, 15 is a microphone 116 is a recognition device. Patent Applicant Fujitsu Kinsha Sub-Agent Patent Attorney 1) Zen Saka Mold Opening Figure 1 (b) (c) Figure 2 Figure 3 Figure 4

Claims

[Claims]

Standard patterns for multiple speakers In a speech recognition device that can recognize the voice of an input speaker by transmitting a standard pattern corresponding to each speaker to the recognition device, one of the standard pattern areas for a valley speaker is used. One in the department? ! i Record each speaker's name in voice, and when the input speaker enters his or her name on the back door, p his or her standard pattern is input to the recognition device, f
6. A voice input method for different types of voice input, which is characterized by the ability to use 6 networks.