JP2003131696A

JP2003131696A - Voice registration authentication system

Info

Publication number: JP2003131696A
Application number: JP2001327683A
Authority: JP
Inventors: Hiromi Bessho; ひろみ別所
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2001-10-25
Filing date: 2001-10-25
Publication date: 2003-05-09

Abstract

PROBLEM TO BE SOLVED: To increase the authentication speed and authentication rate of a registration type voice authentication system by classifying a voice by its feature when the voice is registered and retrieving approximate data from data in the closest classification group according to the feature of the voice when authentication is performed. SOLUTION: The system in which an examinee registers a voice and authenticates a speaker has a means which extracts individual differences of an input voice and classifies the voice by the feature when the voice is registered, a means which registers (stores) data by classifications, a means which extracts the feature of a voice when voice authentication is performed as well as the registration, a means which authenticates which classification frame of a voice database matches the input voice for authentication according to the feature of the voice, a means which retrieves approximate data from an estimated classification frame, a means which finds an authentication result, a means which makes the examinee confirm the authentication result, and a means which performs re-retrieval by judging whether the authentication result is correct or incorrect.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、話者認証を目的と
する登録型音声認識システムにおいて、認証率の精度を
上げたり、認証結果をできるだけ早く得ることが必要と
されるような場合に利用できる音声登録認証システムに
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is used in a registration type voice recognition system for the purpose of speaker authentication, when it is necessary to improve the accuracy of an authentication rate or obtain an authentication result as soon as possible. A voice registration authentication system that can be used.

【０００２】[0002]

【従来の技術】従来の技術における登録型の音声認識の
方法では、登録時に入力する音声データにＩＤなどのラ
ベルを付けてデータベースなどに保存し、認証時にはデ
ータベースから類似したデータを検索して認証を図るも
のや、音声をテキスト化して登録し認証時には類似した
発音のテキストを検索して結果とするものがほとんどで
あった。2. Description of the Related Art In the conventional registration-type voice recognition method, voice data input at the time of registration is attached with a label such as an ID and stored in a database or the like, and at the time of authentication, similar data is searched for and authenticated. In most cases, the voice was converted to text and registered, and at the time of authentication, a text with similar pronunciation was searched and the result was obtained.

【０００３】[0003]

【発明が解決しようとする課題】本発明では、音声登録
時に音声の特徴を分類してグループ別にデータベースに
登録（保存）する。次に認証時には、入力音声を分類し
て、データベースの中の最適な分類枠（グループ）から
のみ、類似データを検索して認証結果を得る。これらの
処理によって、被験者には特別な操作を要求することな
く処理を進めることが可能となる。また、認証時に最適
な分類枠から検索を行なうことによって認証率を上げる
と同時に、検索するデータ量を減らすことにより認証結
果を得るまでのスピードを上げることができる。In the present invention, at the time of voice registration, voice characteristics are classified and registered (saved) in a database for each group. Next, at the time of authentication, the input voice is classified and similar data is searched for only from the optimum classification frame (group) in the database to obtain the authentication result. By these processes, it becomes possible to proceed with the process without requiring the subject to perform a special operation. In addition, it is possible to increase the authentication rate by performing a search from the optimum classification frame at the time of authentication, and at the same time to increase the speed of obtaining an authentication result by reducing the amount of data to be searched.

【０００４】本発明の解決しようとしている第一の課題
について以下に説明する。The first problem to be solved by the present invention will be described below.

【０００５】音声の登録時に、音声の特徴（例えば、周
波数分布の違いやピッチの違い、含まれる音や単語の長
さ）によって話者をグループ（男女・大人子供、入力キ
ーワードの長さ、含まれる音による違い、など）別に分
類しデータベースに保存する。認証時に入力する音声の
特徴から最適なグループの中から近似データを検索し、
認証結果を得ることによって認証率を上げると共に、認
証のスピードを上げることができるシステムを提供する
ことである。At the time of voice registration, a group of speakers (men and women / adult children, input keyword length, included) is included according to voice characteristics (for example, difference in frequency distribution or pitch, length of included sound or word). Different according to the sound that is played, etc.) Search for approximate data from the optimal group based on the characteristics of the voice input at the time of authentication,
It is to provide a system capable of increasing the certification rate and the certification speed by obtaining the certification result.

【０００６】本発明の解決しようとしている第二の課題
について以下に説明する。The second problem to be solved by the present invention will be described below.

【０００７】音声の登録時に、話者をグループ別に分類
しデータベースに保存する。例えば、フレックスタイム
制や深夜昼夜の三交代勤務制度等が導入されていて、か
つ入出時に音声によるセキュリティチェックを行なう病
院、工場、オフィスや研究所等を想定する。これらの施
設に出入り（出勤・退社）する人々をメンバーと呼ぶ。
メンバー各自の出退勤時刻に傾向がある場合や、あらか
じめスケジュール管理されている場合、または出勤時刻
から退社時刻が予想できる場合、音声登録時にメンバー
毎の時間を元にしたグループ分けが可能になる。よって
音声登録データを音声データベース内のグループに分類
して保存する。At the time of voice registration, speakers are classified into groups and stored in a database. For example, assume a hospital, a factory, an office, a laboratory, etc., which has introduced a flextime system or a three-shift work system for midnight and nighttime, and which performs a security check by voice when entering and exiting. People who go in and out of these facilities (go to work and leave) are called members.
If the members have a tendency to go to and from the office, the schedule is managed in advance, or the time to leave the office can be predicted from the office time, the grouping based on the time for each member becomes possible at the time of voice registration. Therefore, the voice registration data is classified into groups in the voice database and stored.

【０００８】これによりメンバー毎に出勤時刻と退社時
刻が分類できるので、それぞれ認証時には分類されたグ
ループからメンバーのデータの検索を行なう。Since the attendance time and the leaving time can be classified for each member in this way, the member data is retrieved from the classified groups at the time of authentication.

【０００９】従って、本発明の解決しようとしている第
二の課題は、認証率を上げると共に、認証のスピードを
上げることができるシステムを提供することである。Therefore, a second problem to be solved by the present invention is to provide a system capable of increasing the authentication rate and the authentication speed.

【００１０】[0010]

【課題を解決するための手段】本発明は、下記の構成を
備えることにより上記課題を解決できた。The present invention has solved the above-mentioned problems by having the following constitution.

【００１１】（１）登録型の話者認証システムにおい
て、音声登録時に音声の周波数領域や入力するキーワー
ドの長さや含まれる音の要素などにより声の特徴（個人
差）を引き出す手段と、声の特徴によって分類分けをす
る手段と、分類分けされたグループにデータを保存する
手段と、音声認証時に声の特徴を引き出す手段と、声の
特徴によってどの分類枠から登録データを検索するか決
定する手段と、決定した分類グループから近似した値を
検索する手段と、認証結果を出す手段と、認証結果を被
験者に確認する手段と、認証結果の正誤から再検索を行
なう手段、を有することを特徴とする音声登録認証シス
テム。(1) In the registration-type speaker authentication system, a means for deriving a voice feature (individual difference) based on the frequency domain of voice, the length of a keyword to be input, the included sound element, etc. at the time of voice registration; Means for classifying by characteristics, means for storing data in the classified groups, means for extracting voice characteristics during voice authentication, and means for deciding from which classification frame the registered data should be retrieved by voice characteristics And a means for retrieving an approximate value from the determined classification group, a means for outputting an authentication result, a means for confirming the authentication result with a subject, and a means for re-searching from the correctness of the authentication result. Voice registration authentication system to do.

【００１２】（２）登録型の話者認証システムにおい
て、登録時間や認証時間がスケジュール管理できる場
合、音声登録時に入力者の音声をスケジュールによって
分類する手段と、分類分けされたグループにデータを保
存する手段と、音声認証時の時間によって音声を分類す
る手段と、どの分類枠から登録データを検索するか決定
する手段と、決定した分類グループから近似した値を検
索する手段と、認証結果を出す手段と、認証結果を被験
者に確認する手段と、認証結果の正誤から再検索を行な
う手段、を有することを特徴とする音声登録認証システ
ム。(2) In the registration-type speaker authentication system, when the registration time and the authentication time can be managed by a schedule, a means for classifying the voice of the input person according to the schedule at the time of voice registration, and storing the data in the classified groups. Means, a means for classifying voices according to the time at the time of voice authentication, a means for deciding from which classification frame the registered data is searched, a means for searching an approximate value from the decided classification group, and an authentication result. A voice registration / authentication system comprising: a means, a means for confirming an authentication result with a subject, and a means for performing a re-search based on whether the authentication result is correct or incorrect.

【００１３】[0013]

【発明の実施の形態】以下に、本発明の実施の形態を図
面を用いて説明する。BEST MODE FOR CARRYING OUT THE INVENTION Embodiments of the present invention will be described below with reference to the drawings.

【００１４】＜図９のブロック図の説明＞図９は、本発
明のシステム構成を説明するブロック図である。<Description of Block Diagram of FIG. 9> FIG. 9 is a block diagram for explaining the system configuration of the present invention.

【００１５】図中の１００１は、本システムを制御する
ＣＰＵ、１００２は音声を入力する入力部であり、例え
ばマイクロフオンがこれに相当する。ここから生の音声
１００７が入力しコンピュータに取りこまれる、１００
３は記憶装置である。入力部１００２より入力された生
音声１００７はＣＰＵ１００１の制御により適当な音声
モデル１００８に変換され記憶装置１００３に記憶され
る。音声データベースなどもここに記憶される。１００
４はＣＰＵ１００１に作業領域を提供したり、アプリケ
ーションをロードしたりするＲＡＭである。１００５は
会話的に進行する処理の中で被験者の意志を入力する入
力部である。例えばマウスなどがこれに相当する。１０
０６は会話的処理の流れを表示する出力部である。例え
ばモニターなどがこれに相当する。Reference numeral 1001 in the figure is a CPU for controlling the present system, and 1002 is an input unit for inputting voice, for example, a microphone is equivalent thereto. From here, the raw voice 1007 is input and taken into the computer, 100
Reference numeral 3 is a storage device. The raw voice 1007 input from the input unit 1002 is converted into an appropriate voice model 1008 by the control of the CPU 1001 and stored in the storage device 1003. The voice database etc. are also stored here. 100
A RAM 4 provides a work area to the CPU 1001 and loads an application. Reference numeral 1005 is an input unit for inputting the subject's will in the process of proceeding interactively. For example, a mouse corresponds to this. 10
An output unit 06 displays the flow of conversational processing. For example, a monitor corresponds to this.

【００１６】（実施例１）＜図１実施例１の登録プロセスの概念図について＞図
１は、本発明実施例１の登録時のプロセスを説明する概
念図である。Example 1 <FIG. 1 Regarding Conceptual Diagram of Registration Process of Example 1> FIG. 1 is a conceptual diagram illustrating a process at the time of registration of Example 1 of the present invention.

【００１７】図中、四角はデータを表し、太線の四角は
処理を示す。白抜きの矢印（１０４等）はデータの流れ
を示し、黒い矢印（１０７等）は処理の流れを示す。一
連の処理は図９のＣＰＵ１００１によって制御される。
１０１は登録時に入力するキーワードの処理前の音声デ
ータである。１０２のマイクを通してコンピュータ１０
３に取りこまれる。生音声はコンピュータ１０３によっ
て音声モデル化される（１０５）。入力音声は特徴抽出
され（１０６）、分類されてから（１０８）、音声デー
タベース１１０に登録され（１０９）、図９の記憶装置
１００３に保存される。この模式図では、例えば周波数
分析によって女性のグループ（１１１）などに分類され
ている。In the figure, squares represent data, and bold squares represent processing. White arrows (104, etc.) show the flow of data, and black arrows (107, etc.) show the flow of processing. A series of processes is controlled by the CPU 1001 in FIG.
Reference numeral 101 is voice data before processing of the keyword input at the time of registration. Computer 10 through microphone 102
Get stuck in 3. The raw voice is voice modeled by the computer 103 (105). The input voice is feature-extracted (106), classified (108), registered in the voice database 110 (109), and stored in the storage device 1003 of FIG. In this schematic diagram, for example, the female group (111) is classified by frequency analysis.

【００１８】＜図２実施例１の認証プロセスの概念図
について＞図２は、本発明実施例１の認証時のプロセス
を説明する概念図である。<FIG. 2 Regarding Conceptual Diagram of Authentication Process of First Embodiment> FIG. 2 is a conceptual diagram illustrating a process at the time of authentication according to the first embodiment of the present invention.

【００１９】図中、四角はデータを表し、太線の四角は
処理を示す。白抜きや点線の矢印（２０４等）はデータ
の流れを示し、黒い矢印（２０７等）は処理の流れを示
す。一連の処理は図９のＣＰＵ１００１によって制御さ
れる。２０１は登録時に入力するキーワードの処理前の
音声データである。２０２のマイクを通してコンピュー
タ２０３に取りこまれる。生音声はコンピュータ２０３
によって音声モデル化される（２０５）。入力音声は特
徴抽出され（２０６）、既に音声データベース（２１
０）に分類分けされているグループの中から最適なグル
ープを推定し（２０８）、その中から認証結果を検索す
る（２０９）。図の例では、入力音声は特徴抽出によっ
て女性のものと推定され音声データベース（２１１）内
の女性のグループ（２１１）から近似したデータを検索
し、検索結果を出力する（２１２）。In the figure, squares represent data, and bold squares represent processing. White and dotted arrows (204, etc.) show the flow of data, and black arrows (207, etc.) show the flow of processing. A series of processes is controlled by the CPU 1001 in FIG. Reference numeral 201 is voice data before processing of the keyword input at the time of registration. It is taken in by the computer 203 through the microphone of 202. Raw voice is computer 203
Voice modeled by (205). The input voice is feature-extracted (206), and the voice database (21
The optimum group is estimated from the groups classified into 0) (208), and the authentication result is searched from them (209). In the example of the figure, the input voice is estimated to be female by feature extraction, and approximated data is retrieved from the female group (211) in the voice database (211) and the retrieval result is output (212).

【００２０】＜図５、図６実施例１のフローチャート
の説明＞図５、６において、本発明を実施するためのフ
ローを説明する。<Description of Flowchart of Embodiment 1 in FIGS. 5 and 6> A flow for carrying out the present invention will be described with reference to FIGS.

【００２１】＜図５実施例１の音声登録部における処
理フローの説明＞図５は音声登録処理の制御部であるコ
ンピュータ部（図１の１０３）で行なわれる処理を示
す。<Description of Process Flow in Voice Registration Unit of First Embodiment> FIG. 5 shows a process performed by a computer unit (103 in FIG. 1) which is a control unit of the voice registration process.

【００２２】ステップ（５０１）登録のために被験者は
任意のキーワードを入力する。Step (501) The subject inputs an arbitrary keyword for registration.

【００２３】ステップ（５０２）入力された生の音声は
デジタルデータにされ、音声データベースに保存するた
めに音声モデル化される。Step (502) The input raw voice is digitized and voice modeled for storage in a voice database.

【００２４】ステップ（５０３）音声の特徴を抽出す
る。例えば、図中の例のように音声の振幅レベルの累積
分布モデルや、基本周波数の統計分布モデルや、子音や
母音の含有率によるモデルや、無声音・有声音の含有率
によるモデルや、キーワードの時間的長さによるモデ
ル、等を作成する。Step (503) The feature of the voice is extracted. For example, as shown in the figure, the cumulative distribution model of the amplitude level of the voice, the statistical distribution model of the fundamental frequency, the model based on the content ratio of consonants and vowels, the model based on the content ratio of unvoiced and voiced sounds, and the keyword Create a model based on time length, etc.

【００２５】ステップ（５０４）音声データベースの内
で最適なグループに現在の音声データを分類する。例え
ば、前ステップで作成したモデルに基づいて、キーワー
ドが日本語であるかその他の言語によるものかという分
類や、キーワードが男声か女声か大人か子供による発生
によるものなのか分類する。Step (504) Classify the current voice data into the optimum group in the voice database. For example, based on the model created in the previous step, it is classified whether the keyword is Japanese or another language, and whether the keyword is male voice, female voice, adult or child.

【００２６】ステップ（５０５）分類毎に音声データベ
ースに保存する。Step (505): Save in the voice database for each classification.

【００２７】上記フローチャートにおいては、ステップ
（５０３）、（５０４）における音声の特徴抽出と分類
手段が本実施登録部の特徴的な処理となる。In the above flow chart, the voice feature extraction and classification means in steps (503) and (504) are the characteristic processes of the present registration unit.

【００２８】＜図６実施例１の音声認証部における処
理フローの説明＞図６は音声認証処理の制御部であるコ
ンピュータ部（図２の２０３）で行なわれる処理を示
す。<Description of Process Flow in Voice Authentication Unit of First Embodiment> FIG. 6 shows a process performed by the computer unit (203 in FIG. 2) which is the control unit of the voice authentication process.

【００２９】ステップ（６０１）認証のために被験者は
登録時と同じキーワードを入力する。Step (601) The subject inputs the same keyword as at the time of registration for authentication.

【００３０】ステップ（６０２）入力された生の音声は
デジタルデータにされ、音声データベースに保存するた
めに音声モデル化される。Step (602) The input raw voice is digitized and voice modeled for storage in a voice database.

【００３１】ステップ（６０３）音声の特徴を抽出す
る。Step (603) The feature of the voice is extracted.

【００３２】ステップ（６０４）現在の音声データに一
致するデータが登録されていると思われるグループを特
定する。Step (604) The group in which the data corresponding to the current voice data is considered to be registered is specified.

【００３３】ステップ（６０５）特定したグループの中
から類似した音声データを検索する。Step (605) Similar voice data is searched from the specified group.

【００３４】ステップ（６０６）最も類似したデータの
ＩＤを返り値として返す。Step (606) The ID of the most similar data is returned as a return value.

【００３５】ステップ（６０７）認証結果の正誤確認を
取る。正しければ認証処理は終了する。間違っていれ
ば、再びステップ（６０４）に戻ってグループの特定処
理を行なう。Step (607) Confirm whether the authentication result is correct. If it is correct, the authentication process ends. If it is wrong, the process returns to step (604) to perform the group identification process.

【００３６】上記フローチャートにおいては、ステップ
（６０３）、（６０４）における音声の特徴抽出と分類
グループの特定手段が本実施認証部の特徴的な処理とな
る。In the above flow chart, the voice feature extraction and classification group identification means in steps (603) and (604) are the characteristic processes of the authentication unit of the present embodiment.

【００３７】（実施例２）＜図３実施例２の登録プロセスの概念図について＞図
３は、本発明の実施例２の登録プロセスを説明する概念
図である。ここでは、ある病院の出退時刻管理を行なう
システムを例に取る。この病院では午前８時から午後４
時の「朝〜夕勤務」、午後４時から深夜午前零時の「昼
〜夜勤務」と深夜午前零時から朝午前８時までの「深夜
勤務、当直勤務」の三交代制がとられている。各勤務者
の勤務スケジュールは月毎に決められる。予め勤務者の
音声はコンピュータに登録されていて、月毎のスケジュ
ールと共に管理される。このプロセスを図３と図９で説
明する。Second Embodiment <FIG. 3 Regarding Conceptual Diagram of Registration Process of Second Embodiment> FIG. 3 is a conceptual diagram illustrating the registration process of the second embodiment of the present invention. Here, an example of a system that manages the departure time of a hospital will be described. At this hospital from 8 am to 4 pm
There are three shifts: "morning-evening work" at night, "day-night work" between 4:00 pm and midnight, and "midnight work, shift duty" between midnight and 8:00 am ing. The work schedule of each worker is determined monthly. The voice of the worker is registered in the computer in advance and is managed together with the monthly schedule. This process will be described with reference to FIGS.

【００３８】一連の処理はコンピュータ３０３内のＣＰ
Ｕ１００１によって制御され、ＲＡＭ１００４を作業領
域として使い、記憶装置１００３において勤務スケジュ
ールデータ（３１２）をロードしたり音声モデルを記憶
したりして行なう。またプロセスの進行状態を知らせる
表示や確認作業は、図９のマウスやタッチセンサーなど
の入力部１００５やモニターなどの出力部１００６を媒
介にして行なう。A series of processing is performed by the CP in the computer 303.
Controlled by the U1001, the RAM 1004 is used as a work area, the work schedule data (312) is loaded into the storage device 1003, and the voice model is stored. Further, the display or confirmation work for notifying the progress of the process is performed through an input unit 1005 such as a mouse and a touch sensor and an output unit 1006 such as a monitor in FIG.

【００３９】図中、白抜きの矢印（３０４等）はデータ
の流れを示し、黒い太線の矢印（３０７等）は処理の流
れを示す。各勤務者は３０２の音声入力装置−マイクを
通してキーワードとなる音声３０１を入力する。音声は
コンピュータ３０３に取りこまれ、ＣＰＵの制御により
音声モデル３０５に変換される。音声モデルは、予めデ
ータとして入力されている月ごとの勤務スケジュール３
１２によって特徴抽出（３０６）され（どの勤務時間帯
に属しているか判断し、時間帯毎のグループ（３１３）
に分類（３０８）される。この音声登録処理は最終的に
図９の記憶装置１００３に音声データベース（３１０）
として分類された状態（３１１）で保存される。In the figure, white arrows (304 etc.) show the flow of data, and black thick arrows (307 etc.) show the flow of processing. Each worker inputs the keyword voice 301 through the voice input device 302-microphone 302. The voice is taken into the computer 303 and converted into the voice model 305 under the control of the CPU. The voice model is a monthly work schedule 3 that has been entered in advance as data.
Feature extraction (306) by 12 (determines which working time zone it belongs to, and groups for each time period (313)
(308). This voice registration process is finally performed in the voice database (310) in the storage device 1003 of FIG.
It is saved in the state (311) classified as.

【００４０】＜図４実施例２の認証プロセスの概念図
について＞図４は、本発明の実施例２の音声認証プロセ
スを説明する概念図である。<FIG. 4 Regarding Conceptual Diagram of Authentication Process of Second Embodiment> FIG. 4 is a conceptual diagram illustrating the voice authentication process of the second embodiment of the present invention.

【００４１】一連の処理はコンピュータ４０３内のＣＰ
Ｕ１００１によって制御され、ＲＡＭ１００４を作業領
域として使い、記憶装置１００３において勤務スケジュ
ールデータ（４１２）をロードしたり音声モデルを記憶
したりして行なう。またプロセスの進行状態を知らせる
表示や確認作業は、図９のマウスやタッチセンサーなど
の入力部１００５やモニターなどの出力部１００６を媒
介にして行なう。A series of processing is performed by the CP in the computer 403.
It is controlled by the U1001 and uses the RAM 1004 as a work area to load the work schedule data (412) in the storage device 1003 and store the voice model. Further, the display or confirmation work for notifying the progress of the process is performed through an input unit 1005 such as a mouse and a touch sensor and an output unit 1006 such as a monitor in FIG.

【００４２】図中、白抜きの矢印（４０４等）はデータ
の流れを示し、黒い太線の矢印（４０７等）は処理の流
れを示す。先ず現在の時刻は深夜午前零時前後で出動す
る勤務者の時間帯は深夜／当直勤務帯であるとする。Ｂ
看護婦が病院に出勤してきた場合を例にして説明する。In the figure, a white arrow (404 or the like) indicates a data flow, and a black thick line arrow (407 or the like) indicates a processing flow. First, it is assumed that the present time is a midnight / shift duty zone for workers who are dispatched around midnight. B
A case where a nurse comes to the hospital will be described as an example.

【００４３】Ｂ看護婦は登録時に入力したのと同じキー
ワードを４０２の入力装置−マイクを通してコンピュー
タ４０３に入力する。音声４０１は、コンピュータ４０
３の処理によって音声モデル４０５に変換される。現在
の時刻を午後４時前後、すなわち深夜／当直勤務帯の出
動時刻であるとする（４１２、４０６、４０８）。ここ
で任意の人物が出勤時の音声入力を行なった場合、認証
処理は、音声データベース（４１０）の中の深夜／当直
勤務帯グループに登録されているデータの中から類似デ
ータを検索する（４０９）。つまりＢ看護婦が予定通り
出勤したとすると深夜／当直勤務帯グループ内（４１
３）のＢ看護婦の登録データ（４１１）を探し当てるこ
とになる。The nurse B inputs the same keyword as that input at the time of registration to the computer 403 through the input device-microphone of 402. The voice 401 is the computer 40
It is converted into the voice model 405 by the process of 3. It is assumed that the current time is around 4:00 pm, that is, the dispatch time of the midnight / shift duty zone (412, 406, 408). If an arbitrary person inputs a voice at work here, the authentication process searches the voice database (410) for similar data from the data registered in the late night / shift duty zone group (409). ). In other words, if nurse B goes to work as scheduled, it will be in the midnight / shift duty group (41
The registration data (411) of nurse B in 3) will be found.

【００４４】＜図７、図８実施例２のフローチャート
の説明＞図７、８において、本発明を実施するためのフ
ローを説明する。<Explanation of Flowchart of Embodiment 2 in FIGS. 7 and 8> With reference to FIGS. 7 and 8, a flow for implementing the present invention will be described.

【００４５】＜図７実施例２の音声登録部における処
理の流れの説明＞図７は音声登録処理の制御部であるコ
ンピュータ部（図３の３０３）で行なわれる処理を示
す。<FIG. 7: Description of Process Flow in Voice Registration Unit of Embodiment 2> FIG. 7 shows the process performed by the computer unit (303 in FIG. 3) which is the control unit of the voice registration process.

【００４６】ステップ（７０１）月ごとの勤務者の勤務
時間帯スケジュールを入力する。Step (701) The work hour schedule of the worker for each month is input.

【００４７】ステップ（７０２）登録のために勤務者は
任意のキーワードを発声して入力する。Step (702) For registration, the employee speaks and inputs an arbitrary keyword.

【００４８】ステップ（７０３）入力された生の音声は
デジタルデータにされ、音声データベースに保存するた
めに音声モデル化される。Step (703) The input raw voice is digitized and voice modeled for storage in a voice database.

【００４９】ステップ（７０４）勤務時間帯スケジュー
ルを参照して、音声入力者の分類グループを特定する。Step (704) The classification group of the voice input person is specified by referring to the work hours schedule.

【００５０】ステップ（７０５）音声モデルを特定され
たグループに分類する。Step (705) classify the voice models into the specified groups.

【００５１】ステップ（７０６）音声データベースに保
存する。Step (706) Save to voice database.

【００５２】上記フローチャートにおいては、ステップ
（７０１）、（７０４）が本実施例−音声登録部の特徴
的な処理となる。In the above flow chart, steps (701) and (704) are the characteristic processing of this embodiment-voice registration section.

【００５３】＜図８実施例２の音声認証部における処
理の流れの説明＞図８は音声認証処理の制御部であるコ
ンピュータ部（図４の４０３）で行なわれる処理を示
す。<Description of Process Flow in Voice Authentication Unit of Second Embodiment> FIG. 8 shows a process performed by the computer unit (403 in FIG. 4) which is the control unit of the voice authentication process.

【００５４】ステップ（８０１）病院に出勤する勤務者
は、認証のために登録時と同じキーワードを音声入力す
る。Step (801) The worker who goes to work at the hospital inputs the same keyword as at the time of registration by voice input for authentication.

【００５５】ステップ（８０２）入力された生音声はデ
ジタルデータにされ、音声データベースに保存するため
に音声モデル化される。Step (802) The input raw voice is digitized and voice modeled for storage in a voice database.

【００５６】ステップ（８０３）出勤した勤務者の音声
入力時刻をゲットする。Step (803) Get the voice input time of the worker who went to work.

【００５７】ステップ（８０４）前ステップで得た時刻
よりどの勤務時間帯の出動時刻に該当するか推定する。Step (804) From the time obtained in the previous step, it is estimated which working time zone the dispatch time corresponds to.

【００５８】ステップ（８０５）前ステップで推定した
勤務時間帯のグループから音声モデルと類似したデータ
を検索する。Step (805) Data similar to the voice model is retrieved from the group of working hours estimated in the previous step.

【００５９】ステップ（８０６）最も類似した検索結果
を得る。Step (806) Obtain the most similar search result.

【００６０】ステップ（８０７）被験者に対して認証結
果の正誤確認を取る。正しければ認証処理は終了する。
間違っていれば、次のステップ（８０８）に進む。Step (807) The subject is checked for correctness of the authentication result. If it is correct, the authentication process ends.
If wrong, go to the next step (808).

【００６１】ステップ（８０８）ひとつの勤務時間帯の
分類グループ内ですべて検索されたか確認する。グルー
プ内のすべてのデータが検索されたのであれば、ステッ
プ（８０９）へ進む。グループ内にデータが残っていれ
ば、次の類似候補の検索処理を続けるためにステップ
（８０５）へ戻る。Step (808) It is confirmed whether or not all items have been searched in the classification group of one working hours. If all the data in the group have been retrieved, the process proceeds to step (809). If data remains in the group, the process returns to step (805) to continue the search process for the next similar candidate.

【００６２】ステップ（８０９）ひとつのグループ内で
の検索をすべて終了した場合、このステップでは次のグ
ループの候補を決定したり例外についての処理を行な
う。Step (809) When all the searches within one group have been completed, in this step the candidates for the next group are determined and exceptions are processed.

【００６３】上記フローチャートにおいては、ステップ
（８０３）、（８０４）に本実施例の認証処理の特徴が
ある。In the above flowchart, steps (803) and (804) are characteristic of the authentication process of this embodiment.

【００６４】[0064]

【発明の効果】本発明では、任意のキーワードを発声し
て登録を行なうときに、音声の特徴（個人差・性格・属
性）を抽出して特徴別に分類しデータベースに登録す
る。次に登録時と同じキーワードを音声入力して認証を
行なう場合、再び音声の特徴（個人差・性格・属性）を
抽出して分類し、データベースのどの分類枠に適合して
いるか検証する。最適な分類グループからのみ類似デー
タを検索し、認証結果を得る。これらのプロセスを行な
うことによる利点は、データの入力速度が速くなる、目
や手を使う比率が下がるので作業の操作性が増す、デー
タ入力に熟練を必要としない、などの音声認証特有の利
点に加え、膨大なデータベース内のデータを絞り込んで
検索するために、全体検索よりも類似したデータの数が
減るので信頼性があがる、検索スピードが増す、ことな
どである。According to the present invention, when an arbitrary keyword is uttered and registered, voice characteristics (individual difference / personality / attribute) are extracted, classified according to the characteristics, and registered in the database. Next, when performing the authentication by voice inputting the same keyword as at the time of registration, the voice feature (individual difference / personality / attribute) is extracted and classified again to verify which classification frame of the database is suitable. Similar data is searched only from the optimum classification group to obtain the authentication result. The advantages of performing these processes are the advantages specific to voice authentication, such as faster data entry, less occupancy of eyes and hands, increased workability, and less skill required for data entry. In addition, since the data in a huge database is narrowed down and searched, the number of similar data is reduced as compared with the whole search, so that the reliability is increased and the search speed is increased.

[Brief description of drawings]

【図１】実施例１の登録プロセスの概念図FIG. 1 is a conceptual diagram of a registration process according to a first embodiment.

【図２】実施例１の認証プロセスの概念図FIG. 2 is a conceptual diagram of an authentication process according to the first embodiment.

【図３】実施例２の登録プロセスの概念図FIG. 3 is a conceptual diagram of a registration process according to the second embodiment.

【図４】実施例２の認証プロセスの概念図FIG. 4 is a conceptual diagram of an authentication process according to the second embodiment.

【図５】実施例１の登録プロセスを説明するフローチ
ャートFIG. 5 is a flowchart illustrating a registration process according to the first embodiment.

【図６】実施例１の認証プロセスを説明するフローチ
ャートFIG. 6 is a flowchart illustrating an authentication process according to the first embodiment.

【図７】実施例２の登録プロセスを説明するフローチ
ャートFIG. 7 is a flowchart illustrating a registration process according to the second embodiment.

【図８】実施例２の認証プロセスを説明するフローチ
ャートFIG. 8 is a flowchart illustrating an authentication process according to the second embodiment.

【図９】本発明のシステム構成を説明するブロック図FIG. 9 is a block diagram illustrating the system configuration of the present invention.

[Explanation of symbols]

１０１任意のキーワードの生音声を示すブロック図１０２マイクなど登録用の音声入力装置の概念図１０３登録用コンピュータ部を示すブロック図１０４データの流れ１０５音声モデル化された音声データを示すブロック
図１０６処理フローの特徴抽出処理のステップ１０７処理フローの流れの方向１０８処理フローの分類処理のステップ１０９処理フローの登録処理のステップ１１０音声データベース１１１特徴別に分類されたグループ２０１登録時と同じキーワードの生音声を示すブロッ
ク図２０２マイクなど認証用の音声入力装置の概念図２０３認証用コンピュータ部を示すブロック図２０４データの流れ２０５音声モデル化された音声データを示すブロック
図２０６処理フローの特徴抽出処理のステップ２０７処理フローの流れの方向２０８処理フローの分類処理のステップ２０９処理フローの登録処理のステップ２１０音声データベース２１１特徴別に分類されたグループ３０１任意のキーワードの生音声を示すブロック図３０２マイクなど登録用の音声入力装置の概念図３０３登録用コンピュータ部を示すブロック図３０４データの流れ３０５音声モデル化された音声データを示すブロック
図３０６処理フローの特徴抽出処理のステップ３０７処理フローの流れの方向３０８処理フローの分類処理のステップ３０９処理フローの登録処理のステップ３１０音声データベース３１１特徴別に分類されたグループ３１２病院のある月の勤務スケジュールの概念図３１３各勤務時間帯毎に分類した場合の概念図４０１Ｂ看護婦の登録時と同じキーワードの生音声を
示すブロック図４０２マイクなど認証用の音声入力装置の概念図４０３認証用コンピュータ部を示すブロック図４０４データの流れ４０５音声モデル化された音声データを示すブロック
図４０６処理フローの特徴抽出処理のステップ４０７処理フローの流れの方向４０８処理フローの分類処理のステップ４０９処理フローの登録処理のステップ４１０音声データベース４１１特徴別に分類されたグループ４１２現在の勤務帯４１３分類されたグループ１００１ＣＰＵ１００２音声入力部１００３記憶装置１００４ＲＡＭ１００５入力部１００６出力部１００７生音声データ１００８音声モデル101 Block diagram showing raw voice of arbitrary keyword 102 Conceptual diagram of voice input device for registration such as microphone 103 Block diagram showing registration computer section 104 Data flow 105 Block diagram showing voice modeled voice data 106 Processing Step 107 of flow characteristic extraction processing 108 Direction of flow of processing flow 108 Step 109 of processing flow classification processing Step 110 of processing flow registration processing Voice database 111 Group 201 classified by characteristics Raw voice of the same keyword as at the time of registration Block diagram shown 202 Conceptual diagram of voice input device for authentication such as microphone 203 Block diagram showing computer part for authentication 204 Data flow 205 Block diagram showing voice modeled voice data 206 Step 207 of feature extraction process of process flow Processing flow Flow direction 208 Step 209 of processing flow classification processing 210 of processing flow registration processing 210 Voice database 211 Group 301 classified by feature Block diagram showing raw voice of arbitrary keyword 302 Voice input device for registration such as microphone Conceptual diagram 303 Block diagram showing computer unit for registration 304 Flow of data 305 Block diagram showing voice data modeled as voice 306 Step 307 of feature extraction processing of processing flow Direction of flow 308 of processing flow Processing classification of processing flow Step 309 of processing flow Registration step 310 Voice database 311 Group 312 classified by characteristics Conceptual diagram of work schedule for a month with a hospital 313 Conceptual diagram when classified by each working hours 401 B Registration of nurse Kiwa same as time Block diagram showing a raw voice of a voice 402 Conceptual diagram of a voice input device for authentication such as a microphone 403 Block diagram showing an authentication computer section 404 Data flow 405 Block diagram showing voice data modeled as voice 406 Features of processing flow Step 407 of extraction processing Direction of flow 408 of processing flow Step 409 of processing flow classification processing Step 410 of processing flow registration processing 410 Voice database 411 Group 412 classified by characteristics Current duty zone 413 Classified group 1001 CPU 1002 Voice input unit 1003 Storage device 1004 RAM 1005 Input unit 1006 Output unit 1007 Raw voice data 1008 Voice model

Claims

[Claims]

1. In a registration-type speaker authentication system, a voice characteristic (individual difference) depending on a frequency range of a voice, a length of a keyword to be input, a contained sound element, etc. at the time of voice registration.
To extract data, to classify by voice characteristics, to store data in the classified groups, to extract voice characteristics during voice authentication, and from which classification frame to register data based on voice characteristics. To determine whether or not to search, a method to search for an approximate value from the determined classification group, a method to output the authentication result, a method to confirm the authentication result to the subject, and a method to perform a re-search based on the correctness of the authentication result. A voice registration / authentication system, comprising:

2. In a registration-type speaker authentication system, when registration time or authentication time can be managed by a schedule, a means for classifying a voice of an input person according to a schedule at the time of voice registration, and storing data in the classified group. Means, a means for classifying voices according to the time at the time of voice authentication, a means for deciding from which classification frame the registered data is to be searched, a means for searching an approximate value from the decided classification group, and a means for issuing an authentication result. And a means for confirming the authentication result with the subject and a means for re-searching based on whether the authentication result is correct or incorrect, the voice registration authentication system.