JP2002251199A

JP2002251199A - Voice input information processor

Info

Publication number: JP2002251199A
Application number: JP2001051564A
Authority: JP
Inventors: Akira Suzuki; 明鈴木
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2001-02-27
Filing date: 2001-02-27
Publication date: 2002-09-06

Abstract

PROBLEM TO BE SOLVED: To actualize a voice input information processor which prevents input ted voice information itself or its voiceprint from leaking out. SOLUTION: This voice input information processor has a voice input means 11gh equipped with a voice detecting means 11g for detecting an input voice, a sound signal generating means which generates a sound signal actively muting a voice to the outside by generating an opposite-phase sound from the detected voice or a sound signal for actively changing the voiceprint by varying the frequency spectrum of the input voice on the basis of a voice mode set in advance, and a sound output means 11ij which outputs the sound signal; and a signal regarding the detected input voice is transmitted to a CPU 11a to carry out an indicated process.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、情報処理装置の音
声入力装置による認証、操作指示を行う音声入力情報処
理装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice input information processing device for performing authentication and operation instructions by a voice input device of an information processing device.

【０００２】[0002]

【従来の技術】現在、情報処理装置（コンピュータ）に
各種処理を指示するのに音声入力を使う方式が提案され
ている。また、人間の声紋の特徴の個人差を暗証情報と
して認証に使う情報処理装置がある。特開平９−１４６
５６３号公報等はこの声紋を使用した例である。ところ
が、外部の人間が、認証時に入力された人間の声紋を他
の音声記録装置により録音して暗証情報を盗み取ること
が可能である。2. Description of the Related Art At present, there has been proposed a system using voice input to instruct an information processing apparatus (computer) to perform various processes. There is also an information processing apparatus that uses individual differences in the characteristics of human voice prints as authentication information for authentication. JP-A-9-146
No. 563 is an example of using this voiceprint. However, it is possible for an external person to record a human voiceprint input at the time of authentication with another voice recording device and steal the password information.

【０００３】また、音声または騒音を減少させる能動消
音装置も従来提案されている。しかしながら、これらを
組み合わせた方式は提案されていない。ところが、音声
による認証はその音声を盗み取ることが容易で何らかの
防御策が必要である。これに対して、特開平０８−０８
４１９０号公報は、電話による認証の場合、認証時にプ
ッシュダイヤル回線にダミーのプッシュフォン信号を重
畳することにより単純な録音による盗聴漏洩を防ぐ方式
が提案されているが、これは、決まったプッシュフォン
信号を重畳するものであり、簡単に除去可能である。ま
た、電話回線のみに適用できるものである。[0003] Active silencers for reducing voice or noise have also been proposed. However, a method combining these has not been proposed. However, voice authentication is easy to steal the voice and requires some protection. In contrast, Japanese Patent Application Laid-Open No. 08-08
Japanese Patent No. 4190 proposes a method of preventing eavesdropping leakage by simple recording by superimposing a dummy push-phone signal on a push-dial line at the time of authentication in the case of telephone authentication. The signal is superimposed and can be easily removed. Also, it can be applied only to telephone lines.

【０００４】また、能動消音手段を用いて入力音声以外
の音（雑音）を低減する特開平１０−０９２１２１号公
報のような方式も提案されているが、発音者の音声を外
部に漏れない様に能動消音手段を用いた例は開示されて
いない。また、特開平７−１７０５８７号公報のよう
に、発音者の発声をマイクロフォンで検出し、検出した
信号を基に能動的に消音する音声入力装置が開示されて
いるが、これは情報処理装置の認証、処理指示に関する
ものではなく、単に、マイクロフォンに入力された音声
＋音を能動的に打ち消して消音する能動消音装置であ
る。また、従来は、情報処理装置の音声入力による認
証、及び処理指示は外部にその音声が漏洩するものであ
る。[0004] A system such as Japanese Unexamined Patent Application Publication No. 10-092121 has also been proposed in which sound (noise) other than the input sound is reduced by using active muffling means. Does not disclose an example using an active silencer. Further, as disclosed in Japanese Patent Application Laid-Open No. 7-170587, there is disclosed a voice input device which detects the utterance of a sounder with a microphone and actively mutes the sound based on the detected signal. This is an active silencer that does not relate to authentication and processing instructions, but simply cancels sound by actively canceling voice + sound input to a microphone. Conventionally, the authentication and the processing instruction by the voice input of the information processing apparatus leak the voice to the outside.

【０００５】[0005]

【発明が解決しようとする課題】現在、情報処理装置に
各種処理を指示するために、音声入力を使う方式が提案
されている。また、人間の声紋の特徴の個人差を暗証情
報として認証に使う情報処理装置がある。ところが、外
部の人間が、認証時に入力された人間の声紋を他の音声
記録装置により録音することで、暗証情報を盗み取るこ
とが可能である。本発明は、かかる実情に鑑みてなされ
たものであり、本発明に係る音声入力情報処理装置の第
一の目的は、能動的に消音する機能を備えた音声入力手
段により、外部に音声を聞こえなくする方式や、能動的
に声紋を変更する機能を備えた音声入力手段により、音
声による認証時における暗証情報の漏洩を防ぐことにあ
る。また、第二の目的は、通常のオフィスのように多数
の操作者がそれぞれ音声入力による操作を行う場合、隣
の操作者の音声による誤動作を防止すると共にオフィス
の静穏化を図ることにある。即ち、本発明に係る各請求
項における目的は、次の通りである。At present, a system using voice input has been proposed to instruct an information processing apparatus to perform various processes. There is also an information processing apparatus that uses individual differences in the characteristics of human voice prints as authentication information for authentication. However, it is possible for an external person to steal personal identification information by recording the human voiceprint input at the time of authentication by another voice recording device. The present invention has been made in view of such circumstances, and a first object of a voice input information processing apparatus according to the present invention is to provide a voice input unit having a function of actively silencing a voice so that a voice can be externally heard. An object of the present invention is to prevent leakage of personal identification information at the time of authentication by voice by means of a method of eliminating voice information or voice input means having a function of actively changing a voiceprint. A second object of the present invention is to prevent a malfunction caused by a voice of an adjacent operator and quiet the office when a large number of operators perform operations by voice input as in a normal office. That is, the objects in the claims according to the present invention are as follows.

【０００６】（請求項１における目的）操作者の音声を
能動的に打ち消して外部への音声の漏洩を防ぐ。また、
周辺に音声を聞こえなくして、オフィスの静穏化を図
る。[0006] The object of the present invention is to actively cancel the voice of the operator to prevent leakage of the voice to the outside. Also,
Make the office quieter by not hearing any sounds around.

【０００７】（請求項２における目的）操作者の音声の
声紋を能動的に変更して、音声情報が外部に聞こえるよ
うな場合であっても、外部に声紋が漏洩することを防
ぐ。[0007] The voiceprint of the operator's voice is actively changed to prevent the voiceprint from leaking to the outside even if the voice information can be heard outside.

【０００８】（請求項３における目的）外部に音声また
は声紋が漏洩せずに認証できる音声認証を実現する。A third object of the present invention is to realize voice authentication which can perform authentication without leaking a voice or a voiceprint to the outside.

【０００９】（請求項４における目的）認証された音声
と同じ特徴の声紋で情報処理装置の操作指示を行うの
で、複数の発音者がいても、特定の操作者のみの指示に
より操作される音声入力手段を備えた情報処理装置を実
現する。[0009] (Object of Claim 4) Since the operation instruction of the information processing apparatus is performed by a voiceprint having the same characteristics as the authenticated voice, even if there are a plurality of sounders, the voice operated by the instruction of only a specific operator is provided. An information processing device having input means is realized.

【００１０】（請求項５における目的）認証された音声
と同じ特徴の声紋で情報処理装置の操作指示を行うの
で、複数の発音者がいても、特定の操作者のみの指示に
より操作される音声入力手段を備えた情報処理装置を実
現し、かつ、認証時における暗証情報の漏洩を防ぐ。[0010] (Object of claim 5) Since the operation instruction of the information processing apparatus is performed by using a voiceprint having the same characteristics as the authenticated voice, even if there are a plurality of sounders, the voice operated by the instruction of only a specific operator is provided. An information processing apparatus including an input unit is realized, and leakage of personal identification information at the time of authentication is prevented.

【００１１】（請求項６における目的）認証するための
発音情報を情報処理装置が表示または発音し、それを復
唱した音声の声紋情報を基に認証することにより、認証
の為の暗証情報を認証者が記憶する必要が無く、また、
外部に漏洩せずに、声紋により確実に認証を行う。(Purpose of Claim 6) The information processing device displays or sounds pronunciation information for authentication, and authenticates the password information for the authentication by authenticating based on the voiceprint information of the voice reproduced therefrom. Need not be remembered,
Authentication is assured by voiceprint without leaking to the outside.

【００１２】[0012]

【課題を解決するための手段】請求項１に記載の発明
は、音声を検出する音声検出手段と、検出された前記音
声から逆位相の音を発生する音響発生手段により、能動
的に前記音声を消音する能動消音機能を持つ音声入力手
段を有することにより、前記音声入力手段が、検出され
た前記音声に関する信号を情報処理装置に伝達すると共
に、入力される前記音声が外部に漏れることを防止する
音声入力情報処理装置とすることを特徴とするものであ
る。According to a first aspect of the present invention, there is provided an audio processing apparatus comprising: a voice detecting means for detecting a voice; and a sound generating means for generating a sound having an opposite phase from the detected voice. The voice input means has a voice input means having an active muffling function to mute the sound, and the voice input means transmits a signal related to the detected voice to the information processing device and prevents the input voice from leaking outside. The voice input information processing device performs the following.

【００１３】請求項２に記載の発明は、音声を検出する
音声検出手段と、検出された前記音声に基づいて音を発
生する音響発生手段により、能動的に声紋を変更する声
紋変更機能を持つ音声入力手段を有することにより、前
記音声入力手段が、検出された前記音声に関する信号を
情報処理装置に伝達すると共に、外部には異なる声紋の
音声が聞こえるようにする音声入力情報処理装置とする
ことを特徴とするものである。According to a second aspect of the present invention, there is provided a voiceprint changing function of actively changing a voiceprint by voice detecting means for detecting voice and sound generating means for generating a sound based on the detected voice. By providing a voice input unit, the voice input unit transmits a signal related to the detected voice to an information processing device, and also allows a voice of a different voiceprint to be heard outside. It is characterized by the following.

【００１４】請求項３に記載の発明は、請求項１または
２に記載の音声入力情報処理装置において、前記音声入
力手段が、音声の声紋の特徴を抽出することにより認証
を行う音声入力情報処理装置とすることを特徴とするも
のである。According to a third aspect of the present invention, in the voice input information processing apparatus according to the first or second aspect, the voice input means performs authentication by extracting a voiceprint feature of the voice. It is characterized by being a device.

【００１５】請求項４に記載の発明は、音声を検出する
音声検出手段と、検出された前記音声を基に声紋を検出
する声紋検出手段とを備えた音声入力情報処理装置にお
いて、認証された前記声紋の特徴と同様の特徴を持つ声
紋の音声による操作命令のみを音声認識し、操作を受け
付ける音声入力情報処理装置とすることを特徴とするも
のである。According to a fourth aspect of the present invention, in the voice input information processing apparatus provided with voice detection means for detecting voice and voiceprint detection means for detecting voiceprint based on the detected voice, the authentication is performed. A voice input information processing apparatus that recognizes only an operation command by voice of a voiceprint having the same characteristics as the voiceprint and receives an operation.

【００１６】請求項５に記載の発明は、請求項３に記載
の音声入力情報処理装置において、音声を検出する音声
検出手段と、検出された前記音声を基に声紋を検出する
声紋検出手段とを備えた音声入力情報処理装置におい
て、認証された前記声紋の特徴と同様の特徴を持つ声紋
の音声による操作命令のみを音声認識し、操作を受け付
ける音声入力情報処理装置とすることを特徴とするもの
である。According to a fifth aspect of the present invention, in the voice input information processing apparatus according to the third aspect, a voice detecting means for detecting a voice, and a voiceprint detecting means for detecting a voiceprint based on the detected voice. Wherein the voice input information processing apparatus recognizes only an operation command by voice of a voice print having the same characteristics as the authenticated voice print, and receives the operation. Things.

【００１７】請求項６に記載の発明は、請求項３に記載
の音声入力情報処理装置において、認証時に、前記音声
入力情報処理装置が認証用の発音情報を表示または発音
し、認証者は前記発音情報を発音し、発音された音声の
声紋を基に認証を行う音声入力情報処理装置とすること
を特徴とするものである。According to a sixth aspect of the present invention, in the voice input information processing apparatus according to the third aspect, at the time of authentication, the voice input information processing apparatus displays or sounds pronunciation information for authentication, and A speech input information processing device that pronounces pronunciation information and performs authentication based on a voiceprint of the pronounced speech.

【００１８】[0018]

【発明の実施の形態】本発明は、大きく分けて、能動消
音機能または能動変更機能の２種類を備えた音声入力手
段と、これを認証し、情報処理装置の処理操作指示とす
る手段に関するものである。この能動消音機能または能
動変更機能の２種類を備えた音声入力手段と、情報処理
装置における認証、または操作指示手段とを組み合わせ
て使用する構成を採用することにより適切な認証、また
は操作指示が実現する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention broadly relates to a voice input means having two types of an active mute function and an active change function, and a means for authenticating the voice input means and giving a processing operation instruction of an information processing apparatus. It is. Appropriate authentication or operation instruction is realized by adopting a configuration in which a voice input unit having two types of active silence function or active change function and authentication or operation instruction unit in the information processing apparatus are used in combination. I do.

【００１９】まず、２種類の音声入力手段について説明
すると、双方とも能動的に発音者の音声を変更するもの
であり、図１のような構成により実現できる。ここに、
図１は、本発明に係る音声入力情報処理装置における音
声入力手段の一実施形態を示す概念図である。まず、音
声検出手段１で発音者１０の音声を検出し、検出された
音声信号を音響信号作成手段３と、音声伝達手段４とに
入力する。音響信号作成手段３は、入力された音声信号
を打ち消すような又は音声信号を変更するような干渉音
信号を作成し、作成された干渉音信号を音響発生手段２
に伝達し、音響発生手段２は干渉音を発生する。一般的
に、音声を含む音は空気の振動による波であり、図３に
示すように、音声に対する逆位相の音波を干渉させる
と、干渉された合成音波（合成音）は干渉音により打ち
消される。基本的に、能動消音装置は、この原理に基づ
いて構成されている。First, two types of voice input means will be described. Both of them actively change the voice of the speaker and can be realized by the configuration as shown in FIG. here,
FIG. 1 is a conceptual diagram showing one embodiment of a voice input unit in a voice input information processing device according to the present invention. First, the sound of the sounder 10 is detected by the sound detecting means 1, and the detected sound signal is input to the sound signal creating means 3 and the sound transmitting means 4. The sound signal creating means 3 creates an interference sound signal for canceling the input audio signal or changing the audio signal, and outputs the created interference sound signal to the sound generating means 2.
And the sound generating means 2 generates an interference sound. Generally, sound including sound is a wave due to the vibration of air, and as shown in FIG. 3, when a sound wave having an opposite phase to the sound is caused to interfere, the interfering synthetic sound wave (synthetic sound) is canceled by the interfering sound. . Basically, the active silencer is configured based on this principle.

【００２０】図１に示す構成例は、１対の音声検出手段
１と音響発生手段２とを備えた場合であるが、図２に示
すように、多数対の音声検出手段１，１′，１″と音響
発生手段２，２′，２″とを備えた方式を用いても良
い。この例では、音声検出手段１，１′，１″は、発音
者１０側に指向性を持たせ、一方、音響発生手段２，
２′，２″は、発音者１０とは反対側の外側に指向性を
持たせることにより、多数の音響発生手段２，２′，
２″の内側即ち発音者１０側では、発音者１０の音声が
聞き取れるが、多数の音響発生手段２，２′，２″の外
側即ち発音者１０とは反対側では、音声を打ち消して、
発音者１０の音声の漏洩を防ぐことが出来る。The configuration example shown in FIG. 1 is a case where a pair of voice detection means 1 and sound generation means 2 are provided. As shown in FIG. 2, a large number of pairs of voice detection means 1, 1 ', 1 "and sound generating means 2, 2 ', 2" may be used. In this example, the sound detection means 1, 1 ', 1 "give the sounder 10 a directivity, while the sound generation means 2,
2 ′, 2 ″ are provided with directivity on the outside on the side opposite to the sounder 10 so that many sound generating means 2, 2 ′,
On the inside of 2 ", that is, on the side of the sounder 10, the sound of the sounder 10 can be heard, but on the outside of the many sound generating means 2, 2 ', 2", that is, on the opposite side of the sounder 10, the sound is canceled.
Leakage of the sound of the sounder 10 can be prevented.

【００２１】前述の説明は音声を打ち消す方式を説明し
たが、次に、声紋を変更する方式について述べる。人間
の音声の声紋とは、一般的に、発生音の周波数スペクト
ルの分布が個人によって異なることや、母音，子音の強
弱や間隔に着目して個人を認識するものである。この発
生音の周波数スペクトルの分布は、人間の声帯、口腔の
大きさ、口の開閉動作などの個人差に起因するものであ
る。そこで、特に、個人差があり、且つ、人間が意識的
に変更しにくい声帯、口腔の大きさに起因する周波数ス
ペクトル部分を打ち消す、または変更することにより、
音声としての情報を残したままで声紋を変更することが
できる。つまり、たとえば、発音者の原音声の周波数ス
ペクトルが図４（Ａ）であるような場合、該原音声の一
部の周波数スペクトルを変更させるようにして、干渉後
の変更された合成音声の周波数スペクトルが図４（Ｂ）
になるようにすれば、声紋を変更することができる。In the above description, the method of canceling the voice has been described. Next, the method of changing the voiceprint will be described. Generally, a voiceprint of a human voice recognizes an individual by focusing on the fact that the distribution of the frequency spectrum of a generated sound differs between individuals, and on the strength and interval of vowels and consonants. The distribution of the frequency spectrum of the generated sound is caused by individual differences such as the vocal cords of the human, the size of the oral cavity, and the opening and closing operations of the mouth. Therefore, in particular, there are individual differences, and vocal cords that are difficult for humans to consciously change, by canceling or changing the frequency spectrum portion caused by the size of the oral cavity,
The voiceprint can be changed while leaving the information as voice. That is, for example, when the frequency spectrum of the original voice of the sounder is as shown in FIG. 4A, the frequency spectrum of a part of the original voice is changed so that the frequency of the changed synthesized voice after the interference is changed. The spectrum is as shown in FIG.
, The voiceprint can be changed.

【００２２】図４（Ｂ）に示す例では、一部の周波数ス
ペクトル帯を打ち消すような例を挙げたが、複数の周波
数スペクトル帯を変更するようにしても良いし、あるい
は、時間的に変更するようにしても良い。この場合、変
更する周波数スペクトル帯を分散させることにすれば、
声紋変更による音声の変化が人間の耳には分かりにくく
なるので、かかる声紋変更を行う装置を用いているとの
違和感が少なく、また、変更する周波数が複数の周波数
スペクトル帯に及んでいる為、外部に聞こえる合成音声
からは声紋を分析しにくくすることができる。In the example shown in FIG. 4B, an example is shown in which some frequency spectrum bands are canceled. However, a plurality of frequency spectrum bands may be changed, or the frequency spectrum band may be changed over time. You may do it. In this case, if the frequency spectrum band to be changed is dispersed,
Since the voice change due to the voiceprint change becomes difficult for the human ear to understand, there is little discomfort with using such a voiceprint change device, and since the frequency to be changed extends over a plurality of frequency spectrum bands, It is possible to make it difficult to analyze a voiceprint from a synthesized voice that can be heard outside.

【００２３】次に、前述のごとき能動的な消音機能また
は能動的な声紋変更機能を、音声入力情報処理装置の音
声入力手段として適用する基本的な構成を、図５に示
す。ここで、能動消音（変更）機能付き音声入力手段５
は、前述の図１で示した音声検出手段１，音響発生手段
２，音響信号作成手段３，音声伝達手段４を有するもの
であり、音響発生手段２は、前述のごとく、能動的な消
音機能または能動的な声紋変更機能を有しているもので
ある。Next, FIG. 5 shows a basic configuration in which the active mute function or the active voiceprint changing function as described above is applied as a voice input means of a voice input information processing apparatus. Here, voice input means 5 with an active mute (change) function
Has the sound detecting means 1, the sound generating means 2, the sound signal creating means 3, and the sound transmitting means 4 shown in FIG. 1, and the sound generating means 2 has an active silencing function as described above. Alternatively, it has an active voiceprint changing function.

【００２４】まず、能動消音機能付きの音声入力手段５
の場合、発音者が発音した音声は、外部に漏れることも
なく、音声入力手段５が、作成した音声信号を、情報処
理装置６に伝達する。情報処理装置６は、伝達された音
声信号を基に、各種の動作を行う。一方、能動声紋変更
機能付きの音声入力手段５の場合、発音者が発音した原
音声から変更された声紋の音声は外部に漏れるが、発音
者が発音した原音声は外部には漏れることもなく、音声
入力手段５が、作成した音声信号を、情報処理装置６に
伝達する。情報処理装置６は、伝達された音声信号を基
に、各種の動作を行う。First, voice input means 5 having an active silence function
In the case of, the sound produced by the sounder does not leak to the outside, and the speech input means 5 transmits the created speech signal to the information processing device 6. The information processing device 6 performs various operations based on the transmitted audio signal. On the other hand, in the case of the voice input means 5 with the active voiceprint changing function, the voice of the voiceprint changed from the original voice pronounced by the speaker leaks to the outside, but the original voice pronounced by the speaker does not leak outside. The voice input means 5 transmits the generated voice signal to the information processing device 6. The information processing device 6 performs various operations based on the transmitted audio signal.

【００２５】現在の情報処理装置は、セキュリティ確保
の為に、該情報処理装置の使用者を明確にして、使用者
が該情報処理装置に対して命令できる範囲を制限してい
る。一般的には、図６の認証動作フローチャートに示す
ように、情報処理装置に対して、使用者が各種動作をさ
せる前に、該情報処理装置に対して使用者の身分を明ら
かにする認証動作を行って、認証が得られた後、初め
て、該使用者は、該使用者の身分の権限範囲内で各種動
作を行うことが出来る。In order to ensure security, the current information processing device clarifies the user of the information processing device and limits the range in which the user can issue an instruction to the information processing device. In general, as shown in the authentication operation flowchart of FIG. 6, an authentication operation for clarifying a user's identity to the information processing device before the user performs various operations on the information processing device. Only after the authentication is obtained, the user can perform various operations within the authority of the user.

【００２６】即ち、図６のフローチャートに示すよう
に、通常、認証動作は、ユーザ名と暗証番号との入力を
情報処理装置が使用者に求め、これに対して、該使用者
が正確にユーザ名と暗証番号を入力することで認証され
る。認証された後は、該使用者は、該使用者の身分の権
限範囲内で各種動作を行うことが出来る。該使用者が、
該情報処理装置に対する操作の必要が無くなった場合
は、操作の終了を該情報処理装置に入力し、操作を終了
することにより、認証動作は終了し、他の使用者が不正
に該情報処理装置を使用することを防げる。That is, as shown in the flowchart of FIG. 6, usually, in the authentication operation, the information processing apparatus requests the user to input a user name and a password, and in response to this, the user You are authenticated by entering your name and PIN. After being authenticated, the user can perform various operations within the authority of the user. The user,
When the operation on the information processing apparatus is no longer necessary, the end of the operation is input to the information processing apparatus, and the operation is terminated. Can be used.

【００２７】しかしながら、従来、図６のような認証動
作においては、次の２つの問題がある。第一の問題は、
情報処理装置の使用者が、前記ユーザ名，前記暗証番号
を絶えず記憶している必要があり、しかも、何らかの方
法で、他の使用者が前記ユーザ名，前記暗証番号を不正
に取得した場合は、該使用者になりすまして、該情報処
理装置を使用することができてしまう。第二の問題は、
認証動作後、該情報処理装置を操作中、あるいは、操作
の終了を該情報処理装置に入力しないまま、該情報処理
装置から離れた場合、他の使用者が、該情報処理装置を
操作しても、該情報処理装置は認証された使用者か、他
の使用者かの見分けが付かないため、他の使用者でも、
該情報処理装置を使用することができてしまう。本発明
に係る音声入力情報処理装置の構成は、かかる問題を解
決することができるものである。However, conventionally, in the authentication operation as shown in FIG. 6, there are the following two problems. The first problem is
If the user of the information processing apparatus needs to constantly memorize the user name and the password, and if another user illegally obtains the user name and the password by some method, In other words, the information processing apparatus can be used by impersonating the user. The second problem is
After the authentication operation, if the user is operating the information processing apparatus, or leaves the information processing apparatus without inputting the end of the operation to the information processing apparatus, another user may operate the information processing apparatus. Also, since the information processing device is indistinguishable from an authenticated user or another user, even other users,
The information processing device can be used. The configuration of the voice input information processing apparatus according to the present invention can solve such a problem.

【００２８】まず、本発明の機能を実現する為の能動消
音（変更）機能音声入力手段を備えた音声入力情報処理
装置の具体的な構成例を図７に示す。図７において、中
央演算部（ＣＰＵ）１１ａは、音声入力情報処理装置１
１の中心となり、各種処理を行うものである。主記憶部
（ＲＡＭ）１１ｂは、各種処理を行う為のデータやプロ
グラムを記憶しておく部分である。補助記憶部（ＨＤ）
１１ｃも、データやプログラムを記憶しておく部分であ
るが、主記憶部１１ｂは、音声入力情報処理装置１１の
電源がＯＦＦになっている場合、記憶情報が喪失するの
に対して、補助記憶部１１ｃは、ハードディスク装置等
を用いており、音声入力情報処理装置１１の電源がＯＦ
Ｆになっている場合であっても、記憶情報を保持するこ
とが可能である。表示部（ＣＲＴ）１１ｄは、音声入力
情報処理装置１１から使用者に情報を表示することによ
り、情報を使用者に伝達する為の手段である。キーボー
ド１１ｅは、使用者が音声入力情報処理装置１１に情報
を伝達する手段である。通信部１１ｆは、他の情報処理
装置とケーブル、無線等を用いて情報を相互に伝達する
ものである。First, FIG. 7 shows a specific configuration example of a voice input information processing apparatus provided with an active mute (change) function voice input means for realizing the function of the present invention. In FIG. 7, a central processing unit (CPU) 11a includes a voice input information processing device 1
1 and performs various processes. The main storage (RAM) 11b is a part for storing data and programs for performing various processes. Auxiliary storage (HD)
11c is also a part for storing data and programs, but the main storage unit 11b stores information when the power of the voice input information processing device 11 is turned off, while the stored information is lost. The unit 11c uses a hard disk device or the like, and the power of the voice input information processing device 11 is turned off.
Even if it is set to F, it is possible to hold the stored information. The display unit (CRT) 11d is a means for transmitting information to the user by displaying the information from the voice input information processing device 11 to the user. The keyboard 11e is a means by which the user transmits information to the voice input information processing device 11. The communication unit 11f transmits information to and from another information processing device using a cable, wireless communication, or the like.

【００２９】以上は、一般的な情報処理装置の構成例で
あるが、本発明に係る音声入力情報処理装置１１におい
ては、音声を音声信号に変換するマイクロフォン１１ｇ
と該音声信号を増幅後、Ａ／Ｄ変換してデジタル信号化
するアンプ付きＡ／Ｄ変換器１１ｈとからなる音声入力
手段１１ｇｈと、ＣＰＵからのデジタル音響情報をアナ
ログ音響信号に変換するアンプ付きＤ／Ａ変換器１１ｉ
と該アナログ音響信号を音に変換するスピーカ１１ｊと
からなる音響出力手段１１ｉｊとを、新たに備えてい
る。ここで、音声入力手段１１ｇｈは、音声を入力する
為にあり、音響出力手段１１ｉｊは、前述した通り、外
部に対して、能動的に消音または声紋を変更する為にあ
るものである。The above is an example of the configuration of a general information processing apparatus. In the voice input information processing apparatus 11 according to the present invention, a microphone 11g for converting a voice into a voice signal.
And an A / D converter 11h with an amplifier for amplifying and then A / D converting the audio signal to a digital signal, and an amplifier for converting digital audio information from the CPU into an analog audio signal. D / A converter 11i
And a speaker 11j for converting the analog sound signal into sound. Here, the sound input means 11gh is for inputting sound, and the sound output means 11ij is for actively silencing or changing the voiceprint to the outside, as described above.

【００３０】音声入力手段１１ｇｈ，音響出力手段１１
ｉｊは、一対のみではなく、図２に示したように、発音
者の音声が外部に漏洩しないように、複数の音声入力手
段１１ｇｈ，１１′ｇｈと音響出力手段１１ｉｊ，１
１′ｉｊとを効率的に配置しても良い。また、音響出力
手段１１ｉｊは、音声を能動的に消音や声紋変更する以
外に、使用者に対して情報処理装置からの情報を音によ
り伝達する目的に使用しても良い。Voice input means 11gh, sound output means 11
ij is not limited to a pair, and as shown in FIG. 2, a plurality of voice input means 11gh, 11'gh and sound output means 11ij, 1 are provided so that the voice of the sounder does not leak outside.
1′ij may be arranged efficiently. The sound output means 11ij may be used for the purpose of transmitting information from the information processing device to the user by sound, in addition to actively silencing the sound or changing the voice print.

【００３１】次に、本発明に係る音声入力情報処理装置
における全体の処理の流れを、図８のフローチャートに
示す。まず、音声入力情報処理装置１１の電源をＯＮに
すると、各種回路ブロックの動作チェック後、音声入力
情報処理装置１１は、補助記憶部１１ｃから起動プログ
ラムを主記憶部１１ｂにロードして、音声入力情報処理
装置１１は、オペレーションシステムＯＳを起動後、使
用者からの使用待ちを示す「開始」状態となる。以降
は、前述したごとく、本発明における消音動作や、声紋
変更動作、あるいは、声紋判定動作、音声認識動作等の
各種処理ルーチンが、図８に示すメインルーチンと並行
して同時に動作することになる。Next, the flow of the entire processing in the voice input information processing apparatus according to the present invention is shown in the flowchart of FIG. First, when the power of the voice input information processing apparatus 11 is turned on, after checking the operation of various circuit blocks, the voice input information processing apparatus 11 loads a startup program from the auxiliary storage unit 11c into the main storage unit 11b, and performs voice input processing. After activating the operation system OS, the information processing apparatus 11 enters a “start” state indicating a wait for use by a user. Thereafter, as described above, various processing routines such as a mute operation, a voiceprint change operation, a voiceprint determination operation, and a voice recognition operation according to the present invention operate concurrently with the main routine shown in FIG. .

【００３２】かかる同時並行動作を実現するための一例
として、タイマ割り込みを使用した場合の例を図１２で
示す。音声入力情報処理装置１１は、起動後において、
メインルーチンＳ１００を実行しても、一定の時間間隔
１００ｂでタイマの割り込み信号１００ａを発生し、メ
インルーチンＳ１００の処理を中断後、タイマによる割
り込みルーチンＳ２００にジャンプする。割り込みルー
チンＳ２００の終了後、元のメインルーチンＳ１００の
中断個所に復帰し、中断個所から引き続き、メインルー
チンＳ１００の処理を続行する。このような方式をとる
ことにより、メインルーチンＳ１００と並行して、他の
処理を実行することができる。FIG. 12 shows an example in which a timer interrupt is used as an example for realizing such simultaneous and parallel operations. After activation, the voice input information processing device 11
Even when the main routine S100 is executed, the timer interrupt signal 100a is generated at a fixed time interval 100b, and after the processing of the main routine S100 is interrupted, the process jumps to the timer interrupt routine S200. After the end of the interruption routine S200, the process returns to the original interruption point of the main routine S100, and the processing of the main routine S100 is continued from the interruption point. By adopting such a method, other processing can be executed in parallel with the main routine S100.

【００３３】そこで、図１１に示すような各種の動作を
行う処理ルーチンを、タイマによる割り込みルーチンに
組み込むことにより、メインルーチンの処理と並行して
前述の消音（変更）動作や声紋検出、あるいは、音声認
識の動作を実行させることができる。ここに、図１１
は、消音（変更）動作，声紋検出動作，音声認識の動作
を行う割り込みルーチンの流れを示すフローチャートで
ある。まず、音声入力手段１１ｇｈから、入力された音
声情報を取得する（ステップＳ２０１）。Therefore, by incorporating a processing routine for performing various operations as shown in FIG. 11 into an interrupt routine by a timer, the above-described mute (change) operation, voiceprint detection, or voiceprint detection can be performed in parallel with the processing of the main routine. The operation of voice recognition can be executed. Here, FIG.
9 is a flowchart showing the flow of an interrupt routine for performing a mute (change) operation, a voiceprint detection operation, and a voice recognition operation. First, the input voice information is obtained from the voice input unit 11gh (step S201).

【００３４】次に、予めセットされている音響信号作成
モードが、消音モードか、あるいは、声紋変更モードが
指定されているかを判定して（ステップＳ２０２）、消
音モード又は声紋変更モードが設定されている場合は
（ステップＳ２０２のＹＥＳ）、指定されている音響信
号を作成する（ステップＳ２０３）。作成する音響信号
は、前述のように、入力音声を打ち消すような消音信号
を生成したり、あるいは、周波数スペクトルを変更させ
るような、干渉信号である。かかる干渉信号を、音響出
力手段１１ｉｊを介して、出力する（ステップＳ２０
４）ことにより、消音モードの場合は、入力された音声
が外部に漏洩しないように打ち消すことが出来る。ま
た、声紋変更モードの場合は、前述のように、一部の周
波数スペクトルを打ち消すような干渉信号が出力され、
入力された原音声の声紋が外部に漏洩することを防止で
きる。Next, it is determined whether the preset sound signal creation mode is the mute mode or the voiceprint change mode is specified (step S202), and the mute mode or the voiceprint change mode is set. If there is (YES in step S202), the designated sound signal is created (step S203). The acoustic signal to be created is, as described above, an interference signal that generates a muffling signal that cancels the input voice or changes the frequency spectrum. The interference signal is output via the sound output unit 11ij (step S20).
4) Thus, in the case of the mute mode, the input voice can be canceled so as not to leak outside. In the case of the voiceprint change mode, as described above, an interference signal that cancels a part of the frequency spectrum is output,
It is possible to prevent the voiceprint of the input original voice from leaking outside.

【００３５】次に、声紋判定モードがセットされている
かチェックし（ステップＳ２０５）、セットされている
場合は（ステップＳ２０５のＹＥＳ）、声紋を検出する
（ステップＳ２０６）。なお、声紋検出については、特
開平５−２１０６４８号公報において開示されているよ
うに、各個人の声紋の特徴量を記録しておき、パターン
マッチングにより入力された音声情報から抽出した特徴
量と比較すれば良い。Next, it is checked whether the voiceprint determination mode is set (step S205). If the voiceprint determination mode is set (YES in step S205), a voiceprint is detected (step S206). As for voiceprint detection, as disclosed in Japanese Patent Application Laid-Open No. Hei 5-210648, the feature amount of each individual voiceprint is recorded and compared with the feature amount extracted from the voice information input by pattern matching. Just do it.

【００３６】次に、音声認識モードがセットされている
かチェックし（ステップＳ２０７）、セットされている
場合は（ステップＳ２０７のＹＥＳ）、特開平５−３１
３６８７号公報に開示されているように、音声を認識す
れば良い（ステップＳ２０８）。Next, it is checked whether or not the voice recognition mode is set (step S207). If the voice recognition mode is set (YES in step S207), the operation is started.
As disclosed in Japanese Patent No. 3687, the voice may be recognized (step S208).

【００３７】以上の処理が終了し、割り込みルーチンと
しての処理終了後、メインルーチンの処理を中断個所か
ら再開させて、継続させることにより、割り込みルーチ
ンで行われる消音動作、声紋変更動作、声紋認識動作、
音声認識動作が、メインルーチンの処理と並行して実行
されることになる。なお、声紋認識及び音声認識の方式
については、従来より多くの方式が公開されており、そ
の中から適当な方式を使用することにすれば良い。After the above processing is completed and the processing of the interrupt routine is completed, the processing of the main routine is resumed from the interrupted point and continued, whereby the mute operation, voiceprint change operation, voiceprint recognition operation performed in the interrupt routine are performed. ,
The voice recognition operation is executed in parallel with the processing of the main routine. It should be noted that as to the voiceprint recognition and voice recognition methods, more methods have been disclosed than ever, and an appropriate method may be used among them.

【００３８】また、以上の各動作の処理を一つのＣＰＵ
のみで実行する例を示したが、ＣＰＵ１１ａの負荷を軽
減する為には、図１３に示すように、専用の信号処理手
段（ＤＳＰ）１２を別途設ける構成として、前記割り込
みルーチンとして実現されていた各種の処理を、信号処
理手段（ＤＳＰ）１２において実行させる構成としても
良い。即ち、この場合は、ＣＰＵ１１ａからの一定の割
り込み信号を基にして、ＤＳＰ１２が、図１１に示す割
り込みルーチンの各処理を行った結果を、ＤＳＰ１２か
らＣＰＵ１１ａに転送するように構成しても良い。The processing of each of the above operations is performed by one CPU.
Although an example in which the processing is executed only by the CPU 11a has been described, in order to reduce the load on the CPU 11a, as shown in FIG. 13, a dedicated signal processing means (DSP) 12 is separately provided, which is realized as the interrupt routine. Various processes may be performed by the signal processing unit (DSP) 12. That is, in this case, the DSP 12 may be configured to transfer the result of performing each processing of the interrupt routine shown in FIG. 11 from the DSP 12 to the CPU 11a based on a certain interrupt signal from the CPU 11a.

【００３９】次に、本発明に係る音声入力情報処理装置
における全体の処理（即ち、メインルーチン）の流れ
を、図８のフローチャートに戻って説明する。使用者の
認証に当たって、まず、どのような認証を行うか判断す
る（ステップＳ１０１）。本実施例においては、外部に
使用者の音声が漏れない消音認証モードと、外部に声紋
を漏洩させない声紋変更モードのいずれかが選択でき
る。Next, the flow of the entire processing (that is, the main routine) in the voice input information processing apparatus according to the present invention will be described with reference to the flowchart of FIG. In authenticating the user, first, what kind of authentication is to be performed is determined (step S101). In the present embodiment, either a mute authentication mode in which the user's voice is not leaked to the outside or a voiceprint change mode in which the voiceprint is not leaked to the outside can be selected.

【００４０】消音認証モードの場合は（ステップＳ１０
１の消音モードの場合）、消音認証モードの設定動作を
実行し（ステップＳ１０２）、図９に示すように、消音
モード（ステップＳ１０２ａ）と声紋判定モード（ステ
ップ１０２ｂ）と音声認識モード（ステップＳ１０２
ｃ）の各フラグをセットする。In the case of the mute authentication mode (step S10
1), a mute authentication mode setting operation is performed (step S102), and as shown in FIG. 9, a mute mode (step S102a), a voiceprint determination mode (step 102b), and a voice recognition mode (step S102).
Set each flag of c).

【００４１】一方、声紋変更モードの場合は（ステップ
Ｓ１０１の変更モードの場合）、声紋変更認証モードの
設定動作を実行し（ステップＳ１０３）、図１０に示す
ように、声紋変更モード（ステップＳ１０３ａ）と声紋
判定モード（ステップＳ１０３ｂ）と音声認識モード
（ステップＳ１０３ｃ）の各フラグをセットする。On the other hand, in the case of the voiceprint change mode (in the case of the change mode in step S101), the setting operation of the voiceprint change authentication mode is executed (step S103), and as shown in FIG. 10, the voiceprint change mode (step S103a). And the voice print determination mode (step S103b) and the voice recognition mode (step S103c).

【００４２】図９に示す消音認証モードの設定動作、ま
たは、図１０に示す声紋変更認証モードの設定動作にお
いて設定された各フラグを基にして、前述した図１１に
示すように、音声入力情報処理装置１１は、割り込みル
ーチンＳ２００において、フラグが設定されているモー
ドに関する動作を行うことになる。Based on the flags set in the mute authentication mode setting operation shown in FIG. 9 or the voiceprint change authentication mode setting operation shown in FIG. 10, as shown in FIG. The processing device 11 performs an operation related to the mode for which the flag is set in the interrupt routine S200.

【００４３】なお、図９に示す消音認証モードの設定動
作の場合、一定間隔で実行される割り込みルーチンＳ２
００は、外部に、入力音声が漏洩しないように、入力さ
れた音声信号を打ち消すように干渉音を作成し（ステッ
プＳ２０３）、更に、入力信号の音声から声紋検出によ
り声紋の特徴量を抽出し（ステップＳ２０６）、音声認
識により音声を認識しユーザ名等を検出している（ステ
ップＳ２０８）。これにより、検出された声紋やユーザ
名で認証可能かどうか判断し、認証フラグをセットする
（ステップＳ１０２ｄ）。In the case of the setting operation of the mute authentication mode shown in FIG. 9, an interruption routine S2 executed at regular intervals
In step S203, an interference sound is created so as to cancel the input voice signal so as not to leak the input voice to the outside (step S203), and the voiceprint feature amount is extracted from the voice of the input signal by voiceprint detection. (Step S206), voice is recognized by voice recognition to detect a user name and the like (Step S208). Thus, it is determined whether or not authentication is possible with the detected voiceprint or user name, and an authentication flag is set (step S102d).

【００４４】一方、図１０に示す声紋変更認証モードの
設定動作の場合も、ほぼ同様であり、干渉信号は音声の
特定の周波数スペクトルのみ打ち消すようにすれば良い
（ステップＳ２０３）。または、他の声紋の特徴的な周
波数スペクトル部分を打ち消すようにしても良い。この
ように、消音認証動作または声紋変更認証動作により得
られた認証情報に基づいて、認証し、認証フラグを設定
後（ステップＳ１０２ｄ又は１０３ｄ）、図８に戻っ
て、認証判定をして（ステップＳ１０４）、認証ＯＫの
時は（ステップＳ１０４のＯＫ）、使用者が音声入力情
報処理装置を使用可能とし、ログインする。On the other hand, the operation of setting the voiceprint change authentication mode shown in FIG. 10 is almost the same, and only the interference signal needs to cancel out a specific frequency spectrum of the voice (step S203). Alternatively, a characteristic frequency spectrum portion of another voiceprint may be canceled. As described above, after performing authentication based on the authentication information obtained by the mute authentication operation or the voiceprint change authentication operation and setting the authentication flag (step S102d or 103d), returning to FIG. (S104) When the authentication is OK (OK in step S104), the user enables the voice input information processing apparatus and logs in.

【００４５】ログイン後、音声モードを判定し（ステッ
プＳ１０５）、消音モードの場合は（ステップＳ１０５
の消音モードの場合）、消音モードフラグをセットし
（ステップＳ１０６）、一方、声紋変更モードの場合は
（ステップＳ１０５の変更モードの場合）、声紋変更モ
ードフラグをセットする（ステップＳ１０７）。消音モ
ードフラグがセットされている場合は、タイマによる定
期的な割り込みにより起動される割り込みルーチンにお
いて消音動作が実行される（ステップＳ２０３，Ｓ２０
４）。また、声紋変更モードフラグがセットされている
場合は、同様に、声紋が外部に漏れない様に声紋が変更
される（ステップＳ２０３，Ｓ２０４）。After logging in, the voice mode is determined (step S105). If the mode is the mute mode (step S105)
In the case of the mute mode, the mute mode flag is set (step S106), while in the case of the voiceprint change mode (the change mode of step S105), the voiceprint change mode flag is set (step S107). If the mute mode flag is set, a mute operation is performed in an interrupt routine started by a periodic interrupt by a timer (steps S203 and S20).
4). If the voiceprint change mode flag is set, similarly, the voiceprint is changed so that the voiceprint does not leak outside (steps S203 and S204).

【００４６】次に、操作モードの判定を行い（ステップ
Ｓ１０８）、声紋判定モードの場合、割り込みルーチン
で抽出された声紋が使用者と認証された場合は（ステッ
プＳ１０９，Ｓ１１０のＯＫ）、声紋判定がＯＫとな
り、音声認識動作により、認証された使用者の操作命令
を音声より認識し（ステップＳ１１１）、操作命令に従
い、音声入力情報処理装置が処理を行い（ステップＳ１
１２）、処理終了後、再び音声モード判定（ステップＳ
１０５）に戻る。これらの音声モード、操作モードの設
定は、音声による操作命令により、音声入力情報処理装
置が、任意に切り替えることができる。Next, the operation mode is determined (step S108). In the case of the voiceprint determination mode, if the voiceprint extracted in the interrupt routine is authenticated as a user (OK in steps S109 and S110), the voiceprint determination is performed. Is OK, the operation command of the authenticated user is recognized from the voice by the voice recognition operation (step S111), and the voice input information processing device performs processing according to the operation command (step S1).
12) After the processing is completed, the audio mode is determined again (step S
Return to 105). The setting of the voice mode and the operation mode can be arbitrarily switched by the voice input information processing apparatus by a voice operation command.

【００４７】つまり、例として、認証後の待機状態にお
いては、消音モード及び声紋判定モードが初期状態であ
ると設定することにより、使用者は外部に自分の音声が
漏洩しない状態で、情報処理装置を操作でき、認証後、
他の人が不正に操作しようとしても声紋判定でＮＧにな
るので、他の人は操作することができない。これにより
機密が保持でき、かつ、オフィスの静穏化が図ることが
できる。また、音声入力情報処理装置を構成しているＰ
Ｃを使用中に、他の人と会話が必要になった場合は、音
声モードを声紋変更モードに設定することにより、消音
することなく、他の人と会話が出来る。また、現在、使
用中の音声入力情報処理装置を本人の了解を得て、他人
が使用する場合は、操作モードを非判定モードに設定す
れば良い。これらのモード変更は、認証者のみが行える
ようにすることによりセキュリティの保護が図れる。That is, as an example, in the standby state after the authentication, the mute mode and the voiceprint determination mode are set to the initial state, so that the user does not leak his / her own voice to the information processing apparatus. Can be operated and after authentication,
Even if another person tries to operate illegally, the voiceprint determination becomes NG, and the other person cannot operate. Thereby, confidentiality can be maintained, and the office can be quieted. Also, P which constitutes the voice input information processing device
If it becomes necessary to have a conversation with another person while using C, by setting the voice mode to the voiceprint change mode, the conversation can be made with another person without silencing. When the voice input information processing device currently in use is used by another person with the consent of the person concerned, the operation mode may be set to the non-judgment mode. These mode changes can be performed only by the authenticator, thereby protecting the security.

【００４８】また、他の入力手段であるキーボード、ポ
インティングデバイスからの入力情報については、操作
モードが声紋判定モードの状態に設定している場合にあ
っては、一定期間以上認証者の声紋が検出できない場合
は、入力を受け付けないようにする。これにより、使用
者が席を離れたりした場合は、かかる他の入力手段から
の入力をロックすることが出来る。また、使用者が席に
いるかどうかを人感センサ等を使用して検出しても良
い。For input information from other input means such as a keyboard and a pointing device, when the operation mode is set to the voiceprint determination mode, the voiceprint of the authenticator is detected for a certain period or more. If you cannot, do not accept input. Thus, when the user leaves the seat, the input from the other input means can be locked. Further, whether or not the user is in the seat may be detected using a human sensor or the like.

【００４９】このように、能動消音機能を備えた音声入
力手段による音声入力情報処理装置は認証及び操作時の
セキュリティが向上する。また、認証時に使用する声紋
検出としてユーザ名等を使用した場合、音声入力情報処
理装置の入力時には、前述の声紋変更手段でセキュリテ
ィが守られるが、そのユーザ名を他の場所で会話してい
るところを録音される場合があるので、認証時に、発音
情報、例えば、ランダムに英数字を発音するように使用
者に指示し、この発音された音声の声紋を基に認証して
も良い。また、図１４に示すごとく、ユーザ名に対する
声紋の特徴量を記録した声紋データベース１３を、ユー
ザごとに用意して色々な文字列に関する声紋の特徴量を
記録しておき、音声による操作を繰り返すごとに学習す
るようにしておいても良い。また、認証のセキュリティ
をより増す為に、他の指紋等の認証を併用して実施して
も良い。As described above, the voice input information processing apparatus using the voice input means having the active noise reduction function improves the security at the time of authentication and operation. Further, when a user name or the like is used as a voiceprint detection used at the time of authentication, the security is protected by the above-mentioned voiceprint changing means at the time of input of the voice input information processing apparatus, but the user name is being conversed in another place. However, in some cases, the user may be instructed to pronounce pronunciation information, for example, alphanumeric characters at random, and perform authentication based on the voiceprint of the pronounced sound. Also, as shown in FIG. 14, a voiceprint database 13 in which voiceprint feature amounts for user names are recorded is prepared for each user, and voiceprint feature amounts for various character strings are recorded. You may also learn in advance. Further, in order to further increase the security of the authentication, the authentication may be performed in combination with another authentication such as a fingerprint.

【００５０】以上に説明したごとく、音声入力手段とし
て、消音機能又は声紋変更機能を有しているので、外部
に入力音声又は該入力音声の声紋が漏洩することもな
く、音声入力情報処理装置のセキュリティが保持でき
る。また、音声入力情報処理装置からの表示または発音
に対応して、それを復唱した音声の声紋情報を基に認証
することが可能であるので、認証の為の暗証情報を認証
者が記憶する必要が無く、また、外部に漏洩せずに、声
紋により認証しているので、確実に認証できる。As described above, since the voice input means has a mute function or a voiceprint changing function, the input voice or the voiceprint of the input voice does not leak to the outside, and the voice input information processing device has Security can be maintained. In addition, since it is possible to authenticate based on the voiceprint information of the voice repetition of the voice reproduced in response to the display or pronunciation from the voice input information processing device, it is necessary for the authenticator to store the password information for the authentication. Since the authentication is performed by using a voiceprint without leaking to the outside, the authentication can be surely performed.

【００５１】更に、認証された音声と同じ特徴の声紋に
よってのみ、音声入力情報処理装置の操作指示を受け付
けるので、複数の発音者がいても、特定の操作者のみの
指示で操作可能である。また、該音声入力情報処理装置
を操作中、あるいは、操作の終了を該音声入力情報処理
装置に入力しないまま席を離れた場合でも、他の使用者
が、なりすまして操作を行うこともできない。更に、認
証情報も外部に漏洩しない。Further, since the operation instruction of the voice input information processing device is received only by a voiceprint having the same characteristic as the authenticated voice, even if there are a plurality of sounders, the operation can be performed by the instruction of only a specific operator. Further, even when the user is operating the voice input information processing apparatus or leaves the seat without inputting the end of the operation to the voice input information processing apparatus, another user cannot perform the operation by impersonating. Further, the authentication information does not leak outside.

【００５２】[0052]

【発明の効果】（請求項１での作用効果）消音機能を備
えた音声入力手段を備えているので、操作者の周囲に音
声が漏洩しない。従って、外部に機密の漏洩がなく、オ
フィスの静穏化が実現できる。According to the first aspect of the present invention, since the voice input means having the mute function is provided, the voice does not leak around the operator. Therefore, there is no leakage of confidential information outside, and the office can be quietened.

【００５３】（請求項２での作用効果）声紋変更手段を
備えているので、外部に声紋の漏洩がなく、しかも、他
の人への音声は伝達でき、会話を行うことができる。(Effect of Claim 2) Since the voiceprint changing means is provided, the voiceprint is not leaked to the outside, and the voice can be transmitted to another person and conversation can be performed.

【００５４】（請求項３での作用効果）外部に音声また
は声紋が漏洩せずに認証できるので、暗証情報の漏洩無
しに音声認証できる。(Effect of Claim 3) Since authentication can be performed without leaking voice or voiceprint to the outside, voice authentication can be performed without leaking password information.

【００５５】（請求項４での作用効果）認証された音声
と同じ特徴の声紋により、音声入力情報処理装置の操作
指示を行うので、複数の発音者がいても特定の操作者の
みの指示で操作可能である。(Operation and Effect of Claim 4) Since the operation instruction of the voice input information processing apparatus is performed by the voiceprint having the same characteristic as the authenticated voice, even if there are a plurality of sounders, the instruction is performed only by the specific operator. Operable.

【００５６】（請求項５での作用効果）音声認証された
音声と同じ特徴の声紋により、音声入力情報処理装置の
操作指示を行うので、複数の発音者がいても特定の操作
者のみの指示で操作可能であり、しかも、認証情報が外
部に漏洩しない。(Operation and Effect of Claim 5) Since the operation instruction of the voice input information processing apparatus is performed by the voice print having the same characteristics as the voice that has been authenticated, even if there are a plurality of sounders, the instruction of only the specific operator is given. And the authentication information does not leak outside.

【００５７】（請求項６での作用効果）認証するための
発音情報を音声入力情報処理装置が表示または発音し、
それを復唱した音声の声紋情報を基に認証するので、認
証の為の暗証情報を認証者が記憶する必要が無く、ま
た、外部に漏洩せずに声紋により認証しているので、確
実に認証を行うことができる。(Operation and Effect of Claim 6) The voice input information processing device displays or sounds pronunciation information for authentication.
Authentication is performed based on the voiceprint information of the voice that is read back, so there is no need for the authenticator to memorize the password information for authentication, and since the authentication is performed using the voiceprint without leaking to the outside, authentication is ensured. It can be performed.

[Brief description of the drawings]

【図１】本発明に係る音声入力情報処理装置における
音声入力手段の一実施形態を示す概念図である。FIG. 1 is a conceptual diagram showing one embodiment of a voice input unit in a voice input information processing device according to the present invention.

【図２】多数対の音声検出手段と音響発生手段とを備
えた音声入力手段の他の実施形態を示す概念図である。FIG. 2 is a conceptual diagram showing another embodiment of a voice input unit provided with a large number of pairs of voice detection units and sound generation units.

【図３】入力音声信号と逆位相の干渉音信号との合成
音信号の関係を示す模式図である。FIG. 3 is a schematic diagram showing a relationship between a synthesized sound signal of an input sound signal and an interference sound signal of an opposite phase.

【図４】発音者の原音声の周波数スペクトルを変更さ
せる場合の周波数スペクトルの一例を示す模式図であ
る。FIG. 4 is a schematic diagram showing an example of a frequency spectrum when a frequency spectrum of an original voice of a sounder is changed.

【図５】能動消音機能または能動声紋変更機能を適用
した音声入力手段を有する音声入力情報処理装置の基本
的な構成を示す構成図である。FIG. 5 is a configuration diagram showing a basic configuration of a voice input information processing apparatus having voice input means to which an active mute function or an active voiceprint changing function is applied.

【図６】情報処理装置の認証動作の概念を示すフロー
チャートである。FIG. 6 is a flowchart illustrating a concept of an authentication operation of the information processing apparatus.

【図７】本発明の機能を実現する為の能動消音（変
更）機能音声入力手段を備えた情報処理装置の具体的な
構成例に示すものである。FIG. 7 shows a specific configuration example of an information processing apparatus provided with an active mute (change) function voice input unit for realizing the function of the present invention.

【図８】本発明に係る音声入力情報処理装置における
全体の処理の流れを示すフローチャートである。FIG. 8 is a flowchart showing the overall processing flow in the voice input information processing apparatus according to the present invention.

【図９】本発明に係る音声入力情報処理装置における
消音認証モードの設定動作の流れを示すフローチャート
である。FIG. 9 is a flowchart showing a flow of a setting operation of a mute authentication mode in the voice input information processing apparatus according to the present invention.

【図１０】本発明に係る音声入力情報処理装置におけ
る声紋変更認証モードの設定動作の流れを示すフローチ
ャートである。FIG. 10 is a flowchart showing a flow of a setting operation of a voiceprint change authentication mode in the voice input information processing apparatus according to the present invention.

【図１１】消音（変更）動作，声紋検出動作，音声認
識の動作を行う割り込みルーチンの流れを示すフローチ
ャートである。FIG. 11 is a flowchart showing a flow of an interrupt routine for performing a mute (change) operation, a voiceprint detection operation, and a voice recognition operation.

【図１２】タイマ割り込みを使用した場合の同時並行
動作の実施例を示す概念図である。FIG. 12 is a conceptual diagram showing an embodiment of a concurrent operation when a timer interrupt is used.

【図１３】中央演算部（ＣＰＵ）と別に専用の信号処
理手段（ＤＳＰ）を別途設ける場合の構成を示す概念図
である。FIG. 13 is a conceptual diagram showing a configuration in a case where a dedicated signal processing means (DSP) is provided separately from a central processing unit (CPU).

【図１４】ユーザ名に対する声紋の特徴量を記録した
データベースの例を示す概念図である。FIG. 14 is a conceptual diagram showing an example of a database in which voiceprint feature amounts for user names are recorded.

[Explanation of symbols]

１，１′，１″…音声検出手段、２，２′，２″…音響
発生手段、３…音響信号作成手段、４…音声伝達手段、
５…音声入力手段、６…情報処理装置、１０…発音者、
１１…音声入力情報処理装置、１１ａ…中央演算部（Ｃ
ＰＵ）、１１ｂ…主記憶部（ＲＡＭ）、１１ｃ…補助記
憶部（ＨＤ）、１１ｄ…表示部（ＣＲＴ）、１１ｅ…キ
ーボード、１１ｆ…通信部、１１ｇ…マイクロフォン、
１１ｈ…アンプ付きＡ／Ｄ変換器、１１ｇｈ，１１′ｇ
ｈ…音声入力手段、１１ｉ…アンプ付きＤ／Ａ変換器、
１１ｊ…スピーカ、１１ｉｊ，１１′ｉｊ…音響出力手
段、１２…信号処理手段（ＤＳＰ）、１３…声紋データ
ベース。1, 1 ', 1 "voice detecting means, 2, 2', 2" voice generating means, 3 voice signal generating means, 4 voice transmitting means,
5 voice input means, 6 information processing device, 10 sounder,
11 voice input information processing device, 11a central processing unit (C
PU), 11b main storage unit (RAM), 11c auxiliary storage unit (HD), 11d display unit (CRT), 11e keyboard, 11f communication unit, 11g microphone,
11h: A / D converter with amplifier, 11gh, 11'g
h: voice input means, 11i: D / A converter with amplifier,
11j: Speaker, 11ij, 11'ij: Sound output means, 12: Signal processing means (DSP), 13: Voiceprint database.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/28 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 15/28

Claims

[Claims]

An audio input means having an active mute function for actively silencing the sound by means of a sound detecting means for detecting a sound and a sound generating means for generating a sound having an opposite phase from the detected sound. Thus, the voice input means transmits a signal related to the detected voice to the information processing device and prevents the input voice from leaking to the outside.

2. A voice input means having a voiceprint changing function for actively changing a voiceprint by means of voice detecting means for detecting voice and sound generating means for generating a sound based on the detected voice. The voice input means transmits a signal related to the detected voice to an information processing apparatus, and allows a voice of a different voiceprint to be heard outside.

3. The voice input information processing apparatus according to claim 1, wherein said voice input means performs authentication by extracting voiceprint features of voice.

4. A voice input information processing apparatus comprising: voice detection means for detecting voice; and voiceprint detection means for detecting a voiceprint based on the detected voice. A voice input information processing device, which recognizes only an operation command by a voice of a voiceprint having a characteristic and accepts an operation.

5. The voice input information processing apparatus according to claim 3, further comprising: voice detection means for detecting a voice; and voiceprint detection means for detecting a voiceprint based on the detected voice. A voice input information processing device, wherein the device recognizes only an operation command by voice of a voiceprint having characteristics similar to the characteristics of the authenticated voiceprint, and accepts the operation.

6. The voice input information processing device according to claim 3, wherein at the time of authentication, the voice input information processing device displays or pronounces pronunciation information for authentication, and the authenticator pronounces the pronunciation information, A voice input information processing apparatus for performing authentication on the basis of a voiceprint of a received voice.