JP5054477B2

JP5054477B2 - Hearing aid

Info

Publication number: JP5054477B2
Application number: JP2007249480A
Authority: JP
Inventors: 篤今井; 彰男安藤
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2007-09-26
Filing date: 2007-09-26
Publication date: 2012-10-24
Anticipated expiration: 2027-09-26
Also published as: JP2009080298A

Abstract

PROBLEM TO BE SOLVED: To enable a listener to further comfortably and easily hear voice. SOLUTION: The hearing aid device for outputting voice to be heard by a listener while emphasizing a predetermined section of the voice includes a voice input part which inputs the voice to be heard by the listener; a voice section detection part which detects the voice section to be emphasized and a voice start point from the voice input by the voice input part; a hearing aid voice generation part which controls the speech speed and formant based on the voice start point detected by the voice section detection part to generate a hearing aid voice; and a voice output part which output the hearing aid-controlled voice generated by the hearing aid voice generation part. COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、補聴装置に係り、特に受聴者に対してより快適に音声を聞き取り易くするための補聴装置に関する。 The present invention relates to a hearing aid device, and more particularly, to a hearing aid device that makes it easier for listeners to hear sound.

従来、高齢者や聴力が弱い人（難聴者）等の受聴者の聴力を補償するために、聞き取りの手がかりを与える手法が用いられている。例えば、発話の開始タイミングを意識させるために、発話の開始部分をゆっくり再生させることで聞き取りの手がかりを与える話速変換に関する手法が開示されている（例えば、特許文献１参照。）。 Conventionally, in order to compensate the hearing ability of a listener such as an elderly person or a person with weak hearing (a hearing-impaired person), a technique for giving a clue to listening has been used. For example, in order to make the utterance start timing conscious, a technique related to speech speed conversion that gives a clue for listening by slowly reproducing the start part of the utterance has been disclosed (for example, see Patent Document 1).

特許文献１に示されている手法は、時間的に変化する任意の比率で、入力データを伸張合成して得られた出力データについて、ある無音区間が出現し、この無音区間の継続時間が所定の閾値を越えているときに、この入力データに対する出力データの伸張時間を、この伸張時間内の任意の時間だけ削除することで、実際に発話された時間枠の中で、話速変換を行うことができる。
特許第３２２００４３号公報 In the technique disclosed in Patent Document 1, a certain silent section appears in output data obtained by decompressing and synthesizing input data at an arbitrary ratio that changes with time, and the duration of this silent section is predetermined. When the threshold value of the input data is exceeded, the decompression time of the output data with respect to this input data is deleted by an arbitrary time within the decompression time, so that the speech speed is converted within the actually spoken time frame. be able to.
Japanese Patent No. 3220043

しかしながら、上述に示すような従来手法には、音声信号の受聴タイミングで増幅率や音の特徴を変化させるものはなく、マイクロホンで収音した音を予め調節された増幅率や音質で再生するのみであった。したがって、音声信号の受聴タイミングで増幅率や音の特徴を変化させ、最小限必要な補聴のみを行い効率的に補聴効果を向上させる手法がなかった。 However, none of the conventional methods as described above change the amplification factor or the sound characteristics at the timing of listening to the audio signal, and only reproduce the sound collected by the microphone with the amplification factor and sound quality adjusted in advance. Met. Therefore, there has been no method for efficiently improving the hearing aid effect by changing the amplification factor and the sound characteristics at the listening timing of the audio signal and performing only the minimum necessary hearing aid.

本発明は、上述した問題点に鑑みなされたものであり、受聴者に対してより快適に音声を聞き取り易くするための補聴装置を提供することを目的とする。 The present invention has been made in view of the above-described problems, and an object of the present invention is to provide a hearing aid device that makes it easier for listeners to hear sound more comfortably.

上記課題を解決するために、本件発明は、以下の特徴を有する課題を解決するための手段を採用している。 In order to solve the above problems, the present invention employs means for solving the problems having the following characteristics.

請求項１に記載された発明は、受聴者に受聴させる音声の所定の区間を強調して音声を出力する補聴装置において、受聴者に受聴させる音声を入力する音声入力部と、前記音声入力部により入力された音声から強調させる音声区間及び音声開始点を検出する音声区間検出部と、前記音声区間検出部により検出された前記音声開始点を基準として、話速及びホルマントを制御して補聴音声を生成する補聴音声生成部と、前記補聴音声生成部により生成された補聴制御された音声を出力する音声出力部とを有し、前記音声区間検出部は、予め上限値及び下限値の２つの音声のパワーの閾値を設定し、前記音声入力部により入力された音声のパワーが前記上限値を越えたときの時間を基準として時間軸を逆方向に移動し、最初に前記下限値を下回ったときの時間を音声開始点として検出し、前記下限値を下回ったときの時間から前記上限値を越えたときの時間までを前記音声区間として検出することを特徴とする。 The invention described in claim 1 is a hearing aid device that outputs a sound by emphasizing a predetermined section of the sound to be heard by the listener, a sound input unit for inputting the sound to be heard by the listener, and the sound input unit. Hearing aid speech by controlling speech speed and formant with reference to the speech start point detected by the speech segment detection unit and the speech segment detection unit for detecting the speech segment and speech start point to be emphasized from the speech input by and hearing the sound generator for generating a, the hearing aid have a sound output unit for outputting a sound that is hearing controlled generated by the sound generating unit, the voice section detection unit, the two previously upper limit value and the lower limit value A voice power threshold is set, the time axis is moved in the reverse direction based on the time when the voice power input by the voice input unit exceeds the upper limit value, and first falls below the lower limit value. Detecting a time as a voice starting point of time, and detecting from time when below the lower limit to the time when it exceeds the upper limit value as the speech segment.

請求項１記載の発明によれば、受聴者に対してより快適に音声を聞き取り易くすることができる。具体的には、話が始まるタイミングを受聴者に知らせることで、音を聞くための心構えを喚起し、その結果として補聴効果を向上させることができる。また、音声のパワーを基準として音声区間及び音声開始点を効率的に取得することができる。 According to the first aspect of the present invention, it is possible to make it easier for listeners to hear sound. Specifically, by informing the listener of the timing when the talk starts, it is possible to arouse the attitude to hear the sound, and as a result, the hearing aid effect can be improved. Further, it is possible to efficiently acquire the voice section and the voice start point with the voice power as a reference.

請求項２に記載された発明は、前記補聴音声生成部は、前記音声開始点で出力される音声を所定の増幅率まで増幅し、その後は前記音声区間の中で予め設定された制御関数に対応させて増幅率を下げることを特徴とする。 According to a second aspect of the present invention, the hearing aid sound generation unit amplifies the sound output at the sound start point to a predetermined amplification factor, and then sets a control function set in advance in the sound section. The amplification factor is lowered correspondingly.

請求項２記載の発明によれば、音声区間検出部により検出される音声区間全体の音声を増幅するのではなく、音声開始点（話し始めの部分）を基準に音の大きさ（増幅率）を変化させることで最小限必要な補聴のみを行うことができ、補聴効果を向上させることができる。 According to the second aspect of the present invention, rather than amplifying the voice of the entire voice section detected by the voice section detector, the volume (amplification factor) of the sound is based on the voice start point (the part at the beginning of the speech). By changing, only the minimum necessary hearing aid can be performed, and the hearing aid effect can be improved.

請求項３に記載された発明は、前記補聴音声生成部は、前記音声開始点で出力される話速を所定の速度まで遅くし、その後は前記音声区間の中で予め設定された制御関数に対応させて話速を通常速度に近付けることを特徴とする。 According to a third aspect of the present invention, the hearing aid sound generation unit slows the speech speed output at the sound start point to a predetermined speed, and then sets a control function set in advance in the sound section. Correspondingly, the speech speed is brought close to the normal speed.

請求項３記載の発明によれば、音声区間検出部により検出される音声区間全体の速度を遅くするのではなく、音声開始点を基準に音の速度を変化させることで最小限必要な補聴のみを行うことができ、補聴効果を向上させることができる。 According to the third aspect of the present invention, only the minimum necessary hearing aid is obtained by changing the speed of the sound based on the voice start point, rather than slowing down the speed of the whole voice section detected by the voice section detector. And the hearing aid effect can be improved.

請求項４に記載された発明は、前記補聴音声生成部は、前記音声開始点で出力される音声のホルマント周波数の変化を強調し、その後は前記音声区間の中で予め設定された制御関数に対応させて強調を減少させることを特徴とする。 In the invention described in claim 4 , the hearing aid sound generation unit emphasizes a change in formant frequency of the sound output at the sound start point, and then sets a control function set in advance in the sound section. Correspondingly, the emphasis is reduced.

請求項４記載の発明によれば、音声区間検出部により検出される音声区間全体の速度を遅くするのではなく、音声開始点を基準にホルマント周波数の変化を強調することで、話し始めの“はっきり感”を増すことができ、補聴効果を向上させることができる。 According to the fourth aspect of the present invention, instead of slowing down the speed of the entire speech section detected by the speech section detecting unit, the change of the formant frequency is emphasized with reference to the speech start point. A clear feeling can be increased and the hearing aid effect can be improved.

請求項５に記載された発明は、前記音声開始点で前記受聴者に対して触覚による刺激を与える触覚刺激部を有することを特徴とする。 The invention described in claim 5 includes a tactile stimulation unit that applies tactile stimulation to the listener at the voice start point.

請求項５記載の発明によれば、効果的に受聴者に音声の開始を伝えることができる。 According to the invention described in claim 5 , it is possible to effectively notify the listener of the start of sound.

本発明によれば、受聴者に対してより快適に音声を聞き取り易くすることができる。具体的には、話が始まるタイミングを受聴者に知らせることで、音を聞くための心構えを喚起し、その結果として補聴効果を向上させることができる。 According to the present invention, it is possible to make it easier for listeners to hear sound more comfortably. Specifically, by informing the listener of the timing when the talk starts, it is possible to arouse the attitude to hear the sound, and as a result, the hearing aid effect can be improved.

＜本発明の概要＞
本発明は、音声データ全体から音声区間を抽出し、更に話し始めの部分（音声開始点）を基準として、音の大きさ（増幅率）や速度を変化させたり、ホルマント周波数の変化を強調させることで、受聴者（使用者）に対して話し始めの聞き取り易さを向上させる。 <Outline of the present invention>
The present invention extracts a speech section from the entire speech data, and further changes the volume (amplification factor) and speed of the speech starting point (speech start point) or emphasizes the change in formant frequency. This improves the ease of listening to the listener (user) at the beginning of the conversation.

また、本発明では、話し始めのタイミングを触覚刺激で与える。これは、例えば、お年寄りや聴力が低下した人でも、肩を叩かれる等して、話しかけられるタイミングを教えられると、話の内容を聞き取れる場合が多いことが知られているため、本発明ではこの現象を利用し話し始めの部分を強調することにより、受聴者に注意を喚起する機能を補聴装置に設ける。 In the present invention, the timing of the start of speaking is given by tactile stimulation. This is because, for example, it is known that even elderly people and people with reduced hearing can often hear the content of the story if they are taught the timing of being spoken, such as being struck by their shoulders. Using this phenomenon, the hearing aid device is provided with a function of calling attention to the listener by emphasizing the beginning of the conversation.

したがって、本発明では、補聴器においても、話しかけられたタイミングを受聴者に察知させる機能を備えることで、増幅率をあまり上げなくても、利用者が良好に聞き取ることができる。また、その結果として、音の増幅率を必要以上に上げずに済むので、大きい音をうるさいと感じることも少なくなる。 Therefore, in the present invention, the hearing aid also has a function of allowing the listener to detect the timing of the talk, so that the user can listen well without increasing the amplification factor. As a result, since it is not necessary to increase the amplification factor of sound more than necessary, it is less likely that loud sounds are picked up.

以下に、上述したような特徴を有する本発明における補聴装置を好適に実施した形態について、図面を用いて詳細に説明する。 Hereinafter, a preferred embodiment of a hearing aid device according to the present invention having the above-described features will be described in detail with reference to the drawings.

＜補聴装置（補聴器）：機能構成＞
図１は、本発明における補聴装置の一構成例を示す図である。図１に示す補聴装置１０は、音声入力部１１と、前置増幅部１２と、音声開始点検出部１３と、増幅率設定部１４と、適応増幅部１５と、話速・ホルマント制御関数設定部１６と、補聴音声生成部１７と、音声出力部１８とを有するよう構成されている。 <Hearing Aid Device (Hearing Aid): Functional Configuration>
FIG. 1 is a diagram illustrating a configuration example of a hearing aid apparatus according to the present invention. A hearing aid device 10 shown in FIG. 1 includes a voice input unit 11, a preamplifier 12, a voice start point detector 13, an amplification factor setting unit 14, an adaptive amplifying unit 15, and a speech speed / formant control function setting. Unit 16, hearing aid sound generation unit 17, and sound output unit 18.

音声入力部１１は、外部から入力される音声を入力する。なお、音声入力部１１は、基本的には例えばマイクロホン等の音声入力装置により音響信号が収音される。また、音声入力部１１は、入力した音声を前置増幅部１２に出力する。 The voice input unit 11 inputs voice input from the outside. The voice input unit 11 basically collects an acoustic signal by a voice input device such as a microphone. Further, the voice input unit 11 outputs the input voice to the preamplifier unit 12.

前置増幅部１２は、音声入力部１１で入力された音声信号を所定の大きさに増幅する。また、前置増幅部１２は、増幅した音声信号を音声区間検出部１３及び適応増幅部１５に出力する。 The preamplifier 12 amplifies the audio signal input from the audio input unit 11 to a predetermined size. Further, the preamplifier 12 outputs the amplified audio signal to the audio section detector 13 and the adaptive amplifier 15.

音声区間検出部１３では、入力された音声に対して、音声に該当する音声区間と、話し始めの部分（音声開始点）を検出する。図２は、本実施形態における音声区間の検出方法の一例を示す図である。なお、図２において横軸は時間を示し、縦軸は音響パワーを示している。 The voice section detection unit 13 detects a voice section corresponding to the voice and a speech start part (voice start point) from the input voice. FIG. 2 is a diagram illustrating an example of a speech section detection method according to the present embodiment. In FIG. 2, the horizontal axis indicates time, and the vertical axis indicates sound power.

例えば、音声区間検出部１３は、図２に示すように入力される音声信号に対して音声区間とするパワーの上限値（Ｔｈ１）と下限値（Ｔｈ２）の２種類の閾値を設定し、音声信号に対する音響分析により得られる音声波形２１に対して設定された上限値と下限値の間に含まれる時間Ｔ１〜Ｔ２を音声区間とする。 For example, as shown in FIG. 2, the voice section detection unit 13 sets two types of threshold values, that is, an upper limit value (Th1) and a lower limit value (Th2) of the power to be used as a voice section for an input voice signal. Times T1 to T2 included between the upper limit value and the lower limit value set for the speech waveform 21 obtained by the acoustic analysis on the signal are defined as a speech section.

このように、音響パワーに対して予め２種類の閾値を設定する。つまり、雑音の影響を回避するため、予め大きめの音響パワーを設定した閾値Ｔｈ１を越えたときに、その時間Ｔ２を含む音声区間を検出することができる。具体的には、時間Ｔ２から時間軸を逆方向に移動し、初めて閾値Ｔｈ２を下回った時間Ｔ１を音声開始点（話し始めの部分）として検出する。これにより、音声区間検出部１３は、音声区間Ｔ１〜Ｔ２及び音声開始点Ｔ１を求めることができる。 Thus, two types of threshold values are set in advance for the sound power. That is, in order to avoid the influence of noise, it is possible to detect a speech section including the time T2 when a threshold value Th1 set in advance with a large acoustic power is exceeded. Specifically, the time axis is moved in the reverse direction from the time T2, and the time T1 that is below the threshold value Th2 for the first time is detected as the voice start point (speech start portion). Thereby, the audio | voice area detection part 13 can obtain | require audio | voice area T1-T2 and the audio | voice start point T1.

ここで、一般的に、補聴装置は、雑音が存在する環境で利用されるため、この閾値の設定が性能を左右することになる。そのため、閾値Ｔｈ１と閾値Ｔｈ２は、入力信号を観測しながら適応的に設定するのが好ましい。したがって、音声区間検出部１３は、閾値Ｔｈ１と閾値Ｔｈ２の設定を任意に変更可能な機能を有する。この場合、例えば補強装置１０に対して外部目盛等を設けて閾値を設定してもよく、また、装置内部に組み込まれるソフトウェアを用いて設定することもできる。 Here, since the hearing aid device is generally used in an environment where noise exists, the setting of this threshold value affects the performance. Therefore, it is preferable to adaptively set the threshold value Th1 and the threshold value Th2 while observing the input signal. Therefore, the speech section detection unit 13 has a function that can arbitrarily change the settings of the threshold Th1 and the threshold Th2. In this case, for example, an external scale or the like may be provided for the reinforcing device 10 to set a threshold value, or it may be set using software incorporated in the device.

また、音声区間検出部１３は、３以上の閾値の設定しておき、環境雑音等の収音する周囲の状況に応じてその中から２つの閾値を随時選択するようにしてもよい。なお、音声区間検出部１３は、上述した音声区間検出処理を音響信号のパワーが閾値Ｔｈ１を越えた場合に必ず行い、その都度得られた音声開始点情報を出力する。 Further, the voice section detection unit 13 may set three or more threshold values, and may select two threshold values from time to time according to the surrounding situation in which sound is collected such as environmental noise. Note that the speech segment detection unit 13 always performs the above-described speech segment detection process when the power of the acoustic signal exceeds the threshold value Th1, and outputs the obtained speech start point information each time.

また、音声区間検出部１３における音声区間検出手法は、上述の手法に限定されることはなく、例えばピッチ抽出によって音声の有無を観測して音声区間を検出することもできる。音声区間検出部１３は、検出された区間に関する情報（音声開始点及び音声区間）を増幅率設定部１４に出力する。 Moreover, the speech section detection method in the speech section detection unit 13 is not limited to the above-described method, and the speech section can be detected by observing the presence or absence of speech by, for example, pitch extraction. The voice segment detection unit 13 outputs information (speech start point and voice segment) regarding the detected segment to the amplification factor setting unit 14.

増幅率設定部１４は、音声区間検出部１３により音声区間及び音声開始点が検出される毎に、予め設定された増幅率等の条件に基づいて適応増幅ブロックの増幅率を制御する。ここで、図３は、増幅率を制御するための条件の一例を説明するための図である。なお、図３は、増幅率制御を行う基準として予め設定された制御関数を用いて波形を設定した例を示している。また、図３において、横軸は時間を示し、縦軸は増幅率を示している。 The amplification factor setting unit 14 controls the amplification factor of the adaptive amplification block based on conditions such as a preset amplification factor each time a speech segment and a voice start point are detected by the speech segment detection unit 13. Here, FIG. 3 is a diagram for explaining an example of conditions for controlling the amplification factor. FIG. 3 shows an example in which a waveform is set using a control function set in advance as a reference for performing gain control. In FIG. 3, the horizontal axis represents time, and the vertical axis represents the amplification factor.

ここで、図３に示す時間Ｔ３は、音声区間検出部１３で検出された音声開始点を示している。増幅率設定部１４は、図３に示す所定の関数により得られる曲線波形３１に示すように、時刻Ｔ０〜Ｔ３までの区間は、例えばシグモイド関数ｆ（ｘ）＝１／（１＋ｅ^−ｘ）により得られる曲線波形で設定し、時間Ｔ３以降については、余弦関数の１／４周期を組み合わせた曲線や適当な時定数を有するｅｘｐ減衰関数等を用いた曲線を設定する。 Here, time T 3 shown in FIG. 3 indicates the voice start point detected by the voice section detection unit 13. As shown in the curve waveform 31 obtained by the predetermined function shown in FIG. 3, the amplification factor setting unit 14 uses the sigmoid function f (x) = 1 / (1 + e ^−x ) for the section from time T0 to T3. The curve waveform is obtained, and after time T3, a curve using a quarter cycle of the cosine function or an exp decay function having an appropriate time constant is set.

また、増幅率設定部１４は、上述したような関数に基づいて、例えば強調増幅率Ａ１及び通常増幅率Ａ２を設定する。なお、図３に示す曲線の例では、音声開始点の時間Ｔ３において強調増幅率Ａ１に到達し、その後徐々に単調減少して通常増幅率Ａ２に至る制御関数であれば、どのような関数を用いてもよい。 Further, the amplification factor setting unit 14 sets, for example, the enhancement amplification factor A1 and the normal amplification factor A2 based on the function as described above. In the example of the curve shown in FIG. 3, any function can be used as long as the control function reaches the emphasis amplification factor A1 at the time T3 of the voice start point, and then gradually decreases monotonically and reaches the normal amplification factor A2. It may be used.

増幅率設定部１４は、上述した制御関数を設定し、設定した関数の情報を適応増幅部１５及び話速・ホルマント制御関数設定部１６に出力する。 The amplification factor setting unit 14 sets the control function described above, and outputs information on the set function to the adaptive amplification unit 15 and the speech speed / formant control function setting unit 16.

適応増幅部１５は、増幅率設定部１４で得られた増幅率及び制御関数により得られる曲線に適応させて前置増幅部１２から入力された音声信号の所定の音声区間を増幅する。つまり、適応増幅部１５は、音声区間検出部１３により検出された音声区間に対して、増幅率設定部１４で設定された増幅率及び制御関数により所定の音声区間の増幅を行う。 The adaptive amplifier 15 amplifies a predetermined voice section of the voice signal input from the preamplifier 12 by adapting to the curve obtained from the gain and control function obtained by the gain setting unit 14. That is, the adaptive amplifying unit 15 amplifies a predetermined voice section with respect to the voice section detected by the voice section detecting unit 13 using the amplification factor and the control function set by the amplification factor setting unit 14.

なお、上述した音声区間検出部１３と、増幅率設定部１４とにおける増幅率制御処理により処理の遅延が生じるが、この処理は人間の知覚できない遅延範囲内で行うため無視することができる。 Note that a delay in processing occurs due to the amplification factor control processing in the voice section detection unit 13 and the amplification factor setting unit 14 described above, but since this processing is performed within a delay range that cannot be perceived by humans, it can be ignored.

話速・ホルマント制御関数設定部１６では、話速制御用の関数及びホルマント制御用の関数を設定する。ここで、図４は、話速制御用の関数の一例を示す図である。また、図５は、ホルマント制御用の関数の一例を示す図である。なお、図４において、横軸は時間を示し、縦軸は波形伸縮率を示している。また、図５において、横軸は時間を示し、縦軸はホルマント制御量を示している。なお、上述の関数は一例であり、同様な単調減少性を有する関数であれば他の関数を用いた曲線波形であってもよい。 The speech speed / formant control function setting unit 16 sets a speech speed control function and a formant control function. Here, FIG. 4 is a diagram illustrating an example of a speech speed control function. FIG. 5 is a diagram illustrating an example of a function for formant control. In FIG. 4, the horizontal axis represents time, and the vertical axis represents the waveform expansion / contraction rate. In FIG. 5, the horizontal axis indicates time, and the vertical axis indicates formant control amount. Note that the above-described function is an example, and a curved waveform using another function may be used as long as the function has a similar monotonous decrease.

ここで、図４に示す時間Ｔ４は、波形区間検出部１３により検出された音声開始点を示している。話速・ホルマント制御関数設定部１６は、話速制御においては図４に示すように、音声開始点となる時間Ｔ４までの区間（時間Ｔ０〜Ｔ４）までは、波形の伸縮を行わず（伸縮率１．０）、音声開始点となる時間Ｔ４を基準にして、例えば音声の波形伸縮率を１．０よりも大きいＥ１とし、また時間の経過により次第に伸縮率が１．０に近づくように所定の曲線波形４１を設定する。なお、波形伸縮率Ｅ１については、例えば、１．５や２．０等の所定の値を環境雑音や受聴者（使用者）個人の聞き取り易さ等の各種条件に基づいて任意に設定することができる。 Here, time T 4 shown in FIG. 4 indicates the voice start point detected by the waveform section detector 13. As shown in FIG. 4, the speech speed / formant control function setting unit 16 does not perform waveform expansion / contraction until the interval (time T 0 to T 4) up to the time T 4 that is the voice start point, as shown in FIG. 4. Rate 1.0), with reference to time T4 as the voice start point, for example, the voice waveform expansion / contraction rate is set to E1 larger than 1.0, and the expansion / contraction rate gradually approaches 1.0 over time. A predetermined curve waveform 41 is set. For the waveform expansion / contraction rate E1, for example, a predetermined value such as 1.5 or 2.0 is arbitrarily set based on various conditions such as environmental noise and ease of hearing of the listener (user). Can do.

また、図４においては、波形を大きく（伸縮率が大きく）するほど、話速が遅くなる。なお、時間Ｔ４以降の曲線波形４１の設定方法としては、例えば余弦関数の１／４周期や適当な時定数を有するｅｘｐ減衰関数等を用いることができる。 Also, in FIG. 4, the larger the waveform (the greater the expansion / contraction rate), the slower the speech speed. In addition, as a setting method of the curve waveform 41 after time T4, for example, an exp decay function having a quarter period of a cosine function or an appropriate time constant can be used.

また、図５に示す時間Ｔ５は、波形区間検出部１３により検出された音声開始点を示している。話速・ホルマント制御関数設定部１６は、ホルマント制御においては図５に示すように、音声開始点となる時間Ｔ５までの区間（時間Ｔ０〜Ｔ５）までは、制御関数の値が０であるため制御は行わず、音声開始点となる時間Ｔ５以降については、制御関数の値に対応させて設定される制御量をＳ１とし、所定の曲線波形５１を設定することで、ホルマント周波数の変化を強調する。 Further, a time T5 shown in FIG. 5 indicates a voice start point detected by the waveform section detector 13. As shown in FIG. 5, the speech speed / formant control function setting unit 16 has a control function value of 0 until the time period T5 (time T0 to T5), which is the voice start point, as shown in FIG. No control is performed, and after time T5, which is the voice start point, the control amount set corresponding to the value of the control function is S1, and a predetermined curve waveform 51 is set to emphasize the change in formant frequency. To do.

なお、制御量Ｓ１については、環境雑音や使用者（受聴者）個人の聞き取り易さ等の各種条件に基づいて任意に設定することができる。また、曲線波形５１の設定方法としては、例えば余弦関数の１／４周期や適当な時定数を有するｅｘｐ減衰関数等を用いることができる。 The control amount S1 can be arbitrarily set based on various conditions such as environmental noise and ease of hearing of the user (listener). As a method for setting the curved waveform 51, for example, an 1/4 decay period of an cosine function, an exp decay function having an appropriate time constant, or the like can be used.

更に、上述した図３に示すような増幅率制御用の関数、図４に示すような話速制御用の関数、及び、図５に示すようなホルマント制御用の関数は、例えば、以下に示すような同一の関数Ｒ（ｔ）を用いることができる。 Further, the function for controlling the amplification factor as shown in FIG. 3, the function for controlling the speech speed as shown in FIG. 4, and the function for formant control as shown in FIG. The same function R (t) can be used.

Ｒ（ｔ）＝ｒ_ｅ＋（ｒ_ｓ−ｒ_ｅ）×１／２×［ｃｏｓ｛π（ｔ−ｔ_０）／Ｔ｝＋１．０］
このとき、上述した図３に示す関数の場合には、ｒ_ｅ＝Ａ２、ｒ_ｓ＝Ａ１とし、図４に示す関数の場合には、ｒ_ｅ＝１．０、ｒ_ｓ＝Ｅ１とし、図５に示す関数の場合には、ｒ_ｅ＝０、ｒ_ｓ＝Ｓ１とすることにより、１つの関数で効率的に各関数に対応させることができる。 R (t) = r _e + (r _s −r _e ) × ½ × [cos {π (t−t ₀ ) / T} +1.0]
At this time, in the case of the function shown in FIG. 3 described above, r _e = A2 and r _s = A1, and in the case of the function shown in FIG. 4, r _e = 1.0 and r _s = E1 are set. In the case of the function shown in FIG. 5, by setting r _e = 0 and r _s = S1, one function can efficiently correspond to each function.

話速・ホルマント制御関数設定部１６は、設定された話速制御用の関数及びホルマント制御用の関数を補聴音声生成部１７に出力する。 The speech speed / formant control function setting unit 16 outputs the set speech speed control function and formant control function to the hearing aid sound generation unit 17.

補聴音声生成部１７は、話速・ホルマント制御関数設定部１６で設定された各制御関数に基づいて、話速・ホルマント周波数の制御を行う。 The hearing aid sound generation unit 17 controls the speech speed / formant frequency based on the control functions set by the speech speed / formant control function setting unit 16.

ここで、補聴音声生成部１７における話速の制御に関しては、例えば、上述した特許文献１で開示された方法を利用することができる。具体的には、受聴音声の発声する速さ（話速）を遅くする際に、入力音声のデータ長と、事前に与えられた伸縮倍率に関する変換関数によって予め計算された出力データ長と、実際に出力されている音声のデータ長とを一定の処理単位で常に監視しながら、情報の欠落を生じることなく、一連の処理を行なう。 Here, regarding the control of the speech speed in the hearing aid sound generation unit 17, for example, the method disclosed in Patent Document 1 described above can be used. Specifically, when the speed at which the listening voice is uttered (speaking speed) is slowed down, the data length of the input voice, the output data length calculated in advance by a conversion function relating to a scaling factor given in advance, A series of processing is performed without causing loss of information, while constantly monitoring the data length of the sound output to the unit in a fixed processing unit.

更に、補聴音声生成部１７は、音声を伸張することによる映像と音声との時間差を最小限にすることを目的として、話速変換に期待される遅さの度合い（変換倍率）に応じて設定される可変の閾値以上の長さを有する無音区間を適宜短縮し、かつ入力データ長に対する出力データ長の時間差の程度によって適応的に変換倍率を変化させることにより、変換音声の発話時間を原音声の発話時間にほぼ保ちつつ、決められた時間枠の中で実現し得る最大のゆっくり感を自動的に生成する。 Further, the hearing aid sound generation unit 17 is set according to the degree of delay (conversion magnification) expected for the speech speed conversion for the purpose of minimizing the time difference between the video and the sound due to the expansion of the sound. The speech duration of the converted speech is changed to the original speech by appropriately shortening the silent section having a length equal to or greater than the variable threshold and changing the conversion magnification adaptively according to the degree of time difference between the output data length and the input data length. The maximum slow feeling that can be realized within a predetermined time frame is automatically generated while maintaining the utterance time of the voice.

つまり、補聴音声生成部１７は、入力データ長と、これに任意の伸縮倍率を乗じて算出される目標データ長と、実際の出力音声データ長とを比較しながら制御を行うため、伸張・伸縮倍率の変化に対しても、音声情報の欠落が生じないようにすることができる。 That is, the hearing aid sound generation unit 17 performs control while comparing the input data length, the target data length calculated by multiplying the input data length by an arbitrary expansion / contraction ratio, and the actual output sound data length. It is possible to prevent audio information from being lost even with a change in magnification.

また、時々刻々変化する原音声と、変換音声との時間差を監視し、時間差が少ない場合には、話速変換倍率を一時的に上昇させ、また逆に多い場合には、話速変換倍率を一時的に下降させる等、適応的に倍率を変化させ、更に話速変換倍率や伸張量等に基づいて、無音区間の残存割合を適応的に変化させて、話速変換に伴う原音声からの時間差を適応的に解消する。そのため、使用者等は、数段階の目安となる変換倍率を一度だけ設定操作するだけで、設定された条件に応じて話速変換倍率や無音区間を適応的に制御し、実際に発話された時間枠の中で、話速変換に期待される効果を安定して得ることができる
一方、ホルマント周波数の制御については、例えば、「都木徹、桑原尚夫、「ホルマント変化の強調・抑圧による声質制御」、日本音響学会講演論文集、昭和６１年１０月で開示された方法を利用することができる。 Also, the time difference between the original voice that changes from moment to moment and the converted voice is monitored, and if the time difference is small, the speech speed conversion magnification is temporarily increased. Change the magnification adaptively, such as temporarily lowering it, and then adaptively change the remaining ratio of the silent interval based on the speech speed conversion magnification and expansion amount, etc. The time difference is eliminated adaptively. For this reason, users can set the conversion magnification, which is a standard for several steps, only once, and adaptively control the speech speed conversion magnification and the silent interval according to the set conditions, and the speech is actually spoken. In the time frame, the effect expected for speech speed conversion can be obtained stably. On the other hand, for formant frequency control, for example, “Toru Tsuki, Nao Kuwahara,” “Voice quality by emphasizing / suppressing formant changes” The method disclosed in "Control", Proceedings of the Acoustical Society of Japan, October 1986 can be used.

具体的には、連続音声中の音韻知覚には、隣接した前後の音韻情報が重要な役割を果たしている。そこで、ある音声のホルマント周波数の時間変化をＦ（ｔ）とし、以下に示す（１）式を仮定する。 Specifically, adjacent phoneme information plays an important role in phoneme perception in continuous speech. Therefore, assuming that the time change of the formant frequency of a certain voice is F (t), the following equation (1) is assumed.

ここで、上述の（１）式において、Ｆ＾（ｔ_０）は、現実の物理量Ｆ（ｔ_０）に前後の情報を加算して得られる仮想的な物理量で、知覚の効果を考慮した特徴量ともいえる。また、上述の（１）式において、ｗ（ｔ）は重み関数であり、Ｔは考慮すべき前後の時間の範囲である。

Here, in the above equation (1), F ^ (t ₀ ) is a virtual physical quantity obtained by adding the preceding and succeeding information to the actual physical quantity F (t ₀ ), and is a feature that takes into account the effect of perception. It can be said that it is a quantity. In the above equation (1), w (t) is a weight function, and T is a range of time before and after consideration.

上述した（１）式は、Ｆ＾（ｔ_０）には現時刻ｔ_０の物理量Ｆ（ｔ_０）に加えて、時間ｔだけ前後の物理量Ｆ（ｔ_０＋ｔ）のＦ（ｔ_０）からの差が、ｔに応じた重みで貢献していることを示している。したがって、この重みＷ（ｔ）を増減することにより、知覚的な特徴量Ｆ＾（ｔ）を制御することができる。重み関数Ｗ（ｔ）には、以下に示す（２）式を用いる。 Described above (1) is in F ^ _{(t 0)} in addition to the physical quantity F _{(t 0)} of the current time _{t 0,} the F _{(t 0)} of time t only before and after the physical quantity F _(t 0 + t) It is shown that the difference of is contributing with the weight according to t. Therefore, the perceptual feature quantity F ^ (t) can be controlled by increasing or decreasing the weight W (t). The following equation (2) is used for the weight function W (t).

ここで、上述した（２）式において、αは重みを増減する係数である。なお、自然音声の第１及び第２ホルマント周波数の軌跡に、α＝７．３、Ｔ＝０．１５（ｓｅｃ）として、上述した（１）式を代入した場合に、個々の母音のクラス内のまとまりがよくなると共にクラス間の距離が大きくなり、物理特性上では母音の中性化がある程度回復されることが観察されている。

Here, in the above-described equation (2), α is a coefficient for increasing or decreasing the weight. In addition, when α = 7.3 and T = 0.15 (sec) are substituted into the trajectories of the first and second formant frequencies of natural speech and the above-described equation (1) is substituted, It has been observed that the neutralization of vowels is restored to some extent in terms of physical characteristics, as the unity improves and the distance between classes increases.

したがって、補聴音声生成部１７は、上述の（２）式のαとして、ホルマント周波数制御関数から得られる値を用いる。これにより、特徴量を制御することができ、声質制御を行うことができる。 Therefore, the hearing aid sound generation unit 17 uses a value obtained from the formant frequency control function as α in the above equation (2). Thereby, the feature amount can be controlled and voice quality control can be performed.

また、補聴音声生成部１７は、上述により設定を行った音声を音声出力部１８に出力する。音声出力部１８は、補聴音声生成部１７より得られる音声全体を所定の増幅率で増幅した後、補聴制御された音声を出力する。なお、音声出力部１８としては、例えばイヤホン等を適用することができる。 In addition, the hearing aid sound generation unit 17 outputs the sound set as described above to the sound output unit 18. The sound output unit 18 amplifies the entire sound obtained from the hearing aid sound generation unit 17 with a predetermined amplification factor, and then outputs the sound subjected to hearing aid control. In addition, as the audio | voice output part 18, an earphone etc. are applicable, for example.

これにより、受聴者に対してより快適に音声を聞き取り易くすることができる。具体的には、話が始まるタイミングを音声の大きさや速度の制御により受聴者に知らせることで、音を聞くための心構えを喚起し、その結果として補聴効果を向上させることができる。 Thereby, it is possible to make it easier for the listener to hear the sound more comfortably. Specifically, by informing the listener of the start timing of the talk by controlling the volume and speed of the voice, it is possible to arouse the preparedness for listening to the sound, and as a result, the hearing aid effect can be improved.

＜他の実施形態＞
なお、受聴者に音を聞くための心構えを喚起させる手法としては、上述以外にも、例えば触覚を刺激する手段を設けることで、その結果として補聴効果を向上させることができる。ここで、上述の処理内容を他の実施形態として図を用いて説明する。 <Other embodiments>
In addition to the method described above, for example, a means for stimulating a tactile sensation is provided as a method for encouraging the listener to listen to sound. As a result, the hearing aid effect can be improved. Here, the above processing contents will be described as another embodiment with reference to the drawings.

図６は、他の実施形態を説明するための一構成例を示す図である。なお、図６において、上述した図１に示す補聴装置１０と略同様の処理を行う構成については、同一の名称と同一の番号を付するものとし、ここでの説明は省略する。 FIG. 6 is a diagram illustrating a configuration example for explaining another embodiment. In FIG. 6, the same name and the same number are assigned to a configuration that performs processing substantially similar to that of the hearing aid device 10 illustrated in FIG. 1 described above, and description thereof is omitted here.

図６に示す補聴装置６０は、音声入力部１１と、前置増幅部１２と、音声開始点検出部６１と、増幅率設定部１４と、適応増幅部１５と、話速・ホルマント制御関数設定部１６と、補聴音声生成部１７と、音声出力部１８と、触覚刺激部６２とを有するよう構成されている。 The hearing aid device 60 shown in FIG. 6 includes a voice input unit 11, a preamplifier 12, a voice start point detector 61, an amplification factor setting unit 14, an adaptive amplifier 15, and a speech speed / formant control function setting. Unit 16, hearing aid sound generation unit 17, sound output unit 18, and tactile stimulation unit 62.

ここで、上述の図１に示す補聴装置１０との主な相違部分である音声区間検出部６１及び触覚刺激部６２について説明すると、音声区間検出部６１は、上述の手法で検出した音声開始点及び音声区間の情報を増幅率設定部１４に出力すると共に、受聴者の触覚に刺激を与えるよう指示する指示制御信号を触覚刺激部６２に出力する。 Here, the audio section detection unit 61 and the tactile stimulation unit 62, which are the main differences from the hearing aid device 10 shown in FIG. 1 described above, will be described. The audio section detection unit 61 uses the voice start point detected by the above method. And the voice section information are output to the amplification factor setting unit 14, and an instruction control signal for instructing the listener to touch the tactile sense is output to the tactile stimulation unit 62.

なお、音声区間検出部６１は、音声区間の長さ等に応じて刺激の強さや時間を調整した制御信号を生成して触覚刺激部６２に出力する。例えば、音声区間の短い場合には刺激の強さを強く、また時間を短くし、音声区間の長い場合には、時間を長くするよう指示する制御信号を生成する。 The voice section detection unit 61 generates a control signal in which the intensity and time of the stimulus are adjusted according to the length of the voice section and the like, and outputs the control signal to the tactile stimulation unit 62. For example, when the voice interval is short, the strength of the stimulus is increased and the time is shortened, and when the voice interval is long, a control signal instructing to increase the time is generated.

触覚刺激部６２は、例えば特開２００２−４５７９０号公報に開示されている既存の手段によって振動を与えることができる。具体的には、コイルとコイル支持台にダンパを介して取り付けられる永久磁石により磁気回路を構成し、磁石を含む磁気回路自体を直接振動子として用いることにより、小型で幅広い周波数帯に対応した振動伝達装置を設けることで、これを使用者の身体の所要部位に押し当て、骨振動等を用いることで、振動呼び出し装置として用いることができる。したがって、触覚刺激部６２は、音声区間検出部６１から指示制御信号を取得すると、上述した制御に基づいて受聴者に信号を与える。 The tactile stimulation unit 62 can give vibration by existing means disclosed in, for example, Japanese Patent Application Laid-Open No. 2002-45790. Specifically, a magnetic circuit is configured by a permanent magnet attached to a coil and a coil support base via a damper, and the magnetic circuit including the magnet itself is directly used as a vibrator, so that the vibration corresponding to a small and wide frequency band can be obtained. By providing the transmission device, it can be used as a vibration calling device by pressing it against a required part of the user's body and using bone vibration or the like. Therefore, when the tactile stimulation unit 62 acquires the instruction control signal from the voice section detection unit 61, the tactile stimulation unit 62 gives a signal to the listener based on the control described above.

なお、触覚刺激部６２は、上述した手法に限定されず、例えばイヤホンの筐体に微弱電流を流す等の手法を用いることができる。これにより、触覚刺激部６２は、指示制御信号の指示内容に基づいて電流を流すことで、受聴者に対して刺激を与え、より快適に音声を聞き取り易くすることができる。具体的には、話が始まるタイミングを触覚への刺激により受聴者に知らせることで、音を聞くための心構えを喚起し、その結果として補聴効果を向上させることができる。 The tactile stimulation unit 62 is not limited to the above-described method, and for example, a method such as passing a weak current through the housing of the earphone can be used. As a result, the tactile stimulation unit 62 applies a current based on the instruction content of the instruction control signal, thereby stimulating the listener and making it easier to hear the sound more comfortably. Specifically, by informing the listener of the timing at which the talk starts by stimulating tactile sensation, it is possible to arouse the attitude to hear the sound, and as a result, the hearing aid effect can be improved.

また、触覚刺激部６２は、サイレンや踏切の信号音等の注意喚起音が、予め設定された音量以上で受聴された場合には、音声区間検出部６１により検出された音声開始点のタイミングで音量の増幅や触覚刺激の手段を用いて注意喚起することができる。 In addition, the tactile stimulation unit 62, at the timing of the voice start point detected by the voice section detection unit 61, when a warning sound such as a siren or a crossing signal is received at a preset volume or higher. Attention can be made by means of volume amplification or tactile stimulation.

上述の実施形態により、受聴者に対してより明確に警告することができ安全性を向上することができる。これにより、音量や音声速度を必要以上に調整しなくても、刺激を与えることで受聴者が良好に聞き取ることができる。 According to the above-described embodiment, a warning can be clearly given to the listener, and safety can be improved. Thereby, even if it does not adjust a sound volume and a sound speed more than necessary, a listener can hear well by giving a stimulus.

なお、その他の実施形態として、例えば、話速の制御を行い、更に触覚への刺激も行う等、２つの実施形態を組み合わせて適用することができる。また、上述の実施形態は、本発明の一例に過ぎず、各ブロックでの処理は、同様な目的を達成できるものであれば、いかなる手法を用いてもよい。 As other embodiments, for example, the two embodiments can be applied in combination, such as controlling the speech speed and further stimulating the sense of touch. Further, the above-described embodiment is merely an example of the present invention, and any method may be used for the processing in each block as long as the same object can be achieved.

また、上述した補聴装置は、耳に設置する補聴器の他にもテレビやパソコン等にも適用でき、番組等のコンテンツを視聴者に提供する際にも補聴手法を適用することができる。 In addition, the above-described hearing aid device can be applied to a TV, a personal computer, and the like in addition to a hearing aid installed in the ear, and a hearing aid technique can be applied when providing content such as a program to a viewer.

また、上述の補聴装置や、テレビやパソコン等により提供される場合には、上述した補聴手法を実行可能なプログラムを生成し、補聴装置自体やパソコン（補聴装置内の各ソフトウェアを制御するパソコンも含む）等にアプリケーションとしてインストールすることにより補聴プログラム等を実施することができる。 In addition, when provided by the above-described hearing aid device, a television, a personal computer, or the like, a program capable of executing the above-described hearing aid method is generated, and the hearing aid device itself or a personal computer (a personal computer that controls each software in the hearing aid device is also available). Etc.) can be implemented as an application to implement a hearing aid program or the like.

また、テレビやパソコン等により上述の各機能が提供される場合には、触覚刺激部６２に対応する処理として、ランプや映像中のフラッシュ画像等、視覚への刺激であってもよい。 Further, when the above-described functions are provided by a television, a personal computer, or the like, the process corresponding to the tactile stimulation unit 62 may be a visual stimulus such as a lamp or a flash image in a video.

＜補聴プログラム＞
ここで、上述した補聴装置１０，６０は、ＣＰＵ、ＲＡＭ等の揮発性の記憶媒体、ＲＯＭ等の不揮発性の記録媒体、マウスやキーボード、ポインティングデバイス等の入力装置、コンテンツを表示する表示手段、並びに外部と通信するためのインタフェースを備えたコンピュータによって構成される。 <Hearing aid program>
Here, the hearing aids 10 and 60 described above include a volatile storage medium such as a CPU and a RAM, a non-volatile recording medium such as a ROM, an input device such as a mouse, a keyboard, and a pointing device, a display means for displaying content, And a computer having an interface for communicating with the outside.

また、補聴装置１０，６０に備えた音声入力部１１、前置増幅部１２、音声開始点検出部１３，６１と、増幅率設定部１４、適応増幅部１５、話速・ホルマント制御関数設定部１６、補聴音声生成部１７、音声出力部１８、及び触覚刺激部６２における各機能は、これらの機能を記述したプログラムをＣＰＵに実行させることにより、それぞれ実現される。また、これらのプログラムは、磁気ディスク（フロッピィーディスク、ハードディスク等）、光ディスク（ＣＤ−ＲＯＭ、ＤＶＤ等）、半導体メモリ等の記録媒体に格納して頒布することもできる。 In addition, the voice input unit 11, the preamplifier 12, the voice start point detectors 13 and 61, the amplification factor setting unit 14, the adaptive amplification unit 15, and the speech speed / formant control function setting unit included in the hearing aid devices 10 and 60. 16, each function in the hearing aid sound generation unit 17, the sound output unit 18, and the tactile stimulation unit 62 is realized by causing the CPU to execute a program describing these functions. These programs can also be stored and distributed in a recording medium such as a magnetic disk (floppy disk, hard disk, etc.), optical disk (CD-ROM, DVD, etc.), semiconductor memory, or the like.

つまり、上述した各構成における処理をコンピュータに実行させるための実行プログラム（補聴プログラム）を生成し、例えば、汎用のパーソナルコンピュータやサーバ等にそのプログラムをインストールすることにより、補聴処理を実現することができる。 That is, it is possible to realize the hearing aid processing by generating an execution program (hearing aid program) for causing the computer to execute the processing in each configuration described above and installing the program in, for example, a general-purpose personal computer or server. it can.

＜ハードウェア構成＞
ここで、本発明における実行可能なコンピュータのハードウェア構成例について図を用いて説明する。図７は、本発明における補聴処理が実現可能なハードウェア構成の一例を示す図である。 <Hardware configuration>
Here, an example of a hardware configuration of an executable computer in the present invention will be described with reference to the drawings. FIG. 7 is a diagram showing an example of a hardware configuration capable of realizing the hearing aid processing in the present invention.

図７におけるコンピュータ本体には、入力装置７１と、出力装置７２と、ドライブ装置７３と、補助記憶装置７４と、メモリ装置７５と、各種制御を行うＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）７６と、ネットワーク接続装置７７とを有するよう構成されており、これらはシステムバスＢで相互に接続されている。 7 includes an input device 71, an output device 72, a drive device 73, an auxiliary storage device 74, a memory device 75, a CPU (Central Processing Unit) 76 for performing various controls, and a network connection device. 77, which are connected to each other by a system bus B.

入力装置７１は、使用者（受聴者、視聴者等）が操作するキーボード及びマウス等のポインティングデバイスやマイク等の音声入力デバイス等を有しており、使用者等からのプログラムの実行等、各種操作信号を入力する。 The input device 71 includes a keyboard and a pointing device such as a mouse operated by a user (listener, viewer, etc.), a voice input device such as a microphone, and the like. Input an operation signal.

出力装置７２は、本発明における処理を行うためのコンピュータ本体を操作するのに必要な各種ウィンドウやデータ等を表示するディスプレイや、音声を出力するスピーカ等を有し、ＣＰＵ７６が有する制御プログラムによりプログラムの実行経過や結果等を表示又は音声出力することができる。 The output device 72 has a display for displaying various windows and data necessary for operating the computer main body for performing processing in the present invention, a speaker for outputting sound, and the like, and is programmed by a control program of the CPU 76. It is possible to display or voice output the execution progress and results of the.

ここで、本発明において、コンピュータ本体にインストールされる実行プログラムは、例えばＣＤ−ＲＯＭ等の記録媒体７８等により提供される。プログラムを記録した記録媒体７８は、ドライブ装置７３にセット可能であり、記録媒体７８に含まれる実行プログラムが、記録媒体７８からドライブ装置７３を介して補助記憶装置７４にインストールされる。 In the present invention, the execution program installed in the computer main body is provided by a recording medium 78 such as a CD-ROM. The recording medium 78 on which the program is recorded can be set in the drive device 73, and the execution program included in the recording medium 78 is installed from the recording medium 78 to the auxiliary storage device 74 via the drive device 73.

補助記憶装置７４は、ハードディスク等のストレージ手段であり、本発明における実行プログラムや、コンピュータに設けられた制御プログラム等を蓄積し必要に応じて入出力を行うことができる。 The auxiliary storage device 74 is a storage means such as a hard disk, and can store an execution program according to the present invention, a control program provided in a computer, etc., and perform input / output as necessary.

メモリ装置７５は、ＣＰＵ７６により補助記憶装置７４から読み出された実行プログラム等を格納する。なお、メモリ装置７５は、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）やＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）等からなる。 The memory device 75 stores an execution program or the like read from the auxiliary storage device 74 by the CPU 76. The memory device 75 includes a ROM (Read Only Memory), a RAM (Random Access Memory), and the like.

ＣＰＵ７６は、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）等の制御プログラム、メモリ装置７５に格納されている実行プログラムに基づいて、各種演算や各ハードウェア構成部とのデータの入出力等、コンピュータ全体の処理を制御して各処理を実現することができる。また、ＣＰＵ７６は、プログラムの実行中に必要な各種情報を補助記憶装置７４から取得することができ、またＣＰＵ７６は、処理結果等を格納することもできる。 The CPU 76 controls processing of the entire computer, such as various operations and input / output of data with each hardware component, based on a control program such as OS (Operating System) and an execution program stored in the memory device 75. Each processing can be realized. Further, the CPU 76 can acquire various types of information necessary during execution of the program from the auxiliary storage device 74, and the CPU 76 can also store processing results and the like.

ネットワーク接続装置７７は、通信ネットワーク等と接続することにより、実行プログラムを通信ネットワークに接続されている他の端末等から取得したり、プログラムを実行することで得られた実行結果又は本発明における実行プログラム自体を他の端末等に提供することができる。 The network connection device 77 obtains an execution program from another terminal connected to the communication network by connecting to a communication network or the like, or an execution result obtained by executing the program or an execution in the present invention The program itself can be provided to other terminals.

上述したようなハードウェア構成により、特別な装置構成を必要とせず、低コストで効率的に補聴処理を実現することができる。また、プログラムをインストールすることにより、補聴処理を容易に実現することができる。 With the hardware configuration as described above, it is possible to efficiently realize hearing aid processing at low cost without requiring a special device configuration. In addition, by installing the program, hearing aid processing can be easily realized.

＜補聴プログラム＞
次に、本発明における実行プログラム（補聴プログラム）による補聴処理手順についてフローチャートを用いて説明する。 <Hearing aid program>
Next, a hearing aid processing procedure by the execution program (hearing aid program) in the present invention will be described with reference to a flowchart.

図８は、本発明における補聴処理手順の一例を示すフローチャートである。なお、図８では、触覚への刺激処理も行う例を示している。図８において、まず音声を入力し（Ｓ０１）、予め設定された増幅率で前置増幅を行う（Ｓ０２）。次に、上述した手法により音声開始点・音声区間を検出し（Ｓ０３）、検出された音声開始点・音声区間における増幅率を設定する関数を設定し（Ｓ０４）、設定された制御関数の曲線波形に適応させて音声区間の音声を増幅させる（Ｓ０５）。 FIG. 8 is a flowchart showing an example of a hearing aid processing procedure in the present invention. FIG. 8 shows an example in which tactile stimulation processing is also performed. In FIG. 8, first, voice is input (S01), and preamplification is performed at a preset amplification factor (S02). Next, the voice start point / voice section is detected by the above-described method (S03), a function for setting the amplification factor at the detected voice start point / voice section is set (S04), and the curve of the set control function is set. The voice in the voice section is amplified in accordance with the waveform (S05).

次に、話速・ホルマント制御関数を設定し（Ｓ０６）、設定した関数に適応させて話速・ホルマント制御を行う（Ｓ０７）。更に、使用者の触覚へ刺激を行い（Ｓ０８）、その後、Ｓ０７の処理にて制御された話速とホルマントに基づいて補聴制御された音声を出力する（Ｓ０９）。なお、Ｓ０８の処理は、省略することもできる。 Next, the speech speed / formant control function is set (S06), and the speech speed / formant control is performed according to the set function (S07). Further, the user's sense of touch is stimulated (S08), and then the hearing-controlled sound is output based on the speech speed and formant controlled in the process of S07 (S09). Note that the process of S08 can be omitted.

これにより、受聴者に対してより快適に音声を聞き取り易くすることができる。また、実行プログラムをコンピュータにインストールすることにより、容易に補聴処理を実現することができる。 Thereby, it is possible to make it easier for the listener to hear the sound more comfortably. Further, by installing the execution program in the computer, it is possible to easily realize the hearing aid process.

上述したように、本発明によれば、受聴者に対してより快適に音声を聞き取り易くすることができる。具体的には、話が始まるタイミングを受聴者に知らせることで、音を聞くための心構えを喚起し、その結果として補聴効果を向上させることができる。 As described above, according to the present invention, it is possible to make it easier for listeners to hear sound. Specifically, by informing the listener of the timing when the talk starts, it is possible to arouse the attitude to hear the sound, and as a result, the hearing aid effect can be improved.

つまり、音声区間を検出することにより、話し始めの部分を特定し、その部分の増幅率を上げた後、その後は徐々に増幅率を下げることにより、話し始めを受聴者に気付かせることができる。 In other words, it is possible to identify the beginning of the speech by detecting the voice section, increase the amplification factor of that portion, and then gradually lower the amplification factor to make the listener aware of the beginning of the speech. .

また、触覚への刺激も利用することで、より確実に、話し始めを気付かせることが可能となる。更に、話し始めの部分の話速を遅くしたり、ホルマント周波数の変化を強調したりすることにより、話し始めの聞き取りを向上させることができる。 In addition, by using a tactile stimulus, it becomes possible to make the user start to speak more reliably. Furthermore, listening at the beginning of speaking can be improved by slowing down the speaking speed at the beginning of speaking or by emphasizing changes in formant frequency.

したがって、従来の補聴器の問題である、大きい音はうるさく感じ、小さい音は聞き取りにくいという問題が改善される。その結果、補聴器利用者の聞き取り能力が向上し、お年寄りや難聴者等のコミュニケーションが促進される。 Therefore, the problem of the conventional hearing aid, that is, the loud sound feels loud and the small sound is difficult to hear is improved. As a result, hearing ability of hearing aid users is improved, and communication between the elderly and the hearing impaired is promoted.

以上本発明の好ましい実施形態について詳述したが、本発明は係る特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形、変更が可能である。 Although the preferred embodiment of the present invention has been described in detail above, the present invention is not limited to the specific embodiment, and various modifications, within the scope of the gist of the present invention described in the claims, It can be changed.

本発明における補聴装置の一構成例を示す図である。It is a figure which shows the example of 1 structure of the hearing aid apparatus in this invention. 本実施形態における音声区間の検出方法の一例を示す図である。It is a figure which shows an example of the detection method of the audio | voice area in this embodiment. 増幅率を制御するための条件の一例を説明するための図である。It is a figure for demonstrating an example of the conditions for controlling an amplification factor. 話速制御用の関数の一例を示す図である。It is a figure which shows an example of the function for speech speed control. ホルマント制御用の関数の一例を示す図である。It is a figure which shows an example of the function for formant control. 他の実施形態を説明するための一構成例を示す図である。It is a figure which shows the example of 1 structure for demonstrating other embodiment. 本発明における補聴処理が実現可能なハードウェア構成の一例を示す図である。It is a figure which shows an example of the hardware constitutions which can implement | achieve the hearing aid process in this invention. 本発明における補聴処理手順の一例を示すフローチャートである。It is a flowchart which shows an example of the hearing aid process sequence in this invention.

Explanation of symbols

１０，６０補聴装置
１１音声入力部
１２前置増幅部
１３，６１音声開始点検出部
１４増幅率設定部
１５適応増幅部
１６話速・ホルマント制御関数設定部
１７補聴音声生成部
１８音声出力部
２１音声波形
３１，４１，５１曲線波形
６２触覚刺激部
７１入力装置
７２出力装置
７３ドライブ装置
７４補助記憶装置
７５メモリ装置
７６ＣＰＵ
７７ネットワーク接続装置
７８記録媒体 DESCRIPTION OF SYMBOLS 10,60 Hearing aid 11 Audio | voice input part 12 Preamplifier 13, 61 Voice start point detection part 14 Amplification factor setting part 15 Adaptive amplification part 16 Speech speed / formant control function setting part 17 Hearing aid voice generation part 18 Voice output part 21 Audio waveform 31, 41, 51 Curve waveform 62 Tactile stimulation unit 71 Input device 72 Output device 73 Drive device 74 Auxiliary storage device 75 Memory device 76 CPU
77 Network connection device 78 Recording medium

Claims

In a hearing aid device that outputs a sound by emphasizing a predetermined section of the sound to be heard by the listener,
An audio input unit for inputting audio to be heard by the listener;
A voice section detecting section for detecting a voice section and a voice start point to be emphasized from the voice input by the voice input section;
With reference to the voice start point detected by the voice section detection unit, a hearing aid sound generation unit that generates a hearing aid sound by controlling speech speed and formant;
Have a sound output unit for outputting a sound that is hearing controlled generated by the hearing sound generating unit,
The voice section detection unit sets in advance a threshold value of power of two voices, an upper limit value and a lower limit value, and uses the time when the voice power input by the voice input unit exceeds the upper limit value as a reference time. The axis is moved in the reverse direction, and the time when it first falls below the lower limit value is detected as the voice start point, and the time from when the lower limit value is exceeded to the time when the upper limit value is exceeded is used as the voice. A hearing aid characterized by detecting as a section .

The hearing aid sound generation unit
The sound output by the speech start point and amplified to a predetermined amplification factor, then according to claim 1, characterized in that reducing the gain in correspondence with the preset control functions in the speech segment Hearing aids.

The hearing aid sound generation unit
The speech speed output at the voice start point is lowered to a predetermined speed, and thereafter, the speech speed is made close to the normal speed in correspondence with a control function set in advance in the voice section. The hearing aid device according to 1 or 2 .

The hearing aid sound generation unit
Emphasizing the change in the formant frequency of the speech to be output by the speech start point, claims 1 to 3 thereafter is characterized by reducing the emphasis in association with the preset control functions in the speech segment The hearing aid device according to any one of the above.

A hearing instrument according to any one of claims 1 to 4, characterized in that it has a tactile stimulation unit to stimulate tactile to the listener by the speech start point.