JP2012169781A

JP2012169781A - Speech processing device and method, and program

Info

Publication number: JP2012169781A
Application number: JP2011027781A
Authority: JP
Inventors: Masayoshi Noguchi; 雅義野口
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2011-02-10
Filing date: 2011-02-10
Publication date: 2012-09-06

Abstract

PROBLEM TO BE SOLVED: To provide a speech processing device and method, and a program, capable of balancing between a centralized localization signal such as voice dominant signal and a presence sound before outputting the sound.SOLUTION: In a speech processing device, a center balance correction unit 32 and a gain control signal generation unit 33 produce a gain control signal Gv based on the average level of voice component signals and the average level of presence sound component signals. A presence sound balance correction unit 35 and a gain control signal generation unit 36 produce a gain control signal Gy based on the average level of voice component signals and the average level of presence sound component signals. A presence sound level correction gain generation unit 37 corrects the gain control signal Gy to produce a gain control signal Gs. A variable gain amplifier 24 performs a gain control over a voice dominant signal Sv using the gain control signal Gs. Variable amplifiers 25 and 26 perform a gain control over a presence sound component signal using the gain control signal Gv. The present invention can be applied for television receivers.

Description

本発明は、音声処理装置および方法、並びにプログラムに関し、特に、入力音声信号のうち、集中定位信号と臨場音とのバランスを調整して、集中定位信号と臨場音とをバランスよく出力できるようにした音声処理装置および方法、並びにプログラムに関する。 The present invention relates to an audio processing apparatus and method, and a program, and more particularly, to adjust the balance between a centralized localization signal and a real sound in an input audio signal so that the centralized localization signal and the real sound can be output in a balanced manner. The present invention relates to a voice processing apparatus and method, and a program.

テレビジョン受像機で受信する放送チャンネルを切り替えたときや、ＡＶ（Audio−Visual）システムにおいて、ＡＶセンタで複数の入力機器の切り替えがなされたとき、コンテンツ間のレベル差により、出力音量に大きなが変化が生じてしてしまうことがある。 When a broadcast channel received by a television receiver is switched or when a plurality of input devices are switched at an AV center in an AV (Audio-Visual) system, the output volume may be large due to a level difference between contents. Changes can occur.

このような場合、ユーザは、自分が好みの音量にするためには、ボリューム操作をして音量調節する必要があり、わずらわしさを感じる場合がある。 In such a case, the user needs to adjust the volume by performing a volume operation in order to obtain his preferred volume, and may feel annoying.

また、同一コンテンツ内（例えば、同一の放送チャンネル内や同一の放送番組内）においても、コマーシャル（ＣＭ）部分やシーンの変化によって、出力音量が変化し、不快に思うことがある。 Also, even within the same content (for example, within the same broadcast channel or within the same broadcast program), the output volume may change due to changes in the commercial (CM) part or scene, which may be uncomfortable.

この問題を解決する音量補正方式が従来から種々提案されている。その一例のＡＧＣ（Auto Gain Control；自動利得制御）による音量制御方式が広く知られている。 Various sound volume correction methods for solving this problem have been proposed. A volume control method using AGC (Auto Gain Control) as an example is widely known.

しかしながら、入力音声信号のレベルが大きくレベル変動した場合に、そのレベル変化点での音声信号ゲインの急激な変化を完全に抑えることは困難であり、前記レベル変化点で出力音声音量レベルが揺れるなど、聴取者に、聴感上、違和感を与える場合がある。 However, when the level of the input audio signal greatly fluctuates, it is difficult to completely suppress a sudden change in the audio signal gain at the level change point, and the output audio volume level fluctuates at the level change point. , The listener may feel uncomfortable in terms of hearing.

特に、従来の音量補正方式では、音声信号全体を一律に同様にゲイン制御する方式であるため、前記の急激な変化点での音量レベルの揺れに対する違和感が目立つと言う問題があった。 In particular, the conventional sound volume correction method is a method in which the entire audio signal is equally controlled in the same manner, and thus there is a problem that the uncomfortable feeling with respect to the fluctuation of the sound volume level at the sudden change point is conspicuous.

そこで、音声信号の成分のうち、例えば、センター集中定位信号などの平均レベルを求めて、出力信号が平均レベルとなるようにゲイン制御する技術が提案されている（引用文献１参照）。 In view of this, a technique has been proposed in which, for example, an average level of a centralized localization signal or the like is obtained from the audio signal components, and gain control is performed so that the output signal becomes the average level (see cited document 1).

特開２０１０−１３６１７３号公報JP 2010-136173 A

しかしながら、特許文献１の技術においては、センター集中定位信号が、人物の声などの場合、いかなる状態でも平均レベルとされることで、声は一定のレベルに保たれるが、臨場音とセンター集中定位信号とのゲイン制御のレベルのバランスを補正させることができなかった。 However, in the technique of Patent Document 1, when the center concentration localization signal is a person's voice or the like, the average level is maintained in any state, so that the voice is maintained at a constant level. The balance of the gain control level with the localization signal could not be corrected.

本発明はこのような状況に鑑みてなされたものであり、特に、声主体信号のような集中定位信号と臨場音とのバランスを調整して出力できるようにするものである。 The present invention has been made in view of such a situation, and in particular, can adjust and output a balance between a concentrated localization signal such as a voice main signal and a real sound.

本技術の一側面の音声処理装置は、複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、ゲイン制御して出力する第１成分主体信号ゲイン制御部と、前記入力音声信号のうち、前記第１成分以外の、その他成分信号を、ゲイン制御して出力するその他成分信号ゲイン制御部と、前記第１成分信号の平均レベルを検出する第１成分信号平均レベル検出部と、前記その他成分信号の平均レベルを検出するその他成分信号平均レベル検出部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号を出力する第１成分主体信号ゲイン制御信号出力部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号を出力するその他成分信号ゲイン制御信号出力部とを含み、前記第１成分主体信号ゲイン制御部は、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御して第１成分出力信号として出力し、前記その他成分信号ゲイン制御部は、前記第１成分以外の、その他成分信号を、その他成分信号ゲイン制御信号に基づいて、ゲイン制御してその他成分出力信号として出力する。 An audio processing device according to an aspect of the present technology performs gain control to output a first component main signal having a part of the plurality of audio components as a main component among input audio signals including a plurality of audio components. A one-component main signal gain control unit, another component signal gain control unit that performs gain control on other component signals other than the first component in the input audio signal, and an average level of the first component signal A first component signal average level detection unit for detecting the other component signal average level detection unit for detecting an average level of the other component signal, an average level of the first component signal, and the other component signal average level. Based on a first component main signal gain control signal output unit for outputting a first component gain control signal for gain control of the first component main signal, an average level of the first component signal, and The other component signal gain control signal output unit for outputting the other component gain control signal for gain control of the other component signal based on the other component signal average level, and the first component main signal gain control unit includes: Of the input audio signals composed of the plurality of audio components, gain control is performed on a first component main signal whose main component is a part of the plurality of audio components based on the first component main signal gain control signal. The other component signal gain control unit outputs the other component signal other than the first component as the other component output signal by performing gain control based on the other component signal gain control signal. To do.

前記第１成分主体信号ゲイン制御信号出力部には、前記第１成分信号の平均レベルに、前記その他成分信号平均レベルを第１の所定の増幅率で増倍した信号を加算して補正する第１信号成分平均レベル補正部と、前記第１成分出力信号が、前記第１信号成分平均レベル補正部により補正された前記第１成分信号の平均レベルとなるように、前記第１成分信号ゲイン制御信号を生成する第１成分信号ゲイン制御信号生成部とを含ませるようにすることができ、前記その他成分信号ゲイン制御信号出力部には、前記第１成分信号の平均レベルに、前記その他成分信号平均レベルを第２の所定の増幅率で増倍した信号を加算して補正する第１信号成分平均レベル補正部と、前記その他成分出力信号が、前記第１信号成分平均レベル補正部により補正された前記第１成分信号の平均レベルとなるように、前記その他成分信号ゲイン制御信号を生成するその他成分信号ゲイン制御信号生成部とを含ませるようにすることができる。 The first component main signal gain control signal output unit corrects by adding a signal obtained by multiplying the average level of the other component signal by a first predetermined amplification factor to the average level of the first component signal. The first component signal gain control so that the one signal component average level correction unit and the first component output signal have the average level of the first component signal corrected by the first signal component average level correction unit. A first component signal gain control signal generating unit that generates a signal, and the other component signal gain control signal output unit includes the other component signal at an average level of the first component signal. A first signal component average level correction unit that corrects by adding a signal obtained by multiplying an average level by a second predetermined amplification factor, and the other component output signal is corrected by the first signal component average level correction unit. The to an average level of the first component signal, can be made to include the other component signal gain control signal generator for generating said other component signal gain control signal.

前記第１成分主体信号ゲイン制御信号出力部には、前記第１成分出力信号が、前記第１信号成分平均レベルとなるように、第１成分信号ゲイン制御信号を生成する第１成分ゲイン制御信号生成部と、前記その他成分信号平均レベルに対応付けて設定される第１の増幅率で増倍して、前記第１成分信号ゲイン制御信号を補正し、補正した前記第１成分信号ゲイン制御信号を出力する第１信号成分平均レベル補正部とを含ませるようにすることができ、前記その他成分信号ゲイン制御信号出力部には、前記その他成分出力信号が、前記第１信号成分平均レベルとなるように、その他成分信号ゲイン制御信号を生成するその他成分ゲイン制御信号生成部と、前記その他成分信号平均レベルに対応付けて設定される第２の増幅率で増倍して、前記その他成分信号ゲイン制御信号を補正し、補正した前記その他成分信号ゲイン制御信号を出力するその他信号成分平均レベル補正部とを含ませるようにすることができる。 The first component main signal gain control signal output section generates a first component gain control signal for generating a first component signal gain control signal so that the first component output signal is at the first signal component average level. The first component signal gain control signal is corrected by correcting the first component signal gain control signal by multiplying by a generation unit and a first amplification factor set in association with the average level of the other component signals The other component signal gain control signal output unit outputs the other component output signal at the first signal component average level. As described above, the other component gain control signal generation unit for generating the other component signal gain control signal, the second amplification factor set in association with the other component signal average level, It can be corrected component signal gain control signal, so as to include the other signal components mean level correction unit which outputs the other component signal gain control signal corrected.

第１成分ゲイン制御信号生成部と、前記その他成分信号ゲイン制御信号出力部とは、共通とすることができ、第１成分ゲイン制御信号生成部により生成される第１成分ゲイン制御信号と、前記その他成分信号ゲイン制御信号出力部により生成されるその他成分信号ゲイン制御信号とは、同一とすることができる。 The first component gain control signal generation unit and the other component signal gain control signal output unit can be common, the first component gain control signal generated by the first component gain control signal generation unit, The other component signal gain control signal generated by the other component signal gain control signal output unit can be the same.

前記その他成分信号ゲイン制御信号出力部より出力される、前記その他成分信号ゲイン制御信号を補正するその他信号成分信号ゲイン制御信号補正部を含ませるようにすることができる。 An other signal component signal gain control signal correction unit that corrects the other component signal gain control signal output from the other component signal gain control signal output unit can be included.

前記第１成分信号は、センター集中定位信号とすることができる。 The first component signal may be a center concentrated localization signal.

前記第１成分出力信号と、前記その他成分出力信号とを加算した加算出力信号を、音量補正後の音声出力信号とする加算手段をさらに含ませるようにすることができる。 It is possible to further include addition means for adding the output signal obtained by adding the first component output signal and the other component output signal to the sound output signal after the volume correction.

本技術の一側面の音声処理方法は、複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、ゲイン制御して出力する第１成分主体信号ゲイン制御部と、前記入力音声信号のうち、前記第１成分以外の、その他成分信号を、ゲイン制御して出力するその他成分信号ゲイン制御部と、前記第１成分信号の平均レベルを検出する第１成分信号平均レベル検出部と、前記その他成分信号の平均レベルを検出するその他成分信号平均レベル検出部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号を出力する第１成分主体信号ゲイン制御信号出力部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号を出力するその他成分信号ゲイン制御信号出力部とを含み、前記第１成分主体信号ゲイン制御部は、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御して第１成分出力信号として出力し、前記その他成分信号ゲイン制御部は、前記第１成分以外の、その他成分信号を、その他成分信号ゲイン制御信号に基づいて、ゲイン制御してその他成分出力信号として出力する音声処理装置における音声処理方法であって、前記第１成分主体信号ゲイン制御部における、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、ゲイン制御して出力する第１成分主体信号ゲイン制御ステップと、前記その他成分信号ゲイン制御部における、前記入力音声信号のうち、前記第１成分以外の、その他成分信号を、ゲイン制御して出力するその他成分信号ゲイン制御ステップと、前記第１成分信号平均レベル検出部における、前記第１成分信号の平均レベルを検出する第１成分信号平均レベル検出ステップと、前記その他成分信号平均レベル検出部における、前記その他成分信号の平均レベルを検出するその他成分信号平均レベル検出ステップと、前記第１成分主体信号ゲイン制御信号出力部における、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号を出力する第１成分主体信号ゲイン制御信号出力ステップと、前記その他成分信号ゲイン制御信号出力部における、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号を出力するその他成分信号ゲイン制御信号出力ステップとを含み、前記第１成分主体信号ゲイン制御ステップの処理は、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御して第１成分出力信号として出力し、前記その他成分信号ゲイン制御ステップの処理は、前記第１成分以外の、その他成分信号を、その他成分信号ゲイン制御信号に基づいて、ゲイン制御してその他成分出力信号として出力する。 According to an audio processing method of one aspect of the present technology, a first component main signal mainly including a part of the plurality of audio components among input audio signals including a plurality of audio components is output by performing gain control. A one-component main signal gain control unit, another component signal gain control unit that performs gain control on other component signals other than the first component in the input audio signal, and an average level of the first component signal A first component signal average level detection unit for detecting the other component signal average level detection unit for detecting an average level of the other component signal, an average level of the first component signal, and the other component signal average level. Based on a first component main signal gain control signal output unit for outputting a first component gain control signal for gain control of the first component main signal, an average level of the first component signal, and The other component signal gain control signal output unit for outputting the other component gain control signal for gain control of the other component signal based on the other component signal average level, and the first component main signal gain control unit includes: Of the input audio signals composed of the plurality of audio components, gain control is performed on a first component main signal whose main component is a part of the plurality of audio components based on the first component main signal gain control signal. The other component signal gain control unit outputs the other component signal other than the first component as the other component output signal by performing gain control based on the other component signal gain control signal. An audio processing method in the audio processing device for performing the input audio signal comprising the plurality of audio components in the first component main signal gain control unit A first component main signal gain control step of gain-controlling and outputting a first component main signal whose main component is a part of the plurality of audio components; and the input audio in the other component signal gain control unit Of the signals, the other component signal gain control step for gain-controlling and outputting other component signals other than the first component, and the average level of the first component signal in the first component signal average level detection unit A first component signal average level detecting step for detecting; an other component signal average level detecting step for detecting an average level of the other component signal in the other component signal average level detecting unit; and the first component main signal gain control signal. Based on the average level of the first component signal and the average level of the other component signal in the output unit, the first component main A first component main signal gain control signal output step for outputting a first component gain control signal for gain control of the body signal, an average level of the first component signal in the other component signal gain control signal output unit, and the other A component signal gain control signal output step of outputting another component gain control signal for gain control of the other component signal based on the component signal average level, and the processing of the first component main signal gain control step includes: Of the input audio signals composed of the plurality of audio components, gain control is performed on a first component main signal whose main component is a part of the plurality of audio components based on the first component main signal gain control signal. The first component output signal is output, and the processing of the other component signal gain control step is the other component signal other than the first component. , Based on other component signal gain control signal, and outputs it as the other component output signal and gain control.

本技術の一側面のプログラムは、複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、ゲイン制御して出力する第１成分主体信号ゲイン制御部と、前記入力音声信号のうち、前記第１成分以外の、その他成分信号を、ゲイン制御して出力するその他成分信号ゲイン制御部と、前記第１成分信号の平均レベルを検出する第１成分信号平均レベル検出部と、前記その他成分信号の平均レベルを検出するその他成分信号平均レベル検出部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号を出力する第１成分主体信号ゲイン制御信号出力部と、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号を出力するその他成分信号ゲイン制御信号出力部とを含み、前記第１成分主体信号ゲイン制御部は、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御して第１成分出力信号として出力し、前記その他成分信号ゲイン制御部は、前記第１成分以外の、その他成分信号を、その他成分信号ゲイン制御信号に基づいて、ゲイン制御してその他成分出力信号として出力する音声処理装置を制御するコンピュータに、前記第１成分主体信号ゲイン制御部における、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、ゲイン制御して出力する第１成分主体信号ゲイン制御ステップと、前記その他成分信号ゲイン制御部における、前記入力音声信号のうち、前記第１成分以外の、その他成分信号を、ゲイン制御して出力するその他成分信号ゲイン制御ステップと、前記第１成分信号平均レベル検出部における、前記第１成分信号の平均レベルを検出する第１成分信号平均レベル検出ステップと、前記その他成分信号平均レベル検出部における、前記その他成分信号の平均レベルを検出するその他成分信号平均レベル検出ステップと、前記第１成分主体信号ゲイン制御信号出力部における、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号を出力する第１成分主体信号ゲイン制御信号出力ステップと、前記その他成分信号ゲイン制御信号出力部における、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号を出力するその他成分信号ゲイン制御信号出力ステップとを含む処理を実行させ、前記第１成分主体信号ゲイン制御ステップの処理は、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号を、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御して第１成分出力信号として出力し、前記その他成分信号ゲイン制御ステップの処理は、前記第１成分以外の、その他成分信号を、その他成分信号ゲイン制御信号に基づいて、ゲイン制御してその他成分出力信号として出力する。 A program according to an aspect of the present technology is a first component that performs gain control and outputs a first component main signal that mainly includes a part of the plurality of audio components, among input audio signals including a plurality of audio components. A main signal gain control unit, an other component signal gain control unit that outputs the other component signals other than the first component in the input audio signal by performing gain control, and an average level of the first component signal is detected. Based on the first component signal average level detecting unit, the other component signal average level detecting unit for detecting the average level of the other component signal, the average level of the first component signal, and the other component signal average level. A first component main signal gain control signal output unit for outputting a first component gain control signal for gain control of the first component main signal; an average level of the first component signal; The other component signal gain control signal output unit for outputting the other component gain control signal for gain control of the other component signal based on the other component signal average level, the first component main signal gain control unit, A first component main signal having a part of the plurality of audio components as a main component among input audio signals composed of a plurality of audio components is gain-controlled based on the first component main signal gain control signal. The other component signal gain control unit outputs the other component signal other than the first component as the other component output signal by performing gain control based on the other component signal gain control signal. Among the input audio signals composed of the plurality of audio components in the first component main signal gain controller in the computer that controls the audio processing device, A first component main signal gain control step for gain-controlling and outputting a first component main signal whose main component is a part of the plurality of audio components; and the other component signal gain control unit Among them, the other component signal gain control step for gain-controlling and outputting other component signals other than the first component, and detecting the average level of the first component signal in the first component signal average level detecting unit. A first component signal average level detection step; an other component signal average level detection step for detecting an average level of the other component signal in the other component signal average level detection unit; and the first component main signal gain control signal output unit. , The first component main signal based on the average level of the first component signal and the average level of the other component signal A first component main signal gain control signal output step for outputting a first component gain control signal for gain control, an average level of the first component signal in the other component signal gain control signal output unit, and the other component signal Processing of the first component main signal gain control step is executed by executing a process including an other component gain control signal output step of outputting an other component gain control signal for gain control of the other component signal based on the average level. Is a first component main signal having a part of the plurality of audio components as a main component of the input audio signal composed of the plurality of audio components based on the first component main signal gain control signal. And output as a first component output signal, and the processing of the other component signal gain control step is other than the first component. Minute signal, based on other component signal gain control signal, and outputs it as the other component output signal and gain control.

本技術の電子機器は、請求項１乃至７のいずれかの音声処理装置を含む。 An electronic device of the present technology includes the audio processing device according to any one of claims 1 to 7.

本技術の一側面においては、複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号が、ゲイン制御されて出力され、前記入力音声信号のうち、前記第１成分以外の、その他成分信号が、ゲイン制御されて出力され、前記第１成分信号の平均レベルが検出され、前記その他成分信号の平均レベルが検出され、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記第１成分主体信号をゲイン制御する第１成分ゲイン制御信号が出力され、前記第１成分信号の平均レベルと、前記その他成分信号平均レベルとに基づいて、前記その他成分信号をゲイン制御するその他成分ゲイン制御信号が出力され、前記複数の音声成分からなる入力音声信号のうち、前記複数の音声成分の一部を主たる成分とする第１成分主体信号が、前記第１成分主体信号ゲイン制御信号に基づいて、ゲイン制御されて第１成分出力信号として出力され、前記第１成分以外の、その他成分信号が、その他成分信号ゲイン制御信号に基づいて、ゲイン制御されてその他成分出力信号として出力される。 In one aspect of the present technology, a first component main signal including a part of the plurality of sound components as a main component among input sound signals including a plurality of sound components is output with gain control, and the input sound Among the signals, other component signals other than the first component are output after gain control, the average level of the first component signal is detected, the average level of the other component signal is detected, and the first component is detected. A first component gain control signal for gain control of the first component main signal is output based on an average level of the signal and the other component signal average level, and the average level of the first component signal and the other component Based on the average signal level, an other component gain control signal for gain control of the other component signal is output, and among the input audio signals composed of the plurality of audio components, A first component main signal whose main component is a part of the audio component is gain-controlled based on the first component main signal gain control signal and is output as a first component output signal. The other component signal is gain-controlled based on the other component signal gain control signal and output as the other component output signal.

本発明の音声処理装置は、独立した装置であっても良いし、音声処理を行うブロックであっても良い。 The voice processing apparatus of the present invention may be an independent apparatus or a block that performs voice processing.

本技術の一側面によれば、入力音声信号のうち、センター集中定位信号を含む成分と、それ以外の成分とのバランスを調整して音声補正することが可能となる。 According to one aspect of the present technology, it is possible to perform sound correction by adjusting a balance between a component including the center concentration localization signal and other components in the input sound signal.

この技術による音量補正装置が適用される電子機器の例を説明するためのブロック図である。It is a block diagram for demonstrating the example of the electronic device to which the volume correction apparatus by this technique is applied. この技術による音量補正装置の第１の実施形態を説明するためのブロック図である。It is a block diagram for demonstrating 1st Embodiment of the volume correction apparatus by this technique. 図１の実施形態におけるセンター集中定位信号生成部の構成例を示すブロック図である。It is a block diagram which shows the structural example of the center concentration localization signal generation part in embodiment of FIG. 図１の実施形態におけるセンター集中定位信号生成部の他の構成例を示すブロック図である。It is a block diagram which shows the other structural example of the center concentration localization signal generation part in embodiment of FIG. 図４の例のセンター集中定位信号生成部の一部の構成例を説明するためのブロック図である。FIG. 5 is a block diagram for explaining a configuration example of a part of a center concentration localization signal generation unit in the example of FIG. 4. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図５の構成例の各部を説明するために用いる図である。FIG. 6 is a diagram used for explaining each part of the configuration example of FIG. 5. 図２の臨場音平均レベル検出部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the realistic sound average level detection part of FIG. 図２のセンターバランス補正部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the center balance correction | amendment part of FIG. 図２の臨場音バランス補正部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the realistic sound balance correction | amendment part of FIG. 図２の臨場音レベル補正ゲイン生成部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the realistic sound level correction gain production | generation part of FIG. 図２の臨場音レベル補正ゲイン生成部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the realistic sound level correction gain production | generation part of FIG. 図２の音量補正装置による音量補正処理を説明するために用いるフローチャートである。3 is a flowchart used for explaining volume correction processing by the volume correction apparatus of FIG. 2. 図２の音量補正装置による臨場音レベル補正ゲイン制御信号生成処理を説明するために用いるフローチャートである。FIG. 3 is a flowchart used to explain a live sound level correction gain control signal generation process by the volume correction apparatus of FIG. 2. FIG. この技術による音量補正装置の第２の実施形態を説明するためのブロック図である。It is a block diagram for demonstrating 2nd Embodiment of the volume correction apparatus by this technique. 図２０のバランス補正ゲイン生成部の動作を説明するために用いる図である。It is a figure used in order to demonstrate operation | movement of the balance correction gain production | generation part of FIG. 図２０の臨場音平均レベル検出部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the realistic sound average level detection part of FIG. 図２０のセンターバランス補正部の構成例を説明するために用いる図である。It is a figure used in order to demonstrate the example of a structure of the center balance correction | amendment part of FIG. 図２０の音量補正装置による音量補正処理を説明するために用いるフローチャートである。FIG. 21 is a flowchart used for explaining volume correction processing by the volume correction apparatus of FIG. 20. FIG. この技術による音量補正装置の第２の実施形態の変形例を説明するためのブロック図である。It is a block diagram for demonstrating the modification of 2nd Embodiment of the volume correction apparatus by this technique. この技術による音量補正装置の第３の実施形態を説明するためのブロック図である。It is a block diagram for demonstrating 3rd Embodiment of the volume correction apparatus by this technique. 図２６の音量補正装置による音量補正処理を説明するために用いるフローチャートである。It is a flowchart used in order to demonstrate the volume correction process by the volume correction apparatus of FIG. この技術による音量補正装置の第３の実施形態を説明するためのブロック図である。It is a block diagram for demonstrating 3rd Embodiment of the volume correction apparatus by this technique. 図２８の音量補正装置による音量補正処理を説明するために用いるフローチャートである。It is a flowchart used in order to demonstrate the volume correction process by the volume correction apparatus of FIG. 汎用のパーソナルコンピュータの構成例を説明する図である。And FIG. 11 is a diagram illustrating a configuration example of a general-purpose personal computer.

以下、発明を実施するための形態（以下、実施の形態という）について説明する。なお、説明は以下の順序で行う。
１．第１の実施の形態（ゲイン制御信号を生成する前にバランス補正する場合の一例）
２．第２の実施の形態（ゲイン制御信号を生成した後でバランス補正する場合の一例）
３．変形例
４．第３の実施の形態（第１の実施の形態で臨場音レベル補正ゲイン生成部を削除する場合の一例）
５．第４の実施の形態（第２の実施の形態で臨場音レベル補正ゲイン生成部を削除する場合の一例） Hereinafter, modes for carrying out the invention (hereinafter referred to as embodiments) will be described. The description will be given in the following order.
1. First embodiment (an example in the case of performing balance correction before generating a gain control signal)
2. Second Embodiment (an example of balance correction after generating a gain control signal)
3. Modification 4 Third embodiment (an example in the case where the live sound level correction gain generation unit is deleted in the first embodiment)
5. Fourth embodiment (an example in the case where the live sound level correction gain generation unit is deleted in the second embodiment)

＜１．第１の実施の形態＞
［テレビジョン受像機の構成例］
以下、本技術による音声処理装置である音量補正装置の実施形態を、図面を参照しながら説明する。尚、以下に説明する音量補正装置の実施形態は、テレビジョン受像機の音声出力部に用いる場合を示している。ただし、音量補正装置の実施形態は、これに限るものではなく、音声を出力する装置であれば、その他の装置であっても適用することができるものである。 <1. First Embodiment>
[Example configuration of a television receiver]
Hereinafter, an embodiment of a sound volume correction device which is a sound processing device according to the present technology will be described with reference to the drawings. The embodiment of the volume correction device described below shows a case where the volume correction device is used for an audio output unit of a television receiver. However, the embodiment of the sound volume correction device is not limited to this, and can be applied to other devices as long as they output sound.

すなわち、図１は、テレビジョン受像機の構成例を示すブロック図である。この図１の例のテレビジョン受像機は、マイクロコンピュータを具備して構成される制御部１０を備える。この制御部１０には、リモコン受信部１１が接続され、このリモコン受信部１１でリモコン送信機１２からのリモコン信号を受けて、制御部１０に伝達する。制御部１０は、受信したリモコン信号に応じた処理制御を実行する。 In other words, FIG. 1 is a block diagram illustrating a configuration example of a television receiver. The television receiver in the example of FIG. 1 includes a control unit 10 that includes a microcomputer. A remote control receiving unit 11 is connected to the control unit 10, and the remote control receiving unit 11 receives a remote control signal from the remote control transmitter 12 and transmits it to the control unit 10. The control unit 10 executes processing control according to the received remote control signal.

制御部１０は、テレビジョン受像機の各部に対して制御信号を供給して、テレビ放送信号の受信およびその映像再生および音声再生の処理を実行する。 The control unit 10 supplies a control signal to each unit of the television receiver, and executes a process of receiving a television broadcast signal and reproducing its video and audio.

チューナ部１３は、制御部１０からのユーザのリモコン操作に応じたチャンネル選択制御信号により指定される放送チャンネルの信号を、テレビ放送波信号から選択抽出する。そして、チューナ部１３は、選択抽出した放送チャンネルの信号から、映像信号と、音声信号とを復調デコードし、映像信号は映像信号処理部１４に供給し、音声信号は、音声信号処理部１５に供給する。 The tuner unit 13 selectively extracts a broadcast channel signal designated by a channel selection control signal according to a user's remote control operation from the control unit 10 from a television broadcast wave signal. The tuner unit 13 demodulates and decodes the video signal and the audio signal from the selected broadcast channel signal, supplies the video signal to the video signal processing unit 14, and the audio signal to the audio signal processing unit 15. Supply.

映像信号処理部１４では、制御部１０からの制御を受けて、映像信号についての所定の処理をし、その処理後の映像信号を表示制御部１６を通じて、例えばＬＣＤ（Liquid Crystal Display）からなるディスプレイ１７に供給する。これにより、選択された放送チャンネルの放送番組の画像がディスプレイ１７に表示される。 In the video signal processing unit 14, under the control of the control unit 10, a predetermined process is performed on the video signal, and the processed video signal is displayed on the display control unit 16 through a display controller 16, for example, an LCD (Liquid Crystal Display). 17 is supplied. As a result, an image of the broadcast program of the selected broadcast channel is displayed on the display 17.

また、音声信号処理部１５では、制御部１０からの制御を受けて、音声信号についての所定の処理をする。この実施形態では、音声信号処理部１５では、チューナ部１３からの音声信号から、左右２チャンネルの音声信号ＳｉＬ，ＳｉＲを生成し、その処理後の音声信号ＳｉＬ，ＳｉＲを音量補正部１８に供給する。 In addition, the audio signal processing unit 15 performs predetermined processing on the audio signal under the control of the control unit 10. In this embodiment, the audio signal processing unit 15 generates left and right channel audio signals SiL and SiR from the audio signal from the tuner unit 13 and supplies the processed audio signals SiL and SiR to the volume correction unit 18. To do.

音量補正部１８は、この実施形態の音量補正装置が適用される部分であり、その入力音声信号ＳｉＬ，ＳｉＲは、後述するようにして、音量補正され、出力音声信号ＳｏＬおよびＳｏＲとして出力される。そして、この音量補正部１８からの出力音声信号ＳｏＬおよびＳｏＲが、スピーカ１９Ｌおよび１９Ｒに供給されて、音響再生される。これにより、選択された放送チャンネルの放送番組の音声がスピーカ１９Ｌおよび１９Ｒから放音される。 The volume correction unit 18 is a part to which the volume correction apparatus of this embodiment is applied, and the input audio signals SiL and SiR are volume-corrected and output as output audio signals SoL and SoR as described later. . Then, the output audio signals SoL and SoR from the volume correction unit 18 are supplied to the speakers 19L and 19R to be reproduced acoustically. Thereby, the sound of the broadcast program of the selected broadcast channel is emitted from the speakers 19L and 19R.

以下、この音量補正部１８の場合として、この実施形態の音量補正装置について説明する。 Hereinafter, the volume correction apparatus of this embodiment will be described as a case of the volume correction unit 18.

［音量補正装置の第１の実施形態］
図２は、この技術の音量補正部１８の第１の実施形態としての音量補正装置の全体の構成例を示すブロック図である。 [First Embodiment of Volume Correction Device]
FIG. 2 is a block diagram showing an example of the overall configuration of the volume correction apparatus as the first embodiment of the volume correction unit 18 of this technique.

この第１の実施形態では、入力音声信号は、左右２チャンネルの音声信号とされる。そして、第１成分主体信号は、左右２チャンネルの音声信号中の主として人声成分を主体とする信号（以下、声主体信号という）とされる。また、第１成分以外の他の音声成分は、左右２チャンネルの音声信号のうちの、この声主体信号以外の、いわゆる臨場音とされる。この臨場音を主体とする信号を、以下、臨場音主体信号（または臨場音信号）という。 In the first embodiment, the input audio signal is an audio signal of two left and right channels. The first component main signal is a signal mainly including a human voice component in the left and right two-channel audio signals (hereinafter referred to as a voice main signal). The other audio components other than the first component are so-called real sounds other than the main voice signal in the two left and right channel audio signals. Hereinafter, the signal mainly composed of the live sound is referred to as a live sound main signal (or live sound signal).

この図２に示すように、この第１の実施形態においては、左右２チャンネルの入力音声信号ＳｉＬ，ＳｉＲは、声主体信号と臨場音主体信号との分離部２０に供給される。この例の分離部２０は、センター集中定位信号検出部２１と、２個の減算部２２，２３とからなる。 As shown in FIG. 2, in this first embodiment, the left and right channel input audio signals SiL and SiR are supplied to the separation unit 20 for the main voice signal and the main live sound signal. The separation unit 20 in this example includes a center concentration localization signal detection unit 21 and two subtraction units 22 and 23.

センター集中定位信号検出部２１には、左右２チャンネルの入力音声信号ＳｉＬ，ＳｉＲが共に供給され、左右チャンネルの中央（センター）に定位するセンター集中定位信号として、声主体信号Ｓｖを検出する。センター集中定位信号検出部２１で検出された声主体信号Ｓｖは、減算部２２，２３、並びに、可変ゲインアンプ２４に供給される。 The center centralized localization signal detector 21 is supplied with both left and right channel input audio signals SiL and SiR, and detects the voice main signal Sv as a center concentrated localization signal localized at the center (center) of the left and right channels. The voice main signal Sv detected by the center concentration localization signal detection unit 21 is supplied to the subtraction units 22 and 23 and the variable gain amplifier 24.

減算部２２では、左チャンネルの音声信号ＳｉＬから、声主体信号Ｓｖが減算されて、左チャンネルの臨場音主体信号ＳｓＬが得られる。また、減算部２３では、右チャンネルの音声信号ＳｉＲから、声主体信号Ｓｖが減算されて、右チャンネルの臨場音主体信号ＳｓＲが得られる。 The subtracting unit 22 subtracts the voice main signal Sv from the left channel audio signal SiL to obtain the left channel live sound main signal SsL. Further, the subtracting unit 23 subtracts the voice main signal Sv from the right channel audio signal SiR to obtain the right channel live sound main signal SsR.

こうして、分離部２０では、２チャンネル音声信号ＳｉＬ，ＳｉＲから、声主体信号Ｓｖと、左右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲとが分離されて得られる。 Thus, the separation unit 20 obtains the main voice signal Sv and the left and right channel main sound signals SsL and SsR from the two-channel audio signals SiL and SiR.

そして、この分離部２０からの声主体信号Ｓｖは、可変ゲインアンプ２４を通じて加算部２７，２８に供給されると共に、補正ゲイン生成部３０に供給される。 The main voice signal Sv from the separation unit 20 is supplied to the addition units 27 and 28 through the variable gain amplifier 24 and also to the correction gain generation unit 30.

この例では、補正ゲイン生成部３０は、平均レベル検出部３１、センターバランス補正部３２、ゲイン制御信号生成部３３、臨場音平均レベル検出部３４、臨場音バランス補正部３５、およびゲイン制御信号生成部３６とからなる。平均レベル検出部３１は、声主体信号Ｓｖの平均レベルを検出して、その検出した平均レベルをセンターバランス補正部３２および臨場音バランス補正部３５に供給する。臨場音平均レベル検出部３４は、２チャンネル音声信号ＳｉＬ，ＳｉＲから、臨場音主体信号の平均レベルを検出して、その検出した平均レベルをセンターバランス補正部３２および臨場音バランス補正部３５に供給する。 In this example, the correction gain generation unit 30 includes an average level detection unit 31, a center balance correction unit 32, a gain control signal generation unit 33, an actual sound average level detection unit 34, an actual sound balance correction unit 35, and a gain control signal generation. Part 36. The average level detection unit 31 detects the average level of the voice main signal Sv and supplies the detected average level to the center balance correction unit 32 and the live sound balance correction unit 35. The live sound average level detection unit 34 detects the average level of the live sound main signal from the two-channel audio signals SiL and SiR, and supplies the detected average level to the center balance correction unit 32 and the live sound balance correction unit 35. To do.

センターバランス補正部３２は、声主体信号Ｓｖの平均レベルを、臨場音主体信号の平均レベルに基づいて、所定の処理により補正して、ゲイン制御信号生成部３３に供給する。臨場音バランス補正部３５は、声主体信号Ｓｖの平均レベルを、臨場音主体信号の平均レベルに基づいて、所定の処理により補正して、ゲイン制御信号生成部３６に供給する。 The center balance correction unit 32 corrects the average level of the voice main signal Sv by a predetermined process based on the average level of the live sound main signal, and supplies the corrected signal to the gain control signal generation unit 33. The live sound balance correction unit 35 corrects the average level of the voice main signal Sv by a predetermined process based on the average level of the live sound main signal, and supplies the corrected signal to the gain control signal generation unit 36.

ゲイン制御信号生成部３３は、センターバランス補正部３２により、臨場音主体信号の平均レベルに基づいて補正された、声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるようにするためのゲイン制御信号Ｇｖを生成する。そして、ゲイン制御信号生成部３３は、生成したゲイン制御信号Ｇｖを可変ゲインアンプ２４に供給する。 The gain control signal generation unit 33 is configured so that the average level of the voice main signal Sv corrected by the center balance correction unit 32 based on the average level of the live sound main signal becomes a predetermined reference level. A gain control signal Gv is generated. Then, the gain control signal generation unit 33 supplies the generated gain control signal Gv to the variable gain amplifier 24.

ゲイン制御信号生成部３６は、臨場音バランス補正部３５により、臨場音主体信号の平均レベルに基づいて補正された、声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるようにするためのゲイン制御信号Ｇｙを生成する。そして、ゲイン制御信号生成部３６は、生成したゲイン制御信号Ｇｙを臨場音レベル補正ゲイン生成部３７に供給する。 The gain control signal generation unit 36 is configured so that the average level of the voice main signal Sv corrected by the real sound balance correction unit 35 based on the average level of the main sound main signal becomes a predetermined reference level. The gain control signal Gy is generated. Then, the gain control signal generation unit 36 supplies the generated gain control signal Gy to the live sound level correction gain generation unit 37.

したがって、可変ゲインアンプ２４においては、ゲイン制御信号Ｇｖにより、声主体信号Ｓｖのレベルが大きく変動しても、声主体信号の平均レベルが一定レベル（基準レベル）になるようにゲイン制御される。また、この際、ゲイン制御信号Ｇｖは、臨場音平均レベルに基づいて、声主体信号と臨場音とのバランスが調整されたものであるので、可変ゲインアンプ２４から出力される補正後声主体信号Ｓｖｃの出力レベルは、臨場音と声主体信号とのバランスが調整されたうえで、一定レベルとされる。そして、この一定レベルとされた補正後声主体信号Ｓｖｃが、加算部２７，２８に供給される。 Therefore, in the variable gain amplifier 24, the gain control signal Gv controls the gain so that the average level of the voice main signal becomes a constant level (reference level) even if the level of the voice main signal Sv varies greatly. At this time, since the gain control signal Gv is obtained by adjusting the balance between the voice main signal and the real sound based on the real sound average level, the corrected voice main signal output from the variable gain amplifier 24 is used. The output level of Svc is set to a constant level after the balance between the actual sound and the voice main signal is adjusted. Then, the corrected main voice signal Svc having a constant level is supplied to the adding units 27 and 28.

臨場音レベル補正ゲイン生成部３７は、ゲイン制御信号生成部３６からのゲイン制御信号Ｇｙを受けて、このゲイン制御信号Ｇｙに基づく処理を行って、臨場音主体信号をゲイン補正するゲイン制御信号Ｇｓを生成し、可変ゲインアンプ２５，２６に供給する。 The live sound level correction gain generation unit 37 receives the gain control signal Gy from the gain control signal generation unit 36, performs a process based on the gain control signal Gy, and performs gain correction on the real sound main signal. Is supplied to the variable gain amplifiers 25 and 26.

すなわち、臨場音レベル補正ゲイン生成部３７は、ゲイン制御信号Ｇｙに対して、何らかの処理を加えるので、ゲイン制御信号Ｇｖによる声主体信号Ｓｖに対するゲイン制御態様と、ゲイン制御信号Ｇｓによる臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御態様とは、異なるものとなる。 That is, the presence sound level correction gain generation unit 37 performs some processing on the gain control signal Gy, so that the gain control mode for the voice main signal Sv by the gain control signal Gv and the real sound main signal by the gain control signal Gs The gain control mode for SsL and SsR is different.

可変ゲインアンプ２５，２６は、臨場音レベル補正ゲイン生成部３７からゲイン制御信号Ｇｓの供給を受けて、このゲイン制御信号Ｇｓに基づいて、左右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲに対して、声主体信号Ｓｖとは異なるゲイン制御を行う。 The variable gain amplifiers 25 and 26 receive the supply of the gain control signal Gs from the live sound level correction gain generation unit 37, and based on the gain control signal Gs, the live sound main signals SsL and SsR are Gain control different from that of the voice main signal Sv is performed.

加算部２７は、補正後の左チャンネルの臨場音主体信号ＳｓＬｃと補正後声主体信号Ｓｖｃとを加算し、その加算出力として、音量補正された左チャンネルの出力音声信号ＳｏＬを出力する。 The adder 27 adds the corrected left channel main sound signal SsLc and the corrected voice main signal Svc, and outputs a volume-corrected left channel output audio signal SoL as the addition output.

また、加算部２８は、補正後の右チャンネルの臨場音主体信号ＳｓＲｃと補正後声主体信号Ｓｖｃとを加算し、その加算出力として、音量補正された右チャンネルの出力音声信号ＳｏＲを出力する。 Further, the adder 28 adds the corrected right channel live sound main signal SsRc and the corrected voice main signal Svc, and outputs the output sound signal SoR of the right channel whose volume is corrected as the addition output.

ところで、ゲイン制御信号Ｇｙは、臨場音平均レベルに基づいて、臨場音と声主体信号とのバランスが調整されたものである。このため、ゲイン制御信号Ｇｙに基づいて求められる、ゲイン制御信号Ｇｓによる臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御態様は、入力音声信号の大きなレベル変動に対して即座に追従しないようなものにされると共に、臨場音と声主体信号とのバランスが調整されたものにされる。すなわち、声主体信号Ｓｖに対するゲイン制御態様は、入力音声信号のレベル変動に即座に追従して、臨場音と声主体信号とのバランスを調整しつつ、出力レベルをある程度一定にする。しかしながら、臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御態様は、それとは異なり、臨場音と声主体信号とのバランスを調整しつつ、入力音声信号の大きなレベル変動に対して即座に追従しない特性のものとされる。 By the way, the gain control signal Gy is obtained by adjusting the balance between the live sound and the voice main signal based on the live sound average level. For this reason, the gain control mode for the presence sound main signals SsL and SsR determined by the gain control signal Gs based on the gain control signal Gy is set so as not to immediately follow a large level fluctuation of the input sound signal. At the same time, the balance between the actual sound and the main voice signal is adjusted. That is, the gain control mode for the voice main signal Sv immediately follows the level fluctuation of the input voice signal, and adjusts the balance between the actual sound and the voice main signal, and makes the output level constant. However, the gain control mode for the presence sound main signals SsL and SsR is different from the gain control mode in that the balance between the presence sound and the voice main signal is adjusted and the input sound signal does not immediately follow a large level fluctuation. It is said.

したがって、左右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲは、可変ゲインアンプ２５，２６で、ゲイン制御されて加算部２７，２８に供給される。そして、臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御態様は、臨場音と声主体信号とのバランスを調整しつつ、入力音声信号の大きなレベル変動に対して即座に追従しない特性のものとされている。 Therefore, the main sound signals SsL and SsR of the left and right channels are supplied to the adders 27 and 28 after gain control by the variable gain amplifiers 25 and 26. Then, the gain control mode for the presence sound main signals SsL and SsR has a characteristic that does not immediately follow a large level fluctuation of the input sound signal while adjusting the balance between the presence sound and the sound main signal. .

したがって、臨場音主体信号、および声主体信号は、それぞれのバランスを調整しつつ、声主体信号について生じる大きなレベル変化点での音量レベルの揺れが抑制される。 Therefore, fluctuations in the volume level at a large level change point that occur with respect to the voice main signal are suppressed while adjusting the balance of the live sound main signal and the voice main signal.

このため、加算部２７，２８からの左右チャンネルの出力音声信号ＳｏＬ，ＳｏＲは、補正後声主体信号Ｓｖｃの音量レベルの揺れが、左チャンネルおよび右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲによりマスキングされるようになる。この結果、臨場音と声主体信号とのバランスを調整しつつ、声主体信号Ｓｖｃの音量レベルの揺れが、目立たなくなり、聴取者に対する違和感が軽減される。 For this reason, the output sound signals SoL and SoR of the left and right channels from the adders 27 and 28 are masked by the main sound signals SsL and SsR of the left channel and the right channel of the volume level fluctuation of the corrected voice main signal Svc. Become so. As a result, while adjusting the balance between the actual sound and the voice main signal, the fluctuation of the volume level of the voice main signal Svc becomes inconspicuous, and the uncomfortable feeling to the listener is reduced.

以上の構成により、声主体信号は、素早く適正レベルに遷移されることになるので、人声のレベルの一定感を保ち、台詞などの人声を聞き易くすることができる。さらに、臨場音主体信号は、臨場感がある程度一定に保たれるため、レベルを変えることによる違和感が軽減され、これによってより自然なレベル遷移を実現することができるようになる。 With the above configuration, the voice main signal is quickly shifted to an appropriate level, so that a constant feeling of the voice level can be maintained and a voice such as a dialogue can be easily heard. Furthermore, since the presence of the presence-sound-main signal is kept constant to some extent, a sense of incongruity caused by changing the level is reduced, and thereby a more natural level transition can be realized.

なお、以上の例では、左右２チャンネル用のスピーカにより音声信号を音響再生する場合であるので、加算部２７，２８を設けるようにした。しかし、左右２チャンネル用のスピーカに加えて、センターチャンネル用のスピーカを設けた場合には、補正後声主体信号をセンターチャンネル用スピーカに供給し、可変ゲインアンプ２５，２６の出力音声信号を左右２チャンネル用のスピーカに供給するようにしても良い。この場合には、センターチャンネル用のスピーカの放音音声と、左右２チャンネル用のスピーカの放音音声とが、音響的に合成されることにより、ゲイン制御による音量レベルの揺れがマスキングされ、目立たなくなる。 In the above example, since the audio signal is acoustically reproduced by the left and right channel speakers, the adders 27 and 28 are provided. However, when a center channel speaker is provided in addition to the left and right channel speakers, the corrected main voice signal is supplied to the center channel speaker, and the output audio signals of the variable gain amplifiers 25 and 26 are changed to the left and right channels. You may make it supply to the speaker for 2 channels. In this case, the sound output from the center channel speaker and the sound output from the left and right channel speakers are acoustically synthesized, so that the fluctuation of the volume level due to the gain control is masked and becomes conspicuous. Disappear.

［センター集中定位信号検出部２１の構成例］
＜第１の例＞
図３は、センター集中定位信号検出部２１の第１の構成例を示すものである。この例においては、センター集中定位信号検出部２１は、加算部２１１と、固定ゲイン「０．５」のアンプ２１２とからなる。 [Configuration Example of Center Concentration Localization Signal Detection Unit 21]
<First example>
FIG. 3 shows a first configuration example of the center concentration localization signal detection unit 21. In this example, the center concentration localization signal detection unit 21 includes an addition unit 211 and an amplifier 212 having a fixed gain “0.5”.

そして、この例のセンター集中定位信号検出部２１においては、左右チャンネルの入力音声信号ＳｉＬ，ＳｉＲが加算部２１１で加算され、その加算出力信号がアンプ２１２を通じて出力される。このアンプ２１２の出力信号が声主体信号Ｓｖとされる。 In the centralized localization signal detection unit 21 of this example, the left and right channel input audio signals SiL and SiR are added by the addition unit 211, and the addition output signal is output through the amplifier 212. An output signal of the amplifier 212 is a voice main signal Sv.

なお、この第１の例の場合には、声主体信号Ｓｖの平均値は、左右チャンネルの入力音声信号ＳｉＬ，ＳｉＲの加算信号の平均値に等しくなる。そして、補正ゲイン生成部３０は、声主体信号Ｓｖの平均値が一定レベルとなるようにゲイン制御信号Ｇｖを生成する。よって、この第１の例の場合には、補正ゲイン生成部３０は、左右チャンネルの入力音声信号ＳｉＬ，ＳｉＲの加算信号、つまり、入力音声信号全体のレベルが一定レベルとなるようにゲイン制御信号Ｇｖを生成していることにもなる。 In the case of the first example, the average value of the voice main signal Sv is equal to the average value of the sum signal of the input audio signals SiL and SiR of the left and right channels. Then, the correction gain generation unit 30 generates the gain control signal Gv so that the average value of the voice main signal Sv becomes a constant level. Therefore, in the case of the first example, the correction gain generation unit 30 adds the gain control signal so that the addition signal of the input audio signals SiL and SiR of the left and right channels, that is, the level of the entire input audio signal becomes a constant level. Gv is also generated.

＜第２の例＞
図４は、センター集中定位信号検出部２１の第２の構成例を示すものである。この第２の例は、第１の例の出力をそのまま出力するのではなく、第１の例の出力よりも、さらにセンター定位成分のみの成分に応じた信号を得るようにする例である。 <Second example>
FIG. 4 shows a second configuration example of the center concentration localization signal detection unit 21. In the second example, the output of the first example is not output as it is, but a signal corresponding to the component of only the center localization component is obtained more than the output of the first example.

この例においては、センター集中定位信号検出部２１は、第１の例の加算部２１１および固定ゲイン「０．５」のアンプ２１２に加えて、ゲイン調整アンプ２１３と、センター集中定位率検出部２１４とを備える。 In this example, the center concentration localization signal detection unit 21 includes a gain adjustment amplifier 213 and a center concentration localization rate detection unit 214 in addition to the addition unit 211 and the fixed gain “0.5” amplifier 212 in the first example. With.

この例のセンター集中定位信号検出部２１においては、アンプ２１２の出力信号はゲイン調整アンプ２１３に供給され、このゲイン調整アンプ２１３の出力信号が声主体信号Ｓｖとされる。 In the center concentrated localization signal detector 21 of this example, the output signal of the amplifier 212 is supplied to the gain adjustment amplifier 213, and the output signal of the gain adjustment amplifier 213 is used as the voice main signal Sv.

そして、この例のセンター集中定位信号検出部２１において、左右チャンネルの入力音声信号ＳｉＬ，ＳｉＲは、センター集中定位率検出部２１４にも供給される。このセンター集中定位率検出部２１４においては、入力音声全体に対するセンターに集中的に定位する信号の割合に応じて、ゲイン調整アンプ２１３のゲインを制御するゲイン制御信号Ｇａｔが生成される。 In the center concentration localization signal detector 21 of this example, the left and right channel input audio signals SiL and SiR are also supplied to the center concentration localization rate detector 214. In the center concentration localization rate detection unit 214, a gain control signal Gat for controlling the gain of the gain adjustment amplifier 213 is generated according to the ratio of the signal concentrated in the center with respect to the entire input sound.

そして、センター集中定位率検出部２１４からのゲイン制御信号Ｇａｔにより、ゲイン調整アンプ２１３のゲインが制御されることで、声主体信号Ｓｖは、アンプ２１２の出力のうち、センターに集中的に定位する率に応じた信号成分からなるものとなる。つまり、この第２の例の声主体信号Ｓｖは、第１の例よりも、さらにセンターに集中的に定位する信号成分からなる信号となる。 Then, the gain of the gain adjustment amplifier 213 is controlled by the gain control signal Gat from the center concentration localization rate detection unit 214, so that the voice main signal Sv is localized at the center among the outputs of the amplifier 212. It consists of signal components according to the rate. That is, the voice main signal Sv of the second example is a signal composed of signal components that are localized more centrally than in the first example.

センター集中定位率検出部２１４は、例えば図５に示すような構成を有するものとすることができる。 The center concentration localization rate detection unit 214 may have a configuration as shown in FIG. 5, for example.

すなわち、センター集中定位率検出部２１４は、帯域制限フィルタ２１４１，２１４２と、定位方向検出部２１４３と、定位方向分布計測部２１４４と、センターゲイン制御信号生成部２１４５とを備えて構成される。 That is, the center concentration localization rate detection unit 214 includes band limiting filters 2141 and 2142, a localization direction detection unit 2143, a localization direction distribution measurement unit 2144, and a center gain control signal generation unit 2145.

センター集中定位率検出部２１４に入力された左右２チャンネルの入力音声信号ＳｉＬ，ＳｉＲは、それぞれ帯域制限フィルタ２１４１，２１４２において、例えば低域成分等、定位方向をあまり感じない周波数帯域の成分が除去される。 The left and right two-channel input audio signals SiL and SiR input to the center concentration localization rate detection unit 214 remove, in the band limiting filters 2141 and 2142, for example, frequency band components that do not feel the localization direction much, such as low frequency components. Is done.

そして、帯域制限フィルタ２１４１，２１４２により帯域制限された２チャンネルの入力音声信号ＳｉＬ，ＳｉＲは、定位方向検出部２１４３に供給される。定位方向検出部２１４３は、帯域制限された２チャンネルの入力音声信号ＳｉＬ，ＳｉＲのそれぞれのレベルの大きさにより、所定の周期毎の定位方向の検出時点における２チャンネルの入力音声信号ＳｉＬ，ＳｉＲが持つ定位方向を検出する。 The two-channel input audio signals SiL and SiR band-limited by the band-limiting filters 2141 and 2142 are supplied to the localization direction detection unit 2143. The localization direction detection unit 2143 receives the two-channel input audio signals SiL and SiR at the time of detection in the localization direction every predetermined period according to the level of each of the band-limited two-channel input audio signals SiL and SiR. Detect the localization direction you have.

すなわち、定位方向検出部２１４３においては、所定のサンプリング周期で、帯域制限された２チャンネルの入力オーディオ信号ＳｉＬ，ＳｉＲのそれぞれのレベル（振幅）をサンプリングする。そして、定位方向検出部２１４３においては、この例では、最新サンプリング時点における定位方向を現時点における定位方向として検出するようにする。 That is, the localization direction detection unit 2143 samples each level (amplitude) of the band-limited input audio signals SiL and SiR of two channels at a predetermined sampling period. In this example, the localization direction detection unit 2143 detects the localization direction at the latest sampling time as the current localization direction.

この場合、定位方向検出部２１４３は、当該最新サンプリング時点における定位方向を、入力音声信号ＳｉＬ，ＳｉＲのそれぞれについての、当該最新サンプリング時点のレベルと、それよりも過去のサンプリング時点のレベルとを用いて検出する。 In this case, the localization direction detecting unit 2143 uses the localization direction at the latest sampling time point as the latest sampling time level and the past sampling time level for each of the input audio signals SiL and SiR. To detect.

２チャンネルの入力音声信号ＳｉＬ，ＳｉＲが、デジタルオーディオ信号であれば、前記サンプリング周期は、デジタルオーディオ信号のサンプル周期に等しくすることができる。もっとも、前記サンプリング周期を、デジタルオーディオ信号の１サンプル周期と等しくするのではなく、複数サンプル周期とするようにしてもよい。定位方向検出部２１４３の入力音声信号がアナログ信号である場合には、この定位方向検出部２１４３の入力段において、デジタルオーディオ信号に変換するようにしても良い。 If the two-channel input audio signals SiL and SiR are digital audio signals, the sampling period can be made equal to the sampling period of the digital audio signal. However, the sampling period may not be equal to one sample period of the digital audio signal, but may be a plurality of sample periods. When the input audio signal of the localization direction detection unit 2143 is an analog signal, it may be converted into a digital audio signal at the input stage of the localization direction detection unit 2143.

この定位方向検出部２１４３における定位方向の検出方法を、図６を参照しながら説明する。図６の分布図（Ａ），（Ｂ）は、左チャンネルの入力音声信号ＳｉＬの振幅をＸ軸にとり、右チャンネルの入力音声信号ＳｉＲの振幅をＹ軸にとった場合の座標空間を示している。 A method of detecting the localization direction in the localization direction detection unit 2143 will be described with reference to FIG. The distribution diagrams (A) and (B) of FIG. 6 show the coordinate space when the amplitude of the left channel input audio signal SiL is taken on the X axis and the amplitude of the right channel input audio signal SiR is taken on the Y axis. Yes.

定位方向検出部２１４３では、まず、各サンプリング周期毎の定位方向の検出時点において２チャンネルの入力音声信号ＳｉＬ，ＳｉＲのそれぞれのレベルを取得して、それに対応する座標点を、図６の分布図（Ａ），（Ｂ）の座標空間に、例えばＰ１，Ｐ２，Ｐ３，Ｐ４のように、プロットしてゆく。この例では、Ｐ４が最新の検出時点の座標点であるとする。 In the localization direction detection unit 2143, first, the respective levels of the input audio signals SiL and SiR of the two channels are acquired at the time of detection in the localization direction for each sampling period, and the corresponding coordinate points are shown in the distribution diagram of FIG. For example, P1, P2, P3, and P4 are plotted in the coordinate space of (A) and (B). In this example, it is assumed that P4 is a coordinate point at the latest detection time.

そして、定位方向検出部２１４３では、ｙ＝ｋ・ｘ（ｋは定数）で表される直線（Ｘ軸とＹ軸との交点Ｚを通る直線）を、交点Ｚを中心として±９０°回転させたときに、つまり、定数ｋを変化させたときに、プロットした座標点Ｐ１，Ｐ２，Ｐ３，Ｐ４が、どの定数ｋの直線（どの傾き角度の直線）の一番近くを移動してゆくかを算出する。つまり、定数ｋを変えた各直線からの各座標点Ｐ１，Ｐ２，Ｐ３，Ｐ４までの距離Ｄａ１，Ｄａ２，Ｄａ３，Ｄａ４あるいは距離Ｄｂ１，Ｄｂ２，Ｄｂ３，Ｄｂ４の総和が最も小さい直線の定数ｋを算出する。 The localization direction detection unit 2143 rotates a straight line represented by y = k · x (k is a constant) (a straight line passing through the intersection point Z between the X axis and the Y axis) by ± 90 ° about the intersection point Z. In other words, when the constant k is changed, the plotted coordinate points P1, P2, P3, and P4 move closest to which constant k straight line (which inclination angle straight line). Is calculated. That is, the constant k of the straight line having the smallest sum of the distances Da1, Da2, Da3, Da4 or the distances Db1, Db2, Db3, Db4 from the straight lines with the constant k changed to the coordinate points P1, P2, P3, P4. calculate.

そして、定位方向検出部２１４３は、算出した直線の定数ｋに対応する傾き角度を、検出したい現時点における定位方向とする。図７の例では、Ｘ軸、つまり、左チャンネルの定位方向（左方向）の角度を０°として、このＸ軸に対する角度（以下、定位角度という）θを定位方向として検出することとする。 Then, the localization direction detection unit 2143 sets the inclination angle corresponding to the calculated straight line constant k as the localization direction at the current time point to be detected. In the example of FIG. 7, the angle of the X axis, that is, the localization direction (left direction) of the left channel is set to 0 °, and the angle θ (hereinafter referred to as the localization angle) θ with respect to the X axis is detected as the localization direction.

図６の分布図（Ａ）の場合の座標点Ｐ１，Ｐ２，Ｐ３，Ｐ４の例では、定位角度はθａとして検出され、図６の分布図（Ｂ）の場合の座標点Ｐ１，Ｐ２，Ｐ３，Ｐ４の例では、定位角度はθｂとして検出されるものである。 In the example of the coordinate points P1, P2, P3, and P4 in the case of the distribution diagram (A) in FIG. 6, the localization angle is detected as θa, and the coordinate points P1, P2, and P3 in the case of the distribution diagram (B) in FIG. , P4, the localization angle is detected as θb.

なお、この実施形態では、定位方向検出部２１４３においては、現時点（最新サンプリング時点）の２チャンネル入力音声信号のレベルと、過去のサンプリング時点における２チャンネル入力音声信号のレベルとは等しい重みで用いてはいない。この実施形態では、定位方向検出部２１４３においては、現時点に近いサンプリング時点の２チャンネル入力音声信号のレベルほど重みが大きいものとするようにしている。 In this embodiment, in the localization direction detection unit 2143, the level of the 2-channel input audio signal at the present time (latest sampling time) and the level of the 2-channel input audio signal at the past sampling time are used with equal weights. No. In this embodiment, the localization direction detection unit 2143 is configured to increase the weight as the level of the 2-channel input audio signal at the sampling time close to the current time.

このため、定位方向検出部２１４３では、２チャンネル入力音声信号のレベルのサンプリング値に対して、図７に示すように、現時点（この例では最新サンプリング時点ｔｎ）に近いほど、重みが大きくなるように、指数関数曲線の特性を有する時間ウインドーＷＤ１が用いられている。 For this reason, in the localization direction detection unit 2143, as shown in FIG. 7, with respect to the sampling value of the level of the 2-channel input audio signal, the closer to the current time (in this example, the latest sampling time tn), the greater the weight. In addition, a time window WD1 having the characteristic of an exponential function curve is used.

なお、上述の説明では、処理対象信号時点となる現時点を最新サンプリング時点（最新サンプル時点）とした。しかし、可変ゲインアンプ２４乃至２６の入力側に所定時間τだけ遅延させる遅延回路を設けて、処理対象となる現時点を、入力音声信号ＳｉＬ，ＳｉＲよりも前記τだけ遅延した時点とすることができる。 In the above description, the current time that is the signal to be processed is the latest sampling time (latest sampling time). However, a delay circuit that delays by a predetermined time τ is provided on the input side of the variable gain amplifiers 24 to 26, and the current time to be processed can be set as the time delayed by τ from the input audio signals SiL and SiR. .

その場合には、定位方向検出部２１４３では、処理対象信号時点となる現時点よりも後（未来）の２チャンネル入力音声信号ＳｉＬ，ＳｉＲをも用いて、定位方向を検出するようにすることができる。例えば、図６の例で、処理対象信号時点となる現時点がＰ２やＰ３の場合とすることができる。 In that case, the localization direction detection unit 2143 can detect the localization direction also using the two-channel input audio signals SiL and SiR after (future) the current time as the signal to be processed. . For example, in the example of FIG. 6, the current time that is the processing target signal time point may be P2 or P3.

そして、その場合には、前述した時間ウインドーＷＤ１の代わりに、図８に示すような指数関数曲線の特性の時間ウインドーＷＤ２が用いられる。この時間ウインドーＷＤ２は、処理対象信号時点となる現時点ｔｐで最も重みが大きく、現時点ｔｐから離れるにつれ、過去および未来の方向に重みが小さくなるような指数関数曲線の特性を有するものである。 In that case, a time window WD2 having the characteristic of an exponential function curve as shown in FIG. 8 is used instead of the above-described time window WD1. The time window WD2 has a characteristic of an exponential function curve in which the weight is the largest at the current time tp that is the signal to be processed, and the weight becomes smaller in the past and future directions as the distance from the current time tp is increased.

なお、現時点の２チャンネル入力オーディオ信号のレベルを、過去および／または未来のサンプリング時点における２チャンネル入力音声信号ＳｉＬ，ＳｉＲのレベルを重み付けせずに、そのままの値で用いても良い。 Note that the level of the current 2-channel input audio signal may be used as it is without weighting the levels of the 2-channel input audio signals SiL, SiR at the past and / or future sampling points.

以上のようにして、定位方向検出部２１４３では、現時点においては、２チャンネル入力音声信号ＳｉＬ，ＳｉＲが、どの方向からの信号であるかを、定位角度θとして検出することができる。 As described above, the localization direction detection unit 2143 can detect from which direction the two-channel input audio signals SiL and SiR are signals as the localization angle θ.

しかしながら、検出した現時点における定位角度θは、１時点における入力オーディオ信号の定位方向を一方向に限定したもので、各方向ごとの信号の強さが反映されていない。そこで、この実施形態では、この点にかんがみ、定位方向検出部２１４３で検出された現時点における２チャンネル入力音声信号ＳｉＬ，ＳｉＲの定位方向の検出結果（定位角度θ）は、定位方向分布計測部２１４４に供給される。 However, the detected localization angle θ at the present time limits the localization direction of the input audio signal at one time point to one direction, and does not reflect the strength of the signal in each direction. Therefore, in this embodiment, in view of this point, the localization direction detection result (localization angle θ) of the two-channel input audio signals SiL and SiR detected by the localization direction detection unit 2143 at the present time is the localization direction distribution measurement unit 2144. To be supplied.

定位方向分布計測部２１４４では、予め定められた所定時間区間ｄに渡って定位方向検出部２１４３で検出された定位角度θの、全方位についての分布を求め、２チャンネル入力音声信号の定位方向が、どの角度方向にどのくらいの割合を持っているかを計測する。 The localization direction distribution measurement unit 2144 obtains a distribution in all directions of the localization angle θ detected by the localization direction detection unit 2143 over a predetermined time interval d, and the localization direction of the two-channel input audio signal is determined. Measure how much angle you have in which angle direction.

この場合、所定時間区間ｄは、例えば数ミリ秒〜数百ミリ秒、この例では数１０ミリ秒に選定されている。そして、この実施形態では、定位方向分布計測部２１４４では、この所定時間区間ｄにおける定位方向検出部２１４３で検出された定位角度θに対して、定位方向検出部２１４３における重み係数の特性と同様に重み付けをするようにする。 In this case, the predetermined time interval d is selected from several milliseconds to several hundred milliseconds, for example, several tens of milliseconds in this example. In this embodiment, in the localization direction distribution measurement unit 2144, the localization angle θ detected by the localization direction detection unit 2143 in the predetermined time interval d is similar to the characteristic of the weighting factor in the localization direction detection unit 2143. Make weights.

すなわち、定位方向分布計測部２１４４では、現時点ｔｐ（この例では、ｔｐ＝ｔｎ（最新サンプリング時点））に近づくほど指数関数的に大きくなるような重み付けをする時間ウインドーＷＤ３（図９参照）をかけて重み付けをするようにする。 In other words, the localization direction distribution measuring unit 2144 applies a time window WD3 (see FIG. 9) that is weighted so as to increase exponentially as it approaches the current time tp (in this example, tp = tn (latest sampling time)). To weight.

なお、前述したように、入力オーディオ信号に対して遅延時間τを設けるようにして、定位方向検出部２１４３での重み付けのための時間ウインドーを、図８のようにする場合には、定位方向分布計測部２１４４における時間ウインドーも、図８と同様なものとなる。その場合の時間区間ｄは、現時点ｔｐより未来と過去の両方を含む時間区間となるものである。なお、重み付けをせずに、そのままの値で用いてもよい。 As described above, when the time window for weighting by the localization direction detection unit 2143 is set as shown in FIG. 8 by providing the delay time τ for the input audio signal, the localization direction distribution is set. The time window in the measurement unit 2144 is the same as that in FIG. The time interval d in that case is a time interval including both the future and the past from the current time tp. Note that the value may be used as it is without weighting.

図１０は、この定位方向分布計測部２１４４で求められた定位角度θの分布である定位方向分布Ｐ（θ）の一例を示すもので、横軸にはＸ軸（左チャンネル定位方向）を基準にした定位角度θをとり、縦軸には各定位角度の出現度（＜１）をとったものである。ここで、この実施形態では、定位方向分布Ｐ（θ）をすべての定位角度θについて総和を求めたときに１、すなわち、
ΣＰ（θ）＝１
となるように分布が生成される。 FIG. 10 shows an example of the localization direction distribution P (θ), which is the distribution of the localization angle θ obtained by the localization direction distribution measuring unit 2144. The horizontal axis is based on the X axis (left channel localization direction). And the vertical axis represents the appearance degree (<1) of each localization angle. Here, in this embodiment, the localization direction distribution P (θ) is 1 when the sum is obtained for all localization angles θ, that is,
ΣP (θ) = 1
A distribution is generated so that

また、定位角度θと、音声信号の定位方向との関係は、図１１に示すようなものとなる。なお、図１１に示されている正面方向、左方向、右方向などは、リスナを基準にした方向名である。 Further, the relationship between the localization angle θ and the localization direction of the audio signal is as shown in FIG. Note that the front direction, the left direction, the right direction, and the like shown in FIG. 11 are direction names based on the listener.

以上のようにして、定位方向分布計測部２１４４からは、現時点（現サンプリング時点あるいは現サンプル時点；処理対象信号時点）ごとに、図１０に示すような定位方向分布Ｐ（θ）の情報が得られる。 As described above, the localization direction distribution measurement unit 2144 obtains information on the localization direction distribution P (θ) as shown in FIG. 10 for each current time (current sampling time or current sampling time; processing target signal time). It is done.

この定位方向分布Ｐ（θ）の情報は、センターゲイン制御信号生成部２１４５に供給される。センターゲイン制御信号生成部２１４５では、定位方向分布計測部２１４４によって算出された定位方向分布Ｐ（θ）から、センターに集中的に定位する信号ほど、ゲインが大きく、その他では、ゲインが小さくなるセンターゲイン制御信号を生成する。 Information on the localization direction distribution P (θ) is supplied to the center gain control signal generation unit 2145. In the center gain control signal generation unit 2145, from the localization direction distribution P (θ) calculated by the localization direction distribution measurement unit 2144, a signal that is localized in the center has a larger gain, and in others, the center has a smaller gain. Generate a gain control signal.

センターゲイン制御信号生成部２１４５は、図示を省略するゲインテーブルメモリを備えている。このゲインテーブルメモリには、ゲイン調整アンプ２１３に供給するゲイン制御信号を生成するためのゲインテーブル情報Ｋ（θ）が予め記憶されている。 The center gain control signal generation unit 2145 includes a gain table memory (not shown). In this gain table memory, gain table information K (θ) for generating a gain control signal to be supplied to the gain adjustment amplifier 213 is stored in advance.

このゲインテーブル情報Ｋ（θ）は、定位角度のすべて（−４５°〜１３５°）に対して、センター定位方向に重み付けが施されたゲイン特性とされている。図１２に、このゲインテーブル情報Ｋ（θ）の例を示す。 The gain table information K (θ) is a gain characteristic in which weighting is performed in the center localization direction for all localization angles (−45 ° to 135 °). FIG. 12 shows an example of the gain table information K (θ).

すなわち、ゲインテーブル情報Ｋ（θ）は、この例では、図１２に示すように、正面方向（センター方向；θ＝４５°）のときにゲインが最大の「１」となる。そして、センター方向よりも左方向の定位角度範囲（０°〜４５°）およびセンター方向よりも左方向の定位角度範囲（４５°〜９０°）では、センター方向から遠ざかるにしたがってゲインが小さくなるようなゲイン特性とされる。 That is, in this example, the gain table information K (θ) has a maximum gain of “1” in the front direction (center direction; θ = 45 °) as shown in FIG. In the localization angle range (0 ° to 45 °) to the left of the center direction and the localization angle range to the left of the center direction (45 ° to 90 °), the gain decreases as the distance from the center direction increases. Gain characteristics.

センターゲイン制御信号生成部２１４５では、定位方向分布計測部２１４４で求められた定位方向分布Ｐ（θ）の情報と、ゲインテーブル情報Ｋ（θ）のゲイン値との、すべての定位角度についての積の総和を算出する。 In the center gain control signal generation unit 2145, the product of the localization direction distribution P (θ) obtained by the localization direction distribution measurement unit 2144 and the gain value of the gain table information K (θ) for all localization angles. Calculate the sum of.

すなわち、センターゲイン制御信号生成部２１４５は、
Ｇａｔ＝Σ（Ｋ（θ）×Ｐ（θ））として、ゲイン制御信号Ｇａｔを生成する。 That is, the center gain control signal generation unit 2145
The gain control signal Gat is generated as Gat = Σ (K (θ) × P (θ)).

こうしてセンターゲイン制御信号生成部２１４５で生成されたゲイン制御信号Ｇａｔは、センター集中定位率検出部２１４の出力として、ゲイン調整アンプ２１３に供給される。 The gain control signal Gat generated by the center gain control signal generation unit 2145 in this way is supplied to the gain adjustment amplifier 213 as an output of the center concentration localization rate detection unit 214.

したがって、ゲイン調整アンプ２１３からは、第１の例よりも、さらにセンターに集中的に定位する信号成分からなる声主体信号Ｓｖが得られる。 Therefore, from the gain adjustment amplifier 213, a voice main signal Sv composed of signal components that are localized more centrally than in the first example is obtained.

なお、センター集中定位信号検出部２１としては、上述した第１の例および第２の例に限られるものではないことは勿論である。 Of course, the center concentration localization signal detection unit 21 is not limited to the first and second examples described above.

［臨場音平均レベル検出部３４の構成例］
次に、図１３を参照して、臨場音平均レベル検出部３４の構成例について説明する。臨場音平均レベル検出部３４は、減算部４１、および平均レベル検出部４２より構成されている。減算部４１は、入力音声信号ＳｉＬ，ＳｉＲの差分を求めて平均レベル検出部４２に供給する。平均レベル検出部４２は、入力音声信号ＳｉＬ，ＳｉＲの差分の平均レベルを臨場音平均レベルとして検出して、その検出した臨場音平均レベルをセンターバランス補正部３２および臨場音バランス補正部３５に供給する。尚、臨場音平均レベル検出部３４の構成については、臨場音平均レベルとして定義できる値が求められるものであれば他の構成であってもよく、例えば、左右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲとの差分を用いたものであってもよい。 [Configuration Example of Realistic Sound Average Level Detection Unit 34]
Next, a configuration example of the live sound average level detection unit 34 will be described with reference to FIG. The live sound average level detection unit 34 includes a subtraction unit 41 and an average level detection unit 42. The subtractor 41 obtains the difference between the input audio signals SiL and SiR and supplies the difference to the average level detector 42. The average level detection unit 42 detects the average level of the difference between the input audio signals SiL and SiR as the real sound average level, and supplies the detected real sound average level to the center balance correction unit 32 and the real sound balance correction unit 35. To do. The configuration of the live sound average level detection unit 34 may be other configurations as long as a value that can be defined as the live sound average level is required. For example, the live sound main signals SsL and SsR of the left and right channels are used. It is also possible to use the difference between and.

［センターバランス補正部３２の構成例］
次に、図１４を参照して、センターバランス補正部３２の構成例について説明する。センターバランス補正部３２は、固定ゲインアンプ５１、および加算部５２より構成されている。固定ゲインアンプ５１は、臨場音平均レベルを固定のゲインＫ１でゲイン調整し加算部５２に供給する。加算部５２は、固定ゲインアンプ５１より供給されてくるゲインＫ１でゲイン制御された臨場音平均レベルと、平均レベル検出部３１より供給されてくる声主体信号Ｓｖの平均レベルとを加算してセンターバランス補正平均レベルとしてゲイン制御信号生成部３３に供給する。尚、センターバランス補正部３２の構成については、センターバランス補正平均レベルとして定義できる値が求められるものであれば他の構成であってもよい。 [Configuration Example of Center Balance Correction Unit 32]
Next, a configuration example of the center balance correction unit 32 will be described with reference to FIG. The center balance correction unit 32 includes a fixed gain amplifier 51 and an addition unit 52. The fixed gain amplifier 51 adjusts the gain of the live sound average level with a fixed gain K1 and supplies it to the adder 52. The adder 52 adds the live sound average level gain-controlled by the gain K1 supplied from the fixed gain amplifier 51 and the average level of the voice main signal Sv supplied from the average level detector 31 to the center. This is supplied to the gain control signal generator 33 as the balance correction average level. The configuration of the center balance correction unit 32 may be another configuration as long as a value that can be defined as the center balance correction average level is obtained.

［臨場音バランス補正部３５の構成例］
次に、図１５を参照して、臨場音バランス補正部３５の構成例について説明する。臨場音バランス補正部３５は、固定ゲインアンプ６１、および加算部６２より構成されている。固定ゲインアンプ６１は、臨場音平均レベルを固定のゲインＫ２でゲイン制御し加算部６２に供給する。加算部６２は、固定ゲインアンプ６１より供給されてくるゲインＫ２でゲイン制御された臨場音平均レベルと、平均レベル検出部３１より供給されてくる声主体信号Ｓｖの平均レベルとを加算して臨場音バランス補正平均レベルとしてゲイン制御信号生成部３６に供給する。尚、臨場音バランス補正部３５の構成については、臨場音バランス補正平均レベルとして定義できる値が求められるものであれば他の構成であってもよい。 [Configuration example of the realistic sound balance correction unit 35]
Next, a configuration example of the live sound balance correction unit 35 will be described with reference to FIG. The live sound balance correction unit 35 includes a fixed gain amplifier 61 and an addition unit 62. The fixed gain amplifier 61 controls the gain of the live sound average level with a fixed gain K 2 and supplies it to the adder 62. The adding unit 62 adds the actual sound average level gain-controlled by the gain K2 supplied from the fixed gain amplifier 61 and the average level of the voice main signal Sv supplied from the average level detecting unit 31 to the presence. This is supplied to the gain control signal generator 36 as the sound balance correction average level. In addition, about the structure of the realistic sound balance correction | amendment part 35, if the value which can be defined as a realistic sound balance correction average level is calculated | required, another structure may be sufficient.

すなわち、センターバランス補正部３２、および臨場音バランス補正部３５は、それぞれ異なるゲインＫ１，Ｋ２により臨場音平均レベルを、声主体信号Ｓｖの平均レベルに加算することで、それぞれセンターバランス補正平均レベルおよび臨場音バランス補正平均レベルを求めている。したがって、このゲインＫ１，Ｋ２を様々に変化させることにより、声主体信号Ｓｖと臨場音とを様々な割合で変化させることができ、声主体信号Ｓｖと臨場音とのバランスを調整することが可能となる。 That is, the center balance correction unit 32 and the live sound balance correction unit 35 add the live sound average level to the average level of the voice main signal Sv with different gains K1 and K2, respectively. The average level of realistic sound balance correction is obtained. Therefore, by changing the gains K1 and K2 in various ways, the voice main signal Sv and the real sound can be changed at various ratios, and the balance between the voice main signal Sv and the real sound can be adjusted. It becomes.

［臨場音レベル補正ゲイン生成部３７の構成例］
次に、図１６を参照して、臨場音レベル補正ゲイン生成部３７の構成例について説明する。 [Configuration Example of Realistic Sound Level Correction Gain Generation Unit 37]
Next, a configuration example of the live sound level correction gain generation unit 37 will be described with reference to FIG.

入力音声信号のレベル変動あるいは声主体信号Ｓｖのレベル変動が大きく、声主体信号Ｓｖの出力レベルのみを一定レベルにゲイン制御したときには、元の入力音声信号に対するバランスが悪化して、違和感を覚える場合がある。 When the input voice signal level fluctuation or the voice main signal Sv level fluctuation is large and only the output level of the voice main signal Sv is gain-controlled to a constant level, the balance with respect to the original input voice signal deteriorates and the user feels uncomfortable There is.

この臨場音レベル補正ゲイン生成部３７は、この問題を改善するものである。臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１からなる。 The realistic sound level correction gain generation unit 37 improves this problem. The live sound level correction gain generation unit 37 includes a gain value conversion table unit 71.

ゲイン値変換テーブル部７１は、声主体信号Ｓｖに対するゲイン制御信号Ｇｙを入力信号として受けて、臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御信号Ｇｓを出力するものであり、ゲイン値変換テーブルメモリ（図示は省略）を有する。 The gain value conversion table unit 71 receives the gain control signal Gy for the voice main signal Sv as an input signal and outputs the gain control signal Gs for the real sound main signals SsL and SsR. Is omitted).

ゲイン値変換テーブル部７１が備えるゲイン値変換テーブルメモリに記憶されるゲイン値変換テーブル情報の例を説明するための図を図１７に示す。 FIG. 17 illustrates an example of gain value conversion table information stored in the gain value conversion table memory included in the gain value conversion table unit 71.

声主体信号Ｓｖのレベル変動が小さい場合あるいは入力音声信号全体のレベル変動が小さいときには、ゲイン制御信号Ｇｙによる声レベル補正ゲイン値は、Ｇｙ＝１を中心として大きく変化しない。 When the level fluctuation of the voice main signal Sv is small or the level fluctuation of the entire input voice signal is small, the voice level correction gain value by the gain control signal Gy does not change largely around Gy = 1.

このような場合には、前述した臨場音主体信号と、声主体信号とのバランスは、元の入力音声信号から大きく外れてはいないので、違和感を覚えることはない。このため、このようなレベル変動が小さい範囲では、臨場音主体信号ＳｓＬ，ＳｓＲは、固定ゲイン「１」のアンプを通じて出力してもよい。 In such a case, the above-described balance between the real sound main signal and the voice main signal is not greatly deviated from the original input audio signal, so that there is no sense of incongruity. Therefore, in the range where the level fluctuation is small, the live sound main signals SsL and SsR may be output through an amplifier having a fixed gain “1”.

そこで、この図１７の例においては、０．７５≦Ｇｙ≦１．２５の範囲では、臨場音主体信号に対するゲイン制御信号Ｇｓは、常にＧｓ＝１のゲイン値にする。 Therefore, in the example of FIG. 17, the gain control signal Gs for the live sound main signal is always set to a gain value of Gs = 1 in the range of 0.75 ≦ Gy ≦ 1.25.

そして、入力音声信号のレベル変動あるいは声主体信号Ｓｖのレベル変動が、このような小さいレベル変動範囲から逸脱する場合には、この例では、ゲイン制御信号Ｇｙに対して所定の比を持って、追従して、臨場音主体信号ＳｓＬ，ＳｓＲをゲイン制御する。 When the level fluctuation of the input voice signal or the level fluctuation of the voice main signal Sv deviates from such a small level fluctuation range, in this example, a predetermined ratio with respect to the gain control signal Gy is obtained. Following this, the real sound main signals SsL and SsR are gain-controlled.

すなわち、図１７の例においては、Ｇｙ＜０．７５となる入力レベルが大きくなる範囲では、Ｇｓ／Ｇｙ＝Ｋ１１（＝１／０．７５）の関係を持って、ゲイン制御信号Ｇｙから、臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御信号Ｇｓを出力するようにする。 That is, in the example of FIG. 17, in the range where the input level where Gy <0.75 becomes large, the relationship from Gs / Gy = K11 (= 1 / 0.75) is obtained from the gain control signal Gy. A gain control signal Gs for the main sound signals SsL and SsR is output.

また、Ｇｙ＞１．２５となる入力レベルが小さくなる範囲では、Ｇｓ／Ｇｙ＝Ｋ１２（＝２／２．５）の関係を持って、ゲイン制御信号Ｇｙから、臨場音主体信号ＳｓＬ，ＳｓＲに対するゲイン制御信号Ｇｓを出力するようにする。 Further, in the range where the input level where Gy> 1.25 is small, the relationship between Gs / Gy = K12 (= 2 / 2.5) and the gain control signal Gy to the main sound signals SsL, SsR. The gain control signal Gs is output.

このようにすることにより、声レベル補正ゲインが大きく変動した場合でも、臨場音レベル補正ゲインが、声レベル補正ゲインと一定の比を持って追従するので、声主体信号レベルに対する臨場音主体信号レベルのバランスが大きく開いてしまうことを防止できる。したがって、レベル変動が大きい場合においても自然なレベル遷移を実現することができるようになる。 By doing this, even if the voice level correction gain fluctuates greatly, the realistic sound level correction gain follows the voice level correction gain with a certain ratio, so the realistic sound main signal level relative to the main voice signal level. Can be prevented from being greatly opened. Therefore, natural level transition can be realized even when the level fluctuation is large.

ゲイン値変換テーブル部７１は、声主体信号Ｓｖに対するゲイン制御信号Ｇｙの値をゲイン値変換テーブルメモリの読み出しアドレス入力として、対応する臨場音主体信号に対するゲイン制御信号Ｇｓを読み出して、出力するように構成できる。 The gain value conversion table unit 71 uses the value of the gain control signal Gy for the voice main signal Sv as the read address input of the gain value conversion table memory, and reads and outputs the gain control signal Gs for the corresponding real sound main signal. Can be configured.

［図２の音量補正装置による音量補正処理］
音量補正装置は、ソフトウェア処理演算により各構成の機能を実現するようにすることができる。そこで、図１８のフローチャートを参照して、図２の音量補正装置による音量補正処理について説明する。 [Volume correction processing by the volume correction apparatus of FIG. 2]
The sound volume correction device can realize the functions of each component by software processing calculation. A volume correction process performed by the volume correction apparatus shown in FIG. 2 will be described with reference to the flowchart shown in FIG.

ステップＳ１において、センター集中定位信号検出部２１は、左右２チャンネルの入力音声信号ＳｉＬ，ＳｉＲより、左右チャンネルの中央（センター）に定位するセンター集中定位信号を抽出し、声主体信号Ｓｖとして検出して減算部２２，２３に出力する。 In step S1, the center concentration localization signal detector 21 extracts a center concentration localization signal localized at the center (center) of the left and right channels from the input audio signals SiL and SiR of the left and right channels, and detects them as a voice main signal Sv. To the subtracting units 22 and 23.

ステップＳ２において、減算部２２は、左チャンネルの音声信号ＳｉＬから、声主体信号Ｓｖを減算して、左チャンネルの臨場音主体信号ＳｓＬを生成し、可変ゲインアンプ２５に供給する。また、減算部２３は、右チャンネルの音声信号ＳｉＲから、声主体信号Ｓｖを減算して、右チャンネルの臨場音主体信号ＳｓＲを生成し、可変ゲインアンプ２６に供給する。 In step S 2, the subtracting unit 22 subtracts the voice main signal Sv from the left channel audio signal SiL to generate a left channel live sound main signal SsL and supplies it to the variable gain amplifier 25. Further, the subtracting unit 23 subtracts the voice main signal Sv from the right channel audio signal SiR to generate a right channel live sound main signal SsR and supplies it to the variable gain amplifier 26.

ステップＳ３において、平均レベル検出部３１は、声主体信号Ｓｖの平均レベルを検出して、その検出した平均レベルをセンターバランス補正部３２および臨場音バランス補正部３５に供給する。 In step S3, the average level detector 31 detects the average level of the voice main signal Sv and supplies the detected average level to the center balance corrector 32 and the live sound balance corrector 35.

ステップＳ４において、臨場音平均レベル検出部３４は、２チャンネル音声信号ＳｉＬ，ＳｉＲから、臨場音主体信号の平均レベルを検出して、その検出した臨場音主体信号の平均レベルをセンターバランス補正部３２および臨場音バランス補正部３５に供給する。 In step S4, the live sound average level detection unit 34 detects the average level of the live sound main signal from the two-channel audio signals SiL and SiR, and the detected average level of the live sound main signal is the center balance correction unit 32. And supplied to the live sound balance correction unit 35.

ステップＳ５において、センターバランス補正部３２は、声主体信号Ｓｖの平均レベルに、臨場音主体信号の平均レベルを所定のゲインＫ１によりゲイン制御した値を加算することにより補正して、ゲイン制御信号生成部３３に供給する。 In step S5, the center balance correction unit 32 corrects the average level of the live sound main signal by adding a value obtained by gain control with the predetermined gain K1 to the average level of the voice main signal Sv, thereby generating a gain control signal. To the unit 33.

ステップＳ６において、臨場音バランス補正部３５は、声主体信号Ｓｖの平均レベルに、臨場音主体信号の平均レベルを所定のゲインＫ２によりゲイン制御した値を加算することにより補正して、ゲイン制御信号生成部３６に供給する。 In step S6, the live sound balance correction unit 35 corrects the average level of the live sound main signal by adding a value obtained by gain control using the predetermined gain K2 to the average level of the voice main signal Sv, thereby obtaining a gain control signal. It supplies to the production | generation part 36.

ステップＳ７において、ゲイン制御信号生成部３３は、センターバランス補正部３２により、臨場音主体信号の平均レベルに基づいて補正された、声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるようにするためのゲイン制御信号Ｇｖを生成する。そして、ゲイン制御信号生成部３３は、生成したゲイン制御信号Ｇｖを可変ゲインアンプ２４に供給する。 In step S7, the gain control signal generation unit 33 causes the center balance correction unit 32 to correct the average level of the voice main signal Sv, which is corrected based on the average level of the live sound main signal, to a predetermined reference level. A gain control signal Gv for generating Then, the gain control signal generation unit 33 supplies the generated gain control signal Gv to the variable gain amplifier 24.

ステップＳ８において、ゲイン制御信号生成部３６は、臨場音バランス補正部３５により、臨場音主体信号の平均レベルに基づいて補正された、声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるようにするためのゲイン制御信号Ｇｙを生成する。そして、ゲイン制御信号生成部３６は、生成したゲイン制御信号Ｇｙを臨場音レベル補正ゲイン生成部３７に供給する。 In step S 8, the gain control signal generation unit 36 has the average level of the voice main signal Sv corrected by the real sound balance correction unit 35 based on the average level of the real sound main signal becomes a predetermined reference level. A gain control signal Gy is generated for this purpose. Then, the gain control signal generation unit 36 supplies the generated gain control signal Gy to the live sound level correction gain generation unit 37.

ステップＳ９において、臨場音レベル補正ゲイン生成部３７は、ゲイン制御信号生成部３６からのゲイン制御信号Ｇｙを受けて、このゲイン制御信号Ｇｙに基づく臨場音レベル補正ゲイン制御信号生成処理を行って、臨場音主体信号をゲイン補正する、臨場音レベル補正ゲインであるゲイン制御信号Ｇｓを生成し、可変ゲインアンプ２５，２６に供給する。 In step S9, the live sound level correction gain generation unit 37 receives the gain control signal Gy from the gain control signal generation unit 36, performs live sound level correction gain control signal generation processing based on the gain control signal Gy, A gain control signal Gs, which is a live sound level correction gain, for gain correction of the live sound main signal is generated and supplied to the variable gain amplifiers 25 and 26.

［臨場音レベル補正ゲイン制御信号生成処理］
ここで、図１９のフローチャートを参照して、臨場音レベル補正ゲイン制御信号生成処理について説明する。 [Realistic sound level correction gain control signal generation processing]
Here, the realistic sound level correction gain control signal generation processing will be described with reference to the flowchart of FIG.

ステップＳ２１において、臨場音レベル補正ゲイン生成部３７は、入力された声主体信号についてのゲイン制御信号Ｇｙのゲイン値を検知する In step S21, the live sound level correction gain generating unit 37 detects the gain value of the gain control signal Gy for the input voice main signal.

ステップＳ２１において、臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１に基づいて、そのゲイン値Ｇｙが、Ｇｙ＜０．７５であるか否か判定する。ステップＳ２１において、例えば、ゲイン値Ｇｙ＜０．７５である場合、処理は、ステップＳ２３に進む。 In step S 21, the live sound level correction gain generation unit 37 determines whether the gain value Gy is Gy <0.75 based on the gain value conversion table unit 71. In step S21, for example, when the gain value Gy <0.75, the process proceeds to step S23.

ステップＳ２３において、臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１に基づいて、Ｇｓ＝Ｋ１１×Ｇｙなる演算により臨場音主体信号ＳｓＬ，ＳｓＲに対する、臨場音レベル補正ゲインであるゲイン制御信号Ｇｓを算出する。 In step S23, the live sound level correction gain generation unit 37 performs gain control that is the live sound level correction gain for the live sound main signals SsL and SsR based on the gain value conversion table unit 71 by the calculation of Gs = K11 × Gy. The signal Gs is calculated.

一方、ステップＳ２２において、Ｇｙ＜０．７５ではないと判定された場合、ステップＳ２４において、臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１に基づいて、Ｇｙ＞１．２５であるか否かを判定する。ステップＳ２４において、例えば、Ｇｙ＞１．２５である場合、処理は、ステップＳ２５に進む。ステップＳ２５において、臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１に基づいて、Ｇｓ＝Ｋ１２×Ｇｙなる演算により臨場音主体信号ＳｓＬ，ＳｓＲに対する臨場音レベル補正ゲインであるゲイン制御信号Ｇｓを算出する。 On the other hand, when it is determined in step S22 that Gy <0.75 is not satisfied, the live sound level correction gain generation unit 37 satisfies Gy> 1.25 based on the gain value conversion table unit 71 in step S24. It is determined whether or not. In step S24, for example, if Gy> 1.25, the process proceeds to step S25. In step S25, the live sound level correction gain generation unit 37 is a gain control signal that is a live sound level correction gain for the live sound main signals SsL and SsR based on the gain value conversion table unit 71 by a calculation of Gs = K12 × Gy. Gs is calculated.

ステップＳ２４において、例えば、Ｇｙ＞１．２５ではない場合、ステップＳ２６において、臨場音レベル補正ゲイン生成部３７は、ゲイン値変換テーブル部７１に基づいて、０．７５≦Ｇｙ≦１．２５であることを確認して、Ｇｓ＝１とする。 In step S24, for example, when Gy> 1.25 is not satisfied, in step S26, the live sound level correction gain generation unit 37 satisfies 0.75 ≦ Gy ≦ 1.25 based on the gain value conversion table unit 71. Confirm that Gs = 1.

そして、ステップＳ２３，Ｓ２４、およびＳ２６の処理が終了した後、処理は、ステップＳ２１に戻り、それ以降の処理が繰り返される。 And after the process of step S23, S24, and S26 is complete | finished, a process returns to step S21 and the process after it is repeated.

なお、上述の説明のゲイン値の数値は、一例であり、これに限られるものではないことはいうまでもない。そして、ゲイン値Ｇｙ＝１を中心とするレベル変動が小さい範囲、すなわち、α≦Ｇｙ≦βの範囲においては、上述の例では、１−α＝β−１としたが、１−α≠β−１であっても勿論良い。 Needless to say, the numerical value of the gain value described above is merely an example, and the present invention is not limited to this. In the range where the level fluctuation around the gain value Gy = 1 is small, that is, in the range of α ≦ Gy ≦ β, 1−α = β-1 in the above example, but 1−α ≠ β Of course, it may be -1.

また、Ｇｙ＜αの範囲での比の値Ｋ１１と、Ｇｙ＞βの範囲での比の値Ｋ１２も、一例であり、また、Ｋ１１＝Ｋ１２であってもよい。 Also, the ratio value K11 in the range of Gy <α and the ratio value K12 in the range of Gy> β are examples, and K11 = K12 may be used.

ここで、図１８のフローチャートの説明に戻る。 Now, the description returns to the flowchart of FIG.

ステップＳ１０において、可変ゲインアンプ２５，２６は、臨場音レベル補正ゲイン生成部３７から供給されてくる臨場音レベル補正ゲインであるゲイン制御信号Ｇｓに基づいて、左右チャンネルの臨場音主体信号ＳｓＬ，ＳｓＲに対して、ゲイン制御を行い、それぞれ加算部２７，２８に供給する。 In step S10, the variable gain amplifiers 25 and 26, based on the gain control signal Gs, which is the live sound level correction gain supplied from the live sound level correction gain generation unit 37, the live sound main signals SsL and SsR for the left and right channels. The gain is controlled and supplied to the adders 27 and 28, respectively.

ステップＳ１１において、可変ゲインアンプ２４は、ゲイン制御信号生成部３３は、生成したゲイン制御信号Ｇｖに基づいて、声主体信号Ｓｖに対して、ゲイン制御を行い、補正後の声主体信号Ｓｖｃを加算部２７，２８に供給する。 In step S11, the variable gain amplifier 24, the gain control signal generation unit 33 performs gain control on the voice main signal Sv based on the generated gain control signal Gv, and adds the corrected voice main signal Svc. Parts 27 and 28.

ステップＳ１２において、加算部２７は、左チャンネルの臨場音主体信号ＳｓＬと補正後声主体信号Ｓｖｃとを加算し、その加算出力として、音量補正された左チャンネルの出力音声信号ＳｏＬを出力する。また、加算部２８は、右チャンネルの臨場音主体信号ＳｓＲと補正後声主体信号Ｓｖｃとを加算し、その加算出力として、音量補正された右チャンネルの出力音声信号ＳｏＲを出力する。 In step S12, the adding unit 27 adds the left channel presence sound main signal SsL and the corrected voice main signal Svc, and outputs the output sound signal SoL of the left channel whose volume has been corrected as the addition output. Further, the adder 28 adds the right-channel live sound main signal SsR and the corrected voice main signal Svc, and outputs the volume-corrected right-channel output audio signal SoR as the addition output.

以上の処理により、臨場音主体信号、および声主体信号が、ゲインＫ１，Ｋ２の調整により、様々な割合でバランスを取ることが可能になると共に、声主体信号について生じる大きなレベル変化点での音量レベルの揺れを抑制することが可能となる。結果として、臨場音と声主体信号とのバランスを調整しつつ、声主体信号Ｓｖｃの音量レベルの揺れが、目立たなくなり、聴取者に対する違和感が軽減される。 With the above processing, the live sound main signal and the voice main signal can be balanced at various ratios by adjusting the gains K1 and K2, and the volume at the large level change point generated for the voice main signal. It is possible to suppress level fluctuation. As a result, while adjusting the balance between the actual sound and the voice main signal, the fluctuation of the volume level of the voice main signal Svc becomes inconspicuous, and the uncomfortable feeling for the listener is reduced.

＜２．第２の実施の形態＞
［音量補正装置の第２の実施形態］
以上においては、声主体信号Ｓｖの平均レベルに対して、臨場音平均レベルに基づいたバランス補正を行うことにより、補正された声主体信号Ｓｖの平均レベルを用いて、ゲイン制御信号Ｇｖ，Ｇｓを生成する例について説明してきたが、バランス補正ができれば、補正する信号は別の信号であってもよい。 <2. Second Embodiment>
[Second Embodiment of Volume Correction Device]
In the above, the gain control signals Gv and Gs are obtained using the average level of the corrected voice main signal Sv by performing the balance correction based on the average level of the actual sound with respect to the average level of the voice main signal Sv. Although an example of generation has been described, the signal to be corrected may be another signal as long as balance correction can be performed.

例えば、声主体信号Ｓｖの平均レベルに基づいて求められるゲイン制御信号を、声主体信号、および臨場音主体信号のそれぞれについて、臨場音平均レベルに基づいたバランス補正を行うことによりゲイン制御信号Ｇｖ，Ｇｙを生成するようにしてもよい。 For example, the gain control signal Gv, which is obtained by performing the balance correction based on the actual sound average level for each of the voice main signal and the real sound main signal is obtained from the gain control signal obtained based on the average level of the main voice signal Sv. Gy may be generated.

図２０は、声主体信号Ｓｖの平均レベルに基づいて求められるゲイン制御信号を、声主体信号、および臨場音主体信号のそれぞれについて、臨場音平均レベルに基づいてバランスを補正することによりゲイン制御信号Ｇｖ，Ｇｙを生成する音量補正装置の構成を示している。尚、図２０の音量補正装置において、図２の音量補正装置における構成と同一の機能を備えた構成については、同一の符号を付しており、その説明は適宜省略するものとする。図２０の音量補正装置において、図２の音量補正装置と異なる構成は、センターバランス補正部３２、ゲイン制御信号生成部３３、臨場音バランス補正部３５、およびゲイン制御信号生成部３６に代えて、ゲイン制御信号生成部１０１−１、センターバランス補正部１０３、ゲイン制御信号生成部１０１−２、および臨場音バランス補正部１０４を設け、さらに、バランス補正ゲイン生成部１０２を備えていることである。 FIG. 20 shows a gain control signal obtained based on the average level of the voice main signal Sv by correcting the balance of each of the voice main signal and the real sound main signal based on the real sound average level. The structure of the sound volume correction | amendment apparatus which produces | generates Gv and Gy is shown. In the sound volume correction apparatus in FIG. 20, the same reference numerals are given to the components having the same functions as those in the sound volume correction apparatus in FIG. 2, and the description thereof will be omitted as appropriate. 20 differs from the volume correction device of FIG. 2 in that the center balance correction unit 32, the gain control signal generation unit 33, the live sound balance correction unit 35, and the gain control signal generation unit 36 are replaced with each other. A gain control signal generation unit 101-1, a center balance correction unit 103, a gain control signal generation unit 101-2, and a live sound balance correction unit 104 are provided, and a balance correction gain generation unit 102 is further provided.

ゲイン制御信号生成部１０１−１，１０１−２は、いずれも平均レベル検出部３１より供給されてくる声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるためのゲイン制御信号を生成し、それぞれセンターバランス補正部１０３、および臨場音バランス補正部１０４に供給する。 The gain control signal generation units 101-1 and 101-2 each generate a gain control signal for the average level of the main voice signal Sv supplied from the average level detection unit 31 to be a predetermined reference level. Are supplied to the center balance correction unit 103 and the live sound balance correction unit 104, respectively.

バランス補正ゲイン生成部１０２は、臨場音平均レベル検出部３４より供給されてくる臨場音平均レベルに基づいて、声主体信号Ｓｖの平均レベルを予め定めた基準レベルとなるためのゲイン制御信号を、声主体信号Ｓｖと臨場音主体信号とのバランスを補正するためのバランス補正ゲインＫ３，Ｋ４を生成し、それぞれセンターバランス補正部１０３、および臨場音バランス補正部１０４に供給する。より詳細には、バランス補正ゲイン生成部１０２は、図２１で示されるような、臨場音平均レベルに応じて設定されるバランス補正ゲインのテーブルを記憶しており、臨場音平均レベルに応じたバランス補正ゲインＫ３，Ｋ４を読み出して出力している。図２１で示されるように、声主体信号Ｓｖを補正するゲイン制御信号を補正するためのバランス補正ゲインＫ３は、臨場音平均レベルに比例して大きくなるように設定されている。これに対して、臨場音信号ＳｉＬ，ＳｉＲを補正するゲイン制御信号を補正するためのバランス補正ゲインＫ４は、臨場音平均レベルに比例して小さくなるように設定されている。尚、バランス補正ゲインＫ３，Ｋ４については、臨場音平均レベルに対して、図２１で示されるような関係とは異なる関係となるものであってもよく、例えば、バランス補正ゲインＫ３，Ｋ４が、それぞれ臨場音平均レベルに対して、それぞれ図２１の場合と逆の関係となるようなものであってもよい。 Based on the live sound average level supplied from the live sound average level detector 34, the balance correction gain generator 102 generates a gain control signal for setting the average level of the voice main signal Sv to a predetermined reference level. Balance correction gains K3 and K4 for correcting the balance between the voice main signal Sv and the real sound main signal are generated and supplied to the center balance correction unit 103 and the real sound balance correction unit 104, respectively. More specifically, the balance correction gain generation unit 102 stores a table of balance correction gains set according to the realistic sound average level as shown in FIG. 21, and the balance according to the realistic sound average level. Correction gains K3 and K4 are read and output. As shown in FIG. 21, the balance correction gain K3 for correcting the gain control signal for correcting the voice main signal Sv is set to increase in proportion to the actual sound average level. On the other hand, the balance correction gain K4 for correcting the gain control signal for correcting the realistic sound signals SiL and SiR is set to be smaller in proportion to the realistic sound average level. The balance correction gains K3 and K4 may be different from the relationship shown in FIG. 21 with respect to the actual sound average level. For example, the balance correction gains K3 and K4 may be Each of the actual sound average levels may have a reverse relationship to that in FIG.

センターバランス補正部１０３は、ゲイン制御信号生成部１０１−１より供給されてくるゲイン制御信号をバランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ３により補正してゲイン制御信号Ｇｖを生成し、可変ゲインアンプ２４に供給する。 The center balance correction unit 103 corrects the gain control signal supplied from the gain control signal generation unit 101-1 with the balance correction gain K3 supplied from the balance correction gain generation unit 102 to generate the gain control signal Gv. To the variable gain amplifier 24.

臨場音バランス補正部１０４は、ゲイン制御信号生成部１０１−２より供給されてくるゲイン制御信号を、バランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ４により補正してゲイン制御信号Ｇｙを生成し、臨場音レベル補正ゲイン生成部３７に供給する。 The live sound balance correction unit 104 corrects the gain control signal supplied from the gain control signal generation unit 101-2 with the balance correction gain K4 supplied from the balance correction gain generation unit 102, thereby obtaining the gain control signal Gy. Generated and supplied to the live sound level correction gain generation unit 37.

すなわち、図２の音量調整部１８においては、声主体信号Ｓｖの平均レベルが、臨場音平均レベルに基づいてバランス補正がなされ、補正後の声主体信号Ｓｖの平均レベルからゲイン制御信号が生成されていた。これに対して、図２０の音量調整部１８においては、声主体信号Ｓｖの平均レベルに基づいてゲイン制御信号が生成されて、臨場音平均レベルに基づいて、声主体信号と臨場音とのバランスを調整するために設定されるバランス補正ゲインでバランス補正されている。 That is, in the volume adjusting unit 18 in FIG. 2, the average level of the voice main signal Sv is balance-corrected based on the actual sound average level, and a gain control signal is generated from the corrected average level of the voice main signal Sv. It was. On the other hand, in the volume adjusting unit 18 in FIG. 20, a gain control signal is generated based on the average level of the voice main signal Sv, and the balance between the voice main signal and the real sound is based on the real sound average level. The balance is corrected with the balance correction gain set to adjust the frequency.

［センターバランス補正部１０３の構成例］
次に、図２２を参照して、センターバランス補正部１０３の構成例について説明する。センターバランス補正部１０３は、可変ゲインアンプ１２１により構成されている。可変ゲインアンプ１２１は、バランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ３により、声主体信号Ｓｖの平均レベルに基づいて生成されるゲイン制御信号をゲイン制御して、ゲイン制御信号Ｇｖを生成して可変ゲインアンプ２４に供給する。 [Configuration Example of Center Balance Correction Unit 103]
Next, a configuration example of the center balance correction unit 103 will be described with reference to FIG. The center balance correction unit 103 includes a variable gain amplifier 121. The variable gain amplifier 121 gain-controls the gain control signal generated based on the average level of the voice main signal Sv by the balance correction gain K3 supplied from the balance correction gain generation unit 102, and generates the gain control signal Gv. Generated and supplied to the variable gain amplifier 24.

［臨場音バランス補正部１０４の構成例］
次に、図２３を参照して、臨場音バランス補正部１０４の構成例について説明する。臨場音バランス補正部１０４は、可変ゲインアンプ１３１により構成されている。可変ゲインアンプ１３１は、バランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ４により、声主体信号Ｓｖの平均レベルに基づいて生成されるゲイン制御信号をゲイン制御して、ゲイン制御信号Ｇｙを生成して臨場音レベル補正ゲイン生成部３７に供給する。 [Configuration example of the realistic sound balance correction unit 104]
Next, a configuration example of the realistic sound balance correction unit 104 will be described with reference to FIG. The live sound balance correction unit 104 includes a variable gain amplifier 131. The variable gain amplifier 131 gain-controls the gain control signal generated based on the average level of the voice main signal Sv by the balance correction gain K4 supplied from the balance correction gain generation unit 102, and generates the gain control signal Gy. Generated and supplied to the live sound level correction gain generator 37.

［図２０の音量補正装置による音量補正処理］
図２０音量補正装置についても、ソフトウェア処理演算により各構成の機能を実現するようにすることができる。そこで、図２４のフローチャートを参照して、図２０の音量補正装置による音量補正処理について説明する。尚、図２４のフローチャートにおけるステップＳ３１乃至Ｓ３４、およびＳ３９乃至Ｓ４２の処理については、図１８のフローチャートを参照して説明した処理と同一の処理であるので、その説明は省略する。 [Volume Correction Processing by Volume Correction Device in FIG. 20]
Also in the sound volume correction apparatus in FIG. 20, the function of each component can be realized by software processing calculation. Therefore, the volume correction processing by the volume correction apparatus of FIG. 20 will be described with reference to the flowchart of FIG. Note that the processing of steps S31 to S34 and S39 to S42 in the flowchart of FIG. 24 is the same as the processing described with reference to the flowchart of FIG.

すなわち、ステップＳ３１乃至Ｓ３４の処理により、平均レベル検出部３１により声主体信号Ｓｖの平均レベルが求められ、臨場音平均レベル検出部３４により臨場音平均レベルが求められると処理は、ステップＳ３５に進む。 That is, when the average level detection unit 31 obtains the average level of the voice main signal Sv by the processing of steps S31 to S34 and the presence sound average level detection unit 34 obtains the presence sound average level, the processing proceeds to step S35. .

ステップＳ３５において、ゲイン制御信号生成部１０１−１，１０１−２は、声主体信号Ｓｖの平均レベルが、予め定めた基準レベルとなるためのゲイン制御信号を生成し、それぞれセンターバランス補正部１０３、および臨場音バランス補正部１０４に供給する。 In step S35, the gain control signal generation units 101-1 and 101-2 generate gain control signals for the average level of the voice main signal Sv to be a predetermined reference level, and the center balance correction unit 103, And supplied to the live sound balance correction unit 104.

ステップＳ３６において、バランス補正ゲイン生成部１０２は、臨場音平均レベル検出部３４より供給されてくる臨場音平均レベルに基づいて、声主体信号Ｓｖの平均レベルを予め定めた基準レベルとなるためのゲイン制御信号を、声主体信号Ｓｖと臨場音とのバランスを補正するためのバランス補正ゲインＫ３，Ｋ４を生成し、それぞれセンターバランス補正部１０３、および臨場音バランス補正部１０４に供給する。 In step S36, the balance correction gain generation unit 102 gains the average level of the voice main signal Sv to a predetermined reference level based on the actual sound average level supplied from the actual sound average level detection unit 34. The control signals are generated as balance correction gains K3 and K4 for correcting the balance between the voice main signal Sv and the real sound, and are supplied to the center balance correction unit 103 and the real sound balance correction unit 104, respectively.

ステップＳ３７において、センターバランス補正部１０３は、ゲイン制御信号生成部１０１−１より供給されてくるゲイン制御信号をバランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ３によりゲイン制御することで、バランス補正してゲイン制御信号Ｇｖを生成し、可変ゲインアンプ２４に供給する。 In step S 37, the center balance correction unit 103 performs gain control on the gain control signal supplied from the gain control signal generation unit 101-1 using the balance correction gain K 3 supplied from the balance correction gain generation unit 102. The balance correction is performed to generate a gain control signal Gv, which is supplied to the variable gain amplifier 24.

ステップＳ３８において、臨場音バランス補正部１０４は、ゲイン制御信号生成部１０１−２より供給されてくるゲイン制御信号をバランス補正ゲイン生成部１０２より供給されてくるバランス補正ゲインＫ４によりゲイン制御することで、バランス補正してゲイン制御信号Ｇｙを生成し、臨場音レベル補正ゲイン生成部３７に供給する。 In step S 38, the live sound balance correction unit 104 performs gain control on the gain control signal supplied from the gain control signal generation unit 101-2 using the balance correction gain K 4 supplied from the balance correction gain generation unit 102. Then, the balance correction is performed to generate the gain control signal Gy, which is supplied to the live sound level correction gain generation unit 37.

そして、ステップＳ３９においては、生成されたゲイン制御信号Ｇｙが、臨場音レベル補正ゲイン生成部３７により臨場音レベル補正ゲイン生成処理されることにより、ゲイン制御信号Ｇｓが生成され、可変ゲインアンプ２５，２６に供給される。これにより、ステップＳ４０において、可変ゲインアンプ２５，２６が、臨場音主体信号ＳｓＬ，ＳｓＲをそれぞれゲイン制御することで補正し加算部２７，２８に供給する。ステップＳ４１において、可変ゲインアンプ２４は、ゲイン制御信号Ｇｖに基づいて、声主体信号Ｓｖをゲイン制御して、加算部２７，２８に供給する。そして、ステップＳ４２において、加算部２７，２８は、補正後の左右チャンネルの臨場音主体信号ＳｓＬｃ，ＳｓＲｃと補正後声主体信号Ｓｖｃとをそれぞれ加算し、その加算出力として、音量補正された左右チャンネルの出力音声信号ＳｏＬ，ＳｏＲを出力する。 In step S39, the generated gain control signal Gy is subjected to the live sound level correction gain generation process by the live sound level correction gain generating unit 37, whereby the gain control signal Gs is generated, and the variable gain amplifier 25, 26. Thereby, in step S40, the variable gain amplifiers 25 and 26 correct the actual sound main signals SsL and SsR by performing gain control, respectively, and supply them to the adders 27 and 28. In step S41, the variable gain amplifier 24 performs gain control on the voice main signal Sv based on the gain control signal Gv, and supplies it to the adders 27 and 28. In step S42, the adding units 27 and 28 add the corrected main sound signals SsLc and SsRc of the left and right channels and the corrected voice main signal Svc, respectively, and the volume-corrected left and right channels as the added outputs. Output audio signals SoL and SoR.

以上の処理により、臨場音平均レベルに応じたバランス補正ゲインＫ３，Ｋ４を様々に設定することで、声主体信号Ｓｖと臨場音主体信号とのバランスを様々に変化させて調整することが可能になると共に、声主体信号Ｓｖの急激な変化においても、自然なレベル遷移を実現し、聴取者に対する違和感を軽減することが可能となる。 Through the above processing, it is possible to adjust the balance between the voice main signal Sv and the real sound main signal in various ways by variously setting the balance correction gains K3 and K4 according to the real sound average level. At the same time, a natural level transition can be realized even when the voice main signal Sv is suddenly changed, and the uncomfortable feeling to the listener can be reduced.

＜３．変形例＞
［音量補正装置の第２の実施の形態の変形例］
図２０の音量補正装置においては、センターバランス補正部１０３、および臨場音バランス補正部１０４のそれぞれの前段にゲイン制御信号生成部１０１−１，１０１−２をそれぞれ設けた例について説明してきたが、ゲイン制御信号生成部１０１−１，１０１−２はいずれも同一の構成であるので、共通となる１個のゲイン制御信号生成部１０１とするようにしてもよい。 <3. Modification>
[Modification of Second Embodiment of Volume Correction Device]
In the volume correction apparatus of FIG. 20, the example in which the gain control signal generation units 101-1 and 101-2 are provided in the previous stage of the center balance correction unit 103 and the live sound balance correction unit 104 has been described. Since the gain control signal generation units 101-1 and 101-2 have the same configuration, the gain control signal generation unit 101 may be a common one.

図２５は、ゲイン制御信号生成部１０１−１，１０１−２を、共通となる１個のゲイン制御信号生成部１０１とした音量補正装置の構成例を示している。 FIG. 25 shows a configuration example of a volume correction apparatus in which the gain control signal generation units 101-1 and 101-2 are made one common gain control signal generation unit 101.

すなわち、図２５で示されるように、平均レベル検出部３１の後段であって、センターバランス補正部１０３、および臨場音バランス補正部１０４のそれぞれに分岐する前の位置に共通のゲイン制御信号生成部１０１を１個にして設けるようにしてもよい。 That is, as shown in FIG. 25, a gain control signal generation unit common to positions after the average level detection unit 31 and before branching to the center balance correction unit 103 and the live sound balance correction unit 104. One 101 may be provided.

このような構成により、図２５の音量補正装置においても、図２０の音量補正装置における場合と同様の効果を奏することが可能となる。 With such a configuration, the sound volume correction apparatus in FIG. 25 can achieve the same effects as those in the sound volume correction apparatus in FIG.

＜４．第３の実施の形態＞
［音量補正装置の第３の実施形態］
以上においては、可変ゲインアンプ２５，２６のゲイン制御信号Ｇｙを、補正して、臨場音レベル補正ゲインであるゲイン制御信号Ｇｓを生成する臨場音レベル補正ゲイン生成部３７を設ける例について説明してきたが、ゲイン制御信号Ｇｙ＝ゲイン制御信号Ｇｓであっても、声主体信号Ｓｖの変化に対して自然にレベルを遷移させて聴取者に対する違和感を軽減しつつ、声主体信号と臨場音とのバランスを補正することが可能である。したがって、臨場音レベル補正ゲイン生成部３７を削除しても、第１の実施形態である図２の音量補正装置と同様の効果は得ることができる。 <4. Third Embodiment>
[Third embodiment of volume correction apparatus]
In the above, an example has been described in which the live sound level correction gain generation unit 37 that corrects the gain control signal Gy of the variable gain amplifiers 25 and 26 and generates the gain control signal Gs that is the live sound level correction gain is provided. However, even if the gain control signal Gy = the gain control signal Gs, the balance between the voice main signal and the real sound is reduced while the level is naturally shifted with respect to the change of the voice main signal Sv to reduce the sense of discomfort to the listener. Can be corrected. Therefore, even if the realistic sound level correction gain generation unit 37 is deleted, the same effect as that of the volume correction device of FIG. 2 according to the first embodiment can be obtained.

図２６は、第１の実施形態の音量補正装置より臨場音レベル補正ゲイン生成部３７を削除した音量補正装置の構成例を示している。 FIG. 26 shows a configuration example of a volume correction apparatus in which the live sound level correction gain generation unit 37 is deleted from the volume correction apparatus of the first embodiment.

すなわち、図２６の音量補正装置においては、ゲイン制御信号生成部３６により生成されるゲイン制御信号Ｇｙそのものが、ゲイン制御信号Ｇｓとして可変ゲインアンプ２５，２６に供給される。 26, the gain control signal Gy itself generated by the gain control signal generator 36 is supplied to the variable gain amplifiers 25 and 26 as the gain control signal Gs.

［図２６の音量補正装置による音量補正処理］
次に、図２７のフローチャートを参照して、図２６の音量補正装置による音量補正処理について説明する。ただし、図２７のフローチャートにおけるステップＳ５１乃至Ｓ６１の処理は、図１８のフローチャートにおけるステップＳ１乃至Ｓ８，Ｓ１０乃至Ｓ１２の処理と同様であるので、その説明は省略する。 [Volume Correction Processing by Volume Correction Device in FIG. 26]
Next, the volume correction processing by the volume correction apparatus of FIG. 26 will be described with reference to the flowchart of FIG. However, steps S51 to S61 in the flowchart of FIG. 27 are the same as steps S1 to S8 and S10 to S12 in the flowchart of FIG.

すなわち、以上の処理においても、声主体信号Ｓｖの変化に対して自然にレベルを遷移させて聴取者に対する違和感を軽減しつつ、声主体信号と臨場音とのバランスを調整することが可能である。 That is, even in the above processing, it is possible to adjust the balance between the voice main signal and the actual sound while reducing the sense of discomfort to the listener by naturally changing the level with respect to the change of the voice main signal Sv. .

＜５．第４の実施の形態＞
［音量補正装置の第４の実施形態］
また、第３の実施形態と同様に、第２の実施形態における音量補正装置の臨場音レベル補正ゲイン生成部３７を削除しても、その効果は得ることができる。 <5. Fourth Embodiment>
[Fourth Embodiment of Volume Correction Device]
Similarly to the third embodiment, the effect can be obtained even if the live sound level correction gain generation unit 37 of the volume correction device in the second embodiment is deleted.

図２８は、第２の実施形態の音量補正装置より臨場音レベル補正ゲイン生成部３７を削除した音量補正装置の構成例を示している。 FIG. 28 shows a configuration example of a volume correction apparatus in which the live sound level correction gain generation unit 37 is deleted from the volume correction apparatus of the second embodiment.

すなわち、図２８の音量補正装置においても、ゲイン制御信号生成部３６により生成されるゲイン制御信号Ｇｙそのものが、ゲイン制御信号Ｇｓとして可変ゲインアンプ２５，２６に供給される。 That is, also in the sound volume correction apparatus of FIG. 28, the gain control signal Gy itself generated by the gain control signal generator 36 is supplied to the variable gain amplifiers 25 and 26 as the gain control signal Gs.

［図２６の音量補正装置による音量補正処理］
次に、図２９のフローチャートを参照して、図２８の音量補正装置による音量補正処理について説明する。ただし、図２９のフローチャートにおけるステップＳ８１乃至Ｓ９１の処理は、図２４のフローチャートにおけるステップＳ３１乃至Ｓ３８，Ｓ４０乃至Ｓ４２の処理と同様であるので、その説明は省略する。 [Volume Correction Processing by Volume Correction Device in FIG. 26]
Next, the volume correction processing by the volume correction apparatus of FIG. 28 will be described with reference to the flowchart of FIG. However, steps S81 to S91 in the flowchart of FIG. 29 are the same as steps S31 to S38 and S40 to S42 in the flowchart of FIG.

すなわち、以上の処理においても、声主体信号Ｓｖの変化に対して自然にレベルを遷移させて聴取者に対する違和感を軽減しつつ、臨場音声主体信号と臨場音とのバランスを調整することが可能である。 That is, even in the above processing, it is possible to adjust the balance between the real sound main signal and the real sound while reducing the sense of discomfort to the listener by naturally changing the level with respect to the change of the voice main signal Sv. is there.

尚、以上においては、音声信号が２チャンネルの場合を例とした場合について説明してきたが、それよりも多くのチャンネル、すなわち、マルチチャンネルを扱う場合においても、本技術を適用することができる。すなわち、マルチチャンネルの音声信号に基づいて、センター集中定位信号を検出し、センター集中定位信号について、上述した声主体信号に対する処理と同様に処理し、その他の音声信号については、上述した臨場音と同様に処理することで対応することができる。この際、必要に応じて上述したバランス補正に必要なゲインを調整することで、様々な比率でバランスを補正することが可能である。また、マルチチャンネル毎に臨場音に対応した処理をすることで、バランス補正のバリエーションを増やすことも可能である。さらに、マルチチャンネルをダウンミックスして２チャンネルとし、上述した手法でバランスを補正することも可能である。 In the above description, the case where the audio signal has two channels has been described as an example. However, the present technology can also be applied to a case where more channels, that is, multi-channels are handled. That is, the center concentration localization signal is detected based on the multi-channel audio signal, the center concentration localization signal is processed in the same manner as the processing for the voice main signal described above, and the other sound signals are It can respond by processing similarly. At this time, the balance can be corrected at various ratios by adjusting the gain necessary for the balance correction described above as necessary. Also, it is possible to increase the variation of balance correction by performing processing corresponding to the realistic sound for each multi-channel. Furthermore, the multi-channel can be downmixed into two channels, and the balance can be corrected by the method described above.

本技術によれば、複数の成分を含む入力音声信号のうち、声主体信号などのセンター集中低位信号からなる第１成分主体信号の変化に対して自然にレベルを遷移させて聴取者に対する違和感を軽減しつつ、臨場音声主体信号などのセンター集中低位信号以外の信号とのバランスを補正することが可能である。 According to the present technology, among the input audio signals including a plurality of components, the level is naturally shifted with respect to the change of the first component main signal including the center concentrated low-level signal such as the voice main signal, so that the listener feels uncomfortable. While mitigating, it is possible to correct the balance with signals other than the center-concentrated low-level signal, such as the presence voice main signal.

ところで、上述した一連の処理は、ハードウェアにより実行させることもできるが、ソフトウェアにより実行させることもできる。一連の処理をソフトウェアにより実行させる場合には、そのソフトウェアを構成するプログラムが、専用のハードウェアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、記録媒体からインストールされる。 By the way, the series of processes described above can be executed by hardware, but can also be executed by software. When a series of processing is executed by software, a program constituting the software may execute various functions by installing a computer incorporated in dedicated hardware or various programs. For example, it is installed from a recording medium in a general-purpose personal computer or the like.

図３０は、汎用のパーソナルコンピュータの構成例を示している。このパーソナルコンピュータは、CPU(Central Processing Unit)１００１を内蔵している。CPU１００１にはバス１００４を介して、入出力インタ-フェイス１００５が接続されている。バス１００４には、ROM(Read Only Memory)１００２およびRAM(Random Access Memory)１００３が接続されている。 FIG. 30 shows a configuration example of a general-purpose personal computer. This personal computer incorporates a CPU (Central Processing Unit) 1001. An input / output interface 1005 is connected to the CPU 1001 via a bus 1004. A ROM (Read Only Memory) 1002 and a RAM (Random Access Memory) 1003 are connected to the bus 1004.

入出力インタ-フェイス１００５には、ユーザが操作コマンドを入力するキーボード、マウスなどの入力デバイスよりなる入力部１００６、処理操作画面や処理結果の画像を表示デバイスに出力する出力部１００７、プログラムや各種データを格納するハードディスクドライブなどよりなる記憶部１００８、LAN（Local Area Network）アダプタなどよりなり、インターネットに代表されるネットワークを介した通信処理を実行する通信部１００９が接続されている。また、磁気ディスク（フレキシブルディスクを含む）、光ディスク（CD-ROM(Compact Disc-Read Only Memory)、DVD(Digital Versatile Disc)を含む）、光磁気ディスク（ＭＤ(Mini Disc)を含む）、もしくは半導体メモリなどのリムーバブルメディア１０１１に対してデータを読み書きするドライブ１０１０が接続されている。 The input / output interface 1005 includes an input unit 1006 including an input device such as a keyboard and a mouse for a user to input an operation command, an output unit 1007 for outputting a processing operation screen and an image of the processing result to a display device, a program and various types A storage unit 1008 including a hard disk drive for storing data, a LAN (Local Area Network) adapter, and the like, and a communication unit 1009 for performing communication processing via a network represented by the Internet are connected. Also, a magnetic disk (including a flexible disk), an optical disk (including a CD-ROM (Compact Disc-Read Only Memory), a DVD (Digital Versatile Disc)), a magneto-optical disk (including an MD (Mini Disc)), or a semiconductor A drive 1010 for reading / writing data from / to a removable medium 1011 such as a memory is connected.

CPU１００１は、ROM１００２に記憶されているプログラム、または磁気ディスク、光ディスク、光磁気ディスク、もしくは半導体メモリ等のリムーバブルメディア１０１１から読み出されて記憶部１００８にインストールされ、記憶部１００８からRAM１００３にロードされたプログラムに従って各種の処理を実行する。RAM１００３にはまた、CPU１００１が各種の処理を実行する上において必要なデータなども適宜記憶される。 The CPU 1001 is read from a program stored in the ROM 1002 or a removable medium 1011 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory, installed in the storage unit 1008, and loaded from the storage unit 1008 to the RAM 1003. Various processes are executed according to the program. The RAM 1003 also appropriately stores data necessary for the CPU 1001 to execute various processes.

尚、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理は、もちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理を含むものである。 In this specification, the step of describing the program recorded on the recording medium is not limited to the processing performed in time series in the order described, but of course, it is not necessarily performed in time series. Or the process performed separately is included.

２０分離部，２１センター集中定位信号検出部，２４乃至２６可変ゲインアンプ，２７，２８加算部，３０補正ゲイン生成部，３７臨場音レベル補正ゲイン生成部 20 separation unit, 21 center centralized localization signal detection unit, 24-26 variable gain amplifier, 27, 28 addition unit, 30 correction gain generation unit, 37 realistic sound level correction gain generation unit

Claims

A first component main signal gain control unit that performs gain control and outputs a first component main signal having a part of the plurality of audio components as a main component among input audio signals composed of a plurality of audio components;
An other component signal gain control unit that performs gain control on the other component signals other than the first component in the input audio signal, and outputs them;
A first component signal average level detector for detecting an average level of the first component signal;
Other component signal average level detection unit for detecting an average level of the other component signal;
A first component main signal gain control signal output unit that outputs a first component gain control signal for gain control of the first component main signal based on the average level of the first component signal and the average level of the other component signals When,
An other component signal gain control signal output unit that outputs an other component gain control signal for gain control of the other component signal based on an average level of the first component signal and the other component signal average level;
The first component main signal gain control unit converts a first component main signal having a main component of a part of the plurality of audio components, out of the input audio signals including the plurality of audio components, to the first component main signal. Based on the gain control signal, gain control and output as a first component output signal,
The other component signal gain control unit performs gain control on the other component signal other than the first component based on the other component signal gain control signal, and outputs the other component signal as an other component output signal.

The first component main signal gain control signal output unit is
A first signal component average level correction unit that corrects by adding a signal obtained by multiplying the other component signal average level by a first predetermined amplification factor to the average level of the first component signal;
A first component signal gain that generates the first component signal gain control signal so that the first component output signal becomes an average level of the first component signal corrected by the first signal component average level correction unit. A control signal generator,
The other component signal gain control signal output unit is
A first signal component average level correcting unit that corrects the average level of the first component signal by adding a signal obtained by multiplying the other component signal average level by a second predetermined amplification factor;
Other component signal gain control signal generation for generating the other component signal gain control signal so that the other component output signal becomes an average level of the first component signal corrected by the first signal component average level correction unit. The voice processing device according to claim 1, comprising: a unit.

The first component main signal gain control signal output unit is
A first component gain control signal generation unit that generates a first component signal gain control signal so that the first component output signal has the first signal component average level;
The first component signal gain control signal is corrected by multiplying by a first amplification factor set in association with the other component signal average level, and the corrected first component signal gain control signal is output. One signal component average level correction unit,
The other component signal gain control signal output unit is
An other component gain control signal generating unit that generates an other component signal gain control signal so that the other component output signal has the first signal component average level;
Other signal component that corrects the other component signal gain control signal by multiplying by a second amplification factor set in association with the other component signal average level and outputs the corrected other component signal gain control signal The sound processing apparatus according to claim 1, further comprising an average level correction unit.

The first component gain control signal generation unit and the other component signal gain control signal output unit are common,
The first component gain control signal generated by the first component gain control signal generation unit and the other component signal gain control signal generated by the other component signal gain control signal output unit are the same. Voice processing device.

The audio processing apparatus according to claim 1, further comprising: an other signal component signal gain control signal correction unit that corrects the other component signal gain control signal output from the other component signal gain control signal output unit.

The sound processing apparatus according to claim 1, wherein the first component signal is a center concentrated localization signal.

The sound processing apparatus according to any one of claims 1 to 6, further comprising an adding unit that uses a summed output signal obtained by adding the first component output signal and the other component output signal as a sound output signal after volume correction. .

A first component main signal gain control unit that performs gain control and outputs a first component main signal having a part of the plurality of audio components as a main component among input audio signals composed of a plurality of audio components;
An other component signal gain control unit that performs gain control on the other component signals other than the first component in the input audio signal, and outputs them;
A first component signal average level detector for detecting an average level of the first component signal;
Other component signal average level detection unit for detecting an average level of the other component signal;
A first component main signal gain control signal output unit that outputs a first component gain control signal for gain control of the first component main signal based on the average level of the first component signal and the average level of the other component signals When,
An other component signal gain control signal output unit that outputs an other component gain control signal for gain control of the other component signal based on an average level of the first component signal and the other component signal average level;
The first component main signal gain control unit converts a first component main signal having a main component of a part of the plurality of audio components, out of the input audio signals including the plurality of audio components, to the first component main signal. Based on the gain control signal, gain control and output as a first component output signal,
The other component signal gain control unit is a sound processing method in the sound processing apparatus that performs gain control on the other component signal other than the first component based on the other component signal gain control signal and outputs the other component signal as the other component output signal. There,
In the first component main signal gain control unit, the first component main signal mainly including a part of the plurality of audio components among the input audio signals composed of the plurality of audio components is output after gain control. A first component main signal gain control step;
In the other component signal gain control unit, other component signal gain control step of performing gain control and outputting other component signals other than the first component in the input audio signal;
A first component signal average level detection step of detecting an average level of the first component signal in the first component signal average level detector;
In the other component signal average level detection unit, other component signal average level detection step for detecting an average level of the other component signal;
A first component gain control signal for gain control of the first component main signal based on an average level of the first component signal and an average level of the other component signal in the first component main signal gain control signal output unit. A first component main signal gain control signal output step for outputting
The other component that outputs the other component gain control signal for gain control of the other component signal based on the average level of the first component signal and the average level of the other component signal in the other component signal gain control signal output unit. A signal gain control signal output step,
In the processing of the first component main signal gain control step, a first component main signal mainly including a part of the plurality of audio components among the input audio signals composed of the plurality of audio components is converted to the first component. Based on the main signal gain control signal, gain control and output as a first component output signal,
The processing of the other component signal gain control step is to perform gain control on the other component signals other than the first component based on the other component signal gain control signal and output the other component signals as other component output signals.

A first component main signal gain control unit that performs gain control and outputs a first component main signal having a part of the plurality of audio components as a main component among input audio signals composed of a plurality of audio components;
An other component signal gain control unit that performs gain control on the other component signals other than the first component in the input audio signal, and outputs them;
A first component signal average level detector for detecting an average level of the first component signal;
Other component signal average level detection unit for detecting an average level of the other component signal;
A first component main signal gain control signal output unit that outputs a first component gain control signal for gain control of the first component main signal based on the average level of the first component signal and the average level of the other component signals When,
An other component signal gain control signal output unit that outputs an other component gain control signal for gain control of the other component signal based on an average level of the first component signal and the other component signal average level;
The first component main signal gain control unit converts a first component main signal having a main component of a part of the plurality of audio components, out of the input audio signals including the plurality of audio components, to the first component main signal. Based on the gain control signal, gain control and output as a first component output signal,
The other component signal gain control unit performs gain control on the other component signal other than the first component based on the other component signal gain control signal, and outputs the other component signal as the other component output signal. ,
In the first component main signal gain control unit, the first component main signal mainly including a part of the plurality of audio components among the input audio signals composed of the plurality of audio components is output after gain control. A first component main signal gain control step;
In the other component signal gain control unit, other component signal gain control step of performing gain control and outputting other component signals other than the first component in the input audio signal;
A first component signal average level detection step of detecting an average level of the first component signal in the first component signal average level detector;
In the other component signal average level detection unit, other component signal average level detection step for detecting an average level of the other component signal;
A first component gain control signal for gain control of the first component main signal based on an average level of the first component signal and an average level of the other component signal in the first component main signal gain control signal output unit. A first component main signal gain control signal output step for outputting
The other component that outputs the other component gain control signal for gain control of the other component signal based on the average level of the first component signal and the average level of the other component signal in the other component signal gain control signal output unit. A process including a signal gain control signal output step,
In the processing of the first component main signal gain control step, a first component main signal mainly including a part of the plurality of audio components among the input audio signals composed of the plurality of audio components is converted to the first component. Based on the main signal gain control signal, gain control and output as a first component output signal,
The processing of the other component signal gain control step is a program for performing gain control on the other component signal other than the first component based on the other component signal gain control signal and outputting it as an other component output signal.

An electronic device comprising the audio processing device according to claim 1.