JP2006145818A

JP2006145818A - Audio signal adjusting device

Info

Publication number: JP2006145818A
Application number: JP2004335497A
Authority: JP
Inventors: Tetsuya Hayashi; 林　　哲也
Original assignee: Teac Corp
Current assignee: Teac Corp
Priority date: 2004-11-19
Filing date: 2004-11-19
Publication date: 2006-06-08

Abstract

<P>PROBLEM TO BE SOLVED: To obtain an output sound of characteristics more comfortable to a user. <P>SOLUTION: The audio signal adjusting device 10 which adjusts and outputs an input audio signal is equipped with: a level decision section 12 which decides the input level being the level of the input audio signal; an input/output characteristic memory section 16 which memorizes the level input and output ratio set for every input level as input level and input/output characteristics; and a level adjusting section 14 which adjusts the level of the input audio signal based on the input level and input/output characteristics of the audio signal. The input level and input/output characteristics include the characteristics for coping with low audio which are made higher in the level input-to-output ratio of the low input level and the characteristics for coping with nighttime which are made lower in the level input-to-output ratio of the high input level. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、入力された音声信号に対して所定の信号処理を施して出力する音声信号調整装置に関する。 The present invention relates to an audio signal adjusting device that performs predetermined signal processing on an input audio signal and outputs the processed signal.

ＤＶＤプレーヤやテレビジョン受像機、ＡＶアンプなど、音声を再生、出力する機器には、音声信号に対して所定の信号処理を施し、出力する音声信号調整装置が搭載されている。このような音声信号調整装置では、音声信号のレベルについては、特段、調整することなく、低レベル（小音量）の音声信号は低レベルのまま、高レベル（大音量）の音声信号は高レベルのまま出力することが多い。これにより、自然で幅のある音声出力が得られる。 A device that reproduces and outputs sound, such as a DVD player, a television receiver, and an AV amplifier, is equipped with a sound signal adjustment device that performs predetermined signal processing on the sound signal and outputs the sound signal. In such an audio signal adjustment device, the level of the audio signal is not particularly adjusted, and the low level (low volume) audio signal remains low and the high level (high volume) audio signal is high. Often output as is. Thereby, a natural and wide audio output can be obtained.

しかし、音声を鑑賞する環境やユーザの聴覚特性によっては、入力音声と同レベルで出力されると問題が生じる場合がある。例えば、深夜のように静かな環境で鑑賞する場合、高レベルの音声が高レベルのままで出力されると、周囲の人への迷惑となることがある。逆に、昼間のように、環境雑音が大きい環境では、低レベルの音声を明瞭に聞き取ることができない。また、高齢者のように聴覚特性が低下したユーザは、低レベルの音声が聞き取りづらいという問題がある。 However, depending on the environment in which the audio is viewed and the auditory characteristics of the user, problems may arise if the audio is output at the same level as the input audio. For example, when viewing in a quiet environment such as midnight, if a high level sound is output at a high level, it may be annoying to surrounding people. Conversely, in an environment where there is a large amount of environmental noise such as during the daytime, low-level speech cannot be clearly heard. In addition, there is a problem that a user whose auditory characteristics have deteriorated, such as an elderly person, is difficult to hear low-level sound.

そこで、特許文献１には、聴覚特性が低下した難聴ユーザのために、音声信号のレベルについてダイナミックレンジ圧縮を行う音声処理装置が開示されている。この音声処理装置では、入力された音声信号をレベル圧縮し、出力音声を受聴可能なダイナミックレンジ内に納めるようにしている。かかる技術によれば、低レベルの音声が比較的高レベルに出力されるため、環境雑音が大きい場合や、高齢のユーザであっても、低レベル音声を明瞭に聞き取ることが可能となる。 Therefore, Patent Document 1 discloses an audio processing apparatus that performs dynamic range compression on the level of an audio signal for a hearing-impaired user whose auditory characteristics have deteriorated. In this sound processing device, the input sound signal is level-compressed so that the output sound is within a dynamic range that can be heard. According to such a technique, since the low-level sound is output at a relatively high level, it is possible to hear the low-level sound clearly even when the environmental noise is large or even an elderly user.

特開２０００−２２４７３号公報JP 2000-22473 A

しかしながら、レベルのダイナミックレンジ圧縮では、低レベルの音声だけでなく、高レベルの音声も影響を受ける。したがって、高レベルの音声信号は、比較的低レベルに調整されて出力されることとなる。しかし、映画における爆発音のような高レベル音声は、高レベルのままで出力されたほうが迫力があり、より好適と言える。またレベルのダイナミックレンジ圧縮を行った場合、出力音声全体の幅が小さくなり、いわゆる「平坦な」出力音声となり、望ましくない。 However, level dynamic range compression affects not only low level speech but also high level speech. Therefore, a high level audio signal is adjusted to a relatively low level and output. However, it can be said that high-level sound such as explosive sound in a movie is more suitable if it is output as it is at a high level. Also, when level dynamic range compression is performed, the overall width of the output sound is reduced, resulting in a so-called “flat” output sound, which is undesirable.

そこで、本発明では、ユーザにとって、より快適な特性の出力音声が得られる音声信号調整装置を提供することを目的とする。 Therefore, an object of the present invention is to provide an audio signal adjustment device that can provide output audio with more comfortable characteristics for the user.

本発明の音声信号調整装置は、入力された音声信号を調整して出力する音声信号調整装置であって、入力された音声信号のレベルである入力レベルを判定するレベル判定手段と、入力レベルごとに設定されたレベル入出力比を記憶する入出力特性記憶手段と、入力された音声信号を、当該音声信号の入力レベルおよび入出力特性記憶手段に記憶されたレベル入出力比に基づいて、レベル調整するレベル調整手段と、を備えることを特徴とする。 An audio signal adjustment device of the present invention is an audio signal adjustment device that adjusts and outputs an input audio signal, and includes a level determination unit that determines an input level that is a level of the input audio signal, and an input level. Input / output characteristic storage means for storing the level input / output ratio set to the level of the input audio signal based on the input level of the audio signal and the level input / output ratio stored in the input / output characteristic storage means Level adjusting means for adjusting.

好適な態様では、入出力特性記憶手段に記憶されるレベル入出力比は、特定の注目レベル以外の入力レベルに対応するレベル入出力比は、全て同じ値である。さらに、注目レベルの音声信号と、注目レベル以外の音声信号と、のレベル変化が緩慢になるように両音声信号の間を補正する補正手段を有することが望ましい。 In a preferred embodiment, the level input / output ratios stored in the input / output characteristic storage means are all the same value for the level input / output ratios corresponding to input levels other than the specific attention level. Furthermore, it is desirable to have correction means for correcting between the two audio signals so that the level change between the audio signal of the attention level and the audio signal other than the attention level becomes slow.

他の好適な態様では、さらに、入力された音声信号を複数の周波数帯域に分別する周波数分別手段を備え、入出力特性記憶手段に記憶されるレベル入出力比は、周波数帯域および入力レベルごとに設定される。他の好適な態様では、入力された音声信号が、会話に関わる音声信号か否かを判定するシーン判定手段を備え、会話に関わる音声信号についてのみ、レベル調整手段によるレベル調整を行う。 In another preferred aspect, the apparatus further comprises frequency classification means for classifying the input audio signal into a plurality of frequency bands, and the level input / output ratio stored in the input / output characteristic storage means is different for each frequency band and input level. Is set. In another preferred aspect, the apparatus includes a scene determination unit that determines whether or not the input audio signal is an audio signal related to conversation, and performs level adjustment by the level adjustment unit only for the audio signal related to conversation.

他の好適な態様では、入出力特性記憶手段は、低入力レベルに対応するレベル入出力比を他の入力レベルに対応するレベル入出力比より大きくした小声対応入出力比と、高入力レベルに対応するレベル入出力比を他の入力レベルに対応するレベル入出力比より小さくした夜間対応入出力比と、高周周波数帯域に対応するレベル入出力比を他の周波数帯域でのレベル入出力比より大きくした補聴対応入出力比と、のうちの少なくとも一つのレベル入出力比を記憶している。望ましくは、入出力特性記憶手段は、小声対応入出力比と、夜間対応入出力比と、補聴対応入出力比と、のうちの少なくとも二以上のレベル入出力比を記憶しており、記憶されている複数のレベル入出力比のうちレベル調整で用いるレベル入出力比の種類を、ユーザが選択可能である。他の好適な態様では、入出力特性記憶手段に記憶されるレベル入出力比の値をユーザが設定可能である。 In another preferred aspect, the input / output characteristic storage means has a low input level corresponding to the low input level and a low input level corresponding to the other input level, and a high input level. The level input / output ratio corresponding to the night frequency, and the level input / output ratio corresponding to the high frequency band is reduced to the level input / output ratio corresponding to the other input levels. The hearing aid compatible input / output ratio and at least one level input / output ratio of the larger are stored. Preferably, the input / output characteristic storage means stores and stores at least two or more level input / output ratios of a low-voice compatible input / output ratio, a night-compatible input / output ratio, and a hearing aid compatible input / output ratio. The user can select the type of level input / output ratio used for level adjustment among the plurality of level input / output ratios. In another preferred embodiment, the user can set the level input / output ratio value stored in the input / output characteristic storage means.

本発明によれば、入力音声信号のレベルごとにレベル入出力比を設定できるため、鑑賞環境やユーザの聴覚特性に応じて、特定の入力レベルのレベル入出力比のみを大きく、または、小さくできる。これにより、ユーザにとって、より快適な特性の出力音声が得られる。 According to the present invention, since the level input / output ratio can be set for each level of the input audio signal, only the level input / output ratio of a specific input level can be increased or decreased according to the viewing environment and the auditory characteristics of the user. . As a result, an output sound with more comfortable characteristics for the user can be obtained.

以下、本発明の実施形態について図面を参照して説明する。図１は、本発明の基本的な実施形態である音声信号調整装置１０の構成を示すブロック図である。この音声信号調整装置１０は、ＤＶＤプレーヤやテレビジョン受像機、ＡＶアンプなど、音声信号を再生出力する機器に搭載される装置で、入力された音声信号Ａｉｎに対してレベル調整を施して出力音声Ａｏｕｔとして出力する。入力音声信号Ａｉｎは、レベル判定部１２に入力され、そのレベル（音量）が判定される。レベル判定部１２は、入力音声信号Ａｉｎをレベル判定結果とともに、レベル調整部１４に渡す。レベル調整部１４は、入力音声信号Ａｉｎのレベルに基づいて、レベル調整を行い、出力音声信号Ａｏｕｔとして出力する。レベル調整は、入出力特性記憶部１６に記憶された入出力特性に基づいて行われる。 Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a configuration of an audio signal adjustment device 10 which is a basic embodiment of the present invention. The audio signal adjusting device 10 is a device mounted on a device that reproduces and outputs an audio signal, such as a DVD player, a television receiver, or an AV amplifier, and adjusts the level of the input audio signal Ain to output audio. Output as Aout. The input audio signal Ain is input to the level determination unit 12, and the level (volume) is determined. The level determination unit 12 passes the input audio signal Ain to the level adjustment unit 14 together with the level determination result. The level adjustment unit 14 performs level adjustment based on the level of the input audio signal Ain, and outputs it as the output audio signal Aout. The level adjustment is performed based on the input / output characteristics stored in the input / output characteristics storage unit 16.

図２は、入出力特性記憶部１６に記憶された入出力特性の一例を示す図である。ここでは、入出力特性は、入力音声信号のレベル（入力レベル）と、出力音声信号のレベル（出力レベル）とのテーブルとして記憶している例を示す。レベル調整部１４は、この入出力特性に基づいて、入力音声のレベル調整を行う。例えば、入力音声信号のレベルが１２ｄＢの場合は、出力音声信号のレベルが２８ｄＢとなるようにレベル調整を行う。また、入力レベルが４８ｄＢの場合は、出力レベルが４８ｄＢとなるようにレベル調整を行う。なお、本実施形態では、入出力特性をテーブルとして記憶しているが、当然、他の形態、例えば、グラフや、数式で記憶していてもよい。 FIG. 2 is a diagram illustrating an example of the input / output characteristics stored in the input / output characteristic storage unit 16. Here, an example in which the input / output characteristics are stored as a table of the input audio signal level (input level) and the output audio signal level (output level) is shown. The level adjustment unit 14 adjusts the level of the input sound based on the input / output characteristics. For example, when the level of the input audio signal is 12 dB, level adjustment is performed so that the level of the output audio signal is 28 dB. When the input level is 48 dB, the level is adjusted so that the output level is 48 dB. In the present embodiment, the input / output characteristics are stored as a table, but naturally, other forms such as a graph or a mathematical expression may be stored.

ここで、図２から明らかなように、本実施形態では、入力レベルによって、そのレベル入出力比が異なる。すなわち、入力レベルが低い（３６ｄＢ未満）場合はレベル入出力比が大きく、入力レベルが高い（３６ｄＢ以上）場合はレベル入出力比が小さくなっている。このような入出力特性とする理由について図３を用いて説明する。図３は、数種類の入出力特性を示す図であり、横軸が入力レベル、縦軸が出力レベルを示す。また、破線は従来から多用されている一般的な入出力特性を、一点鎖線はダイナミックレンジ圧縮処理を施した場合の入出力特性を、実線は本実施形態における入出力特性を示す。 Here, as is apparent from FIG. 2, in this embodiment, the level input / output ratio differs depending on the input level. That is, when the input level is low (less than 36 dB), the level input / output ratio is large, and when the input level is high (36 dB or more), the level input / output ratio is small. The reason for the input / output characteristics will be described with reference to FIG. FIG. 3 is a diagram illustrating several types of input / output characteristics, in which the horizontal axis indicates the input level and the vertical axis indicates the output level. A broken line indicates a general input / output characteristic that has been frequently used, a dashed-dotted line indicates an input / output characteristic when dynamic range compression processing is performed, and a solid line indicates an input / output characteristic in the present embodiment.

従来の一般的な音声信号調整装置では、音声信号のレベルについては調整を行わないものが多い。したがって、出力音声のレベルは入力レベルと同レベルとなり、レベル入出力比は入力レベルの大きさに関わらず一定であった（図３の破線を参照）。そのため、低レベル（小音量）の音声信号は低レベルのまま、高レベル（大音量）の音声信号は高レベルのまま、出力される。このように音声信号を入力レベルと同レベルのまま出力することで、入力音声信号が本来持つ、幅のある自然な出力音声を得ることができる。しかしながら、その出力音声を聞く環境（鑑賞環境）や、ユーザの聴覚特性によっては、入力レベルと同レベルでは望ましくない場合がある。例えば、鑑賞環境での環境雑音が大きい場合には、どうしても低レベルの音声が聞き取りにくくなってしまうという問題がある。逆に、深夜のように周囲が静かな環境では、高レベルの音声信号を高レベルのまま出力すると、騒音の原因となり、近所迷惑となることがある。また、高齢のユーザなどは、聴力特性が低下し、低レベルの音声が聞き取りにくい。 Many conventional general audio signal adjustment devices do not adjust the level of the audio signal. Therefore, the level of the output sound is the same as the input level, and the level input / output ratio is constant regardless of the magnitude of the input level (see the broken line in FIG. 3). Therefore, a low level (small volume) audio signal is output at a low level, and a high level (large volume) audio signal is output at a high level. By outputting the audio signal as it is at the same level as the input level in this way, it is possible to obtain a wide natural output sound inherent in the input audio signal. However, depending on the environment in which the output sound is heard (viewing environment) and the auditory characteristics of the user, it may not be desirable at the same level as the input level. For example, there is a problem that when the environmental noise in the viewing environment is large, it is difficult to hear low-level sound. Conversely, in an environment where the surroundings are quiet, such as at midnight, if a high-level audio signal is output at a high level, it may cause noise and annoy the neighborhood. Also, elderly users and the like have poor hearing characteristics and are difficult to hear low-level sounds.

そこで、従来から、音声信号に対してダイナミックレンジ圧縮を施し、低レベルの音声信号を比較的高レベルに、高レベルの音声信号を比較的低レベルに調整する手法が提案されている（図３の一点鎖線を参照）。かかる手法によれば、本来、低レベルである音声信号も聞き取りやすいレベルで出力される。その結果、聴覚特性が低下したユーザであっても音声を明瞭に聞き取ることができる。また、環境雑音が大きい場合であっても、低レベルの音声を明瞭に聞き取ることができる。しかし、この手法では、低レベルの音声信号だけでなく、全てのレベル帯域の音声信号が影響を受けてしまう。例えば、大音量で聞いてこそ迫力のある高レベルの音声信号は、比較的低レベルで出力されることとなる。また、全体的にレベルの幅が小さくなり、いわゆる、「平坦な」音となってしまう。 Therefore, conventionally, a method has been proposed in which dynamic range compression is performed on an audio signal to adjust a low-level audio signal to a relatively high level and a high-level audio signal to a relatively low level (FIG. 3). See the dash-dot line). According to such a technique, an originally low level audio signal is also output at a level that is easy to hear. As a result, even a user whose auditory characteristics have deteriorated can clearly hear the voice. Further, even when the environmental noise is large, low level speech can be heard clearly. However, with this method, not only low-level audio signals but also all level band audio signals are affected. For example, a powerful high-level audio signal can be output at a relatively low level only by listening at a high volume. In addition, the overall width of the level is reduced, resulting in a so-called “flat” sound.

そこで、本実施形態では、レベル入出力比を入力レベルごとに設定できるようにし、強調したい入力レベルのレベル入出力比は大きく、弱めたい入力レベルのレベル入出力比は小さくできるようにしている。図２の図示例では、低レベルの音声信号も聞き取りやすくするために、低入力レベル（３６ｄＢ未満）のレベル入出力比を大きく、それ以外の入力レベル（３６ｄＢ以上）のレベル入出力比を通常と同じ値に設定している。このような設定とすることにより、環境雑音が大きい場合や、高齢のために聴覚特性が低下している場合であっても、低レベル音声を明瞭に聞き取ることができる。一方、高レベル音声は、高レベルのまま出力されるため、自然で迫力のある出力音声を得ることができる。これにより、より快適な音響特性をユーザに提供できる。 Therefore, in this embodiment, the level input / output ratio can be set for each input level, the level input / output ratio of the input level desired to be emphasized is large, and the level input / output ratio of the input level desired to be weakened can be reduced. In the illustrated example of FIG. 2, the level input / output ratio of the low input level (less than 36 dB) is increased, and the level input / output ratio of the other input levels (above 36 dB) is normally set to make it easy to hear low level audio signals. Is set to the same value as By adopting such a setting, it is possible to clearly hear low-level sound even when the environmental noise is large or the auditory characteristics are degraded due to aging. On the other hand, since the high level sound is output at a high level, a natural and powerful output sound can be obtained. Thereby, a more comfortable acoustic characteristic can be provided to the user.

図４は、図２で例示した入出力特性に基づいて音声信号を調整した結果を示す図である。横軸は時間を、縦軸はレベルを示している。また、破線は入力音声のレベルを、実線は出力音声のレベルを示している。通常、入力音声は、様々なレベルを有しており、当然、低レベル（３６ｄＢ未満）の音声も含まれている。この場合、そのままのレベルで出力すると、低レベルの音声が聞き取りにくくなる。そこで、本実施形態では、図２で例示した入出力特性に基づいて、低レベルの音声信号については、比較的高レベルになるようにレベル調整を施す。その結果、本来、低レベルであった音声も比較的高レベルとなり、低レベルの音声、例えば、囁き声なども明瞭に聞き取れるようになる。また、高レベルの音声は高レベルのまま出力されるため、例えば、映画における爆発音などは、その迫力を失うことなく出力される。 FIG. 4 is a diagram illustrating a result of adjusting the audio signal based on the input / output characteristics illustrated in FIG. The horizontal axis indicates time, and the vertical axis indicates level. A broken line indicates the level of the input sound, and a solid line indicates the level of the output sound. Usually, the input sound has various levels, and naturally, the sound of low level (less than 36 dB) is also included. In this case, if it is output at the same level, it becomes difficult to hear low-level sound. Therefore, in this embodiment, the level adjustment is performed so that the low-level audio signal is relatively high based on the input / output characteristics illustrated in FIG. As a result, the sound that was originally at a low level becomes a relatively high level, and a low-level sound such as a whisper can be clearly heard. Further, since high-level sound is output at a high level, for example, explosion sound in a movie is output without losing its power.

以上、説明したように、本実施形態によれば、入力レベルごとにレベル入出力比を設定できる。したがって、自然な音の幅を失うことなく、強調したい入力レベルの音声を比較的高レベルで出力したり、弱めたい入力レベルの音声を比較的低レベルで出力したりできる。これにより、ユーザは、自身の聴覚特性や環境に応じた適切な出力音声を得ることができる。 As described above, according to the present embodiment, the level input / output ratio can be set for each input level. Therefore, it is possible to output a sound of an input level that is desired to be emphasized at a relatively high level without losing a natural sound width, and to output a sound of an input level that is desired to be weakened at a relatively low level. Thereby, the user can obtain an appropriate output sound according to his / her auditory characteristics and environment.

なお、より自然な音のつながりを得るために、レベル入出力比が異なる音声信号の間を補正してもよい。図５は、レベル入出力比が異なる音声信号間での補正を示す図である。図５において、横軸は時間、縦軸はレベルを示す。また、破線は入力音声を、実線は図２の入力特性に基づいてレベル調整された出力音声を、太実線はレベル調整後の出力音声に補正を施した出力音声を示す。また、四角や丸で示されたポイントは音声信号のサンプリングポイントを示している。図５の例では、Ｐ２以降の音声信号は、３６ｄＢ未満のため、通常より高いレベル入出力比でレベル調整される（図５の実線参照）。一方、Ｐ１以前の音声信号は、通常通りのレベル入出力比で調整される。この高いレベル入出力比でレベル調整された音声信号（Ｐ２以降の音声信号）と、通常のレベル入出力比でレベル調整された音声信号（Ｐ１以前の音声信号）と、をそのまま連続させた場合、音のつながりが不自然になることが多い。そこで、通常より高いレベル入出力比でレベル調整されたＰ２と、このＰ２より３ポイント分手前にあるＰ３と、を直線で結び、接続させる。このようにすることで、レベル入出力比が異なる音声信号の間の音のつながりが自然な形に近づき、より好適な音響特性を得ることができる。なお、ここで例示した補正方法は一例であり、当然、他の補正方法を用いて補正してもよい。また、場合によっては、補正をせずに、入出力特性に基づくレベル調整のみを音声信号に施し、出力してもよい。 In addition, in order to obtain a more natural sound connection, it may be corrected between audio signals having different level input / output ratios. FIG. 5 is a diagram illustrating correction between audio signals having different level input / output ratios. In FIG. 5, the horizontal axis represents time, and the vertical axis represents level. Also, the broken line indicates the input sound, the solid line indicates the output sound whose level is adjusted based on the input characteristics of FIG. 2, and the thick solid line indicates the output sound obtained by correcting the output sound after the level adjustment. Further, points indicated by squares or circles indicate sampling points of the audio signal. In the example of FIG. 5, since the audio signal after P2 is less than 36 dB, the level is adjusted at a higher level input / output ratio than usual (see the solid line in FIG. 5). On the other hand, the audio signal before P1 is adjusted with a normal level input / output ratio. When an audio signal level-adjusted at this high level input / output ratio (audio signal after P2) and an audio signal level-adjusted at the normal level input / output ratio (audio signal before P1) are continued as they are Often, the connection of sound becomes unnatural. Therefore, P2 whose level is adjusted at a higher level input / output ratio than usual and P3 which is three points before this P2 are connected by a straight line and connected. By doing in this way, the connection of the sound between the audio signals having different level input / output ratios approaches a natural shape, and more preferable acoustic characteristics can be obtained. Note that the correction method illustrated here is an example, and naturally, other correction methods may be used for correction. In some cases, the audio signal may be subjected to level adjustment based on the input / output characteristics without correction and output.

また、図２では、入出力特性の一例として低レベルの入力信号に対するレベル入出力比が高い入出力特性を例示しているが、入力レベルごとにレベル入出力比が設定されているのであれば、当然、他の入出力特性を用いてもよい。例えば、図６（ａ）〜（ｃ）に示すような入出力特性を用いてもよい。 In addition, FIG. 2 illustrates an input / output characteristic having a high level input / output ratio with respect to a low-level input signal as an example of the input / output characteristic. However, if the level input / output ratio is set for each input level, FIG. Of course, other input / output characteristics may be used. For example, input / output characteristics as shown in FIGS. 6A to 6C may be used.

また、入出力特性をユーザが適宜、設定できるようにしてもよい。例えば、音声信号調整装置１０に、何らかの表示手段を設け、当該表示手段に図３に示すような入出力特性図を表示させる。そして、ユーザは、表示された入出力特性図を見ながら、スイッチ等の操作手段を操作して、所望の入出力特性に設定できるようにしてもよい。また、別の方法として、予め、数種類の入出力特性、例えば、低レベルのレベル入出力比のみを大きくした小声対応特性と、高レベルのレベル入出力比のみを小さくした夜間対応特性などを用意しておき、ユーザは状況に応じて所望の入出力特性を選択できるようにしてもよい。 In addition, the input / output characteristics may be appropriately set by the user. For example, the audio signal adjustment apparatus 10 is provided with some display means, and the input / output characteristic diagram as shown in FIG. 3 is displayed on the display means. Then, the user may be able to set desired input / output characteristics by operating an operation means such as a switch while viewing the displayed input / output characteristics diagram. As another method, several types of input / output characteristics, such as a low voice level response characteristic with only a low level level input / output ratio increased, and a night response characteristic with only a high level level input / output ratio decreased are prepared. In addition, the user may be able to select a desired input / output characteristic according to the situation.

さらに、本実施形態では、入力レベルごとにレベル入出力比が設定されているが、周波数帯域ごとにもレベル入出力比を設定してもよい。すなわち、図８〜１０に示すように、複数の周波数帯域それぞれについて、入力レベルごとにレベル入出力比を設定し、入力音声の周波数およびレベルに応じたレベル入出力比でレベル調整するようにしてもよい。例えば、図８の場合、同じ１２ｄＢの入力信号であっても、周波数１ｋＨｚの入力信号は３０ｄＢに、周波数５ｋＨｚの入力信号は１２ｄＢにレベル調整されて出力されることになる。 Furthermore, in this embodiment, the level input / output ratio is set for each input level, but the level input / output ratio may be set for each frequency band. That is, as shown in FIGS. 8 to 10, for each of a plurality of frequency bands, a level input / output ratio is set for each input level, and the level is adjusted with a level input / output ratio corresponding to the frequency and level of the input sound. Also good. For example, in the case of FIG. 8, even if the input signal is the same 12 dB, the input signal with a frequency of 1 kHz is adjusted to 30 dB and the input signal with a frequency of 5 kHz is adjusted to a level of 12 dB and output.

このように周波数も考慮してレベル調整することにより、より適切な音響特性を得ることができる。例えば、映画の場合、同じ低レベルの音声であっても、人の話し声に係る音声とＢＧＭや効果音などに係る音声がある。このとき、話し声も効果音等も同様なレベルで出力すると、話し声が効果音等で打ち消され、話し声が明瞭に聞き取れない場合がある。そのような場合は、人の話し声に相当する周波数であって、低レベルの音声信号のみを比較的高レベルで出力できるようにレベル入出力比を設定することが望ましい。 Thus, by adjusting the level in consideration of the frequency, more appropriate acoustic characteristics can be obtained. For example, in the case of a movie, there are voices related to human speech and voices related to BGM and sound effects even with the same low level voice. At this time, if the voice and sound effects are output at the same level, the voice may be canceled by the sound effects and the voice may not be heard clearly. In such a case, it is desirable to set the level input / output ratio so that only a low-level audio signal having a frequency corresponding to a human voice can be output at a relatively high level.

次に、より具体的な実施形態について図７を用いて説明する。図７は本発明のより具体的な実施形態である信号調整装置２０である。この信号調整装置２０は、ＤＶＤプレーヤや、テレビジョンなどのＡＶ機器に搭載されるもので、入力された音声信号及び映像信号に対して所定の信号処理を施した後、出力する。映像信号に対する信号処理の形態については従来からの技術を用いることができるため、ここでは説明を省略する。以下では、主に、音声信号に対する信号処理について詳説する。 Next, a more specific embodiment will be described with reference to FIG. FIG. 7 shows a signal conditioner 20 which is a more specific embodiment of the present invention. This signal adjustment device 20 is mounted on an AV device such as a DVD player or a television, and performs predetermined signal processing on the input audio signal and video signal, and then outputs them. Since a conventional technique can be used for the form of signal processing for a video signal, description thereof is omitted here. In the following, signal processing for audio signals will be mainly described in detail.

外部機器２２から入力された音声信号は、ＤＩＲ（デジタル・インターフェイス・レシーバ）２６またはＡＤコンバータ２４（以下、「ＡＤＣ２４」という）を介してエンコーダ２８へと入力される。ＤＩＲ２６は、デジタル信号のための入力インターフェースであり、デジタルの音声信号の入力を受け付ける。ＡＤコンバータ２４は、アナログ信号のための入力インターフェースであり、アナログの入力信号をデジタル信号に変換する。 The audio signal input from the external device 22 is input to the encoder 28 via a DIR (digital interface receiver) 26 or an AD converter 24 (hereinafter referred to as “ADC 24”). The DIR 26 is an input interface for digital signals, and accepts input of digital audio signals. The AD converter 24 is an input interface for an analog signal, and converts the analog input signal into a digital signal.

エンコーダ２８は、入力信号を符号化した後、デジタルシグナルプロセッサ３０（以下、「ＤＳＰ３０」という）へと出力する。ＤＳＰ３０は、デジタル信号に対して種々の信号処理を施すためのプロセッサであり、レベル判定手段、周波数分別手段、レベル調整手段、補正手段として機能する。すなわち、ＤＳＰ３０は、入力された音声信号を周波数帯域ごとに分別したうえでレベルを判定し、その判定結果に応じてレベル調整を行う。さらに、レベル調整後の音声のつながりが自然なものとなるように補正を施す。レベル調整は、メモリ３２に記憶された入出力特性に基づいて行われる。この入出力特性については後に詳説する。 The encoder 28 encodes the input signal and then outputs it to the digital signal processor 30 (hereinafter referred to as “DSP 30”). The DSP 30 is a processor for performing various signal processing on the digital signal, and functions as a level determination unit, a frequency classification unit, a level adjustment unit, and a correction unit. That is, the DSP 30 determines the level after classifying the input audio signal for each frequency band, and performs level adjustment according to the determination result. Further, correction is performed so that the sound connection after level adjustment becomes natural. Level adjustment is performed based on the input / output characteristics stored in the memory 32. The input / output characteristics will be described in detail later.

レベル調整及び補正が施された音声信号は、外部機器５０、または、スピーカ５２へと出力される。外部機器５０としては、テレビジョン受像機などの映像音声出力機器が該当する。スピーカ５２としては当該外部機器５０に付属のスピーカが該当する。 The audio signal subjected to the level adjustment and correction is output to the external device 50 or the speaker 52. The external device 50 corresponds to a video / audio output device such as a television receiver. A speaker attached to the external device 50 corresponds to the speaker 52.

外部機器５０がデジタル信号を取り扱える場合、レベル調整された音声信号はＤＩＴ（デジタル・インターフェース・トランスミッション）４４を介して出力される。ＤＩＴ４４は、デジタル信号のための出力インターフェースである。一方、外部機器５０がデジタル信号を取り扱えない場合、あるいは、スピーカ５２に音声信号を出力する場合、レベル調整された音声信号は、デコーダ４０で復号化される。さらに、ＤＡコンバータ４２（以下、「ＤＡＣ４２」という）によりアナログ信号に変換されて出力される。その際、必要に応じて、電子ボリューム４６やパワーアンプ４８での増幅が行われる。電子ボリューム４６は、アナログ信号を増幅するＩＣであり、その増幅率をデジタルで制御できるようになっている。 When the external device 50 can handle a digital signal, the level-adjusted audio signal is output via a DIT (digital interface transmission) 44. The DIT 44 is an output interface for digital signals. On the other hand, when the external device 50 cannot handle a digital signal, or when outputting an audio signal to the speaker 52, the audio signal whose level has been adjusted is decoded by the decoder 40. Further, it is converted into an analog signal by a DA converter 42 (hereinafter referred to as “DAC 42”) and output. At that time, amplification by the electronic volume 46 and the power amplifier 48 is performed as necessary. The electronic volume 46 is an IC that amplifies an analog signal, and the amplification factor can be digitally controlled.

ＯＳＤ（オンスクリーンディスプレイ）コントローラ３６は、各種設定値を設定するための、設定画面を作成、出力するプロセッサである。このＯＳＤコントローラ３６は、各種設定画面、例えば、後述する入出力特性の設定画面を作成し、外部機器５０に出力する。外部機器５０は、出力された設定画面を表示器に表示する。ユーザは、この表示器に表示された設定画面を見ながら各種設定値の設定を行う。設定はリモコン５４などを介して行われ、その操作内容はリモコン受光部５６で操作信号として受信される。受信された操作信号はＣＰＵ３４に出力され、ＣＰＵ３４は当該操作信号に応じてメモリ３２に記憶される設定値を修正したり、表示器に表示される設定画面の修正をＯＳＤコントローラ３６に指示する。 The OSD (On Screen Display) controller 36 is a processor that creates and outputs a setting screen for setting various setting values. The OSD controller 36 creates various setting screens, for example, input / output characteristic setting screens to be described later, and outputs them to the external device 50. The external device 50 displays the output setting screen on the display. The user sets various setting values while looking at the setting screen displayed on the display. The setting is performed via the remote controller 54 or the like, and the operation content is received as an operation signal by the remote control light receiving unit 56. The received operation signal is output to the CPU 34, and the CPU 34 corrects the setting value stored in the memory 32 according to the operation signal, and instructs the OSD controller 36 to correct the setting screen displayed on the display.

ＣＰＵ３４は、信号調整装置全体の制御をするプロセッサであり、ＤＳＰ３０やＯＳＤコントローラ３６などに、適宜、制御信号を出力する。また、このＣＰＵ３４は、入力された音声信号や映像信号等に基づいて、当該信号に係るシーン（場面）の種類の判別も行う。通常、映画などの番組は、多数のシーンから構成されている。これらのシーンの中には、登場人物が会話する会話シーンや、映像とＢＧＭだけで構成されるシーンなどがある。ＣＰＵ３４は、入力された信号が、会話シーンに係る信号か否かを判断する。この判断は、入力された音声信号を周波数解析等することにより判断できる。また、映画であれば、字幕情報の有無で会話シーンか否かを判断できる。この判断結果は、ＤＳＰ３０に出力される。 The CPU 34 is a processor that controls the entire signal conditioning apparatus, and appropriately outputs control signals to the DSP 30 and the OSD controller 36. The CPU 34 also determines the type of scene (scene) related to the signal based on the input audio signal, video signal, or the like. Usually, a program such as a movie is composed of many scenes. Among these scenes, there are conversation scenes in which characters talk and scenes composed only of video and BGM. The CPU 34 determines whether or not the input signal is a signal related to a conversation scene. This determination can be made by frequency analysis or the like of the input audio signal. In the case of a movie, whether or not it is a conversation scene can be determined by the presence or absence of subtitle information. The determination result is output to the DSP 30.

このようなシーン種類判別を行うのは、特定シーンの場合にだけ、レベル調整を行うことがあるからである。例えば、映画において、登場人物の会話のうち小声の会話を明瞭に聞き取りたい場合、会話シーンに係る音声信号についてだけレベル調整すればよい。逆に、登場人物が会話をせず、ＢＧＭや効果音だけが流れているシーンでは、レベル調整は不要となる。そこで、ＣＰＵ３４はシーンの種類判定を行い、ＤＳＰ３０はその結果に応じてレベル調整するか否かを決定する。 Such scene type determination is performed because level adjustment may be performed only in the case of a specific scene. For example, in a movie, when it is desired to clearly hear a quiet conversation among the conversations of characters, it is only necessary to adjust the level of the audio signal related to the conversation scene. Conversely, in a scene where the characters do not have a conversation and only BGM and sound effects are flowing, level adjustment is not necessary. Therefore, the CPU 34 determines the scene type, and the DSP 30 determines whether or not to adjust the level according to the result.

次に、メモリ３２に記憶されている入出力特性について説明する。メモリ３２には、入力レベルおよび周波数に関わらずレベル入出力比一定の通常入出力特性と、ユーザが定義した入出力特性であるユーザ定義入出力特性と、予め設定された定形入出力特性が記憶されている。 Next, the input / output characteristics stored in the memory 32 will be described. The memory 32 stores normal input / output characteristics with a constant level input / output ratio regardless of the input level and frequency, user-defined input / output characteristics that are user-defined input / output characteristics, and preset fixed input / output characteristics. Has been.

通常入出力特性は、入力信号のレベル、周波数に関わらず、常にレベル入出力比が一定の入出力特性である。ユーザからレベル調整の指示がなされない場合は、この通常入出力特性に基づいてレベル調整がなされる。 The normal input / output characteristics are input / output characteristics in which the level input / output ratio is always constant regardless of the level and frequency of the input signal. When the user does not instruct level adjustment, level adjustment is performed based on the normal input / output characteristics.

定形入出力特性は、予め用意された入出力特性である。本実施形態では、定形入出力特性として、図８に示す小声対応特性、図９に示す夜間対応特性、図１０に示す補聴対応特性の三種類が用意されている。図８〜図１０は、それぞれ、入出力特性を示す図で、Ｘ軸（図面の左右方向）は周波数を、Ｙ軸（図面の斜め左右方向）は入力レベルを、Ｚ軸は出力レベルを示している。ユーザは、必要に応じて、この三種類の定形入出力特性の中から適用したい入出力特性を選択できる。ユーザにより、いずれかの定形入出力特性が選択された場合は、その定形入出力特性の基づいて、音声信号のレベル調整がなされる。 The fixed input / output characteristic is an input / output characteristic prepared in advance. In the present embodiment, three types of standard input / output characteristics are prepared: a low voice response characteristic shown in FIG. 8, a night response characteristic shown in FIG. 9, and a hearing aid response characteristic shown in FIG. 8 to 10 are diagrams showing input / output characteristics, wherein the X axis (left and right direction in the drawing) indicates the frequency, the Y axis (diagonal left and right direction in the drawing) indicates the input level, and the Z axis indicates the output level. ing. The user can select an input / output characteristic to be applied from among these three types of fixed input / output characteristics as required. When one of the fixed input / output characteristics is selected by the user, the level of the audio signal is adjusted based on the fixed input / output characteristic.

図８に示す小声対応特性は、環境雑音が大きい場合であっても明瞭に小声を聞き取りたい場合に適した入出力特性である。そのため、入力レベルが低く（３６ｄＢ未満）、かつ、人の声に対応する周波数（２００Ｈｚ〜２ｋＨｚ）の場合のレベル入出力比が、他の場合のレベル入出力比に比べて高くなっている。この小声対応特性に基づいてレベル調整した場合、ＢＧＭや効果音などの特定周波数（２００Ｈｚ〜２ｋＨｚ）以外の音声信号、および、特定周波数であっても所定レベル以上の音声信号は、通常のレベル入出力比でレベル調整される。換言すれば、これらの音声については、高レベルの信号は高レベルのまま、低レベルの信号は低レベルのまま、出力される。一方、特定周波数（２００Ｈｚ〜２ｋＨｚ）、かつ、所定レベル（３６ｄＢ）未満の入力信号は、比較的、高レベルに調整されて出力される。その結果、環境雑音や効果音等が大きい場合であっても、小音量の会話を明瞭に聞き取ることができ、快適な鑑賞ができる。 The low voice response characteristic shown in FIG. 8 is an input / output characteristic suitable for clearly listening to a low voice even when the environmental noise is large. Therefore, the level input / output ratio when the input level is low (less than 36 dB) and the frequency corresponding to the human voice (200 Hz to 2 kHz) is higher than the level input / output ratio in other cases. When the level is adjusted based on this low voice response characteristic, audio signals other than a specific frequency (200 Hz to 2 kHz) such as BGM and sound effects, and an audio signal having a predetermined frequency or higher even at a specific frequency are input to the normal level. The level is adjusted by the output ratio. In other words, for these sounds, a high level signal is output at a high level and a low level signal is output at a low level. On the other hand, an input signal having a specific frequency (200 Hz to 2 kHz) and less than a predetermined level (36 dB) is adjusted to a relatively high level and output. As a result, even when environmental noise, sound effects, and the like are large, it is possible to clearly hear a low-volume conversation and to enjoy a comfortable appreciation.

なお、この小声対応特性が選択された場合は、会話シーンの場合だけレベル調整を行うシーン限定モードとなる。シーン限定モードの場合は、会話シーンに係る音声信号についてだけ小声対応特性でのレベル調整を行う。そして、会話シーン以外のシーンに係る音声信号については、通常入出力特性でのレベル調整を行う。このように、会話シーンに係る音声についてだけ、小声対応特性でのレベル調整を行うことにより、小声での会話を明瞭に聞き取ることができるとともに、他の音については自然なレベルで聞くことができる。その結果、好適な音響特性を提供できる。また、特定シーンの場合にだけ、周波数、および、レベルに基づいたレベル調整を行うことにより、複雑な信号処理の量を低減できる。 When this low voice response characteristic is selected, a scene-limited mode in which level adjustment is performed only in the case of a conversation scene is set. In the scene-limited mode, the level adjustment with the low voice characteristic is performed only for the audio signal related to the conversation scene. For audio signals related to scenes other than the conversation scene, level adjustment is performed using normal input / output characteristics. In this way, by adjusting the level with the low voice characteristic only for the voice related to the conversation scene, it is possible to clearly hear the voice conversation and to hear other sounds at a natural level. . As a result, suitable acoustic characteristics can be provided. Also, the amount of complicated signal processing can be reduced by performing level adjustment based on the frequency and level only in the case of a specific scene.

図９に示す夜間対応特性は、夜間のように静かな環境で周囲に迷惑をかけることなく鑑賞したい場合に適した入出力特性である。この夜間対応特性では、周囲への影響が大きい、低周波数（５００Ｈｚ以下）、かつ、高レベル（９６ｄＢより大きい）の入力信号のレベル入出力比を通常より低めに設定している。この夜間対応特性に基づいてレベル調整を行った場合、低周波数、いわゆる重低音の入力信号のうち、高レベルの入力音声は、比較的、低レベルに調整されて出力される。その結果、周囲への影響が大きい重低音の音量が小さくなるので、夜間のように静かな環境で鑑賞しても周囲への迷惑を低減できる。 The night correspondence characteristic shown in FIG. 9 is an input / output characteristic suitable for viewing in a quiet environment such as at night without disturbing the surroundings. In this nighttime response characteristic, the level input / output ratio of the input signal having a low frequency (500 Hz or less) and a high level (greater than 96 dB), which has a large influence on the surroundings, is set lower than usual. When the level adjustment is performed based on the nighttime response characteristic, the high-level input sound among the low-frequency, so-called deep bass input signals is relatively adjusted to the low level and output. As a result, the volume of the heavy bass that has a large influence on the surroundings is reduced, so that troubles to the surroundings can be reduced even when viewed in a quiet environment such as at night.

図１０に示す補聴対応特性は、聴覚特性が低下したユーザ、例えば、高齢者などであっても快適に鑑賞したい場合に適した入出力特性である。高齢者などは、聴覚特性が低下し、低レベルの音声が聞き取りにくくなる。特に、高周波数の音声の聞き取りが困難になる。そこで、この補聴対応特性は、全体的にダイナミックレンジ圧縮を施すとともに、高周波数（１ｋＨｚ以上）、中・低レベルのレベル入出力比を通常より高くしている。これにより、聴覚特性が低下した高齢者等であっても、明瞭に音声を聞き取ることができる。 The hearing aid compatibility characteristics shown in FIG. 10 are input / output characteristics suitable for users who have deteriorated auditory characteristics, for example, elderly people, who want to appreciate comfortably. The elderly and the like have poor auditory characteristics, making it difficult to hear low-level sounds. In particular, it becomes difficult to hear high-frequency sound. Therefore, this hearing aid response characteristic is generally subjected to dynamic range compression, and the level input / output ratio of high frequency (1 kHz or more) and medium / low level is made higher than usual. Thereby, even an elderly person whose auditory characteristics have deteriorated can clearly hear the voice.

ユーザ定義入出力特性は、ユーザ自身が定義した出力特性である。このユーザ定義入出力特性を設定したい場合、ユーザは、リモコン５４を操作して、設定画面の表示を指示する。ＣＰＵ３４やＯＳＤコントローラ３６は、ユーザからの指示に応じて、入出力特性の設定画面を表示器に表示する。設定画面としては、種々の形態が考えられるが、例えば、図８〜図１０に示すような周波数軸、入力レベル軸、出力レベル軸から構成される三次元グラフを表示するようにしてもよい。ユーザは、表示された三次元グラフを見ながら、所望の周波数、入力レベルのレベル入出力比の修正を指示すればよい。 The user-defined input / output characteristics are output characteristics defined by the user. When setting the user-defined input / output characteristics, the user operates the remote controller 54 to instruct display of a setting screen. The CPU 34 and the OSD controller 36 display an input / output characteristic setting screen on the display in accordance with an instruction from the user. Although various forms are conceivable as the setting screen, for example, a three-dimensional graph composed of a frequency axis, an input level axis, and an output level axis as shown in FIGS. 8 to 10 may be displayed. The user may instruct correction of the level input / output ratio of the desired frequency and input level while looking at the displayed three-dimensional graph.

次に、この信号調整装置での音声信号のレベル調整の流れについて図１１を用いて説明する。音声信号の再生の前に、ユーザは、予め適用する入出力特性を選択しておく。具体的には、ユーザは、まず、通常入出力特性（入出力比一定）か、定形入出力特性か、ユーザ定義入出力特性か、を選択する。定形入出力特性を選択した場合、さらに、小声対応特性、夜間対応特性、補聴対応特性のいずれかを選択する。ここで、小声対応特性が選択された場合、自動的に、会話シーンに係る音声についてだけ当該小声対応特性を適用する「シーン限定モード」が設定される。 Next, the flow of audio signal level adjustment in this signal adjustment apparatus will be described with reference to FIG. Prior to reproduction of the audio signal, the user selects input / output characteristics to be applied in advance. Specifically, the user first selects normal input / output characteristics (constant input / output ratio), regular input / output characteristics, or user-defined input / output characteristics. When the standard input / output characteristic is selected, one of a low voice response characteristic, a night response characteristic, and a hearing aid response characteristic is further selected. Here, when the low voice response characteristic is selected, a “scene-limited mode” is automatically set in which the low voice response characteristic is applied only to the voice related to the conversation scene.

ユーザ定義入出力特性を選択した場合は、設定画面を見ながら、所望のレベル入出力比となるように設定する。設定された入出力特性の値は、ユーザ定義入出力特性として、メモリ３２に記憶される。 When a user-defined input / output characteristic is selected, a desired level input / output ratio is set while looking at the setting screen. The set input / output characteristic value is stored in the memory 32 as a user-defined input / output characteristic.

信号調整装置２０に入力された音声信号は、ＤＩＲ２６（またはＡＤＣ２４）、エンコーダ２８を介してＤＳＰ３０に入力される。ＤＳＰ３０は、ＣＰＵ３４からの指示に応じて、当該音声信号に対し所定の信号処理を施す。その信号処理の一つとしてレベル調整が含まれる。 The audio signal input to the signal conditioner 20 is input to the DSP 30 via the DIR 26 (or ADC 24) and the encoder 28. In response to an instruction from the CPU 34, the DSP 30 performs predetermined signal processing on the audio signal. Level adjustment is included as one of the signal processing.

ＤＳＰ３０は、メモリ３２を参照し、通常入出力特性が選択されているか否かを確認する（Ｓ１０）。通常入出力特性が選択されている場合は、通常入出力特性に基づいてレベル調整を行う（Ｓ２２）。この通常入出力特性は、入力レベル、周波数に関わらず、レベル入出力比が一定であるため、入力信号の周波数分別やレベル判定などは行われない。ＤＳＰ３０は、所定のレベル入出力比でレベル調整した後、音声信号を出力する。 The DSP 30 refers to the memory 32 and confirms whether or not the normal input / output characteristic is selected (S10). If the normal input / output characteristic is selected, level adjustment is performed based on the normal input / output characteristic (S22). In this normal input / output characteristic, since the level input / output ratio is constant regardless of the input level and frequency, the frequency classification of the input signal and the level determination are not performed. The DSP 30 adjusts the level with a predetermined level input / output ratio, and then outputs an audio signal.

一方、通常入出力特性が選択されていない場合は、次に、シーン限定モードが設定されているか否かを判断する（Ｓ１２）。これは、選択されている入出力特性の種類で判断できる。選択されている入出力特性が、小声対応特性の場合は、自動的にシーン限定モードが設定されていると判断する。それ以外の場合は、シーン限定モードは設定されていないと判断する。 On the other hand, if the normal input / output characteristics are not selected, it is next determined whether or not the scene limited mode is set (S12). This can be determined by the type of the selected input / output characteristic. If the selected input / output characteristic is a low voice characteristic, it is automatically determined that the scene-limited mode is set. In other cases, it is determined that the scene limited mode is not set.

シーン限定モードが設定されている場合、入力音声が会話シーンにかかるものか否かを判断する（Ｓ１４）。これは、入力音声に対する周波数解析の結果や、字幕情報が有無、などで判断できる。会話シーンでない場合は、通常入出力特性でレベル調整する（Ｓ２２）。 If the scene limited mode is set, it is determined whether or not the input voice is applied to the conversation scene (S14). This can be determined based on the result of frequency analysis for the input sound, the presence or absence of caption information, and the like. If it is not a conversation scene, the level is adjusted with the normal input / output characteristics (S22).

シーン限定モードが設定されていない場合、または、シーン限定モードが設定されており、さらに、会話シーンである場合は、入力音声を周波数分別する（Ｓ１６）。続いて、周波数分別された音声信号のレベルを判定する（Ｓ１８）。そして、これらの判定結果、及び、設定されている入出力特性に基づいて、入力音声のレベル調整を実行する（Ｓ２０）。 If the scene limited mode is not set, or if the scene limited mode is set and the scene is a conversation scene, the input voice is classified by frequency (S16). Subsequently, the level of the audio signal classified by frequency is determined (S18). Based on these determination results and the set input / output characteristics, level adjustment of the input sound is executed (S20).

レベル調整が施された音声信号は、その他、必要な信号処理が施された後、ＤＳＰ３０から出力される。出力された音声信号は、デコーダ４０やＤＩＴ４４、ＤＡＣ４２などを介して外部機器５０またはスピーカ５２に出力される。出力される音声は、鑑賞環境や、ユーザの聴覚特性に応じて、適切なレベルに調整されているため、ユーザは快適な鑑賞が可能となる。 The audio signal subjected to level adjustment is output from the DSP 30 after other necessary signal processing. The output audio signal is output to the external device 50 or the speaker 52 via the decoder 40, DIT 44, DAC 42, or the like. Since the output sound is adjusted to an appropriate level according to the viewing environment and the user's auditory characteristics, the user can enjoy viewing comfortably.

以上、説明したように、本実施形態では、入力レベルごとに設定されたレベル入出力比でレベル調整を行うため、鑑賞環境やユーザの聴覚特性に応じた適切なレベルの出力音声を得ることができる。その結果、ユーザは、快適な鑑賞が可能となる。 As described above, in the present embodiment, the level adjustment is performed with the level input / output ratio set for each input level, so that it is possible to obtain an output sound of an appropriate level according to the viewing environment and the user's auditory characteristics. it can. As a result, the user can enjoy comfortable viewing.

本発明の基本的な実施形態である音声信号調整装置の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice signal adjustment apparatus which is fundamental embodiment of this invention. 入出力特性記憶部に記憶された入出力特性の一例を示す図である。It is a figure which shows an example of the input / output characteristic memorize | stored in the input / output characteristic memory | storage part. 複数種類の入出力特性を示すグラフである。It is a graph which shows multiple types of input / output characteristics. 図２の入出力特性に基づいてレベル調整した結果を示す図である。It is a figure which shows the result of having adjusted the level based on the input-output characteristic of FIG. 音声信号間の補正を示す図である。It is a figure which shows the correction | amendment between audio | voice signals. 他の入出力特性の例を示す図である。It is a figure which shows the example of another input-output characteristic. 本発明の他の実施形態である信号調整装置の構成を示すブロック図である。It is a block diagram which shows the structure of the signal adjustment apparatus which is other embodiment of this invention. 小声対応特性を示す図である。It is a figure which shows a low voice response characteristic. 夜間対応特性を示す図である。It is a figure which shows the night correspondence characteristic. 補聴対応特性を示す図である。It is a figure which shows a hearing aid correspondence characteristic. 音声信号に対するレベル調整の流れを示すフローチャートである。It is a flowchart which shows the flow of the level adjustment with respect to an audio | voice signal.

Explanation of symbols

１０音声信号調整装置、１２レベル判定部、１４レベル調整部、１６入出力特性記憶部、２０信号調整装置、２２，５０外部機器、３０ＤＳＰ、３６ＯＳＤコントローラ、５２スピーカ。 DESCRIPTION OF SYMBOLS 10 Audio | voice signal adjustment apparatus, 12 level determination part, 14 level adjustment part, 16 input-output characteristic memory | storage part, 20 signal adjustment apparatus, 22, 50 external apparatus, 30 DSP, 36 OSD controller, 52 speaker.

Claims

An audio signal adjustment device that adjusts and outputs an input audio signal,
Level determining means for determining an input level which is a level of an input audio signal;
Input / output characteristic storage means for storing a level input / output ratio set for each input level;
Level adjustment means for adjusting the level of the input audio signal based on the input level of the audio signal and the level input / output ratio stored in the input / output characteristic storage means;
An audio signal adjustment device comprising:

The audio signal adjustment device according to claim 1,
An audio signal adjusting apparatus characterized in that the level input / output ratios stored in the input / output characteristic storage means are all the same value for the level input / output ratios corresponding to input levels other than a specific level of interest.

The audio signal adjustment device according to claim 2, further comprising:
An audio signal adjusting apparatus comprising: a correcting unit that corrects between two audio signals so that a level change between an audio signal of an attention level and an audio signal other than the attention level becomes slow.

The audio signal adjustment device according to any one of claims 1 to 3, further comprising:
A frequency separation means for separating the input audio signal into a plurality of frequency bands;
The audio signal adjustment apparatus, wherein the level input / output ratio stored in the input / output characteristic storage means is set for each frequency band and input level.

The audio signal adjustment device according to any one of claims 1 to 4, further comprising:
Scene determination means for determining whether or not the input audio signal is an audio signal related to conversation,
An audio signal adjusting apparatus characterized by performing level adjustment by a level adjusting means only for an audio signal related to a conversation.

The audio signal adjustment device according to claim 1,
The input / output characteristic storage means is
The low-level input / output ratio corresponding to the low input level is larger than the level input / output ratio corresponding to other input levels,
I / O ratio for nighttime, with the level I / O ratio corresponding to the high input level smaller than the level I / O ratio corresponding to other input levels,
Hearing-aid compatible input / output ratio with a level input / output ratio corresponding to the high frequency band greater than the level input / output ratio in other frequency bands,
An audio signal adjustment device, wherein at least one level input / output ratio is stored.

The audio signal adjustment device according to claim 6,
The input / output characteristic storage means stores at least two or more level input / output ratios of a low-voice compatible input / output ratio, a night-compatible input / output ratio, and a hearing aid compatible input / output ratio,
An audio signal adjusting apparatus, wherein a user can select a type of level input / output ratio used for level adjustment among a plurality of stored level input / output ratios.

The audio signal adjustment device according to any one of claims 1 to 7, further comprising:
An audio signal adjusting device characterized in that a user can set a level input / output ratio value stored in an input / output characteristic storage means.