JP2005020720A

JP2005020720A - Voice processing apparatus

Info

Publication number: JP2005020720A
Application number: JP2004159129A
Authority: JP
Inventors: Ryoko Sakakibara; 僚子榊原; Takashi Fujita; 剛史藤田; Norio Hatanaka; 規男畑中
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-05-30
Filing date: 2004-05-28
Publication date: 2005-01-20
Anticipated expiration: 2024-05-28
Also published as: JP4471159B2

Abstract

<P>PROBLEM TO BE SOLVED: To determine and correct the level of input voice data while keeping the original number of channels unchanged. <P>SOLUTION: The voice processing apparatus for determining and correcting the volume level of voice data comprises a controller 1300 for reducing the frequencies in the level determination and correction, thus making it possible to reduce the throughput and to perform inherent functions to the input voice data of multi channels while keeping the original number of channels. In addition, the provision of a plurality of level determining devices 1101 to 1103 and level correcting devices 1201 to 1203 would enable voice processing to be performed in free setting modes that meet the preferences of the user. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、デジタル信号処理技術に関し、音声データのレベル補正を行う音声処理装置に関する。 The present invention relates to a digital signal processing technique, and relates to a sound processing apparatus that performs level correction of sound data.

従来の音源は２チャンネルのステレオ音声がほとんどである。そのため、従来の音声データのレベル補正を行う音声処理装置においては、２チャンネルを想定した音声処理を行う。すなわち、入力された２チャンネルの音声データに対して音量レベル検出を行い、音量レベルに応じてレベル補正処理を行うことで、音声データのレベル補正を実現する。この方法は、例えば、図１７に示す音声処理装置によって実現される。 Conventional sound sources are mostly two-channel stereo sound. For this reason, a conventional audio processing device that performs level correction of audio data performs audio processing assuming two channels. That is, volume level detection is performed on the input 2-channel audio data, and level correction processing is performed according to the volume level, thereby realizing level correction of the audio data. This method is realized, for example, by a voice processing device shown in FIG.

従来の音声処理装置５０００は、レベル判定部５１００とレベル補正部５２００とを備える。レベル判定部５１００は、外部より入力される音声データの音量レベルと一定の基準音量との大小を判定する。レベル補正部５２００は、レベル判定部により判定された音量レベルに応じてレベル補正処理を行う。 The conventional audio processing device 5000 includes a level determination unit 5100 and a level correction unit 5200. The level determination unit 5100 determines the level of the volume level of audio data input from the outside and a certain reference volume. The level correction unit 5200 performs level correction processing according to the volume level determined by the level determination unit.

レベル判定部５１００は、入力される２チャンネル音声データから、各チャンネル１サンプルのデータを取り出す。取り出した２チャンネルのデータの比較を行い、大きいほうの値をレベル検出データとして適用する。次に、レベル検出データと音量レベルを判定するための基準音量との比較を行い、その結果をレベル判定結果信号としてレベル補正部５２００に送信する。 The level determination unit 5100 extracts data of one sample for each channel from the input two-channel audio data. The extracted two channels of data are compared, and the larger value is applied as level detection data. Next, the level detection data and the reference volume for determining the volume level are compared, and the result is transmitted to the level correction unit 5200 as a level determination result signal.

レベル補正部５２００は、レベル判定部５１００からのレベル判定結果信号に基づき、入力されるデータが基準音量よりも大きい場合は音量を小さくする処理を、小さい場合は音量を大きくする処理を行う。これらの処理の出力結果と前サンプルの出力データとの間で平滑化処理が行われ、平滑化済の出力が当該サンプルの出力データとされる。 Based on the level determination result signal from the level determination unit 5100, the level correction unit 5200 performs a process for decreasing the volume when the input data is larger than the reference volume, and performs a process for increasing the volume when the input data is small. A smoothing process is performed between the output results of these processes and the output data of the previous sample, and the smoothed output is used as the output data of the sample.

このような音声処理装置は、車載用オーディオ機器向けや、ＡＶ機器を深夜の限られた音量で動作させる場合に適用される。そうすると、２チャンネルの音声データのダイナミックレンジを圧縮することができ、小さな音も聴きやすい状態となる。したがって、車内などの雑音が多い視聴環境や音量を小さくして視聴する必要がある状況においても十分にオーディオを楽しむことができる。 Such an audio processing apparatus is applied to in-vehicle audio equipment or when operating an AV equipment at a limited volume at midnight. If it does so, the dynamic range of the audio | voice data of 2 channels can be compressed, and it will be in the state which is easy to hear a small sound. Therefore, the audio can be sufficiently enjoyed even in a noisy viewing environment such as in a car or in a situation where it is necessary to view with a reduced volume.

このような従来の音声処理を実施したものとして、特許文献１に記載された先行技術がある。この先行技術によれば、ＣＤ（コンパクトディスク）再生等による音響信号のダイナミックレンジを圧縮する際、歪率を悪化させずに過度的な圧縮特性を改善して、歪なく正常にダイナミックレンジを圧縮することができる。
特開平５−２７５９５０号 The prior art described in Patent Document 1 is one in which such conventional voice processing is performed. According to this prior art, when compressing the dynamic range of an acoustic signal due to CD (compact disc) playback, etc., the dynamic range is normally compressed without distortion by improving excessive compression characteristics without deteriorating the distortion rate. can do.
JP-A-5-275950

近年急速に普及しているＤＶＤなどを処理する場合、多チャンネルの音声データを処理する必要がある。従来の音声処理装置５０００を多チャンネルに適用する場合、多チャンネルの音声データを２チャンネルにダウンミックスしてから音声処理装置５０００に入力する方法が考えられる。しかしながら、多チャンネルの入力源であっても音声処理装置５０００からは２チャンネルの出力しか出力できないうえ、次のような不具合がある。 When processing a DVD or the like that has been rapidly spreading in recent years, it is necessary to process multi-channel audio data. When the conventional audio processing device 5000 is applied to multiple channels, a method of down-mixing multi-channel audio data to 2 channels and then inputting the audio data to the audio processing device 5000 can be considered. However, even with a multi-channel input source, only two channels of output can be output from the audio processing device 5000, and there are the following problems.

音声データでは、多チャンネル化に伴って多様化が促進される。その場合、万人に共通の設定で全ての音源の音声処理を行うのでは音声処理に対するユーザの希望に柔軟に対応しているとはいい難い。２チャンネルのステレオ音源の場合では、左チャンネルと右チャンネルとで設定を変化させる必要は特にない。しかしながら、例えばドルビーデジタルでは、左フロント、右フロント、センター、サラウンド２チャンネルの計５チャンネルと、低域効果音を記録したＬＦＥ（Low Frequency Effect）をそれぞれ混ぜ合わせることなく独立して記録・再生する。この場合、音源によっては、センターを特に響かせたい場合や、フロントを強調したい場合、ＬＦＥを強調したい場合など、最適な設定は多様化する。しかも、その設定はユーザの希望によりさらに多様化する。しかしながら、２チャンネルにダウンミックスする構成では、設定を自由に変化させるのは困難である。 In audio data, diversification is promoted as the number of channels increases. In that case, it is difficult to flexibly respond to the user's desire for voice processing if voice processing of all sound sources is performed with settings common to all. In the case of a two-channel stereo sound source, it is not particularly necessary to change the setting between the left channel and the right channel. However, in Dolby Digital, for example, the left front, right front, center, and surround 2 channels in total, and the LFE (Low Frequency Effect) that records the low-frequency effect sound are recorded and reproduced independently without mixing each other. . In this case, depending on the sound source, optimal settings are diversified, for example, when it is desired to make the center particularly resonate, when the front is emphasized, or when the LFE is emphasized. Moreover, the setting is further diversified according to the user's wishes. However, it is difficult to change the setting freely in the configuration of downmixing to two channels.

したがって、本発明の主たる目的は、入力される音声データが多チャンネルであっても、２チャンネルにダウンミックスを行うことなく、元のチャンネル数のまま、音量レベル補正のできる音声処理装置を提供することである。 Therefore, a main object of the present invention is to provide an audio processing apparatus capable of correcting the sound volume level with the original number of channels without downmixing to 2 channels even if the input audio data is multi-channel. That is.

また、本発明の他の目的は、入力される音声データの種類、またはユーザの好みなどにより、レベル判定の方法や、基準、レベル補正の補正度合いなどの設定を変化させることができるような音声処理装置を提供することである。 Another object of the present invention is such that the level determination method, the reference, and the correction level of the level correction can be changed according to the type of input audio data or the user's preference. It is to provide a processing device.

上記課題を解決するために発明の音声処理装置は、入力音声データの音量レベルを判定するレベル判定部と、前記レベル判定部の判定結果に基づいて、前記音量レベルの補正を行うレベル補正部と、前記レベル判定部がレベル判定を行う頻度を調整するレベル判定間引きコントローラとを備える。 In order to solve the above problems, an audio processing device of the invention includes a level determination unit that determines a volume level of input audio data, a level correction unit that corrects the volume level based on a determination result of the level determination unit, and And a level determination thinning controller for adjusting the frequency with which the level determination unit performs level determination.

以上説明したように本発明の音声処理装置によれば、処理量を削減でき、入力される音声データが多チャンネルであっても、２チャンネルにダウンミックスを行うことなく、元のチャンネル数のまま、音量レベル補正を行うことが可能となる。 As described above, according to the audio processing device of the present invention, the processing amount can be reduced, and even if the input audio data is multi-channel, the original number of channels is maintained without downmixing to 2 channels. , Volume level correction can be performed.

また、入力される音声データの種類、または使用者の好みなどにより、レベル判定の方法や、基準、レベル補正の補正度合いなどの設定を変化させることが可能となる。 In addition, it is possible to change settings such as a level determination method, a reference, and a correction level of level correction depending on the type of input audio data or the user's preference.

以下、本発明の好ましい実施の形態について図面を参照して説明する。 Hereinafter, preferred embodiments of the present invention will be described with reference to the drawings.

（実施の形態１）
図１は本発明の実施の形態１に係る音声処理装置１０００の構成を示す図である。音声処理装置１０００はレベル判定部１１００とレベル補正部１２００とレベル判定間引きコントローラ１３００とを備える。本発明は、レベル判定間引きコントローラ１３００を設ける点に特徴がある。 (Embodiment 1)
FIG. 1 is a diagram showing a configuration of a speech processing apparatus 1000 according to Embodiment 1 of the present invention. The speech processing apparatus 1000 includes a level determination unit 1100, a level correction unit 1200, and a level determination thinning controller 1300. The present invention is characterized in that a level determination thinning controller 1300 is provided.

レベル判定部１１００は各々が少なくとも１チャンネルの音声データに対してレベル判定が可能な複数のレベル判定器から構成される。同様に、レベル補正部１２００は、各々が少なくとも１チャンネルの音声データに対してレベル補正が可能な複数のレベル補正器から構成される。レベル判定間引きコントローラ１３００は入力される音声データに対し、レベル判定部１１００の動作を間引くことにより処理量の削減を行う。 The level determination unit 1100 includes a plurality of level determination units each capable of determining a level for audio data of at least one channel. Similarly, the level correction unit 1200 includes a plurality of level correctors each capable of performing level correction on at least one channel of audio data. The level determination thinning controller 1300 reduces the amount of processing by thinning out the operation of the level determination unit 1100 for the input audio data.

音声処理装置１０００では、レベル判定間引きコントローラ１３００を設けることにより、入力される音声データが多チャンネルであっても、２チャンネルにダウンミックスを行うことなく、元のチャンネル数のまま、音量レベル補正を行うことが可能となる。 In the audio processing apparatus 1000, by providing the level determination thinning controller 1300, even if the input audio data is multi-channel, the volume level correction is performed with the original number of channels without downmixing to two channels. Can be done.

図２を参照して実施の形態１に係る音声処理装置１０００についてさらに詳細に説明する。以下の説明では、入力音声データとして多チャンネルの音声データが入力された場合について説明する。図２において図１と同様の部分には同一の符号を付す。 The speech processing apparatus 1000 according to Embodiment 1 will be described in more detail with reference to FIG. In the following description, a case where multi-channel audio data is input as input audio data will be described. In FIG. 2, the same parts as those in FIG.

音声処理装置１０００は、レベル判定部１１００とレベル補正部１２００とレベル判定間引きコントローラ１３００と情報伝達器２０００と入力器１６００とを備える。 The speech processing apparatus 1000 includes a level determination unit 1100, a level correction unit 1200, a level determination thinning controller 1300, an information transmitter 2000, and an input unit 1600.

レベル判定部１１００は、各々が同様の構造を有する複数（この例では３つ）のレベル判定器１１０１〜１１０３を備える。同様に、レベル補正部１２００は、各々が同様の構造を有するレベル補正器１２０１〜１２０３を備える。各レベル判定器１１０１〜１１０３と各レベル補正器１２０１〜１２０３とは互いに一対一に対応する。音声処理装置１０００は、さらに、演算用係数記憶器１４００とレベル判定器数切替器１５００とを備える。演算用係数記憶器１４００は、第１、第２の補正パラメータにより決定される基準音量をそれぞれ記憶する。レベル判定間引きコントローラ１３００は、デコーダ別レベル判定間引きコントローラ切替器１３１０を備える。 The level determination unit 1100 includes a plurality (three in this example) of level determination units 1101 to 1103 each having a similar structure. Similarly, the level correction unit 1200 includes level correctors 1201 to 1203 each having a similar structure. The level determiners 1101 to 1103 and the level correctors 1201 to 1203 correspond one to one. The speech processing apparatus 1000 further includes a calculation coefficient memory 1400 and a level determination device number switch 1500. The calculation coefficient memory 1400 stores the reference sound volume determined by the first and second correction parameters. The level determination thinning controller 1300 includes a decoder-specific level determination thinning controller switching unit 1310.

入力器１６００は、音声処理装置１０００の使用者が行う入力操作を受けてその入力操作情報を情報伝達器２０００に送信する。 The input device 1600 receives an input operation performed by the user of the speech processing apparatus 1000 and transmits the input operation information to the information transmitter 2000.

以下、音声処理装置１０００の動作を説明する。レベル判定間引きコントローラ１３００は、情報伝達器２０００を介して、入力音声データの信号形態を示すデコーダ情報Ｓ０１を外部から受信する。本実施形態における信号形態は、例えば、ステレオ音声形態、ドルビーデジタル形態等が挙げられる。デコーダ情報Ｓ０１は、入力音声データＳ０６に付随する情報であり、入力音声データＳ０６とともに外部から音声処理装置１０００に入力される。 Hereinafter, the operation of the speech processing apparatus 1000 will be described. The level determination thinning controller 1300 receives decoder information S01 indicating the signal form of the input audio data from the outside via the information transmitter 2000. Examples of the signal form in this embodiment include a stereo sound form and a Dolby digital form. The decoder information S01 is information accompanying the input sound data S06, and is input to the sound processing apparatus 1000 from the outside together with the input sound data S06.

レベル判定部１１００には、第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８とが、情報伝達器２０００から供給される。レベル判定部１１００に受信されるパラメータＳ０７,Ｓ０８は、レベル判定器１１０１〜１１０３それぞれに入力される。第１,第２のレベル補正パラメータＳ０７,Ｓ０８については後述する。 The level determination unit 1100 is supplied with the first level correction parameter S07 and the second level correction parameter S08 from the information transmitter 2000. The parameters S07 and S08 received by the level determination unit 1100 are input to the level determination units 1101 to 1103, respectively. The first and second level correction parameters S07 and S08 will be described later.

レベル判定間引きコントローラ１３００は、デコーダ情報Ｓ０１に基づいて間引き頻度を決定する。具体的には、入力音声データの信号形態（ステレオ音声形態,ドルビーデジタル形態等）を示すデコーダ情報Ｓ０１に基づき、コントローラ１３００は、その信号形態の音声データの各チャンネルデータをレベル判定器１１０１〜１１０３の一つにおいて不具合なくレベル判定処理するのに適した間引き頻度を決定する。コントローラ１３００は、決定した間引き頻度を示すレベル判定間隔情報Ｓ０２をレベル判定部１１００に送信する。 The level determination thinning controller 1300 determines the thinning frequency based on the decoder information S01. Specifically, based on the decoder information S01 indicating the signal form (stereo sound form, Dolby digital form, etc.) of the input sound data, the controller 1300 determines the level determiners 1101-1103 for each channel data of the sound data in the signal form. In this case, a thinning frequency suitable for level determination processing without a defect is determined. The controller 1300 transmits level determination interval information S02 indicating the determined thinning frequency to the level determination unit 1100.

レベル判定器数切替器１５００は、その音声データの処理に適したレベル判定器１１００の数を示すレベル判定器数情報Ｓ０３を、情報伝達器２０００を介して受信する。レベル判定器数情報Ｓ０３は、音声処理装置１０００の使用者によって入力器１６００に入力されて情報伝達器２０００に供給される。ここで、音声データの処理に適したレベル判定器数とは、音声データのチャンネル数に一致するレベル判定器数をいい、レベル判定器数情報Ｓ０３によって、音声データの各チャンネルに一対一に対応するレベル判定器数が指定される。ただし、音声データの複数のチャンネルに対して一つのレベル判定器１１０１〜１１０３が指定されてもよい。 The level judgment device number switching device 1500 receives the level judgment device number information S03 indicating the number of level judgment devices 1100 suitable for the processing of the audio data via the information transmitter 2000. The level determination device number information S03 is input to the input device 1600 by the user of the speech processing apparatus 1000 and supplied to the information transmitter 2000. Here, the number of level determiners suitable for processing audio data means the number of level determiners that matches the number of channels of audio data, and corresponds to each channel of audio data on a one-to-one basis by the level determiner number information S03. The number of level determiners to be specified is specified. However, one level determiner 1101-1103 may be specified for a plurality of channels of audio data.

レベル判定器数切替器１５００は、指定されたレベル判定器数に基づいて、動作指示情報Ｓ０４を作成してレベル判定器１１００に送信する。その際、レベル判定器数切替器１５００は、レベル判定器数情報Ｓ０３に基づいて次のように処理する。レベル判定器数情報Ｓ０３は、上述したように入力される音声データ（多チャンネルデータ）の各チャンネルに対応するレベル判定器１１０１〜１１０３の数を指定するデータである。このような情報を示すレベル判定器数情報Ｓ０３を受信したレベル判定器数切替器１５００は、複数あるレベル判定器１１０１〜１１０３の中からレベル判定器数情報Ｓ０３に指定されるＸ個（Ｘ≦レベル判定器総数）のレベル判定器１１０１〜１１０Ｘを特定し、特定したレベル判定器１１０１〜１１０Ｘそれぞれに動作指示情報Ｓ０４を送信する。これにより、レベル判定器数切替器１５００は、入力された音声データの各チャンネルデータの判定を行うレベル判定器１１０１〜１１０Ｘを指定したうえで、指定したレベル判定器１１０１〜１１０Ｘに動作指示情報Ｓ０４を送信する。 Level discriminator number switching device 1500 creates operation instruction information S04 based on the specified number of level discriminator devices and transmits the operation instruction information S04 to level discriminator 1100. At that time, the level determination device number switching device 1500 performs the following processing based on the level determination device number information S03. The level determination device number information S03 is data for designating the number of level determination devices 1101 to 1103 corresponding to each channel of the audio data (multi-channel data) input as described above. The level decision unit number switching device 1500 that has received the level decision unit number information S03 indicating such information has X pieces (X ≦ X) designated in the level decision unit number information S03 from among a plurality of level decision units 1101 to 1103. Level determiners 1101 to 110X are specified, and the operation instruction information S04 is transmitted to each of the specified level determiners 1101 to 110X. As a result, the level determining device number switching device 1500 designates the level judging devices 1101 to 110X for judging each channel data of the input audio data, and then sends the operation instruction information S04 to the designated level judging devices 1101 to 110X. Send.

動作指示情報Ｓ０４を受信したレベル判定器１１０１〜１１０Ｘは、判定動作を実施する。具体的には、レベル判定器１１０１〜１１０Ｘは、動作指示情報Ｓ０４とレベル判定間隔情報Ｓ０２とに基づいて入力音声データのレベル判定を行う。その際、レベル判定器１１０１〜１１０Ｘは、レベル判定間隔情報Ｓ０２で指定される頻度でレベル判定処理を間引きながら処理を行う。具体的には、図３（Ａ）に示すように、音声処理装置１０００に設定されたサンプリング周期２１００の全ての周期点において判定処理を行うのではなく、図３（Ｂ）に示すように、サンプリング周期２１００で規定される全周期点２１００₁〜_nのうち任意の数の周期点２１００₁〜_m（ｍ＜ｎ）を任意の間引き周期に基づいて定期的に間引いたうえで、残余の周期点２１００_(m+1)〜_nにおいて判定処理を行う。例えば、全周期点２１００₁〜_nにおいて８個毎に１個の周期点においてのみ判定処理を行い、残余の７個の周期点においては判定処理を行わない（間引く）。 The level determiners 1101 to 110X that have received the operation instruction information S04 perform a determination operation. Specifically, the level determiners 1101 to 110X perform input audio data level determination based on the operation instruction information S04 and the level determination interval information S02. At this time, the level determiners 1101 to 110X perform processing while thinning out the level determination processing at a frequency specified by the level determination interval information S02. Specifically, as shown in FIG. 3A, determination processing is not performed at all the periodic points of the sampling period 2100 set in the audio processing apparatus 1000, but as shown in FIG. Any number of periodic points 2100 ₁ to _m (m <n) out of all periodic points 2100 ₁ to _n defined by the sampling period 2100 are periodically thinned based on an arbitrary thinning period, and then the remaining period Judgment processing is performed at points 2100 _{(m + 1)} to _n . For example, at all periodic points 2100 ₁ to _n , determination processing is performed only at one periodic point for every eight, and determination processing is not performed at the remaining seven periodic points (thinning out).

これにより全体としてのレベル判定回数が削減される。したがって、レベル判定器１１０１〜１１０３の処理が過負荷となることに起因する不具合を生じさせることなく、レベル判定を実施することが可能となる。 As a result, the number of level determinations as a whole is reduced. Therefore, the level determination can be performed without causing a problem caused by the processing of the level determination units 1101 to 1103 being overloaded.

レベル判定器１１０１〜１１０Ｘの動作をさらに詳細に説明する。レベル判定器１１０１〜１１０Ｘは、情報伝達器２０００を介して第１,第２のレベル補正パラメータＳ０７,Ｓ０８を受信すると、その第１,第２のレベル補正パラメータＳ０７,Ｓ０８に基づいて演算用係数アドレス情報Ｓ０９を生成して演算用係数記憶器１４００に送信する。第１,第２のレベル補正パラメータＳ０７,Ｓ０８は、音声処理装置１０００の使用者が入力器１６００に入力したのち、入力器１６００から情報伝達器２０００に供給される。演算用係数アドレス情報Ｓ０９とは次の情報のことである。演算用係数記憶器１４００には、各音声データの基準データの実体値が格納されている。演算用係数アドレス情報Ｓ０９は、演算用係数記憶器１４００における基準音声データの格納位置を指定するアドレス情報である。 The operation of the level determiners 1101 to 110X will be described in further detail. When the level determiners 1101 to 110X receive the first and second level correction parameters S07 and S08 via the information transmitter 2000, the calculation coefficients based on the first and second level correction parameters S07 and S08 are received. The address information S09 is generated and transmitted to the calculation coefficient memory 1400. The first and second level correction parameters S07 and S08 are supplied from the input device 1600 to the information transmitter 2000 after the user of the speech processing apparatus 1000 inputs them to the input device 1600. The calculation coefficient address information S09 is the following information. The calculation coefficient storage 1400 stores the actual value of the reference data of each audio data. The calculation coefficient address information S09 is address information for designating the storage position of the reference audio data in the calculation coefficient storage 1400.

演算用係数記憶器１４００は、演算用係数アドレス情報Ｓ０９を受信すると、記憶している基準音量データの全実体値データから、その演算用係数アドレス情報Ｓ０９でアドレス指定される基準音声データＳ１０の実体値を読み出してレベル判定器１１０１〜１１０Ｘに送信する。 When the calculation coefficient memory 1400 receives the calculation coefficient address information S09, the calculation coefficient address information S09 is used to calculate the entity of the reference audio data S10 that is addressed by the calculation coefficient address information S09. The value is read and transmitted to the level determiners 1101 to 110X.

レベル判定器１１０１〜１１０Ｘは、基準音量データＳ１０の実体値を受信すると、受信する入力音声データＳ０６の中から自らに割り当てられたチャンネルデータを取り出し、取り出したチャンネルデータと基準音量データＳ１０の実体値との間での大小を比較して、その比較結果であるレベル判定結果信号Ｓ１１をレベル補正器１２０１〜１２０Ｘに送信する。レベル判定結果信号Ｓ１１は、レベル判定器１１０１〜１１０Ｘそれぞれに対応するレベル補正器１２０１〜１２０Ｘに送信される。 When the level determiners 1101 to 110X receive the actual value of the reference volume data S10, the level determiners 1101 to 110X extract channel data assigned to them from the received input audio data S06, and the extracted channel data and the actual value of the reference volume data S10. And a level determination result signal S11, which is the comparison result, is transmitted to the level correctors 1201 to 120X. The level determination result signal S11 is transmitted to the level correctors 1201 to 120X corresponding to the level determiners 1101 to 110X, respectively.

レベル補正器１２０１〜１２０Ｘは、レベル判定結果信号Ｓ１１を受信すると、その判定結果に基づいて入力音声の補正を行い、出力音声データＳ１５を出力する。これにより、音声処理装置１０００による一連のレベル補正処理が完了する。 When the level correctors 1201 to 120X receive the level determination result signal S11, the level correctors 1201 to 120X correct the input sound based on the determination result and output the output sound data S15. Thereby, a series of level correction processing by the audio processing apparatus 1000 is completed.

図４は、音声処理装置１０００に入力される入力音声データのレベルとそれに対応する出力音声データのレベルとの関係をグラフ化したものである。グラフＧ１は、レベル補正を実施しなかった場合を示し、グラフＧ２は、レベル補正を実施した場合の補正パターンの一例を示す。 FIG. 4 is a graph showing the relationship between the level of input voice data input to the voice processing apparatus 1000 and the level of output voice data corresponding thereto. A graph G1 shows a case where level correction is not performed, and a graph G2 shows an example of a correction pattern when level correction is performed.

レベル補正器１２０１〜１２０３は、対応するレベル判定器１１０１〜１１０３からレベル判定結果信号Ｓ１１を受け取ると、そのレベル判定結果信号Ｓ１１を解読することで、そのレベル補正器１２０１〜１２０３に対応するチャンネルデータの音声レベルが基準音量データＳ１０の実体値よりも大きいか否かを判定する。 When the level correctors 1201 to 1203 receive the level determination result signal S11 from the corresponding level determiners 1101 to 1103, the level corrector 1201 to 1203 decodes the level determination result signal S11 to thereby obtain channel data corresponding to the level correctors 1201 to 1203. It is determined whether or not the sound level is greater than the actual value of the reference volume data S10.

大きいと判定する場合は、入力音声レベルは図４において基準音量データＳ１０よりも図中左側に位置する音声レベル領域Ｈ２にあると判断する。そして、その音声レベル領域Ｈ２に適した補正パターン（補正特性）に基づいて、チャンネルデータの補正を行うことで、入力音声データレベルに比べて出力音声データレベルを下げるように補正する。 When it is determined that the input sound level is high, it is determined that the input sound level is in the sound level region H2 located on the left side in FIG. 4 with respect to the reference volume data S10. Then, the channel data is corrected based on a correction pattern (correction characteristic) suitable for the audio level region H2, thereby correcting the output audio data level to be lower than the input audio data level.

一方、小さいと判断する場合は、入力音声レベルは図４において基準音量データＳ１０よりも図中右側に位置する音声レベル領域Ｈ１にあると判断する。そして、その音声レベル領域Ｈ１に適した補正パターン（補正特性）に基づいて、チャンネルデータの補正を行うことで、入力音声データレベルに比べて出力音声データレベルを所定量持ち上げるように補正する。 On the other hand, when it is determined that the input sound level is low, it is determined that the input sound level is in the sound level region H1 located on the right side in the drawing with respect to the reference volume data S10 in FIG. Then, by correcting the channel data based on the correction pattern (correction characteristic) suitable for the audio level region H1, the output audio data level is corrected to be increased by a predetermined amount compared to the input audio data level.

レベル判定部１１００の動作について、図５を参照してさらに詳細に説明する。レベル判定部１１００が有する各レベル判定器１１０１〜１１０３はレベル判定チャンネル指定器１１１１を備える。レベル判定チャンネル指定器１１１１は、情報伝達器２０００を介して、レベル判定チャンネル情報Ｓ０５を受信する。レベル判定チャンネル情報Ｓ０５は、音声処理装置１０００の使用者が入力器１６００を介して情報伝達器２０００に入力する情報であって、入力音声データＳ０６においてレベル判定を行うチャンネルを指定する情報である。 The operation of the level determination unit 1100 will be described in more detail with reference to FIG. Each of the level determination units 1101 to 1103 included in the level determination unit 1100 includes a level determination channel specifying unit 1111. The level determination channel designator 1111 receives the level determination channel information S05 via the information transmitter 2000. The level determination channel information S05 is information that the user of the voice processing apparatus 1000 inputs to the information transmitter 2000 via the input device 1600, and is information that specifies a channel for performing level determination in the input voice data S06.

レベル判定チャンネル指定器１１１１は、受信するレベル判定チャンネル情報Ｓ０５に基づき、このレベル判定チャンネル指定器１１１１が組み込まれたレベル判定器１１０１〜１１０３でレベル判定を行うチャンネルを指定する。担当チャンネルとして指定するチャンネルは、単一の場合もあれば複数の場合もある。例えば、ドルビーサラウンド方式の音声データの場合、レベル判定チャンネル指定器１１１１は、左右チャンネルの音声データをひとまとめにして単一の判定対象音声データとして取り扱う場合がある。この場合、レベル判定チャンネル指定器１１１１は、一対の音声データ（左右）を担当チャンネルとして決定する。以上説明した各判定チャンネルの指定は、レベル判定チャンネル情報Ｓ０５に基づいてレベル判定チャンネル指定器１１１１が実施する。 Based on the received level determination channel information S05, the level determination channel specification unit 1111 specifies a channel for which level determination is performed by the level determination units 1101 to 1103 in which the level determination channel specification unit 1111 is incorporated. There may be a single channel or a plurality of channels designated as the assigned channels. For example, in the case of Dolby Surround audio data, the level determination channel designator 1111 may handle the audio data of the left and right channels as a single determination target audio data. In this case, the level determination channel designator 1111 determines a pair of audio data (left and right) as the assigned channels. The determination of each determination channel described above is performed by the level determination channel designator 1111 based on the level determination channel information S05.

レベル判定器１１０１〜１１０３は、入力音声データＳ０６から、レベル判定チャンネル指定器１１１１で決定されたチャンネルのチャンネルデータを１サンプル読み込んで、レベル検出データとする。ここで、複数チャンネルがレベル判定チャンネルとして指定された場合は、例えば、その最大値が選択される。音量レベルを判定するための基準音量データＳ１０の実体値は演算用係数記憶器１４００に係数テーブルとなって格納されている。レベル判定器１１０１〜１１０３は、入力器１６００から情報伝達器２０００を介して受信される第１,第２のレベル補正パラメータＳ０７,Ｓ０８に基づいて決定される演算用係数記憶器１４００のアドレス位置に格納されている基準音量データの実体値を読み出す。 The level determiners 1101 to 1103 read one sample of the channel data of the channel determined by the level determination channel designator 1111 from the input audio data S06 and use it as level detection data. Here, when a plurality of channels are designated as level determination channels, for example, the maximum value is selected. The actual value of the reference volume data S10 for determining the volume level is stored as a coefficient table in the calculation coefficient memory 1400. The level determiners 1101 to 1103 are located at the address positions of the calculation coefficient memory 1400 determined based on the first and second level correction parameters S07 and S08 received from the input device 1600 via the information transmitter 2000. Read the actual value of the stored reference volume data.

第１のレベル補正パラメータＳ０７は、前述したように、入力音量レベルが基準音量データＳ１０よりも小さい音声レベル領域Ｈ１における出力音声データＳ１５の出力レベルを決定するパラメータであり、図４における補正パターンＧ２１に対応している。同様に、第２のレベル補正パラメータＳ０８は、入力音量レベルが基準音量Ｓ１０よりも大きい音声レベル領域Ｈ２における出力音声データＳ１５の出力レベルを決定するパラメータであり、図４における補正パターンＧ２２が対応している。 As described above, the first level correction parameter S07 is a parameter for determining the output level of the output sound data S15 in the sound level region H1 whose input sound volume level is smaller than the reference sound volume data S10, and the correction pattern G21 in FIG. It corresponds to. Similarly, the second level correction parameter S08 is a parameter for determining the output level of the output sound data S15 in the sound level region H2 in which the input sound volume level is larger than the reference sound volume S10, and corresponds to the correction pattern G22 in FIG. ing.

第１,第２のレベル補正パラメータＳ０７,Ｓ０８は、音声処理装置１０００の使用者によって入力器１６００に入力される使用者好みの補正特性情報に基づいて情報伝達器２０００が生成してレベル判定器１１０１〜１１０３に供給する。 The first and second level correction parameters S07 and S08 are generated by the information transmitter 2000 based on user-preferred correction characteristic information input to the input device 1600 by the user of the speech processing apparatus 1000, and the level determination device. 1101 to 1103.

ここで、補正パターンＧ２１は、次の算定式（第１の一次式）により規定される。 Here, the correction pattern G21 is defined by the following calculation formula (first primary formula).

Ｙ＝ａ₁Ｘ＋ｂ₁
Ｘ：音声処理装置の入力音声データ
Ｙ：音声処理装置の出力音声データ。
ａ₁：入力音声データＸの増幅係数であって、固定値が設定される
（本実施の形態では、ａ₁＝１）。
ｂ₁：第１のレベル補正パラメータＳ０７の値であって、入力器１６００
により任意に変更可能。 Y = a ₁ X + b ₁
X: input voice data of the voice processing device Y: output voice data of the voice processing device.
a ₁ : Amplification coefficient of input audio data X, which is set to a fixed value (a ₁ = 1 in the present embodiment).
b ₁ : the value of the first level correction parameter S07, and the input device 1600
Can be changed arbitrarily.

上記算定式より明らかなように、第１のレベル補正パラメータＳ０７は、補正パターンＧ２１におけるｙ切片であって、具体的には、出力音声データＳ１５を算定するために入力音声データＳ０６に加減算される値である。 As is clear from the above calculation formula, the first level correction parameter S07 is a y-intercept in the correction pattern G21, and specifically, is added to or subtracted from the input voice data S06 in order to calculate the output voice data S15. Value.

また、補正パターンＧ２２は、次の算定式（第２の一次式）により規定される。 The correction pattern G22 is defined by the following calculation formula (second primary formula).

Ｙ＝ａ₂Ｘ＋ｂ₂
Ｘ：音声処理装置の入力音声データ
Ｙ：音声処理装置の出力音声データ。
ａ₂：入力音声データＸの増幅係数であって、
固定値が設定される（本実施の形態では、ａ₂＝１／２）。
ｂ₂：第２レベル補正パラメータＳ０８の値であって、入力器１６００によ
り任意に変更可能。 Y = a ₂ X + b ₂
X: input sound data of the sound processing device Y: output sound data of the sound processing device.
a ₂ : Amplification coefficient of the input audio data X,
A fixed value is set (in the present embodiment, a ₂ = 1/2).
b ₂ : Value of the second level correction parameter S08, which can be arbitrarily changed by the input device 1600.

上記算定式より明らかなように、第２のレベル補正パラメータＳ０８は、補正パターンＧ２２のおけるｙ切片であって、具体的には、出力音声データＳ１５を算定するために入力音声データＳ０６に加減算される値である。 As is apparent from the above calculation formula, the second level correction parameter S08 is a y-intercept in the correction pattern G22. Specifically, the second level correction parameter S08 is added to or subtracted from the input voice data S06 in order to calculate the output voice data S15. Value.

基準音量データＳ１０は、隣接する音声レベル領域Ｈ１,Ｈ２どうしの間の境界となる音量レベルを示し、補正パターンＧ２１,Ｇ２２の両算定式の交点におけるＸの値（入力音声データＳ０６）として求められる。 The reference sound volume data S10 indicates a sound volume level that is a boundary between adjacent sound level regions H1 and H2, and is obtained as a value of X (input sound data S06) at the intersection of both calculation formulas of the correction patterns G21 and G22. .

第１,第２のレベル補正パラメータＳ０７,Ｓ０８の設定に基づいて補正パターンＧ２１と補正パターンＧ２２とが決定されると、交点となる基準音量データＳ１０は一意に決定される。したがって、レベル判定器１１０１〜１１０３は、第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８とを用いて、演算用係数記憶器１４００内の基準音量データＳ１０の実体値を特定することが可能となる。 When the correction pattern G21 and the correction pattern G22 are determined based on the settings of the first and second level correction parameters S07 and S08, the reference volume data S10 that is an intersection is uniquely determined. Therefore, the level determiners 1101 to 1103 can specify the actual value of the reference volume data S10 in the calculation coefficient storage unit 1400 using the first level correction parameter S07 and the second level correction parameter S08. It becomes possible.

図６は、第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８との間の相関に基づいて基準音量データＳ１０を特定する境界特定データを示すテーブルである。ここで、基準音量データＳ１０の境界特定データはステップ値からなり、値”０”から値”２１”までの２２ステップが設定されている。境界特定データは、基準レベルである０ｄｂからどれだけ−∞ｄｂに向けて変移するかを示す変移量として規定されている。値”０”は最も０ｄＢレベルに近い値を示し、値”２１”は、最も−∞ｄＢに近い値を示す。つまり、基準音量データＳ１０の境界特定データは、値が大きい程、−∞ｄＢに近づく。以下、このように規定される基準音量データＳ１０の境界特定データは、レベル補正オフセットと呼ばれる。 FIG. 6 is a table showing boundary specifying data for specifying the reference sound volume data S10 based on the correlation between the first level correction parameter S07 and the second level correction parameter S08. Here, the boundary specifying data of the reference volume data S10 is composed of step values, and 22 steps from the value “0” to the value “21” are set. The boundary specifying data is defined as a shift amount that indicates how much the reference level shifts from 0 db toward -∞ db. The value “0” indicates the value closest to the 0 dB level, and the value “21” indicates the value closest to −∞ dB. That is, the boundary specifying data of the reference volume data S10 approaches -∞ dB as the value increases. Hereinafter, the boundary specifying data of the reference volume data S10 defined in this way is referred to as a level correction offset.

なお、レベル補正オフセットは、演算用係数記憶器１４００に格納されている基準音量データＳ１０の実体値を示すものではない。基準音量データＳ１０の実体値は、レベル補正オフセットに対応させた状態で演算用係数記憶器１４００に格納されている。 The level correction offset does not indicate the actual value of the reference sound volume data S10 stored in the calculation coefficient storage 1400. The actual value of the reference volume data S10 is stored in the calculation coefficient memory 1400 in a state corresponding to the level correction offset.

図６のテーブルでは、
・使用者が入力器１６００に実施する第１のレベル補正パラメータＳ０７の調整量が０〜５の６ステップに設定されている、
・使用者が入力器１６００に実施する第１のレベル補正パラメータＳ０７の１ステップの調整に対して、レベル補正オフセットの実体値はその３倍のステップ間隔（３）で変動する、
・使用者が入力器１６００に実施する第２のレベル補正パラメータＳ０８の調整量が０〜６の７ステップに設定されている、
・使用者が入力器１６００に実施する第２のレベル補正パラメータＳ０８の１ステップの調整に対して、レベル補正オフセットの実体値は同一のステップ間隔（１）で変動する、
という条件が設定された場合が想定されている。 In the table of FIG.
The adjustment amount of the first level correction parameter S07 that the user performs on the input device 1600 is set to 6 steps of 0 to 5,
For the one-step adjustment of the first level correction parameter S07 performed by the user on the input device 1600, the actual value of the level correction offset fluctuates at a step interval (3) that is three times that.
The adjustment amount of the second level correction parameter S08 that the user performs on the input device 1600 is set to 7 steps of 0 to 6,
The actual value of the level correction offset varies at the same step interval (1) with respect to the one-step adjustment of the second level correction parameter S08 performed by the user on the input device 1600.
It is assumed that the condition is set.

以上の条件に基づいて作製された図６のテーブルでは、第１のレベル補正パラメータＳ０７の６ステップ調整量と第２のレベル補正パラメータＳ０８の７ステップ調整量とに基づき、レベル補正オフセットは、６×７＝４２通りの値を取ることになる。 In the table of FIG. 6 produced based on the above conditions, the level correction offset is 6 based on the 6-step adjustment amount of the first level correction parameter S07 and the 7-step adjustment amount of the second level correction parameter S08. X7 = 42 values are taken.

さらには、使用者が入力器１６００に実施する第１のレベル補正パラメータＳ０７の１ステップの調整に対して、レベル補正オフセットの実体値は、３ステップ間隔で変動する。これに対して、使用者が入力器１６００に実施する第２のレベル補正パラメータＳ０８の１ステップの調整に対して、レベル補正オフセットの実体値は、１ステップ間隔で変動する。 Furthermore, for one step adjustment of the first level correction parameter S07 performed by the user on the input device 1600, the actual value of the level correction offset varies at intervals of three steps. On the other hand, for the one-step adjustment of the second level correction parameter S08 performed by the user on the input device 1600, the actual value of the level correction offset varies at one-step intervals.

しかしながら、上述したように、レベル補正オフセットの実体値は、値”０”から値”２１”までの２２ステップが設定されている。そのため、図６のテーブルを詳細に見れば明らかなように、４２通りあるレベル補正オフセットにおいて、値が重複するものが多数存在する。したがって、演算用係数記憶器１４００では、図７に示すように、レベル補正オフセットの重複を排除してなる２２個のレベル補正オフセットの実体値が存在する。これら２２個のレベル補正オフセットの実体値に対応する２２個の基準音量データＳ１０の実体値Ｄ０〜Ｄ２１が存在する。これら２２個の基準音量データＳ１０の実体値データＤ０〜Ｄ２１が、演算用係数記憶器１４００内の２２個のメモリ領域に順次格納されている。 However, as described above, the actual value of the level correction offset has 22 steps from the value “0” to the value “21”. Therefore, as is apparent from the detailed table of FIG. 6, there are a large number of 42 level correction offsets with overlapping values. Therefore, as shown in FIG. 7, in the calculation coefficient memory 1400, there are 22 actual values of level correction offsets obtained by eliminating duplication of level correction offsets. There are 22 actual values D0 to D21 of the reference volume data S10 corresponding to the actual values of the 22 level correction offsets. The actual value data D0 to D21 of the 22 reference volume data S10 are sequentially stored in 22 memory areas in the calculation coefficient memory 1400.

このようにしてデータ量が削減された演算用係数記憶器１４００から、基準音量データＳ１０の前記実体値データＤ０〜Ｄ２１は、第１,第２のレベル補正パラメータＳ０７,Ｓ０８に基づいて次のようにして読み出される。 Based on the first and second level correction parameters S07 and S08, the actual value data D0 to D21 of the reference volume data S10 from the arithmetic coefficient memory 1400 with the data amount reduced in this way are as follows. Is read out.

図８は、基準音量データＳ１０の実体値データＤ０〜Ｄ２１を演算用係数記憶器１４００から読み出す工程を示すフローチャートである。 FIG. 8 is a flowchart showing a process of reading the actual value data D0 to D21 of the reference volume data S10 from the calculation coefficient memory 1400.

まず、入力器１６００の入力値に基づいて情報伝達器２０００が第１,第２のレベル補正パラメータＳ０７,Ｓ０８を生成して出力する。レベル判定チャンネル指定器１１１１は、情報伝達器２０００が出力する第１,第２のレベル補正パラメータＳ０７,Ｓ０８を取得する（Ｓ８０１,Ｓ８０２）。 First, the information transmitter 2000 generates and outputs the first and second level correction parameters S07 and S08 based on the input value of the input device 1600. The level determination channel designator 1111 acquires the first and second level correction parameters S07 and S08 output from the information transmitter 2000 (S801 and S802).

次に、取得した第１,第２のレベル補正パラメータＳ０７,Ｓ０８を次の算定式に代入することで、レベル補正オフセットが算定される（Ｓ８０３）。 Next, the level correction offset is calculated by substituting the acquired first and second level correction parameters S07 and S08 into the following calculation formula (S803).

レベル補正オフセット＝
（第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８との間の相対比）＋（第２のレベル補正パラメータＳ０８）
ここで、第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８との相対比Ｃ１とは、第１のレベル補正パラメータＳ０７の入力調整量に対する第１のレベル補正パラメータＳ０７の実体値の出力変化量Ｃ２と、第２のレベル補正パラメータＳ０８の入力調整量に対する第２のレベル補正パラメータＳ０８の実体値の出力変化量Ｃ３との間の相対比を示し、具体的には、次の算定式により求められる。 Level correction offset =
(Relative ratio between the first level correction parameter S07 and the second level correction parameter S08) + (second level correction parameter S08)
Here, the relative ratio C1 between the first level correction parameter S07 and the second level correction parameter S08 is the output of the actual value of the first level correction parameter S07 with respect to the input adjustment amount of the first level correction parameter S07. The relative ratio between the change amount C2 and the output change amount C3 of the actual value of the second level correction parameter S08 with respect to the input adjustment amount of the second level correction parameter S08 is shown. Is required.

Ｃ１＝Ｃ２／Ｃ３
本実施の形態では、Ｃ１＝３／１＝３となる。 C1 = C2 / C3
In this embodiment, C1 = 3/1 = 3.

Ｓ８０３で求められたレベル補正オフセットに基づき、レベル判定器１１０１〜１１０３は、演算用係数記憶器１４００から基準音量データＳ１０の実体値データＤ０〜Ｄ２１を読み出す。 Based on the level correction offset obtained in S803, the level determiners 1101 to 1103 read the entity value data D0 to D21 of the reference volume data S10 from the calculation coefficient storage 1400.

具体的には、次の算定式に示すように、レベル補正オフセットに演算用係数記憶器１４００の係数データサイズを乗算することで、係数読み出しアドレスオフセットを算定する（Ｓ８０４）。 Specifically, as shown in the following calculation formula, the coefficient read address offset is calculated by multiplying the level correction offset by the coefficient data size of the calculation coefficient memory 1400 (S804).

係数読み出しアドレスオフセット＝
レベル補正オフセット×係数データサイズ
係数読み出しアドレスオフセットとは、演算用係数記憶器１４００において基準音量Ｓ１０の実体値データＤ０〜Ｄ２１が格納されているメモリ領域の先頭アドレスからその実体値データＤｎ（０≦ｎ≦２１）が格納されているアドレス位置までの間隔を示すオフセットのことである。 Coefficient read address offset =
Level correction offset × coefficient data size The coefficient read address offset is the actual value data Dn (0 ≦ 0) from the start address of the memory area where the actual value data D0 to D21 of the reference volume S10 is stored in the coefficient storage 1400 for calculation. n ≦ 21) is an offset indicating an interval to the stored address position.

次に、Ｓ８０４で算定した係数読み出しアドレスオフセットを、演算用係数記憶器１４００の先頭アドレスに加算することで、演算用係数記憶器１４００においてその基準音量Ｓ１０の実体値データＤｎが格納されているメモリ領域のアドレスが特定される（Ｓ８０５）。 Next, the coefficient read address offset calculated in S804 is added to the head address of the calculation coefficient memory 1400, so that the actual value data Dn of the reference volume S10 is stored in the calculation coefficient memory 1400. The address of the area is specified (S805).

Ｓ８０５で算定したアドレスが演算用係数アドレス情報Ｓ０９となる。この演算用係数アドレス情報Ｓ０９が、レベル判定器１１０１〜１１０３から演算用係数１４００に送信される。演算用係数記憶器１４００では、演算用係数アドレス情報Ｓ０９で表されたアドレスに格納されている基準音量データＳ１０の実体値データＤｎが読み出されてレベル判定器１１０１〜１１０３に送信される。 The address calculated in S805 is the calculation coefficient address information S09. The calculation coefficient address information S09 is transmitted from the level determiners 1101 to 1103 to the calculation coefficient 1400. In the calculation coefficient storage 1400, the actual value data Dn of the reference volume data S10 stored at the address represented by the calculation coefficient address information S09 is read and transmitted to the level determination units 1101-1103.

レベル判定器１１０１〜１１０３では、レベル検出データと基準音量Ｓ１０との音量レベルを比較し、その結果であるレベル判定結果信号Ｓ１１が、レベル補正器１２０１〜１２０３に送信される。 The level determiners 1101 to 1103 compare the volume levels of the level detection data and the reference volume S10, and a level determination result signal S11 as a result thereof is transmitted to the level correctors 1201 to 1203.

このようにレベル判定器１１０１〜１１０３では、重複する基準音量データＳ１０のステップ値と基準音量データＳ１０の実体値データＤ０〜Ｄ２１との間の関係を、上述した読み出し方法を設定することで整理されている。したがって、演算用係数記憶器１４００において、基準音量データＳ１０の実体値データＤ０〜Ｄ２１を重複させることなく格納することができる。これにより、演算用係数記憶器１４００に必要となるメモリの容量を削減することができ、その分、低コスト化が実現される。 As described above, in the level determiners 1101 to 1103, the relationship between the overlapping step values of the reference volume data S10 and the entity value data D0 to D21 of the reference volume data S10 is arranged by setting the above-described reading method. ing. Therefore, the calculation coefficient storage 1400 can store the entity value data D0 to D21 of the reference volume data S10 without overlapping. As a result, the memory capacity required for the arithmetic coefficient memory 1400 can be reduced, and the cost can be reduced accordingly.

レベル補正器１２０１〜１２０３の動作の詳細を説明する。図９は、レベル補正器１２０１〜１２０３の構成を示す図である。レベル補正器１２０１〜１２０３は、処理チャンネル切替器１２１０と、チャンネル処理指定器１２２０と、レベル補正初期値切替器１２３０と、レベル補正値算出器１２４０と、レベル補正平滑器１２５０とを有する。 Details of the operations of the level correctors 1201 to 1203 will be described. FIG. 9 is a diagram illustrating the configuration of the level correctors 1201 to 1203. The level correctors 1201 to 1203 include a processing channel switcher 1210, a channel processing designator 1220, a level correction initial value switcher 1230, a level correction value calculator 1240, and a level correction smoother 1250.

レベル補正器１２０１〜１２０３は、情報伝達器２０００を介して、第１のレベル補正パラメータＳ０７と第２のレベル補正パラメータＳ０８と処理チャンネル情報Ｓ１２とチャンネル処理指定情報Ｓ１３とレベル補正初期値情報Ｓ１４とを受信する。なお、これら情報Ｓ０７,Ｓ０８,Ｓ１２,Ｓ１３,Ｓ１４は、音声処理装置１０００の使用者により入力器１６００に入力される。これら情報Ｓ１２,Ｓ１３,Ｓ１４は、図２に図示していない。 The level correctors 1201 to 1203 are connected to the first level correction parameter S07, the second level correction parameter S08, the processing channel information S12, the channel processing designation information S13, and the level correction initial value information S14 via the information transmitter 2000. Receive. These pieces of information S07, S08, S12, S13, and S14 are input to the input device 1600 by the user of the speech processing apparatus 1000. These information S12, S13, S14 are not shown in FIG.

処理チャンネル切替器１２１０は、受信する処理チャンネル情報Ｓ１２に基づいて、このレベル補正器１２０１〜１２０３でレベル補正を受け持つ音声データのチャンネルを決定する。 The processing channel switch 1210 determines the channel of the audio data that is responsible for level correction by the level correctors 1201 to 1203 based on the received processing channel information S12.

チャンネル処理指定器１２２０は、そのレベル補正器１２０１〜１２０３においてレベル補正処理を受け持つチャンネルにレベル補正処理を実施するか否かを決定する。処理実施の有無は、情報伝達器２０００を介してレベル補正器１２０１〜１２０３が受信するチャンネル処理指定情報Ｓ１３に基づいて行われる。チャンネル処理指定情報Ｓ１３は、音声処理装置１０００の使用者が設定して音声処理装置１０００に入力するデータである。 The channel processing designator 1220 determines whether or not to perform level correction processing on the channel responsible for level correction processing in the level correctors 1201 to 1203. Whether or not the process is performed is determined based on channel processing designation information S13 received by the level correctors 1201 to 1203 via the information transmitter 2000. The channel processing designation information S13 is data set by the user of the voice processing apparatus 1000 and input to the voice processing apparatus 1000.

レベル補正初期値切替器１２３０は、情報伝達器２０００を介して受信するレベル補正初期値情報Ｓ１４に基づき、レベル補正器１２０１〜１２０３のレベル補正初期値を決定する。 The level correction initial value switch 1230 determines the level correction initial values of the level correctors 1201 to 1203 based on the level correction initial value information S14 received via the information transmitter 2000.

レベル補正値算出器１２４０は、対応するレベル判定器１１０１〜１１０Ｘから受信するレベル判定結果信号Ｓ１１に基づいて次のようなレベル補正処理を行う。レベル判定結果信号結果信号Ｓ１１が、（入力音声データＳ０６＞基準音量データＳ１０）を示す場合は、レベル補正器１２０１〜１２０Ｘは、受け持つチャンネルデータの出力レベルを、第２のレベル補正パラメータＳ０８により決定される補正パターンＧ２２に従い、入力レベルより小さくする処理を実施する。反対に、（入力音声データＳ０６＜基準音量データＳ１０）を示す場合は、受け持つチャンネルデータの出力レベルを、第１のレベル補正パターンＳ０７により決定される補正パターンＧ２１に従い、入力レベルより大きくする処理を実施する。処理を受け持つチャンネルは、処理チャンネル切替器１２１０により指定される。 The level correction value calculator 1240 performs the following level correction processing based on the level determination result signal S11 received from the corresponding level determiners 1101 to 110X. When the level determination result signal result signal S11 indicates (input audio data S06> reference volume data S10), the level correctors 1201 to 120X determine the output level of the channel data to be handled by the second level correction parameter S08. In accordance with the correction pattern G22 to be performed, a process of making it smaller than the input level is performed. On the other hand, when (input audio data S06 <reference volume data S10) is indicated, a process of making the output level of the channel data to be handled larger than the input level according to the correction pattern G21 determined by the first level correction pattern S07. carry out. The channel responsible for processing is designated by the processing channel switch 1210.

レベル補正平滑器１２５０は、レベル補正値算出器１２４０から出力される補正処理済データと、前回のサンプリング時点において補正処理された処理済データとの間で平滑化処理を行い、その平滑化処理済データを、当該サンプル時点における出力音声データＳ１５として出力する。 The level correction smoother 1250 performs a smoothing process between the corrected data output from the level correction value calculator 1240 and the processed data corrected at the previous sampling time, and the smoothed process is completed. The data is output as output audio data S15 at the sample time.

平滑化処理の詳細を図１０のフローチャートを参照して説明する。以下に説明する平滑化処理は、特に記載しない処理以外は基本的にレベル補正平滑器１２５０により実施される。 The details of the smoothing process will be described with reference to the flowchart of FIG. The smoothing process described below is basically performed by the level correction smoother 1250 except for a process not particularly described.

レベル補正値算出器１２４０から今回のレベル補正算定値が入力されたのち（Ｓ１００１）、入力された今回のレベル補正算定値が入力処理最初のデータであるか否かが、レベル補正初期値切替器１２３０において判断される（Ｓ１００２）。 After the current level correction calculation value is input from the level correction value calculator 1240 (S1001), it is determined whether or not the input current level correction calculation value is the first data in the input process. The determination is made at 1230 (S1002).

Ｓ１００２で、処理最初のデータではないと判断する場合、レベル補正初期値切替器１２３０は、レベル補正平滑器１２５０において順次更新記憶している前回の平滑化後の補正値を、そのまま前回の平滑化後の補正値として用いる指示をレベル補正平滑器１２５０に出力する。 If it is determined in S1002 that the data is not the first data, the level correction initial value switch 1230 uses the previous smoothed correction value that is sequentially updated and stored in the level correction smoother 1250 as it is. An instruction to be used as a subsequent correction value is output to the level correction smoother 1250.

一方、Ｓ１００２で、今回のレベル補正算定値が処理最初のデータであると判断する場合、レベル補正初期値切替器１２３０は、レベル補正平滑器１２５０においてレベル補正初期値情報Ｓ１４により指定されるレベル補正初期値を前回の平滑化後の補正値として用いる指示をレベル補正平滑器１２５０に出力する（Ｓ１００３）。 On the other hand, if it is determined in S1002 that the current level correction calculation value is the first data of processing, the level correction initial value switch 1230 is the level correction specified by the level correction initial value information S14 in the level correction smoother 1250. An instruction to use the initial value as the correction value after the previous smoothing is output to the level correction smoother 1250 (S1003).

レベル補正初期値切替器１２３０において、上述したＳ１００１〜１００３の処理が実施されたのち、レベル補正平滑器１２５０では、次の算定式に基づいて、今回の平滑化後の補正値が算定される（Ｓ１００４）。 After the processing of S1001 to S1003 described above is performed in the level correction initial value switch 1230, the level correction smoother 1250 calculates the correction value after the current smoothing based on the following calculation formula ( S1004).

今回の平滑化後の補正値＝
（今回のレベル補正算出値−前回の平滑化後の補正値）×平滑回路収束時定数
今回の平滑化後の補正値がＳ１００４で算定されたのち、レベル補正平滑器１２５０では、次の算定式に基づいて、補正を担当するチャンネルデータにおける今回の補正処理後の出力音声レベル（平滑処理済）が算定される（Ｓ１００５）。 Correction value after smoothing this time =
(Current level correction calculation value−correction value after previous smoothing) × smoothing circuit convergence time constant After the current smoothing correction value is calculated in S1004, the level correction smoother 1250 uses the following calculation formula: Based on the above, the output sound level (smoothed) after the current correction processing in the channel data in charge of correction is calculated (S1005).

今回の出力音量レベル＝
今回のレベル補正算出値＋今回の平滑化後の補正値
補正を担当するチャンネルデータにおける今回の補正処理後の出力音声レベル（平滑処理済）がＳ１００５で算出されたのち、次の補正処理に移行するために、レベル補正平滑器１２５０では、今回の平滑化後の補正値が、前回の平滑化後の補正値として更新記録される（Ｓ１００６）。以上の処理が実施されたのち、今回の補正処理後の出力音声レベル（平滑処理済）が、レベル補正平滑器１２５０から外部に出力される（Ｓ１００７）。 This output volume level =
Current level correction calculated value + current smoothed correction value After the output sound level (smoothed) after the current correction process in the channel data in charge of correction is calculated in S1005, the process proceeds to the next correction process. Therefore, in the level correction smoother 1250, the correction value after the current smoothing is updated and recorded as the correction value after the previous smoothing (S1006). After the above processing is performed, the output sound level (smoothed) after the current correction processing is output from the level correction smoother 1250 to the outside (S1007).

図１１にレベル補正平滑器１２５０による平滑化処理の効果を示す。図１１は、音声処理装置１０００のサンプリング周期２１００において時系列に沿った周期時点２１００₁〜₃におけるレベル補正値算出器１２４０の補正出力とレベル補正平滑器１２５０の平滑出力とを示す。図において、Ｓ０５₁〜₃は、周期点２１００₁〜₃における入力音声データのサンプリング値であり、符号１００₁〜₃は、周期点２１００₁〜₃における上記補正出力であり、１１０₁〜₃は、周期点２１００₁〜₃における上記平滑出力であり、１２０₁〜₃は、周期点２１００₁〜₃におけるレベル補正算出値であり、１３０₁〜₃は、周期点２１００₁〜₃における平滑化後の補正値である。 FIG. 11 shows the effect of the smoothing process by the level correction smoother 1250. Figure 11 shows the smoothed output of the correction output and the level correction smoother 1250 level correction value calculator 1240 in the periodic time 2100 _{_1-3} along the time series in the sampling period 2100 of the sound processing apparatus 1000. In the figure, S05 ₁ to ₃ are sampling values of the input audio data at the periodic points 2100 ₁ to ₃ , the reference numerals 100 ₁ to ₃ are the correction outputs at the periodic points 2100 ₁ to ₃ , and 110 ₁ to ₃ are a the smooth output at periodic points 2100 _{_1-3,} 120 _{_1-3,} a level correction value calculated at periodic points 2100 _{_1-3,} 130 _{_1-3,} smoothed at periodic points 2100 _{_1-3} Is the correction value.

図１１から明らかなように、周期点２１００_2,3における平滑化後の補正値１３０_2,3は、同一の周期点２１００_2,3におけるレベル補正算出値１２０_2,3より小さくなったり、大きくなったりするものの、平滑出力１１０_2,3は、補正出力１００_2,3より変化量が小さくなる。ここでいう変化量とは、同一の周期点２１００_2,3におけるサンプリング値Ｓ０５_2,3に対する平滑出力１１０_2,3やレベル補正算出値１２０_2,3の変移量をいう。 As apparent from FIG. 11, the correction value 130 _2,3 after smoothing in cycle point 2100 _2,3, or smaller than the level correction calculation value 120 _2,3 in the same cycle points 2100 _2,3, large However, the amount of change in the smooth output 110 ₂ , ₃ is smaller than that in the correction output 100 _{2, 3} . The amount of change here refers to the amount of change in the smoothed output 110 _2,3 and the level correction calculated value 120 _2,3 with respect to the sampling value S05 _{2,3 at the} same periodic point 2100 _2,3 .

このような平滑化処理を行うことで、出力される出力音声データＳ１５は、レベル補正されているにも拘わらず聞き取りやすくなる。 By performing such a smoothing process, the output audio data S15 to be output can be easily heard regardless of the level correction.

なお、チャンネル処理指定器１２２０において、レベル補正をしないと決定された場合、そのレベル補正器１２０１〜１２０３では、担当するチャンネルのレベル補正を行わず、入力音声データＳ０６がそのまま出力音声データＳ１５として出力される。 When the channel processing designator 1220 determines not to perform level correction, the level correctors 1201 to 1203 do not perform level correction of the channel in charge, and the input audio data S06 is output as output audio data S15 as it is. Is done.

なお、上述したＳ１００２の処理では、今回のレベル補正算定値が処理最初のデータであると判断される場合、レベル補正初期値情報Ｓ１４により指定されるレベル補正初期値を、前回の平滑化後の補正値として用いる。 In the process of S1002 described above, when it is determined that the current level correction calculation value is the first data of the process, the level correction initial value specified by the level correction initial value information S14 is used after the previous smoothing. Used as a correction value.

この場合におけるレベル補正初期値は、
音量レベルの最大値（＝０ｄＢ）から音量レベルの最小値（＝−∞ｄＢ）の間で任意に設定することができる。 In this case, the level correction initial value is
It can be arbitrarily set between the maximum value of the volume level (= 0 dB) and the minimum value of the volume level (= −∞ dB).

レベル補正初期値の例として次の３例を説明する。
１．処理最初のレベル補正算定値
２．音量レベルの最大値（＝０ｄＢ）の時のレベル補正値
３．音量レベルの最小値（＝−∞ｄＢ）の時のレベル補正値
１．の場合、処理最初のサンプリング周期点２１００_Sにおける前回の平滑化処理後の補正値はこのサンプリング周期点２１００_Sにおけるレベル補正算出値と同じものである。そのため、（今回のレベル補正算出値−前回のレベル補正算出値＝０）となる。これにより、今回の平滑化後の補正値も０となる。その結果、平滑化処理が実施されない場合と同様にして、このサンプリング周期点２１００_Sにおける補正処理後の出力音声レベル（平滑処理済）は、補正処理後の出力音声レベルと同一となり、第２番目のサンプリング周期点２１００_S+1から実質的に平滑処理が実施されることになる。 The following three examples will be described as examples of the level correction initial value.
1. 1. First level correction calculation value for processing 2. Level correction value at maximum volume level (= 0 dB) Level correction value at minimum volume level (= -∞ dB) In this case, the correction value after the previous smoothing process at the first sampling cycle point 2100 _S is the same as the level correction calculation value at this sampling cycle point 2100 _S. Therefore, (current level correction calculated value−previous level correction calculated value = 0). Thereby, the correction value after the current smoothing is also zero. As a result, similarly to the case where the smoothing process is not performed, the output sound level after the correction process (smoothed) at the sampling cycle point 2100 _S is the same as the output sound level after the correction process, and the second Thus, the smoothing process is substantially performed from the sampling period point 2100 _{S + 1} .

２．の場合、処理最初のサンプリング周期点２１００_Sにおける前回の平滑化後の補正値は、音量レベルの最大値の時のレベル補正値であり、図４に示すように、全補正値中、最小の補正値となる。したがって、このサンプリング周期点２１００_S以降、サンプリング周期点２１００が時間的に移動するにつれて平滑化後の補正値が徐々に増加する制御となる。 2. In this case, the correction value after the previous smoothing at the processing first sampling cycle point 2100 _S is the level correction value at the maximum value of the volume level, and as shown in FIG. It becomes a correction value. Therefore, after this sampling cycle point 2100 _S , the smoothed correction value gradually increases as the sampling cycle point 2100 moves in time.

３．の場合、処理最初のサンプリング周期点２１００_Sにおける前回の平滑化後の補正値は、音量レベルの最小値の時のレベル補正値であり、図４に示すように、全補正値中、最大の補正値となる。したがって、このサンプリング周期点２１００_S以降、サンプリング周期点２１００が時間的に移動するにつれて平滑化後の補正値が徐々に減少する制御となる。 3. In this case, the correction value after the previous smoothing at the first sampling cycle point 2100 _S is the level correction value at the minimum value of the volume level, and as shown in FIG. It becomes a correction value. Therefore, after this sampling cycle point 2100 _S , the correction value after smoothing is gradually reduced as the sampling cycle point 2100 moves in time.

例えば、入力音声データＳ０６の先頭の音量レベルが低い場合は、補正初期値を最大にすると先頭の音声が聴きやすくなり、また、入力音声データＳ０６の先頭の音量レベルが高い場合は、補正初期値を最小、または元の音量レベルに設定すると、不自然なクリッピングを防ぐことができる。そのため、入力音声データＳ０６のレベルに応じて、または使用者の好みにより、補正初期値の設定を変更すればよい。 For example, when the volume level at the beginning of the input audio data S06 is low, it is easy to listen to the beginning sound when the correction initial value is maximized. When the volume level at the beginning of the input audio data S06 is high, the correction initial value is set. Setting to a minimum or original volume level can prevent unnatural clipping. Therefore, the setting of the correction initial value may be changed according to the level of the input audio data S06 or according to the user's preference.

音声処理装置１０００では、レベル判定部１１００の判定処理を間引くことにより、多チャンネルの音声データに対してレベル補正を行う場合であっても、処理量を削減することことで、不具合を生じさせることなく判定処理を実施することが可能となる。 In the audio processing apparatus 1000, even if level correction is performed on multi-channel audio data by thinning out the determination processing of the level determination unit 1100, a problem is caused by reducing the processing amount. It is possible to carry out the determination process without any problem.

さらには、レベル判定器１１０１〜１１０３を複数設けることにより、相関の少ないチャンネル同士を別のレベル判定器１１０１〜１１０３でレベル判定することが可能となる。これにより、入力される音声データの種類、視聴環境、使用者の好み等により設定を自由に変更することが可能となり、さまざまな状況、要望に対応することができる。 Furthermore, by providing a plurality of level determiners 1101 to 1103, it is possible to determine the levels of channels with little correlation with different level determiners 1101 to 1103. Accordingly, the setting can be freely changed according to the type of input audio data, viewing environment, user preference, and the like, and various situations and requests can be dealt with.

また、レベル補正器１２０１〜１２０３を複数設けることにより、入力される音声データの種類、視聴環境、ユーザの好みによりレベル補正度合いの設定を自由に変更できるようになり、さまざまな状況、要望に対応することができる。 In addition, by providing a plurality of level correctors 1201 to 1203, it becomes possible to freely change the level correction level setting according to the type of input audio data, viewing environment, and user preferences, and respond to various situations and demands. can do.

また、多チャンネルの音声データをどの程度の頻度で間引くかを決定するのにデコード情報を用いた場合には、音声データの形式に応じた最適な間引き頻度でのレベル補正が可能となる。 In addition, when decoding information is used to determine how often the multi-channel audio data is to be thinned out, level correction can be performed with the optimum thinning frequency according to the format of the audio data.

また、レベル判定器１１０１〜１１０３がレベル判定チャンネル指定器１１１１を備え、レベル補正器１２０１〜１２０３がチャンネル処理指定器１２２０を備えることにより、どのチャンネルをどのレベル判定器１１０１〜１１０３でレベル判定し、どのレベル補正器１２０１〜１２０３で補正するか（どの補正グラフを用いて補正するか）を自由に設定することができ、使用者の好み、音声データの種類に応じた最適な音声処理を行うことが可能となる。 Further, the level determination units 1101 to 1103 include the level determination channel designator 1111, and the level correctors 1201 to 1203 include the channel processing specification unit 1220, which level is determined by which level determination unit 1101 to 1103, It is possible to freely set which level corrector 1201 to 1203 (which correction graph is used for correction), and to perform optimum audio processing according to the user's preference and the type of audio data Is possible.

例えば、５．１ｃｈドルビーデジタルを例に取れば、左右フロントを用いたＬＦＥ（Low Frequency Effect）をレベル補正することなどが可能となる。 For example, taking 5.1ch Dolby Digital as an example, it is possible to correct the level of LFE (Low Frequency Effect) using the left and right fronts.

また、レベル補正器１２０１〜１２０３が処理チャンネル切替器１２１０を備えることにより、任意のチャンネルの処理に対して、レベル補正の実施を行うか行わないかを指定することが可能となり、入力される音声データの種類、視聴環境、使用者の好みにより設定を自由に変更できるようになり、さまざまな状況、要望に対応することができる。 In addition, since the level correctors 1201 to 1203 include the processing channel switcher 1210, it is possible to specify whether or not to perform level correction for processing of an arbitrary channel, and input audio. Settings can be changed freely according to the type of data, viewing environment, and user preferences, so that various situations and requests can be met.

また、レベル補正器１２０１〜１２０３がレベル補正初期値切替器１２３０を備えることにより、レベル補正初期値を自由に設定することが可能となる。レベル補正初期値は音声データの初期段階の補正レベルに影響を与える。例えば、初期段階ではあまり補正をかけない方を好む使用者が存在する一方、初期段階ではたいていの場合は音声レベルが小さいため大きめに補正をかけることを好む使用者も存在する。さらには、レベル補正初期値をどのように設定するかは、音源の性質によっても微妙に異なる。レベル補正初期値切替器１２３０を設けることにより、入力音声データの初期部分のデータの音量レベルにより、または使用者の好みによりレベル補正初期値の設定を自由に変更できるようになる。 Further, since the level correctors 1201 to 1203 include the level correction initial value switch 1230, the level correction initial value can be freely set. The initial level correction value affects the initial correction level of the audio data. For example, while there are users who prefer not to make much correction at the initial stage, there are also users who prefer to make a larger correction at the initial stage because the audio level is usually low. Furthermore, how to set the level correction initial value is slightly different depending on the nature of the sound source. By providing the level correction initial value switch 1230, the setting of the level correction initial value can be freely changed according to the volume level of the data of the initial portion of the input voice data or according to the user's preference.

なお、実施の形態１において、レベル判定チャンネル指定器１１１１、処理チャンネル切替器１２１０、チャンネル処理指定器１２２０、レベル補正初期値切替器１２３０などを全て備える必要は必ずしもない。例えば、レベル補正初期値が固定で良ければ、レベル補正初期値切替器１２３０を備える必要はない。また、処理チャンネル切替器１２１０とチャンネル処理指定器１２２０との双方を備える必要もない。レベル補正初期値切替器１２３０とチャンネル処理指定器１２２０を備えていない構成を、図１２に示す。 In the first embodiment, it is not always necessary to provide all of the level determination channel designator 1111, the processing channel switch 1210, the channel process designator 1220, the level correction initial value switch 1230, and the like. For example, if the level correction initial value may be fixed, the level correction initial value switch 1230 need not be provided. Further, it is not necessary to provide both the processing channel switch 1210 and the channel processing designator 1220. FIG. 12 shows a configuration in which the level correction initial value switching unit 1230 and the channel processing designation unit 1220 are not provided.

さらには、レベル補正部１２００は複数のレベル補正器１２０１〜１２０３を設ける必要は必ずしもなく、単一のレベル補正器を備えたものであってもよい。単一のレベル補正器を設ける場合は、レベル補正は１種類の補正パターンで行われる。しかしながら、チャンネルごとに基準音量データＳ１０より大きいか否かの判定は可能であるので、従来と比較すると自由な音声処理が可能となる。単一のレベル補正器を設ける場合の構成を図１３に示す。 Furthermore, the level correction unit 1200 is not necessarily provided with a plurality of level correctors 1201 to 1203, and may be provided with a single level corrector. When a single level corrector is provided, level correction is performed with one type of correction pattern. However, since it is possible to determine whether or not the volume is larger than the reference volume data S10 for each channel, free audio processing can be performed as compared with the related art. A configuration in the case of providing a single level corrector is shown in FIG.

また、実施の形態１においては各々のレベル判定器１１０１〜１１０３には全てのチャンネルの音声データが入力され、レベル判定チャンネル指定器１１１１によってレベル判定を行うチャンネルを指定している。しかしながら、入力音声データＳ０６の全てのチャンネルデータを、全てのレベル判定器１１０１〜１１０３に入力する必要はない。例えば６チャンネルデータを有する音声データが入力される場合、
・レベル判定器１１０１には、第１のチャンネルデータだけが独占的に供給される、
・レベル判定器１１０２には、第２〜第５のチャンネルデータだけが独占的に供給されたうえで、レベル判定チャンネル指定器１１１１によって、レベル判定するチャンネルデータが指定される、
・レベル判定器１１０３には、全てのチャンネルデータが供給されたうえで、レベル判定チャンネル指定器１１１１によってレベル判定するチャンネルデータが指定される、
という構成であっても良い。 In the first embodiment, audio data of all channels is input to each of the level determiners 1101 to 1103, and a channel for performing level determination is specified by the level determination channel specifying unit 1111. However, it is not necessary to input all the channel data of the input audio data S06 to all the level determiners 1101 to 1103. For example, when audio data having 6-channel data is input,
Only the first channel data is exclusively supplied to the level determiner 1101.
Only the second to fifth channel data are exclusively supplied to the level determination unit 1102, and then the level determination channel specification unit 1111 specifies the channel data for level determination.
The level determination unit 1103 is supplied with all channel data, and the level determination channel specification unit 1111 specifies channel data for level determination.
It may be configured as follows.

この場合、レベル判定器１１０１は第１のチャンネル専用なのでレベル判定チャンネル指定器１１１１を備える必要はない。このような構成はレベル補正器１２０１〜１２０３についても同様である。 In this case, since the level determination unit 1101 is dedicated to the first channel, it is not necessary to include the level determination channel designator 1111. Such a configuration is the same for the level correctors 1201 to 1203.

（実施の形態２）
図１４は本発明の実施の形態２に係る音声処理装置３０００の構成を示す図である。音声処理装置３０００では、レベル判定間引きコントローラ１３００が、デコーダ別レベル判定間引きコントローラ切替器ではなく、同時実行機能別レベル判定間引きコントローラ切替器１３２０を備えている点で実施の形態１と異なる。同時実行機能別レベル判定間引きコントローラ１３２０は、音声処理装置３０００の内部で生成されて情報伝達器２０００を介して受信する同時実行機能情報Ｓ１６に基づき、レベル判定器１１０１〜１１０３のレベル判定を行う間隔を決定する。例えば、音声処理装置３０００が、前述した本発明の補正処理とともに、処理１（バーチャルサウンド生成処理）と、処理２（出力帯域拡張処理）と、処理３（イコライザ処理）とを同時に実行する機能を備えている場合を想定する。この場合、図１５に示すように、同時実行機能別レベル判定間引きコントローラ切替器１３２０は、本発明の補正処理と同時に実行する他の処理の有無を判断したうえで、同時実行する処理が音声処理装置３０００に与える負荷の大きさに応じて、レベル判定器１１０１〜１１０３のレベル判定を行う間隔を決定する。図１５の場合の例として、処理１〜３が本発明の補正処理と全く同時に実行されない場合は、判定実行頻度（間引き間隔）は、２サンプリング周期に設定される。一方、全ての処理１〜３が本発明の補正処理と同時に実行される場合は、判定実行頻度（間引き間隔）は、３２サンプリング周期に設定される。 (Embodiment 2)
FIG. 14 is a diagram showing a configuration of a sound processing device 3000 according to Embodiment 2 of the present invention. The sound processing device 3000 is different from the first embodiment in that the level determination thinning controller 1300 includes a level determination thinning controller switching unit 1320 for each simultaneous execution function instead of a level determination thinning controller switching unit for each decoder. The simultaneous execution function level determination decimation controller 1320 generates the level determination of the level determination units 1101 to 1103 based on the simultaneous execution function information S16 generated inside the speech processing device 3000 and received via the information transmitter 2000. To decide. For example, the audio processing device 3000 has a function of simultaneously executing the processing 1 (virtual sound generation processing), the processing 2 (output band expansion processing), and the processing 3 (equalizer processing) together with the correction processing of the present invention described above. Assume that you have it. In this case, as shown in FIG. 15, the simultaneous execution function-specific level determination decimation controller switching unit 1320 determines whether or not there is another process executed simultaneously with the correction process of the present invention, and the simultaneously executed process is an audio process. In accordance with the magnitude of the load applied to the device 3000, the interval for performing the level determination of the level determiners 1101-1103 is determined. As an example in the case of FIG. 15, when the processes 1 to 3 are not executed at the same time as the correction process of the present invention, the determination execution frequency (decimation interval) is set to 2 sampling periods. On the other hand, when all the processes 1 to 3 are executed simultaneously with the correction process of the present invention, the determination execution frequency (decimation interval) is set to 32 sampling periods.

これにより、同時に実行される処理が多いときは間引き頻度を増やし、同時に実行される機能が少ないときは間引き頻度を減らすことことで、全体としての最適な処理量を実現できる。 As a result, it is possible to realize the optimum processing amount as a whole by increasing the thinning frequency when there are many processes executed simultaneously and decreasing the thinning frequency when there are few functions executed simultaneously.

（実施の形態３）
図１６は、本発明の実施の形態３に係る音声処理装置４０００の構成を示す図である。 (Embodiment 3)
FIG. 16 is a diagram showing a configuration of a speech processing apparatus 4000 according to Embodiment 3 of the present invention.

音声処理装置４０００は、上述した音声処理装置１０００または音声処理装置３０００の構成に加えて、レベル判定サンプル指定器１７００を有する。レベル判定サンプル指定器１７００は、レベル判定間引きコントローラ１３００による判定間引き制御において間引き位置を指定する乱数を発生させてレベル判定間引きコントローラ１３００に供給する。レベル判定間引きコントローラ１３００は、レベル判定サンプル指定器１７００から供給される乱数に基づいて判定間引き制御における間引き位置、言い換えれば、所定期間内における判定対象音声データサンプルの配置位置を設定する。 The voice processing device 4000 includes a level determination sample designator 1700 in addition to the configuration of the voice processing device 1000 or the voice processing device 3000 described above. The level determination sample designator 1700 generates a random number that specifies a thinning position in the determination thinning control by the level determination thinning controller 1300 and supplies the random number to the level determination thinning controller 1300. The level determination thinning controller 1300 sets the thinning position in the determination thinning control based on the random number supplied from the level determination sample designator 1700, in other words, the arrangement position of the determination target audio data sample within a predetermined period.

音声処理装置４０００では、レベル判定を行う判定対象音声データサンプルの位相をばらつかせることが可能となる。 In the audio processing device 4000, it is possible to vary the phase of the determination target audio data sample for performing the level determination.

前述した本発明の判定対象サンプルの間引き処理を実施する場合、次のような判定誤差が生じる可能性がある。すなわち、間引きサンプルの配置位置に基づく周波数（間引き周波数）と音声処理装置１０００,１３００の入力音声データＳ０６,Ｓ０５のサンプル周波数とが互いに整数倍になる状態で上記間引きを行うと、入力音声データＳ０６,Ｓ０５が正弦波などの定常波である場合、判定対象音声データの位置が固定されてしまってレベル判定部１１００による判定に誤差が生じる可能性がある。 When the above-described determination target sample thinning process of the present invention is performed, the following determination error may occur. That is, if the above-described thinning is performed in a state where the frequency based on the arrangement position of the thinned samples (thinning frequency) and the sample frequencies of the input voice data S06 and S05 of the voice processing apparatuses 1000 and 1300 are an integral multiple of each other, the input voice data S06. , S05 is a stationary wave such as a sine wave, the position of the determination target audio data is fixed, and an error may occur in the determination by the level determination unit 1100.

これに対して、音声処理装置４０００では、レベル判定を行う判定対象音声データサンプルの位相をばらつかせることが可能となるため、上記判定誤差を可及的に小さくすることができる。 On the other hand, since the audio processing device 4000 can vary the phase of the determination target audio data sample for performing the level determination, the determination error can be reduced as much as possible.

この発明を詳細にその最も好ましい実施の形態について説明したが、その好ましい実施形態についての部品の組み合わせと配列は、後に請求するこの発明の精神と範囲とに反することなく種々変更することができるものである。 Although the invention has been described in detail with respect to the most preferred embodiment thereof, the combination and arrangement of parts for the preferred embodiment can be variously modified without departing from the spirit and scope of the invention as claimed later. It is.

本発明の実施の形態１の音声処理装置の全体構成を示す図である。It is a figure which shows the whole structure of the speech processing unit of Embodiment 1 of this invention. 実施の形態１の音声処理装置の詳細構成例を示す図である。2 is a diagram illustrating a detailed configuration example of a speech processing apparatus according to Embodiment 1. FIG. 図３（Ａ）,図３（Ｂ）は、サンプリング周期２１００の説明図である。3A and 3B are explanatory diagrams of the sampling period 2100. FIG. 実施の形態１の音声処理装置への入力音声データの音量レベルとそれに対する出力音声データの音量レベルの一例を表すグラフである。4 is a graph showing an example of a volume level of input sound data to the sound processing apparatus of Embodiment 1 and a sound volume level of output sound data corresponding thereto. 実施の形態１のレベル判定器の構成を示す図である。FIG. 3 is a diagram illustrating a configuration of a level determiner according to the first embodiment. 本発明に第１のレベル補正パラメータと第２のレベル補正パラメータとの関係を示すテーブルである。4 is a table showing a relationship between a first level correction parameter and a second level correction parameter according to the present invention. 演算用係数記憶器のメモリ構成を示す図である。It is a figure which shows the memory structure of the coefficient memory | storage device for a calculation. 本発明の演算係数の設定方法を示すフローチャートである。It is a flowchart which shows the setting method of the calculation coefficient of this invention. 実施の形態１のレベル補正器の構成を示す図である。FIG. 3 is a diagram illustrating a configuration of a level corrector according to the first embodiment. レベル補正平滑器による平滑処理を示すフローチャートである。It is a flowchart which shows the smoothing process by a level correction | amendment smoother. レベル補正平滑器による平滑処理の効果を示す図である。It is a figure which shows the effect of the smoothing process by a level correction | amendment smoother. 実施の形態１の別の構成を示す図である。6 is a diagram showing another configuration of the first embodiment. FIG. 実施の形態１のさらに別の構成例を示す図である。6 is a diagram showing still another configuration example of the first embodiment. FIG. 本発明の実施の形態２の構成例を示す図である。It is a figure which shows the structural example of Embodiment 2 of this invention. 実施の形態２による判定頻度調整処理の説明図である。12 is an explanatory diagram of determination frequency adjustment processing according to Embodiment 2. FIG. 実施の形態３による判定頻度調整処理の説明図である。12 is an explanatory diagram of determination frequency adjustment processing according to Embodiment 3. FIG. 従来の音声処理装置の構成を示す図である。It is a figure which shows the structure of the conventional audio processing apparatus.

Explanation of symbols

１０００,３０００音声処理装置１１００,３１００レベル判定部
１１０１〜１１０３レベル判定器１１１０レベル判定チャンネル指定器
１１１１レベル判定チャンネル指定器１２００,３２００レベル補正部
１２０１〜１２０３レベル補正器１２１０処理チャンネル切替器
１２２０チャンネル処理指定器１２３０レベル補正初期値切替器
１２４０レベル補正値算出器１２５０レベル補正平滑器
１３００レベル判定間引きコントローラ
１３１０デコーダ別レベル判定間引きコントローラ切替器
１３２０同時実行機能別レベル判定間引きコントローラ切替器
１４００演算用係数記憶器１５００レベル判定部数切替器
１６００入力器２０００情報伝達器
２１００サンプリング周期Ｓ０１デコーダ情報
Ｓ０２レベル判定間隔情報Ｓ０３レベル判定部数情報
Ｓ０４動作指示情報Ｓ０５レベル判定チャンネル情報
Ｓ０６入力音声データＳ０７第１のレベル補正パラメータ１
Ｓ０８第２のレベル補正パラメータ２Ｓ０９演算用係数アドレス情報
Ｓ１０基準音量データＳ１１レベル判定結果信号
Ｓ１２処理チャンネル情報Ｓ１３チャンネル処理指定情報
Ｓ１４レベル補正初期値情報Ｓ１５出力音声データ
Ｓ１６同時実行機能情報
Ｇ１レベル補正を実施しなかった場合の入力音量レベルに対する出力音量レベルのグラフ
Ｇ２レベル補正を実施した場合の入力音量レベルに対する出力音量レベルのグラフの一例
１００₁〜₃ 補正出力１１０₁〜₃ 平滑出力
１２０₁〜₃ レベル補正算出値１３０₁〜₃ 平滑化後の補正値
Ｈ１音声レベル領域

1000, 3000 Audio processing device 1100, 3100 Level determination unit 1101 to 1103 Level determination unit 1110 Level determination channel specification unit 1111 Level determination channel specification unit 1200, 3200 Level correction unit 1201 to 1203 Level correction unit 1210 Processing channel switching unit 1220 Channel processing Designator 1230 Level correction initial value switcher 1240 Level correction value calculator 1250 Level correction smoother 1300 Level determination thinning controller 1310 Decoder-specific level determination thinning controller switcher 1320 Simultaneous determination function-specific level determination thinning controller switcher 1400 Coefficient storage for calculation Device 1500 Level determination copy number switch 1600 Input device 2000 Information transmitter 2100 Sampling cycle S01 Decoder information S02 Level determination interval information S 3 level judgment copy number information S04 operation instruction information S05 level determined channel information S06 input audio data S07 first level correction parameter 1
S08 Second level correction parameter 2 S09 Calculation coefficient address information S10 Reference volume data S11 Level determination result signal S12 Processing channel information S13 Channel processing designation information S14 Level correction initial value information S15 Output audio data S16 Simultaneous execution function information G1 Level correction Graph G2 of the output volume level with respect to the input volume level when the correction is not performed Example of a graph of the output volume level with respect to the input volume level when the level correction is performed 100 ₁ to ₃ Correction output 110 ₁ to ₃ Smooth output 120 ₁ to _3- level correction calculation value 130 ₁ to ₃ Correction value H1 after smoothing Audio level area

Claims

A level determination unit for determining the volume level of the input audio data;
A level correction unit that corrects the volume level based on the determination result of the level determination unit;
A level determination thinning controller that adjusts the frequency with which the level determination unit performs level determination;
An audio processing apparatus comprising:

The speech processing apparatus according to claim 1,
The audio data is composed of a plurality of channel data,
The audio processing apparatus, wherein the level determination unit includes a plurality of level determination units that perform level determination of one or more channel data.

The speech processing apparatus according to claim 1,
The level determination unit determines the audio level in synchronization with a sampling cycle of basic processing of the audio processing device,
The audio processing apparatus according to claim 1, wherein the level determination thinning controller periodically thins out arbitrary periodic points from the respective periodic points of the sampling period, and determines the audio level at the remaining periodic points.

The speech processing apparatus according to claim 1,
The level determination thinning controller changes a level determination thinning frequency based on decoder information indicating a data format of the input audio data.

The speech processing apparatus according to claim 1,
The level determination decimation controller changes the level determination decimation frequency according to other processing contents performed by the audio processing apparatus on the audio data simultaneously with the level correction processing by the level correction unit. Processing equipment.

The speech processing apparatus according to claim 2, wherein
The level correction unit has a plurality of level correctors corresponding to each of the plurality of level determiners.

The speech processing apparatus according to claim 2, wherein
The speech processing apparatus, wherein the level determination unit includes a level determination channel designator that determines channel data for performing level determination from the plurality of channel data.

The speech processing apparatus according to claim 6, wherein
At least one of the level correctors receives a plurality of the channel data, and the level corrector determines a channel data for performing channel correction from the plurality of input channel data. A speech processing apparatus comprising a processing designator.

The speech processing apparatus according to claim 6, wherein
A plurality of channel data of the input audio data is input to at least one of the level correctors, and the level corrector performs level correction on each of the input channel data. An audio processing device comprising a processing channel switch for determining whether or not.

The speech processing apparatus according to claim 6, wherein
At least one of the level correctors includes a level correction initial value switch for changing a correction initial value at the start of level correction.

The speech processing apparatus according to claim 1,
The level correction unit divides the input sound level into a plurality of sound level areas according to the level, and sets a correction pattern for each divided sound level area to correct the volume level. ,
The level determination unit includes a first linear expression (Y = a ₁ X + b ₁ ) indicating a correction pattern in which a boundary between the adjacent sound level regions is set to one of the adjacent sound level regions; A second linear expression (Y = a ₂ X + b ₂ ) indicating a correction pattern set in the other adjacent sound level region;
A speech processing device characterized in that it is calculated as an intersection of
X: Input voice data of the voice processing device
Y: Output audio data of the audio processing device

The speech processing apparatus according to claim 11,
The first level correction parameter for finely adjusting the value of b ₁ in the first primary expression and the second level correction parameter for finely adjusting the value of b _{2 in} the _second primary expression An input device that is input by a user of the audio processing device;
A coefficient storage for calculation in which entity value data of the boundary value is stored;
Further comprising
The level determination unit includes a table for specifying boundary specifying data based on a correlation between the first level correction parameter and the second level correction parameter, and the level determination unit is connected to the input device. The boundary specifying data is specified by referring to the table with the input first and second level correction parameters, and the boundary entity is obtained from the calculation coefficient storage unit based on the specified boundary specifying data. An audio processing apparatus, wherein value data is read, and an audio level region of the input audio data is specified based on the read boundary entity value data.

The speech processing apparatus according to claim 3, wherein
The level determination thinning unit arbitrarily sets the remaining periodic points for determining the audio level based on random numbers.