JP3186331B2

JP3186331B2 - Signal conversion method or apparatus, and recording medium

Info

Publication number: JP3186331B2
Application number: JP12291893A
Authority: JP
Inventors: 健三赤桐
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1993-05-25
Filing date: 1993-05-25
Publication date: 2001-07-11
Anticipated expiration: 2016-07-11
Also published as: JPH06334533A

Abstract

PURPOSE:To obtain a sound of high sound quality that people comfortably hear by converting characteristics of acoustic time signal information by varying a difference in the attitude value of a specific frequency component from other frequency components almost in a critical band. CONSTITUTION:Band-division filters 2-4 and modified DCT(MDCT) circuits 5a-5d as a converting means convert the acoustic time signal information into plural frequency components. Then a frequency component varying circuit 6, a mask circuit 10, a frequency shift peak detecting circuit 12, a dissonant frequency component detecting circuit 11, asking threshold curve detecting 16, and a minimum audible curve generating circuit 17 as an attribute varying means vary the difference in the attribute value of the frequency component, obtained from the acoustic time signal information that frequency components at least two places have different frequency resolution and time resolution among plural frequency components obtained from the converting means, from other frequency components almost in the critical band.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、例えばディジタルオー
ディオ機器に適用され、時間信号である入力オーディオ
信号に対して特に聴覚の性質を用いて音質を変更する
（すなわち時間信号情報の特性を変換する）信号変換方
法又は装置、並びにこれら方法又は装置により時間信号
情報の特性が変換された情報が記録される記録媒体に関
するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention is applied to, for example, digital audio equipment, and changes the sound quality of an input audio signal, which is a time signal, particularly by using the property of hearing (that is, converts the characteristics of time signal information). The present invention relates to a signal conversion method or apparatus, and a recording medium on which information obtained by converting characteristics of time signal information by the method or apparatus is recorded.

【０００２】[0002]

【従来の技術】従来より、音響信号情報の音質を変化さ
せる手法としては、例えば、フィルタ処理によって周波
数特性を変更する方式や、高次高調波を発生させる方
式、若しくはいわゆるコンプレサによってダイナミック
レンジを変更するなどの方式が用いられている。2. Description of the Related Art Conventionally, as a method of changing the sound quality of acoustic signal information, for example, a method of changing a frequency characteristic by a filtering process, a method of generating a higher-order harmonic, or a method of changing a dynamic range by a so-called compressor. For example, a method such as performing is used.

【０００３】[0003]

【発明が解決しようとする課題】しかし、上記フィルタ
を用いる方式の場合は、例えば中域を増強することでプ
レゼンスを上げるなどのフィルタの使用の仕方を変える
ことで音質を変化させるものであり、高次高調波を発生
させる方式の場合は、聞きやすい音を得るというよりも
効果音的な使用に供されるものである。また、上記コン
プレサによってダイナミックレンジを変更する方式は、
大きい音が耳を痛めたり小さい音が周囲の雑音にマスク
されないようにするというものである。これらの方式で
は、瞬時瞬時で変わってゆく音響信号情報の変化に対応
して聴覚的に心地好く聞こえる音にする最適なコントロ
ールは困難である。However, in the case of the method using the above-mentioned filter, the sound quality is changed by changing the way of using the filter, for example, by increasing the midrange to increase the presence. In the case of the method of generating higher-order harmonics, sound is used more effectively than sound that is easy to hear. The method of changing the dynamic range by the above compressor is as follows.
Loud sounds do not hurt your ears and quiet sounds are not masked by the surrounding noise. In these systems, it is difficult to optimally control a sound that is audibly comfortable in response to a change in acoustic signal information that changes instantaneously.

【０００４】そこで、本発明は、上述のような実情に鑑
みて提案されたものであり、人間の聴覚に照らして音質
に関して意味のある音声及び音響信号の変換が可能な信
号変換方法及び装置、並びに記録媒体を提供することを
目的とするものである。Accordingly, the present invention has been proposed in view of the above-described circumstances, and a signal conversion method and apparatus capable of converting meaningful voice and acoustic signals with respect to sound quality in light of human hearing. It is another object of the present invention to provide a recording medium.

【０００５】すなわち、本発明が解決しようとする課題
は、音響信号情報を聴覚的な原理を用いて瞬時瞬時に人
間にとって音質的に高品質に心地好く聞こえる音を作り
出す手法を与えることである。また、本発明の別の課題
は、既にディジタル化されて量子化雑音が付加されてし
まった音響信号情報からこの量子化雑音の聴覚的な影響
を減ずることにより、品質の向上を図ることである。ま
た、本発明の別の目的は、既にディジタル化されて量子
化雑音が付加されてしまったオーディオ信号情報からこ
の量子化雑音の聴覚的な影響を減じた後、本件出願人
が、先に、いわゆるコンパクトディスクのようなオーデ
ィオ機器の音質を向上させる技術として提案しているい
わゆる等ラウドネス特性やマスキング特性に合うように
量子化雑音のスペクトルを変更することによって聴感上
の雑音レベルを低減させる技術（以後この技術を例えば
スーパービットマッピング:Super Bit Mapping技術と呼
ぶことにする）、すなわち例えば特開平２−２０８１２
号公報、特開平２−１８５５５２号公報、特開平２−１
８５５５６号公報等に開示した技術を用いて、１６ビッ
トの語長を持つコンパクトディスクに記録するとき、聴
覚的な処理によって音質を向上させたデータを作ること
にある。当該スーパービットマッピング技術は、１６ビ
ットを越える語長を有するディジタル信号を１６ビット
長を有するコンパクトディスクの為に再量子化する場
合、音質向上を図ることができる。さらに、本発明の一
つの課題は、既に量子化雑音が付加されてしまったオー
ディオ信号情報について、聴覚的に音質を等価的に１６
ビット以上に一度向上させ、再び１６ビットに再量子化
する際、聴覚的に重要な周波数帯域のＳ／Ｎを１６ビッ
ト以上に保ったまま１６ビットとすることで、音質の向
上を図ることである。That is, the problem to be solved by the present invention is to provide a method of instantly and instantaneously producing a sound that sounds comfortable and high-quality for humans by using the aural principle of acoustic signal information. . Another object of the present invention is to improve the quality by reducing the auditory influence of quantization noise from audio signal information that has already been digitized and quantization noise has been added. . Another object of the present invention is to reduce the auditory effect of quantization noise from audio signal information that has already been digitized and quantization noise has been added. A technique for reducing the noise level on hearing by changing the spectrum of quantization noise to match so-called equal loudness characteristics and masking characteristics, which is proposed as a technology for improving the sound quality of audio equipment such as a so-called compact disc ( Hereinafter, this technique will be referred to as, for example, a super bit mapping technique.
JP, JP-A-2-185552, JP-A-2-1-1
When recording on a compact disk having a word length of 16 bits using the technique disclosed in Japanese Patent No. 85556 or the like, an object of the present invention is to produce data with improved sound quality by auditory processing. The super bit mapping technique can improve sound quality when a digital signal having a word length exceeding 16 bits is re-quantized for a compact disk having a 16-bit length. Further, one object of the present invention is to provide an audio signal information to which quantization noise has already been added, equivalently reducing the sound quality to 16 perceptually.
Improving the sound quality by improving the sound quality once by maintaining the S / N of the frequency band that is perceptually important at 16 bits or more when re-quantizing to 16 bits again by improving it to 16 bits or more. is there.

【０００６】[0006]

【課題を解決するための手段】本発明の信号変換方法
は、上述の目的を達成するために提案されたものであ
り、音響時間信号情報を周波数に変換して得られる周波
数成分を用いて当該音響時間信号を変換する信号変換方
法において、略臨界帯域内の各周波数成分について、近
接する周波数成分に基づく指標を異なる周波数幅で少な
くとも２つ求め、上記指標を用いて当該臨界帯域内の周
波数成分の領域を選択し、選択した上記領域の周波数成
分と当該臨界帯域内の他の周波数成分との相対的な大き
さを変更するものである。また、本発明の信号変換装置
は、上述の目的を達成するために提案されたものであ
り、音響時間信号情報を周波数に変換して得られる周波
数成分を用いて当該音響時間信号を変換する信号変換装
置において、略臨界帯域内の各周波数成分について、近
接する周波数成分に基づく指標を異なる周波数幅で少な
くとも２つ求める指標算出手段と、上記指標を用いて当
該臨界帯域内の周波数成分の領域を選択する領域選択手
段と、選択した上記領域の周波数成分と当該臨界帯域内
の他の周波数成分との相対的な大きさを変更する周波数
成分変更手段とを有するものである。A signal conversion method according to the present invention has been proposed to achieve the above-mentioned object, and uses a frequency component obtained by converting acoustic time signal information into a frequency. In the signal conversion method for converting an acoustic time signal, for each frequency component in a substantially critical band, at least two indices based on adjacent frequency components are obtained with different frequency widths, and the frequency component in the critical band is determined using the index. Is selected, and the relative magnitude between the frequency component of the selected region and other frequency components within the critical band is changed. Further, a signal conversion device of the present invention has been proposed to achieve the above object, and a signal for converting the acoustic time signal using a frequency component obtained by converting the acoustic time signal information into a frequency. In the conversion device, for each frequency component in the substantially critical band, an index calculating means for obtaining at least two indices based on adjacent frequency components with different frequency widths, and using the indices, a region of the frequency component in the critical band. It has a region selecting means for selecting and a frequency component changing means for changing a relative magnitude of a frequency component of the selected area and another frequency component in the critical band.

【０００７】ここで、上記音響時間信号情報を周波数成
分に変換する際には、前記音響時間信号情報を複数の帯
域に分割した後、それぞれの帯域信号を直交変換して複
数の周波数成分を得るようにする。なお、前記複数の周
波数成分の周波数分解能は低域ほど高いものである。Here, when converting the acoustic time signal information into frequency components, the acoustic time signal information is divided into a plurality of bands, and each band signal is orthogonally transformed to obtain a plurality of frequency components. To do. Note that the frequency resolution of the plurality of frequency components is higher as the frequency is lower.

【０００８】また、上記音響時間信号情報の特性を変え
る際には、音響時間信号情報から得られた複数の周波数
成分の少なくとも一つのローカルピークについて、略臨
界帯域内の他の周波数成分との間で属性の大きさの違い
を変えること、略臨界帯域幅の１０％から５０％の周波
数差を持つ周波数領域の他の周波数成分との間で属性の
大きさの違いを大きくすること、周波数成分から得られ
る２箇の移動ピーク値の差により、周波数成分の属性の
大きさの違いを変える周波数領域を決定すること、略臨
界帯域幅の５０％幅の移動ピーク値から略臨界帯域幅の
１０％幅の移動ピーク値を引いた値が負の周波数領域の
周波数成分を小さくするか又は削除すること、時間信号
情報の短時間エネルギを保存するように周波数成分の大
きさを調整すること、時間信号情報の短時間エネルギを
保存するように少なくとも一つのローカルピークの周波
数成分の大きさを調整すること、音響時間信号情報から
得られた周波数成分について、略臨界帯域内の他の周波
数成分の内、最小可聴限レベル又はマスキングスレッシ
ョールドレベルを越える周波数成分との間で属性の大き
さの違いを変えること、音響時間信号情報から得られた
周波数成分について、略臨界帯域内の他の周波数成分の
内、最小可聴限レベルとマスキングスレッショールドレ
ベルの大きいほうのレベルを越える周波数成分との間で
属性の大きさの違いを変えること、音響時間信号情報か
ら得られた周波数成分について、略臨界帯域内の他の周
波数成分の内、限定されたレベル範囲内の周波数成分と
の間で属性の大きさの違いを変えること、量子化雑音レ
ベルにより限定されたレベル範囲内の周波数成分との間
で属性の大きさの違いを変えること、などを行う。[0008] When changing the characteristics of the acoustic time signal information, at least one local peak of a plurality of frequency components obtained from the acoustic time signal information is interposed between other local frequency components within a substantially critical band. To change the size of the attribute, to increase the size of the attribute between other frequency components having a frequency difference of 10% to 50% of the critical bandwidth, and to increase the frequency component. From the difference between the two moving peak values obtained from the above, the frequency domain in which the difference in the magnitude of the attribute of the frequency component is changed is determined. The value obtained by subtracting the moving peak value of the% width reduces or eliminates the frequency components in the negative frequency region, and adjusts the magnitude of the frequency components so as to preserve the short-time energy of the time signal information. Adjusting the magnitude of the frequency component of at least one local peak so as to conserve the short-time energy of the time signal information, and for the frequency component obtained from the acoustic time signal information, other frequency components within a substantially critical band. Changing the difference in the magnitude of the attribute between the frequency component exceeding the minimum audible level or the masking threshold level, and for the frequency component obtained from the acoustic time signal information, Of the frequency components, changing the difference in the size of the attribute between the minimum audible level and the frequency component exceeding the larger masking threshold level, the frequency component obtained from the acoustic time signal information, Change the difference in attribute size between frequency components within a limited level range, among other frequency components within a substantially critical band If, by changing the difference magnitude of attributes between the frequency components in the level range which is limited by the quantization noise level, and the like performed.

【０００９】さらに、本発明の信号変換方法又は装置で
は、時間軸上に再合成された時間信号情報に対してノイ
ズシェイプ特性を有する再量子化処理を施すようにもし
ている。このとき、ノイズシェイプ特性は、最小可聴
限、等ラウドネス若しくはマスキング特性の少なくとも
ひとつに依存している。Further, in the signal conversion method or apparatus according to the present invention, requantization processing having a noise shape characteristic is performed on the time signal information recombined on the time axis. At this time, the noise shape characteristic depends on at least one of the minimum audibility, equal loudness, and masking characteristic.

【００１０】なお、本発明の信号変換方法又は装置にお
いては、前記属性が周波数成分の大きさである。[0010] In the signal conversion method or apparatus according to the present invention, the attribute is a magnitude of a frequency component.

【００１１】すなわち言い換えれば、本発明の信号変換
方法又は装置は、入力音響時間信号をフィルタ処理若し
くは直交変換を用いることにより周波数成分を得る。次
にこれらの周波数成分の隣接した成分毎の移動ピーク値
を、臨界帯域に関係した２つの異なる周波数幅で得て、
この２種類の移動ピーク値の差が生じる周波数帯域の周
波数成分の大きさを小さくすることにより、ローカルピ
ーク周波数成分と他の周波数成分との間の不協和度を低
減させる。入力音響時間信号を周波数軸上に展開するに
あたっては、フィルタなどにより複数の周波数帯域の時
間軸上成分を得た後、直交変換等によるブロック化周波
数分析手法を用いるか、いわゆるＱＭＦ(Quadrature Mi
rror Filter)や、ＣＱＦ(Conjugate Quadrature Filte
r) などの帯域分割フィルタをツリー構造に従属接続す
ることにより、低域から高域にかけて、徐々に周波数分
解能が低下し、逆に時間分解能が向上する帯域分割を行
う。That is, in other words, the signal conversion method or apparatus of the present invention obtains a frequency component by filtering or orthogonally transforming an input acoustic time signal. Next, moving peak values for adjacent components of these frequency components are obtained at two different frequency widths related to the critical band,
By reducing the magnitude of the frequency component in the frequency band where the difference between the two types of moving peak values occurs, the degree of dissonance between the local peak frequency component and other frequency components is reduced. In developing the input acoustic time signal on the frequency axis, a time domain component of a plurality of frequency bands is obtained by a filter or the like, and then a blocking frequency analysis method such as orthogonal transform is used, or a so-called QMF (Quadrature
rror Filter), CQF (Conjugate Quadrature Filte
By connecting the band division filters such as r) in a tree structure, the frequency resolution gradually decreases from the low band to the high band, and conversely the band resolution is improved.

【００１２】この時、低域の方が高域よりも長い時間の
ブロックでブロック化して直交変換若しくは時間軸上複
数サンプルのピーク値を取るようにしてもよい。ブロッ
クの周波数帯域幅及び時間幅は聴覚的に最適になるよう
に臨界帯域幅を充分満足する周波数分解能を与えるよう
にする。それぞれのブロックにおいて、分析により得ら
れているスペクトルは、その大きさと周波数により、マ
スキングスレッショールド（マスキングのしきい値）以
上か否かが判定され、マスキングスレッショールド以下
の場合には強さ，位相などの属性が変更されないように
する。このことは最小可聴限についても同様であり、最
小可聴限を下回る周波数成分については、たとえ移動ピ
ーク値の差がゼロではなくても変更しないようにする。At this time, the low-frequency band may be divided into blocks longer in time than the high-frequency band so as to obtain the peak values of a plurality of samples on the orthogonal transform or the time axis. The frequency bandwidth and the time width of the block should provide a frequency resolution that sufficiently satisfies the critical bandwidth so as to be acoustically optimal. In each block, the spectrum obtained by the analysis is determined based on its magnitude and frequency as to whether or not it is above a masking threshold (masking threshold). If it is below the masking threshold, the intensity is determined. , Phase, and other attributes are not changed. The same is true for the minimum audible limit, and the frequency components below the minimum audible limit are not changed even if the difference between the moving peak values is not zero.

【００１３】さらには、既に付加されてしまった量子化
雑音のレベルが同定もしくは予想し得る場合には、この
レベルの周波数成分については他の成分とは異なる処理
を行うことは、付加済みの量子化雑音を効果的に除去す
る上で有効である。例えば他の成分よりも大きな減衰率
を与えるか、完全に除去してしまうことは有効である。
更に、以上のように処理した音響信号情報を前記スーパ
ービットマッピング処理することによりビット長を減ず
ることは、限られた語長で記録再生伝送等を行う場合、
聴感的な音質の劣化をできるだけ防ぐ上で有効である。
以上述べた様に本発明は聴覚的な方法で音響信号情報の
周波数成分をコントロールすることにより上述の課題を
解決する。Further, when the level of the already added quantization noise is identifiable or predictable, performing a process different from that of the other components on the frequency component of this level is performed by using the added quantum noise. This is effective in effectively removing the formation noise. For example, it is effective to provide a larger attenuation rate than other components or to completely remove the components.
Further, reducing the bit length by performing the super bit mapping process on the audio signal information processed as described above, when performing recording and reproduction transmission with a limited word length,
This is effective in preventing audible deterioration of sound quality as much as possible.
As described above, the present invention solves the above-described problem by controlling the frequency components of the acoustic signal information in an auditory manner.

【００１４】また、本発明の記録媒体は、上記信号変換
方法又は装置により処理されて得られた変換データが記
録されてなるものである。この記録媒体としは、光磁気
ディスク、又は光ディスク、又は半導体メモリ、又はＩ
Ｃメモリーカードなどを挙げることができる。Further, a recording medium of the present invention is a medium on which converted data obtained by processing by the above signal conversion method or apparatus is recorded. The recording medium may be a magneto-optical disk, an optical disk, a semiconductor memory,
C memory card and the like.

【００１５】[0015]

【作用】本発明によれば、聴覚的に裏付けのある臨界帯
域内の周波数成分間の調和関係をコントロールすること
で、音声及び音響信号の音質を人間にとって有益なよう
に調整することができる。また、マスキングスレッショ
ールド及び最小可聴限以下の周波数成分については変更
を加えないようにすることは、音質的に無関係な不必要
な処理をできるだけ行わず、接続歪みなど余計な副作用
を防ぐ上で有効である。さらに、コンパクトディスクに
記録されるディジタルサンプルデータが１６ビットの語
長の分解能しかないにもかかわらず、聴覚的な周波数成
分の変更とスーパービットマッピング処理を組み合わせ
て１６ビット音響信号情報を作りだしコンパクトディス
クなどに記録することは、既に量子化雑音が付加された
音響信号情報及び、聴覚的に望ましくない周波数成分を
含む音響信号情報をコンパクトディスク、ディジタルオ
ーディオテープ等に記録する上で有効である。According to the present invention, the sound quality of voice and acoustic signals can be adjusted to be useful to humans by controlling the harmonic relationship between frequency components within a critical band that is acoustically supported. In addition, not changing the masking threshold and the frequency components below the minimum audible limit should not perform unnecessary processing unrelated to sound quality as much as possible, and prevent unnecessary side effects such as connection distortion. It is valid. Furthermore, despite the fact that the digital sample data recorded on the compact disc has only a 16-bit word-length resolution, the combination of the change of the audible frequency component and the super bit mapping process creates 16-bit sound signal information. Recording on a compact disk, digital audio tape, or the like is effective for recording acoustic signal information to which quantization noise has already been added and acoustic signal information containing an acoustically undesirable frequency component.

【００１６】[0016]

【実施例】以下、本発明の実施例について図面を参照し
ながら説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００１７】本発明の信号変換方法が適用される本実施
例の信号変換装置は、図１に示すように、音響時間信号
情報を複数の周波数成分に変換する変換手段としての後
述する帯域分割フィルタ２，３，４及びＭＤＣＴ回路５
ａ，５ｂ，５ｃ，５ｄと、当該変換手段から得られた複
数の周波数成分のうち、少なくとも２箇の周波数成分が
異なる周波数分解能と時間分解能を持つ音響時間信号情
報から得られた周波数成分について、略臨界帯域内の他
の周波数成分との間で属性の大きさの違いを変える属性
変更手段としての後述する周波数成分変更回路６及びマ
スク回路１０，周波数移動ピーク検出回路１２，不協和
周波数検出回路１１，マスキングスレショールドカーブ
検出回路１６，最小可聴カーブ発生回路１７とを有する
ものである。As shown in FIG. 1, the signal conversion apparatus of the present embodiment to which the signal conversion method of the present invention is applied includes a band division filter (described later) as conversion means for converting acoustic time signal information into a plurality of frequency components. 2, 3, 4 and MDCT circuit 5
a, 5b, 5c, 5d, and a plurality of frequency components obtained from the conversion means, at least two frequency components of which are obtained from acoustic time signal information having different frequency resolutions and time resolutions, A frequency component changing circuit 6 and a mask circuit 10, a frequency shift peak detecting circuit 12, and a dissonance frequency detecting circuit, which will be described later, serve as attribute changing means for changing the difference in the size of the attribute from other frequency components in the substantially critical band. 11, a masking threshold curve detection circuit 16, and a minimum audible curve generation circuit 17.

【００１８】先ず、図１は、本発明に係る信号変換方法
を実現する本実施例の信号変換装置の一実施例の概略構
成を示すブロック回路図である。以下、図１の具体的な
構成について詳細に説明する。First, FIG. 1 is a block circuit diagram showing a schematic configuration of one embodiment of a signal conversion device of the present embodiment for realizing a signal conversion method according to the present invention. Hereinafter, the specific configuration of FIG. 1 will be described in detail.

【００１９】すなわち、本実施例の信号変換装置は、音
声若しくは音響信号情報（音響時間信号情報）等の入力
ディジタル信号を、複数の周波数帯域に分割すると共
に、最低域の隣接した２帯域の帯域幅は同じで、より高
い周波数帯域ではその内の高い周波数帯域ほどバンド幅
を広く選定し、各周波数帯域毎に直交変換を行って、得
られた周波数軸上のスペクトルデータから、周波数領域
の移動ピークカーブと周波数領域のマスキングカーブの
情報を求める。That is, the signal converter of this embodiment divides an input digital signal such as voice or acoustic signal information (acoustic time signal information) into a plurality of frequency bands, The width is the same, and the higher the frequency band, the wider the bandwidth is selected in the higher frequency band, the orthogonal transform is performed for each frequency band, and the frequency domain shift is performed based on the obtained spectrum data on the frequency axis. Information on the peak curve and the masking curve in the frequency domain is obtained.

【００２０】上記周波数領域の移動ピークカーブの情報
からは、周波数成分間の調和関係から周波数成分を変更
することにより好ましい音質の変化が期待できる周波数
帯域を得る。また、周波数領域のマスキングカーブの情
報からは周波数領域の移動ピークカーブの情報から求ま
った周波数成分を変更することにより好ましい音質の変
化が期待できる周波数帯域のうち、マスキングにより実
質的に音質変化が期待できない周波数領域を求めて、周
波数成分を変化させる周波数帯域から除外する。最小可
聴限を下回る周波数成分についても変更の対象から除外
する。このようにして求められた周波数成分を変化させ
る周波数帯域内の周波数成分の大きさを小さくするか又
は除去する。From the information on the moving peak curve in the frequency domain, a frequency band in which a desirable change in sound quality can be expected by changing the frequency component based on the harmonic relationship between the frequency components is obtained. In addition, from the information on the masking curve in the frequency domain, in a frequency band in which a preferable change in the sound quality can be expected by changing the frequency component obtained from the information on the moving peak curve in the frequency domain, a substantial change in the sound quality is expected by the masking. A frequency region that cannot be obtained is obtained and excluded from a frequency band in which a frequency component is changed. Frequency components below the minimum audible limit are also excluded from the change. The magnitude of the frequency component in the frequency band in which the frequency component thus obtained is changed is reduced or eliminated.

【００２１】次に、周波数成分を逆直交変換して時間信
号情報を得、全帯域を合成フィルタでまとめることで全
帯域時間信号情報を得る。さらに、量子化を行うにあっ
たては、２０ｋＨｚ以下の帯域内の量子化雑音スペクト
ルを聴感的に最適化するスーパービットマッピング処理
を行う。Next, time signal information is obtained by performing an inverse orthogonal transformation of the frequency component, and the entire band is obtained by synthesizing the entire band with a synthesis filter. Further, when performing quantization, a super bit mapping process for audibly optimizing a quantization noise spectrum within a band of 20 kHz or less is performed.

【００２２】より詳細に図１において説明すると、入力
端子１には、例えばサンプリング周波数が４４．１ｋＨ
ｚの時、０〜２２ｋＨｚのオーデイオＰＣＭ信号が供給
されている。この入力信号は、例えばいわゆる上記ＣＱ
Ｆ等の帯域分割フイルタ２により０〜１１ｋＨｚ帯域と
１１ｋ〜２２ｋＨｚ帯域とに分割され、０〜１１ｋＨｚ
帯域の信号は同じくＣＱＦフイルタ等の帯域分割フイル
タ３により０〜５．５ｋＨｚ帯域と５．５ｋ〜１１ｋＨ
ｚ帯域とに分割される。更に０〜５．５ｋＨｚ帯域の信
号は同じくＣＱＦ等の帯域分割フイルタ４により０〜
２．７５ｋＨｚ帯域と２．７５〜５．５ｋＨｚ帯域とに
分割される。More specifically, referring to FIG. 1, the input terminal 1 has, for example, a sampling frequency of 44.1 kHz.
At the time of z, an audio PCM signal of 0 to 22 kHz is supplied. This input signal is, for example, the so-called CQ
The band is divided into a band of 0 to 11 kHz and a band of 11 to 22 kHz by a band dividing filter 2 such as F.
The band signal is also converted to a band of 0 to 5.5 kHz and a band of 5.5 to 11 kHz by a band division filter 3 such as a CQF filter.
divided into z bands. Further, the signals in the 0 to 5.5 kHz band are also converted to 0 to 5.5 kHz by a band division filter 4 such as CQF.
It is divided into a 2.75 kHz band and a 2.75 to 5.5 kHz band.

【００２３】帯域分割フイルタ２からの１１ｋ〜２２ｋ
Ｈｚ帯域の信号は直交変換回路の一例であるＭＤＣＴ
（モディファイド離散コサイン変換）回路５ａに送ら
れ、帯域分割フイルタ３からの５．５ｋ〜１１ｋＨｚ帯
域の信号はＭＤＣＴ回路５ｂに送られ、帯域分割フイル
タ４からの２．７５ｋＨｚ〜５．５ｋＨｚ帯域の信号は
ＭＤＣＴ回路５ｃに送られ、帯域分割フイルタ４からの
０ｋＨｚ〜２．７５ｋＨｚ帯域の信号はＭＤＣＴ回路５
ｄに送られることにより、それぞれＭＤＣＴ処理され
る。もちろん、これら直交変換回路としては、上記ＭＤ
ＣＴ以外にも高速フーリエ変換（ＦＦＴ），離散コサイ
ン変換（ＤＣＴ）などの直交変換を用いることができ
る。11k to 22k from band division filter 2
The signal in the Hz band is an MDCT which is an example of an orthogonal transformation circuit.
(Modified Discrete Cosine Transform) The signal in the 5.5 kHz to 11 kHz band from the band division filter 3 is sent to the MDCT circuit 5 b, and the signal in the 2.75 kHz to 5.5 kHz band from the band division filter 4 is sent to the circuit 5 a. Is sent to the MDCT circuit 5c, and the signal in the 0 kHz to 2.75 kHz band from the band division filter 4 is transmitted to the MDCT circuit 5c.
d to be subjected to MDCT processing. Of course, these orthogonal transform circuits include the MD
In addition to CT, orthogonal transform such as fast Fourier transform (FFT) and discrete cosine transform (DCT) can be used.

【００２４】ここで、上述したような帯域分割フィルタ
による入力ディジタル信号を複数の周波数帯域に分割す
る手法としては、例えば、上記ＣＱＦなどのフィルタを
用いる手法があり、これは、例えば、 Mark J. T. Smit
h and Thomas P. Barnwell,"Exact Reconstruction Tec
hniques for Tree-Structured Subband Coders,"IEEE T
rans. ASSP, Vol ASSP-34 No 3, June 1986, pp. 434-4
41. に述べられている。また、1976 R.E.Crochiere Dig
ital coding of speech in subbands BellSyst.Tech.
J. Vol.55,No.8 1976 には、ＱＭＦなどのフィルタを用
いた手法が述べられている。更にICASSP 83,BOSTON Pol
yphase Quadrature filters-A newsubband coding tech
nique Joseph H. Rothweiler には等バンド幅のフィル
タ分割手法が述べられている。Here, as a method of dividing an input digital signal into a plurality of frequency bands by the above-described band division filter, there is a method of using a filter such as the above-described CQF.
h and Thomas P. Barnwell, "Exact Reconstruction Tec
hniques for Tree-Structured Subband Coders, "IEEE T
rans.ASSP, Vol ASSP-34 No 3, June 1986, pp. 434-4
41. Also, 1976 RECrochiere Dig
ital coding of speech in subbands BellSyst.Tech.
J. Vol. 55, No. 8 1976 describes a method using a filter such as QMF. ICASSP 83, BOSTON Pol
yphase Quadrature filters-A newsubband coding tech
nique Joseph H. Rothweiler describes an equal-bandwidth filter partitioning method.

【００２５】また、上述した直交変換としては、例え
ば、入力オーディオ信号を所定単位時間（フレーム）で
ブロック化し、当該ブロック毎に例えば高速フーリエ変
換（ＦＦＴ）、コサイン変換（ＤＣＴ）、モディファイ
ドＤＣＴ変換（ＭＤＣＴ）等を行うことで、時間軸を周
波数軸に変換するような直交変換がある。上記ＭＤＣＴ
についてはICASSP 1987Subband/Transform Coding Usin
g Filter Bank DesignsBased on Time Domain Aliasing
Cancellation J.P.Princen A.B.Bradley Univ.of Surr
ey Royal Melbourne Inst.of Tech.に述べられている。As the above-described orthogonal transform, for example, an input audio signal is divided into blocks in a predetermined unit time (frame), and for each block, for example, a fast Fourier transform (FFT), a cosine transform (DCT), a modified DCT transform ( For example, there is an orthogonal transformation that transforms a time axis into a frequency axis by performing (MDCT) or the like. MDCT above
About ICASSP 1987 Subband / Transform Coding Usin
g Filter Bank DesignsBased on Time Domain Aliasing
Cancellation JPPrincen ABBradley Univ. Of Surr
ey Royal Melbourne Inst. of Tech.

【００２６】ここで、上記各ＭＤＣＴ回路５ａ、５ｂ、
５ｃ、５ｄに供給する各帯域毎のブロックについての標
準的な入力信号に対する具体例を図２に示す。Here, each of the MDCT circuits 5a, 5b,
FIG. 2 shows a specific example of a standard input signal for a block for each band supplied to 5c and 5d.

【００２７】この図２の具体例において、上述した４つ
のフイルタ出力信号は、各帯域ごとに別々の直交変換ブ
ロックサイズを持ち、それぞれの周波数での臨界帯域幅
を充分満足するような周波数分析を行う。これにより周
波数が高くなるほど周波数分解能は低くなるが、その代
わりに時間分解能が向上する。本実施例では、周波数分
解は臨界帯域をそれぞれ略１０分割する程度に選んでい
る。このことにより臨界帯域内の周波数成分の大きさの
コントロールが臨界帯域内周波数をかなり自由に限定し
て行うことができる様にしている。In the specific example shown in FIG. 2, the four filter output signals described above have different orthogonal transform block sizes for each band, and are subjected to frequency analysis that sufficiently satisfies the critical bandwidth at each frequency. Do. Thus, the higher the frequency, the lower the frequency resolution, but instead, the higher the time resolution. In the present embodiment, the frequency decomposition is selected so that the critical band is divided into approximately ten parts. Thus, the control of the magnitude of the frequency component in the critical band can be performed with the frequency in the critical band being restricted quite freely.

【００２８】すなわち、本実施例では、０Ｈｚから２．
７５ｋＨｚまでの帯域は、直交変換の時間ブロックサイ
ズを４６．４ｍｓｅｃとして、この帯域の最も狭い臨界
帯域幅１００Ｈｚの１０分の１の概略１０Ｈｚの周波数
分解能が得られるようにしている。同様にして、２．７
５ｋＨｚから５．５ｋＨｚ帯域は１１．６ｍｓｅｃの直
交変換の時間ブロックサイズを用いて４０Ｈｚの周波数
分解能を、５．５ｋＨｚから１１ｋＨｚ帯域は５．８ｍ
ｓｅｃの直交変換の時間ブロックサイズを用いて８０Ｈ
ｚの周波数分解能を、１１ｋＨｚから２２ｋＨｚ帯域は
２．９ｍｓｅｃの直交変換の時間ブロックサイズを用い
て１６０Ｈｚの周波数分解能を得ている。なお、１１ｋ
Ｈｚにおける臨界帯域幅は概略３ｋＨｚであるから、更
に直交変換ブロックサイズを半分にして３２０Ｈｚの周
波数分解能とすることは時間分解能を更に上げるうえで
有効である。表１には臨界帯域の中心周波数と帯域幅を
示している。That is, in the present embodiment, from 0 Hz to 2.
For the band up to 75 kHz, the time block size of the orthogonal transform is set to 46.4 msec so that a frequency resolution of approximately 10 Hz, which is one tenth of the narrowest critical bandwidth of 100 Hz, is obtained. Similarly, 2.7
The 5 kHz to 5.5 kHz band has a frequency resolution of 40 Hz using the orthogonal block time block size of 11.6 msec, and the 5.5 kHz to 11 kHz band has 5.8 m.
80H using the time block size of the orthogonal transform in sec
As for the frequency resolution of z, the frequency resolution of 160 Hz is obtained using the time block size of the orthogonal transform of 2.9 msec in the 11 kHz to 22 kHz band. In addition, 11k
Since the critical bandwidth in Hz is approximately 3 kHz, it is effective to further reduce the orthogonal transform block size to a frequency resolution of 320 Hz to further increase the time resolution. Table 1 shows the center frequency and the bandwidth of the critical band.

【００２９】[0029]

【表１】 [Table 1]

【００３０】再び図１に戻って、各ＭＤＣＴ回路５ａ，
５ｂ，５ｃ，５ｄにてＭＤＣＴ処理されて得られた周波
数成分或いはＭＤＣＴ係数データは、ローカルピーク周
波数成分と不協和の関係を持つ周波数成分の存在する周
波数領域を確定する周波数移動ピーク検出回路１２及び
不協和周波数検出回路１１と、マスキングスレショール
ドカーブを求めるマスキングスレショールドカーブ検出
回路１６に供給される。Referring back to FIG. 1, each MDCT circuit 5a,
The frequency component or MDCT coefficient data obtained by performing the MDCT processing in 5b, 5c, and 5d includes a frequency shift peak detecting circuit 12 that determines a frequency region in which a frequency component having a dissonance relation with the local peak frequency component exists; The signal is supplied to a dissonance frequency detection circuit 11 and a masking threshold curve detection circuit 16 for obtaining a masking threshold curve.

【００３１】ここで、上記周波数移動ピーク検出回路１
２の動作を以下に説明する。図３においては、判り易い
ように、３個の隣接周波数成分に関する移動ピーク値の
取り方を説明している。Here, the frequency shift peak detecting circuit 1
Operation 2 will be described below. FIG. 3 illustrates how to obtain moving peak values for three adjacent frequency components for easy understanding.

【００３２】先ず、成分ｓ１を中心とした移動ピーク値
は、当該成分ｓ１とその両隣の成分を含めた各成分ｓ
０，ｓ１，ｓ２の中の最大の大きさを持つ成分の大きさ
で移動ピーク値が定義される。次に、成分ｓ２を中心と
した移動ピーク値は、当該成分ｓ２とその両隣の成分を
含めた各成分ｓ１，ｓ２，ｓ３の中の最大の大きさを持
つ成分の大きさで移動ピーク値が定義される。このよう
にして次々にピーク値を求めて行くことにより、移動ピ
ークカーブが得られる。First, the moving peak value centered on the component s1 is calculated as the component s1 including the component s1 and its neighboring components.
The moving peak value is defined by the magnitude of the component having the largest magnitude among 0, s1, and s2. Next, the moving peak value around the component s2 is determined by the magnitude of the component having the largest magnitude among the components s1, s2, and s3 including the component s2 and its neighboring components. Defined. The moving peak curve is obtained by successively obtaining the peak values in this manner.

【００３３】図３では判り易いように、周波数成分は全
て同じ帯域幅を持ち、且つ移動ピークを求めるときの周
波数幅も等しくして図示してあるが、本実施例では、図
４に示すように、高域になるに従い、周波数成分の持つ
帯域幅は広がり且つその周波数での臨界帯域幅の１０％
若しくは５０％幅の周波数幅での移動ピーク値が求めら
れる。なお、図４において、図中ＢＥ１〜ＢＥ４はそれ
ぞれ帯域を示し、図中曲線Ｐ₁₀は臨界帯域幅の１０％幅
の移動ピークカーブを、曲線Ｐ₅₀は臨界帯域幅の５０％
幅の移動ピークカーブを示し、曲線ＳＤは周波数成分分
布を、ＣＢは各周波数における臨界帯域幅を示してい
る。ここでもしもピーク値が重複して定義された周波数
帯域ではピーク値の大きいほうが選ばれる。In FIG. 3, for the sake of clarity, all the frequency components have the same bandwidth, and the frequency width for obtaining the moving peak is also shown as being equal. In the present embodiment, however, as shown in FIG. In addition, as the frequency becomes higher, the bandwidth of the frequency component expands and becomes 10% of the critical bandwidth at that frequency.
Alternatively, a moving peak value in a frequency width of 50% width is obtained. In FIG. 4, reference numeral BE1~BE4 shows the band, respectively, the movement peak curve of 10% the width of the curve P ₁₀ is critical bandwidth in the figure, the curve P ₅₀ 50% of the critical bandwidth
A moving peak curve of the width is shown, a curve SD shows a frequency component distribution, and CB shows a critical bandwidth at each frequency. Here, in a frequency band in which the peak values are defined in an overlapping manner, the higher peak value is selected.

【００３４】なお、上記臨界帯域幅は、協和性、雑音の
大きさの感覚、マスキング特性など人間の聴覚特性を良
く理解できる物理量であり、本発明に関しては協和性に
ついての説明を図５を用いて説明する。図５は２つの周
波数成分の周波数差が、横軸（臨界帯域幅で正規化され
た周波数を示す軸）に示された周波数だけあるとき、こ
の２つの周波数成分がどの程度の協和性もしくは不協和
性を示すかを縦軸に表している。この結果によれば、２
つの周波数成分の周波数差が、臨界帯域幅の１０％から
５０％までの間（不協和音帯域ＮＨＢ）では不協和の感
覚が生じ（不協和音レベルＮＨＬ）、０％から１０％及
び５０％から１００％の周波数差（協和音帯域ＨＢ）で
は協和の感覚が生じる（協和音レベルＨＬ）。なお、こ
の臨界帯域幅は、前記表１のように高域ほど帯域幅が広
くなっている。Note that the critical bandwidth is a physical quantity that enables human hearing characteristics such as consonance, noise sensation, and masking characteristics to be well understood. For the present invention, FIG. Will be explained. FIG. 5 shows that when the frequency difference between the two frequency components is only the frequency indicated on the horizontal axis (the axis indicating the frequency normalized by the critical bandwidth), the degree of coordination or imbalance between the two frequency components The vertical axis indicates whether or not concordance is exhibited. According to this result, 2
When the frequency difference between the two frequency components is between 10% and 50% of the critical bandwidth (dissonance band NHB), a sense of dissonance occurs (dissonance level NHL), between 0% and 10% and between 50% and 100%. In the frequency difference (consonant band HB), a sense of consonance occurs (consonant level HL). As shown in Table 1, the higher the critical bandwidth, the wider the critical bandwidth.

【００３５】次に不協和帯域を検出する具体的手段を図
６を用いて説明する。図１における各ＭＤＣＴ回路５
ａ，５ｂ，５ｃ，５ｄにてＭＤＣＴ処理されて得られた
周波数成分或いはＭＤＣＴ係数データは、絶対値を取ら
れた後、図６に示す不協和帯域検出手段としての不協和
周波数検出回路１１の入力端子４１に与えられる。ここ
で、より長い時間幅を持つ低域側特性は、各高域時間に
共通に使用される。Next, specific means for detecting a dissonance band will be described with reference to FIG. Each MDCT circuit 5 in FIG.
The frequency components or MDCT coefficient data obtained by performing the MDCT processing at a, 5b, 5c, and 5d are subjected to an absolute value, and then are processed by the dissonance frequency detection circuit 11 shown in FIG. It is provided to an input terminal 41. Here, the low-frequency characteristic having a longer time width is commonly used for each high-frequency time.

【００３６】上記入力端子４１に与えられた周波数成分
から２つの異なった周波数幅を持った移動ピーク特性が
得られる。すなわち、臨界帯域幅の１０％幅の移動ピー
ク値を与える臨界帯域幅の１０％幅の移動ピーク検出回
路４２と臨界帯域幅の５０％幅の移動ピーク値を与える
臨界帯域幅の５０％幅の移動ピーク検出回路４３によっ
て２つの異なった周波数幅を持った移動ピーク特性が得
られる。From the frequency components applied to the input terminal 41, moving peak characteristics having two different frequency widths can be obtained. That is, the moving peak detecting circuit 42 having a width of 10% of the critical bandwidth giving a moving peak value having a width of 10% of the critical bandwidth and the moving peak detecting circuit 42 having a width of 50% of the critical bandwidth giving a moving peak value having a width of 50% of the critical bandwidth. The moving peak detection circuit 43 provides moving peak characteristics having two different frequency widths.

【００３７】これら臨界帯域幅の１０％幅の移動ピーク
値を与える臨界帯域幅の１０％幅の移動ピーク検出回路
４２と臨界帯域幅の５０％幅の移動ピーク値を与える臨
界帯域幅の５０％幅の移動ピーク検出回路４３で得られ
た移動ピークカーブは、その差を差検出回路４４によっ
て求められ、出力端子４５から取り出される。A moving peak detecting circuit 42 having a width of 10% of the critical bandwidth giving a moving peak value of 10% of the critical bandwidth, and a 50% of a critical bandwidth giving a moving peak value of 50% of the critical bandwidth. The difference of the moving peak curve obtained by the moving peak detecting circuit 43 of the width is obtained by the difference detecting circuit 44, and is taken out from the output terminal 45.

【００３８】このようにして求められた移動ピーク値の
差が、あるスレッショールドを越える周波数領域を不協
和周波数領域と定義する。A frequency region in which the difference between the moving peak values thus obtained exceeds a certain threshold is defined as a dissonant frequency region.

【００３９】しかしながら、その他の聴覚的効果すなわ
ちマスキング効果，等ラウドネス，最小可聴限を考える
とき、以上のようにして求められた不協和周波数領域に
含まれる周波数成分全てを操作の対象とする必要はな
い。すなわち、マスキング効果、等ラウドネス、最小可
聴限を考慮したときに、聴覚的に聞こえることがないと
判断される周波数成分は操作の対象から外してもほとん
ど影響がなく、また、等ラウドネスを考えたときに、効
果的である帯域のみを操作の対象とすることは演算量の
減少に役立つ。However, when considering other auditory effects, ie, masking effect, equal loudness, and minimum audibility, it is necessary to operate all the frequency components included in the dissonance frequency region obtained as described above. Absent. That is, when considering the masking effect, equal loudness, and minimum audibility, frequency components determined to be inaudible are hardly affected even when removed from the operation target, and the equal loudness is considered. At times, it is useful to reduce the amount of calculation by operating only the effective band.

【００４０】図１におけるマスク機能を有するマスク回
路１０、マスキングカーブ算出機能を有するマスキング
スレショールドカーブ検出回路１６、最小可聴限情報を
記憶する最小可聴カーブ発生回路１７は、以上説明した
様に、マスキング効果、最小可聴限を考慮したときに、
聴覚的に聞こえることがないと判断される周波数成分を
操作の対象から外す為に用いられる。As described above, the mask circuit 10 having the mask function, the masking threshold curve detection circuit 16 having the masking curve calculation function, and the minimum audible curve generating circuit 17 for storing the minimum audible information are shown in FIG. Considering the masking effect and minimum audibility,
It is used to exclude a frequency component determined not to be audible from the operation target.

【００４１】以下、より詳細に上記マスク回路１０での
マスク機能と、マスキングスレショールドカーブ検出回
路１６でのマスキングカーブ算出機能と、最小可聴カー
ブ発生回路１７での最小可聴限記憶機能につき説明す
る。Hereinafter, the mask function in the mask circuit 10, the masking curve calculation function in the masking threshold curve detection circuit 16, and the minimum audible limit storage function in the minimum audible curve generation circuit 17 will be described in more detail. .

【００４２】図７は上記マスキングスレショールドカー
ブ検出回路１６でのマスキングカーブ算出機能の一具体
例の概略構成を示すブロック回路図である。この図７に
おいて、入力端子７１には、図１における各ＭＤＣＴ回
路５ａ，５ｂ，５ｃ，５ｄからの周波数成分データが供
給されている。FIG. 7 is a block circuit diagram showing a schematic configuration of a specific example of a masking curve calculation function in the masking threshold curve detection circuit 16. In FIG. 7, an input terminal 71 is supplied with frequency component data from each of the MDCT circuits 5a, 5b, 5c and 5d in FIG.

【００４３】この周波数軸上の入力データは、臨界帯域
毎のエネルギ算出回路７２に送られて、ここで各臨界帯
域のエネルギが、各臨界帯域内の周波数成分の各振幅値
の総和を計算することにより求められる。この各臨界地
域毎のエネルギの代わりに、振幅値のピーク値、平均値
等が用いられることもある。このエネルギ算出回路７２
からの出力として、例えば各バンドの総和値のスペクト
ルを図８に図中ＳＢとして示している。ただし、この図
８では、図示を簡略化するため、分割帯域数を１２バン
ド（Ｂ1 〜Ｂ12）で表現している。The input data on the frequency axis is sent to the energy calculation circuit 72 for each critical band, where the energy of each critical band calculates the sum of the amplitude values of the frequency components in each critical band. It is required by Instead of the energy for each critical area, a peak value or an average value of the amplitude value may be used. This energy calculation circuit 72
For example, the spectrum of the sum value of each band is shown as SB in FIG. However, in FIG. 8, the number of divided bands is represented by 12 bands (B1 to B12) to simplify the illustration.

【００４４】ここで、上記スペクトルＳＢのいわゆるマ
スキングに於ける影響を考慮するために、該スペクトル
ＳＢに所定の重み付け関数を掛けて加算するような畳込
み（コンボリユーション）処理を施す。このため、上記
帯域毎のエネルギ算出回路７２の出力すなわち該スペク
トルＳＢの各値は、畳込みフイルタ回路７３に送られ
る。該畳込みフイルタ回路７３は、例えば、入力データ
を順次遅延させる複数の遅延素子と、これら遅延素子か
らの出力にフイルタ係数（重み付け関数）を乗算する複
数の乗算器（例えば各バンドに対応する２５個の乗算
器）と、各乗算器出力の総和をとる総和加算器とから構
成されるものである。この畳込み処理により、例えば図
８のＢ６で示されるバンドのスペクトルＳＢに対しては
図８の図中点線で示す部分の総和がとられる。なお、上
記マスキングとは、人間の聴覚上の特性により、ある信
号によって他の信号がマスクされて聞こえなくなる現象
をいうものであり、このマスキング効果には、時間軸上
のオーデイオ信号による継時マスキング効果と、周波数
軸上の信号による同時刻マスキング効果とがある。これ
らのマスキング効果により、マスキングされる部分に信
号情報もしくはノイズがあったとしても、これらは聞こ
えないことになる。このため、実際のオーデイオ信号で
は、このマスキングされる範囲内の信号情報及びノイズ
は操作対象とする必要がない。Here, in order to consider the influence on the so-called masking of the spectrum SB, a convolution (convolution) process is performed in which the spectrum SB is multiplied by a predetermined weighting function and added. Therefore, the output of the energy calculation circuit 72 for each band, that is, each value of the spectrum SB, is sent to the convolution filter circuit 73. The convolution filter circuit 73 includes, for example, a plurality of delay elements for sequentially delaying input data and a plurality of multipliers (for example, 25 corresponding to each band) for multiplying an output from these delay elements by a filter coefficient (weighting function). Multipliers) and a sum adder for summing the outputs of the multipliers. By this convolution process, for example, the sum of the portions indicated by the dotted lines in FIG. 8 is obtained for the spectrum SB of the band indicated by B6 in FIG. The above-mentioned masking is a phenomenon in which a certain signal masks another signal and becomes inaudible due to human auditory characteristics. The masking effect includes successive masking by an audio signal on a time axis. There is an effect and a simultaneous masking effect by a signal on the frequency axis. Due to these masking effects, even if there is signal information or noise in the masked portion, they will not be heard. Therefore, in the actual audio signal, the signal information and the noise within the masked range need not be operated.

【００４５】なお、上記畳込みフイルタ回路７３の各乗
算器の乗算係数（フイルタ係数）の一具体例を示すと、
任意のバンドに対応する乗算器Ｍの係数を１とすると
き、乗算器Ｍ−１で係数０．１５を、乗算器Ｍ−２で係
数０．００１９を、乗算器Ｍ−３で係数０．０００００
８６を、乗算器Ｍ＋１で係数０．４を、乗算器Ｍ＋２で
係数０．０６を、乗算器Ｍ＋３で係数０．００７を各遅
延素子の出力に乗算することにより、上記スペクトルＳ
Ｂの畳込み処理が行われる。ただし、Ｍは１〜２５の任
意の整数である。A specific example of the multiplication coefficient (filter coefficient) of each multiplier of the convolution filter circuit 73 is shown below.
Assuming that the coefficient of the multiplier M corresponding to an arbitrary band is 1, the multiplier M-1 has a coefficient of 0.15, the multiplier M-2 has a coefficient of 0.0019, and the multiplier M-3 has a coefficient of 0. 00000
86, the multiplier M + 1 multiplies the coefficient 0.4, the multiplier M + 2 multiplies the coefficient 0.06, and the multiplier M + 3 multiplies the coefficient 0.007 by the output of each delay element.
B convolution processing is performed. Here, M is an arbitrary integer of 1 to 25.

【００４６】次に、上記畳込みフイルタ回路７３の出力
は引算器７４に送られる。該引算器７４は、上記畳込ん
だ領域での後述する操作対象から外すことが可能な信号
情報もしくはノイズレベルに対応するレベルαを求める
ものである。なお、当該操作対象から外すことが可能な
信号情報もしくはノイズレベルに対応するレベルαは、
後述するように、逆コンボリユーション処理を行うこと
によって、クリテイカルバンド（臨界帯域幅）の各バン
ド毎の操作対象から外すことが可能な信号情報もしくは
ノイズレベルとなるようなレベルである。ここで、上記
引算器７４には、上記レベルαを求めるための許容関数
（マスキングレベルを表現する関数）が供給される。こ
の許容関数を増減させることで上記レベルαの制御を行
っている。当該許容関数は、次に説明するような（ｎ−
ａｉ）関数発生回路７５から供給されているものであ
る。Next, the output of the convolution filter circuit 73 is sent to a subtractor 74. The subtractor 74 is for obtaining a level α corresponding to signal information or a noise level which can be excluded from an operation target described later in the convolved area. The level α corresponding to signal information or noise level that can be excluded from the operation target is
As will be described later, by performing the inverse convolution process, the level is such that the signal information or the noise level can be excluded from the operation target for each band of the critical band (critical bandwidth). Here, an allowance function (a function expressing a masking level) for obtaining the level α is supplied to the subtractor 74. The level α is controlled by increasing or decreasing the allowable function. The permissible function is (n−
ai) It is supplied from the function generating circuit 75.

【００４７】すなわち、操作対象から外すことが可能な
信号情報もしくはノイズレベルに対応するレベルαは、
臨界帯域の帯域の低域から順に与えられる番号をｉとす
ると、次の（１）式で求めることができる。 α＝Ｓ−（ｎ−ａｉ）・・・（１）この（１）式において、ｎ，ａは定数でａ＞０、Ｓは畳
込み処理されたバークスペクトルの強度であり、（１）
式中(n-ai)が許容関数となる。本実施例ではｎ＝３８，
ａ＝１としている。That is, the level α corresponding to the signal information or noise level that can be excluded from the operation target is
Assuming that the number given in order from the lower band of the critical band is i, it can be obtained by the following equation (1). α = S− (n−ai) (1) In the equation (1), n and a are constants, a> 0, S is the intensity of the convolution-processed bark spectrum, and (1)
In the equation, (n-ai) is an allowable function. In this embodiment, n = 38,
a = 1.

【００４８】このようにして、上記レベルαが求めら
れ、このデータは、割算器７６に伝送される。当該割算
器７６では、上記畳込みされた領域での上記レベルαを
逆コンボリユーションするためのものである。したがっ
て、この逆コンボリユーション処理を行うことにより、
上記レベルαからマスキングスペクトルが得られるよう
になる。すなわち、このマスキングスペクトルが、操作
対象から外すことが可能な信号情報もしくはノイズスペ
クトルとなる。Thus, the level α is obtained, and this data is transmitted to the divider 76. In the divider 76, the level α in the convolved region is inversely convoluted. Therefore, by performing this inverse convolution processing,
A masking spectrum can be obtained from the level α. That is, this masking spectrum becomes signal information or a noise spectrum that can be excluded from the operation target.

【００４９】なお、上記逆コンボリユーション処理は、
複雑な演算を必要とするが、本実施例では簡略化した割
算器７６を用いて逆コンボリユーションを行っている。The above inverse convolution processing is
Although complicated operations are required, in this embodiment, inverse convolution is performed using a simplified divider 76.

【００５０】次に、上記マスキングスペクトルは、合成
回路７７を介して減算器７８に伝送される。ここで、当
該減算器７８には、上記臨界帯域毎のエネルギ検出回路
７２からの出力、すなわち前述したスペクトルＳＢが、
遅延回路７９を介して供給されている。したがって、こ
の減算器７８で上記マスキングスペクトルとスペクトル
ＳＢとの減算演算が行われることで、図９に示すよう
に、上記スペクトルＳＢは、該マスキングスペクトルＭ
Ｓのレベルで示すレベル以下がマスキングされることに
なる。Next, the masking spectrum is transmitted to a subtractor 78 via a synthesis circuit 77. Here, the output from the energy detection circuit 72 for each critical band, that is, the aforementioned spectrum SB is
It is supplied via a delay circuit 79. Therefore, the subtraction operation of the masking spectrum and the spectrum SB is performed by the subtracter 78, as shown in FIG.
The level below the level indicated by the level S is masked.

【００５１】当該減算器７８からの出力は、操作対象か
ら外すことが可能な信号情報若しくはノイズレベル補正
回路（図示は省略している）を介し、出力端子８１を介
して取り出され、上記マスク回路１０に送られて、ここ
で不協和周波数領域のうちで操作対象から外すことが可
能な周波数領域を除外する。The output from the subtracter 78 is taken out via an output terminal 81 via signal information or a noise level correction circuit (not shown) which can be excluded from the operation object, and is output from the mask circuit. The frequency range that can be excluded from the operation target is excluded from the discordant frequency range.

【００５２】なお、遅延回路７９は上記合成回路７７以
前の各回路での遅延量を考慮してエネルギ検出回路７２
からのスペクトルＳＢを遅延させるために設けられてい
る。The delay circuit 79 is provided with an energy detection circuit 72 in consideration of the amount of delay in each circuit before the synthesis circuit 77.
Is provided to delay the spectrum SB from.

【００５３】ところで、上述した合成回路７７での合成
の際には、最小可聴カーブ発生回路１７から供給される
図１０に示すような人間の聴覚特性であるいわゆる最小
可聴限カーブＲＣを示すデータと、上記マスキングスペ
クトルＭＳとを合成することができる。この最小可聴カ
ーブにおいて、信号もしくは雑音絶対レベルがこの最小
可聴限カーブ以下ならば該信号及び雑音は聞こえないこ
とになる。この最小可聴限カーブは、例えば再生時の再
生ボリュームの違いで異なるものとなるが、現実的なデ
ィジタルシステムでは、例えば１６ビットダイナミック
レンジへの音楽のはいり方にはさほど違いがないので、
例えば４ｋＨｚ付近の最も耳に聞こえやすい周波数帯域
の量子化雑音が聞こえないとすれば、他の周波数帯域で
はこの最小可聴限カーブのレベル以下の量子化雑音は聞
こえないと考えられる。したがって、このように例えば
システムの持つワードレングスの４ｋＨｚ付近の雑音が
聞こえない使い方をすると仮定し、この最小可聴限カー
ブＲＣとマスキングスペクトルＭＳとを共に合成するこ
とで、操作対象から外すことが可能な信号情報もしくは
ノイズレベルを得るようにすると、この場合の操作対象
から外すことが可能な信号情報もしくはノイズレベル
は、図１０中の斜線で示す部分までとすることができる
ようになる。By the way, at the time of synthesizing by the synthesizing circuit 77 described above, data indicating a so-called minimum audible curve RC which is a human auditory characteristic as shown in FIG. , And the masking spectrum MS. At this minimum audible curve, if the absolute signal or noise level is below this minimum audible curve, the signal and noise will not be heard. This minimum audible curve differs depending on, for example, the difference in the playback volume at the time of playback, but in a realistic digital system, for example, there is not much difference in how to enter music into the 16-bit dynamic range.
For example, if quantization noise in the most audible frequency band around 4 kHz is not heard, it is considered that quantization noise below the level of the minimum audible curve is not heard in other frequency bands. Therefore, for example, assuming that the system is used so that noise around 4 kHz of the word length of the system cannot be heard, it is possible to remove the minimum audible curve RC and the masking spectrum MS from the operation target by combining them together. If the appropriate signal information or noise level is obtained, the signal information or noise level that can be excluded from the operation target in this case can be up to the shaded portion in FIG.

【００５４】なお、本実施例では、上記最小可聴限カー
ブの４ｋＨｚのレベルを、例えば２０ビット相当の最低
レベルに合わせている。また、この図１０は、信号スペ
クトルＳＳも同時に示している。In this embodiment, the 4 kHz level of the minimum audible curve is adjusted to the lowest level corresponding to, for example, 20 bits. FIG. 10 also shows the signal spectrum SS.

【００５５】また、別の操作対象周波数成分の限定方法
としては、入力ディジタル信号情報に含まれている量子
化雑音のレベルにより、限定する場合がある。量子化雑
音レベルはスペクトルがほぼ白色の場合、語長によりほ
ぼ決定されるからこのレベル範囲の周波数成分を限定的
に操作対象とすることによって効果的に量子化雑音のう
ち、不協和を引き起こす成分を低減もしくは除去するこ
とができる。図１においては、量子化雑音レベル記憶機
能に、量子化雑音レベルを記憶させておくことにより、
このレベルの範囲に操作対象とする周波数成分を限定す
る。もちろんこの量子化レベルは最適になるように調整
をしてもよい。As another method of limiting the frequency component to be operated, there is a case where the frequency component is limited by the level of the quantization noise included in the input digital signal information. When the spectrum is almost white, the quantization noise level is almost determined by the word length. Therefore, by restricting the frequency components in this level range to the operation target, the component of the quantization noise that causes dissonance is effectively generated. Can be reduced or eliminated. In FIG. 1, by storing the quantization noise level in the quantization noise level storage function,
The frequency components to be operated are limited to this level range. Of course, this quantization level may be adjusted to be optimal.

【００５６】また、上記操作対象から外すことが可能な
信号情報もしくはノイズレベル補正回路では、図示を省
略する補正情報出力回路から送られてくる例えば等ラウ
ドネスカーブの情報に基づいて、上記減算器７８からの
出力における操作対象から外すことが可能な信号情報も
しくはノイズレベルを補正している。ここで、等ラウド
ネスカーブとは、人間の聴覚特性に関する特性曲線であ
り、例えば１ｋＨｚの純音と同じ大きさに聞こえる各周
波数での音の音圧を求めて曲線で結んだもので、ラウド
ネスの等感度曲線とも呼ばれる。またこの等ラウドネス
曲線は、図１０に示した最小可聴カーブＲＣと略同じ曲
線を描くものである。この等ラウドネス曲線において
は、例えば４ｋＨｚ付近では１ｋＨｚのところより音圧
が８〜１０ｄＢ下がっても１ｋＨｚと同じ大きさに聞こ
え、逆に、１０ｋＨｚ付近では１ｋＨｚでの音圧よりも
約１５ｄＢ高くないと同じ大きさに聞こえない。このた
め、上記最小可聴カーブのレベルを越えた信号もしくは
雑音は、該等ラウドネス曲線に応じたカーブで与えられ
る周波数特性でその大きさを評価されるのが良いことが
わかる。このようなことから、上記等ラウドネス曲線を
考慮して、演算量を削減するために操作対象から外すこ
とが可能な信号情報もしくはノイズを選定することは、
人間の聴覚特性に適合していることがわかる。In the signal information or noise level correction circuit which can be excluded from the operation target, the subtracter 78 is provided based on, for example, information on an equal loudness curve sent from a correction information output circuit not shown. The signal information or the noise level that can be excluded from the operation target in the output from is corrected. Here, the equal loudness curve is a characteristic curve relating to human auditory characteristics. For example, the loudness curve is obtained by calculating the sound pressure of sound at each frequency that sounds as loud as a pure tone of 1 kHz, and is connected by a curve. Also called a sensitivity curve. Further, this equal loudness curve draws substantially the same curve as the minimum audible curve RC shown in FIG. In this equal loudness curve, for example, at around 4 kHz, even if the sound pressure falls by 8 to 10 dB from the place of 1 kHz, it sounds as large as 1 kHz. It doesn't sound the same size. For this reason, it can be seen that the magnitude of a signal or noise exceeding the level of the minimum audible curve should be evaluated by a frequency characteristic given by a curve corresponding to the equal loudness curve. For this reason, in consideration of the above equal loudness curve, selecting signal information or noise that can be excluded from the operation target in order to reduce the amount of calculation is:
It can be seen that it is compatible with human hearing characteristics.

【００５７】図１に戻って、上記マスク回路１０は、以
上に説明した聴覚的効果を用いて、不必要な周波数帯域
での周波数成分の変更を行わないようにする。このマス
ク回路１０は出力としてローカルピーク成分との間で不
協和な関係を持つ周波数成分のうち聴覚的な音質向上に
効果的な操作が得られる成分情報を出す。図１の周波数
成分変更回路６は、この情報を基にして対象となる周波
数成分の大きさを変更する。Returning to FIG. 1, the mask circuit 10 uses the above-described auditory effect to prevent the frequency components from being changed in unnecessary frequency bands. The mask circuit 10 outputs, as an output, component information of the frequency components having a dissonant relationship with the local peak component that can be effectively operated to improve the auditory sound quality. The frequency component changing circuit 6 in FIG. 1 changes the size of the target frequency component based on this information.

【００５８】図１１には、上記周波数成分変更回路６に
おいて周波数成分の大きさを変更する様子を示してい
る。この図１１において、図中Band１〜Band４はマスク
回路１０により指定された周波数成分の大きさを変更す
る周波数領域であり、その変更の程度は各バンドの中央
部ほど大きくなっている。これは前記図５に示された不
協和度が周波数差により異なることを利用したものであ
る。また、図中Ｓｐ１〜Ｓｐ４は、各ローカルピークス
ペクトルの位置の利得を表しており、不協和周波数帯域
の周波数成分が小さくなったことにより、全体のエネル
ギが減少することを補償するためにこの周波数位置のス
ペクトルの大きさを大きくすることを示している。FIG. 11 shows how the frequency component changing circuit 6 changes the magnitude of the frequency component. In FIG. 11, Band1 to Band4 in the figure are frequency regions in which the magnitude of the frequency component specified by the mask circuit 10 is changed, and the degree of the change is larger in the center of each band. This utilizes the fact that the degree of dissonance shown in FIG. 5 differs depending on the frequency difference. In the drawing, Sp1 to Sp4 represent gains at the positions of the respective local peak spectra. In order to compensate for a decrease in the total energy due to a decrease in the frequency component of the dissonant frequency band, the frequencies are used. This indicates that the magnitude of the spectrum at the position is increased.

【００５９】このようにして周波数成分の大きさを変更
した周波数成分変更回路６の出力は、前記ＭＤＣＴの逆
変換を行うＩＭＤＣＴ回路９ａ，９ｂ，９ｃ，９ｄによ
って、周波数軸上から時間軸上へと変換される。これら
ＩＭＤＣＴ回路９ａ，９ｂ，９ｃ，９ｄからのＩＭＤＣ
Ｔ出力信号は、前記ＣＱＦとは逆の周波数合成（ＩＣＱ
Ｆ）機能を有する帯域合成フィルタ１３，１４，１５に
より周波数合成され全帯域時間信号となる。The output of the frequency component changing circuit 6 in which the magnitude of the frequency component is changed in this way is shifted from the frequency axis to the time axis by the IMDCT circuits 9a, 9b, 9c and 9d which perform the inverse transform of the MDCT. Is converted to IMDC from these IMDCT circuits 9a, 9b, 9c, 9d
The T output signal is frequency-combined (ICQ
F) The signals are frequency-synthesized by the band synthesizing filters 13, 14, and 15 having a function to be a full band time signal.

【００６０】これら帯域合成フィルタ１３，１４，１５
による全帯域信号は、周波数成分の変更によってダイナ
ミックレンジが、元の入力信号情報に比較して大きくな
っていることがあるので、コンパクトディスクに記録す
る場合には、１６ビットへの再量子化が必要となること
がある。なお、本件出願人は、先に、入力されたディジ
タルオーデイオ信号をオーディオ帯域内でのノイズシェ
イピングによって等ラウドネス特性に近いノイズ周波数
特性を与える再量子化を行いコンパクトディスクに１６
ビット再量子化信号を記録するような技術を、例えば前
述の特開平２−２０８１２号公報、特開平２−１８５５
５２号公報、特開平２−１８５５５６号公報にて開示し
ている。These band synthesis filters 13, 14, 15
In some cases, the dynamic range of the full-band signal due to the change of the frequency component is larger than that of the original input signal information. Therefore, when recording on a compact disc, requantization to 16 bits is required. May be required. The applicant of the present application first re-quantizes the input digital audio signal by noise shaping in the audio band to give a noise frequency characteristic close to the equal loudness characteristic, and performs 16-bit compact disc recording.
Techniques for recording a bit requantized signal are described in, for example, the above-mentioned JP-A-2-20812 and JP-A-2-1855.
No. 52, JP-A-2-185556.

【００６１】本発明ではこのような場合、本発明によっ
て処理された信号を更に上記ノイズシェイピングするこ
とによって１６ビットを越える特性をもつコンパクトデ
ィスク記録信号を得ることができる。According to the present invention, in such a case, the signal processed according to the present invention is further subjected to the noise shaping to obtain a compact disk recording signal having a characteristic exceeding 16 bits.

【００６２】以下、図１において上記ノイズシェイピン
グを行うノイズシェイパの動作を説明する。上記帯域合
成フィルタ１５から加算回路１８に供給された信号は、
帰還フィルタ２１の出力信号との差をとられる。加算回
路１８の出力は再量子化器１９及び第２の加算回路２０
に供給される。再量子化器１９は、入力信号語長よりも
少ない語長で出力されることで少ない情報量で信号を伝
送記録等を行おうとするものである。この再量子化器１
９の出力は当該ノイズシェイパの出力端子２２及び第２
の加算回路２０に供給される。第２の加算回路２０は再
量子化器１９の入力及び出力の信号の差を得るものであ
り、出力として量子化誤差が抽出される。第２の加算回
路２０の出力は帰還フィルタ２１に供給される。Hereinafter, the operation of the noise shaper for performing the noise shaping will be described with reference to FIG. The signal supplied from the band synthesis filter 15 to the addition circuit 18 is
The difference from the output signal of the feedback filter 21 is obtained. The output of the addition circuit 18 is a requantizer 19 and a second addition circuit 20.
Supplied to The requantizer 19 attempts to transmit and record a signal with a small amount of information by being output with a word length smaller than the word length of the input signal. This requantizer 1
9 is connected to the output terminal 22 of the noise shaper and the second terminal.
Is supplied to the adder circuit 20 of FIG. The second adding circuit 20 obtains the difference between the input and output signals of the requantizer 19, and extracts a quantization error as an output. The output of the second adding circuit 20 is supplied to a feedback filter 21.

【００６３】ここで、当該帰還フィルタ２１について図
１２にて詳細に説明する。この図１２において、端子５
０を介して帰還フィルタ２１に供給された信号は、遅延
素子５２，５３，５４，５５の直列回路に順次シフトし
てゆく。各遅延素子５２，５３，５４，５５の出力は、
乗算素子５６，５７，５８，５９と接続されており、こ
れら乗算素子５６，５７，５８，５９において各対応す
る係数入力端子６２，６３，６４，６５から供給される
フィルタ係数との積がとられる。これらの乗算素子５
６，５７，５８，５９の出力は、加算素子６０で加算さ
れて帰還フィルタの出力端子６１に導かれる。Here, the feedback filter 21 will be described in detail with reference to FIG. In FIG. 12, terminal 5
The signal supplied to the feedback filter 21 via 0 is sequentially shifted to a series circuit of the delay elements 52, 53, 54, and 55. The output of each delay element 52, 53, 54, 55 is
The multipliers 56, 57, 58, 59 are connected to each other, and the products of the multipliers 56, 57, 58, 59 are multiplied by the filter coefficients supplied from the corresponding coefficient input terminals 62, 63, 64, 65. Can be These multiplying elements 5
The outputs of 6, 57, 58, and 59 are added by an adding element 60 and guided to an output terminal 61 of a feedback filter.

【００６４】以上の加算回路１８、再量子化器１９、第
２の加算回路２０、及び帰還フイルタ２１より構成され
るノイズシェイパによって等ラウドネス特性に近いノイ
ズ周波数特性が与えられたディジタルオーデイオ信号
は、出力端子２２より出力される。この出力信号は、所
定の誤り訂正処理等がなされ、記録媒体（光磁気ディス
ク、光ディスク、半導体メモリ、ＩＣメモリーカード、
光ディスク）に記録される。The digital audio signal to which the noise frequency characteristic close to the equal loudness characteristic is given by the noise shaper constituted by the adder circuit 18, the requantizer 19, the second adder circuit 20, and the feedback filter 21 is output. Output from terminal 22. This output signal is subjected to a predetermined error correction process or the like, and is output to a recording medium (a magneto-optical disk, an optical disk, a semiconductor memory, an IC memory card,
Optical disc).

【００６５】なお、本発明実施例により形成された変換
データは記録媒体への記録の他に、伝送路を介して伝送
することも可能である。The converted data formed by the embodiment of the present invention can be transmitted via a transmission path in addition to being recorded on a recording medium.

【００６６】さらに、本発明は上記実施例のみに限定さ
れるものではなく、画像信号情報などにも適用できる。Further, the present invention is not limited to the above embodiment, but can be applied to image signal information and the like.

【００６７】[0067]

【発明の効果】本発明によれば、上述したようなことか
ら、音響時間信号情報を聴覚的な原理を用いて瞬時瞬時
に人間にとって音質的に高品質に心地好く聞こえる音を
作り出すことができる。また、既にディジタル化されて
量子化雑音が付加した音響時間信号情報から量子化雑音
の聴覚的な影響を減ずることにより、品質の向上を図る
ことができ、既にディジタル化されて量子化雑音が付加
されたオーディオ信号情報から量子化雑音の聴覚的な影
響を減じた後、コンパクトディスクのようなオーディオ
機器の音質を向上させる技術として例えば等ラウドネス
特性やマスキング特性に合うように量子化雑音のスペク
トルを変更することによって聴感上の雑音レベルを低減
させる技術を用いて、１６ビットの語長を持つコンパク
トディスクに記録するときに、聴覚的な処理によって音
質を向上させたデータを作ることができる。これによ
り、１６ビットを越える語長を有するディジタル信号を
１６ビット長を有するコンパクトディスクの為に再量子
化する場合、音質向上を図ることができる。さらに、本
発明によれば、既に量子化雑音が付加されたオーディオ
信号情報について、聴覚的に音質を等価的に１６ビット
以上に一度向上させ、再び１６ビットに再量子化する
際、聴覚的に重要な周波数帯域のＳ／Ｎを１６ビット以
上に保ったまま１６ビットとすることで、音質の向上を
図ることが可能となる。According to the present invention, from the above, it is possible to instantly and instantaneously produce a sound that sounds comfortable and high quality to humans by using the auditory principle of acoustic time signal information. it can. In addition, quality can be improved by reducing the auditory influence of quantization noise from the acoustic time signal information that has already been digitized and has quantization noise added, thereby improving the quality. After reducing the audible effects of quantization noise from the audio signal information obtained, as a technique for improving the sound quality of audio equipment such as compact discs, for example, the spectrum of quantization noise is adjusted to match the equal loudness characteristics and masking characteristics. When data is recorded on a compact disk having a word length of 16 bits by using a technique of reducing the noise level on the audibility by changing the data, data with improved sound quality can be produced by audible processing. Thus, when re-quantizing a digital signal having a word length exceeding 16 bits for a compact disk having a 16-bit length, sound quality can be improved. Furthermore, according to the present invention, when audio signal information to which quantization noise has already been added is perceptually improved in sound quality equivalently once to 16 bits or more and re-quantized to 16 bits again, By setting the S / N of the important frequency band to 16 bits while maintaining it at 16 bits or more, it is possible to improve sound quality.

[Brief description of the drawings]

【図１】本発明の時間信号情報の特性の変換方法（信号
変換方法）を実現する本実施例の信号変換装置の概略構
成例を示すブロック回路図である。FIG. 1 is a block circuit diagram illustrating a schematic configuration example of a signal conversion device according to an embodiment of the present invention that implements a method for converting characteristics of time signal information (signal conversion method) according to the present invention.

【図２】本発明に係る各帯域毎の時間ブロックを示す図
である。FIG. 2 is a diagram showing a time block for each band according to the present invention.

【図３】本発明に係る周波数移動ピークを示す図であ
る。FIG. 3 is a diagram showing a frequency shift peak according to the present invention.

【図４】本発明に係る周波数移動ピーク周波数特性の例
を示す図である。FIG. 4 is a diagram showing an example of a frequency shift peak frequency characteristic according to the present invention.

【図５】協和度と臨界帯域の関係を示す図である。FIG. 5 is a diagram showing a relationship between a degree of consonance and a critical band.

【図６】本発明実施例装置の不協和帯域検出回路の構成
例を示すブロック回路図である。FIG. 6 is a block circuit diagram showing a configuration example of a dissonance band detection circuit of the device according to the embodiment of the present invention.

【図７】本実施例装置のマスキングスレッショールドカ
ーブ検出回路の構成例を示すブロック回路図である。FIG. 7 is a block circuit diagram illustrating a configuration example of a masking threshold curve detection circuit of the device of the present embodiment.

【図８】各臨界帯域の信号成分の総和値を示す図であ
る。FIG. 8 is a diagram showing the sum of signal components in each critical band.

【図９】各臨界帯域の信号成分の総和値とマスキングス
レショールドを示す図である。FIG. 9 is a diagram illustrating a sum value of signal components in each critical band and a masking threshold.

【図１０】各臨界帯域の信号成分の総和値とマスキング
スレショールド、最小可聴限を示す図である。FIG. 10 is a diagram showing the sum of signal components in each critical band, a masking threshold, and a minimum audible limit.

【図１１】周波数成分の大きさを変える例を示す図であ
る。FIG. 11 is a diagram illustrating an example in which the magnitude of a frequency component is changed.

【図１２】ノイズシェーピングの為の帰還フィルタの構
成例を示す図である。FIG. 12 is a diagram illustrating a configuration example of a feedback filter for noise shaping.

[Explanation of symbols]

１，４１，７１・・・入力端子２，３，４・・・・・帯域分割フィルタ（ＣＱＦ）５ａ，５ｂ，５ｃ，５ｄ・・・・ＭＤＣＴ回路６・・・・・・・・・周波数成分変更回路１０・・・・・・・・マスク回路１１・・・・・・・・不協和周波数検出回路１２・・・・・・・・周波数移動ピーク検出回路１３，１４，１５・・帯域合成フィルタ１６・・・・・・・・マスキングスレッショールドカー
ブ検出回路１６１７・・・・・・・・最小可聴カーブ発生回路１８，２０・・・・・加算回路１９・・・・・・・・再量子化器２１・・・・・・・・帰還フィルタ２２，４５，６１，８１・・・出力端子４２・・・・・・・・臨界帯域の１０％幅の移動ピーク
検出回路４３・・・・・・・・臨界帯域の５０％幅の移動ピーク
検出回路４４・・・・・・・・差検出回路５２，５３，５４，５５・・・遅延素子５６，５７，５８，５９・・・乗算素子６０・・・・・・・・加算素子７２・・・・・・・・臨界帯域のエネルギ算出回路７３・・・・・・・・畳み込みフィルタ回路７５・・・・・・・・関数発生回路７４・・・・・・・・引算器７６・・・・・・・・割算器７７・・・・・・・・合成回路７８・・・・・・・・減算回路1, 41, 71 ... input terminal 2, 3, 4 ... band division filter (CQF) 5a, 5b, 5c, 5d ... MDCT circuit 6 ... frequency Component change circuit 10 Mask circuit 11 Dissonance frequency detection circuit 12 Frequency shift peak detection circuit 13, 14, 15 Band Synthetic filter 16 Masking threshold curve detection circuit 16 17 Minimum audible curve generation circuit 18, 20 Addition circuit 19 .. requantizer 21... Feedback filter 22, 45, 61, 81... Output terminal 42... 10% width of the critical band moving peak detection circuit 43. ........... Moving peak detection circuit having 50% width of critical band ·········· Difference detection circuits 52, 53, 54 and 55 ··· Delay elements 56, 57, 58 and 59 ··· Multiplication elements 60 ········ Addition elements 72 ···· Critical band energy calculation circuit 73 ······· Convolution filter circuit 75 ····· Function generation circuit 74 ················· ··········································· Subtraction circuit

───────────────────────────────────────────────────── フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/30 G10L 13/00 G11B 20/10 301 ──────────────────────────────────────────────────続き Continued on the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) H03M 7/30 G10L 13/00 G11B 20/10 301

Claims

(57) [Claims]

An audio time signal information is obtained by converting it into a frequency.
Transforms the sound time signal using the frequency component
In the signal conversion method, for each frequency component substantially in the critical band,
Find at least two component-based indices with different frequency widths
Because the area of the frequency components within the critical band with the index
Select and select the frequency components of the selected region and other
A signal conversion method characterized by changing a relative magnitude with respect to a frequency component .

2. The method according to claim 1, wherein the acoustic time signal information is converted into a frequency.
Transforms the sound time signal using the frequency component
In the signal conversion device, for each frequency component within the substantially critical band,
Find at least two component-based indices with different frequency widths
Index calculating means, and using the above-mentioned index to calculate the frequency component region in the critical band.
Region selecting means to select, frequency components of the selected region and other within the critical band
Frequency component change that changes the relative size with the frequency component
Additional signal conversion apparatus characterized by having means.

3. The method according to claim 2 , wherein the region selecting means includes a region within the critical band.
Select the region corresponding to at least one local peak
Signal converting apparatus according to claim 2, characterized in that.

4. The method according to claim 1, wherein the index calculating means comprises two different
The frequency width is 10% and 50% of the critical bandwidth.
3. The signal converter according to claim 2 , wherein a wave number width is used .

5. The method according to claim 1, wherein the index calculating means transfers the index.
Using the moving peak value, the region selecting means increases the difference according to the difference between the moving peak values.
The region of the frequency component is selected.
2. The signal conversion device according to 2 .

Wherein said area selection means selects a frequency region where the value is negative minus the moving peak value of 10% the width of the critical bandwidth from the mobile peak value of 50% the width of the critical bandwidth
And the frequency component changing means is provided by the area selecting means.
Set the magnitude of the frequency component in the selected area within the above critical band.
6. The signal conversion device according to claim 5 , wherein the signal conversion device is relatively reduced or eliminated as compared with other frequency components .

7. The frequency component changing means, wherein, characterized in that to change the relative size of the <br/> frequency components within the critical band to store the short energy time signal information Item 3. The signal conversion device according to Item 2 .

8. The apparatus according to claim 7, wherein said frequency component changing means is selected.
Relative magnitude between the frequency component, a frequency component exceeding the minimum audible level or a masking threshold level of the other frequency components within the critical band of the serial area
Signal converter according to claim 2, wherein changing the.

9. The apparatus according to claim 6, wherein said frequency component changing means is selected.
The relative magnitude between the frequency components in the region and the frequency components within the level range defined by the quantization noise level
Signal converter according to claim 2, wherein changing the.

10. The signal conversion according to claim 2 , further comprising requantization processing means for performing a requantization process having a noise shape characteristic on the time signal information recombined on the time axis. apparatus.

11. A recording medium on which is recorded conversion data converted based on the signal conversion method according to claim 1.