JP2012249048A

JP2012249048A - Acoustic processing device

Info

Publication number: JP2012249048A
Application number: JP2011118772A
Authority: JP
Inventors: Kazunobu Kondo; 多伸近藤; Yasuyuki Umeyama; 康之梅山
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2011-05-27
Filing date: 2011-05-27
Publication date: 2012-12-13
Anticipated expiration: 2031-05-27
Also published as: JP5861275B2

Abstract

PROBLEM TO BE SOLVED: To easily recognize a time change of a localization direction of each frequency component of an acoustic signal.SOLUTION: A display control part 40 displays a frequency characteristic image 50, in which a spectrogram of an acoustic signal x is arranged in a region A to which a frequency axis F1 and a time axis T1 are set, and a localization image 60, in which an acoustic image point q of each frequency component corresponding to a focal point on the time axis T1 among the acoustic signals x is arranged in a region B to which a frequency axis F2 and a localization axis L indicating a localization direction θ are set. A frequency/time designation region 52 defined by an object band aF on the frequency axis F1 and an object period aT on the time axis T1 is arranged in the region A, and a target designation region 62 defined by an object bF on the frequency axis F2 and an object localization area bL on the localization axis L for the acoustic signals x in the object period aT is arranged in the region B according to an instruction from a user. A signal processing part 36 suppresses or emphasizes each frequency component indicated by the acoustic image point q in the target designation region 62 among the acoustic signals x in the object period aT.

Description

本発明は、音響信号に関連する情報を表示する技術に関する。 The present invention relates to a technique for displaying information related to an acoustic signal.

音響信号のうち所定の方向に音像が定位する成分を抑圧または強調する技術が従来から提案されている。例えば特許文献１や特許文献２には、音響信号の左右のチャネル間の類似度に応じて周波数毎に設定された係数値を音響信号の各周波数成分に乗算することで前方（中央）に定位する成分を抑圧する技術が開示されている。 Conventionally, a technique for suppressing or enhancing a component in which a sound image is localized in a predetermined direction in an acoustic signal has been proposed. For example, in Patent Document 1 and Patent Document 2, localization is performed forward (center) by multiplying each frequency component of the acoustic signal by a coefficient value set for each frequency according to the similarity between the left and right channels of the acoustic signal. A technique for suppressing the components to be generated is disclosed.

特許第３６７０５６２号公報Japanese Patent No. 3670562 特開２００９−１８８９７１号公報JP 2009-188971 A

ところで、音響信号の各周波数成分の定位方向の時間的な変化を視覚的に確認できれば、特定の期間内で所望の方向に定位する周波数成分（例えば歌唱曲の収録音のうち中央に定位する歌唱音）を抑圧または強調の対象として利用者が容易に選択できて便利である。しかし、例えば音響信号のスペクトログラムでは、音響信号の各周波数成分の時間的な変化が提示されるに過ぎず、各周波数成分の定位方向の時間的な変化までは確認できない。したがって、所望の方向に定位する成分を抑圧または強調の対象として選択することは困難である。また、時間軸と定位方向を示す定位軸とが設定された領域内に音響信号の定位方向の時間的な変化の画像を表示する構成も想定されるが、音響信号の周波数成分毎の定位方向の変化を確認できないという問題がある。以上の事情を考慮して、本発明は、音響信号の各周波数成分について定位方向の時間的な変化を視覚的に容易に確認できるようにすることを目的とする。 By the way, if the temporal change in the localization direction of each frequency component of the acoustic signal can be visually confirmed, a frequency component localized in a desired direction within a specific period (for example, a song localized in the center of the recorded sound of a song) This is convenient because the user can easily select (sound) as an object of suppression or emphasis. However, for example, in the spectrogram of the acoustic signal, only the temporal change of each frequency component of the acoustic signal is presented, and the temporal change of the localization direction of each frequency component cannot be confirmed. Therefore, it is difficult to select a component that is localized in a desired direction as an object of suppression or enhancement. In addition, a configuration in which an image of temporal changes in the localization direction of the acoustic signal is displayed in an area where the time axis and the localization axis indicating the localization direction are set is assumed, but the localization direction for each frequency component of the acoustic signal is also assumed. There is a problem that the change of can not be confirmed. In view of the above circumstances, an object of the present invention is to make it possible to easily visually confirm a temporal change in a localization direction for each frequency component of an acoustic signal.

以上の課題を解決するために本発明が採用する手段を説明する。なお、本発明の理解を容易にするために、以下の説明では、本発明の要素と後述の実施形態の要素との対応を括弧書で付記するが、本発明の範囲を実施形態の例示に限定する趣旨ではない。 Means employed by the present invention to solve the above problems will be described. In order to facilitate the understanding of the present invention, in the following description, the correspondence between the elements of the present invention and the elements of the embodiments described later will be indicated in parentheses, but the scope of the present invention will be exemplified in the embodiments. It is not intended to be limited.

本発明の音響処理装置は、利用者からの指示を受付ける受付手段（例えば入力装置２４）と、第１周波数軸（例えば周波数軸Ｆ1）と第１時間軸（例えば時間軸Ｔ1）とが設定された第１領域（例えば領域Ａ）内に音響信号のスペクトログラムを配置した周波数特性画像と、第２周波数軸（例えば周波数軸Ｆ2）と定位方向を示す定位軸（例えば定位軸Ｌ）とが設定された第２領域（例えば領域Ｂ）内に音響信号のうち第１時間軸上の着目点に対応する各周波数成分の音像点を配置した定位画像とを表示装置に表示させる手段であって、第１周波数軸上の対象帯域（例えば対象帯域ａF）と第１時間軸上の対象期間（例えば対象期間ａT）とで画定される周波数/時間指定領域を受付手段に対する指示に応じて第１領域に配置し、対象期間内の音響信号について、第２周波数軸上の対象帯域（例えば対象帯域ｂF）と定位軸上の対象定位域（例えば対象定位域ｂL）とで画定される目標指定領域を受付手段に対する指示に応じて第２領域に配置する表示制御手段と、対象期間内の音響信号のうち目標指定領域内の音像点が示す各周波数成分（目標成分）を抑圧または強調する信号処理手段とを具備する。 In the sound processing apparatus of the present invention, receiving means (for example, the input device 24) for receiving an instruction from the user, a first frequency axis (for example, the frequency axis F1), and a first time axis (for example, the time axis T1) are set. The frequency characteristic image in which the spectrogram of the acoustic signal is arranged in the first region (for example, the region A), the second frequency axis (for example, the frequency axis F2), and the localization axis (for example, the localization axis L) indicating the localization direction are set. Means for causing a display device to display a localization image in which sound image points of each frequency component corresponding to a point of interest on the first time axis in an acoustic signal in a second region (for example, region B) are arranged. A frequency / time designation area defined by a target band on one frequency axis (for example, target band aF) and a target period on the first time axis (for example, target period aT) is set as the first area in response to an instruction to the receiving means. Placed and acoustic signal within the target period A target designation area defined by a target band (for example, target band bF) on the second frequency axis and a target localization area (for example, target localization area bL) on the localization axis according to an instruction to the receiving means. And a signal processing means for suppressing or emphasizing each frequency component (target component) indicated by the sound image point in the target designated area of the acoustic signal within the target period.

以上の構成では、音響信号のスペクトログラムを配置した周波数特性画像と音響信号の着目点に対応する各周波数成分の音像点を配置した定位画像とが表示装置に表示されるから、音響信号の各周波数成分について定位方向の時間的な変化を利用者が視覚的に容易に確認することが可能である。また、周波数特性画像の周波数/時間指定領域の対象帯域に対応する目標指定領域が定位画像に配置されるから、音響信号の目標成分について周波数と定位方向と時間との関係を利用者が容易に把握できるという利点もある。なお、定位画像に音像点を配置する音響信号の範囲は任意であるが、例えば音響信号を複数の単位区間に区分する構成では、複数の単位区間のうち着目点に対応する所定個（単数または複数）の単位区間における各周波数成分の音像点を表示制御手段が第２領域内に配置する構成が好適である。また、目標指定領域および周波数/時間指定領域の指定の順序は任意である。すなわち、目標指定領域の指定後に周波数/時間指定領域を配置する構成と、周波数/時間指定領域の指定後に目標指定領域を配置する構成との双方が本発明の範囲に包含される。 In the above configuration, since the frequency characteristic image in which the spectrogram of the acoustic signal is arranged and the localization image in which the sound image point of each frequency component corresponding to the target point of the acoustic signal is displayed on the display device, each frequency of the acoustic signal is displayed. The user can easily visually confirm the temporal change in the localization direction of the component. In addition, since the target designation area corresponding to the target band of the frequency / time designation area of the frequency characteristic image is arranged in the localization image, the user can easily determine the relationship between the frequency, localization direction, and time for the target component of the acoustic signal. There is also an advantage that it can be grasped. Note that the range of the acoustic signal in which the sound image point is arranged in the localization image is arbitrary, but for example, in a configuration in which the acoustic signal is divided into a plurality of unit sections, a predetermined number (single or A configuration in which the display control means arranges the sound image points of each frequency component in a plurality of unit sections in the second region is preferable. The order of designation of the target designation area and the frequency / time designation area is arbitrary. That is, both the configuration in which the frequency / time designation region is arranged after the designation of the target designation region and the configuration in which the target designation region is arranged after the designation of the frequency / time designation region are included in the scope of the present invention.

本発明の前述の効果は、周波数特性画像と定位画像とを表示装置に並列（同時）に表示した場合に格別に顕著となる。ただし、周波数特性画像と定位画像とを例えば利用者からの指示に応じて選択的に表示することも可能である。 The above-described effect of the present invention becomes particularly remarkable when the frequency characteristic image and the localization image are displayed in parallel (simultaneously) on the display device. However, the frequency characteristic image and the localization image can be selectively displayed according to an instruction from the user, for example.

本発明の好適な態様において、表示制御手段は、音響信号を区分した単位区間の所定個毎の各周波数成分の音像点の分布を、第２周波数軸と定位軸とに交差する時間軸の方向に立体的に配列した定位画像を表示装置に表示させる。以上の態様では、音響信号ｘの各音像点の分布の時間的な変化を利用者が容易に把握できるという利点がある。なお、以上の態様の具体例は、例えば第５実施形態として後述される。 In a preferred aspect of the present invention, the display control means determines the distribution of the sound image points of each frequency component for each predetermined unit section in which the acoustic signal is divided, in the direction of the time axis that intersects the second frequency axis and the localization axis. A stereo image arranged three-dimensionally is displayed on a display device. In the above aspect, there exists an advantage that a user can grasp | ascertain the temporal change of distribution of each sound image point of the acoustic signal x easily. In addition, the specific example of the above aspect is later mentioned, for example as 5th Embodiment.

本発明の好適な態様において、表示制御手段は、対象帯域または対象定位域が相違する複数の目標指定領域を受付手段に対する指示に応じて第２領域内に重複可能に配置し、各目標指定領域に対応する複数の周波数/時間指定領域を第１領域内に重複可能に配置する。以上の態様では、対象帯域または対象定位域が相違する複数の目標指定領域が第２領域に重複可能に配置されるとともに各目標指定領域に対応する複数の周波数/時間指定領域が第１領域に重複可能に配置される。したがって、周波数と期間と定位方向とが相違する複数種の成分を目標成分として利用者が容易に指定できるという利点がある。なお、以上の態様の具体例は例えば第２実施形態として後述される。 In a preferred aspect of the present invention, the display control means arranges a plurality of target designation areas having different target bands or target localization areas in the second area in accordance with instructions to the reception means, and each target designation area A plurality of frequency / time designating areas corresponding to are arranged in the first area so as to overlap each other. In the above aspect, a plurality of target designation areas having different target bands or target localization areas are arranged so as to overlap the second area, and a plurality of frequency / time designation areas corresponding to each target designation area are in the first area. It is arranged so that it can overlap. Therefore, there is an advantage that the user can easily specify a plurality of types of components having different frequencies, periods, and localization directions as target components. In addition, the specific example of the above aspect is later mentioned as 2nd Embodiment, for example.

本発明の好適な態様において、表示制御手段は、目標指定領域に対応する複数の周波数/時間指定領域を、第１領域における第１時間軸上の相異なる時点に設定する。以上の態様では、１個の目標指定領域に対応する複数の周波数/時間指定領域が第１領域に配置されるから、例えば所定の帯域内で間欠的に発生する目標成分を容易かつ適切に指定することが可能である。なお、以上の態様の具体例は、例えば第３実施形態として後述される。 In a preferred aspect of the present invention, the display control means sets a plurality of frequency / time designation areas corresponding to the target designation area at different time points on the first time axis in the first area. In the above aspect, since a plurality of frequency / time specifying areas corresponding to one target specifying area are arranged in the first area, for example, target components generated intermittently within a predetermined band can be specified easily and appropriately. Is possible. In addition, the specific example of the above aspect is later mentioned, for example as 3rd Embodiment.

本発明の好適な態様において、表示制御手段は、第１領域内のスペクトログラムのうち目標指定領域内の各音像点に対応する周波数成分と他の周波数成分とを相異なる態様で表示する。以上の態様では、目標指定領域の内側の各音像点に対応する周波数成分と外側の各音像点に対応する周波数成分とが相異なる態様で表示されるから、目標成分として抑圧または強調の対象となる周波数成分を利用者が周波数特性画像から容易に把握できるという利点がある。なお、以上の態様の具体例は、例えば第６実施形態として後述される。 In a preferred aspect of the present invention, the display control means displays the frequency component corresponding to each sound image point in the target designated region and the other frequency components in the spectrogram in the first region in different manners. In the above aspect, since the frequency component corresponding to each sound image point inside the target designation region and the frequency component corresponding to each sound image point outside are displayed in different modes, the target component is the object of suppression or enhancement. There is an advantage that the user can easily grasp the frequency component from the frequency characteristic image. In addition, the specific example of the above aspect is later mentioned, for example as 6th Embodiment.

本発明の好適な態様において、表示制御手段は、周波数/時間指定領域および目標指定領域の一方の対象帯域について受付手段に付与された変更指示に応じて周波数/時間指定領域および目標指定領域の双方の対象帯域を変更する。以上の態様では、対象帯域を変更する場合の利用者の負担が軽減されるという利点がある。 In a preferred aspect of the present invention, the display control means includes both the frequency / time designation area and the target designation area according to a change instruction given to the reception means for one target band of the frequency / time designation area and the target designation area. Change the target bandwidth. In the above aspect, there exists an advantage that the burden of the user at the time of changing an object band is reduced.

本発明の好適な態様において、表示制御手段は、音響信号を構成する第１チャネル信号（例えば音響信号ｘL）および第２チャネル信号（例えば音響信号ｘR）の各々のスペクトログラムを第１領域内に配置し、定位軸上における目標指定領域の位置に応じて第１チャネル信号のスペクトログラム（例えばスペクトログラムＳL）と第２チャネル信号のスペクトログラム（例えばスペクトログラムＳR）とを相異なる表示態様に変化させる。以上の態様では、目標指定領域の位置に応じて各スペクトログラムの表示態様が制御されるから、目標成分の定位位置を利用者が周波数特性画像から直感的に把握できるという利点がある。なお、目標指定領域の位置に応じて表示態様を変化させる領域は、スペクトログラムの全体および一部の何れでもよい。すなわち、スペクトログラムの全体にわたる表示態様を第１チャネル信号と第２チャネル信号とで相違させる構成のほか、例えば、スペクトログラムのうち周波数/時間指定領域の範囲内の表示態様を第１チャネル信号と第２チャネル信号とで相違させる構成も採用され得る。なお、以上の態様の具体例は、例えば第４実施形態として後述される。 In a preferred aspect of the present invention, the display control means arranges spectrograms of the first channel signal (for example, the acoustic signal xL) and the second channel signal (for example, the acoustic signal xR) constituting the acoustic signal in the first region. Then, the spectrogram (for example, spectrogram SL) of the first channel signal and the spectrogram (for example, spectrogram SR) of the second channel signal are changed to different display modes according to the position of the target designation region on the localization axis. In the above mode, since the display mode of each spectrogram is controlled according to the position of the target designation region, there is an advantage that the user can intuitively grasp the localization position of the target component from the frequency characteristic image. Note that the region in which the display mode is changed according to the position of the target designation region may be either the whole or a part of the spectrogram. That is, in addition to the configuration in which the display mode over the entire spectrogram is different between the first channel signal and the second channel signal, for example, the display mode within the frequency / time designation region of the spectrogram is changed to the first channel signal and the second channel signal. A configuration that is different from that of the channel signal may also be adopted. In addition, the specific example of the above aspect is later mentioned as 4th Embodiment, for example.

本発明の好適な態様において、表示制御手段は、第２時間軸（例えば時間軸Ｔ2）が設定された第３領域（例えば領域Ｃ）内に信号処理手段による処理前または処理後の音響信号の波形を配置した波形画像を、周波数特性画像および定位画像とともに表示装置に表示させる。以上の態様では、音響信号の波形画像が周波数特性画像や定位画像とともに表示装置に表示されるから、目標成分の対象期間と音響信号の波形との関係で容易に把握できるという利点がある。また、第２時間軸上の対象期間を示す時間指定領域を表示制御手段が第３領域に配置する構成によれば、以上の効果は格別に顕著となる。 In a preferred aspect of the present invention, the display control means includes the acoustic signal before or after the processing by the signal processing means in the third area (for example, the area C) in which the second time axis (for example, the time axis T2) is set. The waveform image in which the waveform is arranged is displayed on the display device together with the frequency characteristic image and the localization image. In the above aspect, since the waveform image of the acoustic signal is displayed on the display device together with the frequency characteristic image and the localization image, there is an advantage that it can be easily grasped by the relationship between the target period of the target component and the waveform of the acoustic signal. In addition, according to the configuration in which the display control unit arranges the time designation area indicating the target period on the second time axis in the third area, the above effect becomes particularly remarkable.

本発明の好適な態様において、表示制御手段は、周波数/時間指定領域および時間指定領域の一方の対象期間について受付手段に付与された変更指示に応じて周波数/時間指定領域および時間指定領域の双方の対象期間を変更する。以上の態様では、対象期間を変更する場合の利用者の負担が軽減されるという利点がある。 In a preferred aspect of the present invention, the display control means includes both the frequency / time designation area and the time designation area in accordance with a change instruction given to the reception means for one target period of the frequency / time designation area and the time designation area. Change the target period. In the above aspect, there exists an advantage that the burden of the user at the time of changing a target period is reduced.

なお、前述の各形態における表示の態様とは、観察者が視覚的に認識できる画像の状態を意味する。具体的な表示態様としては、形状，寸法，階調，色彩，点滅の有無等が例示される。 In addition, the display mode in each of the above-described forms means an image state that can be visually recognized by an observer. Specific examples of display modes include shape, dimensions, gradation, color, presence / absence of blinking, and the like.

以上の各態様に係る音響処理装置は、処理係数列の生成に専用されるＤＳＰ（Digital Signal Processor）などのハードウェア（電子回路）によって実現されるほか、ＣＰＵ（Central Processing Unit）等の汎用の演算処理装置とプログラムとの協働によっても実現される。本発明に係るプログラムは、利用者からの指示を受付ける受付手段と、第１周波数軸と第１時間軸とが設定された第１領域内に音響信号のスペクトログラムを配置した周波数特性画像と、第２周波数軸と定位方向を示す定位軸とが設定された第２領域内に音響信号のうち第１時間軸上の着目点に対応する各周波数成分の音像点を配置した定位画像とを表示装置に表示させる手段であって、第１周波数軸上の対象帯域と第１時間軸上の対象期間とで画定される周波数/時間指定領域を受付手段に対する指示に応じて第１領域に配置し、対象期間内の音響信号について、第２周波数軸上の対象帯域と定位軸上の対象定位域とで画定される目標指定領域を受付手段に対する指示に応じて第２領域に配置する表示制御手段と、対象期間内の音響信号のうち目標指定領域内の音像点が示す各周波数成分を抑圧または強調する信号処理手段としてコンピュータを機能させる。以上のプログラムによれば、本発明に係る音響処理装置と同様の作用および効果が奏される。本発明のプログラムは、コンピュータが読取可能な記録媒体に格納された形態で利用者に提供されてコンピュータにインストールされるほか、通信網を介した配信の形態でサーバ装置から提供されてコンピュータにインストールされる。 The sound processing device according to each of the above aspects is realized by hardware (electronic circuit) such as a DSP (Digital Signal Processor) dedicated to generation of a processing coefficient sequence, and a general-purpose device such as a CPU (Central Processing Unit). This is also realized by cooperation between the arithmetic processing unit and the program. A program according to the present invention includes a receiving means for receiving an instruction from a user, a frequency characteristic image in which a spectrogram of an acoustic signal is arranged in a first region in which a first frequency axis and a first time axis are set, A localization device in which a sound image point of each frequency component corresponding to a point of interest on the first time axis in an acoustic signal is arranged in a second region in which two frequency axes and a localization axis indicating a localization direction are set. A frequency / time designation region defined by a target band on the first frequency axis and a target period on the first time axis is arranged in the first region according to an instruction to the reception unit, Display control means for arranging a target designation area defined by a target band on the second frequency axis and a target localization area on the localization axis in the second area for an acoustic signal within the target period in response to an instruction to the reception means; , Acoustic signal within the target period In other words, the computer is caused to function as signal processing means for suppressing or enhancing each frequency component indicated by the sound image point in the target designation region. According to the above program, the same operation and effect as the sound processing apparatus according to the present invention are exhibited. The program of the present invention is provided to a user in a form stored in a computer-readable recording medium and installed in the computer, or provided from a server device in a form of distribution via a communication network and installed in the computer. Is done.

第１実施形態に係る音響処理装置のブロック図である。1 is a block diagram of a sound processing apparatus according to a first embodiment. 第１実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 1st Embodiment. 第２実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 2nd Embodiment. 第３実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 3rd Embodiment. 第４実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 4th Embodiment. 第５実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 5th Embodiment. 第６実施形態における編集画面の模式図である。It is a schematic diagram of the edit screen in 6th Embodiment.

＜Ａ：第１実施形態＞
図１は、本発明の第１実施形態に係る音響処理装置１００のブロック図である。図１に示すように、音響処理装置１００は、演算処理装置１２と記憶装置１４と信号供給装置２２と入力装置２４と表示装置２６と放音装置２８とを具備するコンピュータシステムで実現される。入力装置２４は、利用者からの指示を受付ける機器（受付手段）であり、例えば複数の操作子を含んで構成される。表示装置２６（例えば液晶表示機器）は、演算処理装置１２による制御のもとで各種の画像を表示する。 <A: First Embodiment>
FIG. 1 is a block diagram of a sound processing apparatus 100 according to the first embodiment of the present invention. As shown in FIG. 1, the sound processing device 100 is realized by a computer system including an arithmetic processing device 12, a storage device 14, a signal supply device 22, an input device 24, a display device 26, and a sound emitting device 28. The input device 24 is a device (accepting means) that accepts an instruction from a user, and includes, for example, a plurality of operators. The display device 26 (for example, a liquid crystal display device) displays various images under the control of the arithmetic processing device 12.

信号供給装置２２は、相異なる位置に存在する音源が発音した複数の音響（歌唱音や伴奏音）の混合音の波形を示す音響信号ｘ（ｘL，ｘR）を演算処理装置１２に供給する。左チャネルの音響信号ｘLおよび右チャネルの音響信号ｘRは、各音源に対応する音像が相異なる位置に定位するように収音または加工（例えば左右チャネル間の振幅比や遅延量を変更する処理）されたステレオ形式の信号である。周囲の音響を収音して音響信号ｘを生成する収音機器（ステレオマイク）や、可搬型または内蔵型の記録媒体（例えばＣＤ）から音響信号ｘを読出す再生装置や、通信網から音響信号ｘを受信する通信装置が信号供給装置２２として採用され得る。 The signal supply device 22 supplies the arithmetic processing device 12 with an acoustic signal x (xL, xR) indicating a waveform of a mixed sound of a plurality of sounds (singing sound and accompaniment sound) generated by sound sources present at different positions. The left channel acoustic signal xL and the right channel acoustic signal xR are collected or processed so that the sound images corresponding to each sound source are located at different positions (for example, processing for changing the amplitude ratio and delay amount between the left and right channels). Stereo signal. Sound collecting device (stereo microphone) that picks up surrounding sound and generates sound signal x, playback device that reads sound signal x from a portable or built-in recording medium (for example, CD), and sound from a communication network A communication device that receives the signal x may be employed as the signal supply device 22.

記憶装置１４は、演算処理装置１２が実行するプログラムや演算処理装置１２が使用する各種のデータを記憶する。半導体記録媒体や磁気記録媒体等の公知の記録媒体または複数種の記録媒体の組合せが記憶装置１４として任意に採用される。なお、音響信号ｘ（ｘL，ｘR）を記憶装置１４に格納した構成（したがって信号供給装置２２は省略される）も採用され得る。 The storage device 14 stores a program executed by the arithmetic processing device 12 and various data used by the arithmetic processing device 12. A known recording medium such as a semiconductor recording medium or a magnetic recording medium or a combination of a plurality of types of recording media is arbitrarily employed as the storage device 14. Note that a configuration in which the acoustic signal x (xL, xR) is stored in the storage device 14 (therefore, the signal supply device 22 is omitted) may be employed.

演算処理装置１２は、記憶装置１４に格納されたプログラムを実行することで、信号供給装置２２が供給する音響信号ｘから音響信号ｙ（ｙL，ｙR）を生成するための複数の機能（周波数分析部３２，係数列生成部３４，信号処理部３６，波形合成部３８，表示制御部４０）を実現する。左チャネルの音響信号ｙLおよび右チャネルの音響信号ｙRは、音響信号ｘの特定の成分（以下「目標成分」という）を他の成分に対して抑圧または強調したステレオ形式の信号である。音響信号ｘのうち所定の方向に音像が定位する各周波数成分が目標成分として抑圧または強調される。なお、演算処理装置１２の各機能を複数の集積回路に分散した構成や、専用の電子回路（ＤＳＰ）が各機能を実現する構成も採用され得る。図１の放音装置２８（例えばステレオスピーカやステレオヘッドホン）は、演算処理装置１２が生成した音響信号ｙ（ｙL，ｙR）に応じた音波を放射する。 The arithmetic processing device 12 executes a program stored in the storage device 14 to thereby generate a plurality of functions (frequency analysis) for generating the acoustic signal y (yL, yR) from the acoustic signal x supplied by the signal supply device 22. Unit 32, coefficient sequence generation unit 34, signal processing unit 36, waveform synthesis unit 38, display control unit 40). The left-channel acoustic signal yL and the right-channel acoustic signal yR are stereo signals in which a specific component (hereinafter referred to as “target component”) of the acoustic signal x is suppressed or enhanced with respect to other components. Each frequency component in which the sound image is localized in a predetermined direction in the acoustic signal x is suppressed or enhanced as a target component. A configuration in which each function of the arithmetic processing unit 12 is distributed over a plurality of integrated circuits, or a configuration in which a dedicated electronic circuit (DSP) realizes each function may be employed. The sound emitting device 28 (for example, a stereo speaker or a stereo headphone) in FIG. 1 emits a sound wave corresponding to the acoustic signal y (yL, yR) generated by the arithmetic processing device 12.

周波数分析部３２は、音響信号ｘLの周波数スペクトルＸLと音響信号ｘRの周波数スペクトルＸRとを時間軸上の単位区間（フレーム）毎に順次に生成する。周波数スペクトルＸLは、相異なる周波数（周波数帯域）ｆに対応する複数の周波数成分ＸL(f,m)の系列である。同様に、周波数スペクトルＸRは、複数の周波数成分ＸR(f,m)で構成される。記号ｍは時間軸上の各時点（単位区間の番号）を意味する。周波数スペクトルＸLおよび周波数スペクトルＸRの生成には、例えば短時間フーリエ変換等の公知の周波数分析技術が任意に採用される。 The frequency analysis unit 32 sequentially generates the frequency spectrum XL of the acoustic signal xL and the frequency spectrum XR of the acoustic signal xR for each unit section (frame) on the time axis. The frequency spectrum XL is a series of a plurality of frequency components XL (f, m) corresponding to different frequencies (frequency bands) f. Similarly, the frequency spectrum XR is composed of a plurality of frequency components XR (f, m). The symbol m means each time point (unit section number) on the time axis. For the generation of the frequency spectrum XL and the frequency spectrum XR, a known frequency analysis technique such as short-time Fourier transform is arbitrarily employed.

係数列生成部３４は、音響信号ｘの目標成分を抑圧または強調するための処理係数列Ｇ(m)を単位区間毎に順次に生成する。処理係数列Ｇ(m)は、相異なる周波数ｆに対応する複数の係数値ｇ(f,m)の系列である。各係数値ｇ(f,m)は、音響信号ｘの周波数成分ＸL(f,m)および周波数成分ＸR(f,m)に対する利得（スペクトルゲイン）に相当し、音響信号ｘの目標成分が抑圧または強調されるように選定される。各係数値ｇ(f,m)を選定する処理の詳細は後述する。 The coefficient sequence generator 34 sequentially generates a processing coefficient sequence G (m) for suppressing or enhancing the target component of the acoustic signal x for each unit section. The processing coefficient sequence G (m) is a series of a plurality of coefficient values g (f, m) corresponding to different frequencies f. Each coefficient value g (f, m) corresponds to a gain (spectral gain) for the frequency component XL (f, m) and the frequency component XR (f, m) of the acoustic signal x, and the target component of the acoustic signal x is suppressed. Or selected to be emphasized. Details of the process of selecting each coefficient value g (f, m) will be described later.

信号処理部３６は、係数列生成部３４が生成した処理係数列Ｇ(m)を周波数スペクトルＸLおよび周波数スペクトルＸRの各々に作用させることで音響信号ｙLの周波数スペクトルＹL（周波数成分ＹL(f,m)）と音響信号ｙRの周波数スペクトルＹR（周波数成分ＹR(f,m)）とを単位区間毎に生成する。具体的には、音響信号ｘLの周波数成分ＸL(f,m)と処理係数列Ｇ(m)の係数値ｇ(f,m)との乗算値が音響信号ｙLの周波数成分ＹL（f,m）として算定され、音響信号ｘRの周波数成分ＸR(f,m)と係数値ｇ(f,m)との乗算値が音響信号ｙRの周波数成分ＹR(f,m)として算定される。 The signal processing unit 36 applies the processing coefficient sequence G (m) generated by the coefficient sequence generation unit 34 to each of the frequency spectrum XL and the frequency spectrum XR, thereby causing the frequency spectrum YL (frequency component YL (f, f, m)) and the frequency spectrum YR (frequency component YR (f, m)) of the acoustic signal yR are generated for each unit section. Specifically, the product of the frequency component XL (f, m) of the acoustic signal xL and the coefficient value g (f, m) of the processing coefficient sequence G (m) is the frequency component YL (f, m) of the acoustic signal yL. ) And the product of the frequency component XR (f, m) of the acoustic signal xR and the coefficient value g (f, m) is calculated as the frequency component YR (f, m) of the acoustic signal yR.

波形合成部３８は、信号処理部３６が生成した周波数スペクトルＹLおよび周波数スペクトルＹRから音響信号ｙLおよび音響信号ｙRを生成する。具体的には、波形合成部３８は、単位区間毎の周波数スペクトルＹLを時間領域の信号に変換するとともに前後の単位区間で相互に連結することで音響信号ｙLを生成する。同様に、波形合成部３８は、単位区間毎の周波数スペクトルＹRから音響信号ｙRを生成する。波形合成部３８が生成した音響信号ｙ（ｙL，ｙR）が放音装置２８に供給されて音波として再生される。 The waveform synthesis unit 38 generates the acoustic signal yL and the acoustic signal yR from the frequency spectrum YL and the frequency spectrum YR generated by the signal processing unit 36. Specifically, the waveform synthesizer 38 generates the acoustic signal yL by converting the frequency spectrum YL for each unit section into a signal in the time domain and connecting them in the front and back unit sections. Similarly, the waveform synthesizer 38 generates an acoustic signal yR from the frequency spectrum YR for each unit section. The acoustic signal y (yL, yR) generated by the waveform synthesizer 38 is supplied to the sound emitting device 28 and reproduced as a sound wave.

図１の表示制御部４０は、利用者が目標成分を指定するために参照する図２の編集画面４５を表示装置２６に表示させる。図２に示すように、編集画面４５は、周波数特性画像５０と定位画像６０と波形画像７０とを含んで構成される。 The display control unit 40 in FIG. 1 causes the display device 26 to display the editing screen 45 in FIG. 2 that is referred to by the user for designating the target component. As shown in FIG. 2, the editing screen 45 includes a frequency characteristic image 50, a localization image 60, and a waveform image 70.

周波数特性画像５０は、相互に交差する時間軸Ｔ1と周波数軸Ｆ1とが設定された領域Ａを含む。領域Ａには、音響信号ｘLのスペクトログラムＳL（階調や色彩で強度が表現された周波数スペクトルＸLの時系列）と音響信号ｘRのスペクトログラムＳRとが並列に配置される。また、時間軸Ｔ1上の任意の時点（以下「着目点」という）を示す指示子（カーソル）５４が領域Ａに配置される。なお、信号処理部３６による処理後の音響信号ｙ（ｙL，ｙR）のスペクトログラムを領域Ａに配置する構成や、音響信号ｘおよび音響信号ｙの各々のスペクトログラムを利用者からの指示に応じて切替えて領域Ａに配置する構成も採用され得る。 The frequency characteristic image 50 includes a region A in which a time axis T1 and a frequency axis F1 intersecting each other are set. In the region A, the spectrogram SL of the acoustic signal xL (the time series of the frequency spectrum XL in which the intensity is expressed by gradation and color) and the spectrogram SR of the acoustic signal xR are arranged in parallel. In addition, an indicator (cursor) 54 indicating an arbitrary time point on the time axis T1 (hereinafter referred to as “point of interest”) is arranged in the region A. The spectrogram of the acoustic signal y (yL, yR) after processing by the signal processing unit 36 is arranged in the region A, and the spectrograms of the acoustic signal x and the acoustic signal y are switched according to an instruction from the user. In this case, a configuration arranged in the region A can also be adopted.

図２の定位画像６０は、相互に交差する定位軸Ｌと周波数軸Ｆ2とが設定された領域Ｂを含む。定位軸Ｌは、音像の定位方向θを示す座標軸である。定位軸Ｌの原点（θ＝０）は受聴点の正面方向に相当する。また、定位軸Ｌの原点に対して正側の領域（θ＞０）は正面に対して右側の方向に相当し、負側の領域（θ＜０）は正面方向の左側の方向に相当する。 The localization image 60 of FIG. 2 includes a region B where a localization axis L and a frequency axis F2 intersecting each other are set. The localization axis L is a coordinate axis indicating the localization direction θ of the sound image. The origin (θ = 0) of the localization axis L corresponds to the front direction of the listening point. Further, the positive region (θ> 0) with respect to the origin of the localization axis L corresponds to the right side direction with respect to the front surface, and the negative region (θ <0) corresponds to the left side direction with respect to the front direction. .

定位画像６０の領域Ｂには複数の音像点ｑが配置される。定位軸Ｌ上の定位方向θと周波数軸Ｆ2上の周波数ｆとに対応する座標に位置する音像点ｑは、その定位方向θに定位する周波数ｆの周波数成分が音響信号ｘに存在することを意味する。 A plurality of sound image points q are arranged in the region B of the localization image 60. The sound image point q located at coordinates corresponding to the localization direction θ on the localization axis L and the frequency f on the frequency axis F2 indicates that the frequency component of the frequency f localized in the localization direction θ exists in the acoustic signal x. means.

表示制御部４０は、音響信号ｘの各周波数ｆの定位方向θを単位区間毎に算定する。定位方向θの算定には、例えば以下の数式(1)が好適に利用される。数式(1)の記号|ＸL(f,m)|は音響信号ｘLの周波数成分ＸL(f,m)の振幅を意味し、記号|ＸR(f,m)|は周波数成分ＸR(f,m)の振幅を意味する。音響信号ｘのうち周波数特性画像５０の指示子５４が指定する着目点に対応する１個の単位区間について数式(1)で特定された各音像点ｑが領域Ｂに配置される。なお、数式(1)の意義や導出については、例えば、M. Vinyes, J. Bonada, A. Loscos, "Demixing Commercial Music Productions wia Human-Assisted Time-Frequency Masking"，Audio Engineering Society 120th Convention, France, 2006から理解される。
θ＝２・arctan（|ＸL(f,m)|/|ＸR(f,m)|）・２/π−１ ……(1) The display control unit 40 calculates the localization direction θ of each frequency f of the acoustic signal x for each unit section. For the calculation of the localization direction θ, for example, the following formula (1) is preferably used. The symbol | XL (f, m) | in Equation (1) means the amplitude of the frequency component XL (f, m) of the acoustic signal xL, and the symbol | XR (f, m) | is the frequency component XR (f, m). ) Amplitude. In the acoustic signal x, each sound image point q specified by Expression (1) for one unit section corresponding to the point of interest specified by the indicator 54 of the frequency characteristic image 50 is arranged in the region B. For the significance and derivation of Equation (1), see, for example, M. Vinyes, J. Bonada, A. Loscos, "Demixing Commercial Music Productions wia Human-Assisted Time-Frequency Masking", Audio Engineering Society 120th Convention, France, As understood from 2006.
θ = 2 · arctan (| XL (f, m) | / | XR (f, m) |) · 2 // − 1 (1)

利用者は、入力装置２４を適宜に操作することで領域Ｂ内の任意の領域（以下「目標指定領域」という）６２を指定することが可能である。表示制御部４０は、図２に示すように、利用者から指示された目標指定領域６２を表示装置２６に表示させる。目標指定領域６２は、定位軸Ｌ上の任意の範囲（以下「対象定位域」という）ｂLと周波数軸Ｆ2上の任意の範囲（以下「対象帯域」という）ｂFとで画定される矩形領域である。利用者は、目標指定領域６２の対象定位域ｂLと対象帯域ｂFとを入力装置２４の操作で任意に変更することが可能である。音響信号ｘのうち目標指定領域６２の内側に位置する音像点ｑに対応する各周波数ｆの周波数成分が目標成分として抑圧または強調の対象となる。すなわち、対象帯域ｂF内の音響信号ｘの各周波数成分（ＸL(f,m)，ＸR(f,m)）のうち対象定位域ｂL内の定位方向θに定位する目標成分が抑圧または強調される。 The user can designate an arbitrary area 62 (hereinafter referred to as “target designation area”) 62 within the area B by appropriately operating the input device 24. As shown in FIG. 2, the display control unit 40 causes the display device 26 to display a target designation area 62 instructed by the user. The target designation area 62 is a rectangular area defined by an arbitrary range (hereinafter referred to as “target localization area”) bL on the localization axis L and an arbitrary range (hereinafter referred to as “target band”) bF on the frequency axis F2. is there. The user can arbitrarily change the target localization area bL and the target band bF of the target designation area 62 by operating the input device 24. The frequency component of each frequency f corresponding to the sound image point q located inside the target designation area 62 in the acoustic signal x is the target of suppression or enhancement as the target component. That is, among the frequency components (XL (f, m), XR (f, m)) of the acoustic signal x in the target band bF, the target component that is localized in the localization direction θ in the target localization area bL is suppressed or emphasized. The

利用者は、入力装置２４を適宜に操作することで目標指定領域６２について任意の増幅率αを指定することが可能である。表示制御部４０は、利用者が指定した増幅率α（図２の例示では４０ｄＢ）を目標指定領域６２に付加する。目標指定領域６２の増幅率αが負数に設定された場合には目標成分が抑圧され、増幅率αが正数に設定された場合には目標成分が強調される。 The user can designate an arbitrary amplification factor α for the target designation region 62 by appropriately operating the input device 24. The display control unit 40 adds the amplification factor α (40 dB in the example of FIG. 2) designated by the user to the target designation region 62. When the amplification factor α of the target designation area 62 is set to a negative number, the target component is suppressed, and when the amplification factor α is set to a positive number, the target component is emphasized.

目標指定領域６２が領域Ｂ内に設定されて入力装置２４に所定の操作（例えば適用ボタンの押下）が付与されると、表示制御部４０は、図２に示すように、その目標指定領域６２に対応する周波数/時間指定領域５２を周波数特性画像５０の領域Ａ内に配置する。周波数/時間指定領域５２は、時間軸Ｔ1上の特定の範囲（以下「対象期間」という）ａTと周波数軸Ｆ1上の対象帯域ａFとで画定される矩形領域であり、音響信号ｘLおよび音響信号ｘRの各々のスペクトログラムに重なるように配置される。周波数/時間指定領域５２の対象帯域ａFは、目標指定領域６２の対象帯域ｂFと共通の帯域に設定される。 When the target designation area 62 is set in the area B and a predetermined operation (for example, pressing of the apply button) is given to the input device 24, the display control unit 40, as shown in FIG. A frequency / time designating area 52 corresponding to is arranged in the area A of the frequency characteristic image 50. The frequency / time designation region 52 is a rectangular region defined by a specific range (hereinafter referred to as “target period”) aT on the time axis T1 and a target band aF on the frequency axis F1, and includes the acoustic signal xL and the acoustic signal. It is arranged so as to overlap each spectrogram of xR. The target band aF in the frequency / time designation area 52 is set to a band common to the target band bF in the target designation area 62.

利用者は、周波数/時間指定領域５２の対象帯域ａFを入力装置２４の操作で任意に変更することが可能である。表示制御部４０は、周波数/時間指定領域５２の対象帯域ａFおよび目標指定領域６２の対象帯域ｂFの一方に対する変更の指示に応じて対象帯域ａFおよび対象帯域ｂFの双方を相互に連動して変更し、対象帯域ａFと対象帯域ｂFとを共通の帯域に維持する。他方、周波数/時間指定領域５２の対象期間ａTは、定位画像６０の目標指定領域６２が新規に設定された時点では所定の時間長（例えば音響信号ｘの全区間）に設定され、入力装置２４に対する利用者からの指示に応じて任意に変更される。 The user can arbitrarily change the target band aF in the frequency / time designation area 52 by operating the input device 24. The display control unit 40 changes both the target band aF and the target band bF in conjunction with each other in response to an instruction to change one of the target band aF in the frequency / time specifying area 52 and the target band bF in the target specifying area 62. The target band aF and the target band bF are maintained in a common band. On the other hand, the target period aT of the frequency / time specifying area 52 is set to a predetermined time length (for example, all sections of the acoustic signal x) when the target specifying area 62 of the localization image 60 is newly set. It is arbitrarily changed according to instructions from the user.

図２の波形画像７０は、時間軸Ｔ2が設定された領域Ｃを含む。時間軸Ｔ2は、周波数特性画像５０の時間軸Ｔ1と共通の座標軸である。領域Ｃには、音響信号ｘLの波形ＷLおよび音響信号ｘRの波形ＷRと、時間軸Ｔ2上の着目点を示す指示子（カーソル）７４とが配置される。波形画像７０の指示子７４と周波数特性画像５０の指示子５４とは音響信号ｘの共通の着目点を指示する。なお、信号処理部３６による処理後の音響信号ｙ（ｙL，ｙR）の波形を領域Ｃに配置する構成や、音響信号ｘおよび音響信号ｙの各々の波形を利用者からの指示に応じて切替えて領域Ｃに配置する構成も採用され得る。 The waveform image 70 in FIG. 2 includes a region C in which the time axis T2 is set. The time axis T2 is a coordinate axis common to the time axis T1 of the frequency characteristic image 50. In the area C, a waveform WL of the acoustic signal xL, a waveform WR of the acoustic signal xR, and an indicator (cursor) 74 indicating a point of interest on the time axis T2 are arranged. The indicator 74 of the waveform image 70 and the indicator 54 of the frequency characteristic image 50 indicate a common point of interest of the acoustic signal x. It should be noted that the waveform of the acoustic signal y (yL, yR) after processing by the signal processing unit 36 is arranged in the region C, and each waveform of the acoustic signal x and the acoustic signal y is switched according to an instruction from the user. In this case, a configuration arranged in the region C can also be adopted.

領域Ｃには時間指定領域７２が設定される。時間指定領域７２は、時間軸Ｔ2上の対象期間ｃTにわたる矩形領域である。時間指定領域７２の対象期間ｃTと周波数/時間指定領域５２の対象期間ａTとは共通する。利用者は、入力装置２４に対する操作で時間指定領域７２の対象期間ｃTを任意に変更することが可能である。表示制御部４０は、周波数/時間指定領域５２の対象期間ａTおよび時間指定領域７２の対象期間ｃTの一方に対する変更の指示に応じて対象期間ａTおよび対象期間ｃTの双方を相互に連動して変更し、対象期間ａTと対象期間ｃTとを共通の期間に維持する。 In the area C, a time designation area 72 is set. The time designation area 72 is a rectangular area over the target period cT on the time axis T2. The target period cT in the time designation area 72 and the target period aT in the frequency / time designation area 52 are common. The user can arbitrarily change the target period cT of the time designation area 72 by operating the input device 24. The display control unit 40 changes both the target period aT and the target period cT in response to an instruction to change one of the target period aT in the frequency / time specifying area 52 and the target period cT in the time specifying area 72. The target period aT and the target period cT are maintained in a common period.

利用者が入力装置２４に対する操作で音響の再生を指示すると、係数列生成部３４による処理係数列Ｇ(m)の生成と信号処理部３６による音響信号ｙの生成とが順次に実行されて音響信号ｙに応じた音波が放音装置２８から再生される。周波数特性画像５０の指示子５４と波形画像７０の指示子７４とが示す着目点（再生ポイント）は再生の進行とともに時間軸Ｔ1および時間軸Ｔ2に沿って刻々と移動する。また、利用者は、入力装置２４を適宜に操作して指示子５４および指示子７４を移動させることで所望の時点を着目点として指定することが可能である。定位画像６０の領域Ｂには着目点の１個の単位区間に対応する複数の音像点ｑが表示され、着目点の移動とともに随時に更新される。 When the user instructs to reproduce sound by operating the input device 24, the generation of the processing coefficient sequence G (m) by the coefficient sequence generation unit 34 and the generation of the acoustic signal y by the signal processing unit 36 are sequentially executed to generate the sound. A sound wave corresponding to the signal y is reproduced from the sound emitting device 28. The point of interest (reproduction point) indicated by the indicator 54 of the frequency characteristic image 50 and the indicator 74 of the waveform image 70 moves along the time axis T1 and the time axis T2 as the reproduction proceeds. In addition, the user can designate a desired time point as a point of interest by appropriately operating the input device 24 and moving the indicator 54 and the indicator 74. In the region B of the localization image 60, a plurality of sound image points q corresponding to one unit section of the target point are displayed and updated as needed as the target point moves.

音響の再生中または停止中の任意の時点で、利用者は、定位画像６０の領域Ｂに目標指定領域６２を新規に設定し、または既存の目標指定領域６２を変更することが可能である。例えば、利用者は、着目点について定位画像６０に表示される各音像点ｑの分布や周波数特性画像６０に表示されるスペクトログラム（ＳL，ＳR）を参照するとともに放音装置２８からの再生音を聴取しながら、自身が抑圧または強調を希望する周波数ｆおよび定位方向θに対応する音像点ｑを内包するように目標指定領域６２を設定または変更し、音響信号ｘのうち目標成分を抑圧または強調したい期間を周波数/時間指定領域５２の対象期間ａTまたは時間指定領域７２の対象期間ｃTとして指定する。 The user can newly set the target designation area 62 in the area B of the localization image 60 or change the existing target designation area 62 at any time point during reproduction or stop of the sound. For example, the user refers to the distribution of each sound image point q displayed on the localization image 60 and the spectrogram (SL, SR) displayed on the frequency characteristic image 60 with respect to the point of interest, and the reproduced sound from the sound emitting device 28. While listening, the target designation area 62 is set or changed so as to include the sound image point q corresponding to the frequency f and the localization direction θ desired to be suppressed or enhanced, and the target component of the acoustic signal x is suppressed or enhanced. The desired period is specified as the target period aT in the frequency / time specifying area 52 or the target period cT in the time specifying area 72.

指示子５４および指示子７４の示す着目点が対象期間ａT（ｃT）の外側に存在する場合には定位画像６０の領域Ｂに目標指定領域６２は表示されず（各音像点ｑは表示される）、着目点が対象期間ａTの内側に位置する場合には、図２の例示の通り、定位画像６０の領域Ｂに各音像点ｑとともに目標指定領域６２が表示される。すなわち、着目点の移動（再生の進行）とともに目標指定領域６２の表示／非表示が刻々と変化する。 When the point of interest indicated by the indicator 54 and the indicator 74 exists outside the target period aT (cT), the target designation region 62 is not displayed in the region B of the localization image 60 (each sound image point q is displayed). When the point of interest is located inside the target period aT, the target designation area 62 is displayed together with each sound image point q in the area B of the localization image 60 as illustrated in FIG. That is, the display / non-display of the target designation area 62 changes every moment as the point of interest moves (progress of reproduction).

対象期間ａT（ｃT）の外側の各単位区間について、係数列生成部３４は、処理係数列Ｇ(m)の全部の係数値ｇ(f,m)を所定の数値γ1に設定する。数値γ1は、信号処理部３６による処理の前後で強度を変化させない数値（例えば１）に設定される。他方、対象期間ａT（ｃT）の内側の各単位区間について、係数列生成部３４は、音響信号ｘのうちその単位区間に対応する目標指定領域６２で指定された目標成分が増幅率αに応じて抑圧または強調されるように処理係数列Ｇ(m)の各係数値ｇ(f,m)を設定する。 For each unit section outside the target period aT (cT), the coefficient sequence generator 34 sets all coefficient values g (f, m) of the processing coefficient sequence G (m) to a predetermined numerical value γ1. The numerical value γ1 is set to a numerical value (for example, 1) that does not change the intensity before and after the processing by the signal processing unit 36. On the other hand, for each unit section inside the target period aT (cT), the coefficient sequence generation unit 34 determines that the target component specified in the target specifying area 62 corresponding to the unit section of the acoustic signal x corresponds to the amplification factor α. Each coefficient value g (f, m) of the processing coefficient sequence G (m) is set so as to be suppressed or emphasized.

例えば、増幅率αが負数（目標成分の抑圧）である場合、係数列生成部３４は、目標指定領域６２の外側の音像点ｑに対応する各周波数ｆの係数値ｇ(f,m)を前述の数値γ1に設定し、目標指定領域６２の内側の音像点ｑに対応する各周波数ｆ（すなわち目標成分の各周波数ｆ）の係数値ｇ(f,m)を数値γg1に設定する。増幅率αの絶対値が大きい（目標成分の抑圧の度合が大きい）ほど数値γg1が小さい数値となるように、数値γg1は１未満の正数の範囲内で増幅率αに応じて可変に設定される。したがって、音響信号ｘの目標成分を他の成分と比較して抑圧した音響信号ｙが信号処理部３６による処理で生成される。 For example, when the amplification factor α is a negative number (suppression of the target component), the coefficient sequence generator 34 calculates the coefficient value g (f, m) of each frequency f corresponding to the sound image point q outside the target designation area 62. The coefficient value g (f, m) of each frequency f (that is, each frequency f of the target component) corresponding to the sound image point q inside the target designation area 62 is set to the numerical value γg1. The numerical value γg1 is variably set within the range of positive numbers less than 1 according to the amplification factor α so that the larger the absolute value of the amplification factor α (the greater the degree of suppression of the target component), the smaller the numerical value γg1. Is done. Therefore, an acoustic signal y in which the target component of the acoustic signal x is suppressed by comparing with other components is generated by processing by the signal processing unit 36.

他方、増幅率αが正数（目標成分の強調）である場合、係数列生成部３４は、目標指定領域６２の外側の音像点ｑに対応する各周波数ｆの係数値ｇ(f,m)を数値γ0に設定し、目標指定領域６２の内側の音像点ｑに対応する各周波数ｆの係数値ｇ(f,m)を数値γg2に設定する。数値γ0は、信号処理部３６による処理で強度を低下させる数値（例えば０）に設定される。他方、数値γg2は、増幅率αの絶対値が大きい（目標成分の強調の度合が大きい）ほど大きい数値となるように１未満の正数の範囲内で増幅率αに応じて可変に設定される。したがって、音響信号ｘの目標成分を他の成分に対して強調した音響信号ｙが信号処理部３６による処理で生成される。 On the other hand, when the amplification factor α is a positive number (emphasis of the target component), the coefficient sequence generator 34 generates a coefficient value g (f, m) of each frequency f corresponding to the sound image point q outside the target designation area 62. Is set to the numerical value γ0, and the coefficient value g (f, m) of each frequency f corresponding to the sound image point q inside the target designation area 62 is set to the numerical value γg2. The numerical value γ0 is set to a numerical value (for example, 0) that decreases the strength by the processing by the signal processing unit 36. On the other hand, the numerical value γg2 is set variably in accordance with the amplification factor α within a positive number range of less than 1 so that the absolute value of the amplification factor α is larger (the degree of emphasis of the target component is larger). The Therefore, an acoustic signal y in which the target component of the acoustic signal x is emphasized with respect to other components is generated by processing by the signal processing unit 36.

なお、目標指定領域６２の外側の音像点ｑに対応する各係数値ｇ(f,m)を数値γ0と数値γ1との中間値（例えば０.５）に設定し、目標指定領域６２の内側の音像点ｑに対応する各係数値ｇ(f,m)を増幅率α（負数または正数）に応じて設定することで、音響信号ｘの目標成分を他の成分に対して抑圧または強調することも可能である。 Each coefficient value g (f, m) corresponding to the sound image point q outside the target designation area 62 is set to an intermediate value (for example, 0.5) between the numerical value γ0 and the numerical value γ1, and the inside of the target designation area 62 is set. By setting each coefficient value g (f, m) corresponding to the sound image point q in accordance with the amplification factor α (negative or positive), the target component of the acoustic signal x is suppressed or enhanced with respect to other components. It is also possible to do.

以上に説明した第１実施形態では、音響信号ｘのうち周波数特性画像５０の指示子５４や波形画像７０の指示子７４が示す着目点に対応する単位区間の各周波数成分の各音像点ｑが定位画像６０の領域Ｂに配置される。したがって、音響信号ｘの各周波数成分について定位方向θの時間的な変化を視覚的に容易に確認できるという利点がある。また、対象帯域ａFおよび対象期間ａTで規定される周波数/時間指定領域５２を含む周波数特性画像５０と、対象帯域ｂFおよび対象定位域ｂLで規定される目標指定領域６２を含む定位画像６０とが、表示装置２６に並列に表示される。したがって、音響信号ｘのうち抑圧または強調の対象となる目標成分について、周波数ｆの範囲（対象帯域ａFおよび対象帯域ｂF）と定位方向θの範囲（対象定位域ｂL）と時間範囲（対象期間ａT）との関係を利用者が容易に把握できるという利点もある。更に第１実施形態では、波形画像７０も周波数特性画像５０や定位画像６０とともに表示装置２６に表示されるから、目標成分の対象期間ｃTを音響信号ｘの波形（ＷL，ＷR）との関係で容易に把握できるという利点もある。 In the first embodiment described above, each sound image point q of each frequency component of the unit section corresponding to the point of interest indicated by the indicator 54 of the frequency characteristic image 50 and the indicator 74 of the waveform image 70 in the acoustic signal x is obtained. It is arranged in the region B of the localization image 60. Therefore, there is an advantage that a temporal change in the localization direction θ can be easily visually confirmed for each frequency component of the acoustic signal x. In addition, a frequency characteristic image 50 including a frequency / time designation region 52 defined by the target band aF and the target period aT, and a localization image 60 including a target designation region 62 defined by the target band bF and the target localization region bL. Are displayed in parallel on the display device 26. Accordingly, for the target component to be suppressed or emphasized in the acoustic signal x, the range of the frequency f (target band aF and target band bF), the range of the localization direction θ (target localization area bL), and the time range (target period aT). There is also an advantage that the user can easily grasp the relationship with Furthermore, in the first embodiment, since the waveform image 70 is also displayed on the display device 26 together with the frequency characteristic image 50 and the localization image 60, the target period cT of the target component is related to the waveform (WL, WR) of the acoustic signal x. There is also an advantage that it can be easily grasped.

＜Ｂ：第２実施形態＞
本発明の第２実施形態を以下に説明する。なお、以下に例示する各形態において作用や機能が第１実施形態と同等である要素については、以上の説明で参照した符号を流用して各々の詳細な説明を適宜に省略する。 <B: Second Embodiment>
A second embodiment of the present invention will be described below. In addition, about the element which an effect | action and a function are equivalent to 1st Embodiment in each form illustrated below, each reference detailed in the above description is diverted and each detailed description is abbreviate | omitted suitably.

図３は、第２実施形態の編集画面４５のうち周波数特性画像５０および定位画像６０の模式図である。なお、周波数特性画像５０のスペクトログラム（ＳL，ＳR）の図示を図３では便宜的に省略した。 FIG. 3 is a schematic diagram of the frequency characteristic image 50 and the localization image 60 in the editing screen 45 of the second embodiment. Note that the spectrograms (SL, SR) of the frequency characteristic image 50 are omitted in FIG. 3 for convenience.

図３に示すように、第２実施形態の表示制御部４０は、対象帯域ｂFまたは対象定位域ｂLが相違する複数の目標指定領域６２（６２１，６２２）を利用者からの指示に応じて定位画像６０の領域Ｂ内に設定する。利用者は、対象帯域ｂFと対象定位域ｂLと増幅率αとを目標指定領域６２１および目標指定領域６２２の各々について個別に変更することが可能である。目標指定領域６２１と目標指定領域６２２とは、領域Ｂ内に相互に離間して配置されるほか、図３の例示のように相互に重複する位置にも配置され得る。 As shown in FIG. 3, the display control unit 40 according to the second embodiment localizes a plurality of target designation areas 62 (621, 622) having different target bands bF or target localization areas bL according to instructions from the user. Set in area B of image 60. The user can individually change the target band bF, the target localization area bL, and the amplification factor α for each of the target designation area 621 and the target designation area 622. The target designation area 621 and the target designation area 622 can be arranged in the area B so as to be separated from each other, and can also be arranged at positions overlapping each other as illustrated in FIG.

図３に示すように、周波数特性画像５０の領域Ａ内には、目標指定領域６２１に対応する周波数/時間指定領域５２１と目標指定領域６２２に対応する周波数/時間指定領域５２２とが設定される。利用者は、対象帯域ａFおよび対象期間ａTを周波数/時間指定領域５２１および周波数/時間指定領域５２２の各々について個別に設定することが可能である。したがって、周波数/時間指定領域５２１と周波数/時間指定領域５２２とは、領域Ａ内に相互に離間して配置されるほか、図３の例示のように相互に重複する位置にも配置され得る。 As shown in FIG. 3, a frequency / time designation area 521 corresponding to the target designation area 621 and a frequency / time designation area 522 corresponding to the target designation area 622 are set in the area A of the frequency characteristic image 50. . The user can individually set the target band aF and the target period aT for each of the frequency / time designation area 521 and the frequency / time designation area 522. Therefore, the frequency / time designation area 521 and the frequency / time designation area 522 can be arranged apart from each other in the area A, and can also be arranged at positions overlapping each other as illustrated in FIG.

時間軸Ｔ1上で周波数/時間指定領域５２１のみが存在する時点ｔaを着目点として指示子５４および指示子７４が示す状態では、定位画像６０の領域Ｂには目標指定領域６２１のみが配置される。他方、周波数/時間指定領域５２１および周波数/時間指定領域５２２の双方が存在する時点ｔbが着目点として指示された状態では、図３に示すように、目標指定領域６２１および目標指定領域６２２の双方が定位画像６０の領域Ｂに配置される。また、着目点として時点ｔcが指示された状態では目標指定領域６２２のみが領域Ｂに配置される。 In a state where the indicator 54 and the indicator 74 indicate the time point ta at which only the frequency / time specifying region 521 exists on the time axis T1 as the point of interest, only the target specifying region 621 is arranged in the region B of the localization image 60. . On the other hand, in the state where the point in time tb in which both the frequency / time specifying area 521 and the frequency / time specifying area 522 exist is designated as the point of interest, both the target specifying area 621 and the target specifying area 622 are shown in FIG. Are arranged in the region B of the localization image 60. Further, only the target designation area 622 is arranged in the area B when the time point tc is designated as the point of interest.

音響信号ｘのうち目標指定領域６２１および目標指定領域６２２の双方が指定された単位区間について、係数列生成部３４は、目標指定領域６２１の処理係数列ＧA(m)と目標指定領域６２２の処理係数列ＧB(m)との各々を第１実施形態と同様の方法で生成し、処理係数列ＧA(m)と処理係数列ＧB(m)との合成（例えば相互に対応する係数値ｇ(f,m)の加算または乗算）で処理係数列Ｇ(m)を生成する。 For the unit section in which both the target designation area 621 and the target designation area 622 are designated in the acoustic signal x, the coefficient sequence generation unit 34 processes the processing coefficient series GA (m) of the target designation area 621 and the target designation area 622. Each of the coefficient sequences GB (m) is generated by the same method as in the first embodiment, and the processing coefficient sequence GA (m) and the processing coefficient sequence GB (m) are combined (for example, coefficient values g ( A processing coefficient sequence G (m) is generated by addition or multiplication of f, m).

第２実施形態においても第１実施形態と同様の効果が実現される。また、第２実施形態では、複数の目標指定領域６２（６２１，６２２）が定位画像６０の領域Ｂに配置されるとともに各目標指定領域６２に対応する複数の周波数/時間指定領域５２（５２１，５２２）が周波数特性画像５０の領域Ａに配置される。したがって、周波数ｆ（対象帯域ａF）と発生期間（対象期間ａT）と定位方向（対象定位域ｂL）とが相違する複数種の目標成分を容易に抑圧または強調の対象として指定することが可能である。 In the second embodiment, the same effect as in the first embodiment is realized. In the second embodiment, a plurality of target designation areas 62 (621, 622) are arranged in the area B of the localization image 60, and a plurality of frequency / time designation areas 52 (521, 521) corresponding to the target designation areas 62 are provided. 522) is arranged in the region A of the frequency characteristic image 50. Therefore, a plurality of types of target components having different frequencies f (target band aF), generation period (target period aT), and localization direction (target localization area bL) can be easily specified as suppression or enhancement targets. is there.

＜Ｃ：第３実施形態＞
図４は、第３実施形態の編集画面４５のうち周波数特性画像５０および定位画像６０の模式図である。なお、図３と同様に、周波数特性画像５０のスペクトログラム（ＳL，ＳR）の図示は図４でも省略されている。 <C: Third Embodiment>
FIG. 4 is a schematic diagram of the frequency characteristic image 50 and the localization image 60 in the editing screen 45 of the third embodiment. As in FIG. 3, the spectrogram (SL, SR) of the frequency characteristic image 50 is not shown in FIG.

第３実施形態の表示制御部４０は、定位画像６０の１個の目標指定領域６２に対応する複数の周波数/時間指定領域５２（５２A，５２B）を利用者からの指示に応じて周波数特性画像５０の領域Ａ内に配置する。利用者は、周波数/時間指定領域５２Aと周波数/時間指定領域５２Bとの各々について対象期間ａTを個別に設定することが可能である。したがって、周波数/時間指定領域５２Aと周波数/時間指定領域５２Bとは時間軸Ｔ1上で相異なる時点（相互に間隔をあけた位置）に配置され得る。他方、周波数軸Ｆ1上の対象帯域ａFは周波数/時間指定領域５２Aと周波数/時間指定領域５２Bとで共通する。 The display control unit 40 of the third embodiment displays a plurality of frequency / time designation areas 52 (52A, 52B) corresponding to one target designation area 62 of the localization image 60 in accordance with an instruction from the user. 50 regions A are arranged. The user can individually set the target period aT for each of the frequency / time specifying area 52A and the frequency / time specifying area 52B. Therefore, the frequency / time designation area 52A and the frequency / time designation area 52B can be arranged at different time points (positions spaced from each other) on the time axis T1. On the other hand, the target band aF on the frequency axis F1 is common to the frequency / time designation area 52A and the frequency / time designation area 52B.

第３実施形態においても第１実施形態と同様の効果が実現される。また、第３実施形態では、１個の目標指定領域６２に対応する複数の周波数/時間指定領域５２（５２A，５２B）が周波数特性画像５０の領域Ａに配置されるから、例えば所定の帯域内の周波数ｆで間欠的に発生する目標成分（例えば間奏をあけて発生する歌唱音）を適切に指定することが可能である。 In the third embodiment, the same effect as in the first embodiment is realized. In the third embodiment, since a plurality of frequency / time designation areas 52 (52A, 52B) corresponding to one target designation area 62 are arranged in the area A of the frequency characteristic image 50, for example, within a predetermined band It is possible to appropriately specify a target component (for example, a singing sound generated with an interval) intermittently generated at a frequency f.

なお、第２実施形態のように領域Ｂ内に複数の目標指定領域６２（６２１，６２２）が指定され得る構成に第３実施形態を適用することも可能である。例えば、目標指定領域６２１に対応する複数の周波数/時間指定領域５２と目標指定領域６２２に対応する複数の周波数/時間指定領域５２とを、相異なる表示態様（例えば色彩）で領域Ａ内に配置する構成が好適である。 Note that the third embodiment can be applied to a configuration in which a plurality of target designation areas 62 (621, 622) can be designated in the area B as in the second embodiment. For example, a plurality of frequency / time designation areas 52 corresponding to the target designation area 621 and a plurality of frequency / time designation areas 52 corresponding to the target designation area 622 are arranged in the area A in different display modes (for example, colors). The structure which does is suitable.

＜Ｄ：第４実施形態＞
図５は、第４実施形態の編集画面４５のうち周波数特性画像５０および定位画像６０の模式図である。第４実施形態の表示制御部４０は、周波数特性画像５０の領域Ａ内に配置される音響信号ｘLのスペクトログラムＳLおよび音響信号ｘRのスペクトログラムＳRの各々の表示態様（以下の例示では階調）を、定位画像６０における目標指定領域６２の定位軸Ｌ上の位置（すなわち目標成分の定位方向θ）に応じて変化させる。 <D: Fourth Embodiment>
FIG. 5 is a schematic diagram of the frequency characteristic image 50 and the localization image 60 in the editing screen 45 of the fourth embodiment. The display control unit 40 according to the fourth embodiment displays each display mode (gradation in the following example) of the spectrogram SL of the acoustic signal xL and the spectrogram SR of the acoustic signal xR arranged in the region A of the frequency characteristic image 50. The position is changed according to the position of the target designation region 62 on the localization axis L in the localization image 60 (that is, the localization direction θ of the target component).

図５の部分(a)に示すように、定位軸Ｌ上の原点に対して正側（右方向）に目標指定領域６２が設定された場合（すなわち、右方向に定位する成分が目標成分として指定された場合）、表示制御部４０は、領域Ａ内のスペクトログラムＳRをスペクトログラムＳLと比較して低階調で表示する。目標指定領域６２の重心（図心）が原点から右方向に離間するほどスペクトログラムＳRの階調はスペクトログラムＳLと比較して低下する。同様に、図５の部分(b)に示すように、目標指定領域６２の重心が原点から負側（左方向）に離間するほど、スペクトログラムＳLはスペクトログラムＳRと比較して低階調で表示される。目標指定領域６２の重心が定位軸Ｌ上の原点に位置する場合、スペクトログラムＳLとスペクトログラムＳRとは相等しい階調で表示される。 As shown in part (a) of FIG. 5, when the target designation region 62 is set on the positive side (right direction) with respect to the origin on the localization axis L (that is, the component localized in the right direction is used as the target component). When designated), the display control unit 40 displays the spectrogram SR in the region A in a low gradation compared with the spectrogram SL. As the center of gravity (centroid) of the target designation area 62 is separated from the origin in the right direction, the gradation of the spectrogram SR decreases as compared with the spectrogram SL. Similarly, as shown in part (b) of FIG. 5, the spectrogram SL is displayed at a lower gradation as compared to the spectrogram SR as the center of gravity of the target designation area 62 is further away from the origin toward the negative side (leftward). The When the center of gravity of the target designation area 62 is located at the origin on the localization axis L, the spectrogram SL and the spectrogram SR are displayed with the same gradation.

第４実施形態においても第１実施形態と同様の効果が実現される。また、第４実施形態では、スペクトログラムＳRおよびスペクトログラムＳLの各々の表示態様が目標指定領域６２の定位軸Ｌ上の位置に応じて変化するから、目標成分の定位方向を利用者が周波数特性画像５０から直感的に把握できるという利点がある。 In the fourth embodiment, the same effect as in the first embodiment is realized. Further, in the fourth embodiment, since the display modes of the spectrogram SR and the spectrogram SL change according to the position on the localization axis L of the target designation region 62, the user can specify the localization direction of the target component in the frequency characteristic image 50. There is an advantage that it can be grasped intuitively.

なお、以上の説明では、目標指定領域６２の指定に並行して随時にスペクトログラムＳRおよびスペクトログラムＳLの表示態様を変更したが、例えば目標指定領域６２の指定後に入力装置２４に対して所定の操作（例えば確定ボタンの押下）が付与された場合にスペクトログラムＳRおよびスペクトログラムＳLの表示態様を変更する構成も採用され得る。 In the above description, the display mode of the spectrogram SR and the spectrogram SL is changed at any time in parallel with the designation of the target designation area 62. For example, after the designation of the target designation area 62, a predetermined operation ( For example, a configuration that changes the display mode of the spectrogram SR and the spectrogram SL when the confirmation button is pressed) may be employed.

また、以上の説明では、スペクトログラムＳRおよびスペクトログラムＳLの各々の全体的な表示態様を目標指定領域６２の位置に応じて相違させたが、スペクトログラムＳRやスペクトログラムＳLの部分的な表示態様を相違させる構成も採用される。例えば、目標指定領域６２に対応する周波数/時間指定領域５２の範囲内の表示態様のみを目標指定領域６２の位置に応じてスペクトログラムＳRおよびスペクトログラムＳLとで相違させ、他の領域の表示態様は共通させることも可能である。 In the above description, the overall display modes of the spectrogram SR and the spectrogram SL are made different according to the position of the target designation area 62. However, the partial display modes of the spectrogram SR and the spectrogram SL are made different. Is also adopted. For example, only the display mode within the range of the frequency / time specification region 52 corresponding to the target specification region 62 is made different between the spectrogram SR and the spectrogram SL according to the position of the target specification region 62, and the display mode of other regions is common. It is also possible to make it.

＜Ｅ：第５実施形態＞
図６は、第５実施形態における定位画像６０の模式図である。図６に示すように、第６実施形態の表示制御部４０が表示装置２６に表示させる定位画像６０は、音響信号ｘの相異なる時点（単位区間）に対応する複数の単位画像Ｕが、周波数軸Ｆ2と定位軸Ｌとに交差する時間軸Ｔ0に沿って立体的に配列された画像である。音響信号ｘの各時点に対応する１個の単位画像Ｕには、音響信号ｘのうちその時点の単位区間に対応する複数の音像点ｑと目標指定領域６２とが配置される。複数の単位画像Ｕのうち指示子５４および指示子７４が示す着目点に対応する１個の単位画像Ｕが最前面に表示されるように各単位画像Ｕの時間軸Ｔ0上の位置が順次に変更される。なお、各単位画像Ｕを透過表示することも可能である。 <E: Fifth Embodiment>
FIG. 6 is a schematic diagram of a localization image 60 in the fifth embodiment. As shown in FIG. 6, the localization image 60 displayed on the display device 26 by the display control unit 40 according to the sixth embodiment includes a plurality of unit images U corresponding to different time points (unit sections) of the acoustic signal x. It is an image arranged three-dimensionally along a time axis T0 intersecting the axis F2 and the localization axis L. In one unit image U corresponding to each time point of the acoustic signal x, a plurality of sound image points q and target designation areas 62 corresponding to the unit section at that time point in the acoustic signal x are arranged. The position of each unit image U on the time axis T0 is sequentially arranged so that one unit image U corresponding to the point of interest indicated by the indicator 54 and the indicator 74 among the plurality of unit images U is displayed in the foreground. Be changed. Each unit image U can be displayed in a transparent manner.

第５実施形態においても第１実施形態と同様の効果が実現される。また、第５実施形態では、音響信号ｘの相異なる時点での音像点ｑを配置した複数の単位画像Ｕが時間軸Ｔ0に沿って立体的に配置されるから、音像点ｑの分布と目標指定領域６２との関係の時間的な遷移を定位画像６０から容易に把握できるという利点がある。 In the fifth embodiment, the same effect as in the first embodiment is realized. In the fifth embodiment, since the plurality of unit images U in which the sound image points q at different times of the acoustic signal x are disposed are arranged three-dimensionally along the time axis T0, the distribution of the sound image points q and the target There is an advantage that the temporal transition of the relationship with the designated area 62 can be easily grasped from the localization image 60.

＜Ｆ：第６実施形態＞
図７は、第６実施形態における周波数特性画像５０および定位画像６０の部分的な模式図である。図７に示すように、周波数軸Ｆ2上の複数の周波数ｆは、目標指定領域６２の内側の音像点ｑに対応する周波数ｆaと、目標指定領域６２の外側の音像点ｑに対応する周波数ｆbとに区分される。目標指定領域６２の対象帯域ｂFの外側の周波数ｆが周波数ｆaに選別されるほか、対象帯域ｂFの内側の周波数ｆでも、対象定位域ｂLの外側に位置する音像点ｑに対応する周波数ｆは周波数ｆbに選別される。 <F: Sixth Embodiment>
FIG. 7 is a partial schematic diagram of the frequency characteristic image 50 and the localization image 60 in the sixth embodiment. As shown in FIG. 7, a plurality of frequencies f on the frequency axis F 2 are a frequency fa corresponding to the sound image point q inside the target designation area 62 and a frequency fb corresponding to the sound image point q outside the target designation area 62. It is divided into and. In addition to selecting the frequency f outside the target band bF in the target designation area 62 as the frequency fa, the frequency f corresponding to the sound image point q located outside the target localization area bL is also the frequency f inside the target band bF. The frequency fb is selected.

図７には、周波数特性画像５０のスペクトログラムＳLのうち音響信号ｘの１個の単位区間に対応する周波数スペクトルＸLの各周波数成分ＸL(f,m)が模式的に図示されている。図７に示すように、表示制御部４０は、周波数スペクトルＸLのうち音像点ｑが目標指定領域６２の内側に位置する各周波数ｆaの周波数成分ＸL(fa,m)と、音像点ｑが目標指定領域６２の外側に位置する各周波数ｆbの周波数成分ＸL(fb,m)とを、相異なる表示態様（例えば階調）で表示する。図７では、周波数ｆaの周波数成分ＸL(fa,m)を黒色（低階調）で表示し、周波数ｆbの周波数成分ＸL(fb,m)を白色（高階調）で表示した場合が想定されている。なお、図７ではスペクトログラムＳL（周波数スペクトルＸL）のみを図示したが、スペクトログラムＳRについても同様に表示される。 FIG. 7 schematically shows each frequency component XL (f, m) of the frequency spectrum XL corresponding to one unit section of the acoustic signal x in the spectrogram SL of the frequency characteristic image 50. As shown in FIG. 7, the display control unit 40 sets the frequency component XL (fa, m) of each frequency fa in the frequency spectrum XL where the sound image point q is located inside the target designation region 62 and the sound image point q as the target. The frequency components XL (fb, m) of the respective frequencies fb located outside the designated area 62 are displayed in different display modes (for example, gradation). In FIG. 7, it is assumed that the frequency component XL (fa, m) of the frequency fa is displayed in black (low gradation) and the frequency component XL (fb, m) of the frequency fb is displayed in white (high gradation). ing. Although only the spectrogram SL (frequency spectrum XL) is shown in FIG. 7, the spectrogram SR is also displayed in the same manner.

第６実施形態においても第１実施形態と同様の効果が実現される。また、第６実施形態では、目標指定領域６２の内側の音像点ｑに対応する周波数成分ＸL(fa,m)と、目標指定領域６２の外側の音像点ｑに対応する周波数成分ＸL(fb,m)とが相異なる態様で周波数特性画像５０に表示されるから、目標成分として抑圧または強調の対象となる周波数成分ＸL(f,m)を利用者が周波数特性画像５０から容易に把握できるという利点がある。 In the sixth embodiment, the same effect as in the first embodiment is realized. In the sixth embodiment, the frequency component XL (fa, m) corresponding to the sound image point q inside the target designation region 62 and the frequency component XL (fb, f) corresponding to the sound image point q outside the target designation region 62 are used. m) is displayed on the frequency characteristic image 50 in a different manner from that of the frequency characteristic image 50, so that the user can easily grasp the frequency component XL (f, m) to be suppressed or emphasized as the target component from the frequency characteristic image 50. There are advantages.

なお、以上の例示では周波数ｆaの周波数成分ＸL(fa,m)と周波数ｆbの周波数成分ＸL(fb,m)とで階調を相違させたが、周波数成分ＸL(fa,m)と周波数成分ＸL(fb,m)とを視覚的に区別する方法は任意である。例えば、周波数成分ＸL(fa,m)と周波数成分ＸL(fb,m)とで色彩を相違させた構成や、周波数成分ＸL(fa,m)および周波数成分ＸL(fb,m)の一方の表示を点滅させることも可能である。 In the above example, the gradation is different between the frequency component XL (fa, m) of the frequency fa and the frequency component XL (fb, m) of the frequency fb, but the frequency component XL (fa, m) and the frequency component are different. A method of visually distinguishing XL (fb, m) is arbitrary. For example, the frequency component XL (fa, m) and the frequency component XL (fb, m) have different colors, or one of the frequency component XL (fa, m) and the frequency component XL (fb, m) is displayed. It is also possible to blink.

＜Ｇ：変形例＞
以上の形態には様々な変形が加えられる。具体的な変形の態様を以下に例示する。以下の例示から任意に選択された２以上の態様は併合され得る。 <G: Modification>
Various modifications are added to the above embodiment. Specific modifications are exemplified below. Two or more aspects arbitrarily selected from the following examples may be merged.

（１）前述の各形態では、音響信号ｘLのスペクトログラムＳLと音響信号ｘRのスペクトログラムＳRとを領域Ａに配置したが、音響信号ｘLと音響信号ｘRとの混合信号（モノラル信号）のスペクトログラムや音響信号ｘLおよび音響信号ｘRの一方のスペクトログラムを領域Ａに配置することも可能である。波形画像７０についても同様であり、音響信号ｘLと音響信号ｘRの混合信号の波形や音響信号ｘLおよび音響信号ｘRの一方の波形を領域Ｃに配置した構成も採用される。また、波形画像７０を省略した構成も採用される。 (1) In each of the above-described embodiments, the spectrogram SL of the acoustic signal xL and the spectrogram SR of the acoustic signal xR are arranged in the region A. However, the spectrogram or acoustic of the mixed signal (monaural signal) of the acoustic signal xL and the acoustic signal xR is used. It is also possible to place one spectrogram of the signal xL and the acoustic signal xR in the region A. The same applies to the waveform image 70, and a configuration in which the waveform of the mixed signal of the acoustic signal xL and the acoustic signal xR or one of the acoustic signal xL and the acoustic signal xR is arranged in the region C is also employed. Moreover, the structure which abbreviate | omitted the waveform image 70 is also employ | adopted.

（２）前述の各形態では、音響信号ｘの１個の単位区間内の各周波数成分に対応する音像点ｑを領域Ｂに配置したが、音響信号ｘの複数の単位区間にわたる各周波数成分の音像点ｑを領域Ｂに配置することも可能である。例えば、複数の単位区間にわたる定位方向θの平均値や加算値が周波数ｆ毎に算定され、各数値に対応する音像点ｑが領域Ｂに配置される。 (2) In each of the above-described embodiments, the sound image point q corresponding to each frequency component in one unit section of the acoustic signal x is arranged in the region B, but each frequency component over a plurality of unit sections of the acoustic signal x It is also possible to arrange the sound image point q in the region B. For example, an average value and an added value of the localization direction θ over a plurality of unit sections are calculated for each frequency f, and a sound image point q corresponding to each numerical value is arranged in the region B.

（３）前述の各形態では、領域Ａと領域Ｂと領域Ｃとを並列に表示したが、各領域を選択的に表示することも可能である。例えば、表示制御部４０は、領域Ａと領域Ｂと領域Ｃとのうち利用者が入力装置２４の操作で選択した少なくともひとつの領域を表示装置２６に表示させる。 (3) In each of the above-described embodiments, the area A, the area B, and the area C are displayed in parallel, but each area can be selectively displayed. For example, the display control unit 40 causes the display device 26 to display at least one region selected by the user by operating the input device 24 among the regions A, B, and C.

（４）周波数/時間指定領域５２や目標指定領域６２や時間指定領域７２の形状は任意であり、前述の各形態に例示した矩形状には限定されない。 (4) The shapes of the frequency / time designation area 52, the target designation area 62, and the time designation area 72 are arbitrary, and are not limited to the rectangular shapes exemplified in the above embodiments.

（５）第１実施形態ないし第６実施形態のなかから任意に選択された２以上の形態を併合することも可能である。 (5) Two or more modes arbitrarily selected from the first to sixth embodiments may be merged.

１００……音響処理装置、１２……演算処理装置、１４……記憶装置、２２……信号供給装置、２４……入力装置、２６……表示装置、２８……放音装置、３２……周波数分析部、３４……係数列生成部、３６……信号処理部、３８……波形合成部、４０……表示制御部、４５……編集画面、５０……周波数特性画像、５２……周波数/時間指定領域、５４……指示子、６０……定位画像、６２……目標指定領域、７０……波形画像、７２……時間指定領域、７４……指示子。
DESCRIPTION OF SYMBOLS 100 ... Sound processing device, 12 ... Arithmetic processing device, 14 ... Memory | storage device, 22 ... Signal supply device, 24 ... Input device, 26 ... Display device, 28 ... Sound emission device, 32 ... Frequency Analysis unit 34... Coefficient sequence generation unit 36... Signal processing unit 38... Waveform synthesis unit 40... Display control unit 45 45 Edit screen 50. Time designation area, 54... Indicator, 60... Localization image, 62... Target designation area, 70.

Claims

A reception means for receiving instructions from the user;
A frequency characteristic image in which a spectrogram of an acoustic signal is arranged in a first area in which a first frequency axis and a first time axis are set, and a second area in which a second frequency axis and a localization axis indicating a localization direction are set Means for displaying a localization image in which sound image points of each frequency component corresponding to the point of interest on the first time axis in the acoustic signal are displayed in parallel on a display device, the first frequency axis A frequency / time designation area defined by an upper target band and a target period on the first time axis is arranged in the first area in response to an instruction to the receiving means, and the acoustic signal within the target period Display control means for arranging a target designation area defined by the target band on the second frequency axis and the target localization area on the localization axis in the second area in response to an instruction to the reception means;
A sound processing apparatus comprising: a signal processing unit that suppresses or emphasizes each frequency component indicated by a sound image point in the target designated region of the sound signal within the target period.

The display control means arranges a plurality of target designation areas having different target bands or target localization areas in the second area in accordance with an instruction to the reception means, and each target designation area is arranged in each target designation area. The sound processing device according to claim 1, wherein a plurality of corresponding frequency / time designation regions are arranged so as to overlap in the first region.

3. The sound according to claim 1, wherein the display control unit sets a plurality of the frequency / time designation areas corresponding to the target designation area at different time points on the first time axis in the first area. Processing equipment.

The display control means arranges spectrograms of each of the first channel signal and the second channel signal constituting the acoustic signal in the first region, and the display control unit performs the spectrogram according to the position of the target designation region on the localization axis. The acoustic processing device according to any one of claims 1 to 3, wherein the spectrogram of the first channel signal and the spectrogram of the second channel signal are changed to different display modes.

The display control means includes a waveform image in which a waveform of an acoustic signal before or after processing by the signal processing means is arranged in a third region in which a second time axis is set, together with the frequency characteristic image and the localization image. The sound processing device according to claim 1, wherein the sound processing device is displayed on the display device.