JP2006139158A

JP2006139158A - Sound signal synthesizer and synthesizing/reproducing apparatus

Info

Publication number: JP2006139158A
Application number: JP2004329918A
Authority: JP
Inventors: Toshio Motegi; 敏雄茂出木
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2004-11-15
Filing date: 2004-11-15
Publication date: 2006-06-01

Abstract

<P>PROBLEM TO BE SOLVED: To provide a sound signal synthesizer capable of creating a novel sound signal while allowing the influence of sound signals prior to synthesizing to remain in synthesizing a plurality of the sound signals, and a sound signal synthesizing/reproducing apparatus capable of reproducing the sound signals simultaneously with synthesizing the same. <P>SOLUTION: A block read means 10 reads the plurality of the sound signals in sound block unit consisting of a prescribed number of samples from a sound signal memory section 61 and a frequency conversion means 20 applies Fourier transform to the sound blocks read from the respective sound signals to obtain spectral blocks. A block read means 30 generates synthesized spectral blocks by synthesizing the spectral blocks obtained from the respective sound signals and generates the synthesized sound blocks by Fourier inverse transformation. The generated synthesized sound blocks are sequentially outputted by an output means 50 and the synthesized sound signals are recorded in a synthesized sound signal memory section 60. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、ＣＤ・ＤＶＤ等を用いた民生・業務用途における鑑賞用のパッケージ音楽再生分野、放送事業者・公共施設の事業者等が商業目的で配信するＢＧＭ分野において好適な音響信号の合成および再生技術に関する。 The present invention is a method for synthesizing an acoustic signal suitable for use in the field of package music reproduction for appreciation in consumer / business use using CD / DVD, and in the BGM field distributed for commercial purposes by broadcasters / public facility operators. Regeneration technology.

従来より、一般の音楽制作においては、複数の音響信号を波形上で重ね合わせることがしばしば行われている。（例えば、特許文献１、２参照）。
特許第２６７１８２５号特許第３０９２５９２号 Conventionally, in general music production, a plurality of acoustic signals are often superimposed on a waveform. (For example, refer to Patent Documents 1 and 2).
Japanese Patent No. 2671825 Patent No. 3092592

しかしながら、従来の手法では、合成後に得られる音響信号は、元の音響信号を単に足し合わせたようなものになる。すなわち、別々の楽器で演奏された音響信号を合成した場合、２つの楽器で合奏したような状態の音響信号が得られるものであり、新規な音楽を創作するということはできない。 However, with the conventional method, the acoustic signal obtained after synthesis is simply the sum of the original acoustic signals. In other words, when sound signals played by different musical instruments are synthesized, an acoustic signal in a state of being played by two musical instruments is obtained, and it is impossible to create new music.

そこで、本発明は、複数の音響信号を合成した場合に、合成前の音響信号の影響を残しつつも、新規な音響信号を創作することが可能な音響信号の合成装置、および合成すると同時に再生することが可能な音響信号の合成再生装置を提供することを課題とする。 Therefore, the present invention provides an apparatus for synthesizing an acoustic signal capable of creating a new acoustic signal while retaining the influence of the acoustic signal before synthesis when synthesizing a plurality of acoustic signals, and reproducing simultaneously with the synthesis. It is an object of the present invention to provide an apparatus for synthesizing and reproducing an acoustic signal that can be used.

上記課題を解決するため、本発明では、時系列のサンプル列で構成される複数の音響信号を合成して、合成音響信号を生成する装置として、複数の音響信号から、それぞれ所定数のサンプルを音響ブロックとして読み込むブロック読込手段と、前記読み込んだ音響ブロックに対してフーリエ変換を行い、スペクトルブロックを生成する周波数変換手段と、前記各音響信号から得られたスペクトルブロックの互いに対応する成分同士を積演算することにより、スペクトルブロックを合成して合成スペクトルブロックを生成するブロック合成手段と、前記生成された合成スペクトルブロックに対してフーリエ逆変換を行い、合成音響ブロックを生成する周波数逆変換手段と、前記生成された合成音響ブロックを時系列順に出力する出力手段を有する構成としたことを特徴とする。 In order to solve the above problems, in the present invention, as a device for synthesizing a plurality of acoustic signals composed of time-series sample sequences and generating a synthesized acoustic signal, a predetermined number of samples are respectively obtained from the plurality of acoustic signals. A block reading means for reading as an acoustic block, a frequency converting means for performing a Fourier transform on the read acoustic block to generate a spectrum block, and corresponding components of the spectrum block obtained from each of the acoustic signals are multiplied. A block synthesizing unit that synthesizes the spectrum blocks by calculating to generate a synthesized spectrum block; a frequency inverse transform unit that performs Fourier inverse transform on the generated synthesized spectrum block and generates a synthesized acoustic block; Output means for outputting the generated synthesized sound block in time series order Wherein the structure and the.

本発明によれば、複数の音響信号に対して、各音響信号を所定の単位で周波数解析した後、得られた周波数成分同士を合成し、合成後の周波数成分を周波数逆変換することにより、音響信号に戻すようにしたので、合成後の音響信号は、合成前の各音響信号の影響を残しつつも新規なものとなるという効果を奏する。 According to the present invention, for a plurality of acoustic signals, after frequency analysis of each acoustic signal in a predetermined unit, the obtained frequency components are synthesized with each other, and the frequency components after synthesis are frequency-inverted, Since the acoustic signal is returned to the acoustic signal, the synthesized acoustic signal has an effect of being novel while leaving the influence of each acoustic signal before the synthesis.

以下、本発明の実施形態について図面を参照して詳細に説明する。
（１．合成装置の構成）
図１は、本発明に係る音響信号合成装置の構成を示す機能ブロック図である。図１において、１０はブロック読込手段、２０は周波数変換手段、３０はブロック合成手段、４０は周波数逆変換手段、５０は出力手段、６０は記憶手段、６１は音響信号記憶部、６２は合成音響信号記憶部である。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
(1. Configuration of the synthesizer)
FIG. 1 is a functional block diagram showing a configuration of an acoustic signal synthesizer according to the present invention. In FIG. 1, 10 is a block reading means, 20 is a frequency converting means, 30 is a block synthesizing means, 40 is an inverse frequency converting means, 50 is an output means, 60 is a storage means, 61 is an acoustic signal storage section, and 62 is a synthetic sound. It is a signal storage unit.

ブロック読込手段１０は、合成対象とする元の各音響信号から所定数のサンプルを１ブロックとして読み込む機能を有している。周波数変換手段２０は、ブロック読込手段１０が読み込んだ音響信号のブロックをフーリエ変換してスペクトルブロックを生成する機能を有している。ブロック合成手段３０は、各音響信号について得られたスペクトルブロックを合成して１つの合成スペクトルブロックを生成する機能を有している。周波数逆変換手段４０は、得られた合成スペクトルブロックをフーリエ逆変換することにより、１つの合成音響ブロックを生成する機能を有している。出力手段５０は、得られた合成音響ブロックを順次出力する機能を有している。記憶手段６０は、合成の対象となる複数の音響信号を記憶した音響信号記憶部６１と、合成後の合成音響信号を記憶する合成音響信号記憶部６２を有しており、その他処理に必要な各種情報を記憶するものである。図１に示した各構成手段は、現実にはコンピュータおよびその周辺機器等のハードウェアに専用のプログラムを搭載することにより実現される。すなわち、コンピュータが、専用のプログラムに従って各手段の内容を実行することになる。 The block reading means 10 has a function of reading a predetermined number of samples as one block from each original acoustic signal to be synthesized. The frequency conversion means 20 has a function of generating a spectrum block by Fourier transforming the block of the acoustic signal read by the block reading means 10. The block synthesizing unit 30 has a function of synthesizing the spectrum blocks obtained for each acoustic signal to generate one synthesized spectrum block. The frequency inverse transform means 40 has a function of generating one synthesized acoustic block by performing Fourier inverse transform on the obtained synthesized spectrum block. The output unit 50 has a function of sequentially outputting the obtained synthesized sound block. The storage unit 60 includes an acoustic signal storage unit 61 that stores a plurality of acoustic signals to be synthesized, and a synthesized acoustic signal storage unit 62 that stores a synthesized acoustic signal after synthesis, and is necessary for other processing. Various types of information are stored. Each component shown in FIG. 1 is actually realized by installing a dedicated program in hardware such as a computer and its peripheral devices. That is, the computer executes the contents of each means according to a dedicated program.

（２．１．合成装置の処理動作）
次に、図１に示した音響信号合成装置の処理動作について説明する。本発明は、２以上の音響信号を合成するものであるが、以下の例では、最も単純な例として、２つの音響信号を合成する場合について説明する。まず、ブロック読込手段１０は、外部から指定された２つの音響信号からそれぞれ所定数のサンプルを１音響ブロックとして読み込む。作業者は、複数の音響信号を読み込ませる際に、１つを基準音響信号として指定する。ブロック読込手段１０が読み込む１音響ブロックのサンプル数は、適宜設定することができるが、サンプリング周波数が４４．１ｋＨｚの場合、４０９６サンプル程度とすることが望ましい。したがって、ブロック読込手段１０は、基準音響信号Ｐ、音響信号Ｑについてそれぞれ４０９６サンプルずつ、順次音響ブロックとして読み込んでいくことになる。音響ブロックは、隣接する音響ブロックとサンプルが重複するように読み込んで行く。例えば、先頭の音響ブロックがサンプル番号１から４０９６までとしたら、２番目の音響ブロックはサンプル番号２０４９から６１４４までとし、３番目の音響ブロックはサンプル番号４０９７から８１９２までとする。この場合、隣接する音響ブロックにおいて、２０４８サンプルづつ重複して符号化することになる。このように音響ブロックを、区間を重複させて設定するのは、音響ブロックの変わり目で、ノイズが発生するのを防ぐためである。重複したサンプルについて、合成後に信号レベルが不連続にならないようにするために、後述するようにフーリエ変換する際には、窓関数を乗じるようにしている。 (2.1. Processing operation of synthesizer)
Next, the processing operation of the acoustic signal synthesizer shown in FIG. 1 will be described. The present invention synthesizes two or more acoustic signals. In the following example, a case where two acoustic signals are synthesized will be described as the simplest example. First, the block reading means 10 reads a predetermined number of samples as one acoustic block from two externally designated acoustic signals. An operator designates one as a reference acoustic signal when reading a plurality of acoustic signals. The number of samples of one acoustic block read by the block reading means 10 can be set as appropriate. However, when the sampling frequency is 44.1 kHz, it is desirable to set the number of samples to about 4096 samples. Therefore, the block reading means 10 sequentially reads 4096 samples of the reference sound signal P and the sound signal Q as sound blocks. The acoustic block is read so that the sample overlaps with the adjacent acoustic block. For example, if the first sound block is sample numbers 1 to 4096, the second sound block is sample numbers 2049 to 6144, and the third sound block is sample numbers 4097 to 8192. In this case, in the adjacent sound block, 2048 samples are redundantly encoded. The reason for setting the acoustic block in such a manner that the sections overlap is to prevent noise from occurring at the transition of the acoustic block. In order to prevent the signal level from becoming discontinuous after synthesis for the duplicate samples, a window function is multiplied when performing Fourier transform, as will be described later.

続いて、周波数変換手段２０は、読み込んだ各音響ブロックに対して、フーリエ変換を行いスペクトルブロックを得る。具体的には、音響信号ｘ（ｉ）に対して、以下の〔数式１〕に従った処理を行い、変換データの実部Ａ（ｊ）、虚部Ｂ（ｊ）を得る。 Subsequently, the frequency conversion means 20 performs a Fourier transform on each read acoustic block to obtain a spectrum block. Specifically, the acoustic signal x (i) is processed according to the following [Equation 1] to obtain a real part A (j) and an imaginary part B (j) of the converted data.

〔数式１〕
Ａ（ｊ）＝Σ_i=0,…_,N-1ｘ（ｉ）・ｃｏｓ（２πｉｊ／Ｎ）
Ｂ（ｊ）＝Σ_i=0,…_,N-1ｘ（ｉ）・ｓｉｎ（２πｉｊ／Ｎ） [Formula 1]
A (j) = Σ _{i = 0,} ... _{, N−1} x (i) · cos (2πij / N)
B (j) = Σ _{i = 0,} ... _{, N−1} x (i) · sin (2πij / N)

〔数式１〕において、ｉは、音響ブロック内のＮ個のサンプルに付した通し番号であり、ｉ＝０，１，２，…Ｎ−１の整数値をとる。また、ｊは周波数の値について、値の小さなものから順に付した通し番号であり、ｉと同様に、ｊ＝０，１，２，…Ｎ−１の整数値をとる。この際、音響信号ｘ（ｉ）には、Ｗ（ｉ）＝０．５−０．５・ｃｏｓ（２πｉ／Ｎ）で表現される窓関数（ハニング窓）を重みとして乗じる。このような窓関数は、フーリエ変換を行う際に、周波数成分に波形を分断することにより発生する高周波ノイズを低減するためと、フーリエ逆変換を行う際に解析区間の間で信号レベルが不連続にならないように連結させるために用いられるものであり、周知の技術である。 In [Expression 1], i is a serial number assigned to N samples in the acoustic block, and takes an integer value of i = 0, 1, 2,... N−1. Further, j is a serial number assigned in order from the smallest value of the frequency value, and takes an integer value of j = 0, 1, 2,. At this time, the acoustic signal x (i) is multiplied by a window function (Hanning window) expressed by W (i) = 0.5−0.5 · cos (2πi / N) as a weight. Such a window function reduces the high-frequency noise generated by dividing the waveform into frequency components when performing the Fourier transform, and the signal level is discontinuous between the analysis intervals when performing the inverse Fourier transform. This is a well-known technique that is used for connection so as not to occur.

上記〔数式１〕に従った処理を実行することにより、周波数に対応した成分であるスペクトルで表現されたスペクトルブロックが得られる。続いて、ブロック合成手段３０が、基準音響信号Ｐから得られたスペクトルブロックと音響信号Ｑから得られたスペクトルブロックを合成する処理を行う。この合成処理は、スペクトルブロック同士の積演算により行われる。具体的には、上記〔数式１〕により得られた基準音響信号Ｐの実部Ａｐ（ｊ）、虚部Ｂｐ（ｊ）音響信号Ｑの実部Ａｑ（ｊ）、虚部Ｂｑ（ｊ）を用いて、以下の〔数式２〕により、合成値として実部Ａ´（ｊ）、虚部Ｂ´（ｊ）を算出する。 By executing the processing according to the above [Equation 1], a spectrum block expressed by a spectrum which is a component corresponding to a frequency is obtained. Subsequently, the block synthesizing unit 30 performs a process of synthesizing the spectrum block obtained from the reference sound signal P and the spectrum block obtained from the sound signal Q. This synthesizing process is performed by a product operation between the spectrum blocks. Specifically, the real part Ap (j) of the reference acoustic signal P obtained by the above [Formula 1], the real part Aq (j) of the imaginary part Bp (j) acoustic signal Q, and the imaginary part Bq (j) are obtained. Using the following [Equation 2], the real part A ′ (j) and the imaginary part B ′ (j) are calculated as composite values.

〔数式２〕
Ａ´（ｊ）＝Ａｐ（ｊ）・Ｃ｛Ｅｑ（ｊ）／Ｅｐ（ｊ）｝^1/4
Ｂ´（ｊ）＝Ｂｐ（ｊ）・Ｃ｛Ｅｑ（ｊ）／Ｅｐ（ｊ）｝^1/4
ただし、Ｅｐ（ｊ）＝Ａｐ（ｊ）²＋Ｂｐ（ｊ）²，Ｅｑ（ｊ）＝Ａｑ（ｊ）²＋Ｂｑ（ｊ）² [Formula 2]
A ′ (j) = Ap (j) · C {Eq (j) / Ep (j)} ^1/4
B ′ (j) = Bp (j) · C {Eq (j) / Ep (j)} ^1/4
However, Ep (j) = Ap (j) ² + Bp (j) ² , Eq (j) = Aq (j) ² + Bq (j) ²

上記〔数式２〕において、Ｃは、振幅補正のための比例係数であり、適宜設定される。〔数式２〕の第１、第２式における｛Ｅｑ（ｊ）／Ｅｐ（ｊ）｝は、通常、１より小さい値となるので、乗算することにより、信号レベルが小さくなってしまう。その補正を行うために比例係数Ｃを乗算するのである。従って、Ｃは１より大きい値とすることが必要となる。なお、スペクトルブロック同士の積演算とは、基準音響信号Ｐの周波数成分である実部Ａｐ（ｊ）、虚部Ｂｐ（ｊ）に、音響信号Ｑのスペクトル強度Ｅｑ（ｊ）が乗算されることを示している。 In the above [Equation 2], C is a proportional coefficient for amplitude correction and is set as appropriate. Since {Eq (j) / Ep (j)} in the first and second expressions of [Formula 2] is usually a value smaller than 1, the signal level is reduced by multiplication. In order to perform the correction, the proportional coefficient C is multiplied. Therefore, C must be a value greater than 1. Note that the product operation between spectral blocks means that the real part Ap (j) and the imaginary part Bp (j), which are the frequency components of the reference acoustic signal P, are multiplied by the spectral intensity Eq (j) of the acoustic signal Q. Is shown.

また、上記Ｅｑ（ｊ）は、各ブロック単位で算出されるものであるが、対象とする音響信号によっては、後のスペクトル合成処理に伴い粒状ノイズが発生する場合がある。これは、時間軸方向にスペクトルの不連続が発生するためである。この対策として、対象とするブロックの近傍の数ブロックにおいて算出されたＥｑ（ｊ）の平均値を用いることが望ましい。本実施形態では、対象とするブロックの２ブロック過去までのブロックで算出されたＥｑ（ｊ）との３つのＥｑ（ｊ）の平均値をＥｑ（ｊ）として、上記〔数式２〕の第１式、第２式で用いている。 Further, Eq (j) is calculated in units of blocks, but depending on the target acoustic signal, granular noise may occur with subsequent spectrum synthesis processing. This is because spectral discontinuities occur in the time axis direction. As a countermeasure, it is desirable to use an average value of Eq (j) calculated in several blocks near the target block. In this embodiment, the average value of three Eq (j) with Eq (j) calculated in blocks up to two past blocks of the target block is defined as Eq (j), and the first equation of the above [Expression 2] is used. It is used in the formula and the second formula.

次に、周波数逆変換手段４０が、合成により得られた合成スペクトルブロックをフーリエ逆変換して合成音響ブロックを得る処理を行う。具体的には、上記〔数式２〕により得られたスペクトルの実部Ａ´（ｊ）、虚部Ｂ´（ｊ）を用いて、以下の〔数式３〕に従った処理を行い、ｘ´（ｉ）を算出する。 Next, the frequency inverse transform unit 40 performs a process of obtaining a synthesized sound block by performing Fourier inverse transform on the synthesized spectrum block obtained by the synthesis. Specifically, using the real part A ′ (j) and imaginary part B ′ (j) of the spectrum obtained by the above [Equation 2], processing according to the following [Equation 3] is performed, and x ′ (I) is calculated.

〔数式３〕
ｘ´（ｉ）＝１／Ｎ・｛Σ_j=0,…_,N-1Ａ´（ｊ）・ｃｏｓ（２πｉｊ／Ｎ）−Σ_j=0,…_,N-1Ｂ´（ｊ）・ｓｉｎ（２πｉｊ／Ｎ）｝ [Formula 3]
x ′ (i) = 1 / N · {Σ _{j = 0,} ... _{, N−1 A} ′ (j) · cos (2πij / N) −Σ _{j = 0,} ... _{, N−1 B} ′ (j) • sin (2πij / N)}

上記〔数式３〕により合成音響ブロックの各サンプルｘ´（ｉ）が得られることになる。出力手段５０は、得られた合成音響ブロックを出力ファイルに記録していく。以上のような処理を音響信号の全サンプルに渡って実行していくことにより、全ての合成音響ブロックが出力ファイルに記録されて、合成音響信号として得られる。得られた合成音響信号は、記憶手段６０内の合成音響信号記憶部６２に出力され、記憶される。 Each sample x ′ (i) of the synthesized sound block is obtained by the above [Equation 3]. The output unit 50 records the obtained synthesized sound block in an output file. By executing the processing as described above over all the samples of the acoustic signal, all the synthesized acoustic blocks are recorded in the output file and obtained as a synthesized acoustic signal. The obtained synthesized acoustic signal is output to and stored in the synthesized acoustic signal storage unit 62 in the storage means 60.

上記の例では、２つの音響信号を合成する場合について説明したが、３以上の音響信号を合成する場合には、上記〔数式１〕〜〔数式３〕を一般化した式として以下の〔数式４〕〜〔数式６〕を用いて実行することになる。以下の〔数式４〕〜〔数式６〕においては、合成対象とするＫ個の各音響信号に番号ｋ（ｋ＝０，１，２，…Ｋ−１）を付し、基準音響信号はｋ＝０とする。 In the above example, the case of synthesizing two acoustic signals has been described. However, in the case of synthesizing three or more acoustic signals, the following [Equation 1] is used as a generalized equation of [Equation 1] to [Equation 3]. 4] to [Formula 6]. In the following [Equation 4] to [Equation 6], a number k (k = 0, 1, 2,... K−1) is assigned to each of K acoustic signals to be synthesized, and the reference acoustic signal is k. = 0.

〔数式４〕
Ａ（ｋ，ｊ）＝Σ_i=0,…_,N-1ｘ（ｋ，ｉ）・ｃｏｓ（２πｉｊ／Ｎ）
Ｂ（ｋ，ｊ）＝Σ_i=0,…_,N-1ｘ（ｋ，ｉ）・ｓｉｎ（２πｉｊ／Ｎ） [Formula 4]
A (k, j) = Σ _{i = 0,} ... _{, N−1} x (k, i) · cos (2πij / N)
B (k, j) = Σ _{i = 0,} ... _{, N−1} x (k, i) · sin (2πij / N)

上記〔数式４〕において、ｉ、ｊ、Ｎについては、上記〔数式１〕において用いたものと同じである。 In the above [Formula 4], i, j, and N are the same as those used in the above [Formula 1].

〔数式５〕
Ａ´（ｊ）＝Ａ（０，ｊ）・Ｃ｛Ｓ（ｊ）／Ｅ（０，ｊ）｝^1/4
Ｂ´（ｊ）＝Ｂ（０，ｊ）・Ｃ｛Ｓ（ｊ）／Ｅ（０，ｊ）｝^1/4
ただし、Ｓ（ｊ）＝｛Ｅ（１，ｊ）・Ｅ（２，ｊ）・…・Ｅ（Ｋ−１，ｊ）｝^1/K-1 Ｅ（ｋ，ｊ）＝Ａ（ｋ，ｊ）²＋Ｂ（ｋ，ｊ）² [Formula 5]
A ′ (j) = A (0, j) · C {S (j) / E (0, j)} ^1/4
B ′ (j) = B (0, j) · C {S (j) / E (0, j)} ^1/4
However, S (j) = {E (1, j) · E (2, j) ···· E (K-1, j)} ^{1 / K-1} E (k, j) = A (k, j ) ² + B (k, j) ²

上記〔数式５〕においては、上記〔数式４〕により得られた各音響信号の実部Ａ（ｋ，ｊ）、虚部Ｂ（ｋ，ｊ）を用いて、合成値として実部Ａ´（ｊ）、虚部Ｂ´（ｊ）を算出する。〔数式５〕において、補正係数Ｃは、〔数式２〕において用いたものと同じである。 In the above [Formula 5], the real part A ′ () is used as a composite value by using the real part A (k, j) and imaginary part B (k, j) of each acoustic signal obtained by the above [Formula 4]. j) and the imaginary part B ′ (j) is calculated. In [Formula 5], the correction coefficient C is the same as that used in [Formula 2].

また、上記Ｓ（ｊ）は、各ブロック単位で算出されるものであるが、対象とする音響信号によっては、後のスペクトル合成処理に伴い粒状ノイズが発生する場合がある。これは、時間軸方向にスペクトルの不連続が発生するためである。この対策として、対象とするブロックの近傍の数ブロックにおいて算出されたＳ（ｊ）の平均値を用いることが望ましい。本実施形態では、対象とするブロックの２ブロック過去までのブロックで算出されたＳ（ｊ）との３つのＳ（ｊ）の平均値をＳ（ｊ）として、上記〔数式５〕の第１式、第２式で用いている。 Further, S (j) is calculated in units of blocks, but depending on the target acoustic signal, granular noise may occur with the subsequent spectrum synthesis process. This is because spectral discontinuity occurs in the time axis direction. As a countermeasure, it is desirable to use an average value of S (j) calculated in several blocks near the target block. In the present embodiment, the average value of three S (j) with S (j) calculated in blocks up to two past blocks of the target block is set to S (j), and the first equation (5) is used. It is used in the formula and the second formula.

次に、音響信号合成装置は、合成により得られた合成スペクトルブロックをフーリエ逆変換して合成音響ブロックを得る処理を行う。具体的には、上記〔数式５〕により得られたスペクトルの実部Ａ´（ｊ）、虚部Ｂ´（ｊ）を用いて、以下の〔数式６〕に従った処理を行い、ｘ´（ｉ）を算出する。 Next, the acoustic signal synthesizer performs a process of obtaining a synthesized acoustic block by performing inverse Fourier transform on the synthesized spectrum block obtained by the synthesis. Specifically, using the real part A ′ (j) and imaginary part B ′ (j) of the spectrum obtained by the above [Formula 5], processing according to the following [Formula 6] is performed, and x ′ (I) is calculated.

〔数式６〕
ｘ´（ｉ）＝１／Ｎ・｛Σ_j=0,…_,N-1Ａ´（ｊ）・ｃｏｓ（２πｉｊ／Ｎ）−Σ_j=0,…_,N-1Ｂ´（ｊ）・ｓｉｎ（２πｉｊ／Ｎ）｝ [Formula 6]
x ′ (i) = 1 / N · {Σ _{j = 0,} ... _{, N−1 A} ′ (j) · cos (2πij / N) −Σ _{j = 0,} ... _{, N−1 B} ′ (j) • sin (2πij / N)}

上記〔数式６〕においては、上記〔数式５〕により得られたスペクトルの実部Ａ´（ｊ）、虚部Ｂ´（ｊ）を用いて、合成音響信号を構成する各サンプルであるｘ´（ｉ）を算出する。 In the above [Equation 6], the real part A ′ (j) and the imaginary part B ′ (j) of the spectrum obtained by the above [Equation 5] are used for each sample constituting the synthesized acoustic signal x ′. (I) is calculated.

（２．２．各音響信号の演奏時間が異なる場合の処理）
合成対象とする各音響信号ｋ（ｋ＝０，１，２，…Ｋ−１）の演奏時間が互いに異なる場合、基準音響信号の演奏時間が短い場合は支障がないが、基準音響信号の演奏時間が長い場合や、互いに時間軸を同調させたい場合は、以下のような処理を行う。 (2.2. Processing when performance time of each acoustic signal is different)
When the performance times of the respective acoustic signals k (k = 0, 1, 2,... K-1) to be synthesized are different from each other, there is no problem if the performance time of the reference acoustic signal is short. When the time is long or when it is desired to synchronize the time axes with each other, the following processing is performed.

まず、基準音響信号（ｋ＝０）における１ブロックのサンプル数を高速フーリエ変換を適用するため２の累乗となる値Ｎ（０）（例えば４０９６）に設定し、解析を行うブロック数Ｂを算出する。基準音響信号以外の他の音響信号（以下、非基準音響信号という）については、ブロック数がＢとなるように、１回あたり解析を行う必要があるサンプル数Ｎ（ｋ）を求める。非基準音響信号において、フーリエ解析を行う場合は、音響ブロックとしてＮ（ｋ）個のサンプルを抽出して、Ｎ（０）個になるようにリサンプリング処理（時間軸スケーリング）を施す。リサンプリングの具体的な処理は、ｋ≧１の非基準音響信号の各サンプルｘ（ｋ，ｉ）（ｉ＝０，…，Ｎ−１）に対して、以下の〔数式７〕に従ってｘ″（ｋ，ｉ）を求めることにより実行する。この処理は、ブロック読込手段１０が行うことになる。 First, the number of samples of one block in the reference acoustic signal (k = 0) is set to a value N (0) (for example, 4096) that is a power of 2 in order to apply the fast Fourier transform, and the number of blocks B to be analyzed is calculated. To do. For other acoustic signals (hereinafter referred to as non-reference acoustic signals) other than the reference acoustic signal, the number of samples N (k) that needs to be analyzed once is determined so that the number of blocks is B. When Fourier analysis is performed on a non-reference acoustic signal, N (k) samples are extracted as acoustic blocks, and resampling processing (time axis scaling) is performed so that N (0) samples are obtained. The specific processing of resampling is as follows. For each sample x (k, i) (i = 0,..., N−1) of the non-reference acoustic signal with k ≧ 1, x ″ according to [Equation 7] This processing is executed by obtaining (k, i), which is performed by the block reading means 10.

〔数式７〕
ｘ″（ｋ，ｉ）＝ｘ（ｋ，ＩＮＴ｛ｉ・Ｎ（ｋ）／Ｎ（０）｝） [Formula 7]
x ″ (k, i) = x (k, INT {i · N (k) / N (0)})

上記〔数式７〕において、ＩＮＴ｛｝は、｛｝内の小数点以下を切り捨てた整数値を示すものである。このｘ″（ｋ，ｉ）がリサンプリングされたサンプルとなる。 In the above [Equation 7], INT {} indicates an integer value obtained by rounding down the decimals in {}. This x ″ (k, i) is a resampled sample.

続いて、リサンプリングされたＮ（０）個のサンプルに対してフーリエ変換を施する。これは、前記〔数式４〕において、ｘ（ｋ，ｉ）の代わりに上記ｘ″（ｋ，ｉ）を代入することにより行う。その後、得られたＡ（ｋ，ｊ）およびＢ（ｋ，ｊ）からなる周波数の次元の要素に対してＮ（０）／Ｎ（ｋ）だけスケーリングを施し、音響信号の時間軸変倍に伴って生じる周波数の変倍分を補正する。補正は、ｋ≧１、ｊ＝０，…，Ｎ（０）／２−１の実部、虚部の成分に対して、以下の〔数式８〕に従った処理を実行することにより行う。 Subsequently, Fourier transform is performed on the resampled N (0) samples. This is performed by substituting the above x ″ (k, i) in place of x (k, i) in [Formula 4]. Thereafter, the obtained A (k, j) and B (k, i) j) is scaled by N (0) / N (k) with respect to the frequency dimension element consisting of j) to correct the frequency scaling caused by the time base scaling of the acoustic signal. ≧ 1, j = 0,..., N (0) / 2-1 by executing processing according to the following [Equation 8] for the real part and imaginary part components.

〔数式８〕
Ａ″（ｋ，ｊ）＝Ａ（ｋ，ｊ・Ｎ（０）／Ｎ（ｋ））
Ｂ″（ｋ，ｊ）＝Ｂ（ｋ，ｊ・Ｎ（０）／Ｎ（ｋ）） [Formula 8]
A ″ (k, j) = A (k, j · N (0) / N (k))
B ″ (k, j) = B (k, j · N (0) / N (k))

上記〔数式８〕により、補正された周波数成分Ａ″（ｋ，ｊ）、Ｂ″（ｋ，ｊ）の集合であるスペクトルが得られることになる。この処理は、周波数変換手段２０が行うことになる。スペクトルを合成する際には、各非音響信号について求められたＡ″（ｋ，ｊ）、Ｂ″（ｋ，ｊ）を、上記〔数式５〕において、Ａ（ｋ，ｊ）、Ｂ（ｋ，ｊ）に代入することにより合成後のスペクトルブロックの各周波数成分が求められることになる。 By the above [Equation 8], a spectrum which is a set of corrected frequency components A ″ (k, j) and B ″ (k, j) is obtained. This processing is performed by the frequency conversion means 20. When synthesizing the spectrum, A ″ (k, j) and B ″ (k, j) obtained for each non-acoustic signal are converted into A (k, j) and B (k , J), each frequency component of the combined spectrum block is obtained.

本発明に係る音響信号の合成装置は、各音響信号が１つのチャンネルで構成されていても、複数のチャンネルで構成されていても各チャンネルに対して処理を行うことができる。また、元の音響信号において、各チャンネルの信号レベルが異なる場合は、各チャンネルごとに得られたスペクトルブロックの周波数成分に、チャンネルごとに異なる重みで変倍をかけて、対応するチャンネル同士で他の音響信号と合成することになる。具体的には、上記〔数式５〕において算出されるＥ（ｋ，ｊ）に変倍を行うことになる。 The acoustic signal synthesizer according to the present invention can perform processing on each channel regardless of whether each acoustic signal is composed of one channel or a plurality of channels. In addition, if the signal level of each channel is different in the original acoustic signal, the frequency component of the spectrum block obtained for each channel is scaled with a different weight for each channel, and the corresponding channels are different. It is synthesized with the acoustic signal. Specifically, scaling is performed on E (k, j) calculated in the above [Equation 5].

（３．１再生装置の構成）
次に、本発明に係る音響信号の再生装置について説明する。図２は、本発明に係る音響信号の再生装置の一実施形態を示す構成図である。図２において、７０は合成ブロック蓄積手段、８０はサウンドデバイスドライバ、８１はサウンドデバイス、８２はタイマー、９０は合成比率設定手段である。 (3.1 Configuration of playback device)
Next, a sound signal reproducing apparatus according to the present invention will be described. FIG. 2 is a block diagram showing an embodiment of a sound signal reproducing apparatus according to the present invention. In FIG. 2, 70 is a synthesis block storage means, 80 is a sound device driver, 81 is a sound device, 82 is a timer, and 90 is a synthesis ratio setting means.

ブロック読込手段１０〜周波数逆変換手段４０は、図１に示した音響信号合成装置におけるものと全く同じである。合成ブロック投入手段５１は、合成音響ブロックを出力する先が図１に示した出力手段５０とは異なり、合成音響信号記憶部６２ではなく、合成ブロック蓄積手段７０となっている。ただし、合成ブロック投入手段５１は、単純に合成音響ブロックを投入するだけでなく、後述するように、合成ブロック蓄積手段７０に空きが無い場合は、合成音響ブロックの投入を制御する機能も有している。合成ブロック蓄積手段７０は、合成音響ブロックを蓄積するバッファメモリを複数有しており、これらのバッファメモリに蓄積された合成音響ブロックを、ＦＩＦＯ（ファーストイン・ファーストアウト）方式、すなわち、先に入ってきた情報が、先に出ていく方式で処理する機能を有している。すなわち、合成ブロック蓄積手段７０は、合成ブロック投入手段５１から投入された合成音響ブロックを投入された順序で蓄積し、その順序でサウンドデバイスドライバ８０に渡す機能を有することとなる。サウンドデバイスドライバ８０は、サウンドデバイス８１を駆動させて合成音響ブロックを音響再生する機能を有しており、サウンドデバイス８１は、デジタルデータである合成音響ブロックをＤ／Ａ変換して音声として再生する機能を有している。すなわち、サウンドデバイスドライバ８０およびサウンドデバイス８１は合成音響ブロック再生手段として機能することになる。タイマー８２は、サウンドデバイスによる音響信号の再生と、外部機器の音響信号の再生とのタイミングをとるために利用するタイマーであり、コンピュータにおいて時刻管理を行うタイマーと共用されている。合成比率設定手段９０は、複数の音響信号をどの程度の比率で合成するかを設定する機能を有している。 The block reading means 10 to the frequency reverse conversion means 40 are exactly the same as those in the acoustic signal synthesizer shown in FIG. Unlike the output unit 50 shown in FIG. 1, the synthetic block input unit 51 is not the synthetic acoustic signal storage unit 62 but the synthetic block storage unit 70, unlike the output unit 50 shown in FIG. 1. However, the synthesis block input unit 51 not only simply inputs a synthetic sound block, but also has a function of controlling the input of the synthetic sound block when the synthesis block storage unit 70 has no space, as will be described later. ing. The synthesized block storage means 70 has a plurality of buffer memories for storing synthesized acoustic blocks, and the synthesized acoustic blocks stored in these buffer memories are stored in the FIFO (first in first out) method, that is, first in. It has a function to process the received information in a way that goes out first. That is, the synthesized block accumulating unit 70 has a function of accumulating the synthesized sound blocks input from the synthesized block input unit 51 in the input order and passing them to the sound device driver 80 in that order. The sound device driver 80 has a function of driving the sound device 81 to reproduce sound of the synthesized sound block. The sound device 81 performs D / A conversion on the synthesized sound block that is digital data and reproduces it as sound. It has a function. That is, the sound device driver 80 and the sound device 81 function as synthetic sound block reproduction means. The timer 82 is a timer used for timing the reproduction of the acoustic signal by the sound device and the reproduction of the acoustic signal of the external device, and is shared with a timer that performs time management in the computer. The synthesis ratio setting means 90 has a function of setting a ratio at which a plurality of acoustic signals are synthesized.

（３．２再生装置の処理動作）
続いて、図２に示した再生装置の処理動作について説明する。まず、再生装置の利用者は、合成比率設定手段９０により複数の音響信号の合成比率を設定する。具体的には、上記〔数式２〕における比例係数Ｃに相当するものを設定することになる。設定された合成比率は、合成比率設定手段９０からブロック合成手段３０に与えられることになる。 (3.2 Processing operation of playback device)
Next, the processing operation of the playback device shown in FIG. 2 will be described. First, the user of the playback device sets the synthesis ratio of a plurality of sound signals by the synthesis ratio setting means 90. Specifically, a value corresponding to the proportional coefficient C in the above [Equation 2] is set. The set composition ratio is given from the composition ratio setting means 90 to the block composition means 30.

設定後、音響信号再生装置は処理を開始する。まず、ブロック読込手段１０が複数の音響信号をブロック単位で読み込む。続いて、周波数変換手段２０が音響ブロックをスペクトルブロックに変換する。 After the setting, the acoustic signal reproduction device starts processing. First, the block reading means 10 reads a plurality of acoustic signals in units of blocks. Subsequently, the frequency conversion means 20 converts the acoustic block into a spectrum block.

次に、ブロック合成手段３０が上記〔数式５〕に従ってスペクトルブロックの合成を行う。この際、合成比率が設定されている場合には、上記Ｓ（ｊ）を算出するに当たって、各音響信号のスペクトル強度Ｅ（ｋ、ｊ）に設定された比率を乗じる。 Next, the block synthesizing unit 30 synthesizes the spectrum blocks according to the above [Equation 5]. At this time, when the synthesis ratio is set, the ratio set to the spectrum intensity E (k, j) of each acoustic signal is multiplied in calculating the above S (j).

続いて、得られた複数のスペクトルブロックを、周波数逆変換手段４０がフーリエ逆変換し、合成音響ブロックを生成する。周波数逆変換手段４０により得られた合成音響ブロックは、合成ブロック投入手段５１により合成ブロック蓄積手段７０に蓄積されていく。本実施形態では、合成ブロック蓄積手段７０に４ブロックまで蓄積可能となっているため、４ブロック蓄積されるまでは、サウンドデバイスドライバ８０による処理は開始されない。図３に示すように、合成ブロック蓄積手段７０に合成音響ブロックが４ブロック蓄積されると、サウンドデバイスドライバ８０が、合成ブロック蓄積手段７０に蓄積された合成音響ブロックのうち先頭のブロックを音響再生する。具体的には、サウンドデバイス８１が合成音響ブロックのデータをＤ／Ａ変換してスピーカに出力することになる。音響再生された合成音響ブロックは、合成ブロック蓄積手段７０から削除される。 Subsequently, the frequency inverse transform unit 40 inversely transforms the obtained plurality of spectrum blocks to generate a synthesized acoustic block. The synthesized sound block obtained by the frequency inverse transform means 40 is accumulated in the synthesized block accumulation means 70 by the synthesis block input means 51. In the present embodiment, since up to 4 blocks can be stored in the synthesized block storage unit 70, processing by the sound device driver 80 is not started until 4 blocks are stored. As shown in FIG. 3, when four synthesized sound blocks are accumulated in the synthesized block accumulating unit 70, the sound device driver 80 reproduces the first block among the synthesized acoustic blocks accumulated in the synthesized block accumulating unit 70. To do. Specifically, the sound device 81 D / A converts the data of the synthesized sound block and outputs it to the speaker. The synthesized sound block that has been reproduced is deleted from the synthesized block storage means 70.

合成音響ブロックが削除されて、合成ブロック蓄積手段７０内に余裕ができると、合成ブロック投入手段５１から合成音響ブロックが合成ブロック蓄積手段７０に投入される。これにより、合成ブロック蓄積手段７０内は再び最大容量まで蓄積されることになる。合成された合成音響ブロックは、現実には、ＣＰＵが合成ブロック投入手段５１として機能することにより、合成ブロック蓄積手段７０内に投入される。この合成ブロック投入手段５１は、合成音響ブロックを合成ブロック蓄積手段７０に単純に投入するだけでなく、合成ブロック蓄積手段７０に空きが無い場合は、ブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０に対して処理を中断するメッセージを送り、合成ブロック蓄積手段７０への合成音響ブロックの投入を制御している。 When the synthesized sound block is deleted and there is room in the synthesized block storage unit 70, the synthesized block is put into the synthesized block storage unit 70 from the synthesized block input unit 51. As a result, the combined block storage means 70 is stored up to the maximum capacity again. The synthesized synthesized sound block is actually put into the synthesized block accumulating means 70 when the CPU functions as the synthesized block throwing means 51. This synthetic block input means 51 not only simply inputs the synthetic sound block to the synthetic block storage means 70, but also when the synthetic block storage means 70 has no free space, the block reading means 10, the frequency conversion means 20, the block synthesis. A message for interrupting the processing is sent to the means 30 and the frequency inverse transform means 40 to control the input of the synthesized sound block to the synthesized block storage means 70.

一方、サウンドデバイスドライバ８０は、合成ブロック蓄積手段７０に蓄積された合成音響ブロックのうち先頭のブロックを順次音響再生していく。この際、サウンドデバイスドライバ８０は、１つの合成音響ブロックの音響再生を終了する度に、ブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０、合成ブロック投入手段５１に対して各処理の実行を許可するメッセージを送る。 On the other hand, the sound device driver 80 sequentially plays back the first block among the synthesized sound blocks stored in the synthesized block storing means 70. At this time, each time the sound device driver 80 finishes the sound reproduction of one synthesized sound block, the sound reading is performed by the block reading means 10, the frequency converting means 20, the block synthesizing means 30, the frequency inverse converting means 40, and the synthesized block input means 51. A message that permits execution of each process is sent to the server.

ここで、上記再生装置における処理の概要を整理して図４のフローチャートに示す。まず、合成ブロック投入手段５１が、合成ブロック蓄積手段７０内に空いているバッファメモリが存在するかどうかを探索する（ステップＳ１）。空いているバッファメモリが存在しない場合は、ブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０、合成ブロック投入手段５１に対して処理を中断するメッセージを送り、サウンドデバイスドライバ８０からの再生終了メッセージの受信待ちとする（ステップＳ２）。サウンドデバイスドライバ８０からの再生終了メッセージがあった場合には、再生が終了した合成音響ブロックを格納していたバッファメモリから削除して再生終了バッファを空きバッファに設定する（ステップＳ３）。サウンドデバイスドライバ８０からの再生終了メッセージは、同時にブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０、合成ブロック投入手段５１にも送信されるため、ブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０、合成ブロック投入手段５１が処理を再開し、音響信号の合成が行われる（ステップＳ４）。続いて、空いているバッファメモリに合成音響ブロックが格納される（ステップＳ５）。一方、サウンドデバイス８１では、常に、合成ブロック蓄積手段７０内のバッファメモリを探索しており（ステップＳ６）、合成音響ブロックが存在する場合には、合成音響ブロックを再生する（ステップＳ７）。１つの合成音響ブロックの再生を待ち（ステップＳ８）、再生が終了したら、再生終了メッセージをブロック読込手段１０、周波数変換手段２０、ブロック合成手段３０、周波数逆変換手段４０、合成ブロック投入手段５１に送信する（ステップＳ９）。 Here, the outline of the processing in the reproducing apparatus is organized and shown in the flowchart of FIG. First, the composite block input unit 51 searches for a free buffer memory in the composite block storage unit 70 (step S1). If there is no free buffer memory, a message for interrupting the process is sent to the block reading means 10, the frequency converting means 20, the block synthesizing means 30, the frequency inverse converting means 40, and the synthesizing block input means 51, and the sound device A reception end message is waited for from the driver 80 (step S2). If there is a playback end message from the sound device driver 80, the playback end buffer is set as an empty buffer by deleting from the buffer memory storing the synthesized sound block that has been played back (step S3). Since the reproduction end message from the sound device driver 80 is simultaneously transmitted to the block reading means 10, the frequency converting means 20, the block synthesizing means 30, the frequency reverse converting means 40, and the synthesized block inputting means 51, the block reading means 10, The frequency conversion unit 20, the block synthesis unit 30, the frequency inverse conversion unit 40, and the synthesis block input unit 51 resume processing, and the synthesis of the acoustic signal is performed (step S4). Subsequently, the synthesized sound block is stored in an empty buffer memory (step S5). On the other hand, the sound device 81 always searches for the buffer memory in the synthetic block storage means 70 (step S6), and when the synthetic acoustic block exists, the synthetic acoustic block is reproduced (step S7). Waiting for the reproduction of one synthesized sound block (step S8), when the reproduction is completed, a reproduction end message is sent to the block reading means 10, the frequency converting means 20, the block synthesizing means 30, the frequency reverse converting means 40, and the synthetic block input means 51. Transmit (step S9).

本発明に係る音響信号の合成装置の機能ブロック図である。It is a functional block diagram of the synthesis apparatus of the acoustic signal which concerns on this invention. 本発明に係る音響信号の合成再生装置の機能ブロック図である。It is a functional block diagram of the synthetic | combination reproduction | regeneration apparatus of the acoustic signal which concerns on this invention. 合成音響ブロックが蓄積された状態の音響信号の合成再生装置を示す図である。It is a figure which shows the synthetic | combination reproduction | regeneration apparatus of the acoustic signal of the state in which the synthetic | combination acoustic block was accumulate | stored. 音響信号の合成再生装置の処理動作を示すフローチャートである。It is a flowchart which shows the processing operation of the synthetic | combination reproduction | regeneration apparatus of an acoustic signal.

Explanation of symbols

１０・・・ブロック読込手段
２０・・・周波数変換手段
３０・・・ブロック合成手段
４０・・・周波数逆変換手段
５０・・・出力手段
５１・・・合成ブロック投入手段
６０・・・記憶手段
６１・・・音響信号記憶部
６２・・・合成音響信号記憶部
７０・・・合成ブロック蓄積手段
８０・・・サウンドデバイスドライバ
８１・・・サウンドデバイス
８２・・・タイマー
９０・・・合成比率設定手段

DESCRIPTION OF SYMBOLS 10 ... Block reading means 20 ... Frequency conversion means 30 ... Block synthesis means 40 ... Frequency reverse conversion means 50 ... Output means 51 ... Synthesis block input means 60 ... Storage means 61 ... Acoustic signal storage unit 62 ... Synthetic acoustic signal storage unit 70 ... Synthetic block storage means 80 ... Sound device driver 81 ... Sound device 82 ... Timer 90 ... Synthesis ratio setting means

Claims

A device for synthesizing a plurality of acoustic signals composed of time-series sample sequences to generate a synthesized acoustic signal,
A block reading means for reading a predetermined number of samples as acoustic blocks from a plurality of acoustic signals,
Frequency transforming means for performing Fourier transform on the read acoustic block and generating a spectrum block;
Block synthesis means for synthesizing spectrum blocks to generate a synthesized spectrum block by performing a product operation on the corresponding components of the spectrum blocks obtained from the respective acoustic signals;
Frequency inverse transform means for performing inverse Fourier transform on the generated synthesized spectrum block and generating a synthesized acoustic block;
Output means for outputting the generated synthesized sound block in chronological order;
An apparatus for synthesizing an acoustic signal, comprising:

In claim 1,
The block reading means reads the acoustic block by overlapping a predetermined number of samples on the time axis in the acoustic signal, and changes the value of the sample in the acoustic block read by a predetermined weight function. An apparatus for synthesizing acoustic signals.

In claim 1,
The block synthesizing unit synthesizes the spectrum blocks by using an average value of corresponding components of one or more spectrum blocks located in the past in time from the spectrum blocks obtained from the respective acoustic signals. An apparatus for synthesizing an acoustic signal.

In claim 1,
The block reading means reads the sound block with a different number of samples for each sound signal as one sound block so that the number of blocks is the same for each sound signal, and then reads one of the plurality of sound signals. A process for deleting a part of the samples in the acoustic block in order to make the other acoustic signal coincide with the number of samples of the acoustic block of the reference acoustic signal, as a reference acoustic signal,
The frequency conversion means performs a Fourier transform on the acoustic block from which some samples are deleted, and performs a scaling process in the frequency axis direction on the obtained spectrum block,
The apparatus for synthesizing an acoustic signal characterized in that the block synthesizing means performs processing on a spectrum block that has been subjected to scaling processing in the frequency axis direction.

In claim 1,
The block synthesizing unit synthesizes the frequency component of the spectrum block obtained from each acoustic signal by scaling based on the weight set in advance for each acoustic signal. Synthesizer.

In claim 5,
The block synthesizing means, when each acoustic signal is composed of a plurality of channels,
An apparatus for synthesizing an acoustic signal, characterized in that the frequency component of a spectrum block obtained for each channel is scaled with a different weight for each channel.

A device that synthesizes a plurality of sound signals composed of time-series sample sequences and reproduces the sound,
A block reading means for reading a predetermined number of samples as acoustic blocks from a plurality of acoustic signals,
Frequency transforming means for performing Fourier transform on the read acoustic block and generating a spectrum block;
Block synthesis means for synthesizing spectrum blocks to generate a synthesized spectrum block by performing a product operation on the corresponding components of the spectrum blocks obtained from the respective acoustic signals;
Frequency inverse transform means for performing inverse Fourier transform on the generated synthesized spectrum block and generating a synthesized acoustic block;
Synthetic block accumulating means for accumulating two or more synthetic acoustic blocks;
Synthetic block input means for inputting the generated synthetic acoustic block into the synthetic block storage means;
Of the synthesized sound blocks existing in the synthesized block storage means, the first synthesized sound block is played back, and after the playback is finished, the synthesized sound block is deleted from the synthesized block storage means, so that a new synthesized sound block is obtained. In addition to providing room for the block to be inserted in the synthetic block accumulating unit, if there is a synthetic acoustic block stored next, the next synthetic acoustic block is acoustically connected to the first synthetic acoustic block. Synthetic block reproduction means for reproducing;
An apparatus for synthesizing and reproducing acoustic signals, comprising:

In claim 7,
When the synthetic block input means inputs a synthetic acoustic block,
When the synthesized block storage means cannot afford to newly accept a synthesized sound block, a message for interrupting each operation is sent to the block reading means, the frequency converting means, the block synthesizing means, and the frequency inverse converting means. The sound signal reproducing apparatus is characterized in that each of the means performs control to be interrupted in a current state.

In claim 7,
Each time the synthetic block reproducing means finishes reproducing one synthetic acoustic block, the block reading means, the frequency converting means, the block synthesizing means, and the frequency inverse converting means are interrupted by the respective means. An apparatus for reproducing an acoustic signal, wherein control for resuming an operation is resumed.

On the computer,
A block reading means for reading a predetermined number of samples as acoustic blocks from a plurality of acoustic signals,
Frequency conversion means for performing Fourier transform on the read acoustic block and generating a spectrum block,
Block synthesizing means for synthesizing spectrum blocks to generate a synthesized spectrum block by multiplying corresponding components of the spectrum blocks obtained from the respective acoustic signals;
Frequency inverse transform means for performing inverse Fourier transform on the generated synthesized spectrum block and generating a synthesized acoustic block;
A computer program for executing output means for outputting the generated synthetic sound block in time series order.

On the computer,
Block reading means for reading a predetermined number of samples as acoustic blocks from a plurality of acoustic signals,
Frequency conversion means for performing Fourier transform on the read acoustic block and generating a spectrum block,
Block synthesizing means for synthesizing spectrum blocks to generate a synthesized spectrum block by multiplying corresponding components of the spectrum blocks obtained from the respective acoustic signals,
Frequency inverse transform means for performing inverse Fourier transform on the generated synthesized spectrum block and generating a synthesized acoustic block;
Synthetic block accumulating means for accumulating two or more synthetic acoustic blocks;
Synthetic block input means for inputting the generated synthetic acoustic block into the synthetic block storage means;
Of the synthesized sound blocks existing in the synthesized block storage means, the first synthesized sound block is played back, and after the playback is finished, the synthesized sound block is deleted from the synthesized block storage means, so that a new synthesized sound block is obtained. In addition to providing the synthetic block accumulating means with a room where a block can be inserted, if there is a synthetic acoustic block stored next, the next synthetic acoustic block is continuously acoustically connected to the first synthetic acoustic block. A computer program for executing synthetic block reproduction means for reproduction.