JP2017026795A

JP2017026795A - Transmission system, encoding device, decoding device, and method and program therefor

Info

Publication number: JP2017026795A
Application number: JP2015144740A
Authority: JP
Inventors: 尚植松; Hisashi Uematsu; 修一坂本; Shuichi Sakamoto; ホルヘトレビーニョ; Trevino Jorge; 陽一鈴木; Yoichi Suzuki
Original assignee: Tohoku University NUC; Nippon Telegraph and Telephone Corp
Current assignee: Tohoku University NUC; Nippon Telegraph and Telephone Corp
Priority date: 2015-07-22
Filing date: 2015-07-22
Publication date: 2017-02-02
Anticipated expiration: 2035-07-22
Also published as: JP6355207B2

Abstract

PROBLEM TO BE SOLVED: To provide a transmission system that efficiently transmits sound space information.SOLUTION: An encoding device 100 generates a matrix M having observation signals S-Sof respective microphones and values normalized by autocovariance of the observation signals (110), sorts the N eigenvalues λand eigenvectors eof a square matrix MM so that they are in descending order with respect to the eigenvalues λ(140), obtains a matrix E'=[e(1)e(2) to e(N')] consisting only of N' eigenvectors e(n'), corresponding to N' eigenvalues, respectively, on the basis of a relationship between a predetermined threshold and values corresponding to the eigenvalues after the sorting (150), calculates a variance of the observation signals and a matrix μ' that is a product of the matrix M and matrix E' at a dimension reduction unit 160, and determines a difference between the observation signals S-Sand an approximate value as an error at an error calculation unit 170. A decoding device includes a decoding unit that decodes the observation signals using the matrix μ' and the matrix E', N variances σ, and the error.SELECTED DRAWING: Figure 3

Description

本発明は、複数のマイクロホンを用いて収音した音を伝送する技術に関する。なお、三次元空間上に複数のマイクロホンを配置し、音を収音することで、その観測信号には、音場の情報が含まれるため、音場の情報を伝送する技術に関すると言ってもよい。なお、音場の情報とは、例えばインパルス応答などであり、音空間情報、または、音の三次元空間情報を含む特性と言ってもよい。 The present invention relates to a technique for transmitting sound collected using a plurality of microphones. It should be noted that, by arranging a plurality of microphones in a three-dimensional space and collecting sound, the observation signal includes information on the sound field. Good. The sound field information is, for example, an impulse response or the like, and may be referred to as characteristics including sound space information or sound three-dimensional space information.

複数のマイクロホンを用いて収音した音を伝送する技術として非特許文献１〜３が知られている。非特許文献１〜３では、音声情報を圧縮し伝送する。 Non-Patent Documents 1 to 3 are known as techniques for transmitting sound collected using a plurality of microphones. In Non-Patent Documents 1 to 3, audio information is compressed and transmitted.

M.Hans, R.Schafer, “Lossless compression of digital audio”, Signal Processing Magazine, IEEE, 2001年, vol. 18, pp.21-32.M. Hans, R. Schafer, “Lossless compression of digital audio”, Signal Processing Magazine, IEEE, 2001, vol. 18, pp. 21-32. J.Herre, “Form joing stereo to spatial audio coding recent progress and standardization”, In 6th Intl. Conf. on Digital Audio Effects, 2004年, p.6.J. Herre, “Form joing stereo to spatial audio coding recent progress and standardization”, In 6th Intl. Conf. On Digital Audio Effects, 2004, p. 6. J.Breebaart, et.al, “Backgroud, Concept, and Architecture for the Recent MPEG Surround Standard on Multichannel Audio Compression”, J. Audio Eng. Soc., 2007年, vol.55(5), pp. 331-351.J. Breebaart, et.al, “Backgroud, Concept, and Architecture for the Recent MPEG Surround Standard on Multichannel Audio Compression”, J. Audio Eng. Soc., 2007, vol. 55 (5), pp. 331-351 .

しかしながら、非特許文献１〜３の音声圧縮の技術は、複数のマイクロホンを隣接して並べた際のマイクロホン間の相関は考慮されていない。すなわち、複数のマイクロホンを隣接して並べた場合には、各マイクロホンには非常に相関の高い信号が入力されることとなるが、従来の技術ではこれらは独立の信号として扱っている。 However, the sound compression techniques of Non-Patent Documents 1 to 3 do not consider the correlation between microphones when a plurality of microphones are arranged adjacent to each other. That is, when a plurality of microphones are arranged adjacent to each other, a highly correlated signal is input to each microphone, but these are handled as independent signals in the conventional technology.

本発明は、複数のマイクロホン間の相関を考慮して、音空間情報を効率的に伝送する伝送システム、符号化装置、復号装置、それらの方法及びプログラムを提供することを目的とする。 An object of the present invention is to provide a transmission system, an encoding device, a decoding device, and a method and a program for efficiently transmitting sound space information in consideration of the correlation between a plurality of microphones.

上記の課題を解決するために、本発明の一態様によれば、伝送システムは、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sを符号化装置から復号装置へ伝送する。符号化装置は、n=1,2,…,Nとし、各マイクロホンの各観測信号s_nと、各観測信号s_nの平均s^- _nとの差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列Mを生成する正規化部と、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替える並び替え部と、N'<N、n'=1,2,…,N'とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、所定の閾値と固有値λ(n')に対応する値との大小関係に基づき、N'個の固有値λ(n')にそれぞれ対応するN'個の固有ベクトルe(n')のみからなる行列E'=[e(1)e(2)…e(N')]を得る打切り部と、N個の各観測信号s_nの分散σ_{s_ns_n}と、行列Mと行列E'との積である行列μ'と、行列E'とを用いて観測信号行列Sの近似値行列S'を求め、観測信号行列Sと近似値行列S'との差分を誤差Rとして求める誤差算出部と、を含む。復号装置は、行列μ'と行列E'とN個の分散σ_{s_ns_n}と誤差Rとを用いて、観測信号行列Sを復号する復号部を含む。 In order to solve the above problem, according to one aspect of the present invention, a transmission system uses N as an integer equal to or greater than 2, and an observation signal matrix S collected by N microphones from an encoding device. Transmit to the decoding device. Encoding device, n = 1, 2, ..., and N, and each of the observed signal s _n of each microphone, the average s of each observed signal s _n ^- autocovariance of the observed signal s _n a difference between _n sigma A normalization unit that generates a matrix M having elements m _n normalized by _{s_ns_n} , and N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M are arranged in descending order with respect to the eigenvalue λ _n. N '<N, n' = 1,2, ..., N ', and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N ), The eigenvector corresponding to λ (n) is e (n), and N ′ eigenvalues λ (n ′) are respectively determined based on the magnitude relationship between the predetermined threshold and the value corresponding to the eigenvalue λ (n ′). and the corresponding n 'eigenvectors e (n') consisting only of matrix E '= [e (1) e (2) ... e (n')] truncation unit to obtain, for each of the n observed signal s _n and variance _σ s_ns_n, a 'matrix μ is the product of' the matrix M matrix E, obtains a 'approximate value matrix S of the observation signal matrix S by using the' matrix E, the observed signal matrix S Including the difference between the Nitinol matrix S 'and the error calculating section for obtaining an error R, a. The decoding device includes a decoding unit that decodes the observed signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n,} and the error R.

上記の課題を解決するために、本発明の他の態様によれば、符号化装置は、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sに対応する符号を生成する。符号化装置は、n=1,2,…,Nとし、各観測信号ｓ_nと、各観測信号ｓ_nの時間平均ｓ^- _nとの差分を各観測信号ｓ_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列Mを生成する正規化部と、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替える並び替え部と、N'<N、n'=1,2,…,N'とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、所定の閾値以下となるN'個の固有値λ(n')に対応する固有ベクトルe(n')のみからなる行列E'を得る打切り部と、N個の各観測信号ｓ_nの分散σ_{s_ns_n}と、行列Mと行列E'との積である行列μ'と、行列E'とを用いて観測信号行列Sの近似値行列S'を求め、観測信号行列Sと近似値行列S'との差分を誤差Rとして求める誤差算出部と、を含む。 In order to solve the above-described problem, according to another aspect of the present invention, the encoding device corresponds to an observation signal matrix S collected by N microphones, where N is any integer greater than or equal to 2. Generate a code. Encoding device, n = 1,2, ..., and N, and each of the observed signal s _n, times the average s of each observed signal s _n ^- in autocovariance sigma _{S_ns_n} of each observation signal the difference between _n s _n A normalization unit that generates a matrix M having elements of normalized values m _n and an array that rearranges the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M in descending order with respect to the eigenvalues λ _n. And N ′ <N, n ′ = 1, 2,..., N ′, and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), The eigenvector corresponding to λ (n) is e (n), and a truncation unit that obtains a matrix E ′ composed of only eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold And an approximate value matrix S ′ of the observed signal matrix S using the variance σ _{s_ns_n} of each of the N observed signals s _n , the matrix μ ′ that is the product of the matrix M and the matrix E ′, and the matrix E ′. And an error calculation unit that calculates a difference between the observed signal matrix S and the approximate value matrix S ′ as an error R.

上記の課題を解決するために、本発明の他の態様によれば、復号装置は、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sに対応する符号を受け取り、観測信号行列Sを復号する。復号装置は、n=1,2,…,Nとし、各観測信号s_nと、各観測信号s_nの時間平均s^- _nとの差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列をMとし、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替えて得られる、並び替え後の固有値をλ(n)とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、N'<N、n'=1,2,…,N'とし、所定の閾値以下となるN'個の固有値λ(n')に対応する固有ベクトルe(n')のみからなる行列をE'とし、N個の各観測信号s_nの分散σ_{s_ns_n}と、行列μ'=ME'と行列E'とを用いて得られる観測信号行列Sの近似値行列をS'とし、観測信号行列Sと近似値行列S'との差分を誤差Rとし、行列μ'と行列E'とN個の分散σ_{s_ns_n}と誤差Rとを用いて、観測信号行列Sを復号する復号部を含む。 In order to solve the above-described problem, according to another aspect of the present invention, a decoding device uses a code corresponding to an observation signal matrix S collected by N microphones, where N is any integer of 2 or more. And the observed signal matrix S is decoded. Decoding apparatus, n = 1,2, ..., and N, and each of the observed signal s _n, times the average s of each observed signal s _n ^- normal differences between _n in autocovariance sigma _{S_ns_n} of each observed signal s _n M is a matrix whose elements are generalized values m _n , and is obtained by rearranging the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M in descending order with respect to the eigenvalues λ _n Is the eigenvalue of λ (n), the rearranged eigenvalue is λ (n), λ (1)> λ (2)>...> λ (N), and the eigenvector corresponding to λ (n) is e (n) N ′ <N, n ′ = 1, 2,..., N ′, and a matrix consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold _Let E 'be the approximate value matrix of the observed signal matrix S obtained using the variance σ _{s_ns_n} of each of the N observed signals s _n and the matrix μ' = ME 'and the matrix E', and the observed signal matrix The difference between S and the approximate value matrix S ′ is the error R, and the observed signal row is calculated using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n} and the error R. A decoding unit for decoding the sequence S is included.

上記の課題を解決するために、本発明の他の態様によれば、伝送方法は、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sを符号化装置から復号装置へ伝送する。伝送方法は、n=1,2,…,Nとし、符号化装置が、各マイクロホンの各観測信号s_nと、各観測信号s_nの平均s^- _nとの差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列Mを生成する正規化ステップと、符号化装置が、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替える並び替えステップと、N'<N、n'=1,2,…,N'とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、符号化装置が、所定の閾値と固有値λ(n')に対応する値との大小関係に基づき、N'個の固有値λ(n')にそれぞれ対応するN'個の固有ベクトルe(n')のみからなる行列E'=[e(1)e(2)…e(N')]を得る打切りステップと、符号化装置が、N個の各観測信号s_nの分散σ_{s_ns_n}と、行列Mと行列E'との積である行列μ'と、行列E'とを用いて観測信号行列Sの近似値行列S'を求め、観測信号行列Sと近似値行列S'との差分を誤差Rとして求める誤差算出ステップと、復号装置が、行列μ'と行列E'とN個の分散σ_{s_ns_n}と誤差Rとを用いて、観測信号行列Sを復号する復号ステップとを含む。 In order to solve the above-described problem, according to another aspect of the present invention, a transmission method uses N as an integer equal to or greater than 2, and encodes an observation signal matrix S collected by N microphones. To the decoding device. Transmission method, n = 1, 2, ..., and N, the coding apparatus, and the observed signal s _n of each microphone, the average s of each observed signal s _n ^- a difference between _n of each observed signal s _n A normalization step for generating a matrix M whose elements are values m _n normalized by the autocovariance σ _{s_ns_n} , and an encoding device _converts the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M into eigenvalues. A rearrangement step for rearranging λ _{n in} descending order, N ′ <N, n ′ = 1, 2,..., N ′, and eigenvalues after the rearrangement are λ (n), λ (1)> λ (2)>...> The eigenvectors corresponding to λ (N) and λ (n) are set to e (n), and the encoding device has a magnitude relationship between the predetermined threshold and the value corresponding to the eigenvalue λ (n ′). Based on the matrix E ′ = [e (1) e (2)... E (N ′)] consisting only of N ′ eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′), respectively. a truncation obtaining, encoding apparatus, the variance sigma _{S_ns_n} of n each observed signal s _n, which is the product of the the matrix M matrix E 'line An error calculation step of obtaining an approximate value matrix S ′ of the observation signal matrix S using μ ′ and the matrix E ′, and obtaining a difference between the observation signal matrix S and the approximate value matrix S ′ as an error R, and a decoding device , A decoding step of decoding the observed signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n,} and the error R.

上記の課題を解決するために、本発明の他の態様によれば、符号化方法は、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sに対応する符号を生成する。符号化方法は、n=1,2,…,Nとし、正規化部が、各観測信号s_nと、各観測信号s_nの時間平均s^- _nとの差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列Mを生成する正規化ステップと、並び替え部が、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替える並び替えステップと、N'<N、n'=1,2,…,N'とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、打切り部が、所定の閾値以下となるN'個の固有値λ(n')に対応する固有ベクトルe(n')のみからなる行列E'を得る打切りステップと、誤差算出部が、N個の各観測信号s_nの分散σ_{s_ns_n}と、行列Mと行列E'との積である行列μ'と、行列E'とを用いて観測信号行列Sの近似値行列S'を求め、観測信号行列Sと近似値行列S'との差分を誤差Rとして求める誤差算出ステップと、を含む。 In order to solve the above-described problem, according to another aspect of the present invention, an encoding method corresponds to an observation signal matrix S collected by N microphones, where N is any integer greater than or equal to 2. Generate a code. Encoding method, n = 1, 2, ..., and N, normalization section, and each of the observed signal s _n, times the average s of each observed signal s _n ^- a difference between _n own respective observation signal s _n A normalization step for generating a matrix M whose elements are values m _n normalized by the covariance σ _{s_ns_n} , and a reordering unit _converts the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M into eigenvalues λ _A rearrangement step for rearranging _{n in} descending order, N ′ <N, n ′ = 1, 2,..., N ′, and eigenvalues after the rearrangement are λ (n), λ (1)> λ ( 2)>...> The eigenvector corresponding to λ (N), λ (n) is e (n), and the eigenvector e corresponding to N ′ eigenvalues λ (n ′) whose truncation part is equal to or less than a predetermined threshold value A truncation step for obtaining a matrix E ′ consisting only of (n ′), and an error calculating unit, a variance σ _{s_ns_n} of each of the N observation signals s _n , and a matrix μ ′ that is a product of the matrix M and the matrix E ′ , Find approximate matrix S 'of observed signal matrix S using matrix E' and approximate observed signal matrix S Including, an error calculation step of obtaining an error R the difference between the matrix S '.

上記の課題を解決するために、本発明の他の態様によれば、復号方法は、Nを2以上の整数の何れかとし、N個のマイクロホンで収音した観測信号行列Sに対応する符号を受け取り、観測信号行列Sを復号する。復号方法は、n=1,2,…,Nとし、各観測信号s_nと、各観測信号s_nの時間平均s^- _nとの差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値m_nを要素とする行列をMとし、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるように並び替えて得られる、並び替え後の固有値をλ(n)とし、並び替え後の固有値をλ(n)、λ(1)>λ(2)>…>λ(N)、λ(n)に対応する固有ベクトルをe(n)とし、N'<N、n'=1,2,…,N'とし、所定の閾値以下となるN'個の固有値λ(n')に対応する固有ベクトルe(n')のみからなる行列をE'とし、N個の各観測信号s_nの分散σ_{s_ns_n}と、行列μ'=ME'と、行列E'とを用いて得られる観測信号行列Sの近似値行列をS'とし、観測信号行列Sと近似値行列S'との差分を誤差Rとし、復号部が、行列μ'と行列E'とN個の分散σ_{s_ns_n}と誤差Rとを用いて、観測信号行列Sを復号する復号ステップを含む。 In order to solve the above-described problem, according to another aspect of the present invention, a decoding method uses a code corresponding to an observation signal matrix S collected by N microphones, where N is any integer greater than or equal to 2. And the observed signal matrix S is decoded. Decoding method, n = 1,2, ..., and N, and each of the observed signal s _n, times the average s of each observed signal s _n ^- normal differences between _n in autocovariance sigma _{S_ns_n} of each observed signal s _n M is a matrix whose elements are generalized values m _n , and is obtained by rearranging the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M in descending order with respect to the eigenvalues λ _n Is the eigenvalue of λ (n), the rearranged eigenvalue is λ (n), λ (1)> λ (2)>...> λ (N), and the eigenvector corresponding to λ (n) is e (n) N ′ <N, n ′ = 1, 2,..., N ′, and a matrix consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold _Let E 'be the approximate value matrix of the observation signal matrix S obtained using the variance σ _{s_ns_n} of each of the N observation signals s _n , the matrix μ' = ME ', and the matrix E', and the observation signal The difference between the matrix S and the approximate value matrix S ′ is an error R, and the decoding unit uses the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n,} and the error R. A decoding step of decoding the observed signal matrix S.

本発明によれば、音空間情報を効率的に伝送することができるという効果を奏する。 According to the present invention, there is an effect that sound space information can be efficiently transmitted.

固有値を対角要素とする対角行列の近似行列を説明するための図。The figure for demonstrating the approximation matrix of the diagonal matrix which uses an eigenvalue as a diagonal element. 第一実施形態に係る伝送システムの機能ブロック図。The functional block diagram of the transmission system which concerns on 1st embodiment. 第一実施形態に係る符号化装置の機能ブロック図。The functional block diagram of the encoding apparatus which concerns on 1st embodiment. 第一実施形態に係る符号化装置の処理フローの例を示す図。The figure which shows the example of the processing flow of the encoding apparatus which concerns on 1st embodiment. 第一実施形態に係る復号装置の機能ブロック図。The functional block diagram of the decoding apparatus which concerns on 1st embodiment. 第一実施形態に係る復号装置の処理フローの例を示す図。The figure which shows the example of the processing flow of the decoding apparatus which concerns on 1st embodiment.

以下、本発明の実施形態について、説明する。なお、以下の説明に用いる図面では、同じ機能を持つ構成部や同じ処理を行うステップには同一の符号を記し、重複説明を省略する。以下の説明において、ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Hereinafter, embodiments of the present invention will be described. In the drawings used for the following description, constituent parts having the same function and steps for performing the same process are denoted by the same reference numerals, and redundant description is omitted. In the following description, it is assumed that processing performed for each element of a vector or matrix is applied to all elements of the vector or matrix unless otherwise specified.

＜第一実施形態のポイント＞
第一実施形態の音空間情報圧縮方法は、符号化ステップと、復号ステップとからなる。まず、符号化ステップについて説明する。 <Points of first embodiment>
The sound space information compression method according to the first embodiment includes an encoding step and a decoding step. First, the encoding step will be described.

S(j)をN個のマイクロホンから入力されたフレーム長Lの観測信号からなる行列（以下、観測信号行列ともいう）とする。この場合、観測信号行列S(j)は式(1)で表現される Let S (j) be a matrix (hereinafter also referred to as an observation signal matrix) composed of observation signals of frame length L input from N microphones. In this case, the observed signal matrix S (j) is expressed by equation (1)

jはフレームのインデックス、lはフレームj内のサンプルのインデックス、nはマイクロホンのインデックス、l=0,1,…,L-1、n=1,2,…,Nである。なお、以下の説明において、特に断りのない限り、フレームj毎に処理を行うものとし、フレームのインデックスjを省略する。 j is the index of the frame, l is the index of the sample in frame j, n is the index of the microphone, l = 0, 1,..., L−1, n = 1, 2,. In the following description, unless otherwise specified, processing is performed for each frame j, and the index j of the frame is omitted.

ここで、観測信号s_n(l)からその平均s^- _n(マイクロホンn毎の観測信号の平均であり、例えば、時間平均であり、s_n(0),s_n(1),…,s_n(L-1)の平均値)を減算し、自己共分散σ_{s_ns_n}(ただし、下付添え字におけるA_BはA_Bを表し、s_ns_nはs_ns_nを表す)で正規化された値m_n(l)を要素とする行列Mを式(2),(3)のように定義する。 Here, the observed signal s _n (l) from the average s ^- _n (the average of the observation signal of each microphone n, for example, a time _{average, s n (0), s} n (1), ..., s _n (L-1) mean value) is calculated by subtracting an, autocovariance _σ s_ns_n (However, A_B in subscript represents a _B, s_ns_n is s _n s _n represents an) in normalized value m A matrix M having _n (l) as an element is defined as in equations (2) and (3).

ここで、カルーネン・レーベ(Karhunen-Loeve)変換を導入する。この変換は、相関の高い信号を互いに独立な成分に分解するものである。カルーネン・レーベ変換により、次式により表されるN行N列の正方行列M^TMを得る。 Here we introduce the Karhunen-Loeve transformation. This conversion decomposes highly correlated signals into components independent of each other. An N-by-N square matrix M ^T M expressed by the following equation is obtained by Karoonen-Loeve transform.

ただし、正方行列M^TMの要素σ_{m_j,m_k}(ただし、j=1,2,…,N、k=1,2,…,Nであり、下付添え字m_jm_kは、m_jm_kを表す)の値は、マイクロホンj,k間の相関の程度を示している。この正方行列M^TMを、さらに次式により固有値分解する。 However, the element σ _{m_j, m_k} (where j = 1,2, ..., N, k = 1,2, ..., N of the square matrix M ^T M, and the subscript m_jm_k is m _j m _k The value of (represents) indicates the degree of correlation between the microphones j and k. This square matrix M ^T M is further subjected to eigenvalue decomposition according to the following equation.

ただし、Eは、N個の固有ベクトルe_n=[e_n,1,e_n,2,…,e_n,N]^T(要素数N)を要素とする正方行列である。Λは、N個の固有値λ_nを対角要素とする対角行列である。さらに、固有値λ_n及び固有ベクトルe_nを、固有値λ_nに関して降順となるよう並び替える。並び替え後のN個の固有値λ_nをλ(1),λ(2),…,λ(N)と表現すると、λ(1)＞λ(2)＞…＞λ(N)である。固有値λ_nに関して降順に並び替えた後のN個の固有ベクトルe_nをe(1),e(2),…,e(N)と表現し、e(n)=[e(n,1),e(n,2),…,e(n,N)]^Tと表現する。 Here, E is a square matrix having N eigenvectors e _n = [e _{n, 1} , e _{n, 2} ,..., E _{n, N} ] ^T (number of elements N) as elements. Λ is a diagonal matrix having N eigenvalues λ _n as diagonal elements. Furthermore, the eigenvalues lambda _n and eigenvectors e _n, rearranges to be a descending order with respect to the eigenvalue lambda _n. When the rearranged N eigenvalues λ _n are _expressed as λ (1), λ (2),..., Λ (N), λ (1)> λ (2)>. N eigenvectors e _n after sorting in descending order with respect to the eigenvalue λ _n are _expressed as e (1), e (2),..., E (N), and e (n) = [e (n, 1) , e (n, 2),..., e (n, N)] ^T.

次に、行列Mと行列Eの積として得られるL行N列の行列μを、次式で定義する。 Next, an L × N matrix μ obtained as a product of the matrix M and the matrix E is defined by the following equation.

直交性のために次式(7)が成り立つので、正規化された元の行列Mは、式(8)によって求めることができる。 Since the following equation (7) holds because of the orthogonality, the normalized original matrix M can be obtained by equation (8).

降順に並べた固有値λ(n)をある閾値で打ち切って対角行列Λを近似することができる。この場合、対角行列Λの近似行列Λ'、行列Eとその近似行列E'は式(9)の関係になる。 The diagonal matrix Λ can be approximated by truncating eigenvalues λ (n) arranged in descending order at a certain threshold. In this case, the approximate matrix Λ ′ and the matrix E of the diagonal matrix Λ have the relationship of Equation (9).

なお、式(9)を満たすのであれば、閾値及びN'はどのような値に設定してもよい。例えば、固有値λ(1)からλ(n')までの総和x'に対して閾値y₁(0<y₁<1)を設けてもよいし(図１のS4の(x'<y₁?)に相当)、固有値λ(1)からλ(n')までの固有値の個数n'に対して閾値y₂(1≦y₂≦N)を設けてもよいし(図１のS4の(n'<y₂?)に相当)、各固有値λ(n')に対して閾値y₃(0<y₃<1)を設けてもよい(図１のS4の(λ(n')>y₃?)に相当)。さらに、これらの条件を組合せてもよい（例えば、上述の条件の少なくとも２つ以上の条件に対して、and条件またはor条件を設定する）。閾値y₁,y₂,y₃の大きさや条件は、実験やシミュレーションにより、伝送に先立ち予め適切なものを求めておけばよい。なお、固有値λ(1)からλ(N')までの総和xが1以上となると、対角行列Λを適切に近似することができないため、総和xが1未満となるようにN'を設定する(図１のＳ３及びＳ６参照)。 Note that the threshold and N ′ may be set to any value as long as Expression (9) is satisfied. For example, a threshold value y ₁ (0 <y ₁ <1) may be provided for the sum x ′ from eigenvalues λ (1) to λ (n ′) ((x ′ <y ₁ of S4 in FIG. 1). )), Or a threshold value y ₂ (1 ≦ y ₂ ≦ N) may be provided for the number n ′ of eigenvalues from eigenvalues λ (1) to λ (n ′) (in S4 of FIG. 1). (equivalent to n ′ <y ₂ ?)), a threshold value y ₃ (0 <y ₃ <1) may be provided for each eigenvalue λ (n ′) ((λ (n ′) of S4 in FIG. 1). > y ₃ ?)). Furthermore, these conditions may be combined (for example, an and condition or an condition is set for at least two conditions described above). The magnitudes and conditions of the thresholds y ₁ , y ₂ , and y ₃ may be determined in advance prior to transmission through experiments and simulations. Note that if the sum x from the eigenvalues λ (1) to λ (N ') is 1 or more, the diagonal matrix Λ cannot be approximated appropriately, so N' is set so that the sum x is less than 1. (Refer to S3 and S6 in FIG. 1).

式(6)のN行N列の行列Eの代わりにN行N'列の行列E'を用いることで、次式により、L行N列の行列μから次元を削減したL行N'列の行列μ'を得ることができる（式(10)）。 By using the N-row N'-column matrix E 'instead of the N-row N-column matrix E in Equation (6), the L-row N'-column is reduced in dimension from the L-row N-column matrix μ by the following equation: Can be obtained (Equation (10)).

ここから、L行N列の行列Mの近似であるL行N列の行列M'を次式(11)のように求めることができる。 From this, a matrix M ′ of L rows and N columns, which is an approximation of the matrix M of L rows and N columns, can be obtained as in the following equation (11).

観測信号行列Sの近似値行列であるS'は、式(12)によって求めることができる。 S ′, which is an approximate value matrix of the observation signal matrix S, can be obtained by Expression (12).

ここで、要素s'_n(l)は次式(13)となる。 Here, the element s ′ _n (l) is expressed by the following equation (13).

観測信号行列Sとその近似値行列S'の誤差をRと定義する（式(14)） The error between the observed signal matrix S and its approximate value matrix S 'is defined as R (Equation (14))

ここで、誤差Rは、観測信号行列Sと同様にL行N列の行列であるが、観測信号行列Sとその近似値行列S'との差分であり、誤差Rの各要素は小さな値となる。そのため、誤差Rの各要素は、観測信号行列Sの各要素よりも少ないビット数で表現することができ、観測信号行列Sを送るよりも伝送量を削減することができる。以上から、元の観測信号行列Sを伝送する代わりに、μ',E',R,およびσ²=[σ_{s_1s_1} σ_{s_2 s_2} … σ_{s_n s_n}]を伝送することで、伝送量を削減することが可能となる。 Here, the error R is a matrix of L rows and N columns like the observation signal matrix S, but is a difference between the observation signal matrix S and its approximate value matrix S ′, and each element of the error R is a small value. Become. Therefore, each element of the error R can be expressed with a smaller number of bits than each element of the observation signal matrix S, and the transmission amount can be reduced as compared with sending the observation signal matrix S. From the above, instead of transmitting the original observed signal matrix S, μ ′, E ′, R, and σ ² = [σ _{s_1s_1} σ _{s_2 s_2} … σ _{s_n s_n} ] are transmitted to reduce the transmission amount Is possible.

符号化ステップでは、観測信号行列Sを用いてμ',E',R,およびσ²を算出し、観測信号行列Sに代えて、μ',E',R,およびσ²を伝送する。 In the encoding step, μ ′, E ′, R, and σ ² are calculated using the observed signal matrix S, and μ ′, E ′, R, and σ ² are transmitted instead of the observed signal matrix S.

復号ステップでは、μ',E',R,およびσ²を受け取り、これらの値を用いて、次式(15)により、観測信号行列Sを復号する。 In the decoding step, μ ′, E ′, R, and σ ² are received, and using these values, the observed signal matrix S is decoded by the following equation (15).

ただし、「.*」は、行列の要素同士の乗算を表す演算子であり、 However, ". *" Is an operator that represents the multiplication of matrix elements.

である。また、 It is. Also,

である。つまり、Σ²は、L個のσ²を要素として持つ行列である。 It is. That is, Σ ² is a matrix having L σ ² as elements.

以下、上述の伝送方法を実現する構成について説明する。 Hereinafter, a configuration for realizing the above transmission method will be described.

＜第一実施形態に係る伝送システム１０＞
図２は第一実施形態に係る伝送システム１０の機能図を示す。 <Transmission system 10 according to the first embodiment>
FIG. 2 shows a functional diagram of the transmission system 10 according to the first embodiment.

伝送システム１０は、符号化装置１００と復号装置２００とを含む。符号化装置１００と復号装置２００とは、通信回線１１を介して通信可能とされている。符号化装置１００は、N個のマイクロホン１２−ｎでそれぞれ収音したN個の観測信号s_nを入力とし、符号(μ',E',R,σ²)を生成し、通信回線１１を介して復号装置２００に伝送する。ただし、n=1,2,…,Nであり、s_n=[s_n(0) s_n(1) … s_n(L-1)]である。復号装置２００は、符号(μ',E',R,σ²)を受け取り、この符号を用いてN個の観測信号s_nを復号する。例えば、復号装置２００は、N個のスピーカ１３−ｎにそれぞれN個の観測信号s_nを出力し、N個のスピーカ１３−ｎはそれぞれN個の観測信号s_nを再生し、N個のマイクロホン１２−ｎが配置された音場を再現する。 The transmission system 10 includes an encoding device 100 and a decoding device 200. The encoding device 100 and the decoding device 200 can communicate with each other via the communication line 11. Encoding apparatus 100 inputs the N number of N picked up respectively by the microphone 12-n of the observation signal s _n, code (μ ', E', R , σ 2) generates, the communication line 11 To the decoding device 200. However, n = 1, 2,..., N, and s _n = [s _n (0) s _n (1)... S _n (L−1)]. Decoding device 200 receives a code (μ ', E', R , σ 2), to decode the N pieces of observed signal s _n with the sign. For example, decoding device 200, respectively to the N speakers 13-n and outputs N observation signals s _n, N speakers 13-n plays the N pieces of observed signal s _n respectively of N The sound field where the microphone 12-n is arranged is reproduced.

＜符号化装置１００＞
図３は符号化装置１００の機能ブロック図を、図４はその処理フローを示す。 <Encoder 100>
FIG. 3 is a functional block diagram of the encoding apparatus 100, and FIG. 4 shows a processing flow thereof.

符号化装置１００は、正規化部１１０、分散計算部１２０、カルーネン・レーベ変換部１３０、並び替え部１４０、打切り部１５０、次元削減部１６０及び誤差算出部１７０を含む。 The encoding apparatus 100 includes a normalization unit 110, a variance calculation unit 120, a Karoonen-Labe conversion unit 130, a rearrangement unit 140, a truncation unit 150, a dimension reduction unit 160, and an error calculation unit 170.

＜正規化部１１０＞
正規化部１１０は、N個のマイクロホン１２−ｎでそれぞれ収音したN個の観測信号s_nを受け取り、これらの値に基づき式(1')に従って観測信号行列Sを生成し、分散計算部１２０及び誤差算出部１７０に出力する。 <Normalization unit 110>
Normalizing unit 110 receives the N number of N picked up respectively by the microphone 12-n of the observation signal s _n, and generates an observation signal matrix S according formula based on these values (1 '), the variance calculator 120 and the error calculation unit 170.

また、正規化部１１０は、N個の観測信号s_nを用いて、式(2),(3)に従って行列Mを生成し、カルーネン・レーベ変換部１３０、打切り部１５０、次元削減部１６０に出力する。 Further, the normalization unit 110, using N observation signals s _n, formula (2), to generate a matrix M according to (3), Karhunen-Loeve transform unit 130, abort section 150, the dimension reduction unit 160 Output.

行列Mの各要素m_n(l)は、各観測信号s_n(l)と各観測信号s_nの平均s^- _n(例えば時間平均)との差分を各観測信号s_nの自己共分散σ_{s_ns_n}で正規化した値である。 Each element m _n matrix M (l), the average s of each observed signal s _n and the observed signal s _n (l) ^- _n (eg, time average) auto-covariance of the observed signal s _n a difference between σ It is a value normalized by _{s_ns_n} .

＜分散計算部１２０＞
分散計算部１２０は、観測信号行列Sを受け取り、各観測信号s_nの分散σ_{s_ns_n}を計算し（Ｓ１２０）、分散σ=[σ_{S_1S_1} σ_{S_2S_2} … σ_{S_NS_N}]を符号化装置１００の出力値として出力するとともに、誤差算出部１７０に出力する。 <Distributed calculation unit 120>
Variance calculator 120 receives the observed signal matrix S, the variance sigma _{S_ns_n} of each observed signal s _n calculated (S120), variance _{_{σ = [σ S_1S_1 σ S_2S_2 ...}} σ S_NS_N] as the output value of the encoding apparatus 100 At the same time, it is output to the error calculator 170.

＜カルーネン・レーベ変換部１３０＞
カルーネン・レーベ変換部１３０は、行列Mを受け取り、カルーネン・レーベ変換し（Ｓ１３０）、式(4)により表される正方行列M^TMを得、並び替え部１４０に出力する。 <Kalunen-Labe conversion unit 130>
The Karoonen-Loeve transform unit 130 receives the matrix M, performs the Karoonen-Labe transform (S130), obtains a square matrix M ^T M expressed by Equation (4), and outputs the square matrix M ^T M to the rearrangement unit 140.

＜並び替え部１４０＞
並び替え部１４０は、正方行列M^TMを受け取り、固有値分解し(式(5))、正方行列M^TMのN個の固有値λ_n及び固有ベクトルe_nを得る。 <Sort section 140>
Rearranging unit 140 receives the square matrix M ^T M, eigenvalue decomposition (Equation (5)) to give the N eigenvalues lambda _n and eigenvectors e _n square matrix M ^T M.

さらに、並び替え部１４０は、固有値λ_nに関して降順となるように並び替え（Ｓ１４０）、並び替え後のN個の固有値λ(1),λ(2),…,λ(N)（ただし、λ(1)＞λ(2)＞…＞λ(N)）を対角要素とする対角行列Λ=diag(λ(1),λ(2),…,λ(N))とN個の固有ベクトルe(1),e(2),…,e(N)からなるL行N列の行列E=[e(1) e(2) … e(N)]を打切り部１５０に出力する。なお、固有ベクトルe(n)は固有値λ(n)に対応する固有ベクトルである。 Further, the rearrangement unit 140 rearranges the eigenvalues λ _n so as to be in descending order (S140), and the N eigenvalues λ (1), λ (2),. Diagonal matrix Λ = diag (λ (1), λ (2), ..., λ (N)) with N as λ (1)> λ (2)>...> λ (N)) L = N matrix E = [e (1) e (2)... E (N)] consisting of eigenvectors e (1), e (2),. . The eigenvector e (n) is an eigenvector corresponding to the eigenvalue λ (n).

＜打切り部１５０＞
打切り部１５０は、並び替え後のN個の固有値λ(n)を対角要素とする対角行列Λと、並び替え後のN個の固有ベクトルe(1),e(2),…,e(N)からなるL行N列の行列E=[e(1) e(2) … e(N)]を受け取り、所定の閾値yと固有値λ(n')に対応する値との大小関係に基づき、N'を設定し（図１のＳ６参照）、N'個の固有値λ(n')にそれぞれ対応するN'個の固有ベクトルe(n')のみからなる行列E'=[e(1) e(2) … e(N')]を得（Ｓ１５０）、符号化装置１００の出力値として出力するとともに、次元削減部１６０及び誤差算出部１７０に出力する。ただし、N'<N、n'=1,2,…,N'である。固有値λ(n')に対応する値とは、例えば、(i)固有値λ(1)からλ(n')までの総和x'、(ii)固有値λ(1)からλ(n')までの固有値の個数n'、(iii)固有値λ(n')自体、(iv)上述の(i)〜(iii)の組合せである(図１のＳ４参照)。所定の閾値yとは、例えば、図１のy₁,y₂,y₃またはそれらの組合せである。 <Cutoff part 150>
The truncation unit 150 includes a diagonal matrix Λ whose diagonal elements are N eigenvalues λ (n) after rearrangement, and N eigenvectors e (1), e (2),. An L-by-N matrix E = [e (1) e (2) ... e (N)] consisting of (N), and a magnitude relationship between a predetermined threshold value y and a value corresponding to the eigenvalue λ (n ′) N ′ is set based on (see S6 in FIG. 1), and a matrix E ′ = [e () consisting of only N ′ eigenvectors e (n ′) respectively corresponding to N ′ eigenvalues λ (n ′). 1) e (2)... E (N ′)] is obtained (S150), and is output as an output value of the encoding apparatus 100, and is also output to the dimension reduction unit 160 and the error calculation unit 170. However, N ′ <N, n ′ = 1, 2,..., N ′. The values corresponding to the eigenvalue λ (n ′) are, for example, (i) the sum x ′ from the eigenvalue λ (1) to λ (n ′), and (ii) from the eigenvalue λ (1) to λ (n ′). The number of eigenvalues n ′, (iii) the eigenvalue λ (n ′) itself, (iv) a combination of the above (i) to (iii) (see S4 in FIG. 1). The predetermined threshold value y is, for example, y ₁ , y ₂ , y _{3 in} FIG. 1 or a combination thereof.

なお、固有値λ(1)からλ(N')までの総和xが1以上となると、対角行列Λを適切に近似することができないため、総和xが1未満となるようにN'を設定する(図１のＳ３及びＳ６参照)。また、N'<NとなるようにN'を設定する(図１のＳ３及びＳ６参照)。 Note that if the sum x from the eigenvalues λ (1) to λ (N ') is 1 or more, the diagonal matrix Λ cannot be approximated appropriately, so N' is set so that the sum x is less than 1. (Refer to S3 and S6 in FIG. 1). Also, N ′ is set so that N ′ <N (see S3 and S6 in FIG. 1).

＜次元削減部１６０＞
次元削減部１６０は、行列Mと行列E'とを受け取り、式(10)に従って、行列Mと行列E'との積である行列μ'=ME'を得（Ｓ１６０）、符号化装置１００の出力値として出力するとともに、誤差算出部１７０に出力する。 <Dimension reduction unit 160>
The dimension reduction unit 160 receives the matrix M and the matrix E ′, obtains a matrix μ ′ = ME ′, which is a product of the matrix M and the matrix E ′, according to the equation (10) (S160), and While outputting as an output value, it outputs to the error calculation part 170.

＜誤差算出部１７０＞
誤差算出部１７０は、観測信号行列Sと行列μ'と行列E'と分散σ²とを受け取り、これらの用いて、式(11)〜(13)に従い、観測信号行列Sの近似値行列S'を求める。 <Error calculation unit 170>
The error calculation unit 170 receives the observation signal matrix S, the matrix μ ′, the matrix E ′, and the variance σ ^2, and uses them to approximate the observation signal matrix S according to the equations (11) to (13). Ask for '.

さらに、誤差算出部１７０は、式(14)に従い、観測信号行列Sと近似値行列S'との差分を誤差Rとして求め（Ｓ１７０）、符号化装置１００の出力値として出力する。 Further, the error calculation unit 170 obtains a difference between the observed signal matrix S and the approximate value matrix S ′ as an error R according to the equation (14) (S170), and outputs it as an output value of the encoding device 100.

符号化装置１００は、図示しない送信部からフレーム毎に符号(μ',E',R,σ²)を伝送する。 The encoding apparatus 100 transmits a code (μ ′, E ′, R, σ ² ) for each frame from a transmission unit (not shown).

＜復号装置２００＞
図５は復号装置２００の機能ブロック図を、図６はその処理フローを示す。 <Decoding device 200>
FIG. 5 shows a functional block diagram of the decoding apparatus 200, and FIG. 6 shows a processing flow thereof.

復号装置２００は、復号部２１０、音信号生成部２２０を含む。復号部２１０は、近似行列計算部２１１、観測信号予測部２１２を含む。 The decoding device 200 includes a decoding unit 210 and a sound signal generation unit 220. The decoding unit 210 includes an approximate matrix calculation unit 211 and an observation signal prediction unit 212.

＜復号部２１０＞
復号部２１０は、符号(μ',E',R,σ²)を受け取り、これらの値を用いて、次式により、観測信号行列Sを復号し（Ｓ２１０）、音信号生成部２２０に出力する。 <Decoding unit 210>
The decoding unit 210 receives the code (μ ′, E ′, R, σ ² ), and uses these values to decode the observation signal matrix S according to the following equation (S210) and outputs it to the sound signal generation unit 220. To do.

である。例えば以下のように観測信号行列Sを復号する。 It is. For example, the observation signal matrix S is decoded as follows.

復号部２１０の近似行列計算部２１１は、行列μ'と行列E'とを用いて、式(11)に従い、行列Mの近似行列M'を求め（Ｓ２１１）、観測信号予測部２１２に出力する。 The approximate matrix calculation unit 211 of the decoding unit 210 obtains an approximate matrix M ′ of the matrix M using the matrix μ ′ and the matrix E ′ according to the equation (11) (S211), and outputs it to the observation signal prediction unit 212. .

復号部２１０の観測信号予測部２１２は、近似行列M'と誤差Rと分散σを受け取り、近似行列M'と分散σとを用いて、式(12),(13)に従い、観測信号行列Sの近似値行列S'を求める。 The observation signal prediction unit 212 of the decoding unit 210 receives the approximation matrix M ′, the error R, and the variance σ, and uses the approximation matrix M ′ and the variance σ to observe the observation signal matrix S according to the equations (12) and (13). An approximate value matrix S ′ is obtained.

さらに、観測信号予測部２１２は、近似値行列S'と誤差Rとを用いて、次式により、観測信号行列Sを求め（Ｓ２１２）、音信号生成部２２０に出力する。 Further, the observation signal prediction unit 212 uses the approximate value matrix S ′ and the error R to obtain the observation signal matrix S according to the following equation (S212) and outputs it to the sound signal generation unit 220.

＜音信号生成部２２０＞
音信号生成部２２０は、観測信号行列Sを受け取り、各スピーカ１３−ｎの観測信号（再生信号）s_nに分離し、それぞれ各スピーカ１３−ｎに出力する。 <Sound signal generation unit 220>
The sound signal generator 220 receives the observed signal matrix S, separates the observed signal (reproduced signal) s _n of the speakers 13-n, respectively and outputs to the speaker 13-n.

各スピーカ１３−ｎは、観測信号（再生信号）s_nを受け取り、再生する。 Each speaker 13-n receives the observed signal (reproduced signal) s _n, to play.

＜効果＞
以上の構成により、音空間情報を効率的に伝送することができる。誤差Rは、観測信号行列Sと同様にL行N列の行列であるが、観測信号行列Sとその近似値行列S'との差分であり、誤差Rの各要素は小さな値となる。そのため、誤差Rの各要素は、観測信号行列Sの各要素よりも少ないビット数で表現することができ、観測信号行列Sを送るよりも伝送量を削減することができる。以上から、元の観測信号行列Sを伝送する代わりに、符号(μ',E',R,σ²)を伝送することで、伝送量を削減することが可能となる。 <Effect>
With the above configuration, sound space information can be efficiently transmitted. The error R is a matrix of L rows and N columns similarly to the observation signal matrix S, but is a difference between the observation signal matrix S and its approximate value matrix S ′, and each element of the error R has a small value. Therefore, each element of the error R can be expressed with a smaller number of bits than each element of the observation signal matrix S, and the transmission amount can be reduced as compared with sending the observation signal matrix S. From the above, it is possible to reduce the transmission amount by transmitting the code (μ ′, E ′, R, σ ² ) instead of transmitting the original observation signal matrix S.

＜その他の変形例＞
本発明は上記の実施形態及び変形例に限定されるものではない。例えば、上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 <Other variations>
The present invention is not limited to the above-described embodiments and modifications. For example, the various processes described above are not only executed in time series according to the description, but may also be executed in parallel or individually as required by the processing capability of the apparatus that executes the processes. In addition, it can change suitably in the range which does not deviate from the meaning of this invention.

＜プログラム及び記録媒体＞
また、上記の実施形態及び変形例で説明した各装置における各種の処理機能をコンピュータによって実現してもよい。その場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記各装置における各種の処理機能がコンピュータ上で実現される。 <Program and recording medium>
In addition, various processing functions in each device described in the above embodiments and modifications may be realized by a computer. In that case, the processing contents of the functions that each device should have are described by a program. Then, by executing this program on a computer, various processing functions in each of the above devices are realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing contents can be recorded on a computer-readable recording medium. As the computer-readable recording medium, for example, any recording medium such as a magnetic recording device, an optical disk, a magneto-optical recording medium, and a semiconductor memory may be used.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させてもよい。 The program is distributed by selling, transferring, or lending a portable recording medium such as a DVD or CD-ROM in which the program is recorded. Further, the program may be distributed by storing the program in a storage device of the server computer and transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶部に格納する。そして、処理の実行時、このコンピュータは、自己の記憶部に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実施形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよい。さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、プログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 A computer that executes such a program first stores, for example, a program recorded on a portable recording medium or a program transferred from a server computer in its storage unit. When executing the process, this computer reads the program stored in its own storage unit and executes the process according to the read program. As another embodiment of this program, a computer may read a program directly from a portable recording medium and execute processing according to the program. Further, each time a program is transferred from the server computer to the computer, processing according to the received program may be executed sequentially. Also, the program is not transferred from the server computer to the computer, and the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes the processing function only by the execution instruction and result acquisition. It is good. Note that the program includes information provided for processing by the electronic computer and equivalent to the program (data that is not a direct command to the computer but has a property that defines the processing of the computer).

また、コンピュータ上で所定のプログラムを実行させることにより、各装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 In addition, although each device is configured by executing a predetermined program on a computer, at least a part of these processing contents may be realized by hardware.

Claims

N is any integer of 2 or more, and is a transmission system for transmitting an observation signal matrix S collected by N microphones from an encoding device to a decoding device,
The encoding device includes:
n = 1, 2, ..., and N, and each of the observed signal s _n of each microphone, the average s of each observed signal s _n ^- regular self-covariance sigma _{S_ns_n} of each observed signal the difference between _n s _n A normalization unit that generates a matrix M whose elements are generalized values m _n ,
A reordering unit that reorders the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M in descending order with respect to the eigenvalues λ _n ;
N ′ <N, n ′ = 1, 2,..., N ′, and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), λ (n) Let e (n) be the eigenvector corresponding to, and based on the magnitude relationship between the predetermined threshold and the value corresponding to the eigenvalue λ (n ′), N ′ pieces of N ′ corresponding to N ′ eigenvalues λ (n ′) respectively A truncation unit for obtaining a matrix E ′ = [e (1) e (2)... E (N ′)] consisting only of eigenvectors e (n ′);
An approximate value matrix of the observed signal matrix S using a variance σ _{s_ns_n} of each of the N observed signals s _n , a matrix μ ′ that is a product of the matrix M and the matrix E ′, and the matrix E ′ An error calculator that calculates S ′ and calculates the difference between the observed signal matrix S and the approximate value matrix S ′ as an error R, and
The decoding device
A decoding unit for decoding the observed signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n} and the error R;
Transmission system.

An encoding device that generates a code corresponding to an observation signal matrix S collected by N microphones, where N is any integer of 2 or more,
n = 1, 2, ..., and N, and each of the observed signal s _n, the time average s of each observed signal s _n ^- normalized by auto-covariance sigma _{S_ns_n} of the difference between _n each observed signal s _n A normalization unit that generates a matrix M having elements of value m _n ,
A reordering unit that reorders the N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M in descending order with respect to the eigenvalues λ _n ;
N ′ <N, n ′ = 1, 2,..., N ′, and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), λ (n) An eigenvector corresponding to e (n), and a truncation unit for obtaining a matrix E ′ consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold value;
An approximate value matrix of the observed signal matrix S using a variance σ _{s_ns_n} of each of the N observed signals s _n , a matrix μ ′ that is a product of the matrix M and the matrix E ′, and the matrix E ′ An error calculation unit that calculates S ′ and calculates a difference between the observed signal matrix S and the approximate value matrix S ′ as an error R.
Encoding device.

The encoding device according to claim 2, comprising:
M ′ = μ ′ (E ′) ^T , each element of M ′ is m ′ _n , each approximate value of each observation signal s _n is s ′ _n , and the error calculation unit is
s ' _n = m' _n σ _{s_ns_n}
To obtain the approximate value matrix S ′ = [s ′ ₁ s ′ ₂ ... S ′ _N ],
Encoding device.

N is any integer of 2 or more, receives a code corresponding to an observation signal matrix S collected by N microphones, and decodes the observation signal matrix S,
n = 1,2, ..., and N, and each of the observed signal s _n, times the average s of each observed signal s _n ^- _n value difference normalized with the auto-covariance sigma _{S_ns_n} of each observed signal s _n and m A matrix having _n as an element is M, and N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M are rearranged so as to be in descending order with respect to the eigenvalue λ _n. (n), the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), the eigenvector corresponding to λ (n) is e (n), and N ′ <N, n ′ = 1, 2,..., N ′, and E ′ a matrix consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold value, An approximate value matrix of the observed signal matrix S obtained by using the variance σ _{s_ns_n} of each of the N observed signals s _n , the matrix μ ′ = ME ′, and the matrix E ′ is defined as S ′, and the observed signal matrix S and The difference from the approximate value matrix S ′ is the error R,
A decoding unit that decodes the observed signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n,} and the error R;
Decoding device.

N is any integer of 2 or more, and is a transmission method for transmitting an observation signal matrix S collected by N microphones from an encoding device to a decoding device,
n = 1, 2, ..., and N, the coding apparatus, and the observed signal s _n of each microphone, the average s of each observed signal s _n ^- self of each observed signal the difference between _n s _n A normalization step for generating a matrix M having elements m _n normalized by the covariance σ _{s_ns_n} ;
The encoding apparatus, the N eigenvalues lambda _n and eigenvectors e _n square matrix M ^T M, and rearranging step for rearranging so that descending with respect to the eigenvalue lambda _n,
N ′ <N, n ′ = 1, 2,..., N ′, and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), λ (n) The eigenvector corresponding to e (n) is set to e (n), and the encoding device determines each of N ′ eigenvalues λ (n ′) based on a magnitude relationship between a predetermined threshold and a value corresponding to the eigenvalue λ (n ′). A truncation step to obtain a matrix E ′ = [e (1) e (2)... E (N ′)] consisting only of the corresponding N ′ eigenvectors e (n ′);
The encoding apparatus, the variance sigma _{S_ns_n} of N each observed signal s _n, a 'matrix μ is the product of' the matrix E and the matrix M, the observed signal using said matrix E ' Obtaining an approximate value matrix S ′ of the matrix S, and calculating an error R as a difference between the observed signal matrix S and the approximate value matrix S ′;
The decoding apparatus includes a decoding step of decoding the observed signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n} and the error R,
Transmission method.

An encoding method for generating a code corresponding to an observation signal matrix S collected by N microphones, where N is any integer of 2 or more,
n = 1, 2, ..., and N, normalization section, and each of the observed signal s _n, the time of each observation signal s _n Mean s ^- wherein a difference between _n each observed signal s _n autocovariance σ of a normalization step for generating a matrix M having elements m _n normalized by _{s_ns_n} ;
Rearranging portion, a N eigenvalues lambda _n and eigenvectors e _n square matrix M ^T M, and rearranging step for rearranging so that descending with respect to the eigenvalue lambda _n,
N ′ <N, n ′ = 1, 2,..., N ′, and the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), λ (n) The eigenvector corresponding to is e (n), and the truncation unit obtains a matrix E ′ consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold. When,
Error calculation section, and the variance sigma _{S_ns_n} of N each observed signal s _n, a 'matrix μ is the product of' the matrix E and the matrix M, the observed signal using said matrix E 'Toto Calculating an approximate value matrix S ′ of the matrix S, and calculating an error R as a difference between the observed signal matrix S and the approximate value matrix S ′.
Encoding method.

N is any integer of 2 or more, receives a code corresponding to the observed signal matrix S collected by N microphones, and decodes the observed signal matrix S,
n = 1,2, ..., and N, and each of the observed signal s _n, times the average s of each observed signal s _n ^- _n value difference normalized with the auto-covariance sigma _{S_ns_n} of each observed signal s _n and m A matrix having _n as an element is M, and N eigenvalues λ _n and eigenvectors e _n of the square matrix M ^T M are rearranged so as to be in descending order with respect to the eigenvalue λ _n. (n), the rearranged eigenvalues are λ (n), λ (1)> λ (2)>...> λ (N), the eigenvector corresponding to λ (n) is e (n), and N ′ <N, n ′ = 1, 2,..., N ′, and E ′ a matrix consisting only of eigenvectors e (n ′) corresponding to N ′ eigenvalues λ (n ′) that are equal to or less than a predetermined threshold value, An approximate value matrix of the observed signal matrix S obtained by using the variance σ _{s_ns_n} of each of the N observed signals s _n , the matrix μ ′ = ME ′, and the matrix E ′ is defined as S ′, and approximated to the observed signal matrix S The difference from the value matrix S ′ is the error R,
The decoding unit includes a decoding step of decoding the observation signal matrix S using the matrix μ ′, the matrix E ′, the N variances σ _{s_ns_n} and the error R,
Decryption method.

A program for causing a computer to function as the encoding device according to claim 2 or claim 3 or the decoding device according to claim 4.