JP2017212552A

JP2017212552A - Channel number converter and program thereof

Info

Publication number: JP2017212552A
Application number: JP2016103664A
Authority: JP
Inventors: 岳大杉本; Takehiro Sugimoto; 大出　訓史; Norifumi Oide; 訓史大出; 一穂小野; Kazuo Ono; 陽佐々木; Akira Sasaki; 小森　智康; Tomoyasu Komori; 智康小森; 北島　周; Shu Kitajima; 周北島
Original assignee: Nippon Hoso Kyokai NHK; Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2016-05-24
Filing date: 2016-05-24
Publication date: 2017-11-30
Anticipated expiration: 2036-05-24
Also published as: JP6684651B2

Abstract

PROBLEM TO BE SOLVED: To provide a channel number converter performing appropriate channel number conversion of each program along the content thereof.SOLUTION: A channel number converter 10 includes a weighting coefficient calculation unit 13 for receiving a multichannel sound signal and a reference signal corresponding to the content thereof, and calculating the weighting coefficient of each channel signal of the multichannel sound signal contained in the reference signal, a correction coefficient calculation unit 14 for calculating correction coefficients for correcting each channel signal of the multichannel sound signal based on the weighting coefficients, a correction coefficient application unit 15 for applying the correction coefficients to the multichannel sound signal, and a channel number conversion unit 17 for converting the multichannel sound signal, to which the correction coefficients are applied, into a reproduction channel signal of desired channel number, by a predetermined channel number conversion method.SELECTED DRAWING: Figure 1

Description

本発明は、マルチチャンネル音声信号のチャンネル数を変換するチャンネル数変換装置およびそのプログラムに関する。 The present invention relates to a channel number conversion device that converts the number of channels of a multi-channel audio signal and a program thereof.

現在、２２．２ｃｈなどのマルチチャンネル音声放送（非特許文献１）の実用化が進められている。マルチチャンネル音声放送により、高い臨場感を持った音声の再生を実現することができる。しかし、一般的な家庭の場合、例えば、２ｃｈステレオ等、２２．２ｃｈより少ないチャンネル数のみを再生可能な環境である場合が多いと想定される。このため、マルチチャンネル音声放送を家庭で再生するには、家庭で再生できるチャンネル数に合わせて、音声信号のチャンネル数を変換する必要がある。このような場合、一般的にダウンミックスやレンダリング等を行って再生環境に応じたチャンネル数に変換する技術が知られている（非特許文献１、２）。 Currently, the practical use of multi-channel audio broadcasting (Non-Patent Document 1) such as 22.2ch is being promoted. With multi-channel audio broadcasting, it is possible to realize audio reproduction with high presence. However, in the case of a general home, for example, it is assumed that there are many cases where it is possible to reproduce only the number of channels smaller than 22.2 ch, such as 2 ch stereo. For this reason, in order to reproduce multi-channel audio broadcasting at home, it is necessary to convert the number of channels of audio signals in accordance with the number of channels that can be reproduced at home. In such a case, a technique is generally known in which downmixing or rendering is performed to convert the number of channels according to the reproduction environment (Non-Patent Documents 1 and 2).

しかし、番組の内容に関わらず一意にチャンネル数を変換する一般的なチャンネル数変換方法では、チャンネル数変換後の音声信号が番組制作者の意図に沿ったものとならない可能性がある。これに対し、マルチチャンネル音声放送を実施する場合には、番組制作者が２ｃｈステレオ用の番組を別に制作して、マルチチャンネルと２ｃｈステレオを同時に放送するいわゆるサイマル放送の実施が検討されている。 However, in a general channel number conversion method in which the number of channels is uniquely converted regardless of the contents of the program, there is a possibility that the audio signal after the channel number conversion does not conform to the intention of the program producer. On the other hand, when multi-channel audio broadcasting is performed, implementation of so-called simulcast where a program producer separately produces a program for 2ch stereo and simultaneously broadcasts multi-channel and 2ch stereo is being studied.

「デジタル放送における映像符号化、音声符号化及び多重化方式標準規格 VIDEO CODING, AUDIO CODING AND MULTIPLEXING SPECIFICATIONS FOR DIGITAL BROADCASTING ARIB STANDARD ARIB STD-B32 3.6版」，平成２８年（２０１６年）３月２５日，一般社団法人電波産業会“Video Coding, Audio Coding and Multiplexing SPECIFICATIONS FOR DIGITAL BROADCASTING ARIB STANDARD ARIB STD-B32 Version 3.6” in Digital Broadcasting, March 25, 2016 Japan Radio Industry Association ISO/IEC 23008-3,“Information technology High efficiency coding and media delivery in heterogeneous environments”ISO / IEC 23008-3, “Information technology High efficiency coding and media delivery in heterogeneous environments”

ところで、家庭環境によってスピーカ数やその配置は様々であり、各家庭での再生環境に応じたチャンネル数での放送を聴取したいというニーズがある。しかし、あらゆるチャンネル数、スピーカ配置に対応するサイマル放送を実施することは不可能である。また、各家庭での再生環境（スピーカ数、スピーカ配置）に応じた一律なチャンネル数変換方法によってチャンネル数を変換した場合、番組ごとに、番組制作者の意図通りに変換を行うことは難しい。その為、サイマル放送が実施されたとしても、各家庭では、番組ごとに適切にチャンネル数変換された音声再生ができない可能性が高い。 By the way, the number of speakers and their arrangement vary depending on the home environment, and there is a need to listen to broadcasting on the number of channels corresponding to the playback environment in each home. However, it is impossible to carry out simultaneous broadcasting corresponding to any number of channels and speaker arrangements. Further, when the number of channels is converted by a uniform channel number conversion method corresponding to the reproduction environment (number of speakers, speaker arrangement) in each home, it is difficult to convert for each program as intended by the program producer. For this reason, even if simulcasting is carried out, there is a high possibility that each home will not be able to reproduce the sound with the appropriate number of channels converted for each program.

そこでこの発明は、上述の課題を解決することのできるチャンネル数変換装置およびそのプログラムを提供することを目的としている。 SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a channel number conversion device and a program thereof that can solve the above-described problems.

本発明の一態様によれば、チャンネル数変換装置は、マルチチャンネル音声信号と前記マルチチャンネル音声信号に対応する参照信号とを入力し、前記参照信号の各チャンネルに含まれる前記マルチチャンネル音声信号の各チャンネル信号に対応する重み付け係数をそれぞれ計算する重み付け係数計算部と、前記重み付け係数に基づいて前記マルチチャンネル音声信号の各チャンネル信号に乗じる補正係数を計算する補正係数計算部と、前記補正係数を前記マルチチャンネル音声信号に適用する補正係数適用部と、前記補正係数を適用したマルチチャンネル音声信号を、所定のチャンネル数変換方法によって、所望のチャンネル数の再生チャンネル信号に変換するチャンネル数変換部と、を備える。 According to an aspect of the present invention, the channel number conversion device receives a multi-channel audio signal and a reference signal corresponding to the multi-channel audio signal, and the multi-channel audio signal included in each channel of the reference signal. A weighting coefficient calculator for calculating a weighting coefficient corresponding to each channel signal, a correction coefficient calculator for calculating a correction coefficient to be multiplied to each channel signal of the multi-channel audio signal based on the weighting coefficient, and the correction coefficient A correction coefficient applying unit to be applied to the multi-channel audio signal; a channel number converting unit for converting the multi-channel audio signal to which the correction coefficient is applied into a reproduction channel signal having a desired number of channels by a predetermined channel number conversion method; .

本発明の一態様によれば、前記重み付け係数計算部は、前記参照信号と前記マルチチャンネル音声信号の各チャンネル信号との間の遅延を補正する遅延補正部、を備えてもよい。 According to an aspect of the present invention, the weighting coefficient calculation unit may include a delay correction unit that corrects a delay between the reference signal and each channel signal of the multi-channel audio signal.

本発明の一態様によれば、前記重み付け係数計算部は、前記マルチチャンネル音声信号と前記参照信号とを入力し、前記マルチチャンネル音声信号の各チャンネル信号に対する重み付け比を分析する重み付け比分析部、を備えてもよい。 According to an aspect of the present invention, the weighting coefficient calculator receives the multichannel audio signal and the reference signal, and analyzes a weighting ratio for each channel signal of the multichannel audio signal; May be provided.

本発明の一態様によれば、前記重み付け係数計算部は、前記参照信号の各チャンネル信号のエネルギーと、前記参照信号の各チャンネルに対応した前記マルチチャンネル音声信号の各チャンネル信号に前記重み付け比を乗じた信号のエネルギーの和とが等しくなるように前記重み付け比を補正する重み付け比補正部、を備えてもよい。 According to an aspect of the present invention, the weighting coefficient calculator calculates the weighting ratio for the energy of each channel signal of the reference signal and each channel signal of the multichannel audio signal corresponding to each channel of the reference signal. You may provide the weighting ratio correction | amendment part which correct | amends the said weighting ratio so that the sum of the energy of the multiplied signal may become equal.

本発明の一態様によれば、前記重み付け係数計算部は、マルチチャンネル音声信号の各チャンネル信号を、各チャンネル信号の類似度に基づいてグルーピングし、そのグループに所属する前記チャンネル信号に基づいて当該グループを代表するグループ信号を生成するグルーピング部、をさらに備え、前記重み付け係数計算部は、前記グループ信号についての重み付け係数を計算してもよい。 According to an aspect of the present invention, the weighting coefficient calculation unit groups each channel signal of the multi-channel audio signal based on the similarity of each channel signal, and based on the channel signal belonging to the group A grouping unit that generates a group signal representing a group may be further included, and the weighting coefficient calculation unit may calculate a weighting coefficient for the group signal.

本発明の一態様によれば、前記グルーピング部は、前記グループに所属する前記チャンネル信号に基づいて、前記チャンネル信号の平均、前記チャンネル信号の類似度の重心にあるチャンネル信号、前記チャンネル信号のうち最大のエネルギーを有するチャンネル信号の何れかを、前記グループ信号として生成してもよい。 According to an aspect of the present invention, the grouping unit includes, based on the channel signals belonging to the group, an average of the channel signals, a channel signal at the centroid of the similarity of the channel signals, and the channel signals Any one of the channel signals having the maximum energy may be generated as the group signal.

本発明の一態様によれば、前記重み付け係数計算部は、前記マルチチャンネル音声信号の各チャンネル信号の中から、前記参照信号との間の相互相関係数に基づいて１つまたは複数のチャンネル信号を選択する基準チャンネル信号選択部、をさらに備え、前記重み付け係数計算部は、前記選択されたチャンネル信号の重み付け係数が、それ以外のチャンネル信号の重み付け係数よりも大きくなることを拘束条件として、重み付け係数を計算してもよい。 According to an aspect of the present invention, the weighting coefficient calculation unit includes one or more channel signals based on a cross-correlation coefficient between the channel signals of the multi-channel audio signal and the reference signal. A reference channel signal selection unit that selects the weighting factor, and the weighting factor calculation unit performs weighting with a constraint that the weighting factor of the selected channel signal is larger than the weighting factors of the other channel signals. A coefficient may be calculated.

本発明の一態様によれば、前記重み付け係数計算部は、前記マルチチャンネル音声信号の各チャンネル信号の中から、前記参照信号との間の相互相関係数に基づいて１つまたは複数のチャンネル信号を選択する基準チャンネル信号選択部、をさらに備え、前記重み付け係数計算部は、前記基準チャンネル信号選択部が選択したチャンネル信号についてのみ重み付け係数を計算してもよい。 According to an aspect of the present invention, the weighting coefficient calculation unit includes one or more channel signals based on a cross-correlation coefficient between the channel signals of the multi-channel audio signal and the reference signal. A reference channel signal selection unit that selects the reference channel signal, and the weighting coefficient calculation unit may calculate the weighting coefficient only for the channel signal selected by the reference channel signal selection unit.

本発明の一態様によれば、前記補正係数計算部は、前記参照信号の全エネルギーまたは前記参照信号の各チャンネルに対応する前記マルチチャンネル音声信号の各チャンネル信号に前記重み付け係数を適用した信号のエネルギーの和と、前記マルチチャンネル音声信号の各チャンネル信号に前記補正係数を適用した信号のエネルギーの和とが等しくなるように前記補正係数を計算してもよい。 According to an aspect of the present invention, the correction coefficient calculation unit is configured to calculate a signal obtained by applying the weighting coefficient to each channel signal of the multi-channel audio signal corresponding to the total energy of the reference signal or each channel of the reference signal. The correction coefficient may be calculated so that the sum of energy is equal to the sum of energy of signals obtained by applying the correction coefficient to each channel signal of the multi-channel audio signal.

本発明の一態様によれば、前記補正係数適用部は、前記マルチチャンネル音声信号のチャンネル数に対応する補正係数の所定の初期値と、前記補正係数計算部が計算した補正係数であって前記参照信号のチャンネル数に対応する補正係数と、に基づいて、補間法により、前記再生チャンネル信号のチャンネル数に応じて前記補正係数計算部が計算した補正係数を修正してもよい。 According to an aspect of the present invention, the correction coefficient application unit includes a predetermined initial value of a correction coefficient corresponding to the number of channels of the multi-channel audio signal, and a correction coefficient calculated by the correction coefficient calculation unit, Based on the correction coefficient corresponding to the number of channels of the reference signal, the correction coefficient calculated by the correction coefficient calculation unit according to the number of channels of the reproduction channel signal may be corrected by an interpolation method.

本発明の一態様によれば、前記チャンネル数変換装置は、前記参照信号を、所定のチャンネル数変換方法によってモノ信号に変換するモノ信号変換部、をさらに備えてもよい。 According to an aspect of the present invention, the channel number conversion device may further include a mono signal conversion unit that converts the reference signal into a mono signal by a predetermined channel number conversion method.

本発明の一態様によれば、コンピュータを、上記の何れか１つに記載のチャンネル数変換装置、として機能させるためのプログラムである。 According to one aspect of the present invention, there is provided a program for causing a computer to function as the channel number conversion device described in any one of the above.

本発明のチャンネル数変換装置によれば、マルチチャンネル音声信号とそれより少ないチャンネル数の音声信号が同時に提供された場合に、任意のチャンネル数に制作意図に沿ったチャンネル数の変換を実現することができる。 According to the channel number conversion apparatus of the present invention, when a multi-channel audio signal and an audio signal having a smaller number of channels are simultaneously provided, conversion of the number of channels according to the production intention can be realized for an arbitrary number of channels. Can do.

本発明に係る第一実施形態におけるチャンネル数変換装置の一例を示すブロック図である。It is a block diagram which shows an example of the channel number converter in 1st embodiment which concerns on this invention. 本発明に係る第一実施形態における重み付け係数計算部の一例を示すブロック図である。It is a block diagram which shows an example of the weighting coefficient calculation part in 1st embodiment which concerns on this invention. 本発明に係る第一実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。It is a flowchart which shows an example of the channel number conversion process in 1st embodiment which concerns on this invention. 本発明に係る第二実施形態におけるチャンネル数変換装置の一例を示すブロック図である。It is a block diagram which shows an example of the channel number converter in 2nd embodiment which concerns on this invention. 本発明に係る第二実施形態における重み付け係数計算部の一例を示すブロック図である。It is a block diagram which shows an example of the weighting coefficient calculation part in 2nd embodiment which concerns on this invention. 本発明に係る第二実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。It is a flowchart which shows an example of the channel number conversion process in 2nd embodiment which concerns on this invention. 本発明に係る第三実施形態におけるチャンネル数変換装置の一例を示すブロック図である。It is a block diagram which shows an example of the channel number converter in 3rd embodiment which concerns on this invention. 本発明に係る第三実施形態における重み付け係数計算部の一例を示すブロック図である。It is a block diagram which shows an example of the weighting coefficient calculation part in 3rd embodiment which concerns on this invention. 本発明に係る第三実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。It is a flowchart which shows an example of the channel number conversion process in 3rd embodiment which concerns on this invention. 本発明に係る第四実施形態におけるチャンネル数変換装置の一例を示すブロック図である。It is a block diagram which shows an example of the channel number converter in 4th embodiment which concerns on this invention. 本発明に係る第四実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。It is a flowchart which shows an example of the channel number conversion process in 4th embodiment which concerns on this invention.

＜第一実施形態＞
以下、本発明の第一実施形態によるチャンネル数変換装置を図１〜図３を参照して説明する。
図１は、本発明に係る第一実施形態におけるチャンネル数変換装置の一例を示すブロック図である。図１に示すようにチャンネル数変換装置１０は、参照信号入力部１１、マルチチャンネル音声信号入力部１２、重み付け係数計算部１３、補正係数計算部１４、補正係数適用部１５、再生チャンネル情報取得部１６、チャンネル数変換部１７、記憶部１８と、を含む。
図１は、チャンネル数変換装置１０に２２．２ｃｈのマルチチャンネル音声信号と２ｃｈの参照信号を入力し、チャンネル数変換後の５．１ｃｈの再生チャンネル音声信号を出力する様子を示している。
チャンネル数変換装置１０は、所定のマルチチャンネル音声信号（ｎチャンネル）を、そのマルチチャンネル音声信号のチャンネル数よりも少ないチャンネル数の音声信号（ｌチャンネル）（以下、参照信号と呼ぶ）を参照して、所望のチャンネル数の再生音声信号（ｍチャンネル）に変換する装置である。以下、所定のマルチチャンネル音声信号として８ＫＳＨＶ用の２２．２ｃｈ音響システム、参照信号として２ｃｈステレオ、再生音声信号を５．１ｃｈの場合を例に説明を行う。しかし、マルチチャンネル音声信号、参照信号、再生音声信号の各チャンネル数は、この例のチャンネル数に限らない。また、チャンネル数変換装置１０は、コンピュータによって構成されており、例えば、テレビなどの放送受信機やホームシアターなどのメディアの再生装置に組み込まれていてもよい。 <First embodiment>
Hereinafter, a channel number conversion apparatus according to a first embodiment of the present invention will be described with reference to FIGS.
FIG. 1 is a block diagram showing an example of a channel number conversion apparatus according to the first embodiment of the present invention. As shown in FIG. 1, the channel number conversion apparatus 10 includes a reference signal input unit 11, a multi-channel audio signal input unit 12, a weighting coefficient calculation unit 13, a correction coefficient calculation unit 14, a correction coefficient application unit 15, and a reproduction channel information acquisition unit. 16, a channel number conversion unit 17, and a storage unit 18.
FIG. 1 shows a state where a 22.2 ch multi-channel audio signal and a 2 ch reference signal are input to the channel number converter 10 and a 5.1 ch playback channel audio signal after the channel number conversion is output.
The channel number converter 10 refers to a predetermined multi-channel audio signal (n channel) with an audio signal (l channel) having a channel number smaller than the number of channels of the multi-channel audio signal (hereinafter referred to as a reference signal). Thus, it is a device that converts it into a reproduced audio signal (m channels) of a desired number of channels. In the following, description will be given by taking as an example the case where the predetermined multi-channel audio signal is a 22.2ch acoustic system for 8K SHV, the reference signal is 2ch stereo, and the reproduced audio signal is 5.1ch. However, the number of channels of the multi-channel audio signal, reference signal, and reproduced audio signal is not limited to the number of channels in this example. The channel number conversion device 10 is configured by a computer, and may be incorporated in a broadcast receiver such as a television or a media playback device such as a home theater, for example.

以下に、チャンネル数変換装置１０について、詳細に説明する。
参照信号入力部１１は、参照信号を入力する。例えば、参照信号入力部１１は、サイマル放送で放送された２２．２ｃｈのマルチチャンネル音声信号と２ｃｈステレオの音声信号のうち２ｃｈステレオの音声信号を入力する。
マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を入力する。例えば、サイマル放送で放送された２２．２ｃｈの音声信号を入力する。
サイマル放送で放送される各チャンネル数に対応する音声信号は、それぞれ専用の技術者が作成してもよい。例えば、２２．２ｃｈの音声信号については、２２．２ｃｈの専用の技術者が、２２．２ｃｈに対応するスピーカ配置等の再生環境によって、放送内容に適した（番組作成者の意図が反映された）３次元の音が再現されるように各チャンネルの音声信号を作成する。一方、２ｃｈの音声信号については、２ｃｈの専用の技術者が、２ｃｈの再生環境によって放送内容に適した音声が再現されるように２つのチャンネルそれぞれの音声信号を作成する。このとき作成される２２．２ｃｈと２ｃｈの各音声信号は、番組によって表現したい内容に適するように作成される。例えばナレーションが支配的なドキュメンタリー番組と音楽番組とでは、同じ２ｃｈであっても、各チャンネルに対して作成される音声信号の性質が異なる。従って、２２．２ｃｈの音声信号と２ｃｈの音声信号との関係は、２２．２ｃｈの音声信号から２ｃｈの音声信号へ、所定の（一種類の）ダウンミックス係数によって変換できる関係であるとは限らない。 Hereinafter, the channel number conversion apparatus 10 will be described in detail.
The reference signal input unit 11 inputs a reference signal. For example, the reference signal input unit 11 inputs a 2ch stereo audio signal among 22.2 ch multichannel audio signals and 2ch stereo audio signals broadcast by simulcast.
The multichannel audio signal input unit 12 inputs a multichannel audio signal. For example, a 22.2 ch audio signal broadcast by simulcast is input.
A dedicated engineer may create audio signals corresponding to the number of channels broadcast by simulcast. For example, for 22.2ch audio signals, a 22.2ch dedicated engineer is suitable for the broadcast content depending on the playback environment such as the speaker layout corresponding to 22.2ch (the intention of the program creator was reflected). ) Create an audio signal for each channel so that a three-dimensional sound is reproduced. On the other hand, for 2ch audio signals, a 2ch dedicated engineer creates audio signals for each of the two channels so that audio suitable for the broadcast content is reproduced in a 2ch playback environment. Each of the 22.2ch and 2ch audio signals created at this time is created so as to suit the contents to be expressed by the program. For example, a documentary program in which narration is dominant and a music program have different characteristics of an audio signal created for each channel even if they are the same 2ch. Therefore, the relationship between the 22.2ch audio signal and the 2ch audio signal is not always a relationship that can be converted from the 22.2ch audio signal to the 2ch audio signal by a predetermined (one type) downmix coefficient. Absent.

重み付け係数計算部１３は、参照信号の個々のチャンネルの信号に含まれるマルチチャンネル音声信号の各チャンネル信号の重み付け係数を計算する。例えば、参照信号を（Ｌ、Ｒ）、マルチチャンネル音声信号を（ＦＣ、ＦＬｃ、ＦＲｃ、ＦＬ、ＦＲ、ＳｉＬ、ＳｉＲ、ＢＬ、ＢＲ、ＢＣ、ＬＦＥ１、ＬＦＥ２、ＴｐＦＣ、ＴｐＦＬ、ＴｐＦＲ、ＴｐＳｉＬ、ＴｐＳｉＲ、ＴｐＣ、ＴｐＢＬ、ＴｐＢＲ、ＴｐＢＣ、ＢｔＦＣ、ＢｔＦＬ、ＢｔＦＲ）とすると、重み付け係数計算部１３は、Ｌを（ａ_１,１×ＦＣ、ａ_２,１×ＦＬｃ、・・・・、ａ_２４,１×ＢｔＦＲ）、Ｒを（ａ_１,２×ＦＣ、ａ_２,２×ＦＬｃ、・・・・、ａ_２４,２×ＢｔＦＲ）と表した場合、各係数（ａ_１,１〜ａ_１,２４、ａ_２,１〜ａ_２,２４）の値を計算する。後述する重み付け係数計算部１３ａ、１３ｂに示す構成についても同様である。 The weighting coefficient calculator 13 calculates a weighting coefficient for each channel signal of the multichannel audio signal included in the signal of each channel of the reference signal. For example, reference signals (L, R), multi-channel audio signals (FC, FLc, FRc, FL, FR, SiL, SiR, BL, BR, BC, LFE1, LFE2, TpFC, TpFL, TpFR, TpSiL, TpSiR) , TpC, TpBL, TpBR, TpBC, BtFC, BtFL, BtFR), the weighting coefficient calculation unit 13 sets L to (a _1,1 × FC, a _2,1 × FLc,..., A _{24, 1} × BtFR), R a _{_{(a 1,2 × FC, a 2,2}} × FLc, when expressed _{····, a 24,2} × BtFR) and, the coefficients _{(a 1,1} ~a _{_1, 24,} to calculate the value of _{a 2,1} ~a _2,24). The same applies to configurations shown in weighting coefficient calculators 13a and 13b described later.

補正係数計算部１４は、重み付け係数計算部１３が計算した重み付け係数を用いて、マルチチャンネル音声信号の各チャンネル信号に乗じる補正係数を計算する。より具体的には、補正係数計算部１４は、参照信号とマルチチャンネル音声信号の各チャンネル信号に重み付け係数を適用して生成した信号のエネルギーまたはラウドネスを指標として、マルチチャンネル音声信号の各チャンネル信号に適用する補正係数を計算する。
または、補正係数計算部１４は、参照信号の各チャンネル信号に対するマルチチャンネル音声信号の各チャンネル信号の重み付け係数の二乗和に基づいて、マルチチャンネル音声信号の各チャンネル信号に適用する補正係数を計算する。 The correction coefficient calculation unit 14 uses the weighting coefficient calculated by the weighting coefficient calculation unit 13 to calculate a correction coefficient to be multiplied to each channel signal of the multichannel audio signal. More specifically, the correction coefficient calculation unit 14 uses the energy or loudness of a signal generated by applying a weighting coefficient to each channel signal of the reference signal and the multichannel audio signal as an index, and outputs each channel signal of the multichannel audio signal. Calculate the correction factor applied to.
Alternatively, the correction coefficient calculation unit 14 calculates a correction coefficient to be applied to each channel signal of the multichannel audio signal based on the square sum of the weighting coefficients of each channel signal of the multichannel audio signal with respect to each channel signal of the reference signal. .

補正係数適用部１５は、補正係数計算部１４が上記したいずれかの方法で計算した補正係数をマルチチャンネル音声信号に適用する。このとき、補正係数適用部１５は、再生する音声信号のチャンネル数に応じて補正係数を修正し、修正後の補正係数をマルチチャンネル音声信号に適用する。
再生チャンネル情報取得部１６は、再生チャンネル音声信号の情報として、例えば、再生チャンネル音声信号のチャンネル数（再生チャンネル数）の情報を取得する。 The correction coefficient application unit 15 applies the correction coefficient calculated by the correction coefficient calculation unit 14 by any of the methods described above to the multichannel audio signal. At this time, the correction coefficient application unit 15 corrects the correction coefficient according to the number of channels of the audio signal to be reproduced, and applies the corrected correction coefficient to the multi-channel audio signal.
The playback channel information acquisition unit 16 acquires, for example, information on the number of channels of the playback channel audio signal (number of playback channels) as information on the playback channel audio signal.

チャンネル数変換部１７は、補正係数適用部１５が補正係数を適用した後のマルチチャンネル音声信号を入力し、例えば、後述するチャンネル数変換処理により、再生チャンネル数に合わせて、マルチチャンネル音声信号をチャンネル数変換する。
記憶部１８は、チャンネル数変換処理に必要な種々のデータを記憶する。 The channel number conversion unit 17 receives the multi-channel audio signal after the correction coefficient application unit 15 applies the correction coefficient, and converts the multi-channel audio signal according to the number of reproduction channels by, for example, channel number conversion processing described later. Convert the number of channels.
The storage unit 18 stores various data necessary for the channel number conversion process.

次に図２を用いて重み付け係数計算部１３について詳しく説明する。
図２は、本発明に係る第一実施形態における重み付け係数計算部の一例を示すブロック図である。
図２に示すように重み付け係数計算部１３は、遅延補正部１３１と、重み付け比分析部１３２と、重み付け比補正部１３３と、を含む。
遅延補正部１３１は、参照信号の各チャンネル信号に対する、マルチチャンネル音声信号の各チャンネル信号の遅延を補正する。２つの信号の時間的なずれは、進んだり遅れたり様々な場合が考えられるが、これらをまとめて遅延と記載する。遅延補正部１３１は、参照信号の各チャンネルに含まれているマルチチャンネル音声信号の各チャンネル信号に対応する信号の、マルチチャンネル音声信号を構成する当該チャンネル信号に対する遅延を、例えば、相互相関関数によって計算する。この理由は、参照信号においては、例えば、マルチチャンネル音声信号で表現される３次元的な音を表現するために、マルチチャンネル音声信号に含まれるあるチャンネル信号に係る音について時間軸方向にずらして構成する場合（例えば、マルチチャンネル音声信号において後方から出力される音と前方から出力される音とが重ならないように、参照信号においては後方からの音を少し遅延させるなど）があるためである。遅延補正部１３１は、マルチチャンネル音声信号の各チャンネル信号の遅延を計算した遅延量分だけ補正し、参照信号の各チャンネル信号に含まれるマルチチャンネル音声信号に対応する信号の位相と、マルチチャンネル音声信号の当該チャンネル信号の位相とを揃える。 Next, the weighting coefficient calculator 13 will be described in detail with reference to FIG.
FIG. 2 is a block diagram showing an example of the weighting coefficient calculator in the first embodiment according to the present invention.
As shown in FIG. 2, the weighting coefficient calculation unit 13 includes a delay correction unit 131, a weighting ratio analysis unit 132, and a weighting ratio correction unit 133.
The delay correction unit 131 corrects the delay of each channel signal of the multi-channel audio signal with respect to each channel signal of the reference signal. The time lag between the two signals may be advanced or delayed, and various cases are considered. These are collectively referred to as a delay. The delay correction unit 131 determines the delay of the signal corresponding to each channel signal of the multichannel audio signal included in each channel of the reference signal with respect to the channel signal constituting the multichannel audio signal, for example, by a cross-correlation function. calculate. This is because, in the reference signal, for example, in order to express a three-dimensional sound expressed by a multi-channel audio signal, the sound related to a certain channel signal included in the multi-channel audio signal is shifted in the time axis direction. This is because there are cases in which the sound is output from the rear in the multi-channel audio signal and the sound output from the front in the reference signal, for example, so that the sound from the rear is slightly delayed in the reference signal. . The delay correction unit 131 corrects the delay of each channel signal of the multi-channel audio signal by the calculated delay amount, and corrects the phase of the signal corresponding to the multi-channel audio signal included in each channel signal of the reference signal, and the multi-channel audio. Align the phase of the signal with the channel signal.

重み付け比分析部１３２は、マルチチャンネル音声信号の各チャンネル信号に対する重み付け比を、重回帰分析、正準相関分析などのいずれかの多変量解析の方法を用いて分析する。または、重み付け比分析部１３２は、遺伝的アルゴリズム、深層学習等の機械学習によって重み付け比を分析してもよい。
重み付け比補正部１３３は、重み付け比分析部１３２が分析した重み付け比を、参照信号を構成する各チャンネル信号のエネルギーに基づいて補正し、補正後の値を重み付け係数として出力する。具体的には、重み付け比補正部１３３は、参照信号の各チャンネル信号のエネルギーと、重み付け比分析部１３２が分析した重み付け比をマルチチャンネル音声信号の各チャンネルに乗じ、乗じて得た擬似参照信号の各チャンネル信号のエネルギーとが等しくなるように前記重み付け比を補正する。 The weighting ratio analysis unit 132 analyzes the weighting ratio of the multichannel audio signal with respect to each channel signal by using any multivariate analysis method such as multiple regression analysis or canonical correlation analysis. Alternatively, the weighting ratio analysis unit 132 may analyze the weighting ratio by machine learning such as a genetic algorithm or deep learning.
The weighting ratio correction unit 133 corrects the weighting ratio analyzed by the weighting ratio analysis unit 132 based on the energy of each channel signal constituting the reference signal, and outputs the corrected value as a weighting coefficient. Specifically, the weighting ratio correction unit 133 multiplies each channel of the multichannel audio signal by the energy of each channel signal of the reference signal and the weighting ratio analyzed by the weighting ratio analysis unit 132, and obtains the pseudo reference signal obtained by multiplication. The weighting ratio is corrected so that the energy of each channel signal becomes equal.

次に図３を用いて、チャンネル数変換処理の詳細について説明を行う。
図３は、本発明に係る第一実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。
前提として、ある番組について、チャンネル数に応じて作成されたマルチチャンネル音声信号および参照信号が同時に放送されており、チャンネル数変換装置１０は両方の信号を入力する。
まず、ステップＳ１１で、マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を入力する。マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を重み付け係数計算部１３に出力する。また、ステップＳ１１と並行して、ステップＳ１２で、参照信号入力部１１は、参照信号を入力する。参照信号入力部１１は、入力したされた２ｃｈステレオの参照信号を、ＬＲそれぞれのチャンネル信号に分離する（ｐ_ｊ、１≦ｊ≦２、ｌ＝２）。ここで、ｐ_ｊは分離後の参照信号の各チャンネル信号である。参照信号入力部１１は、参照信号の各チャンネル信号を重み付け係数計算部１３に出力する。 Next, details of the channel number conversion process will be described with reference to FIG.
FIG. 3 is a flowchart showing an example of the channel number conversion process in the first embodiment according to the present invention.
As a premise, for a certain program, a multi-channel audio signal and a reference signal created according to the number of channels are broadcast simultaneously, and the channel number conversion device 10 inputs both signals.
First, in step S11, the multichannel audio signal input unit 12 inputs a multichannel audio signal. The multichannel audio signal input unit 12 outputs the multichannel audio signal to the weighting coefficient calculation unit 13. In parallel with step S11, in step S12, the reference signal input unit 11 inputs a reference signal. The reference signal input unit 11 separates the input 2ch stereo reference signal into LR channel signals (p _j , 1 ≦ j ≦ 2, l = 2). Here, p _j is each channel signal of the reference signal after separation. The reference signal input unit 11 outputs each channel signal of the reference signal to the weighting coefficient calculation unit 13.

次に、ステップＳ１３で、重み付け係数計算部１３では、遅延補正部１３１が、参照信号とマルチチャンネル音声信号とを入力して、マルチチャンネル音声信号の遅延を補正する。例えば、遅延補正部１３１は、例えば、相互相関関数を計算することにより参照信号のチャンネル信号ごとにマルチチャンネル音声信号の各チャンネル信号に対応する遅延を計算し、マルチチャンネル音声信号の各チャンネルを補正する。遅延補正部１３１は、遅延補正後のマルチチャンネル音声信号を重み付け比分析部１３２に出力する。遅延の補正を行うのは、重み付け比をより正確に計算するためである。 Next, in step S13, in the weighting coefficient calculation unit 13, the delay correction unit 131 inputs the reference signal and the multichannel audio signal, and corrects the delay of the multichannel audio signal. For example, the delay correction unit 131 calculates a delay corresponding to each channel signal of the multichannel audio signal for each channel signal of the reference signal, for example, by calculating a cross correlation function, and corrects each channel of the multichannel audio signal. To do. The delay correction unit 131 outputs the multichannel audio signal after delay correction to the weighting ratio analysis unit 132. The reason for correcting the delay is to calculate the weighting ratio more accurately.

次に、ステップＳ１４で、重み付け比分析部１３２は、遅延補正後のマルチチャンネル音声信号と参照信号とを入力し、参照信号の各チャンネル信号に対するマルチチャンネル音声信号の各チャンネル信号の重み付け比を、重回帰分析等を用いて計算する。具体的には、重み付け比分析部１３２は、２ｃｈ（参照信号）の各チャンネル信号を適切に構成するための、２２．２ｃｈ（マルチチャンネル音声信号）の音声信号（ｑ_ｉ、１≦ｉ≦２４、ｎ＝２４）のチャンネル間の重み付け比（ａ_ｉｊ、１≦ｉ≦２４、１≦ｊ≦２）を計算する。ここで、擬似参照信号のチャンネル信号ｐ＾_ｊは、以下の式（１）で表すことができる。 Next, in step S14, the weighting ratio analysis unit 132 inputs the multichannel audio signal after delay correction and the reference signal, and calculates the weighting ratio of each channel signal of the multichannel audio signal to each channel signal of the reference signal. Calculate using multiple regression analysis. Specifically, the weighting ratio analysis unit 132 is configured to appropriately configure each channel signal of 2ch (reference signal), and 22.2ch (multichannel audio signal) audio signal (q _i , 1 ≦ i ≦ 24). , N = 24), the weighting ratio between channels (a _ij , 1 ≦ i ≦ 24, 1 ≦ j ≦ 2) is calculated. Here, the channel signal p ^ _j of the pseudo reference signal can be expressed by the following equation (1).

式（１）より、重み付け比「ａ_ｉｊ」は、２ｃｈの各チャンネル信号に含まれる２２．２ｃｈ音声信号の各チャンネルのレベル比に対応する。重み付け比分析部１３２は、計算した重み付け比を重み付け比補正部１３３に出力する。 From equation (1), the weighting ratio “a _ij ” corresponds to the level ratio of each channel of the 22.2 ch audio signal included in each channel signal of 2 ch. The weighting ratio analysis unit 132 outputs the calculated weighting ratio to the weighting ratio correction unit 133.

次に、ステップＳ１５で、重み付け比補正部１３３は、重み付け比分析部１３２が分析した重み付け比と参照信号とを入力し、重み付け比を、エネルギーに基づいて補正する。例えば、２ｃｈの参照信号のＬｃｈのエネルギー（Ｅ_Ｌｃｈ）は、以下の式（２）で表すことができる。 Next, in step S15, the weighting ratio correction unit 133 inputs the weighting ratio and the reference signal analyzed by the weighting ratio analysis unit 132, and corrects the weighting ratio based on energy. For example, the Lch energy (E _Lch ) of the 2ch reference signal can be expressed by the following equation (2).

また、重み付け比を用いて表した擬似参照信号のＬｃｈエネルギーは、以下の式（３）で表すことができる。 Further, the Lch energy of the pseudo reference signal expressed using the weighting ratio can be expressed by the following equation (3).

重み付け比補正部１３３は、式（４）によって、参照信号のＬｃｈのエネルギーと擬似参照信号のＬｃｈのエネルギーが等しくなるような定数ｃ_１を計算する。 The weighting ratio correction unit 133 calculates a constant c ₁ such that the Lch energy of the reference signal and the Lch energy of the pseudo reference signal are equal to each other using Expression (4).

次に、重み付け比補正部１３３は、重み付け比ａ_ｉｊ（１≦ｉ≦２４、１≦ｊ≦２）のそれぞれに定数ｃ_ｊの平方根を乗じて重み付け比を補正する。補正後の重み付け比が重み付け係数である。ステップＳ１６で、重み付け係数計算部１３は、重み付け比補正部１３３が補正して得られた重み付け係数を補正係数計算部１４へ出力する。 Next, the weighting ratio correction unit 133 corrects the weighting ratio by multiplying each of the weighting ratios a _ij (1 ≦ i ≦ 24, 1 ≦ j ≦ 2) by the square root of the constant c _j . The corrected weighting ratio is a weighting coefficient. In step S <b> 16, the weighting coefficient calculation unit 13 outputs the weighting coefficient obtained by the correction by the weighting ratio correction unit 133 to the correction coefficient calculation unit 14.

なお、重み付け比補正部１３３は、補正の基準となる指標としてエネルギー以外にもラウドネスや振幅などを用いてもよい。例えば、重み付け比補正部１３３は、参照信号の各チャンネル信号のラウドネスの和と擬似参照信号の各チャンネル信号のラウドネスの和が等しくなるような定数ｃ_ｊを算出してもよい。 Note that the weighting ratio correction unit 133 may use loudness, amplitude, or the like in addition to energy as an index serving as a correction reference. For example, the weighting ratio correction unit 133 may calculate a constant c _j such that the sum of the loudness of each channel signal of the reference signal is equal to the sum of the loudness of each channel signal of the pseudo reference signal.

次に、ステップＳ１７で、補正係数計算部１４は、補正係数を計算する。具体的には、補正係数計算部１４は、２ｃｈステレオ信号全体のエネルギーと、補正係数を適用した２２．２ｃｈ音声信号の全エネルギーの和を等しくするための補正係数ｂ_ｉ（１≦ｉ≦２４）を、以下の式（５）によって計算する。 Next, in step S17, the correction coefficient calculator 14 calculates a correction coefficient. Specifically, the correction coefficient calculation unit 14 corrects the correction coefficient b _i (1 ≦ i ≦ 24) for equalizing the sum of the energy of the entire 2ch stereo signal and the total energy of the 22.2ch audio signal to which the correction coefficient is applied. ) Is calculated by the following equation (5).

例えば、式５より、以下の式（６）を導出することができる。
（ｂ_１）^２＝ｃ_１×（ａ_１１）^２＋ｃ_２×（ａ_２１）^２・・・（６）
他のｂ_２〜ｂ_２４の値についても同様に式（５）から導出することができる。 For example, the following formula (6) can be derived from the formula 5.
(B ₁ ) ² = c ₁ × (a ₁₁ ) ² + c ₂ × (a ₂₁ ) ² (6)
Other values of b _{2 to} b ₂₄ can be similarly derived from the equation (5).

なお、補正係数計算部１４は、補正係数算出の基準となる指標としてエネルギー以外にもラウドネスや振幅などを用いてもよい。例えば、補正係数計算部１４は、参照信号の全ラウドネスと補正係数を適用したマルチチャンネル音声信号の全ラウドネスが等しくなるような補正係数ｂ_ｉを算出してもよい。あるいは、補正係数計算部１４は、参照信号の全チャンネル信号の振幅の２乗和と補正係数を適用したマルチチャンネル音声信号の全チャンネル信号の振幅の２乗和が等しくなるような補正係数ｂ_ｉを算出してもよい。補正係数計算部１４は、計算した補正係数ｂ_１〜ｂ_２４を補正係数適用部１５に出力する。 Note that the correction coefficient calculation unit 14 may use loudness, amplitude, etc. in addition to energy as an index serving as a reference for calculating the correction coefficient. For example, the correction coefficient calculation unit 14, the total loudness of a multichannel audio signal according to the total loudness and the correction coefficient of the reference signal may calculate the correction coefficient b _i as equal. Alternatively, the correction coefficient calculator 14 corrects the correction coefficient b _i so that the sum of squares of the amplitudes of all the channel signals of the reference signal is equal to the sum of squares of the amplitudes of all the channel signals of the multichannel audio signal to which the correction coefficient is applied. May be calculated. The correction coefficient calculation unit 14 outputs the calculated correction coefficients b _{1 to} b ₂₄ to the correction coefficient application unit 15.

次に、ステップＳ１８で、補正係数適用部１５は、再生チャンネル数に応じて、例えば線形補間を利用して補正係数を修正する。ステップＳ１３〜Ｓ１７の過程を経て計算した補正係数ｂ_ｉは、２２．２ｃｈのマルチチャンネル音声信号と２ｃｈの参照信号とから得られた、いわば、２２．２ｃｈを２ｃｈに変換するのに適した補正係数（参照信号のチャンネル数に対応する補正係数）である。２２．２ｃｈのマルチチャンネル音声信号を、所定のチャンネル数変換方法を用いて再生チャンネル音声信号（５．１ｃｈ）にダウンミックスすると、例えば、規格等で定められたダウンミックス係数によって機械的に変換されることになる。このため、変換後の５．１ｃｈの再生チャンネル音声信号は、番組製作者が５．１ｃｈを用いて表現する音声として意図する音声信号とは乖離する可能性がある。本実施形態では、５．１ｃｈより少ないチャンネル数の番組製作者の意図が反映された参照信号を教師データとして、２２．２ｃｈのマルチチャンネル音声信号を参照信号に近づけるための補正係数ｂ_ｉを計算する。しかし、上記のステップで得られたこの補正係数をそのまま適用２２．２ｃｈに適用すると、適用後のマルチチャンネル音声信号には、２ｃｈへの変換が想定された片寄りが生じるため、補正係数適用部１５は、２ｃｈへの片寄りを緩和する修正を行う。そして、補正係数適用部１５は、補正係数ｂ_ｉを、より５．１ｃｈに適した補正係数（再生チャンネル信号のチャンネル数に対応する補正係数）となるよう修正する。 Next, in step S18, the correction coefficient application unit 15 corrects the correction coefficient using, for example, linear interpolation according to the number of reproduction channels. The correction coefficient b _i calculated through the process of steps S13 to S17 is obtained from the 22.2 ch multi-channel audio signal and the 2 ch reference signal, so to speak, a correction suitable for converting 22.2 ch into 2 ch. A coefficient (a correction coefficient corresponding to the number of channels of the reference signal). When a 22.2 ch multi-channel audio signal is downmixed to a playback channel audio signal (5.1 ch) using a predetermined channel number conversion method, for example, it is mechanically converted by a downmix coefficient defined by a standard or the like. Will be. For this reason, there is a possibility that the 5.1 channel playback channel audio signal after conversion is different from the audio signal intended as the audio expressed by the program producer using 5.1 channel. In the present embodiment, a correction signal b _i for approximating a 22.2 ch multi-channel audio signal to the reference signal is calculated using the reference signal reflecting the intention of the program producer with the number of channels smaller than 5.1 ch as teacher data. To do. However, if the correction coefficient obtained in the above step is applied to the 22.2 ch as it is, the multi-channel audio signal after application has a shift that is assumed to be converted to 2 ch. 15 performs a correction to alleviate the shift to 2ch. Then, the correction coefficient application unit 15, the correction coefficient b _i, is modified to become more correction coefficients suitable for 5.1ch (correction coefficient corresponding to the number of channels of the reproduction channel signal).

例えば、２２．２ｃｈ（ｎチャンネル）を２２．２ｃｈ（ｎチャンネル）へ変換するときの補正係数を「１．０」、２２．２ｃｈ（ｎチャンネル）を２ｃｈ（ｌチャンネル）へ変換するときの補正係数を「ｂ_ｉ」とした場合、５．１ｃｈ（ｍチャンネル）のチャンネル数が両者の間であることに基づき、２２．２ｃｈ（ｎチャンネル）を５．１ｃｈ（ｍチャンネル）へ変換するときの補正係数ｂ＾_ｉがそれら両方の補正係数の中間の値（例えば、ｂ＾_ｉ＝（ｂ_ｉ＋１）÷２）と考える。ここで、ｂ_ｉ≦ｂ＾_ｉ≦１．０である。すると、線形補間の関係から、以下の式（６）が導出できる。
（ｂ＾_ｉ−ｂ_ｉ）÷（１．０−ｂ_ｉ）＝（ｍ−ｌ）÷（ｎ−ｌ）
・・・（６）
これを変形すると、以下の式（７）が導出できる。
ｂ＾_ｉ＝ｂ_ｉ＋（１−ｂ_ｉ）×（ｍ−ｌ）÷（ｎ−ｌ）・・・（７）
なお、上記の線形補間の説明でｌ＜ｍ＜ｎであることを前提に説明を行ったが、式（７）は、ｍ＞ｌでもｍ＜ｌでも適用可能であり、ｍとｌの大小関係には制約がない。また、ここでは、線形補間を用いて５．１ｃｈ用に補正係数を修正する場合を例に説明を行ったが、他の補間法（多項式補間など）を用いて補正係数を修正してもよい。
次に、ステップＳ１９で、補正係数適用部１５は、５．１ｃｈに適した補正係数ｂ＾_ｉを２２．２ｃｈのマルチチャンネル音声信号に適用する。具体的には、補正係数適用部１５は、ｂ＾_ｉをｑ_ｉに乗じる。そして、補正係数適用部１５は、適用後のｂ＾_ｉ×ｑ_ｉ（１≦ｉ≦２４）をチャンネル数変換部１７に出力する。 For example, the correction coefficient when converting 22.2 ch (n channel) to 22.2 ch (n channel) is “1.0”, and the correction coefficient when converting 22.2 ch (n channel) to 2 ch (l channel) When the coefficient is “b _i ”, when converting the number of channels of 5.1 ch (m channel) between the two, 22.2 ch (n channel) is converted to 5.1 ch (m channel). It is assumed that the correction coefficient b _i is an intermediate value between the two correction coefficients (for example, b _i = (b _i +1) / 2). Here, b _i ≦ b _i ≦ 1.0. Then, the following equation (6) can be derived from the relationship of linear interpolation.
(B ^ _i −b _i ) ÷ (1.0−b _i ) = (m−l) ÷ (n−l)
... (6)
By transforming this, the following equation (7) can be derived.
b _i = b _i + (1−b _i ) × (m−l) ÷ (n−l) (7)
In the above description of linear interpolation, it was assumed that l <m <n. However, Equation (7) can be applied to both m> l and m <l. There are no restrictions on the relationship. In addition, here, the case where the correction coefficient is corrected for 5.1ch using linear interpolation has been described as an example, but the correction coefficient may be corrected using another interpolation method (polynomial interpolation or the like). .
Next, in step S19, the correction coefficient application unit 15 applies the correction coefficient b _i suitable for 5.1ch to the 22.2ch multi-channel audio signal. Specifically, the correction coefficient application unit 15 multiplies b ^ _i by q _i . Then, the correction coefficient application unit 15 outputs b ^ _i × q _i (1 ≦ i ≦ 24) after application to the channel number conversion unit 17.

なお、上記の例では、２２．２ｃｈを２２．２ｃｈへ変換するときの補正係数を「１．０」と仮定したが、これに限定されない。例えば、マルチチャンネル音声信号の各チャンネル信号ｑ_ｉの重要度などに応じて初期値（マルチチャンネル音声信号のチャンネル数に対応する補正係数の初期値）を設定することが可能である。例えば、記憶部１８にマルチチャンネル音声信号のチャンネル信号ｑ_ｉ（１≦ｉ≦２４）ごとに補正係数の初期値が記録されていて、補正係数適用部１５は、記憶部１８からチャンネル信号ｑ_ｉ（１≦ｉ≦２４）ごとの補正係数の初期値を読み出すことで、ｂ＾_ｉを計算してもよい。例えば、記憶部１８には、ｑ_１の補正係数の初期値が「１．０」、ｑ_２の補正係数の初期値が「０．９」、・・・、ｑ_２４の補正係数の初期値が「０．８」などと記録されていてもよい。 In the above example, the correction coefficient when converting 22.2 ch to 22.2 ch is assumed to be “1.0”, but the present invention is not limited to this. For example, an initial value (an initial value of a correction coefficient corresponding to the number of channels of the multichannel audio signal) can be set according to the importance of each channel signal q _i of the multichannel audio signal. For example, the initial value of the correction coefficient is recorded for each channel signal q _i (1 ≦ i ≦ 24) of the multi-channel audio signal in the storage unit 18, and the correction coefficient application unit 15 receives the channel signal q _i from the storage unit 18. B ^ _i may be calculated by reading the initial value of the correction coefficient for each (1 ≦ i ≦ 24). For example, the storage unit 18, the initial value is "1.0" in the correction factor _{q 1,} the initial value is "0.9" in the correction factor _{q 2,} · · _·, the initial value of the correction factor _{q 24} May be recorded as “0.8” or the like.

次に、ステップＳ２０で、チャンネル数変換部１７は、入力した補正係数適用後のマルチチャンネル音声信号を、後述する所定のチャンネル数変換方法（一般的なチャンネル数変換方法）でチャンネル数変換し、再生チャンネル音声信号を出力する。 Next, in step S20, the channel number conversion unit 17 converts the input multi-channel audio signal after application of the correction coefficient by the predetermined channel number conversion method (general channel number conversion method) described later, Outputs playback channel audio signals.

本実施形態によれば、マルチチャンネル音声信号とそれより少ないチャンネル数の音声信号（参照信号）が同時に提供された場合に、少ないチャンネル数の参照信号を基準にして、より制作意図に沿ったチャンネル数変換を実現することができる。 According to the present embodiment, when a multi-channel audio signal and an audio signal (reference signal) having a smaller number of channels are simultaneously provided, a channel that is more in line with the production intention with reference to the reference signal having a smaller number of channels. Number conversion can be realized.

＜第二実施形態＞
以下、本発明の第二実施形態による重み付け係数計算部を、図４〜６を参照して説明する。
図４は、本発明に係る第二実施形態におけるチャンネル数変換装置の一例を示すブロック図である。
図４に示すようにチャンネル数変換装置１０ａは、参照信号入力部１１、マルチチャンネル音声信号入力部１２、重み付け係数計算部１３ａ、補正係数計算部１４、補正係数適用部１５、再生チャンネル情報取得部１６、チャンネル数変換部１７、記憶部１８と、を含む。このように、第二実施形態によるチャンネル数変換装置１０ａは、第一実施形態の重み付け係数計算部１３に代えて重み付け係数計算部１３ａを備えている。なお、第二実施形態によるチャンネル数変換装置１０ａの他の構成は、第一実施形態のチャンネル数変換装置１０と同様である。 <Second embodiment>
Hereinafter, the weighting coefficient calculator according to the second embodiment of the present invention will be described with reference to FIGS.
FIG. 4 is a block diagram showing an example of the channel number conversion apparatus according to the second embodiment of the present invention.
As shown in FIG. 4, the channel number conversion apparatus 10a includes a reference signal input unit 11, a multichannel audio signal input unit 12, a weighting coefficient calculation unit 13a, a correction coefficient calculation unit 14, a correction coefficient application unit 15, and a reproduction channel information acquisition unit. 16, a channel number conversion unit 17, and a storage unit 18. As described above, the channel number conversion device 10a according to the second embodiment includes the weighting coefficient calculation unit 13a instead of the weighting coefficient calculation unit 13 of the first embodiment. In addition, the other structure of the channel number converter 10a by 2nd embodiment is the same as that of the channel number converter 10 of 1st embodiment.

図５は、本発明に係る第二実施形態における重み付け係数計算部の一例を示すブロック図である。
図５に示すように、本実施形態による重み付け係数計算部１３ａは、遅延補正部１３１と、重み付け比分析部１３２と、重み付け比補正部１３３と、グルーピング部１３４と、を含む。
グルーピング部１３４は、マルチチャンネル音声信号の各チャンネル信号を、各チャンネル信号の類似度に基づいてグルーピングする。例えば、グルーピング部１３４は、各チャンネル音声信号間の相互相関係数を計算し、相互相関係数の大きいチャンネル同士をグルーピングする。また、グルーピング部１３４は、主成分分析、クラスタ分析等の方法、あるいはそれら両方の方法を用いて音声信号の性質が似たチャンネル同士をグルーピングしてもよい。さらに、グルーピング部１３４は、同じグループに所属するチャンネル音声信号を代表するグループ信号を生成する。なお、本実施形態では、重み付け係数計算部１３ａは、このグループ信号について重み付け係数を計算する。グループ信号の生成方法には以下のような方法がある。例えば、グルーピング部１３４は、同じグループに所属するチャンネル信号の平均を生成し、生成した信号をグループ信号として扱ってもよい。また、例えば、グルーピング部１３４は、同じグループに所属するチャンネル信号の類似度の重心にある信号を選択し、選択した信号をグループ信号として扱ってもよい。さらに、例えば、グルーピング部１３４は、同じグループに所属するチャンネル音声信号のうち、最大エネルギーを有するチャンネル音声信号を選択し、選択した信号をグループ信号として扱ってもよい。 FIG. 5 is a block diagram showing an example of a weighting coefficient calculator in the second embodiment according to the present invention.
As shown in FIG. 5, the weighting coefficient calculation unit 13 a according to the present embodiment includes a delay correction unit 131, a weighting ratio analysis unit 132, a weighting ratio correction unit 133, and a grouping unit 134.
The grouping unit 134 groups each channel signal of the multi-channel audio signal based on the similarity of each channel signal. For example, the grouping unit 134 calculates a cross-correlation coefficient between each channel audio signal, and groups channels having a large cross-correlation coefficient. Further, the grouping unit 134 may group channels having similar audio signal properties using methods such as principal component analysis and cluster analysis, or both methods. Further, the grouping unit 134 generates a group signal representing channel audio signals belonging to the same group. In this embodiment, the weighting coefficient calculator 13a calculates a weighting coefficient for this group signal. There are the following methods for generating a group signal. For example, the grouping unit 134 may generate an average of channel signals belonging to the same group and treat the generated signal as a group signal. Further, for example, the grouping unit 134 may select a signal at the center of gravity of the similarity of channel signals belonging to the same group, and handle the selected signal as a group signal. Further, for example, the grouping unit 134 may select a channel audio signal having the maximum energy among the channel audio signals belonging to the same group and treat the selected signal as a group signal.

遅延補正部１３１、重み付け比分析部１３２、重み付け比補正部１３３の機能は、第一実施形態と同様であり、参照信号の各チャンネル信号に対する、マルチチャンネル音声信号の各グループ信号の遅延を補正する。重み付け比分析部１３２は、参照信号の各チャンネル信号におけるマルチチャンネル音声信号の各グループ信号の重み付け比を多変量解析で分析する。重み付け比補正部１３３は、参照信号の各チャンネル信号のエネルギーと、重み付け比分析部１３２が分析した重み付け比をマルチチャンネル音声信号の各グループ信号に乗じて得た擬似参照信号の各チャンネルのエネルギー（参照信号のチャンネルに対応する擬似参照信号のチャンネルのエネルギー）とが等しくなるような定数ｃ_ｊを計算する。グループに所属する各チャンネル音声信号には、グループ信号に与えられた重み付け比を、グループに所属するチャンネル数に応じて等分した値を付与する。 The functions of the delay correction unit 131, the weighting ratio analysis unit 132, and the weighting ratio correction unit 133 are the same as in the first embodiment, and correct the delay of each group signal of the multichannel audio signal with respect to each channel signal of the reference signal. . The weighting ratio analysis unit 132 analyzes the weighting ratio of each group signal of the multichannel audio signal in each channel signal of the reference signal by multivariate analysis. The weighting ratio correction unit 133 energizes each channel signal of the pseudo reference signal obtained by multiplying each group signal of the multichannel audio signal by the energy of each channel signal of the reference signal and the weighting ratio analyzed by the weighting ratio analysis unit 132. A constant c _j is calculated such that the energy of the channel of the pseudo reference signal corresponding to the channel of the reference signal is equal. Each channel audio signal belonging to a group is given a value obtained by equally dividing the weighting ratio given to the group signal according to the number of channels belonging to the group.

次に、図６を用いて、第二実施形態におけるチャンネル数変換処理について説明を行う。図６は、本発明に係る第二実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。
なお、図３で説明した処理と同様の処理については簡単に説明を行う。まず、ステップＳ１１で、マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を入力する。マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を重み付け係数計算部１３ａに出力する。また、ステップＳ１２で、参照信号入力部１１は、参照信号を入力する。参照信号入力部１１は、参照信号の各チャンネル信号を重み付け係数計算部１３ａに出力する。次に、ステップＳ１２１で、重み付け係数計算部１３ａでは、グルーピング部１３４がマルチチャンネル音声信号の各チャンネル信号を、例えば、主成分分析等の方法を用いてグルーピングする。グルーピング部１３４は、グルーピング後のグループ信号を遅延補正部１３１に出力する。次に、ステップＳ１３で、遅延補正部１３１が、参照信号とマルチチャンネル音声信号とを入力して、マルチチャンネル音声信号の遅延を補正する。遅延補正部１３１は、遅延補正後のマルチチャンネル音声信号（各グループ信号）を重み付け比分析部１３２に出力する。 Next, the channel number conversion process in the second embodiment will be described with reference to FIG. FIG. 6 is a flowchart showing an example of channel number conversion processing in the second embodiment according to the present invention.
A process similar to the process described with reference to FIG. 3 will be briefly described. First, in step S11, the multichannel audio signal input unit 12 inputs a multichannel audio signal. The multichannel audio signal input unit 12 outputs the multichannel audio signal to the weighting coefficient calculator 13a. In step S12, the reference signal input unit 11 inputs a reference signal. The reference signal input unit 11 outputs each channel signal of the reference signal to the weighting coefficient calculation unit 13a. Next, in step S121, in the weighting coefficient calculator 13a, the grouping unit 134 groups each channel signal of the multi-channel audio signal using a method such as principal component analysis. The grouping unit 134 outputs the group signal after grouping to the delay correction unit 131. Next, in step S13, the delay correction unit 131 inputs the reference signal and the multichannel audio signal, and corrects the delay of the multichannel audio signal. The delay correction unit 131 outputs the multichannel audio signal (each group signal) after delay correction to the weighting ratio analysis unit 132.

次に、ステップＳ１４で、重み付け比分析部１３２は、遅延補正後のマルチチャンネル音声信号と参照信号とを入力し、参照信号の各チャンネル信号に対するマルチチャンネル音声信号の各グループ信号の重み付け比を、重回帰分析等を用いて計算する。重み付け比分析部１３２は、分析した重み付け比を重み付け比補正部１３３に出力する。次に、ステップＳ１５で、重み付け比補正部１３３は、重み付け比分析部１３２が分析した重み付け比と参照信号とを入力し、重み付け比をエネルギーに基づいて補正する。第二実施形態では、重み付け比補正部１３３は、参照信号の各チャンネル信号のエネルギーと、重み付け比をマルチチャンネル音声信号の各グループ信号に乗じて得た擬似参照信号の各チャンネルのエネルギーとが等しくなるような定数ｃ_ｊを計算し、各重み付け比ａ_ｉｊにｃ_ｊの平方根を乗じた重み付け係数を計算する。このとき、重み付け比補正部１３３は、同じグループ信号に所属するチャンネル信号のそれぞれに同じ重み付け係数を付与する。そして、ステップＳ１６で、重み付け係数計算部１３は、重み付け比補正部１３３が計算した重み付け係数を補正係数計算部１４へ出力する。 Next, in step S14, the weighting ratio analysis unit 132 inputs the multichannel audio signal after delay correction and the reference signal, and calculates the weighting ratio of each group signal of the multichannel audio signal to each channel signal of the reference signal. Calculate using multiple regression analysis. The weighting ratio analysis unit 132 outputs the analyzed weighting ratio to the weighting ratio correction unit 133. Next, in step S15, the weighting ratio correction unit 133 inputs the weighting ratio analyzed by the weighting ratio analysis unit 132 and the reference signal, and corrects the weighting ratio based on energy. In the second embodiment, the weighting ratio correction unit 133 equals the energy of each channel signal of the reference signal and the energy of each channel of the pseudo reference signal obtained by multiplying each group signal of the multichannel audio signal by the weighting ratio. A constant c _j is calculated, and a weighting coefficient obtained by multiplying each weighting ratio a _ij by the square root of c _j is calculated. At this time, the weighting ratio correction unit 133 assigns the same weighting coefficient to each channel signal belonging to the same group signal. In step S <b> 16, the weighting coefficient calculation unit 13 outputs the weighting coefficient calculated by the weighting ratio correction unit 133 to the correction coefficient calculation unit 14.

次に、ステップＳ１７で、補正係数計算部１４は、第一実施形態と同様に補正係数を計算する。次に、ステップＳ１８で、補正係数適用部１５は、線形補間等により、再生チャンネル数に応じて補正係数を修正する。ステップＳ１９で、補正係数適用部１５は、修正後の補正係数をマルチチャンネル音声信号に適用する。補正係数適用部１５は、補正係数適用後のマルチチャンネル音声信号をチャンネル数変換部１７に出力する。次に、ステップＳ２０で、チャンネル数変換部１７は、所定の方法で補正係数適用後のマルチチャンネル音声信号をチャンネル数変換する。チャンネル数変換装置１０ａは、チャンネル数変換後の再生チャンネル音声信号を再生機器等に出力する。 Next, in step S17, the correction coefficient calculation unit 14 calculates a correction coefficient in the same manner as in the first embodiment. Next, in step S18, the correction coefficient application unit 15 corrects the correction coefficient according to the number of reproduction channels by linear interpolation or the like. In step S19, the correction coefficient application unit 15 applies the corrected correction coefficient to the multichannel audio signal. The correction coefficient application unit 15 outputs the multi-channel audio signal after applying the correction coefficient to the channel number conversion unit 17. Next, in step S20, the channel number conversion unit 17 converts the number of channels of the multichannel audio signal after applying the correction coefficient by a predetermined method. The channel number converter 10a outputs the playback channel audio signal after the channel number conversion to a playback device or the like.

第二実施形態によれば、第一実施形態と同様の効果を得ることができる。また、第二実施形態によれば、マルチチャンネル音声信号に含まれるチャンネル音声信号のうち、音声信号の特性が似ているチャンネル音声信号をグルーピングして、遅延補正処理や重み付け係数の算出処理を行うので、第一実施形態に比べ、計算量を抑えることができる。 According to the second embodiment, the same effect as the first embodiment can be obtained. In addition, according to the second embodiment, among channel audio signals included in a multi-channel audio signal, channel audio signals having similar audio signal characteristics are grouped to perform delay correction processing and weighting coefficient calculation processing. Therefore, the amount of calculation can be suppressed as compared with the first embodiment.

＜第三実施形態＞
以下、本発明の第三実施形態による重み付け係数計算部を、図７〜９を参照して説明する。
図７は、本発明に係る第三実施形態におけるチャンネル数変換装置の一例を示すブロック図である。
図７に示すようにチャンネル数変換装置１０ｂは、参照信号入力部１１、マルチチャンネル音声信号入力部１２、重み付け係数計算部１３ｂ、補正係数計算部１４、補正係数適用部１５、再生チャンネル情報取得部１６、チャンネル数変換部１７、記憶部１８と、を含む。このように、第二実施形態によるチャンネル数変換装置１０ａは、第一実施形態の重み付け係数計算部１３に代えて重み付け係数計算部１３ｂを備えている。なお、第二実施形態によるチャンネル数変換装置１０ｂの他の構成は、第一実施形態のチャンネル数変換装置１０と同様である。 <Third embodiment>
Hereinafter, the weighting coefficient calculator according to the third embodiment of the present invention will be described with reference to FIGS.
FIG. 7 is a block diagram showing an example of the channel number conversion apparatus according to the third embodiment of the present invention.
As shown in FIG. 7, the channel number conversion device 10b includes a reference signal input unit 11, a multichannel audio signal input unit 12, a weighting coefficient calculation unit 13b, a correction coefficient calculation unit 14, a correction coefficient application unit 15, and a reproduction channel information acquisition unit. 16, a channel number conversion unit 17, and a storage unit 18. As described above, the channel number conversion device 10a according to the second embodiment includes the weighting coefficient calculation unit 13b instead of the weighting coefficient calculation unit 13 of the first embodiment. In addition, the other structure of the channel number converter 10b by 2nd embodiment is the same as that of the channel number converter 10 of 1st embodiment.

図８は、本発明に係る第三実施形態における重み付け係数計算部の一例を示すブロック図である。
図８に示すように、本実施形態による重み付け係数計算部１３ｂは、遅延補正部１３１ｂと、重み付け比分析部１３２ｂと、重み付け比補正部１３３と、基準チャンネル信号選択部１３５と、チャンネル分類部１３６と、を含む。
基準チャンネル信号選択部１３５は、マルチチャンネル音声信号に含まれるチャンネル信号のうち、参照信号のチャンネル信号との間の相互相関係数が最も大きいチャンネル信号（基準チャンネル信号）を選択する。基準チャンネル信号選択部１３５は、選択した基準チャンネルの情報を出力する。
チャンネル分類部１３６は、マルチチャンネル音声信号に含まれるチャンネル信号を、基準チャンネル音声信号とそれ以外（非基準チャンネル信号）に分類する。 FIG. 8 is a block diagram showing an example of the weighting coefficient calculator in the third embodiment according to the present invention.
As shown in FIG. 8, the weighting coefficient calculation unit 13b according to the present embodiment includes a delay correction unit 131b, a weighting ratio analysis unit 132b, a weighting ratio correction unit 133, a reference channel signal selection unit 135, and a channel classification unit 136. And including.
The reference channel signal selection unit 135 selects a channel signal (reference channel signal) having the largest cross-correlation coefficient with the channel signal of the reference signal among the channel signals included in the multi-channel audio signal. The reference channel signal selection unit 135 outputs information on the selected reference channel.
The channel classification unit 136 classifies the channel signals included in the multi-channel audio signal into a reference channel audio signal and other (non-reference channel signal).

遅延補正部１３１ｂは、参照信号のチャンネル信号に対する、基準チャンネル信号および非基準チャンネル信号郡のうち少なくとも一方の遅延を補正する。なお、非基準チャンネル信号郡に含まれる各チャンネル音声信号の遅延は、個別に補正してもよい。
重み付け比分析部１３２ｂは、基準チャンネル信号の重み付け比が、非基準チャンネル信号群のどの重み付け係数よりも大きいという拘束条件を課したうえで、重み付け比を計算する。 The delay correction unit 131b corrects the delay of at least one of the standard channel signal and the non-standard channel signal group with respect to the channel signal of the reference signal. The delay of each channel audio signal included in the non-reference channel signal group may be individually corrected.
The weighting ratio analysis unit 132b calculates the weighting ratio after imposing a constraint that the weighting ratio of the reference channel signal is larger than any weighting coefficient of the non-reference channel signal group.

重み付け比補正部１３３の機能は、第一実施形態と同様であり、参照信号の各チャンネル信号のエネルギーと、重み付け比分析部１３２ｂが分析した重み付け比をマルチチャンネル音声信号の各チャンネル信号に乗じて得た擬似参照信号の各チャンネルのエネルギー（参照信号のチャンネルに対応する擬似参照信号のチャンネルのエネルギー）とが等しくなるような定数ｃ_ｊを計算する。 The function of the weighting ratio correction unit 133 is the same as that of the first embodiment, and the channel signal of the multichannel audio signal is multiplied by the energy of each channel signal of the reference signal and the weighting ratio analyzed by the weighting ratio analysis unit 132b. A constant c _j is calculated such that the energy of each channel of the obtained pseudo reference signal is equal to the energy of the channel of the pseudo reference signal corresponding to the channel of the reference signal.

次に図９を用いて、第三実施形態におけるチャンネル数変換処理について説明を行う。図９は、本発明に係る第三実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。
なお、図３、６で説明した処理と同様の処理については簡単に説明を行う。まず、ステップＳ１１で、マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を入力する。マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を重み付け係数計算部１３ｂに出力する。また、ステップＳ１２で、参照信号入力部１１は、参照信号を入力する。参照信号入力部１１は、参照信号の各チャンネル信号を重み付け係数計算部１３ｂに出力する。次に、ステップＳ１２２で、重み付け係数計算部１３ｂでは、基準チャンネル信号選択部１３５が基準チャンネル信号を選択する。例えば、番組が報道番組の場合、２ｃｈの参照信号において支配的なのはダイアログ音声信号だと考えられる。基準チャンネル信号選択部１３５は、２２．２ｃｈの各チャンネル信号のうち、ダイアログ音声信号を多く含むチャンネル（相互相関係数が最も大きいチャンネル信号）を基準チャンネル信号として選択する。基準チャンネル信号選択部１３５は、選択した基準チャンネル信号の情報（どのチャンネルを選択したか）をチャンネル分類部１３６に出力する。ステップＳ１２３で、チャンネル分類部１３６は、基準チャンネル信号の情報とマルチチャンネル音声信号とを入力し、マルチチャンネル音声信号に含まれるチャンネル信号を、基準チャンネル信号と非基準チャンネル信号とに分類する。チャンネル分類部１３６は、分類した基準チャンネル信号と非基準チャンネル信号とを遅延補正部１３１ｂへ出力する。次に、ステップＳ１２４で、遅延補正部１３１ｂは、参照信号の各チャンネル信号に対する基準チャンネル信号の遅延を補正する。次に、ステップＳ１２５で、遅延補正部１３１ｂは、参照信号の各チャンネル信号に対する非基準チャンネル信号の遅延を補正する。このとき、遅延補正部１３１ｂは、非基準チャンネル信号のそれぞれについて遅延の補正を行ってもよい。なお、ステップＳ１２４〜Ｓ１２５の処理は両方行うことが好ましいが、どちらか一方、例えば、基準チャンネル信号の遅延の補正処理（ステップＳ１２４）だけを行ってもよい。遅延補正部１３１ｂは、基準チャンネル信号と非基準チャンネル信号とを重み付け比分析部１３２ｂへ出力する。次に、ステップＳ１４１で、重み付け比分析部１３２ｂは、基準チャンネル信号の重み付け比が非基準チャンネル信号に含まれるチャンネル信号のどの重み付け比よりも大きいという拘束条件下で、重み付け比を重回帰分析等により計算する。重み付け比分析部１３２ｂは、分析した重み付け比を重み付け比補正部１３３に出力する。次に、ステップＳ１５で、重み付け比補正部１３３は、重み付け比分析部１３２が分析した重み付け比と参照信号とを入力し、重み付け比をエネルギーに基づいて補正する。ステップＳ１６で、重み付け係数計算部１３は、重み付け比補正部１３３による補正後の重み付け係数を補正係数計算部１４へ出力する。 Next, the channel number conversion process in the third embodiment will be described with reference to FIG. FIG. 9 is a flowchart showing an example of channel number conversion processing in the third embodiment according to the present invention.
A process similar to the process described with reference to FIGS. 3 and 6 will be briefly described. First, in step S11, the multichannel audio signal input unit 12 inputs a multichannel audio signal. The multichannel audio signal input unit 12 outputs the multichannel audio signal to the weighting coefficient calculation unit 13b. In step S12, the reference signal input unit 11 inputs a reference signal. The reference signal input unit 11 outputs each channel signal of the reference signal to the weighting coefficient calculation unit 13b. Next, in step S122, in the weighting coefficient calculator 13b, the reference channel signal selector 135 selects a reference channel signal. For example, when the program is a news report program, it is considered that the dialog audio signal is dominant in the reference signal of 2ch. The reference channel signal selection unit 135 selects a channel (a channel signal having the largest cross-correlation coefficient) containing a lot of dialog audio signals as a reference channel signal among 22.2ch channel signals. The reference channel signal selection unit 135 outputs information on the selected reference channel signal (which channel has been selected) to the channel classification unit 136. In step S123, the channel classification unit 136 receives the reference channel signal information and the multi-channel audio signal, and classifies the channel signal included in the multi-channel audio signal into a reference channel signal and a non-reference channel signal. The channel classification unit 136 outputs the classified reference channel signal and non-reference channel signal to the delay correction unit 131b. Next, in step S124, the delay correction unit 131b corrects the delay of the reference channel signal with respect to each channel signal of the reference signal. Next, in step S125, the delay correction unit 131b corrects the delay of the non-reference channel signal with respect to each channel signal of the reference signal. At this time, the delay correction unit 131b may correct the delay for each of the non-reference channel signals. Note that it is preferable to perform both of the processes of steps S124 to S125, but either one may be performed, for example, only the process of correcting the delay of the reference channel signal (step S124). The delay correction unit 131b outputs the reference channel signal and the non-reference channel signal to the weighting ratio analysis unit 132b. Next, in step S141, the weighting ratio analysis unit 132b performs multiple regression analysis on the weighting ratio under the constraint that the weighting ratio of the reference channel signal is larger than any weighting ratio of the channel signal included in the non-reference channel signal. Calculate according to The weighting ratio analysis unit 132b outputs the analyzed weighting ratio to the weighting ratio correction unit 133. Next, in step S15, the weighting ratio correction unit 133 inputs the weighting ratio analyzed by the weighting ratio analysis unit 132 and the reference signal, and corrects the weighting ratio based on energy. In step S <b> 16, the weighting coefficient calculation unit 13 outputs the weighting coefficient corrected by the weighting ratio correction unit 133 to the correction coefficient calculation unit 14.

次に、ステップＳ１７で、補正係数計算部１４は、補正係数（ｂ_ｉ）を計算する。次に、ステップＳ１８で、補正係数適用部１５は、再生チャンネル数に応じて補正係数を修正する。ステップＳ１９で、補正係数適用部１５は、修正後の補正係数（ｂ＾_ｉ）をマルチチャンネル音声信号に適用し、補正係数適用後のマルチチャンネル音声信号をチャンネル数変換部１７に出力する。次に、ステップＳ２０で、チャンネル数変換部１７は、所定の方法でマルチチャンネル音声信号をチャンネル数変換する。チャンネル数変換装置１０ｂは、チャンネル数変換後の再生チャンネル音声信号を再生機器等に出力する。 Next, in step S17, the correction coefficient calculation unit 14 calculates a correction coefficient (b _i ). Next, in step S18, the correction coefficient application unit 15 corrects the correction coefficient according to the number of reproduction channels. In step S 19, the correction coefficient applying unit 15 applies the corrected correction coefficient (b _i ) to the multichannel audio signal, and outputs the multichannel audio signal after applying the correction coefficient to the channel number converting unit 17. Next, in step S20, the channel number conversion unit 17 converts the number of channels of the multichannel audio signal by a predetermined method. The channel number converter 10b outputs the playback channel audio signal after the channel number conversion to a playback device or the like.

第三実施形態によれば、第一実施形態と同様の効果を得ることができる。また、第三実施形態によれば、マルチチャンネル音声信号に含まれるチャンネル信号のうち、音声信号の特性が似ているチャンネル信号（基準チャンネル信号）だけを選択して、遅延補正処理や重み付け係数の算出処理を行うので、第一実施形態に比べ、計算量を抑えることができる。第三実施形態は、例えば、テレビの対談番組などのダイアログ音声信号が支配的な番組のマルチチャンネル音声信号を再生チャンネル音声信号に変換するような場面で用いることができる。 According to the third embodiment, the same effect as that of the first embodiment can be obtained. Further, according to the third embodiment, only the channel signal (reference channel signal) having similar audio signal characteristics is selected from the channel signals included in the multi-channel audio signal, and the delay correction process and the weighting coefficient are selected. Since the calculation process is performed, the amount of calculation can be reduced compared to the first embodiment. The third embodiment can be used in a situation where, for example, a multi-channel audio signal of a program in which a dialog audio signal is dominant, such as a TV talk program, is converted into a reproduction channel audio signal.

また、基準チャンネル信号だけに限定して以降の処理（重み付け係数の計算など）を行い、非基準チャンネル信号については処理を行わない（非基準チャンネル信号群のそれぞれの信号の強さを「０」として扱う）といった実施形態でもよい。また、基準チャンネル信号選択部１３５は、参照信号のチャンネル信号との間の相互相関係数が最も大きい（一つの）チャンネル信号を選択するだけではなく、相互相関係数が大きいチャンネル信号を、相互相関係数が大きい順に複数選択して、あるいは、相互相関係数が所定の閾値以上のチャンネル信号を選択して、選択した複数のチャンネル信号を基準チャンネル信号としてもよい。
なお、ダイアログ音声信号に含まれる音声は必ずしもダイアログ（対話）の音声に限られない。主に人の声で構成される音声信号をダイアログ音声信号としてよい。 Further, the following processing (weighting coefficient calculation, etc.) is performed only for the reference channel signal, and processing is not performed for the non-reference channel signal (the intensity of each signal of the non-reference channel signal group is “0”). May be used as an embodiment). The reference channel signal selection unit 135 not only selects the channel signal having the largest cross-correlation coefficient with the channel signal of the reference signal, but also selects the channel signal having a large cross-correlation coefficient. A plurality of selected channel signals may be selected as a reference channel signal by selecting a plurality of channels in descending order of correlation coefficient or selecting a channel signal having a cross-correlation coefficient equal to or greater than a predetermined threshold.
Note that the sound included in the dialog sound signal is not necessarily limited to the sound of the dialog (dialog). An audio signal mainly composed of a human voice may be used as the dialog audio signal.

＜第四実施形態＞
以下、本発明の第四実施形態による重み付け係数計算部を、図１０〜１１を参照して説明する。
図１０は、本発明に係る第四実施形態におけるチャンネル数変換装置の一例を示すブロック図である。
図１０に示すようにチャンネル数変換装置１０ｃは、参照信号入力部１１、マルチチャンネル音声信号入力部１２、重み付け係数計算部１３、補正係数計算部１４、補正係数適用部１５、再生チャンネル情報取得部１６、チャンネル数変換部１７、記憶部１８と、モノ信号変換部１９と、を含む。このように、第四実施形態によるチャンネル数変換装置１０ｃは、第一実施形態の構成に加え、モノ信号変換部１９を備えている。なお、第四実施形態によるチャンネル数変換装置１０ｃの他の構成は、第一実施形態のチャンネル数変換装置１０と同様である。また、重み付け係数計算部１３の構成については、図２を用いて説明したものと同様である。 <Fourth embodiment>
Hereinafter, the weighting coefficient calculation unit according to the fourth embodiment of the present invention will be described with reference to FIGS.
FIG. 10 is a block diagram showing an example of the channel number conversion apparatus in the fourth embodiment according to the present invention.
As shown in FIG. 10, the channel number conversion device 10c includes a reference signal input unit 11, a multi-channel audio signal input unit 12, a weighting coefficient calculation unit 13, a correction coefficient calculation unit 14, a correction coefficient application unit 15, and a reproduction channel information acquisition unit. 16, a channel number conversion unit 17, a storage unit 18, and a mono signal conversion unit 19. As described above, the channel number converter 10c according to the fourth embodiment includes the mono signal converter 19 in addition to the configuration of the first embodiment. In addition, the other structure of the channel number converter 10c by 4th embodiment is the same as that of the channel number converter 10 of 1st embodiment. Further, the configuration of the weighting coefficient calculator 13 is the same as that described with reference to FIG.

モノ信号変換部１９は、参照信号を所定の方法によって、参照信号をモノ信号にダウンミックスする。例えば、モノ信号変換部１９は、参照信号入力部１１が入力した２ｃｈの音声信号を１ｃｈの音声信号に変換する。２ｃｈの音声信号から１ｃｈの音声信号への変換には、公知のダウンミックス法を用いてもよい。 The mono signal converter 19 downmixes the reference signal into a mono signal by a predetermined method. For example, the mono signal converter 19 converts a 2ch audio signal input by the reference signal input unit 11 into a 1ch audio signal. A known downmix method may be used for conversion from a 2ch audio signal to a 1ch audio signal.

第四実施形態では、モノ信号に対して重み付け比を計算したり、補正係数を計算したりするため、第一実施形態で説明したそれらの値の計算式と異なる部分がある。例えば、重み付け比補正部１３３は、以下の式（８）から定数ｃを計算する。 In the fourth embodiment, since a weighting ratio is calculated for a mono signal or a correction coefficient is calculated, there are portions different from the calculation formulas of those values described in the first embodiment. For example, the weighting ratio correction unit 133 calculates the constant c from the following equation (8).

また、補正係数計算部１４は、第一実施形態と同様、重み付け係数計算部１３が計算した重み付け係数を用いて補正係数を計算するが、第四実施形態の場合、補正係数計算部１４ｃは、参照信号を変換したモノ信号に対する重み付け係数のみを入力する。従って、第一実施形態と異なり、２ｃｈ分の重み付け係数を統合する必要が無い。具体的には、補正係数計算部１４は、上記の式（８）から算出される以下の関係式（９）によって、補正係数ｂ_１を導出する。
（ｂ_１）^２＝ｃ×（ａ_１）^２・・・（９）
補正係数計算部１４は、他の補正係数ｂ_２〜ｂ_２４の値についても同様に計算する。 The correction coefficient calculator 14 calculates the correction coefficient using the weighting coefficient calculated by the weighting coefficient calculator 13 as in the first embodiment. In the fourth embodiment, the correction coefficient calculator 14c Only the weighting coefficient for the mono signal converted from the reference signal is input. Therefore, unlike the first embodiment, it is not necessary to integrate the weighting coefficients for 2ch. Specifically, the correction coefficient calculation unit 14 derives the correction coefficient b ₁ by the following relational expression (9) calculated from the above expression (8).
(B ₁ ) ² = c × (a ₁ ) ² (9)
The correction coefficient calculation unit 14 similarly calculates the values of the other correction coefficients b _{2 to} b ₂₄ .

次に図１１を用いて、第四実施形態におけるチャンネル数変換処理について説明を行う。図１１は、本発明に係る第四実施形態におけるチャンネル数変換処理の一例を示すフローチャートである。
なお、図３で説明した処理と同様の処理については簡単に説明を行う。まず、ステップＳ１１で、マルチチャンネル音声信号入力部１２は、マルチチャンネル音声信号を入力する。また、ステップＳ１２で、参照信号入力部１１は、参照信号を入力する。参照信号入力部１１は、参照信号をモノ信号変換部１９へ出力する。次に、ステップＳ１２６で、モノ信号変換部１９は、参照信号を１ｃｈのモノ信号に変換し、モノ信号を重み付け係数計算部１３へ出力する。次に重み付け係数計算部１３では、遅延補正部１３１が、モノ信号とマルチチャンネル音声信号とを入力する。ステップＳ１３で、遅延補正部１３１は、モノ信号に対するマルチチャンネル音声信号の遅延を補正する。次に、ステップＳ１４で、重み付け比分析部１３２は、遅延補正後のマルチチャンネル音声信号とモノ信号とを入力し、モノ信号に対するマルチチャンネル音声信号の各チャンネル信号の重み付け比を、重回帰分析等を用いて計算する。重み付け比分析部１３２は、分析した重み付け比を重み付け比補正部１３３に出力する。次に、ステップＳ１５で、重み付け比補正部１３３は、重み付け比分析部１３２が分析した重み付け比とモノ信号とを入力し、重み付け比をエネルギーに基づいて補正する。第四実施形態では、重み付け比補正部１３３は、モノ信号のエネルギーと、重み付け比をマルチチャンネル音声信号の各チャンネル信号に乗じて得た擬似モノ信号のエネルギーとが等しくなるような定数ｃを計算し、各重み付け比ａ_ｉにｃの平方根を乗じた重み付け係数を計算する。ステップＳ１６で、重み付け係数計算部１３は、重み付け係数（補正係数と同じ値）を補正係数計算部１４へ出力する。補正係数計算部１４は、補正係数を補正係数適用部１５へ出力する。 Next, the channel number conversion process in the fourth embodiment will be described with reference to FIG. FIG. 11 is a flowchart showing an example of channel number conversion processing in the fourth embodiment according to the present invention.
A process similar to the process described with reference to FIG. 3 will be briefly described. First, in step S11, the multichannel audio signal input unit 12 inputs a multichannel audio signal. In step S12, the reference signal input unit 11 inputs a reference signal. The reference signal input unit 11 outputs the reference signal to the mono signal conversion unit 19. Next, in step S126, the mono signal conversion unit 19 converts the reference signal into a 1ch mono signal, and outputs the mono signal to the weighting coefficient calculation unit 13. Next, in the weighting coefficient calculation unit 13, the delay correction unit 131 inputs a mono signal and a multi-channel audio signal. In step S13, the delay correction unit 131 corrects the delay of the multi-channel audio signal with respect to the mono signal. Next, in step S14, the weighting ratio analysis unit 132 inputs the multichannel audio signal and the mono signal after delay correction, and performs a multiple regression analysis or the like on the weighting ratio of each channel signal of the multichannel audio signal to the mono signal. Calculate using. The weighting ratio analysis unit 132 outputs the analyzed weighting ratio to the weighting ratio correction unit 133. Next, in step S15, the weighting ratio correction unit 133 inputs the weighting ratio and the mono signal analyzed by the weighting ratio analysis unit 132, and corrects the weighting ratio based on energy. In the fourth embodiment, the weighting ratio correction unit 133 calculates a constant c such that the mono signal energy is equal to the pseudo mono signal energy obtained by multiplying each channel signal of the multichannel audio signal by the weighting ratio. Then, a weighting coefficient obtained by multiplying each weighting ratio a _i by the square root of c is calculated. In step S <b> 16, the weighting coefficient calculator 13 outputs the weighting coefficient (the same value as the correction coefficient) to the correction coefficient calculator 14. The correction coefficient calculation unit 14 outputs the correction coefficient to the correction coefficient application unit 15.

次に、ステップＳ１８で、補正係数適用部１５は、再生チャンネル数に応じて補正係数を修正する。ステップＳ１９で、補正係数適用部１５は、修正後の補正係数をマルチチャンネル音声信号に適用する。次に、ステップＳ２０で、チャンネル数変換部１７は、所定のレンダリング方法で補正係数適用後のマルチチャンネル音声信号を補正係数適用部１５から入力し、チャンネル数変換する。チャンネル数変換装置１０ｃは、チャンネル数変換後の再生チャンネル音声信号を再生機器等に出力する。 Next, in step S18, the correction coefficient application unit 15 corrects the correction coefficient according to the number of reproduction channels. In step S19, the correction coefficient application unit 15 applies the corrected correction coefficient to the multichannel audio signal. Next, in step S20, the channel number conversion unit 17 inputs the multichannel audio signal after applying the correction coefficient by the predetermined rendering method from the correction coefficient application unit 15, and converts the number of channels. The channel number conversion device 10c outputs the playback channel audio signal after the channel number conversion to a playback device or the like.

第四実施形態によれば、第一実施形態と同様の効果を得ることができる。また、補正係数を計算等の処理を、モノ信号を参照して行うので、第一実施形態に比べ加え、計算量を抑えることができる。なお、第四実施形態は、第二実施形態、または、第三実施形態に適用してもよい。 According to the fourth embodiment, the same effect as that of the first embodiment can be obtained. Further, since the processing such as calculation of the correction coefficient is performed with reference to the mono signal, the amount of calculation can be reduced as compared with the first embodiment. Note that the fourth embodiment may be applied to the second embodiment or the third embodiment.

なお、上述のチャンネル数変換装置１０、１０ａ、１０ｂ、１０ｃは、内部にコンピュータシステムを有している。そして、チャンネル数変換装置１０等の動作の過程は、プログラムの形式でコンピュータ読み取り可能な記録媒体に記憶されており、このプログラムをコンピュータシステムが読み出して実行することによって、上記処理が行われる。ここでいうコンピュータシステムとは、ＣＰＵ及び各種メモリやＯＳ、周辺機器等のハードウェアを含むものである。 Note that the above-described channel number conversion apparatuses 10, 10a, 10b, and 10c have a computer system therein. The operation process of the channel number conversion device 10 and the like is stored in a computer-readable recording medium in the form of a program, and the above processing is performed by the computer system reading and executing this program. The computer system here includes a CPU, various memories, an OS, and hardware such as peripheral devices.

また、「コンピュータシステム」は、ＷＷＷシステムを利用している場合であれば、ホームページ提供環境（あるいは表示環境）も含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ−ＲＯＭ等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。さらに「コンピュータ読み取り可能な記録媒体」とは、インターネット等のネットワークや電話回線等の通信回線を介してプログラムを送信する場合の通信線のように、短時間の間、動的にプログラムを保持するもの、その場合のサーバやクライアントとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含むものとする。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよい。 Further, the “computer system” includes a homepage providing environment (or display environment) if a WWW system is used.
The “computer-readable recording medium” refers to a storage device such as a flexible medium, a magneto-optical disk, a portable medium such as a ROM and a CD-ROM, and a hard disk incorporated in a computer system. Furthermore, the “computer-readable recording medium” dynamically holds a program for a short time like a communication line when transmitting a program via a network such as the Internet or a communication line such as a telephone line. In this case, a volatile memory in a computer system serving as a server or a client in that case, and a program that holds a program for a certain period of time are also included. The program may be a program for realizing a part of the functions described above, and may be a program capable of realizing the functions described above in combination with a program already recorded in a computer system.

その他、本発明の趣旨を逸脱しない範囲で、上記した実施の形態における構成要素を周知の構成要素に置き換えることは適宜可能である。また、この発明の技術範囲は上記の実施形態に限られるものではなく、本発明の趣旨を逸脱しない範囲において種々の変更を加えることが可能である。 In addition, it is possible to appropriately replace the components in the above-described embodiments with known components without departing from the spirit of the present invention. The technical scope of the present invention is not limited to the above-described embodiment, and various modifications can be made without departing from the spirit of the present invention.

１０、１０ａ、１０ｂ、１０ｃ・・・チャンネル数変換装置
１１・・・参照信号入力部
１２・・・マルチチャンネル音声信号入力部
１３、１３ａ、１３ｂ・・・重み付け係数計算部
１３１、１３１ｂ・・・遅延補正部
１３２、１３２ｂ・・・重み付け比分析部
１３３・・・重み付け比補正部
１３４・・・グルーピング部
１３５・・・基準チャンネル信号選択部
１３６・・・チャンネル分類部
１４・・・補正係数計算部
１５・・・補正係数適用部
１６・・・再生チャンネル情報取得部
１７・・・チャンネル数変換部
１８・・・記憶部
１９・・・モノ信号変換部 10, 10a, 10b, 10c ... Channel number conversion device 11 ... Reference signal input unit 12 ... Multi-channel audio signal input unit 13, 13a, 13b ... Weighting coefficient calculation unit 131, 131b ... Delay correction unit 132, 132b ... Weighting ratio analysis unit 133 ... Weighting ratio correction unit 134 ... Grouping unit 135 ... Reference channel signal selection unit 136 ... Channel classification unit 14 ... Correction coefficient calculation 15 ... Correction coefficient application unit 16 ... Playback channel information acquisition unit 17 ... Channel number conversion unit 18 ... Storage unit 19 ... Mono signal conversion unit

Claims

Weighting coefficient calculation for inputting a multichannel audio signal and a reference signal corresponding to the multichannel audio signal, and calculating a weighting coefficient corresponding to each channel signal of the multichannel audio signal included in each channel of the reference signal And
A correction coefficient calculator that calculates a correction coefficient to be multiplied to each channel signal of the multi-channel audio signal based on the weighting coefficient;
A correction coefficient applying unit that applies the correction coefficient to the multi-channel audio signal;
A channel number conversion unit that converts the multichannel audio signal to which the correction coefficient is applied into a reproduction channel signal having a desired number of channels by a predetermined channel number conversion method;
A channel number conversion device comprising:

A delay correction unit that corrects a delay between the reference signal and each channel signal of the multi-channel audio signal;
The channel number conversion apparatus according to claim 1, further comprising:

The weighting coefficient calculation unit inputs the multichannel audio signal and the reference signal, and analyzes a weighting ratio for each channel signal of the multichannel audio signal;
The channel number conversion apparatus according to claim 1 or 2, further comprising:

The weighting coefficient calculation unit includes a sum of energy of each channel signal of the reference signal and a signal energy obtained by multiplying each channel signal of the multichannel audio signal corresponding to each channel of the reference signal by the weighting ratio. A weighting ratio correction unit for correcting the weighting ratio to be equal;
The channel number conversion device according to claim 3, further comprising:

The weighting factor calculator is
A grouping unit for grouping each channel signal of the multi-channel audio signal based on the similarity of each channel signal, and generating a group signal representing the group based on the channel signal belonging to the group;
The weighting coefficient calculator calculates a weighting coefficient for the group signal;
5. The channel number conversion device according to claim 2, wherein the number of channels is converted.

The grouping unit may be any one of an average of the channel signals, a channel signal at the center of gravity of the similarity of the channel signals, and a channel signal having the maximum energy among the channel signals based on the channel signals belonging to the group. Or as the group signal,
The number-of-channels conversion device according to claim 5.

The weighting factor calculator is
A reference channel signal selection unit that selects one or a plurality of channel signals based on a cross-correlation coefficient with the reference signal from each channel signal of the multi-channel audio signal;
The weighting coefficient calculation unit calculates a weighting coefficient on the condition that the weighting coefficient of the selected channel signal is larger than the weighting coefficients of other channel signals.
5. The channel number conversion device according to claim 2, wherein the number of channels is converted.

The weighting factor calculator is
A reference channel signal selection unit that selects one or a plurality of channel signals based on a cross-correlation coefficient with the reference signal from each channel signal of the multi-channel audio signal;
The weighting coefficient calculator calculates a weighting coefficient only for the channel signal selected by the reference channel signal selector.
5. The channel number conversion device according to claim 2, wherein the number of channels is converted.

The correction coefficient calculation unit includes the sum of the energy of the signal obtained by applying the weighting coefficient to each channel signal of the multichannel audio signal corresponding to the total energy of the reference signal or each channel of the reference signal, and the multichannel audio Calculating the correction coefficient so that the sum of energy of signals obtained by applying the correction coefficient to each channel signal of the signal is equal;
9. The channel number converter according to claim 1, wherein the number of channels is converted.

The correction coefficient application unit corrects the predetermined initial value of the correction coefficient corresponding to the number of channels of the multi-channel audio signal and the correction coefficient calculated by the correction coefficient calculation unit and corresponding to the number of channels of the reference signal. And correcting the correction coefficient calculated by the correction coefficient calculation unit according to the number of channels of the reproduction channel signal by interpolation based on the coefficient,
10. The channel number conversion apparatus according to claim 1, wherein

11. The channel number conversion apparatus according to claim 1, further comprising: a mono signal conversion unit that converts the reference signal into a mono signal by a predetermined channel number conversion method.

A program for causing a computer to function as the channel number conversion device according to any one of claims 1 to 11.