JP5843705B2

JP5843705B2 - Audio control device, audio reproduction device, television receiver, audio control method, program, and recording medium

Info

Publication number: JP5843705B2
Application number: JP2012138097A
Authority: JP
Inventors: 藤井　修; 修藤井
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 2012-06-19
Filing date: 2012-06-19
Publication date: 2016-01-13
Anticipated expiration: 2032-06-19
Also published as: JP2014003493A

Description

本発明は、複数のスピーカから出力される音声を制御することが可能な音声制御装置、音声再生装置、テレビジョン受像機、音声制御方法、プログラム、および記録媒体に関する。 The present invention relates to an audio control device, an audio reproduction device, a television receiver, an audio control method, a program, and a recording medium that can control audio output from a plurality of speakers.

従来、映像コンテンツを視聴する際、臨場感あるサラウンド効果を得るため、５．１チャンネルなどのマルチチャンネル音響方式が採用された音響システム（以下、「マルチチャンネル音響システム」と称する）が利用されている。マルチチャンネル音響システムにおいては、複数のチャンネルを含み、各チャンネルに対応する理想的な音声出力位置が予め定められたマルチチャンネル音声信号が利用される。マルチチャンネル音声信号を構成する各チャンネルの音声信号が表す音声を、そのチャンネルに対応する音声出力位置に配置されたスピーカから出力することによって、あたかも音場の中に居るような感覚をユーザに与えることができる。 Conventionally, in order to obtain a realistic surround effect when viewing video content, an acoustic system (hereinafter referred to as “multi-channel acoustic system”) employing a 5.1 channel multi-channel acoustic system has been used. Yes. In a multi-channel sound system, a multi-channel audio signal including a plurality of channels and having an ideal audio output position corresponding to each channel is used in advance. By outputting the sound represented by the sound signal of each channel constituting the multi-channel sound signal from the speaker arranged at the sound output position corresponding to the channel, the user feels as if they are in the sound field. be able to.

特に、近年では、スーパーハイビジョンへの利用が予定されているマルチチャンネル音響方式として、２２．２チャンネルのマルチチャンネル音響方式が策定されている。この２２．２チャンネルのマルチチャンネル音響方式は、３次元空間における３つのチャンネル層（上層９チャンネル、中層１０チャンネル、下層３チャンネル）を含んでおり、さらに、ＬＦＥ（Low Frequency Effects：低域効果）用の２つのチャンネルを含んで構成されている。これにより、２２．２チャンネルの音響方式を採用したマルチチャンネル音響システムは、自然かつ高品質な３次元音響効果が得られるとされている。 In particular, in recent years, a 22.2 channel multi-channel sound system has been formulated as a multi-channel sound system scheduled to be used for Super Hi-Vision. This 22.2 channel multi-channel sound system includes three channel layers (upper layer 9 channels, middle layer 10 channels, lower layer 3 channels) in a three-dimensional space, and LFE (Low Frequency Effects). It is configured to include two channels. As a result, a multi-channel sound system employing a 22.2 channel sound system is said to provide a natural and high-quality three-dimensional sound effect.

このようなマルチチャンネル音響システムにおいて、良好なサラウンド効果を得るためには、そのチャンネル数に応じた数のスピーカを、予め定められた適切な位置に配置する必要がある。 In such a multi-channel sound system, in order to obtain a favorable surround effect, it is necessary to arrange a number of speakers corresponding to the number of channels at appropriate positions determined in advance.

しかしながら、実際には、設置スペース、設置コスト等の制約により、音響方式のチャンネル数よりも少ない数のスピーカしか設置できない場合や、スピーカを予め定められた位置に設置できない場合が多い。 However, in reality, there are many cases where only a smaller number of speakers than the number of channels of the acoustic system can be installed due to restrictions such as installation space and installation cost, or speakers cannot be installed at predetermined positions.

そこで、従来、マルチチャンネル音響システムに関し、予め定められた数のスピーカを予め定められた適切な位置に配置できない場合であっても、出来る限り良好なサラウンド効果を得ることを可能とするための、様々な技術が考案されている。 Therefore, conventionally, with respect to a multi-channel sound system, even when a predetermined number of speakers cannot be arranged at a predetermined appropriate position, it is possible to obtain the best possible surround effect. Various techniques have been devised.

例えば、下記特許文献１には、音響再生装置において、スピーカの位置が通常モードかＮＦＬ（ニアフィールドリスニング）モードかを検出し、ＮＦＬモードの場合は、上記音響信号のチャンネル数に応じた各方向の仮想音源がその方向から聞こえるように、上記音響信号を空間音場処理することが記載されている。この技術によると、スピーカを聴取者の近くに置いても本来の音質および音場感を表現することができるとされている。 For example, in Patent Document 1 below, in a sound reproduction device, it is detected whether the position of a speaker is a normal mode or an NFL (near field listening) mode, and in the NFL mode, each direction according to the number of channels of the acoustic signal It is described that the acoustic signal is subjected to spatial sound field processing so that the virtual sound source can be heard from that direction. According to this technology, it is said that the original sound quality and sound field feeling can be expressed even if a speaker is placed near the listener.

また、下記特許文献２には、複数のスピーカの各々の位置を判定し、各スピーカの位置に応じて、ステレオ信号の出力先を切り替えることが記載されている。この技術によると、複数のスピーカがどのチャンネルのスピーカであるかを考慮しなくても、確実にサラウンド効果を得ることができるとされている。 Japanese Patent Application Laid-Open No. 2004-228561 describes that the position of each of a plurality of speakers is determined and the output destination of a stereo signal is switched according to the position of each speaker. According to this technology, the surround effect can be surely obtained without considering which channel the plurality of speakers are.

特開２００８−３１２０９６号公報（２００８年１２月２５日公開）JP 2008-312096 A (released on December 25, 2008) 特開２００７−６０２５３号公報（２００７年３月８日公開）JP 2007-60253 A (published March 8, 2007)

しかしながら、従来技術では、上記した２２．２チャンネルの音響方式のような、３次元空間に複数のチャンネルが配置されたマルチチャンネル音響方式には対応していない。すなわち、従来技術では、スピーカの水平方向の位置に応じて、音響効果を制御することができたとしても、スピーカの垂直（上下）方向の位置に応じて音響効果を制御することはできない。このため、従来技術では、３次元空間における適切な位置（すなわち、水平方向および垂直方向の双方とも適切な位置）にスピーカが配置されていない場合において、上記マルチチャンネル音響方式用の音声信号を再生する場合、コンテンツに本来含まれる垂直方向と水平方向の立体的なサラウンド効果を良好に得ることができない。 However, the conventional technology does not support a multi-channel sound method in which a plurality of channels are arranged in a three-dimensional space, such as the 22.2 channel sound method described above. That is, in the related art, even if the acoustic effect can be controlled according to the horizontal position of the speaker, the acoustic effect cannot be controlled according to the vertical (vertical) direction position of the speaker. For this reason, in the prior art, when the speaker is not arranged at an appropriate position in the three-dimensional space (that is, an appropriate position in both the horizontal direction and the vertical direction), the audio signal for the multi-channel acoustic system is reproduced. In this case, the three-dimensional surround effect in the vertical and horizontal directions originally included in the content cannot be obtained satisfactorily.

本発明は、上記の問題に鑑みてなされたものであり、その目的は、３次元空間において適切な位置にスピーカが配置されていない場合であっても、良好なサラウンド効果を得ることにある。 The present invention has been made in view of the above problems, and an object thereof is to obtain a good surround effect even when a speaker is not disposed at an appropriate position in a three-dimensional space.

複数のチャンネルを含むマルチチャンネル音声信号であって、各チャンネルに対応する音声出力位置が予め定められたマルチチャンネル音声信号を、複数の出力チャンネルを介して出力する音声制御装置において、
前記出力チャンネル毎に、当該出力チャンネルに接続されたスピーカの３次元空間における位置と、前記複数のチャンネルの各々に対応する音声出力位置とに基づいて、前記複数のチャンネルの中から、当該出力チャンネルに割り当てる主チャンネルを決定する主チャンネル決定手段と、
前記出力チャンネル毎に、前記複数のチャンネルの中から、当該出力チャンネルに割り当てる副チャンネルを決定する副チャンネル決定手段と、
前記出力チャンネル毎に、当該出力チャンネルに割り当てた前記主チャンネルの音声信号と、当該出力チャンネルに割り当てた前記副チャンネルの音声信号とを用いて、当該出力チャンネルを介して出力する出力音声信号を生成する出力音声信号生成手段とを備えることを特徴とする。 In a multi-channel audio signal including a plurality of channels, wherein a multi-channel audio signal having a predetermined audio output position corresponding to each channel is output via a plurality of output channels,
For each of the output channels, the output channel is selected from the plurality of channels based on the position of the speaker connected to the output channel in the three-dimensional space and the audio output position corresponding to each of the plurality of channels. A main channel determining means for determining a main channel to be assigned to
Sub-channel determining means for determining a sub-channel assigned to the output channel from the plurality of channels for each of the output channels;
For each output channel, an output audio signal to be output via the output channel is generated using the audio signal of the main channel assigned to the output channel and the audio signal of the sub-channel assigned to the output channel. And an output audio signal generating means.

本音声制御装置によれば、出力チャンネル毎に、スピーカの位置に応じた適切なチャンネルを主チャンネルとして決定することができる。このため、各スピーカから、そのスピーカの位置に応じた適切なチャンネルの音声を、主音声として出力することができる。特に、本音声制御装置によれば、３次元空間における垂直方向の位置を考慮して、主チャンネルを決定しているため、３次元空間において立体的に配置された複数のチャンネルの中から、スピーカの位置に応じた適切なチャンネルを主チャンネルとして決定することができる。 According to this audio control apparatus, an appropriate channel corresponding to the position of the speaker can be determined as the main channel for each output channel. For this reason, the sound of the appropriate channel according to the position of the speaker can be output from each speaker as the main sound. In particular, according to the audio control apparatus, the main channel is determined in consideration of the position in the vertical direction in the three-dimensional space, and therefore the speaker is selected from a plurality of channels arranged three-dimensionally in the three-dimensional space. It is possible to determine an appropriate channel according to the position as the main channel.

すなわち、本音声制御装置によれば、スピーカの水平方向および垂直方向の位置に応じて、音響効果を制御することができるため、３次元空間において適切な位置（水平方向および垂直方向の双方の適切な位置）にスピーカが配置されていない場合であっても、良好なサラウンド効果を得ることができる。 That is, according to the sound control device, the acoustic effect can be controlled according to the position of the speaker in the horizontal direction and the vertical direction, so that the appropriate position in the three-dimensional space (appropriate in both the horizontal direction and the vertical direction). Even if a speaker is not disposed at a desired position, a good surround effect can be obtained.

また、本音声制御装置によれば、出力チャンネル毎に、対応する主チャンネルをスピーカの位置に応じて決定しているため、例えばユーザがスピーカの接続端子や設置位置を間違えたとしても、そのスピーカの位置に応じた適切なチャンネルを、主チャンネルとして決定することができる。すなわち、各スピーカがどのように接続されたとしても、各スピーカからその位置に応じた適切なチャンネルの音声を出力することができる。 In addition, according to the audio control device, for each output channel, the corresponding main channel is determined according to the position of the speaker. For example, even if the user mistakes the connection terminal or installation position of the speaker, the speaker An appropriate channel in accordance with the position of can be determined as the main channel. That is, no matter how the speakers are connected, the sound of an appropriate channel corresponding to the position can be output from each speaker.

また、本音声制御装置によれば、複数のチャンネルの各々について音声出力位置が予め定められており、殆どの処理を、この音声出力位置に基づいて行っているため、処理をパターン化することが可能であり、複雑な演算処理等を行う必要なく、より高速な処理を実現することができる。 Further, according to the audio control apparatus, since the audio output position is predetermined for each of the plurality of channels, and most of the processing is performed based on the audio output position, the processing can be patterned. It is possible, and higher-speed processing can be realized without having to perform complicated arithmetic processing.

上記音声制御装置において、前記出力音声信号生成手段は、前記出力チャンネル毎に、当該出力チャンネルに割り当てた主チャンネルの音声信号と、当該出力チャンネルに割り当てた副チャンネルの音声信号とをダウンミックスすることにより、当該出力チャンネルを介して出力する出力音声信号を生成することが好ましい。 In the audio control device, the output audio signal generation means downmixes, for each output channel, the audio signal of the main channel assigned to the output channel and the audio signal of the subchannel assigned to the output channel. Thus, it is preferable to generate an output audio signal to be output via the output channel.

この構成によれば、各スピーカから、そのスピーカの位置に応じた適切な主チャンネルの音声が主音声としてダウンミックスされた出力音声信号を出力することができるため、良好なサラウンド効果を得ることができる。
・上記音声制御装置において、前記出力音声信号生成手段は、前記出力チャンネル毎に、当該出力チャンネルに割り当てた副チャンネルの音声信号が表す音声が、該副チャンネルに対応する音声出力位置に定位するように、当該出力チャンネルに割り当てた副チャンネルの音声信号の周波数特性を、頭部伝達関数を用いて補正した後、当該出力チャンネルに割り当てた主チャンネルの音声信号と、当該出力チャンネルに割り当てた副チャンネルの信号とをダウンミックスすることにより、当該出力チャンネルを介して出力する出力音声信号を生成することが好ましい。 According to this configuration, it is possible to output an output audio signal in which the sound of the appropriate main channel corresponding to the position of the speaker is downmixed as the main audio from each speaker, so that a favorable surround effect can be obtained. it can.
In the audio control device, the output audio signal generation unit is configured to localize the audio represented by the audio signal of the subchannel assigned to the output channel at the audio output position corresponding to the subchannel for each output channel. After correcting the frequency characteristics of the audio signal of the sub-channel assigned to the output channel using the head related transfer function, the audio signal of the main channel assigned to the output channel and the sub-channel assigned to the output channel It is preferable to generate an output audio signal to be output via the output channel by down-mixing the above signal.

特に、上記音声制御装置において、前記出力音声信号生成手段は、前記出力チャンネル毎に、当該出力チャンネルに割り当てた副チャンネルの音声信号を、前記３次元空間における水平方向の頭部伝達関数と、前記３次元空間における垂直方向の頭部伝達関数とを用いて補正することが好ましい。 In particular, in the audio control device, the output audio signal generating means, for each output channel, the audio signal of the subchannel assigned to the output channel, the head related transfer function in the horizontal direction in the three-dimensional space, and the It is preferable to perform correction using a vertical head related transfer function in a three-dimensional space.

この構成によれば、各チャンネルの音声を、３次元空間における所定の音声出力位置に定位させることができる。すなわち、各チャンネルの音声を、３次元空間における水平方向のみならず垂直方向についても、所定の音声出力位置に定位させることができる。これにより、視聴者に対し、各チャンネルの音声を、上記所定の音声出力位置から聞こえてくるように感じさせることができる。このため、３次元空間において適切な位置にスピーカが配置されていない場合であっても、より良好かつ立体的なサラウンド効果を得ることができる。 According to this configuration, the sound of each channel can be localized at a predetermined sound output position in the three-dimensional space. That is, the sound of each channel can be localized at a predetermined sound output position not only in the horizontal direction but also in the vertical direction in the three-dimensional space. This makes it possible for the viewer to feel that the sound of each channel is heard from the predetermined sound output position. For this reason, even if the speaker is not disposed at an appropriate position in the three-dimensional space, a better and three-dimensional surround effect can be obtained.

上記音声制御装置において、前記主チャンネル決定手段は、前記出力チャンネル毎に、当該出力チャンネルに接続されたスピーカの位置の、基準位置からの方向である第１の方向と、前記複数のチャンネルの各々の前記音声出力位置の、前記基準位置からの方向である第２の方向とに基づいて、前記第２の方向が前記第１の方向に最も近似するチャンネルを、当該出力チャンネルに割り当てる主チャンネルとして決定することが好ましい。 In the audio control device, the main channel determination means includes, for each output channel, a first direction that is a direction from a reference position of a speaker connected to the output channel, and each of the plurality of channels. Based on the second direction, which is the direction from the reference position, of the audio output position, the channel whose second direction is the closest to the first direction is the main channel assigned to the output channel. It is preferable to determine.

この構成によれば、より効率的かつ適切に、各出力チャンネルに割り当てる主チャンネルを決定することができる。 According to this configuration, the main channel to be assigned to each output channel can be determined more efficiently and appropriately.

上記音声制御装置において、前記複数のチャンネルは、予め領域毎にグループ化されており、前記副チャンネル決定手段は、前記出力チャンネル毎に、当該出力チャンネルに割り当てた前記主チャンネルに対応する前記グループに含まれているチャンネルを、当該出力チャンネルに割り当てる副チャンネルとして決定することが好ましい。 In the audio control apparatus, the plurality of channels are grouped in advance for each region, and the sub-channel determination unit assigns, for each output channel, the group corresponding to the main channel assigned to the output channel. Preferably, the included channel is determined as a sub-channel assigned to the output channel.

この構成によれば、より効率的かつ適切に、各出力チャンネルに割り当てる副チャンネルを決定することができる。 According to this configuration, it is possible to determine the sub-channel assigned to each output channel more efficiently and appropriately.

上記音声制御装置において、前記出力チャンネル毎に、当該出力チャンネルに接続されたスピーカの位置を検出する検出手段をさらに備えることが好ましい。 The voice control device preferably further includes detection means for detecting a position of a speaker connected to the output channel for each output channel.

この構成によれば、より効率的かつ確実に、スピーカの位置を特定することができる。 According to this configuration, the position of the speaker can be specified more efficiently and reliably.

また、本発明に係る音声再生装置は、上記音声制御装置を備えたことを特徴とする。 In addition, a sound reproducing device according to the present invention includes the above sound control device.

本音声再生装置によれば、上記音声制御装置と同様の効果を奏する音声再生装置を提供することができる。 According to this audio reproduction device, it is possible to provide an audio reproduction device that has the same effect as the audio control device.

また、本発明に係るテレビジョン受像機は、上記音声制御装置を備えたことを特徴とする。 In addition, a television receiver according to the present invention includes the above-described audio control device.

本テレビジョン受像機によれば、上記音声制御装置と同様の効果を奏するテレビジョン受像機を提供することができる。 According to the present television receiver, it is possible to provide a television receiver that has the same effect as the sound control device.

また、本発明に係る音声制御方法は、複数のチャンネルを含むマルチチャンネル音声信号であって、各チャンネルに対応する音声出力位置が予め定められたマルチチャンネル音声信号を、複数の出力チャンネルを介して出力する音声制御方法において、前記出力チャンネル毎に、当該出力チャンネルに接続されたスピーカの３次元空間における位置と、前記複数のチャンネルの各々に対応する音声出力位置とに基づいて、前記複数のチャンネルの中から、当該出力チャンネルに割り当てる主チャンネルを決定する主チャンネル決定工程と、前記出力チャンネル毎に、前記複数のチャンネルの中から、当該出力チャンネルに割り当てる副チャンネルを決定する副チャンネル決定工程と、前記出力チャンネル毎に、当該出力チャンネルに割り当てた前記主チャンネルの音声信号と、当該出力チャンネルに割り当てた前記副チャンネルの音声信号とを用いて、当該出力チャンネルを介して出力する出力音声信号を生成する出力音声信号生成工程とを含むことを特徴とする。 The audio control method according to the present invention is a multi-channel audio signal including a plurality of channels, and a multi-channel audio signal having a predetermined audio output position corresponding to each channel is transmitted via the plurality of output channels. In the audio control method for outputting, for each of the output channels, the plurality of channels based on a position in a three-dimensional space of a speaker connected to the output channel and an audio output position corresponding to each of the plurality of channels. A main channel determining step for determining a main channel to be assigned to the output channel, a sub channel determining step for determining a sub channel to be assigned to the output channel from the plurality of channels for each output channel, Assigned to the output channel for each output channel An output audio signal generating step of generating an output audio signal to be output through the output channel using the audio signal of the main channel and the audio signal of the sub-channel assigned to the output channel. And

本音声制御方法によれば、当該音声制御方法を実行することにより、上記音声制御装置と同様の効果を奏することができる。 According to this voice control method, the same effect as the voice control apparatus can be obtained by executing the voice control method.

また、本発明に係るプログラムは、コンピュータを上記音声制御装置として機能させるためのプログラムであって、前記コンピュータを前記音声制御装置が備える前記各手段として機能させる。 The program according to the present invention is a program for causing a computer to function as the voice control device, and causes the computer to function as each unit included in the voice control device.

本プログラムによれば、コンピュータが当該プログラムを実行することにより、このコンピュータは、上記音声制御装置と同様の効果を奏することができる。 According to this program, when the computer executes the program, the computer can achieve the same effects as the voice control device.

また、本発明に係る記録媒体は、上記プログラムを記録しているコンピュータ読み取り可能な記録媒体である。 A recording medium according to the present invention is a computer-readable recording medium in which the program is recorded.

本記録媒体によれば、上記音声制御装置と同様の効果を奏することを実現可能とする上記プログラムを当該記録媒体により提供することができる。 According to this recording medium, it is possible to provide the program that makes it possible to realize the same effects as the sound control apparatus.

本発明に係る音声制御装置、テレビジョン受像機、音声制御方法、プログラム、および記録媒体によれば、３次元空間において適切な位置にスピーカが配置されていない場合であっても、良好なサラウンド効果を得ることができる。 According to the sound control device, the television receiver, the sound control method, the program, and the recording medium according to the present invention, even when the speaker is not disposed at an appropriate position in the three-dimensional space, a good surround effect is obtained. Can be obtained.

実施形態に係るテレビジョン受像機の構成を示すブロック図である。It is a block diagram which shows the structure of the television receiver which concerns on embodiment. 本実施形態のテレビジョン受像機が備える音声制御回路の構成を示すブロック図である。It is a block diagram which shows the structure of the audio | voice control circuit with which the television receiver of this embodiment is provided. 実施形態に係る音声制御回路による音声制御処理の手順を示すフローチャートである。It is a flowchart which shows the procedure of the audio | voice control process by the audio | voice control circuit which concerns on embodiment. 記憶部に記憶されているベクトルデータベースの一例を示す。An example of the vector database memorize | stored in the memory | storage part is shown. 図４に示すベクトルデータベースに定義されている各音声出力位置を示す模式図である。It is a schematic diagram which shows each audio | voice output position defined in the vector database shown in FIG. 記憶部に記憶されている３次元空間における水平方向（前後方向および左右方向）の頭部伝達関数の一例を示す。An example of the head-related transfer function in the horizontal direction (front-rear direction and left-right direction) in the three-dimensional space stored in the storage unit is shown. 図４に示すベクトルデータベースに定義されている各音声出力位置の方向を示す。The direction of each voice output position defined in the vector database shown in FIG. 4 is shown.

以下、本発明の実施形態について、図面を用いて説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

（テレビジョン受像機１００の概要）
はじめに、図１を参照して、実施形態に係るテレビジョン受像機１００の概要および構成について説明する。図１は、実施形態に係るテレビジョン受像機１００の構成を示すブロック図である。 (Outline of the television receiver 100)
First, with reference to FIG. 1, the outline | summary and structure of the television receiver 100 which concern on embodiment are demonstrated. FIG. 1 is a block diagram illustrating a configuration of a television receiver 100 according to the embodiment.

図１に示すテレビジョン受像機１００は、地上デジタル放送やＢＳ／ＣＳデジタル放送等の各種映像コンテンツを、対応するアンテナおよびチューナによって受信し、受信した映像コンテンツを再生することによって、これら各種映像コンテンツを視聴者に視聴させることができる。 The television receiver 100 shown in FIG. 1 receives various video contents such as terrestrial digital broadcasts and BS / CS digital broadcasts by a corresponding antenna and tuner, and reproduces the received video contents. Can be viewed by the viewer.

特に、テレビジョン受像機１００は、複数の外部スピーカ１５０を接続することで、マルチチャンネル音響システムを構築して、複数の外部スピーカ１５０から、映像コンテンツの音声をマルチチャンネル出力することが可能となっている。 In particular, the television receiver 100 can connect a plurality of external speakers 150 to construct a multi-channel sound system and output the audio of the video content from the plurality of external speakers 150 in a multi-channel manner. ing.

（テレビジョン受像機１００の構成）
図１に示すように、テレビジョン受像機１００は、チューナユニット１０２、ディスプレイ駆動回路１０４、ディスプレイ１０６、音声出力回路１０８、外部機器インタフェース１１０、通信インタフェース１１２、音声制御回路１２０、およびマイク１２２を備えている。 (Configuration of television receiver 100)
As shown in FIG. 1, the television receiver 100 includes a tuner unit 102, a display driving circuit 104, a display 106, an audio output circuit 108, an external device interface 110, a communication interface 112, an audio control circuit 120, and a microphone 122. ing.

（チューナユニット）
チューナユニット１０２は、チューナ、デモジュレータ、デマルチプレクサ、ビデオデコーダ、オーディオデコーダ等を有して構成されている。チューナは、地上デジタル放送やＢＳ／ＣＳデジタル放送等の各種映像コンテンツの放送波を、外部アンテナを介して受信する。デモジュレータは、チューナによって受信された放送波を復調することによって映像コンテンツのＴＳ（Transport Stream）を得る。デマルチプレクサは、デモジュレータによって得られたＴＳから映像コンテンツの映像信号および音声信号を分離する。 (Tuner unit)
The tuner unit 102 includes a tuner, a demodulator, a demultiplexer, a video decoder, an audio decoder, and the like. The tuner receives broadcast waves of various video contents such as terrestrial digital broadcast and BS / CS digital broadcast via an external antenna. The demodulator obtains a TS (Transport Stream) of the video content by demodulating the broadcast wave received by the tuner. The demultiplexer separates the video signal and the audio signal of the video content from the TS obtained by the demodulator.

そして、デマルチプレクサによってＴＳから分離された映像信号（圧縮ビデオデータ）は、ビデオデコーダによってデコードされ、デジタルビデオデータとなって、ディスプレイ駆動回路１０４へ供給される。一方、デマルチプレクサによってＴＳから分離された音声信号（圧縮音声データ）は、オーディオデコーダによってデコードされ、デジタルオーディオデータとなって、音声出力回路１０８へ供給される。 Then, the video signal (compressed video data) separated from the TS by the demultiplexer is decoded by the video decoder, converted into digital video data, and supplied to the display driving circuit 104. On the other hand, the audio signal (compressed audio data) separated from the TS by the demultiplexer is decoded by the audio decoder and is supplied to the audio output circuit 108 as digital audio data.

（外部機器インタフェース）
外部機器インタフェース１１０は、外部機器へのアクセスを制御する。例えば、テレビジョン受像機１００は、ＢＤレコーダやＨＤＤレコーダ等の外部機器を接続することにより、これらの外部機器によって再生された映像コンテンツの映像をディスプレイ１０６に表示したり、その音声を外部スピーカ１５０から出力したりすることができる。その際、レコーダへのアクセスは、この外部機器インタフェース１１０によって制御されることとなる。 (External device interface)
The external device interface 110 controls access to the external device. For example, the television receiver 100 is connected to an external device such as a BD recorder or an HDD recorder, so that the video of the video content reproduced by the external device is displayed on the display 106, or the sound is displayed on the external speaker 150. Can be output. At that time, access to the recorder is controlled by the external device interface 110.

（通信インタフェース）
通信インタフェース１１２は、ＬＡＮやインターネット等の通信ネットワークへのアクセスを制御する。例えば、テレビジョン受像機１００は、インターネットに接続することにより、インターネットからダウンロードした映像コンテンツの映像をディスプレイ１０６に表示したり、その音声を外部スピーカ１５０から出力したりすることができる。その際、インターネットへのアクセスは、この通信インタフェース１１２によって制御されることとなる。 (Communication interface)
The communication interface 112 controls access to a communication network such as a LAN or the Internet. For example, the television receiver 100 can display a video of video content downloaded from the Internet on the display 106 or output the sound from the external speaker 150 by connecting to the Internet. At that time, access to the Internet is controlled by the communication interface 112.

（ディスプレイ駆動回路、ディスプレイ）
ディスプレイ駆動回路１０４は、供給された映像信号に応じてディスプレイ１０６を駆動することにより、映像コンテンツを構成する映像を、ディスプレイ１０６に表示させる。本実施形態のテレビジョン受像機１００は、ディスプレイ１０６として、液晶ディスプレイを採用しているが、これ以外にも、有機ＥＬディスプレイ、プラズマディスプレイ、ブラウン管ディスプレイ等が採用され得る。 (Display drive circuit, display)
The display drive circuit 104 drives the display 106 in accordance with the supplied video signal, thereby causing the display 106 to display the video constituting the video content. The television receiver 100 of the present embodiment employs a liquid crystal display as the display 106, but other than this, an organic EL display, a plasma display, a cathode ray tube display, or the like may be employed.

（音声出力回路、スピーカ）
音声出力回路１０８は、供給された音声信号に応じて外部スピーカ１５０を駆動することにより、映像コンテンツを構成する音声を外部スピーカ１５０から出力させる。テレビジョン受像機１００には、いくつかのスピーカが内蔵されているだけでなく、複数の外部スピーカ１５０を接続することが可能となっている。テレビジョン受像機１００に接続する外部スピーカ１５０の数は、ユーザによって任意に変更され得る。 (Audio output circuit, speaker)
The audio output circuit 108 drives the external speaker 150 in accordance with the supplied audio signal, thereby outputting the audio constituting the video content from the external speaker 150. The television receiver 100 has not only a few built-in speakers but also a plurality of external speakers 150 that can be connected. The number of external speakers 150 connected to the television receiver 100 can be arbitrarily changed by the user.

例えば、テレビジョン受像機１００は、２２．２チャンネルのマルチチャンネル音響システムを構築することができるように、２４個の出力チャンネルを備えており、各出力チャンネルに対して、外部スピーカ１５０を接続することが可能となっている。すなわち、テレビジョン受像機１００は、２４個の外部スピーカ１５０を接続することが可能となっている。 For example, the television receiver 100 includes 24 output channels so that a 22.2 channel multi-channel sound system can be constructed, and an external speaker 150 is connected to each output channel. It is possible. That is, the television receiver 100 can connect 24 external speakers 150.

例えば、当該テレビジョン受像機１００に２４個の外部スピーカ１５０が接続された場合、これに応じて、音声出力回路１０８は、映像コンテンツを構成する音声を、当該２４個の外部スピーカ１５０から出力させることが可能となっている。また、外部スピーカ１５０は、テレビに内蔵されているスピーカであっても良い。 For example, when 24 external speakers 150 are connected to the television receiver 100, in response to this, the audio output circuit 108 causes the audio constituting the video content to be output from the 24 external speakers 150. It is possible. The external speaker 150 may be a speaker built in the television.

しかしながら、設置スペースや設置コスト等の制約により、テレビジョン受像機１００に対して、２４個の外部スピーカ１５０を接続することができず、より少ない数の外部スピーカ１５０が接続される場合がある。例えば、テレビジョン受像機１００に内蔵の２チャンネルのスピーカのみの場合や、５.１チャンネルのシアターシステムを外部に接続する場合などもある。 However, due to restrictions such as installation space and installation cost, 24 external speakers 150 cannot be connected to the television receiver 100, and a smaller number of external speakers 150 may be connected. For example, there may be a case where only a two-channel speaker built in the television receiver 100 is connected, or a case where a 5.1-channel theater system is connected to the outside.

（音声制御回路）
音声制御回路１２０は、テレビジョン受像機１００に接続された外部スピーカ１５０の数および位置に応じて、各外部スピーカ１５０に供給される音声信号を制御する。 (Voice control circuit)
The audio control circuit 120 controls the audio signal supplied to each external speaker 150 according to the number and position of the external speakers 150 connected to the television receiver 100.

例えば、音声制御回路１２０は、当該音声制御回路１２０に入力された音声信号（以下、「入力音声信号」と示す）のチャンネル数（ｍとする）が、上記外部スピーカ１５０の数（ｎとする）よりも多い場合、上記ｍチャンネルの入力音声信号をダウンミックスすることにより、ｎチャンネルの音声信号（以下、「出力音声信号」と示す）を生成する。 For example, in the audio control circuit 120, the number of channels (m) of the audio signal (hereinafter referred to as “input audio signal”) input to the audio control circuit 120 is the number (n) of the external speakers 150. ), The n-channel audio signal (hereinafter referred to as “output audio signal”) is generated by downmixing the m-channel input audio signal.

そして、音声制御回路１２０は、生成されたｎチャンネルの出力音声信号を、音声出力回路１０８へ供給する。これにより、ｎ個の外部スピーカ１５０の各々に対して、対応するチャンネルの音声信号が供給され、各外部スピーカ１５０から、対応するチャンネルの音声が出力されることとなる。 Then, the audio control circuit 120 supplies the generated n-channel output audio signal to the audio output circuit 108. As a result, the audio signal of the corresponding channel is supplied to each of the n external speakers 150, and the audio of the corresponding channel is output from each external speaker 150.

例えば、音声制御回路１２０は、当該音声制御回路１２０に２２．２チャンネルの入力音声信号が入力されたにも関わらず、テレビジョン受像機１００に対して外部スピーカ１５０が２つしか接続されていない場合、２２．２チャンネルの入力音声信号をダウンミックスすることにより、２チャンネルの出力音声信号を生成する。これにより、２つの外部スピーカ１５０各々からは、当該外部スピーカ１５０に対応する複数のチャンネルの音声が出力されることとなる。 For example, the audio control circuit 120 has only two external speakers 150 connected to the television receiver 100 even though 22.2 channel input audio signals have been input to the audio control circuit 120. In this case, the output audio signal of 2 channels is generated by downmixing the input audio signal of 22.2 channels. As a result, audio from a plurality of channels corresponding to the external speaker 150 is output from each of the two external speakers 150.

このように、本実施形態のテレビジョン受像機１００は、入力音声信号のチャンネル数に対する外部スピーカ１５０の数が少ない場合であっても、これら複数の外部スピーカ１５０から複数のチャンネルの音声を出力することによって、良好な音響効果を得ることができる。 As described above, the television receiver 100 according to the present embodiment outputs audio of a plurality of channels from the plurality of external speakers 150 even when the number of external speakers 150 is small with respect to the number of channels of the input audio signal. Thus, a good acoustic effect can be obtained.

（マイク１２２）
マイク１２２には、テレビジョン受像機１００周辺の音声が入力される。後述するように、テレビジョン受像機１００は、各外部スピーカ１５０から出力され、且つこのマイク１２２から入力された音声に基づいて、各外部スピーカ１５０の位置を検出することが可能となっている。なお、マイク１２２は、テレビジョン受像機１００に内蔵されたものであってもよく、テレビジョン受像機１００に外部接続されたものであってもよい。 (Microphone 122)
Sound around the television receiver 100 is input to the microphone 122. As will be described later, the television receiver 100 can detect the position of each external speaker 150 based on the sound output from each external speaker 150 and input from the microphone 122. The microphone 122 may be built in the television receiver 100 or may be externally connected to the television receiver 100.

（音声制御回路の構成）
次に、図２を参照して、本実施形態のテレビジョン受像機１００が備える音声制御回路１２０の構成について具体的に説明する。図２は、本実施形態のテレビジョン受像機１００が備える音声制御回路１２０の構成を示すブロック図である。 (Configuration of voice control circuit)
Next, with reference to FIG. 2, the structure of the audio | voice control circuit 120 with which the television receiver 100 of this embodiment is provided is demonstrated concretely. FIG. 2 is a block diagram illustrating a configuration of the audio control circuit 120 included in the television receiver 100 of the present embodiment.

図２に示すように、音声制御回路１２０は、記憶部２００、検出部２０２、主チャンネル決定部２０４、副チャンネル決定部２０８、出力音声信号生成部２１０、および音声信号出力部２１２を備える。 As shown in FIG. 2, the audio control circuit 120 includes a storage unit 200, a detection unit 202, a main channel determination unit 204, a sub channel determination unit 208, an output audio signal generation unit 210, and an audio signal output unit 212.

（記憶部２００）
記憶部２００は、ベクトルデータベースおよび頭部伝達関数（ＨＲＴＦ： Head Related Transfer Function）を記憶する。ベクトルデータベースには、マルチチャンネル音響システムのチャンネル毎に、３次元空間における音声出力位置が定義されている。各音声出力位置は、良好な音響効果（サラウンド効果）を得るために、その位置での音声出力が推奨されている位置である。 (Storage unit 200)
The storage unit 200 stores a vector database and a head related transfer function (HRTF). In the vector database, an audio output position in a three-dimensional space is defined for each channel of the multi-channel acoustic system. Each sound output position is a position where sound output at that position is recommended in order to obtain a good acoustic effect (surround effect).

例えば、本実施形態のテレビジョン受像機１００は、２４個の外部スピーカ１５０を接続することにより、２２．２チャンネルの音響システムを構築することが可能となっている。これに応じて、ベクトルデータベースには、２２．２チャンネルを構成する各チャンネルの音声出力位置（すなわち、２４箇所の音声出力位置）が定義されている。 For example, the television receiver 100 according to the present embodiment can construct a 22.2 channel sound system by connecting 24 external speakers 150. Correspondingly, the vector database defines the audio output positions (that is, 24 audio output positions) of each channel constituting 22.2 channels.

頭部伝達関数は、音声が出力されてから、当該音声が視聴者の頭部を経由して視聴者の鼓膜に至るまでの音声の伝達特性を示すものであり、音声の周波数特性を相対音圧レベルで示したものである。人間の頭部の構造は、方向毎に異なるため、音声が出力された位置の方向によって、その音声が鼓膜に到達した時の周波数特性が異なる。この周波数特性の違いにより、人間は、音声が出力された位置の方向を特定することができるようになっている。この原理を利用して、頭部伝達関数においては、視聴者を基準とする音声の出力位置がある方向毎に、異なる周波数特性が定義されている。したがって、この頭部伝達関数に基づいて、音声の周波数特性を制御することにより、その音声の仮想的な出力位置がある方向を自在に調整することが可能となる。 The head-related transfer function indicates the transfer characteristic of the sound from when the sound is output until the sound passes through the viewer's head and reaches the viewer's eardrum. This is indicated by the pressure level. Since the structure of the human head differs depending on the direction, the frequency characteristics when the sound reaches the eardrum differ depending on the direction of the position where the sound is output. Due to the difference in frequency characteristics, a human can specify the direction of the position where the sound is output. Using this principle, in the head-related transfer function, different frequency characteristics are defined for each direction in which the audio output position is based on the viewer. Therefore, by controlling the frequency characteristics of the sound based on this head-related transfer function, the direction in which the virtual output position of the sound is located can be freely adjusted.

特に、本実施形態のテレビジョン受像機１００においては、記憶部２００には、３次元空間における水平方向（前後方向および左右方向）の頭部伝達関数と、３次元空間における垂直方向（上下方向）の頭部伝達関数とが、記憶されている。テレビジョン受像機１００は、この２つの頭部伝達関数を用いることにより、３次元空間におけるいずれの方向にも、音声の仮想的な出力位置がある方向を自在に調整することが可能となっている。 In particular, in the television receiver 100 of the present embodiment, the storage unit 200 includes a head-related transfer function in the horizontal direction (front-rear direction and left-right direction) in a three-dimensional space and a vertical direction (up-down direction) in a three-dimensional space. Are stored. By using these two head-related transfer functions, the television receiver 100 can freely adjust the direction in which the virtual output position of the sound is in any direction in the three-dimensional space. Yes.

（検出部２０２）
検出部２０２は、利用可能な出力チャンネル毎に、外部スピーカ１５０の位置を検出する。ここで、「利用可能な出力チャンネル」とは、外部スピーカ１５０が接続されている出力チャンネルのことを意味するが、これに限らず、ユーザによって利用することが選択された出力チャンネル等であってもよい。 (Detector 202)
The detection unit 202 detects the position of the external speaker 150 for each available output channel. Here, “available output channel” means an output channel to which the external speaker 150 is connected, but is not limited to this, and is an output channel or the like selected by the user. Also good.

例えば、検出部２０２は、上記出力チャンネル毎に、以下の検出処理（１−１）〜（１−５）を行うことにより、当該出力チャンネルに接続されている外部スピーカ１５０の位置を検出する。例えば、テレビジョン受像機１００には、２４個の外部スピーカ１５０を接続することが可能となっているが、２個の外部スピーカ１５０しか接続されていない場合には、検出部２０２は、この２個のスピーカ１５０の各位置を検出することになる。 For example, the detection unit 202 detects the position of the external speaker 150 connected to the output channel by performing the following detection processes (1-1) to (1-5) for each output channel. For example, it is possible to connect 24 external speakers 150 to the television receiver 100, but when only two external speakers 150 are connected, the detection unit 202 detects the two external speakers 150. Each position of the single speaker 150 is detected.

（１−１）検出部２０２は、位置検出用の音声信号をメモリ等の記録媒体から読み出し、当該音声信号を、出力チャンネルを指定して、音声信号出力部２１２へ供給する。この音声信号は、インパルスまたは正弦波等であることが好ましい。 (1-1) The detection unit 202 reads an audio signal for position detection from a recording medium such as a memory, and supplies the audio signal to the audio signal output unit 212 by designating an output channel. The audio signal is preferably an impulse or a sine wave.

（１−２）これにより、当該音声信号は、音声出力回路１０８へ供給され、当該音声信号が、指定された出力チャンネルから出力される。これにより、当該音声信号に基づく音声が、指定された出力チャンネルに接続されている外部スピーカ１５０から出力される。 (1-2) Thereby, the audio signal is supplied to the audio output circuit 108, and the audio signal is output from the designated output channel. Thereby, the sound based on the sound signal is output from the external speaker 150 connected to the designated output channel.

（１−３）検出部２０２は、外部スピーカ１５０から出力された音声を、マイク１２２によって集音する。このとき、検出部２０２は、スピーカの空間位置情報を得るために少なくとも３つ以上のマイク１２２を用いることがより好ましい。 (1-3) The detection unit 202 collects sound output from the external speaker 150 by the microphone 122. At this time, it is more preferable that the detection unit 202 uses at least three or more microphones 122 in order to obtain speaker spatial position information.

（１−４）検出部２０２は、マイク１２２毎に、上記音声についての、外部スピーカ１５０からの出力時刻とマイク１２２からの入力時刻との差、または、外部スピーカ１５０からの出力レベルとマイク１２２からの入力レベルとの差、の少なくともいずれか一方に基づいて、外部スピーカ１５０から当該マイク１２２までの距離を検出する。 (1-4) For each microphone 122, the detection unit 202, for each microphone 122, the difference between the output time from the external speaker 150 and the input time from the microphone 122, or the output level from the external speaker 150 and the microphone 122. The distance from the external speaker 150 to the microphone 122 is detected on the basis of at least one of the difference from the input level from.

（１−５）検出部２０２は、予め定められた基準位置（例えば、ディスプレイ１０６の中央位置）に対する各マイク１２２の位置と、外部スピーカ１５０から各マイク１２２までの距離とに基づいて、上記基準位置に対する外部スピーカ１５０の空間座標位置を特定する。 (1-5) The detection unit 202 determines the reference based on the position of each microphone 122 with respect to a predetermined reference position (for example, the center position of the display 106) and the distance from the external speaker 150 to each microphone 122. The spatial coordinate position of the external speaker 150 with respect to the position is specified.

（主チャンネル決定部２０４）
主チャンネル決定部２０４は、利用可能な出力チャンネル毎に、当該出力チャンネルに接続されている外部スピーカ１５０の空間座標位置（すなわち、検出部２０２によって検出された位置）に応じて、当該出力チャンネルに割り当てる主チャンネルを決定する。 (Main channel determination unit 204)
For each available output channel, the main channel determination unit 204 sets the output channel according to the spatial coordinate position of the external speaker 150 connected to the output channel (that is, the position detected by the detection unit 202). Determine the main channel to assign.

テレビジョン受像機１００においては、記憶部２００に記憶されているベクトルデータベースにおいて、複数のチャンネルの各々に対し、３次元空間における複数の音声出力位置が予め定められている。そこで、主チャンネル決定部２０４は、出力チャンネル毎に、当該出力チャンネルに接続されている外部スピーカ１５０の位置と、ベクトルデータベースにおいて定められている複数のチャンネルの各々の音声出力位置とに基づいて、上記複数のチャンネルの中から、当該出力チャンネルに割り当てる主チャンネルを決定する。 In the television receiver 100, in the vector database stored in the storage unit 200, a plurality of audio output positions in a three-dimensional space are predetermined for each of a plurality of channels. Therefore, the main channel determination unit 204, for each output channel, based on the position of the external speaker 150 connected to the output channel and the audio output position of each of the plurality of channels defined in the vector database, A main channel to be assigned to the output channel is determined from the plurality of channels.

例えば、主チャンネル決定部２０４は、その音声出力位置の基準位置（視聴者の位置）からの方向（第２の方向）が、基準位置（視聴者の位置）からの外部スピーカ１５０の位置の方向（第１の方向）に最も近似するチャンネルを、当該出力チャンネルに割り当てるる主チャンネルとして決定する。 For example, the main channel determination unit 204 determines that the direction (second direction) of the audio output position from the reference position (viewer position) is the position of the external speaker 150 from the reference position (viewer position). The channel closest to (first direction) is determined as the main channel to be assigned to the output channel.

例えば、テレビジョン受像機１００に内蔵するスピーカを利用する場合、チャンネルＦＬｃ，チャンネルＦＲｃ（図５参照）が、主チャンネルとして割り当てられ得る。 For example, when a speaker built in the television receiver 100 is used, the channel FLc and the channel FRc (see FIG. 5) can be assigned as main channels.

また、５チャンネルのマルチチャンネル音響方式に準拠して、５つの外部スピーカを床面に配置して利用する場合、チャンネルＦＬ，ＦＲ，ＦＣ，ＢＬ，ＢＲ（図５参照）が、主チャンネルとして割り当てられ得る。 Also, in accordance with the 5-channel multi-channel sound system, when five external speakers are arranged on the floor and used, channels FL, FR, FC, BL, BR (see FIG. 5) are assigned as main channels. Can be.

また、５チャンネルのマルチチャンネル音響方式に準拠して、５つの外部スピーカを天井から吊り下げて利用する場合、チャンネルＴｐＦＬ，ＴｐＦＲ，ＴｐＦＣ，ＴｐＢＬ，ＴｐＢＲ（図５参照）が、主チャンネルとして割り当てられ得る。 Further, in accordance with the 5-channel multi-channel sound system, when five external speakers are suspended from the ceiling, channels TpFL, TpFR, TpFC, TpBL, and TpBR (see FIG. 5) are assigned as main channels. obtain.

（副チャンネル決定部２０８）
副チャンネル決定部２０８は、出力チャンネル毎に、上記複数のチャンネル（すなわち、ベクトルデータベースに定義されている複数のチャンネル）の中から、当該出力チャンネルに割り当てる副チャンネルを決定する。 (Sub-channel determination unit 208)
The subchannel determination unit 208 determines, for each output channel, a subchannel to be assigned to the output channel from among the plurality of channels (that is, a plurality of channels defined in the vector database).

本実施形態のテレビジョン受像機１００においては、上記複数のチャンネルは、予め領域毎にグループ化されている。特に、本実施形態のテレビジョン受像機１００においては、上記グループ化は、主チャンネルの組み合わせ毎に定義されている。この定義付けは、例えば、テレビジョン受像機１００が備えるメモリ等（例えば、記憶部２００）に記憶されている。そして、副チャンネル決定部２０８は、出力チャンネル毎に、当該出力チャンネルに割り当てる主チャンネルが決定されたことに応じて、当該主チャンネルに対応する上記グループに含まれているチャンネルを、当該出力チャンネルに割り当てる副チャンネルとして決定する。 In the television receiver 100 of the present embodiment, the plurality of channels are grouped in advance for each area. In particular, in the television receiver 100 of the present embodiment, the grouping is defined for each combination of main channels. This definition is stored in, for example, a memory (for example, the storage unit 200) included in the television receiver 100. Then, in response to the determination of the main channel to be assigned to the output channel for each output channel, the sub-channel determination unit 208 sets the channel included in the group corresponding to the main channel as the output channel. Decide as subchannel to assign.

例えば、図５に示すチャンネルＦＬ（フロント左側）とチャンネルＦＲ（フロント右側）とが主チャンネルとして決定された場合には、副チャンネル決定部２０８は、ＦＬを主チャンネルとする出力チャンネルには、３次元空間における左側（図５に示すｘ軸負方向側）の領域のチャンネル群と、３次元空間における中央部（すなわち、ｘ座標が０）の領域のチャンネル群とを、当該出力チャンネルに割り当てる副チャンネルとして決定し、ＦＲを主チャンネルとする出力チャンネルには、３次元空間における右側（図５に示すｘ軸正方向側）の領域のチャンネル群と、３次元空間における中央部（すなわち、ｘ座標が０）の領域のチャンネル群とを、当該出力チャンネルに割り当てる副チャンネルとして決定する、といった具合である。 For example, when the channel FL (front left side) and the channel FR (front right side) shown in FIG. 5 are determined as main channels, the sub-channel determination unit 208 sets 3 as the output channel having FL as the main channel. A channel group in the region on the left side (in the negative x-axis direction shown in FIG. 5) in the three-dimensional space and a channel group in the central region (that is, the x coordinate is 0) in the three-dimensional space are assigned to the output channel. The output channel determined as a channel and having FR as the main channel includes a group of channels on the right side in the three-dimensional space (the x-axis positive direction side in FIG. 5) and the central portion in the three-dimensional space (ie, the x coordinate) Is determined as a sub-channel to be assigned to the output channel.

なお、上記例における、上記中央部の領域のチャンネル群のように、その音声出力位置が、複数のスピーカの位置の中間位置となり得るチャンネルは、複数のグループに属し得る。このようなチャンネルの音声は、双方スピーカから出力することが好ましい場合があるからである。 Note that, like the channel group in the central area in the above example, channels whose audio output positions can be intermediate positions of the positions of a plurality of speakers can belong to a plurality of groups. This is because it may be preferable to output the sound of such a channel from both speakers.

また、上記定義付けにおいて、全てのチャンネルが必ずしもいずれかのグループに属している必要はなく、いずれのグループにも属さないチャンネル（すなわち、いずれのスピーカからもその音声が出力されないチャンネル）が存在してもよい。スピーカの配置によっては、そのチャンネルの音声を出力しないほうが、自然なサラウンド効果を得られる場合があるからである。 Also, in the above definition, not all channels need to belong to any group, and there are channels that do not belong to any group (that is, channels whose sound is not output from any speaker). May be. This is because, depending on the arrangement of the speakers, it may be possible to obtain a natural surround effect without outputting the sound of that channel.

（出力音声信号生成部２１０）
出力音声信号生成部２１０は、出力チャンネル毎に、当該出力チャンネルに割り当てた主チャンネル（すなわち、主チャンネル決定部２０４によって決定された主チャンネル）の音声信号と、当該出力チャンネルに割り当てた副チャンネル（すなわち、副チャンネル決定部２０８によって決定された副チャンネル）の音声信号とを用いて、当該出力チャンネルを介して出力する出力音声信号を生成する。 (Output audio signal generator 210)
For each output channel, the output audio signal generation unit 210 outputs the audio signal of the main channel assigned to the output channel (that is, the main channel determined by the main channel determination unit 204) and the sub-channel assigned to the output channel ( That is, an output audio signal to be output through the output channel is generated using the audio signal of the subchannel determined by the subchannel determination unit 208.

例えば、出力音声信号生成部２１０は、出力チャンネル毎に、当該出力チャンネルに割り当てた主チャンネルの音声信号と、当該出力チャンネルに割り当てた副チャンネルの信号とをダウンミックスすることにより、出力音声信号を生成する。 For example, for each output channel, the output audio signal generation unit 210 down-mixes the audio signal of the main channel assigned to the output channel and the signal of the sub channel assigned to the output channel, thereby converting the output audio signal. Generate.

特に、出力音声信号生成部２１０は、出力チャンネル毎に、当該出力チャンネルに割り当てた副チャンネルの音声信号が表す音声が、該副チャンネルに対応する音声出力位置に定位するように、当該出力チャンネルに割り当てた副チャンネルの音声信号の周波数特性を、頭部伝達関数を用いて補正した後、当該出力チャンネルに割り当てた主チャンネルの音声信号と、当該出力チャンネルに割り当てた副チャンネルの信号とをダウンミックスすることにより、出力音声信号を生成することも可能である。 In particular, for each output channel, the output audio signal generation unit 210 sets the output channel so that the audio represented by the audio signal of the sub channel assigned to the output channel is localized at the audio output position corresponding to the sub channel. After correcting the frequency characteristics of the assigned sub-channel audio signal using the head-related transfer function, the main channel audio signal assigned to the output channel and the sub-channel signal assigned to the output channel are downmixed. By doing so, it is also possible to generate an output audio signal.

（音声信号出力部２１２）
音声信号出力部２１２は、出力音声信号生成部２１０によって生成された、各チャンネルの出力音声信号を、音声出力回路１０８へ出力する。 (Audio signal output unit 212)
The audio signal output unit 212 outputs the output audio signal of each channel generated by the output audio signal generation unit 210 to the audio output circuit 108.

（音声制御処理の手順）
次に、図３を参照して、音声制御回路１２０による音声制御処理の手順について説明する。図３は、実施形態に係る音声制御回路１２０による音声制御処理の手順を示すフローチャートである。 (Voice control processing procedure)
Next, with reference to FIG. 3, the procedure of the voice control process by the voice control circuit 120 will be described. FIG. 3 is a flowchart illustrating the procedure of the voice control process by the voice control circuit 120 according to the embodiment.

以下では、テレビジョン受像機１００において、すでに複数の外部スピーカ１５０が接続されている出力チャンネルが、音声制御回路１２０によって認識されているものとして、当該音声制御回路１２０による音声制御処理を説明する。 Hereinafter, in the television receiver 100, the audio control process by the audio control circuit 120 will be described on the assumption that the output channel to which the plurality of external speakers 150 are already connected is recognized by the audio control circuit 120.

まず、検出部２０２が、利用可能な出力チャンネル毎に、当該出力チャンネルに接続されている外部スピーカ１５０の位置を検出する（ステップＳ３０２）。 First, the detection unit 202 detects the position of the external speaker 150 connected to the output channel for each available output channel (step S302).

次に、主チャンネル決定部２０４が、上記出力チャンネル毎に、ステップＳ３０２で検出された外部スピーカ１５０の位置と、ベクトルデータベースにおいて定義されている、複数のチャンネルの各々の音声出力位置とに基づいて、上記複数のチャンネルの中から、当該出力チャンネルに割り当てる主チャンネルを決定する（ステップＳ３０４）。 Next, the main channel determination unit 204 determines, for each output channel, the position of the external speaker 150 detected in step S302 and the audio output positions of each of the plurality of channels defined in the vector database. The main channel to be assigned to the output channel is determined from the plurality of channels (step S304).

次に、副チャンネル決定部２０８が、上記出力チャンネル毎に、ステップＳ３０４で主チャンネルが決定されたことに応じて、当該出力チャンネルに割り当てる副チャンネルを決定する（ステップＳ３０６）。 Next, in response to the determination of the main channel in step S304 for each output channel, the subchannel determination unit 208 determines a subchannel to be assigned to the output channel (step S306).

その後、入力音声信号が音声制御回路１２０に入力されると（ステップＳ３０８）、出力音声信号生成部２１０は、出力チャンネル毎に、ステップＳ３０４で割り当てた主チャンネルの音声信号と、ステップＳ３０６で割り当てた副チャンネルの音声信号とを用いて、出力音声信号を生成する（ステップＳ３１０）。 Thereafter, when the input sound signal is input to the sound control circuit 120 (step S308), the output sound signal generation unit 210 assigns the sound signal of the main channel assigned in step S304 and the sound signal assigned in step S306 for each output channel. An output audio signal is generated using the sub-channel audio signal (step S310).

そして、音声信号出力部２１２が、ステップＳ３１０で生成された各出力音声信号を、音声出力回路１０８へ出力して（ステップＳ３１２）、音声制御回路１２０は、音声制御処理を終了する。 Then, the audio signal output unit 212 outputs each output audio signal generated in step S310 to the audio output circuit 108 (step S312), and the audio control circuit 120 ends the audio control process.

なお、テレビジョン受像機１００は、ステップＳ３０４で割り当てた主チャンネルを特定するための情報、およびステップＳ３０６で割り当てた副チャンネルを特定するための情報を、当該テレビジョン受像機１００が備えるメモリ等に記憶させておいてもよい。これにより、次回以降の音声制御処理において、ステップＳ３０２〜Ｓ３０６の処理を省略することができる。そして、テレビジョン受像機１００は、スピーカの構成が変更された場合、ステップＳ３０２〜Ｓ３０６の処理を行うことで、主チャンネルおよび副チャンネルを改めて決定するとよい。 The television receiver 100 stores the information for specifying the main channel assigned in step S304 and the information for specifying the subchannel assigned in step S306 in a memory or the like included in the television receiver 100. It may be memorized. Thereby, the process of step S302-S306 can be abbreviate | omitted in the audio | voice control process after the next time. Then, when the configuration of the speaker is changed, the television receiver 100 may determine the main channel and the sub channel anew by performing the processes of steps S302 to S306.

（ベクトルデータベースおよび頭部伝達関数の一例）
ここで、図４および図５を参照してベクトルデータベースの一例について説明する。図４は、記憶部２００に記憶されているベクトルデータベースの一例を示す。図５は、図４に示すベクトルデータベースに定義されている各音声出力位置を示す模式図である。 (Example of vector database and head-related transfer function)
Here, an example of the vector database will be described with reference to FIG. 4 and FIG. FIG. 4 shows an example of a vector database stored in the storage unit 200. FIG. 5 is a schematic diagram showing each audio output position defined in the vector database shown in FIG.

図４に示す例では、ベクトルデータベースには、２２．２チャンネルの各々の音声出力位置が定義されている。この２２．２チャンネルの音響システムは、３次元空間における３つのチャンネル層（上層９チャンネル、中層１０チャンネル、下層３チャンネル）を含んでおり、さらに、ＬＦＥ（Low Frequency Effects：低域効果）用の２つのチャンネルを含んで構成されている。これに応じて、ベクトルデータベースには、２４個の音声出力位置が定義されている。 In the example shown in FIG. 4, the audio output positions of 22.2 channels are defined in the vector database. This 22.2 channel acoustic system includes three channel layers (upper layer 9 channels, middle layer 10 channels, lower layer 3 channels) in a three-dimensional space, and for LFE (Low Frequency Effects). It is configured to include two channels. Accordingly, 24 voice output positions are defined in the vector database.

ベクトルデータベースにおいて、各音声出力位置に対しては、その位置に応じた識別子の組み合わせからなる名称が付与されている。具体的には、上記識別子として、「Ｌ」，「Ｒ」，「Ｔｐ」，「Ｂｔ」，「Ｓｉ」，「Ｂ」，「Ｆ」，「Ｃ」が用いられており、順に、左側，右側，上側，下側，側部，後側，前側，中央を意味する。本実施形態では、チャンネルの名称とその音声出力位置の名称とを同名としている。例えば、フロント右上部のチャンネルおよびその音声出力位置には、「ＴｐＦＲ」という名称が付与されている。 In the vector database, each voice output position is given a name composed of a combination of identifiers corresponding to the position. Specifically, “L”, “R”, “Tp”, “Bt”, “Si”, “B”, “F”, “C” are used as the identifiers, It means right side, upper side, lower side, side, rear side, front side, and center. In this embodiment, the name of the channel and the name of the audio output position are the same. For example, the name “TpFR” is assigned to the channel in the upper right part of the front and its audio output position.

また、ベクトルデータベースにおいて、各音声出力位置は、空間座標によって定義されている。また、各音声出力位置は、原点（０，０，０）からのベクトルで表すこともできる。この例では、ディスプレイ１０６における表示面の中心位置（音声出力位置ＦＣ）が、原点に設定されている。例えば、音声出力位置ＢＬは、ベクトルＶ_{ＦＣ−ＢＬ}と表すことができる。また、音声出力位置ＢＲは、ベクトルＶ_{ＦＣ−ＢＲ}と表すことができる。 In the vector database, each audio output position is defined by spatial coordinates. Each audio output position can also be represented by a vector from the origin (0, 0, 0). In this example, the center position (audio output position FC) of the display surface of the display 106 is set as the origin. For example, the audio output position BL can be expressed as a vector V _FC-BL . The audio output position BR can be expressed as a vector V _FC-BR .

（主チャンネルおよび副チャンネルの決定方法の一例）
主チャンネル決定部２０４は、このベクトルデータベースに基づいて、出力チャンネル毎に、主チャンネルを決定する。例えば、テレビジョン受像機１００が備える出力チャンネルＡにスピーカＡが接続され、出力チャンネルＢにスピーカＢが接続されているとする。ここで、スピーカＡは、音声出力位置ＦＬの近傍にある位置ｓｐＡに配置され、スピーカＢは、音声出力位置ＦＲの近傍にある位置ｓｐＢに配置されているとする。 (Example of main channel and sub-channel determination method)
The main channel determination unit 204 determines a main channel for each output channel based on this vector database. For example, it is assumed that the speaker A is connected to the output channel A included in the television receiver 100 and the speaker B is connected to the output channel B. Here, it is assumed that the speaker A is disposed at a position spA near the sound output position FL, and the speaker B is disposed at a position spB near the sound output position FR.

この場合、主チャンネル決定部２０４は、スピーカＡの位置ｓｐＡに対応するベクトルＶ_ＳＰＡの方向と、音声出力位置ＦＬに対応するベクトルＶ_ＦＬの方向とが、互いに近似することから、音声出力位置ＦＬ（フロント左側）が設定されているチャンネルＦＬを、出力チャンネルＡに割り当てる主チャンネルとして決定する。 In this case, the main channel determination unit 204 approximates the direction of the vector V _SPA corresponding to the position spA of the speaker A and the direction of the vector V _FL corresponding to the audio output position FL, so that the audio output position FL The channel FL in which (front left side) is set is determined as the main channel assigned to the output channel A.

また、主チャンネル決定部２０４は、スピーカＢの位置ｓｐＢに対応するベクトルＶ_ＳＰＢの方向と、音声出力位置ＦＲに対応するベクトルＶ_ＦＲの方向とが、互いに類似することから、音声出力位置ＦＲ（フロント右側）が設定されているチャンネルＦＲを、出力チャンネルＢに割り当てる主チャンネルとして決定する。 In addition, the main channel determination unit 204 is similar to the direction of the vector V _SPB corresponding to the position spB of the speaker B and the direction of the vector V _FR corresponding to the audio output position FR, so that the audio output position FR ( The channel FR in which the front right side) is set is determined as the main channel assigned to the output channel B.

そして、副チャンネル決定部２０４は、チャンネルＦＬに予め対応付けられているチャンネル（例えば、３次元空間における左側（ｘ軸負方向側）の領域のチャンネル、および、３次元空間における中央部の領域のチャンネル）を、出力チャンネルＡに割り当てる副チャンネルとして決定する。 Then, the sub-channel determination unit 204 preliminarily associates the channel with the channel FL (for example, the channel in the left side (x-axis negative direction side) in the three-dimensional space and the central region in the three-dimensional space. Channel) is determined as a sub-channel to be assigned to output channel A.

また、副チャンネル決定部２０４は、チャンネルＦＲに予め対応付けられているチャンネル（例えば、３次元空間における右側（ｘ軸正方向側）の領域のチャンネル、および、３次元空間における中央部の領域のチャンネル）を、出力チャンネルＢに割り当てる副チャンネルとして決定する。 Further, the sub-channel determination unit 204 preliminarily associates the channel FR with the channel (for example, the channel in the right side (x-axis positive direction side) in the three-dimensional space and the central region in the three-dimensional space. Channel) is determined as a sub-channel to be assigned to the output channel B.

そして、出力音声信号生成部２１０は、出力チャンネルＡに割り当てた主チャンネル（チャンネルＦＬ）の音声信号と、出力チャンネルＡに割り当てた副チャンネルの音声信号とをダウンミックスすることにより、出力チャンネルＡの出力音声信号を生成する。 Then, the output audio signal generation unit 210 downmixes the audio signal of the main channel (channel FL) assigned to the output channel A and the audio signal of the subchannel assigned to the output channel A, so that the output channel A An output audio signal is generated.

同様に、出力音声信号生成部２１０は、出力チャンネルＢに割り当てた主チャンネル（チャンネルＦＲ）の音声信号と、出力チャンネルＢに割り当てた副チャンネルの音声信号とをダウンミックスすることにより、出力チャンネルＢの出力音声信号を生成する。 Similarly, the output audio signal generation unit 210 downmixes the audio signal of the main channel (channel FR) assigned to the output channel B and the audio signal of the sub-channel assigned to the output channel B, so that the output channel B The output audio signal is generated.

（出力音声信号の生成方法の一例）
上記のように、複数のチャンネルの各々について主チャンネルおよび副チャンネルが決定されると、音声信号生成部２１０は、出力チャンネル毎に、当該出力チャンネルに割り当てた主チャンネルの音声信号と、当該出力チャンネルに割り当てた副チャンネルの音声信号とをダウンミックスすることにより、当該出力チャンネルを介して出力する出力音声信号を生成することが可能となる。 (Example of output audio signal generation method)
As described above, when the main channel and the sub channel are determined for each of the plurality of channels, the audio signal generation unit 210, for each output channel, the audio signal of the main channel assigned to the output channel, and the output channel By downmixing the audio signal of the sub-channel assigned to the output channel, it is possible to generate an output audio signal that is output via the output channel.

例えば、上記出力チャンネルＡについて、主チャンネルであるチャンネルＦＬの音声信号をＳ_ＦＬとし、３次元空間の左側の領域に音声出力位置が配置されている副チャンネルであるチャンネルＴｐＦＬ，ＴｐＢＬ，・・・の音声信号をＳ_ＴｐＦＬ，Ｓ_ＴｐＢＬ，・・・とし、３次元空間の中央部の領域に音声出力位置が配置されている副チャンネルであるチャンネルＦＣ，ＢＣ，・・・の音声信号をＳ_ＦＣ，Ｓ_ＢＣ，・・・とした場合、出力音声信号生成部２１０は、以下の数式（数１）を用いて、これらの音声信号をダウンミックスすることにより、当該出力チャンネルＡの出力音声信号Ｓ_ＦＬ’’を生成することができる。 For example, for the output channels A, the audio signal of the channel FL is the main channel and S _FL, 3-dimensional channel audio output position on the left side of the region is sub-channels that are arranged in space TpFL, TpBL, ··· the audio signal _{_S TpFL,} _S _TpBL, and ..., channel FC audio output position is subchannels is arranged in the region of the central portion of the three-dimensional space, BC, an audio signal ... _{S FC} , S _BC ,..., The output audio signal generation unit 210 downmixes these audio signals using the following formula (Equation 1), thereby outputting the output audio signal S of the output channel A. _FL '' can be generated.

Ｓ_ＦＬ’’＝ａ×（Ｓ_ＦＬ＋１／√２Ｓ_ＴｐＦＬ＋１／√２Ｓ_ＴｐＢＬ＋・・・＋１／√２Ｓ_ＦＣ＋１／√２Ｓ_ＢＣ＋・・・）・・・（数１）
同様に、上記出力チャンネルＢについて、主チャンネルであるチャンネルＦＲの音声信号をＳ_ＦＲとし、３次元空間の右側の領域に音声出力位置が配置されている副チャンネルであるチャンネルＴｐＦＲ，ＴｐＢＲ，・・・の音声信号をＳ_ＴｐＦＲ，Ｓ_ＴｐＢＲ，・・・とし、３次元空間の中央部の領域に音声出力位置が配置されている副チャンネルであるチャンネルＦＣ，ＢＣ，・・・の音声信号をＳ_ＦＣ，Ｓ_ＢＣ，・・・とした場合、出力音声信号生成部２１０は、以下の数式（数２）を用いて、これらの音声信号をダウンミックスすることにより、当該出力チャンネルＢの出力音声信号Ｓ_ＦＲ’’を生成することができる。 S _FL ″ = a × (S _FL + 1 / √2S _TpFL + 1 / √2S _TpBL +... + 1 / √2S _FC + 1 / √2S _BC +...) (Equation 1)
Similarly, for the output channel B, and audio signals of the channels FR is the main channel and S _FR, channel TpFR by-channel audio output located on the right side of the region of the three-dimensional space is arranged, TPBR, · · _.. , S _TpFR , S _TpBR ,..., And the audio signals of channels FC, BC,..., _Which are sub-channels whose audio output positions are arranged in the central region of the three-dimensional space. _{In the case of FC} , S _BC ,..., The output audio signal generation unit 210 downmixes these audio signals using the following formula (Equation 2), thereby outputting the output audio signal of the output channel B. S _FR ″ can be generated.

Ｓ_ＦＲ’’＝ａ×（Ｓ_ＦＲ＋１／√２Ｓ_ＴｐＦＲ＋１／√２Ｓ_ＴｐＢＲ＋・・・＋１／√２Ｓ_ＦＣ＋１／√２Ｓ_ＢＣ＋・・・）・・・（数２）
但し、上記数式（数１）および（数２）において、ａは、オーバーフロー（Ｓ_ＦＬ’’およびＳ_ＦＲ’’が１以上となること）を低減するための係数であり、例えば、０〜１の範囲内の値が用いられる。なお、オーバーフローの低減は、各ダウンミックス係数を調整することによって実現されてもよい。 S _FR ″ = a × (S _FR + 1 / √2S _TpFR + 1 / √2S _TpBR +... + 1 / √2S _FC + 1 / √2S _BC +...) (Equation 2)
However, in the above formulas (Equation 1) and (Equation 2), a is a coefficient for reducing overflow (S _FL ″ and S _FR ″ become 1 or more), for example, 0 to 1 A value within the range is used. The overflow reduction may be realized by adjusting each downmix coefficient.

音声信号生成部２１０によって生成された、出力チャンネルＡの出力音声信号Ｓ_ＦＬ’’および出力チャンネルＢの出力音声信号Ｓ_ＦＲ’’は、音声信号出力部２１２によって、音声出力回路１０８へ出力される。これにより、出力チャンネルＡに接続されたスピーカＡから、出力音声信号Ｓ_ＦＬ’’に応じた音声（すなわち、出力チャンネルＡに割り当てた各チャンネルの音声）が出力されるとともに、出力チャンネルＢに接続されたスピーカＢから、出力音声信号Ｓ_ＦＲ’’に応じた音声（すなわち、出力チャンネルＢに割り当てた各チャンネルの音声）が出力される。 The output audio signal S _FL ″ of the output channel A and the output audio signal S _FR ″ of the output channel B generated by the audio signal generation unit 210 are output to the audio output circuit 108 by the audio signal output unit 212. . As a result, audio corresponding to the output audio signal S _FL ″ (that is, audio of each channel assigned to the output channel A) is output from the speaker A connected to the output channel A and connected to the output channel B. The sound corresponding to the output sound signal S _FR ″ (that is, the sound of each channel assigned to the output channel B) is output from the speaker B.

なお、上記例では、副チャンネルに適用するダウンミックス係数として１／√２を用いているが、これに限らない。例えば、上記ダウンミックス係数として、従来のマルチチャンネル音響システムにおいてよく利用されているダウンミックス係数（１／２、１／２√２等）を用いてもよい。また、副チャンネルの音声出力位置が、主チャンネルの音声出力位置から離れているほど（ベクトル方向の角度差が大きいほど）、その副チャンネルの音声信号の出力が弱まるように、その副チャンネルに適用するダウンミックス係数を低めてもよい。また、上記例では、主チャンネルの音声信号にはダウンミックス係数を乗じていないが、主チャンネルの音声信号にダウンミックス係数を乗じてもよい。また、各副チャンネルの音声信号に対し、主チャンネルとの水平方向の位置関係に応じたダウンミックス係数と、主チャンネルとの垂直方向の位置関係に応じたダウンミックス係数とを、それぞれ乗じるようにしてもよい。 In the above example, 1 / √2 is used as the downmix coefficient applied to the sub-channel, but the present invention is not limited to this. For example, as the downmix coefficient, a downmix coefficient (1/2, 1 / 2√2, etc.) often used in a conventional multi-channel sound system may be used. Also, it is applied to the sub channel so that the audio output position of the sub channel becomes weaker as the audio output position of the sub channel is farther from the audio output position of the main channel (the greater the angle difference in the vector direction). The downmix coefficient to be performed may be lowered. In the above example, the audio signal of the main channel is not multiplied by the downmix coefficient, but the audio signal of the main channel may be multiplied by the downmix coefficient. In addition, the audio signal of each sub-channel is multiplied by a downmix coefficient corresponding to the horizontal positional relationship with the main channel and a downmix coefficient corresponding to the vertical positional relationship with the main channel. May be.

（効果）
このように、本実施形態のテレビジョン受像機１００は、利用可能な出力チャンネル毎に、当該出力チャンネルに接続されたスピーカの３次元空間における垂直方向の位置を考慮し、当該出力チャンネルに割り当てる主チャンネルを決定し、当該主チャンネルの音声を主音声としてダウンミックスすることにより、出力音声信号を生成する。 (effect)
As described above, the television receiver 100 according to the present embodiment considers the position of the speaker connected to the output channel in the three-dimensional space in the three-dimensional space for each available output channel, and assigns the main channel to the output channel. An output audio signal is generated by determining a channel and downmixing the audio of the main channel as the main audio.

これにより、本実施形態のテレビジョン受像機１００によれば、スピーカの水平方向および垂直方向の双方の位置に応じて音響効果を制御することができるため、３次元空間において適切な位置（水平方向および垂直方向の双方の適切な位置）にスピーカが配置されていない場合であっても、良好なサラウンド効果を得ることができる。 As a result, according to the television receiver 100 of the present embodiment, the acoustic effect can be controlled according to the position of both the horizontal direction and the vertical direction of the speaker. Even if the speaker is not disposed at an appropriate position in both the vertical direction and the vertical direction, a good surround effect can be obtained.

（頭部伝達関数を用いた処理の一例）
上記例では、頭部伝達関数を用いた処理（音像定位処理）を行わずに、出力音声信号を生成する例を説明したが、出力音声信号生成部２１０は、頭部伝達関数を用いた処理（音像定位処理）を行い、出力音声信号を生成することもできる。以下、図６および図７を参照して、出力音声信号生成部２１０による頭部伝達関数を用いた処理の一例について説明する。 (Example of processing using head-related transfer functions)
In the above example, the example in which the output sound signal is generated without performing the process using the head-related transfer function (sound image localization process) has been described. However, the output sound signal generation unit 210 performs the process using the head-related transfer function. (Sound image localization processing) can be performed to generate an output audio signal. Hereinafter, an example of processing using the head-related transfer function by the output audio signal generation unit 210 will be described with reference to FIGS. 6 and 7.

図６は、記憶部２００に記憶されている頭部伝達関数の一例を示す。図６では、一例として、記憶部２００に記憶されている頭部伝達関数のうち、３次元空間における水平方向（前後方向および左右方向）の頭部伝達関数を示す。実際には、記憶部２００には、３次元空間における垂直方向（上下方向）の頭部伝達関数も記憶されている。 FIG. 6 shows an example of the head-related transfer function stored in the storage unit 200. FIG. 6 shows, as an example, a head-related transfer function in the horizontal direction (front-rear direction and left-right direction) in the three-dimensional space among the head-related transfer functions stored in the storage unit 200. Actually, the storage unit 200 also stores a head-related transfer function in the vertical direction (vertical direction) in the three-dimensional space.

図７は、図４に示すベクトルデータベースに定義されている各音声出力位置の基準位置からの方向を示す。図７では、視聴者の位置（図５に示す立方体形状の３次元空間の中心Ｃ）を基準位置（原点（０，０，０））とする、各音声出力位置および各音声出力位置の基準位置からの方向が示されている。各音声出力位置は空間座標で表されており、各音声出力位置の基準位置からの方向は、基準位置からの方位角と仰角とによって表されている。ここで、上記原点から見て、ｚ軸負方向（前方向）の仰角θ_ｅｌを０°、ｙ軸正方向（上方向）の仰角θ_ｅｌを９０°とする。また、上記原点から見て、ｚ軸負方向（前方向）の方位角θ_ａｚを０°、ｘ軸正方向（右方向）の方位角θ_ａｚを９０°、ｚ軸正方向（後方向）の方位角θ_ａｚを１８０°、ｘ軸負方向（左方向）の方位角θ_ａｚを２７０°とする。 FIG. 7 shows the direction from the reference position of each audio output position defined in the vector database shown in FIG. In FIG. 7, each audio output position and the reference of each audio output position with the viewer position (center C of the cubic three-dimensional space shown in FIG. 5) as the reference position (origin (0, 0, 0)). The direction from the position is shown. Each audio output position is represented by spatial coordinates, and the direction of each audio output position from the reference position is represented by an azimuth angle and an elevation angle from the reference position. Here, as viewed from the origin, 0 ° elevation theta _el of z-axis negative direction (forward direction), the elevation angle theta _el of y-axis positive direction (upward direction) to 90 °. Further, when viewed from the origin, 0 ° azimuth angle theta _az in the z-axis negative direction (front direction), 90 ° azimuth angle theta _az of x-axis positive direction (the right direction), the z-axis positive direction (rear direction) 180 ° azimuth angle theta _az of the azimuth angle theta _az of x-axis negative direction (leftward) and 270 °.

出力音声信号生成部２１０は、各副チャンネルの音声を予め定められた音声出力位置から聞こえるようにするために、頭部伝達関数を用いて各副チャンネルの音声信号の周波数特性を補正することにより、各副チャンネルの音声を、当該副チャンネルに音声出力位置に定位させることができる。 The output audio signal generation unit 210 corrects the frequency characteristics of the audio signal of each subchannel using the head-related transfer function so that the audio of each subchannel can be heard from a predetermined audio output position. The sound of each subchannel can be localized at the sound output position of the subchannel.

特に、出力音声信号生成部２１０は、各副チャンネルの音声を、３次元空間における音声出力位置から聞こえるようにするために、水平方向の頭部伝達関数と垂直方向の頭部伝達関数とを用いて、各副チャンネルの音声信号の周波数特性を補正することにより、各副チャンネルの音声を、当該副チャンネルに対応する音声出力位置に定位させることができる。 In particular, the output audio signal generation unit 210 uses a horizontal head-related transfer function and a vertical head-related transfer function so that the audio of each subchannel can be heard from the audio output position in the three-dimensional space. Thus, by correcting the frequency characteristics of the audio signal of each sub-channel, the audio of each sub-channel can be localized at the audio output position corresponding to the sub-channel.

例えば、音声信号生成部２１０は、ある出力チャンネルの出力音声信号を生成する際、当該出力チャンネルに対応する副チャンネル毎に、以下の処理（２−１）〜（２−７）を行うことにより、当該副チャンネルの音声信号の周波数特性を補正する。そして、音声信号生成部２１０は、当該副チャンネルの補正後の音声信号を用いて、当該出力チャンネルの出力音声信号を生成する。 For example, when the audio signal generation unit 210 generates an output audio signal of a certain output channel, the audio signal generation unit 210 performs the following processes (2-1) to (2-7) for each sub-channel corresponding to the output channel. The frequency characteristic of the audio signal of the subchannel is corrected. Then, the audio signal generation unit 210 generates an output audio signal of the output channel using the corrected audio signal of the subchannel.

（２−１）基準位置（視聴者の位置）からの、当該副チャンネルの仮想的な音源（音声出力位置）の水平方向（方位角）に対応付けられている周波数特性を、記憶部２００に記憶されている水平方向の頭部伝達関数から得る。 (2-1) The frequency characteristic associated with the horizontal direction (azimuth angle) of the virtual sound source (audio output position) of the subchannel from the reference position (viewer position) is stored in the storage unit 200. Obtained from the stored horizontal head-related transfer function.

（２−２）基準位置（視聴者の位置）からの、当該副チャンネルの仮想的な音源（音声出力位置）の垂直方向（仰角）に対応付けられている周波数特性を、記憶部２００に記憶されている垂直方向の頭部伝達関数から得る。 (2-2) The frequency characteristic associated with the vertical direction (elevation angle) of the virtual sound source (audio output position) of the sub-channel from the reference position (viewer position) is stored in the storage unit 200. Obtained from the vertical head-related transfer function.

（２−３）基準位置（視聴者の位置）からの、実際に音声が出力される位置（主チャンネルの音声出力位置）の水平方向（方位角）に対応付けられている周波数特性を、記憶部２００に記憶されている水平方向の頭部伝達関数から得る。 (2-3) Stores frequency characteristics associated with the horizontal direction (azimuth angle) of the position (audio output position of the main channel) where audio is actually output from the reference position (viewer position). Obtained from the horizontal head-related transfer function stored in unit 200.

（２−４）基準位置（視聴者の位置）からの、実際に音声が出力される位置（主チャンネルの音声出力位置）の垂直方向（仰角）に対応付けられている周波数特性を、記憶部２００に記憶されている垂直方向の頭部伝達関数から得る。 (2-4) The frequency characteristics associated with the vertical direction (elevation angle) of the position (audio output position of the main channel) where the sound is actually output from the reference position (viewer position) It is obtained from the vertical head related transfer function stored in 200.

（２−５）上記（２−１）で得られた周波数特性と上記（２−２）で得られた周波数特性との差分（水平方向についての周波数特性の差分）を求める。 (2-5) The difference (frequency characteristic difference in the horizontal direction) between the frequency characteristic obtained in (2-1) and the frequency characteristic obtained in (2-2) is obtained.

（２−６）上記（２−３）で得られた周波数特性と上記（２−４）で得られた周波数特性との差分（垂直方向についての周波数特性の差分）を求める。 (2-6) A difference (frequency characteristic difference in the vertical direction) between the frequency characteristic obtained in the above (2-3) and the frequency characteristic obtained in the above (2-4) is obtained.

（２−７）上記（２−５）で得られた差分、および、上記（２−６）で得られた差分を用いて当該副チャンネルの音声信号の周波数特性を補正する。 (2-7) The frequency characteristic of the audio signal of the subchannel is corrected using the difference obtained in (2-5) above and the difference obtained in (2-6) above.

例えば、主チャンネルがＦＬであり、副チャンネルがＴｐＦＬの場合、音声信号生成部２１０は、方位角θ_ａｚについての、周波数特性ｆ_ａｚ（３１５°）と、ｆ_ａｚ（３１５°）との差分（このような同値の差分は１とする。）を求め、仰角θ_ｅｌについての、周波数特性ｆ_ｅｌ（０°）とｆ_ｅｌ（４５°）との差分を求める。そして、音声信号生成部２１０は、求められた差分｛ｆ_ａｚ（３１５°）−ｆ_ａｚ（３１５°）｝および｛ｆ_ｅｌ（０°）−ｆ_ｅｌ（４５°）｝を、副チャンネルＴｐＦＬの音声信号に乗じる。これにより、副チャンネルＴｐＦＬの音声は、音声出力位置ＴｐＦＬに定位される。すなわち、副チャンネルＴｐＦＬの音声は、あたかも音声出力位置ＴｐＦＬから聞こえてくるかの如く、音声出力位置ＦＬから出力される。 For example, when the main channel is FL and the sub-channel is TpFL, the audio signal generation unit 210 determines the difference between the frequency characteristics f _az (315 °) and f _az (315 °) for the azimuth angle θ _az ( Such a difference of the same value is set to 1.) and the difference between the frequency characteristics f _el (0 °) and f _el (45 °) with respect to the elevation angle θ _el is obtained. Then, the audio signal generation unit 210 uses the obtained differences {f _az (315 °) −f _az (315 °)} and {f _el (0 °) −f _el (45 °)} for the subchannel TpFL. Multiply the audio signal. Thereby, the sound of the subchannel TpFL is localized at the sound output position TpFL. That is, the sound of the subchannel TpFL is output from the sound output position FL as if it is heard from the sound output position TpFL.

以下、副チャンネルの音声信号の補正処理を含んだ、出力音声信号の生成処理に用いる、数式の一例を説明する。 Hereinafter, an example of a mathematical expression used for output audio signal generation processing including subchannel audio signal correction processing will be described.

例えば、上記出力チャンネルＢについて、主チャンネルであるチャンネルＦＲの音声信号をＳ_ＦＲとし、３次元空間の右側の領域に音声出力位置が配置されている副チャンネルであるチャンネルＴｐＦＲ，ＴｐＢＲ，・・・の音声信号をＳ_ＴｐＦＲ，Ｓ_ＴｐＢＲ，・・・とした場合、出力音声信号生成部２１０は、以下の数式（数３）を用いて、上記各副チャンネルの音声信号の補正処理を行うとともに、上記各音声信号をダウンミックスすることにより、当該出力チャンネルＢの出力音声信号Ｓ_ＦＲ’を生成することができる。 For example, for the output channel B, and audio signals of the channels FR is the main channel and S _FR, 3-dimensional channel audio output located on the right side of the region is sub-channels that are arranged in space TpFR, TpBR, ··· the audio signal _{_S TpFR,} _S _TpBR, when a., the output audio signal generation unit 210, using the following equation (equation 3), performs correction processing of the audio signals of the respective sub-channel, By downmixing each of the audio signals, an output audio signal S _FR ′ of the output channel B can be generated.

Ｓ_ＦＲ’＝ａ×｛Ｓ_ＦＲ＋１／√２｛ｆ_ａｚ（４５°）−ｆ_ａｚ（０°）｝｛ｆ_ｅｌ（４５°）−ｆ_ｅｌ（４５°）｝Ｓ_ＴｐＦＲ＋１／√２｛ｆ_ａｚ（１３５°）−ｆ_ａｚ（０°）｝｛ｆ_ｅｌ（４５°）−ｆ_ｅｌ（４５°）｝Ｓ_ＴｐＢＲ＋・・・｝・・・（数３）
但し、上記数式（数３）において、ａは、オーバーフロー（Ｓ_ＦＲ’が１以上となること）を防止するための係数であり、例えば、０〜１の範囲内の値が用いられる。なお、オーバーフローの防止は、各ダウンミックス係数を調整することによって実現されてもよい。 S _FR '= a × {S _FR + 1 / √2 {f _az (45 °) −f _az (0 °)} {f _el (45 °) −f _el (45 °)} S _TpFR + 1 / √2 { f _az (135 °) −f _az (0 °)} {f _el (45 °) −f _el (45 °)} S _TpBR +...} ( _Equation 3)
However, in the above mathematical formula (Equation 3), a is a coefficient for preventing overflow (S _FR 'becomes 1 or more), and for example, a value in the range of 0 to 1 is used. In addition, prevention of overflow may be realized by adjusting each downmix coefficient.

（効果）
このように、本実施形態のテレビジョン受像機１００は、利用可能な出力チャンネル毎に、当該出力チャンネルに対応する各副チャンネルの音声が予め定められた音声出力位置に定位するように、当該出力チャンネルに対応する各副チャンネルの音声信号の周波数特性を、頭部伝達関数を用いて補正する。 (effect)
As described above, the television receiver 100 according to the present embodiment outputs the output so that the sound of each sub-channel corresponding to the output channel is localized at a predetermined sound output position for each available output channel. The frequency characteristic of the audio signal of each subchannel corresponding to the channel is corrected using the head-related transfer function.

これにより、本実施形態のテレビジョン受像機１００によれば、各副チャンネルの音声を、３次元空間における所定の音声出力位置に定位させることができる。すなわち、各副チャンネルの音声を、３次元空間における水平方向のみならず垂直方向についても、所定の音声出力位置に定位させることができる。これにより、視聴者に対し、各副チャンネルの音声を、上記所定の音声出力位置から聞こえてくるように感じさせることができる。このため、３次元空間において適切な位置にスピーカが配置されていない場合であっても、より良好かつ立体的なサラウンド効果を得ることができる。 Thereby, according to the television receiver 100 of this embodiment, the sound of each subchannel can be localized at a predetermined sound output position in the three-dimensional space. That is, the sound of each subchannel can be localized at a predetermined sound output position not only in the horizontal direction but also in the vertical direction in the three-dimensional space. Thereby, it is possible to make the viewer feel that the audio of each sub-channel is heard from the predetermined audio output position. For this reason, even if the speaker is not disposed at an appropriate position in the three-dimensional space, a better and three-dimensional surround effect can be obtained.

（プログラム、記憶媒体）
実施形態で説明したテレビジョン受像機１００の各機能は、集積回路（ＩＣチップ）上に形成された論理回路によってハードウェア的に実現してもよいし、ＣＰＵ（Central Processing Unit）を用いてソフトウェア的に実現してもよい。 (Program, storage medium)
Each function of the television receiver 100 described in the embodiment may be realized in hardware by a logic circuit formed on an integrated circuit (IC chip), or software using a CPU (Central Processing Unit). May be realized.

例えば、テレビジョン受像機１００は、各機能を実現するプログラムの命令を実行するＣＰＵ、上記プログラムを格納したＲＯＭ（Read Only Memory）、上記プログラムを展開するＲＡＭ（Random Access Memory）、上記プログラム及び各種データを格納するメモリ等の各種記憶装置（記録媒体）を備えている。そして、上記ＣＰＵが、上記各種記憶装置に格納されているプログラムを読み出し、このプログラムを実行することによって、音声制御回路１２０の各機能を実現することができる。 For example, the television receiver 100 includes a CPU that executes instructions of a program that realizes each function, a ROM (Read Only Memory) that stores the program, a RAM (Random Access Memory) that develops the program, the program, and various programs. Various storage devices (recording media) such as a memory for storing data are provided. Then, the CPU reads out the programs stored in the various storage devices and executes the programs, whereby each function of the voice control circuit 120 can be realized.

上記記録媒体としては、例えば、磁気テープやカセットテープ等のテープ類、フロッピー（登録商標）ディスク／ハードディスク等の磁気ディスクやＣＤ−ＲＯＭ／ＭＯ／ＭＤ／ＤＶＤ／ＣＤ−Ｒ等の光ディスクを含むディスク類、ＩＣカード（メモリカードを含む）／光カード等のカード類、マスクＲＯＭ／ＥＰＲＯＭ／ＥＥＰＲＯＭ（登録商標）／フラッシュＲＯＭ等の半導体メモリ類、あるいはＰＬＤ（Programmable logic device）やＦＰＧＡ（Field Programmable Gate Array）等の論理回路類等を用いることができる。 Examples of the recording medium include tapes such as magnetic tapes and cassette tapes, magnetic disks such as floppy (registered trademark) disks / hard disks, and disks including optical disks such as CD-ROM / MO / MD / DVD / CD-R. IC cards (including memory cards) / optical cards, semiconductor memories such as mask ROM / EPROM / EEPROM (registered trademark) / flash ROM, PLD (Programmable logic device) and FPGA (Field Programmable Gate) Logic circuits such as (Array) can be used.

なお、上記プログラムは、通信ネットワークを介してテレビジョン受像機１００に供給されてもよい。この通信ネットワークは、少なくとも上記プログラムをテレビジョン受像機１００に伝送可能であればよく、その種類はどのようなものであっても良い。例えば、通信ネットワークとしては、インターネット、イントラネット、エキストラネット、ＬＡＮ、ＩＳＤＮ、ＶＡＮ、ＣＡＴＶ通信網、仮想専用網（Virtual Private Network）、電話回線網、移動体通信網、衛星通信網等が利用可能である。 The program may be supplied to the television receiver 100 via a communication network. The communication network only needs to be able to transmit at least the program to the television receiver 100, and may be of any type. For example, the Internet, intranet, extranet, LAN, ISDN, VAN, CATV communication network, virtual private network, telephone line network, mobile communication network, satellite communication network, etc. can be used as the communication network. is there.

また、上記プログラムをテレビジョン受像機１００に供給するための伝送媒体としても、どのような種類のものを利用しても良い。例えば、伝送媒体として、ＩＥＥＥ１３９４、ＵＳＢ、電力線搬送、ケーブルＴＶ回線、電話線、ＡＤＳＬ（Asymmetric Digital Subscriber Line）回線等の有線によるものを利用しても良い。また、伝送媒体として、ＩｒＤＡやリモコンのような赤外線、Ｂｌｕｅｔｏｏｔｈ（登録商標）、ＩＥＥＥ８０２１１無線、ＨＤＲ（High Data Rate）、ＮＦＣ（Near Field Communication）、ＤＬＮＡ、携帯電話網、衛星回線、地上波デジタル網等の無線によるものを利用しても良い。 Also, any kind of transmission medium for supplying the program to the television receiver 100 may be used. For example, a wired medium such as IEEE1394, USB, power line carrier, cable TV line, telephone line, ADSL (Asymmetric Digital Subscriber Line) line may be used as the transmission medium. As transmission media, infrared rays such as IrDA and remote control, Bluetooth (registered trademark), IEEE80211 wireless, HDR (High Data Rate), NFC (Near Field Communication), DLNA, cellular phone network, satellite line, terrestrial digital network You may use the thing by radio | wireless.

（補足事項）
本発明は上述した各実施形態に限定されるものではなく、請求項に示した範囲で種々の変更が可能であり、また、異なる実施形態にそれぞれ開示された技術的手段を適宜組み合わせて得られる実施形態についても本発明の技術的範囲に含まれる。 (Supplementary information)
The present invention is not limited to the above-described embodiments, and various modifications can be made within the scope of the claims, and the technical means disclosed in different embodiments can be appropriately combined. Embodiments are also included in the technical scope of the present invention.

（本発明の適用対象）
例えば、実施形態では、本発明をテレビジョン受像機に適用した例を説明したが、少なくとも実施形態で説明したような音声制御処理を行うことが可能なものであれば、これ以外の装置（例えば、チューナ装置、オーディオレコーダ／プレーヤ（音声再生装置）、オーディオアンプ、ＨＤＤレコーダ／プレーヤ、ＢＤレコーダ／プレーヤ等）にも、本発明を適用することができる。 (Application object of the present invention)
For example, in the embodiment, the example in which the present invention is applied to a television receiver has been described. However, any device (for example, any device) can be used as long as the sound control process described in the embodiment can be performed. The present invention can also be applied to tuner devices, audio recorders / players (sound reproduction devices), audio amplifiers, HDD recorders / players, BD recorders / players, and the like.

また、実施形態では、本発明を２２．２チャンネルの音響システムに適用した例を説明したが、これ以外のマルチチャンネル音響システム（５．１チャンネルの音響システム等）にも、本発明を適用することができる。 In the embodiment, the example in which the present invention is applied to a 22.2 channel sound system has been described. However, the present invention is also applied to other multi-channel sound systems (such as a 5.1 channel sound system). be able to.

（スピーカの位置の検出方法）
また、実施形態では、スピーカの位置を検出する方法として、位置検出用の音声による方法を採用しているが、これに限らない。例えば、スピーカの位置を検出する方法として、カメラから得られた画像、赤外線センサ、超音波センサ、レーザセンサ等による方法を採用してもよい。また、スピーカの位置を、ユーザに設定させる構成を採用してもよい。例えば、３次元空間のイメージ画像（例えば、図５に示す立法体のようなもの）をグラフィカルに表示し、このイメージ画像に対し、スピーカの位置をユーザに設定させるようにしてもよい。 (Speaker position detection method)
In the embodiment, as a method for detecting the position of the speaker, a method using sound for position detection is employed, but the method is not limited thereto. For example, as a method of detecting the position of the speaker, a method using an image obtained from a camera, an infrared sensor, an ultrasonic sensor, a laser sensor, or the like may be employed. Moreover, you may employ | adopt the structure which makes a user set the position of a speaker. For example, an image image in a three-dimensional space (for example, the legitimate body shown in FIG. 5) may be displayed graphically, and the position of the speaker may be set for the image image by the user.

（スピーカ）
実施形態では、外部スピーカ１２０を対象のスピーカとしたが、テレビジョン受像機１００は、内蔵スピーカを備えていてもよく、この場合、内蔵スピーカを、実施形態で説明した外部スピーカ１２０と同様に扱ってもよい。すなわち、内蔵スピーカを含めて、マルチチャンネル音響システムを構築してもよい。 (Speaker)
In the embodiment, the external speaker 120 is the target speaker. However, the television receiver 100 may include a built-in speaker. In this case, the built-in speaker is handled in the same manner as the external speaker 120 described in the embodiment. May be. That is, a multi-channel sound system may be constructed including a built-in speaker.

（基準位置）
図５に示す立方体形状の３次元空間の中心Ｃ（図５参照）を基準位置（視聴者の位置）として、各音声出力位置を規定したが、これに限らず、例えば、実際の視聴者の位置を検出して、検出された位置を基準位置をとして、各音声出力位置を規定してもよい。 (Reference position)
Each voice output position is defined with the center C (see FIG. 5) of the cubic three-dimensional space shown in FIG. 5 as a reference position (viewer position). A position may be detected, and each sound output position may be defined using the detected position as a reference position.

本発明は、マルチチャンネル音響システムに組み込むことが可能な、テレビジョン受像機、チューナ装置、オーディオプレーヤー、オーディオアンプ、ＨＤＤレコーダ／プレーヤ、ＢＤレコーダ／プレーヤ、パーソナルコンピュータ等の各種機器に適用することができる。 The present invention can be applied to various devices such as a television receiver, a tuner device, an audio player, an audio amplifier, an HDD recorder / player, a BD recorder / player, and a personal computer that can be incorporated into a multi-channel sound system. it can.

１００テレビジョン受像機
１０２チューナユニット
１０４ディスプレイ駆動回路
１０６ディスプレイ
１０８音声出力回路
１１０外部機器インタフェース
１１２通信インタフェース
１２０音声制御回路（音声制御装置）
１２２マイク
１５０外部スピーカ
２００記憶部
２０２検出部（検出手段）
２０４主チャンネル決定部（主チャンネル決定手段）
２０８副チャンネル決定部（副チャンネル決定手段）
２１０出力音声信号生成部（出力音声信号生成手段）
２１２音声信号出力部 DESCRIPTION OF SYMBOLS 100 Television receiver 102 Tuner unit 104 Display drive circuit 106 Display 108 Audio | voice output circuit 110 External apparatus interface 112 Communication interface 120 Audio | voice control circuit (audio | voice control apparatus)
122 Microphone 150 External speaker 200 Storage unit 202 Detection unit (detection means)
204 Main channel determination unit (main channel determination means)
208 Sub-channel determination unit (sub-channel determination means)
210 Output audio signal generator (output audio signal generator)
212 Audio signal output unit

Claims

In a multi-channel audio signal including a plurality of channels, wherein a multi-channel audio signal having a predetermined audio output position corresponding to each channel is output via a plurality of output channels,
For each of the output channels, and the position in the three-dimensional space of the speaker connected to the output channel, based on a sound output position corresponding to each of the plurality of channels contained in the multi-channel audio signal, the Main channel determining means for determining a main channel to be assigned to the output channel from the plurality of channels included in the multi-channel audio signal ;
Sub-channel determining means for determining, for each output channel, a sub-channel assigned to the output channel from among the plurality of channels included in the multi-channel audio signal ;
For each output channel, an output audio signal to be output via the output channel is generated using the audio signal of the main channel assigned to the output channel and the audio signal of the sub-channel assigned to the output channel. and an output audio signal generation means for,
The plurality of channels included in the multi-channel audio signal are:
It is grouped in advance by area,
The sub-channel determining means includes
For each output channel, a channel included in the group corresponding to the main channel assigned to the output channel is assigned to a sub-channel assigned to the output channel.
A voice control device characterized by being determined as a channel .

The output audio signal generating means is
For each output channel, by downmixing the audio signal of the main channel assigned to the output channel and the audio signal of the subchannel assigned to the output channel, an output audio signal output via the output channel is obtained. The voice control device according to claim 1, wherein the voice control device is generated.

The output audio signal generating means is
For each output channel, the frequency of the audio signal of the subchannel assigned to the output channel so that the sound represented by the audio signal of the subchannel assigned to the output channel is localized at the audio output position corresponding to the subchannel. After correcting the characteristics using the head-related transfer function, the audio signal of the main channel assigned to the output channel and the signal of the sub-channel assigned to the output channel are downmixed, so that The voice control apparatus according to claim 2, wherein an output voice signal to be output is generated.

The output audio signal generating means is
For each output channel, the audio signal of the subchannel assigned to the output channel is corrected using the horizontal head related transfer function in the three-dimensional space and the vertical head related transfer function in the three-dimensional space. The voice control device according to claim 3.

The main channel determining means includes
For each of the output channels, a first direction which is a direction from a reference position of a speaker connected to the output channel, and the audio of each of the plurality of channels included in the multi-channel audio signal Based on an output position and a second direction that is a direction from the reference position, a channel included in the multi-channel audio signal in which the second direction is closest to the first direction is The voice control device according to any one of claims 1 to 4, wherein the voice control device is determined as a main channel to be assigned to an output channel.

Wherein for each output channel, the sound control apparatus according to any one of claims 1 to 5, further comprising a detection means for detecting the position of the speaker connected to the output channel.

The group is set for each combination of main channels according to audio output position information corresponding to each channel included in the multi-channel audio signal.
The voice control apparatus according to any one of claims 1 to 6, wherein

An audio reproduction device comprising the audio control device according to claim 1.

A television receiver comprising the voice control device according to any one of claims 1 to 7.

In a multichannel audio signal including a plurality of channels, a multichannel audio signal having a predetermined audio output position corresponding to each channel is output via the plurality of output channels.
For each of the output channels, and the position in the three-dimensional space of the speaker connected to the output channel, based on a sound output position corresponding to each of the plurality of channels contained in the multi-channel audio signal, the A main channel determining step for determining a main channel to be assigned to the output channel from the plurality of channels included in the multi-channel audio signal ;
A sub-channel determination step for determining, for each output channel, a sub-channel to be allocated to the output channel from among the plurality of channels included in the multi-channel audio signal ;
For each output channel, an output audio signal to be output via the output channel is generated using the audio signal of the main channel assigned to the output channel and the audio signal of the sub-channel assigned to the output channel. and an output audio signal generation step of viewing including,
The plurality of channels included in the multi-channel audio signal are:
It is grouped in advance by area,
In the sub-channel determination step,
For each output channel, a channel included in the group corresponding to the main channel assigned to the output channel is assigned to a sub-channel assigned to the output channel.
A voice control method characterized by determining as a channel .

A program for causing a computer to function as the voice control device according to any one of claims 1 to 7, wherein the program causes the computer to function as each unit included in the voice control device.

The computer-readable recording medium which has recorded the program of Claim 11.