JP6204684B2

JP6204684B2 - Acoustic signal reproduction device

Info

Publication number: JP6204684B2
Application number: JP2013079623A
Authority: JP
Inventors: 大出　訓史; 訓史大出; 靖茂中山
Original assignee: Japan Broadcasting Corp; NHK Engineering System Inc
Current assignee: Japan Broadcasting Corp; NHK Engineering System Inc
Priority date: 2013-04-05
Filing date: 2013-04-05
Publication date: 2017-09-27
Anticipated expiration: 2033-04-05
Also published as: JP2014204323A

Description

この発明は、複数の音響空間層を持つマルチチャンネル音響方式の音響信号再生装置に関する。 The present invention relates to audio signal reproducing apparatus of a multi-channel sound system having a plurality of layered sound field.

現在番組制作が行われている2チャンネル音響方式、5.1チャンネル音響方式に加え、7.1チャンネルや22.2チャンネルなどの5.1チャンネル音響方式を超えた「3次元(立体)音響方式」など複数の音響方式が提案されている。オーディオ関連の国際標準化団体であるITU-Rでは、ITU-R勧告として5.1チャンネル音響方式を超えた3次元音響方式（advanced multichannel audio system）に対する要求条件（非特許文献１）を定めており、今後も複数の音響方式が提案されることが予測される。これらの音響方式を共通のフォーマットで表現することで、次世代オーディオシステムに適用可能であり種々の方面への活用が可能な柔軟なシステムとすることができる。 In addition to the two-channel and 5.1-channel sound systems currently being produced, multiple sound systems such as the “three-dimensional (three-dimensional) sound system” that exceed the 5.1-channel sound systems such as 7.1 and 22.2 channels are proposed. Has been. The ITU-R, an international standardization organization related to audio, has set requirements (non-patent document 1) for a three-dimensional acoustic system (advanced multichannel audio system) beyond the 5.1 channel acoustic system as an ITU-R recommendation. It is expected that a plurality of acoustic methods will be proposed. By expressing these acoustic systems in a common format, it can be applied to a next-generation audio system and can be a flexible system that can be used in various fields.

"Performance requirements for an advanced multichannel stere ophonic sound system for use with or without accompanying picture", ITU-R勧告BS. 1909"Performance requirements for an advanced multichannel stere ophonic sound system for use with or without accompanying picture", ITU-R recommendation BS. 1909

種々の音響方式を表現可能な共通のフォーマットとして、「単一の音響空間層を持つ音響信号」の検討は進んでいる。ここで、空間的に配置された複数のチャンネル信号によって構築される音を単一の音響空間層とする。これまでの番組制作では番組に必要な音を全て単一の音響空間層に配置している。これまで一つにまとめていた音響空間層を幾つかの層に分割して音響番組制作を行い、「複数の音響空間層を持つ音響信号」の形式を用いることで、番組交換時の受取先や家庭の環境に合わせて受信した音響信号の変形・変換・入替を容易に行うことができるようになる。これ以降、「マルチチャンネル音響方式」とは「複数の音響空間層を持つ音響方式」を意図するものとして説明を行う。 As a common format capable of expressing various acoustic systems, “acoustic signals having a single acoustic space layer” are being studied. Here, a sound constructed by a plurality of spatially arranged channel signals is defined as a single acoustic space layer. In conventional program production, all the sounds required for a program are arranged in a single acoustic space layer. Dividing the acoustic space layer that has been integrated into several layers into several layers, producing an audio program, and using the format of "acoustic signal with multiple acoustic space layers", the recipient at the time of program exchange In addition, it is possible to easily transform, convert, and replace the received acoustic signal according to the home environment. Hereinafter, the “multi-channel acoustic system” will be described as intended to be an “acoustic system having a plurality of acoustic spatial layers”.

例えば、マルチチャンネル音響方式を用いて放送される放送番組は、様々な音響方式によって制作され、様々な再生環境において再生される。生活音が大きい環境や屋外など、音声の音量だけを大きくするなど、放送番組として完成される前の情報を使ったサービスも求められる。しかし、制御したい音がある時刻からある時刻間だけでそれ以外の時間が無音である場合、番組中長い時間、無音である音響チャンネル信号を伝送するのは、伝送コストがかかってしまう。また、ナレーションや音楽など番組内でも求められる音響品質は異なり、常に計算コストの高い音響方式を伝送するのはコストがかかる。 For example, a broadcast program broadcast using a multi-channel audio system is produced by various audio systems and reproduced in various reproduction environments. There is also a need for a service that uses information before it is completed as a broadcast program, such as increasing the volume of the sound, such as in an environment where living sounds are loud or outdoors. However, if the sound that is desired to be controlled is silent for a certain period of time from a certain time, transmitting an acoustic channel signal that is silent for a long time during the program will incur a transmission cost. Also, the sound quality required in a program such as narration or music is different, and it is expensive to always transmit a sound method with a high calculation cost.

したがって、かかる点に鑑みてなされた本発明の目的は、複数の音響空間層を持つマルチチャンネル音響方式に対応し、１つの音響空間層を時分割して必要最小限のデータにより当該音響空間層を再生可能な音響信号再生装置を提供することにある。 Accordingly, an object of the present invention made in view of such a point corresponds to a multi-channel acoustic system having a plurality of acoustic space layers, and the acoustic space layer is obtained by time-sharing one acoustic space layer and using a minimum amount of data. the invention is to provide a reproducible audio signal reproducing apparatus.

上述した諸課題を解決すべく、本発明に係る音響信号再生装置は、複数の音響空間層を含むマルチチャンネル音響信号の再生装置であって、前記マルチチャンネル音響信号に含まれるメタデータは、音響空間層の数と、各音響空間層のチャンネル数及び内容と、それぞれの音響空間層における各音響チャンネル信号の配置と、各音響空間層の時分割された複数の再生時間と、を含み、前記メタデータに記載された各音響空間層の前記再生時間に基づき、時分割して送信される前記音響空間層のうち、前記再生時間に対応する音響チャンネル信号のみを復号化する復号化部と、前記復号化された音響チャンネル信号を再生時間に合わせて再生する再生時間調整部を備える。 In order to solve the various problems described above, an acoustic signal reproducing apparatus according to the present invention is a reproduction apparatus of a multi-channel audio signal including a plurality of layered sound field, meta data included in the multichannel sound signals, sound Including the number of spatial layers, the number and content of channels in each acoustic spatial layer, the arrangement of each acoustic channel signal in each acoustic spatial layer, and a plurality of time-divided playback times for each acoustic spatial layer, based on the playback time of each layered sound field described in the metadata, among divided the layered sound to be transmitted when the decoding unit to decode only sound channel signal corresponding to the reproduction time, comprising a playback time adjustment unit for reproducing the combined sound channel signals the decoded playback time.

また、前記メタデータに含まれる各音響空間層の前記内容に基づいて、同一内容の音響空間層がグループとして規定され、前記復号化部は、前記グループのうち選択された音響空間層に含まれる音響チャンネル信号のみを復号化することが好ましい。 Also, based on the contents of each acoustic space layer included in the metadata, acoustic space layers having the same contents are defined as a group , and the decoding unit is included in the acoustic space layer selected from the group. Preferably only the acoustic channel signal is decoded.

また、前記復号化部は、前記選択された音響空間層の音響チャンネル信号が無音となる区間は、前記グループに含まれる他の音響空間層の音響チャンネル信号を復号化することが好ましい。 The decoding unit preferably decodes an acoustic channel signal of another acoustic space layer included in the group in a section where the acoustic channel signal of the selected acoustic space layer is silent.

本発明に係る音響信号再生装置によれば、複数の音響空間層を持つマルチチャンネル音響方式に対応し、１つの音響空間層を時分割して必要最小限のデータにより当該音響空間層を再生することが可能となる。 According to the audio signal reproducing apparatus according to the present invention, it supports multi-channel sound system having a plurality of layered sound field, and reproduces the sound field layer necessary minimum data time-divided single acoustic space layer It becomes possible.

本発明の一実施形態に係る音響信号再生装置の構成を示す図である。It is a figure which shows the structure of the acoustic signal reproducing | regenerating apparatus which concerns on one Embodiment of this invention. マルチチャンネル音響信号に含まれる音響空間層の一例を示す図である。It is a figure which shows an example of the acoustic space layer contained in a multichannel acoustic signal. マルチチャンネル音響信号における音響チャンネル信号及びメタデータの一例を示す図である。It is a figure which shows an example of the acoustic channel signal and metadata in a multichannel acoustic signal. マルチチャンネル音響信号における音響チャンネル信号及びメタデータの一例を示す図である。It is a figure which shows an example of the acoustic channel signal and metadata in a multichannel acoustic signal. マルチチャンネル音響信号における音響チャンネル信号及びメタデータの一例を示す図である。It is a figure which shows an example of the acoustic channel signal and metadata in a multichannel acoustic signal. 本発明の一実施形態に係る音響信号作成装置の構成を示す図である。It is a figure which shows the structure of the acoustic signal production apparatus which concerns on one Embodiment of this invention.

以降、諸図面を参照しながら、本発明の実施態様を詳細に説明する。ここで、本発明は、「複数の音響空間層を持つ音響信号」であるマルチチャンネル音響信号に対応するものである。本件出願人は、「単一の音響空間層を持つ音響信号」について韓国特許出願（10-2012-0112984）を行っており、また、「複数の音響空間層を持つ音響信号」について日本国特許出願（特願2013-010544）を行っている。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Here, the present invention corresponds to a multi-channel acoustic signal that is an “acoustic signal having a plurality of acoustic spatial layers”. The applicant has filed a Korean patent application (10-2012-0112984) for “acoustic signals having a single acoustic space layer” and a Japanese patent for “acoustic signals having multiple acoustic space layers”. An application has been filed (Japanese Patent Application 2013-010544).

図１は、本発明の一実施形態に係る音響信号再生装置の構成を示す図である。音響信号再生装置１０は、デマルチプレクサ１１（ＤＥＭＵＸ）と、復号化部１２と、再生時間調整部１５と、再生チャンネル変換部１３とを備え、音響信号再生装置１０の出力信号はスピーカ１４により音として再生される。 FIG. 1 is a diagram showing a configuration of an acoustic signal reproduction device according to an embodiment of the present invention. The acoustic signal reproduction device 10 includes a demultiplexer 11 (DEMUX), a decoding unit 12, a reproduction time adjustment unit 15, and a reproduction channel conversion unit 13. An output signal of the acoustic signal reproduction device 10 is sounded by a speaker 14. As played.

デマルチプレクサ１１は、入力されたマルチチャンネル音響データストリームをメタデータと音響チャンネル信号に分離する。デマルチプレクサ１１は、音響チャンネル信号を復号化部１２に出力し、メタデータを復号化部１２、再生時間調整部１５及び再生チャンネル変換部１３に出力する。 The demultiplexer 11 separates the input multi-channel audio data stream into metadata and audio channel signals. The demultiplexer 11 outputs the acoustic channel signal to the decoding unit 12, and outputs the metadata to the decoding unit 12, the reproduction time adjustment unit 15, and the reproduction channel conversion unit 13.

図２は、本実施形態におけるマルチチャンネル音響信号（音響データストリーム）に含まれる音響空間層の一例を示す図である。図２のマルチチャンネル音響信号は、２つの音響空間層を含んで構成される。第１の音響空間層２００はステレオ方式（２ｃｈ）の音響チャンネル（Ｌｃｈ２１０、Ｒｃｈ２２０）であり、Ｍｕｓｉｃ（背景音、環境音、音楽など）を再生する。第２の音響空間層３００はステレオ方式（２ｃｈ）の音響チャンネル（Ｌｃｈ３１０、Ｒｃｈ３２０）であり、Ｄｉａｌｏｇｕｅ（俳優のせりふ、会話など）を再生する。 FIG. 2 is a diagram illustrating an example of an acoustic space layer included in a multi-channel acoustic signal (acoustic data stream) in the present embodiment. The multi-channel acoustic signal of FIG. 2 is configured to include two acoustic spatial layers. The first acoustic space layer 200 is a stereo (2ch) acoustic channel (Lch210, Rch220), and reproduces Music (background sound, environmental sound, music, etc.). The second acoustic space layer 300 is a stereo (2ch) acoustic channel (Lch310, Rch320), and reproduces Dialogue (actor dialogue, conversation, etc.).

図３は、図２に示すマルチチャンネル音響信号における音響チャンネル信号及びメタデータの一例を示す図である。図３のメタデータ（Sound Essence 000）には、当該マルチチャンネル音響信号が２層の音響空間層（Sound Field）を持つことが記載されている。第１の音響空間層２００（Sound Field 01）は、２ｃｈのステレオ方式で構成され、音響チャンネル信号２１０（Channel 01）がＬｃｈ、音響チャンネル信号２２０（Channel 02）がＲｃｈである旨が記載されている。第２の音響空間層３００（Sound Field 02）は、２ｃｈのステレオ方式で構成されるが、音響チャネル信号はDialogue 01（３１０、３２０）とDialogue 02（３３０、３４０）とに時分割され、各音響チャンネル信号の再生時間がメタデータに記載されている。即ち、第２の音響空間層３００は、無音区間（ｎｕｌｌ区間）を除いて時分割された音響チャンネル信号を含む音響空間層であって、時刻０：００：００より、音響チャンネル信号３１０（Channel 01）をＬｃｈ、音響チャンネル信号３２０（Channel 02）をＲｃｈより再生し、時刻０：０１：００より、音響チャンネル信号３３０（Channel 01）をＬｃｈ、音響チャンネル信号３４０（Channel 02）をＲｃｈより再生する旨が記載されている。メタデータに再生時間を記載することにより、各チャンネルにおける無音区間（ｎｕｌｌ区間）の伝送が不用となり、伝送コストを低減することができる。 FIG. 3 is a diagram illustrating an example of an acoustic channel signal and metadata in the multi-channel acoustic signal illustrated in FIG. The metadata (Sound Essence 000) in FIG. 3 describes that the multi-channel acoustic signal has two acoustic space layers (Sound Field). It is described that the first acoustic space layer 200 (Sound Field 01) is configured in a 2ch stereo system, the acoustic channel signal 210 (Channel 01) is Lch, and the acoustic channel signal 220 (Channel 02) is Rch. Yes. The second acoustic space layer 300 (Sound Field 02) is configured in a 2ch stereo system, but the acoustic channel signal is time-divided into Dialogue 01 (310, 320) and Dialogue 02 (330, 340). The reproduction time of the acoustic channel signal is described in the metadata. That is, the second acoustic space layer 300 is an acoustic space layer including an acoustic channel signal that is time-divided except for a silent section (null section), and the acoustic channel signal 310 (Channel 01) is played back from Lch, sound channel signal 320 (Channel 02) is played back from Rch, and sound channel signal 330 (Channel 01) is played back from Lch and sound channel signal 340 (Channel 02) is played back from Rch from time 0:00 It is written to do so. By describing the playback time in the metadata, transmission in a silent section (null section) in each channel becomes unnecessary, and the transmission cost can be reduced.

復号化部１２は、マルチチャンネル音響信号に含まれるメタデータに記載された音響空間層の再生時間に基づき、各音響空間層の音響チャンネル信号を復号化し、メタデータに基づいて、音響チャンネル信号を音響空間層毎にグルーピングする。なお、復号化部１２は、メタデータを参照し、再生対象となる時間、チャンネル以外の各音響チャンネル信号の復号化は行わなくてもよい。これにより、復号化に関する消費電力を低減することができる。 The decoding unit 12 decodes the acoustic channel signal of each acoustic spatial layer based on the reproduction time of the acoustic spatial layer described in the metadata included in the multi-channel acoustic signal, and converts the acoustic channel signal based on the metadata. Group by acoustic space layer. Note that the decoding unit 12 does not have to perform decoding of each audio channel signal other than the time and channel to be reproduced with reference to the metadata. Thereby, the power consumption regarding a decoding can be reduced.

再生時間調整部１５は、マルチチャンネル音響信号に含まれるメタデータに記載された音響空間層の再生時間に基づき、復号化された音響チャンネル信号の再生時間を調整するものである。再生時間調整部１５は、グルーピングされた音響チャンネル信号を一時的にメモリに格納し、メタデータに書かれた再生時刻まで時間的遅らせることで音響チャンネル信号の再生時間を調整する。 The reproduction time adjustment unit 15 adjusts the reproduction time of the decoded acoustic channel signal based on the reproduction time of the acoustic space layer described in the metadata included in the multichannel acoustic signal. The reproduction time adjustment unit 15 temporarily stores the grouped sound channel signals in a memory, and adjusts the reproduction time of the sound channel signals by delaying the time until the reproduction time written in the metadata.

再生チャンネル変換部１３は、再生時刻が調整された音響チャンネル信号を再生チャンネルごとに加算し、各再生スピーカに入力する音響チャンネル信号を生成する。 The reproduction channel conversion unit 13 adds the acoustic channel signal whose reproduction time is adjusted for each reproduction channel, and generates an acoustic channel signal to be input to each reproduction speaker.

このように、本実施形態によれば、復号化部１２は、マルチチャンネル音響信号に含まれるメタデータに記載された音響空間層の再生時間に基づき、時分割して送信される音響空間層の音響チャンネル信号を復号化し、再生時間調整部１５は復号化された音響チャンネル信号の再生時間を調整する。これにより、複数の音響空間層を持つマルチチャンネル音響方式に対応し、１つの音響空間層を時分割して必要最小限のデータにより当該音響空間層を再生することが可能となる。また、必要最小限の音響信号データを伝送し、所望の時間から再生することができるため、伝送コストを抑えることが可能となる。さらに、情報がない時間の音響チャンネル信号を伝送しないことで、少ない伝送コストで所望の放送番組の伝送、番組交換が実現可能となる。 As described above, according to the present embodiment, the decoding unit 12 performs the time-division transmission of the acoustic space layer based on the reproduction time of the acoustic space layer described in the metadata included in the multichannel acoustic signal. The audio channel signal is decoded, and the reproduction time adjustment unit 15 adjusts the reproduction time of the decoded audio channel signal. Accordingly, in correspondence with a multi-channel acoustic system having a plurality of acoustic space layers, it is possible to reproduce one acoustic space layer by time-division and reproducing the acoustic space layer with minimum necessary data. Moreover, since the minimum necessary acoustic signal data can be transmitted and reproduced from a desired time, the transmission cost can be suppressed. Furthermore, by not transmitting an acoustic channel signal for which there is no information, transmission of a desired broadcast program and program exchange can be realized at a low transmission cost.

図４は、マルチチャンネル音響信号における音響チャンネル信号及びメタデータの他の例を示す図である。図４のメタデータ（Sound Essence 000）には、当該マルチチャンネル音響信号が２層の音響空間層（Sound Field）を持ち、第１の音響空間層２００（Sound Field 01）がＭｕｓｉｃ（背景音、環境音、音楽など）を再生し、また、第２の音響空間層３００（Sound Field 02）及び第３の音響空間層４００（Sound Field 03）がＤｉａｌｏｇｕｅ（俳優のせりふ、会話など）グループとして規定され、第２の音響空間層３００と第１の音響空間層４００のうちどちらか選択された音響空間層が再生されることが記載されている。この場合、例えば、第２の音響空間層３００は日本語の音声であり、第３の音響空間層４００は解説放送や英語の音声などである。復号化部１２は、第２の音響空間層３００及び第３の音響空間層４００のうち、通常は、第２の音響空間層３００を復号するが、メタデータの設定やユーザからの入力に応じて第３の音響空間層４００の音声に切り替えて復号化を行うことができる。この例では、視聴者が第３の音響空間層４００を選択した場合は、復号化部１２は、２番目の区間（Dialogue B2）のみ第３の音響空間層４００の復号化を行い、再生時間調整部１５は、第３の音響空間層４００の再生時間を調整する。それ以外の区間（第３の音響空間層４００が無音となる区間）はグループの他の音響空間層である第２の音響空間層３００の音響チャンネル信号が再生される。なお、２番目の区間であるDialogue A2とDialogue B2とはそれぞれ長さが異なるため、再生時間は異なる時刻が設定される。このように、番組中の再生時刻が分かっていることで、映像信号と音響信号の時間的なずれを補正することにも応用することができる。 FIG. 4 is a diagram illustrating another example of an acoustic channel signal and metadata in a multi-channel acoustic signal. In the metadata of FIG. 4 (Sound Essence 000), the multi-channel sound signal has two sound space layers (Sound Field), and the first sound space layer 200 (Sound Field 01) has Music (background sound, Environmental sound, music, etc.), and the second acoustic space layer 300 (Sound Field 02) and the third acoustic space layer 400 (Sound Field 03) are defined as Dialogue (actor dialogue, conversation, etc.) groups It is described that an acoustic space layer selected from the second acoustic space layer 300 and the first acoustic space layer 400 is reproduced. In this case, for example, the second acoustic space layer 300 is Japanese speech, and the third acoustic space layer 400 is commentary broadcasting, English speech, or the like. The decoding unit 12 normally decodes the second acoustic space layer 300 out of the second acoustic space layer 300 and the third acoustic space layer 400, but according to the metadata setting or the input from the user. Thus, decoding can be performed by switching to the sound of the third acoustic space layer 400. In this example, when the viewer selects the third acoustic space layer 400, the decoding unit 12 decodes the third acoustic space layer 400 only in the second section (Dialogue B2), and plays back time. The adjusting unit 15 adjusts the reproduction time of the third acoustic space layer 400. In other sections (sections in which the third acoustic space layer 400 is silent), the acoustic channel signal of the second acoustic space layer 300 which is another acoustic space layer of the group is reproduced. Since the second section, Dialogue A2 and Dialogue B2, have different lengths, different playback times are set. Thus, knowing the playback time in the program can also be applied to correcting the time lag between the video signal and the audio signal.

このように、メタデータにより時分割して送信される音響チャネル信号を含む音響空間層のグループを規定し、復号化部１２が選択された音響空間層の音響チャンネル信号を復号化し、再生時間調整部１５がグループのうち選択された音響空間層の音響チャンネル信号の再生時間を調整することにより、例えば、異なる言語を話す出演者にインタビューを行った場合、メタデータに再生時間を記載することにより、音声が含まれない時間帯は伝送を行わず、音声が含まれる時間帯のみ伝送を行うことにより、必要最小限のデータだけを伝送することが可能となる。 As described above, the acoustic space layer group including the acoustic channel signal transmitted in a time division manner by the metadata is defined, and the decoding unit 12 decodes the selected acoustic space layer acoustic channel signal to adjust the reproduction time. When the section 15 adjusts the playback time of the acoustic channel signal of the selected acoustic space layer of the group, for example, when interviewing performers who speak different languages, the playback time is described in the metadata. It is possible to transmit only the minimum necessary data by transmitting only the time zone in which the voice is included without transmitting in the time zone in which the voice is not included.

また、再生時間調整部１５が選択された音響空間層の音響チャンネル信号が無音となる区間はグループの他の音響空間層の音響チャンネル信号の再生時間を調整することにより、例えば、複数言語番組など、切り替え可能な音響空間層のうち、有効な音声データを含む音響空間層に自動的に切り替えて番組を放送することが可能となる。 Further, in a section where the sound channel signal of the sound space layer for which the play time adjustment unit 15 is selected is silent, by adjusting the play time of the sound channel signal of the other sound space layer of the group, for example, a multi-language program or the like Of the switchable acoustic space layers, the program can be broadcast by automatically switching to an acoustic space layer including valid audio data.

図５は、マルチチャンネル音響信号における音響チャンネル信号及びメタデータの他の例を示す図である。図５は、一つの番組中に、音響方式を切り替える例を示すものである。ナレーションなど特にマルチチャンネル音響方式でなくてもよい場面や、音楽やスポーツ中継など高い臨場感が求められる場面では、同じ番組内でも求められる音響品質は異なる。図５のように、主に音声の場合は２チャンネルの音響空間層（Sound field 01、Sound field 03）とし、音楽演奏の場面では５チャンネルの音響空間層（Sound field 02）というように、一つの番組内で伝送する音響空間を切り替えることも可能である。このように、番組中のある時刻にどの音源が再生されるかをメタデータとして記述することで、効率的に音響信号を伝送することが可能となる。 FIG. 5 is a diagram illustrating another example of an acoustic channel signal and metadata in a multi-channel acoustic signal. FIG. 5 shows an example of switching the sound system in one program. The sound quality required even in the same program is different in scenes that do not have to be a multi-channel sound system, such as narration, or scenes that require a high sense of realism, such as music and sports broadcasts. As shown in FIG. 5, in the case of mainly sound, a two-channel acoustic space layer (Sound field 01, Sound field 03) is used, and in a musical performance scene, a five-channel acoustic space layer (Sound field 02) is used. It is also possible to switch the acoustic space to be transmitted in one program. As described above, it is possible to efficiently transmit an acoustic signal by describing, as metadata, which sound source is reproduced at a certain time in a program.

図６は、本発明の一実施形態に係る音響信号作成装置の構成を示す図である。音響信号作成装置２０は、ミキサ２１と、符号化部２２と、マルチプレクサ２３（ＭＵＸ）とを備える。 FIG. 6 is a diagram showing a configuration of an acoustic signal creation device according to an embodiment of the present invention. The acoustic signal generation device 20 includes a mixer 21, an encoding unit 22, and a multiplexer 23 (MUX).

ミキサ２１は、複数の音響信号をミキシングして、音響空間層毎の音響チャンネル信号として符号化部２２に出力する。 The mixer 21 mixes a plurality of acoustic signals, and outputs them to the encoding unit 22 as acoustic channel signals for each acoustic space layer.

符号化部２２は、ミキサ２１からの各音響空間層の音響チャンネル信号を符号化してマルチプレクサ２３に出力する。 The encoding unit 22 encodes the acoustic channel signal of each acoustic space layer from the mixer 21 and outputs it to the multiplexer 23.

マルチプレクサ２３（多重化部）は、時分割された音響空間層の音響チャンネル信号と、音響空間層の再生時間を含むメタデータとを多重化するものであり、番組制作者等により入力されるメタデータと、符号化された音響チャンネル信号を多重化して複数の音響空間層を持つマルチチャンネル音響信号を作成する。マルチプレクサ２３は、放送又は伝送によりマルチチャンネル音響信号を伝えるため、マルチチャンネル音響信号を多重化して電波またはＩＰ回線等で家庭など遠隔地に伝送する。なお、マルチプレクサ２３は、メタデータに、時分割して送信される音響チャネル信号を含む音響空間層のグループを規定してもよい。 The multiplexer 23 (multiplexer) multiplexes the time-division acoustic channel signal of the acoustic space layer and the metadata including the reproduction time of the acoustic space layer, and is inputted by a program producer or the like. The data and the encoded acoustic channel signal are multiplexed to create a multi-channel acoustic signal having a plurality of acoustic spatial layers. The multiplexer 23 multiplexes the multi-channel sound signal and transmits it to a remote place such as a home by radio wave or IP line in order to transmit the multi-channel sound signal by broadcasting or transmission. Note that the multiplexer 23 may define a group of acoustic space layers including acoustic channel signals transmitted in a time division manner in the metadata.

このように、本実施形態によれば、マルチプレクサ２３は、音響空間層の音響信号を表す音響チャンネル信号と、音響空間層の再生時間を含むメタデータとを多重化する。これにより、音響信号再生装置側で、複数の音響空間層を持つマルチチャンネル音響方式に対応し、１つの音響空間層を時分割して必要最小限のデータにより当該音響空間層を再生することが可能となる。また、必要最小限の音響信号データを伝送し、所望の時間から再生することができるため、伝送コストを抑えることが可能となる。さらに、情報がない時間の音響チャンネル信号を伝送しないことで、少ない伝送コストで所望の放送番組の伝送、番組交換が実現可能となる。 Thus, according to this embodiment, the multiplexer 23 multiplexes the acoustic channel signal representing the acoustic signal of the acoustic space layer and the metadata including the reproduction time of the acoustic space layer. Thereby, on the acoustic signal reproduction apparatus side, it is possible to reproduce the acoustic space layer with the minimum necessary data by time-sharing one acoustic space layer corresponding to the multi-channel acoustic system having a plurality of acoustic space layers. It becomes possible. Moreover, since the minimum necessary acoustic signal data can be transmitted and reproduced from a desired time, the transmission cost can be suppressed. Furthermore, by not transmitting an acoustic channel signal for which there is no information, transmission of a desired broadcast program and program exchange can be realized at a low transmission cost.

本発明を諸図面や実施例に基づき説明してきたが、当業者であれば本開示に基づき種々の変形や修正を行うことが容易であることに注意されたい。従って、これらの変形や修正は本発明の範囲に含まれることに留意されたい。例えば、各機能部、各ステップなどに含まれる機能などは論理的に矛盾しないように再配置可能であり、複数の機能部やステップなどを１つに組み合わせたり、或いは分割したりすることが可能である。 Although the present invention has been described based on the drawings and examples, it should be noted that those skilled in the art can easily make various modifications and corrections based on the present disclosure. Therefore, it should be noted that these variations and modifications are included in the scope of the present invention. For example, the functions included in each functional unit, each step, etc. can be rearranged so that there is no logical contradiction, and a plurality of functional units, steps, etc. can be combined into one or divided. It is.

１０音響信号再生装置
１１デマルチプレクサ
１２復号化部
１３再生チャンネル変換部
１４スピーカ
１５再生時間調整部
２０音響信号作成装置
２１ミキサ
２２符号化部
２３マルチプレクサ（多重化部）
DESCRIPTION OF SYMBOLS 10 Acoustic signal reproduction | regeneration apparatus 11 Demultiplexer 12 Decoding part 13 Reproduction | regeneration channel conversion part 14 Speaker 15 Reproduction | regeneration time adjustment part 20 Acoustic signal production apparatus 21 Mixer 22 Encoding part 23 Multiplexer (multiplexing part)

Claims

A multi-channel acoustic signal reproducing apparatus including a plurality of acoustic spatial layers,
The metadata included in the multi-channel acoustic signal includes the number of acoustic space layers, the number and contents of channels of each acoustic space layer, the arrangement of each acoustic channel signal in each acoustic space layer, and the time of each acoustic space layer. A plurality of divided playback times, and
Based on the playback time of each layered sound field described in the metadata of the layered sound to be transmitted by time division, and a decoding unit to decode only sound channel signal corresponding to the reproduction time ,
A reproduction time adjustment unit for reproducing the decoded acoustic channel signal in accordance with the reproduction time ;
An acoustic signal reproduction apparatus comprising:

Based on the content of each layered sound field included in the metadata, layered sound of the same content is defined as a group, the decoding unit, the acoustic channel included in the acoustic space layer that is selected from the group decoding a signal only, audio signal reproducing apparatus according to claim 1.

3. The acoustic according to claim 2 , wherein the decoding unit decodes an acoustic channel signal of another acoustic space layer included in the group in a section in which the acoustic channel signal of the selected acoustic space layer is silent. Signal reproduction device.