JP2020092439A

JP2020092439A - Audio metadata provision device and audio data reproduction device capable of supporting dynamic format conversion, method executed by the same, and computer readable recording medium with dynamic format conversion recorded

Info

Publication number: JP2020092439A
Application number: JP2020018862A
Authority: JP
Inventors: ユ、ジェ、ヒョン; Jae Hyoun Yoo; イ、テ、ジン; Tae-Jin Lee; イ、ソク、ジン; Seok Jin Lee
Original assignee: Electronics and Telecommunications Research Institute ETRI
Current assignee: Electronics and Telecommunications Research Institute ETRI
Priority date: 2014-09-24
Filing date: 2020-02-06
Publication date: 2020-06-11
Anticipated expiration: 2035-09-16
Also published as: KR20220044457A; KR20190076934A; KR20160035963A; JP2021170798A; KR102380279B1; KR102533824B1; KR20230071107A; KR101993348B1; JP2016072973A; JP6663147B2; KR20210033963A; KR102231750B1; JP6912612B2; JP7166398B2

Abstract

To provide an audio metadata provision device and an audio data reproduction device, capable of supporting dynamic format conversion.SOLUTION: Dynamic format conversion information 310 is the information in which a plurality of format conversion systems K320, M330, L340 between a first format established by a writer of multichannel audio data for each reproduction section and a second format according to a reproduction environment of the multichannel audio data are established for each reproduction section of the multichannel audio data. An audio metadata provision device provides metadata including dynamic format conversion information. Multichannel audio data reproduction device identifies dynamic format conversion information with an audio metadata and converts the multichannel audio data of the first format into a second format with the identified dynamic format conversion information. The multichannel audio data reproduction device reproduces the converted multichannel audio data.SELECTED DRAWING: Figure 3

Description

本発明は多チャネルオーディオデータ再生方法に関し、より詳しくは、多チャネルオーディオデータの様々なフォーマット間の変換方法に関する。 The present invention relates to a method for reproducing multi-channel audio data, and more particularly to a method for converting multi-channel audio data between various formats.

３ＤＴＶ、３Ｄシネマ、ＵＨＤＴＶなど次世代コンテンツ再生環境に対する研究開発が持続しながら、オーディオも多チャネルラウドスピーカを用いる音響再生環境に素早い変化が行われている。 While the research and development on the next-generation content reproduction environment such as 3DTV, 3D cinema, and UHDTV continues, the audio reproduction environment using the multi-channel loudspeaker is rapidly changing.

映画館及びＨＤＴＶのための立体音響の５．１チャネルシステム以後に上りチャネルを含む様々なマルチャネルオーディオシステムが導入され、ＩＴＵ−Ｒ（ＩｎｔｅｒｎａｔｉｏｎａｌＴｅｌｅｃｏｍｍｕｎｉｃａｔｉｏｎＵｎｉｏｎＲａｄｉｏｃｏｍｍｕｎｉｃａｔｉｏｎｓＳｅｃｔｏｒ）では、最近ＲｅｃｏｍｍｅｎｄａｔｉｏｎＢＳ．２０５１を制定して１０．２チャネル、１３．１チャネル、２２．２チャネルなどをはじめとする総８個の多チャネルフォーマットを次世代オーディオシステム（ａｄｖａｎｃｅｄｓｏｕｎｄｓｙｓｔｅｍ）として定義した。したがって、これからは様々なフォーマットにベースを置いたオーディオコンテンツが製造される可能性が極めて高まっている。 After the 5.1 channel system of stereophonic sound for movie theaters and HDTV, various mal-channel audio systems including upstream channels have been introduced, and recently, in ITU-R (International Telecommunications Union Radiocommunications Sector), Recommendation BS. 2051 was established, and a total of eight multi-channel formats including 10.2 channels, 13.1 channels, 22.2 channels, etc. were defined as a next-generation audio system (advanced sound system). Therefore, it is extremely likely that audio contents based on various formats will be manufactured.

このような環境では、１つのフォーマットに製造されたコンテンツが異なるフォーマットに再生する可能性も極めて高いため、コンテンツ間の適切な変換方法が求められている。従来には、コンテンツの多チャネルオーディオフォーマットから再生環境側の新しい多チャネルオーディオフォーマットにフォーマット変換することにおいて一括的な変換を行った。しかし、このような一括の変換方法は、コンテンツ著作者の著作意図を毀損し、意図とは異なる変換を行う恐れがあるという短所がある。 In such an environment, there is a very high possibility that contents produced in one format will be reproduced in different formats, and therefore an appropriate conversion method between contents is required. In the past, batch conversion was performed in the format conversion from the multi-channel audio format of contents to the new multi-channel audio format on the playback environment side. However, such a batch conversion method has a disadvantage in that the intention of the content creator is impaired and there is a risk of performing conversion different from the intention.

本発明の目的は、多チャネルオーディオデータの様々なフォーマット間に著作者の著作意図が完全に保持されるようにフォーマットを変換する動的フォーマット変換方法を提供するためのオーディオメタデータ提供装置、方法及び動的フォーマット変換方法によりフォーマットを変換して再生する装置、方法、並びに動的フォーマット変換方法が記録された記録媒体を提案する。 An object of the present invention is to provide an audio metadata providing apparatus and method for providing a dynamic format conversion method for converting a format so that the author's copyright intent is completely preserved between various formats of multi-channel audio data. An apparatus and method for converting and reproducing a format by the dynamic format conversion method, and a recording medium on which the dynamic format conversion method is recorded are proposed.

本発明の目的は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境による第２フォーマットとの間の変換を行うことのできる動的フォーマット変換情報が含まれたオーディオメタデータを生成できるオーディオメタデータ提供装置及び方法を提供する。 The object of the present invention includes dynamic format conversion information capable of converting between the first format set by the author of multi-channel audio data and the second format according to the reproduction environment of multi-channel audio data. An audio metadata providing apparatus and method capable of generating audio metadata.

本発明の目的は、多チャネルオーディオデータ及び動的フォーマット変換情報が含まれたオーディオメタデータを識別して第１フォーマットから第２フォーマットに多チャネルオーディオデータを変換した後再生する多チャネルオーディオデータ再生装置及び方法を提供する。 An object of the present invention is to reproduce multi-channel audio data by identifying multi-channel audio data and audio metadata including dynamic format conversion information, converting the multi-channel audio data from the first format to the second format, and then reproducing the converted multi-channel audio data. An apparatus and method are provided.

本発明の目的は、多チャネルオーディオデータ及び動的フォーマット変換情報が含まれたオーディオメタデータが記録されたコンピュータで読み出し可能な記録媒体を提供する。 An object of the present invention is to provide a computer-readable recording medium on which audio metadata including multi-channel audio data and dynamic format conversion information is recorded.

本発明の一実施形態に係るオーディオメタデータ提供装置は、多チャネルオーディオデータで多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の動的フォーマット変換情報を識別する変換情報識別部と、前記識別された動的フォーマット変換情報を含むオーディオメタデータを生成するオーディオメタデータ生成部とを含む。 An audio metadata providing apparatus according to an exemplary embodiment of the present invention provides multi-channel audio data between a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of multi-channel audio data. Of the dynamic format conversion information, and an audio metadata generation section for generating audio metadata including the identified dynamic format conversion information.

前記動的フォーマット変換情報は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の複数のフォーマット変換方式が多チャネルオーディオデータの再生区間ごとに設定されたものである。 In the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are the multi-channel audio data. It is set for each playback section.

前記複数のフォーマット変換方式が設定された再生区間は、互いに同一の再生長さを有するか、又は互いに異なる再生長さを有してもよい。 The reproduction sections in which the plurality of format conversion methods are set may have the same reproduction length or different reproduction lengths.

前記多チャネルオーディオデータの再生環境は、前記多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定されてもよい。 The reproduction environment of the multi-channel audio data may be determined based on the layout of the speakers reproduced by the multi-channel audio data.

前記複数のフォーマット変換方式は、前記第１フォーマットから第２フォーマットに変換するためのマトリックスを含んでもよい。 The plurality of format conversion methods may include a matrix for converting the first format to the second format.

前記動的フォーマット変換情報は、多チャネルオーディオデータの再生区間ごとに相異に設定されるか、又は部分的に繰り返されるように設定されてもよい。 The dynamic format conversion information may be set differently for each reproduction section of multi-channel audio data, or may be set to be partially repeated.

本発明の一実施形態に係る多チャネルオーディオデータ再生装置は、第１フォーマットにより製作された多チャネルオーディオデータ、及びオーディオメタデータから多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の動的フォーマット変換情報を識別するデータ識別部と、前記動的フォーマット変換情報を用いて前記第１フォーマットの多チャネルオーディオデータを第２フォーマットに変換するオーディオデータ変換部と、前記第２フォーマットに変換された多チャネルオーディオデータを再生するオーディオデータ再生部とを含む。 A multi-channel audio data reproducing apparatus according to an exemplary embodiment of the present invention includes multi-channel audio data produced according to a first format, and first format and multi-channel audio set by an author of multi-channel audio data from audio metadata. A data identification unit for identifying dynamic format conversion information between the multi-channel audio data of the first format and the second format based on the reproduction environment of the data, and the multi-channel audio data of the first format to the second format by using the dynamic format conversion information. An audio data conversion unit for converting and an audio data reproduction unit for reproducing the multi-channel audio data converted into the second format are included.

前記多チャネルオーディオデータ変換部の再生区間は、互いに同一の再生長さを有するか、又は互いに異なる再生長さを有してもよい。 The reproduction sections of the multi-channel audio data conversion unit may have the same reproduction length or different reproduction lengths.

前記多チャネルオーディオデータ変換部のフォーマット変換方式は、多チャネルオーディオデータの再生区間ごとに相異に変換するか、又は部分的に繰り返されるように変換してもよい。 The format conversion method of the multi-channel audio data conversion unit may be differently converted for each reproduction section of the multi-channel audio data, or may be partially repeated.

本発明の一実施形態に係るオーディオメタデータ提供方法は、多チャネルオーディオデータで多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の動的フォーマット変換情報を識別するステップと、前記識別された動的フォーマット変換情報を含むオーディオメタデータを生成するステップとを含む。 According to an embodiment of the present invention, there is provided an audio metadata providing method between a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of multi-channel audio data. Identifying the dynamic format conversion information, and generating audio metadata including the identified dynamic format conversion information.

本発明の一実施形態に係る多チャネルオーディオデータ再生方法は、第１フォーマットにより製作された多チャネルオーディオデータ、及びオーディオメタデータから多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の動的フォーマット変換情報を識別するステップと、前記動的フォーマット変換情報を用いて前記第１フォーマットの多チャネルオーディオデータを第２フォーマットに変換するステップと、前記第２フォーマットに変換された多チャネルオーディオデータを再生するステップとを含む。 A multi-channel audio data reproducing method according to an exemplary embodiment of the present invention is directed to a multi-channel audio data produced according to a first format and a first format and a multi-channel audio set by an author of multi-channel audio data from audio metadata. Identifying dynamic format conversion information to and from a second format based on a data reproduction environment; and converting the first format multi-channel audio data to a second format using the dynamic format conversion information. And reproducing the multi-channel audio data converted into the second format.

本発明の一実施形態に係るコンピュータで読み出し可能な記録媒体は、１つ以上のチャネルから構成された多チャネルオーディオデータと、多チャネルオーディオデータで多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の動的フォーマット変換情報が含まれたオーディオメタデータとが記録される。 A computer-readable recording medium according to an exemplary embodiment of the present invention includes multi-channel audio data composed of one or more channels, and a first format of multi-channel audio data set by an author of the multi-channel audio data. And audio metadata including dynamic format conversion information between the second format and the second format based on the reproduction environment of the multi-channel audio data are recorded.

本発明の一実施形態によると、多チャネルオーディオデータの様々なフォーマット間に著作者の著作意図が完全に保持されるよう、フォーマットを変換する動的フォーマット変換方法を提供するためのオーディオメタデータ提供装置、方法、及び動的フォーマット変換方法によりフォーマットを変換して再生する装置、方法、並びに動的フォーマット変換方法が記録された記録媒体を提供することができる。 According to one embodiment of the present invention, an audio metadata providing method for converting a format so that the author's copyright intent is completely preserved between various formats of multi-channel audio data. It is possible to provide an apparatus, a method, and an apparatus, method, and a recording medium on which the dynamic format conversion method is recorded, by which the format is converted and reproduced.

本発明の一実施形態によると、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の変換を行うことのできる動的フォーマット変換情報が含まれたオーディオメタデータを生成できるオーディオメタデータ提供装置及び方法を提供することができる。 According to an embodiment of the present invention, a dynamic format conversion capable of converting between a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of multi-channel audio data. An audio metadata providing apparatus and method capable of generating audio metadata containing information can be provided.

本発明の一実施形態によると、多チャネルオーディオデータ及び動的フォーマット変換情報が含まれたオーディオメタデータを識別し、第１フォーマットから第２フォーマットに多チャネルオーディオデータを変換した後、再生する多チャネルオーディオデータ再生装置及び方法を提供することができる。 According to an embodiment of the present invention, audio metadata including multi-channel audio data and dynamic format conversion information is identified, and multi-channel audio data is converted from a first format to a second format and then reproduced. A channel audio data reproducing apparatus and method can be provided.

本発明の一実施形態によると、多チャネルオーディオデータ及び動的フォーマット変換情報が含まれたオーディオメタデータが記録されたコンピュータで読み出し可能な記録媒体を提供することができる。 According to an embodiment of the present invention, it is possible to provide a computer-readable recording medium on which audio metadata including multi-channel audio data and dynamic format conversion information is recorded.

本発明の一実施形態に係るオーディオメタデータ提供装置とオーディオメタデータ及び多チャネルオーディオデータ再生装置を示す図である。FIG. 1 is a diagram showing an audio metadata providing apparatus, an audio metadata and multi-channel audio data reproducing apparatus according to an embodiment of the present invention. 本発明の一実施形態に係る多チャネルオーディオデータのフォーマットを一括的に変換する一例を示す図である。It is a figure which shows an example which converts the format of the multi-channel audio data which concerns on one Embodiment of this invention collectively. 本発明の一実施形態に係る動的フォーマット変換情報に多チャネルオーディオデータのフォーマットを変換する一例を示す図である。It is a figure which shows an example which converts the format of multi-channel audio data into the dynamic format conversion information which concerns on one Embodiment of this invention. 本発明の一実施形態に係る１つ以上の動的フォーマット変換情報を含むオーディオメタデータを示す図である。FIG. 5 is a diagram illustrating audio metadata including one or more pieces of dynamic format conversion information according to an embodiment of the present invention. 本発明の一実施形態に係るマトリックス方式を用いてフォーマット間の変換を行う実施形態を説明するための図である。It is a figure for explaining the embodiment which performs conversion between formats using the matrix method concerning one embodiment of the present invention. 本発明の一実施形態に係るオーディオメタデータ提供装置が動的フォーマット変換情報の含まれたオーディオメタデータを提供する動作を示したフローチャートである。6 is a flowchart illustrating an operation of providing audio metadata including dynamic format conversion information by an audio metadata providing apparatus according to an exemplary embodiment of the present invention. 本発明の一実施形態に係る多チャネルオーディオデータ再生装置が多チャネルオーディオデータのフォーマットを変換した後、これを再生する動作を示したフローチャートである。6 is a flowchart showing an operation of the multi-channel audio data reproducing device according to an embodiment of the present invention, which converts the format of multi-channel audio data and then reproduces the same.

以下、本発明の実施形態について添付の図面を参照しながら詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

図１は、本発明の一実施形態に係るオーディオメタデータ提供装置１１０とオーディオメタデータ１４０及び多チャネルオーディオデータ再生装置１６０を示す図である。 FIG. 1 is a diagram showing an audio metadata providing apparatus 110, an audio metadata 140, and a multi-channel audio data reproducing apparatus 160 according to an embodiment of the present invention.

図１を参考すると、オーディオメタデータ提供装置１１０は、動的フォーマット変換情報を識別する変換情報識別部１２０及び識別された動的フォーマット変換情報を含むオーディオメタデータ１４０を生成するオーディオメタデータ生成部１３０を含む。動的フォーマット変換情報は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の複数のフォーマット変換方式が多チャネルオーディオデータの再生区間ごとに設定されたものである。 Referring to FIG. 1, the audio metadata providing apparatus 110 includes a conversion information identifying unit 120 for identifying dynamic format conversion information and an audio metadata generating unit for generating audio metadata 140 including the identified dynamic format conversion information. Including 130. As the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are used to reproduce the multi-channel audio data. It is set for each section.

本発明の一実施形態によると、変換情報識別部１２０は、多チャネルオーディオデータの著作者から動的フォーマット変換情報を識別する。更なる実施形態によると、変換情報識別部１２０は、１つ以上のオーディオメタデータから複数の動的フォーマット変換情報を識別する。 According to one embodiment of the present invention, the conversion information identifying unit 120 identifies the dynamic format conversion information from the author of the multi-channel audio data. According to a further embodiment, the conversion information identification unit 120 identifies a plurality of dynamic format conversion information from one or more audio metadata.

本発明の一実施形態によると、変換情報識別部１２０で識別された動的フォーマット変換情報に基づいてオーディオメタデータを生成するオーディオメタデータ生成部１３０が提供される。オーディオメタデータ生成部１３０は、オーディオメタデータに識別された複数の動的フォーマット変換情報を含む。本発明の一実施形態によると、オーディオメタデータ生成部１３０は、動的フォーマット変換情報の各フォーマット変換方式をマトリックスの形態に含むことができる。更なる実施形態によると、オーディオメタデータ生成部１３０は、メタデータに識別された動的フォーマット変換情報と共に、メタデータに一般的に含まれる情報（例えば、著作者、レコード名、発売年度など）を含んでもよい。 According to an embodiment of the present invention, an audio metadata generation unit 130 is provided that generates audio metadata based on the dynamic format conversion information identified by the conversion information identification unit 120. The audio metadata generation unit 130 includes a plurality of pieces of dynamic format conversion information identified by the audio metadata. According to an embodiment of the present invention, the audio metadata generator 130 may include each format conversion method of the dynamic format conversion information in the form of a matrix. According to further embodiments, the audio metadata generator 130 may include information typically included in the metadata along with the dynamic format conversion information identified in the metadata (eg, author, record name, release year, etc.). May be included.

本発明の一実施形態によると、オーディオメタデータ提供装置１１０は、多チャネルオーディオデータ提供装置の一部の構成として含まれている。 According to an embodiment of the present invention, the audio metadata providing device 110 is included as a part of the multi-channel audio data providing device.

オーディオメタデータ提供装置１１０から動的フォーマット変換情報１５０を含むオーディオメタデータ１４０が提供される。本発明の一実施形態によると、オーディオメタデータ１４０は、動的フォーマット変換情報１５０だけではなく、メタデータに一般的に含まれる情報を含んでもよい。本発明の他の一実施形態によると、オーディオメタデータは、多チャネルオーディオデータと共に提供され得る。本発明の更なる一実施形態によると、オーディオメタデータ１４０は、リアルタイムで多チャネルオーディオデータ再生装置１６０に送信されたり、又は、多チャネルオーディオデータ再生装置１６０に予め送信されて多チャネルオーディオデータ再生装置１６０のバッファ、メモリのような格納媒体に格納され得る。又は、オーディオメタデータ１４０は、ＣＤ−ＲＯＭ、ＣＤ−ＲＷ、ＤＶＤ−Ｒ、ＤＶＤ−ＲＷなどのような光記録媒体に格納されて配布され得る。 The audio metadata 140 including the dynamic format conversion information 150 is provided from the audio metadata providing device 110. According to an embodiment of the present invention, the audio metadata 140 may include not only the dynamic format conversion information 150 but also information generally included in the metadata. According to another embodiment of the invention, audio metadata may be provided with multi-channel audio data. According to a further embodiment of the present invention, the audio metadata 140 may be transmitted to the multi-channel audio data reproducing device 160 in real time, or may be pre-transmitted to the multi-channel audio data reproducing device 160 to reproduce the multi-channel audio data. It may be stored in a storage medium such as a buffer or a memory of the device 160. Alternatively, the audio metadata 140 may be stored and distributed in an optical recording medium such as a CD-ROM, a CD-RW, a DVD-R, a DVD-RW, or the like.

多チャネルオーディオデータを動的フォーマット変換情報によってフォーマット間に変換した後、これを再生できる多チャネルオーディオデータ再生装置１６０が提供される。
多チャネルオーディオデータ再生装置１６０は、動的フォーマット変換情報を識別するデータ識別部１７０、識別された動的フォーマット変換情報でフォーマット間の変換を行うオーディオデータ変換部１８０、及び変換された多チャネルオーディオデータを再生するオーディオデータ再生部１９０を含む。 There is provided a multi-channel audio data reproducing device 160 capable of reproducing multi-channel audio data after converting the multi-channel audio data between formats according to dynamic format conversion information.
The multi-channel audio data reproducing device 160 includes a data identifying unit 170 for identifying the dynamic format conversion information, an audio data converting unit 180 for converting between the formats by the identified dynamic format conversion information, and the converted multi-channel audio. The audio data reproducing unit 190 for reproducing data is included.

本発明の一実施形態によると、データ識別部１７０は、オーディオメタデータ１４０で多チャネルオーディオデータの再生環境に基づいた第２フォーマットに該当する動的フォーマット変換情報を識別する。多チャネルオーディオデータの再生環境は、多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定される。本発明の一実施形態によると、データ識別部１７０は、オーディオメタデータに記録された１つ以上の動的フォーマット変換情報のうち、第２フォーマットに対応する動的フォーマット変換情報を選択して識別することができる。 According to an exemplary embodiment of the present invention, the data identifying unit 170 identifies the dynamic format conversion information corresponding to the second format based on the reproduction environment of the multi-channel audio data in the audio metadata 140. The reproduction environment of the multi-channel audio data is determined based on the layout of the speaker that reproduces the multi-channel audio data. According to an embodiment of the present invention, the data identifying unit 170 selects and identifies dynamic format conversion information corresponding to the second format from among the one or more dynamic format conversion information recorded in the audio metadata. can do.

本発明の一実施形態によると、オーディオデータ変換部１８０は、識別した動的フォーマット変換情報によって多チャネルオーディオデータを多チャネルオーディオ著作者が設定した第１フォーマットから多チャネルオーディオデータの再生環境に基づいた第２フォーマットに変換する。動的フォーマット変換情報は、第１フォーマットと第２フォーマットとの間の複数のフォーマット変換方式が多チャネルオーディオデータの再生区間ごとに設定されたものである。 According to an embodiment of the present invention, the audio data conversion unit 180 determines the multi-channel audio data from the first format set by the multi-channel audio author according to the identified dynamic format conversion information based on the reproduction environment of the multi-channel audio data. Converted to the second format. The dynamic format conversion information is information in which a plurality of format conversion methods between the first format and the second format are set for each reproduction section of the multi-channel audio data.

本発明の一実施形態によると、オーディオデータ変換部１８０は、再生時間により動的フォーマット変換情報から再生時間を含む再生区間を識別し、動的フォーマット変換情報で当該再生区間に設定されたフォーマット変換方式を識別して第１フォーマットと第２フォーマットとの間のフォーマット変換を行う。本発明の一実施形態によると、複数のフォーマット変換方式が設定された再生区間は、互いに同一の再生長さを有するか、互いに異なる再生長さを有し得る。本発明の一実施形態によると、オーディオデータ変換部１８０は、動的フォーマット変換情報によって再生区間ごとに互いに異なるフォーマット変換方式を用いて変換するか、又は部分的にフォーマット変換方式を繰り返されるように用いて変換することができる。 According to an embodiment of the present invention, the audio data conversion unit 180 identifies a reproduction section including the reproduction time from the dynamic format conversion information according to the reproduction time, and performs the format conversion set in the reproduction section with the dynamic format conversion information. The system is identified and the format conversion between the first format and the second format is performed. According to an exemplary embodiment of the present invention, the reproduction sections in which the plurality of format conversion methods are set may have the same reproduction length or different reproduction lengths. According to an exemplary embodiment of the present invention, the audio data conversion unit 180 may perform conversion using different format conversion methods for each playback section according to the dynamic format conversion information, or may partially repeat the format conversion method. Can be converted using

本発明の一実施形態によると、第２フォーマットに変換された多チャネルオーディオデータを再生するオーディオデータ再生部１９０が提供される。第２フォーマットは多チャネルオーディオデータの再生環境に基づいて、多チャネルオーディオデータの再生環境は多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定される。オーディオデータ再生部１９０は、１つ以上のスピーカの出力部から構成される。オーディオデータ再生部１９０は、第２フォーマットに変換された多チャネルオーディオデータに対して各チャネルに対応するスピーカからオーディオデータを出力する。 According to an embodiment of the present invention, an audio data reproducing unit 190 for reproducing multi-channel audio data converted into the second format is provided. The second format is determined based on the reproduction environment of the multi-channel audio data, and the reproduction environment of the multi-channel audio data is determined based on the layout of the speaker that reproduces the multi-channel audio data. The audio data reproducing unit 190 is composed of output units of one or more speakers. The audio data reproducing unit 190 outputs audio data from the speaker corresponding to each channel with respect to the multi-channel audio data converted into the second format.

本発明の一実施形態によると、オーディオデータ再生部１９０は、出力部に接続されたスピーカの個数を把握して多チャネルオーディオデータの再生環境を識別する。さらに、オーディオデータ再生部１９０は、スピーカの個数だけではなく、各スピーカの位置を識別したり、ユーザから再生環境に関する情報が入力されることで再生環境を識別することができる。 According to an embodiment of the present invention, the audio data reproducing unit 190 identifies the reproduction environment of multi-channel audio data by grasping the number of speakers connected to the output unit. Further, the audio data reproducing unit 190 can identify not only the number of speakers but also the position of each speaker, or the reproduction environment by inputting information regarding the reproduction environment from the user.

図２は、本発明の一実施形態に係る多チャネルオーディオデータのフォーマットを一括的に変換する一例を示す図である。 FIG. 2 is a diagram showing an example of collectively converting the format of multi-channel audio data according to an embodiment of the present invention.

多チャネルオーディオデータは、多チャネルオーディオデータの著作者が設定した多チャネルオーディオデータフォーマットの第１フォーマットに合わせて製作される。多チャネルオーディオデータを再生する側の多チャネルオーディオデータフォーマットの第２フォーマットは、多チャネルオーディオデータの再生環境に基づく。多チャネルオーディオデータの再生環境は、多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定されるため、第２フォーマットは、多チャネルオーディオデータの第１フォーマットと異なり得る。本発明の一実施形態によると、多チャネルオーディオデータの再生環境に基づく第２フォーマットが第１フォーマットと異なる場合、多チャネルオーディオデータ再生装置のオーディオデータ変換部は、一括的なフォーマット変換方式２００により変換を行うことができる。 The multi-channel audio data is produced according to the first multi-channel audio data format set by the author of the multi-channel audio data. The second format of the multi-channel audio data format on the side of reproducing the multi-channel audio data is based on the reproduction environment of the multi-channel audio data. Since the reproduction environment of the multi-channel audio data is determined based on the layout of the speaker in which the multi-channel audio data is reproduced, the second format may be different from the first format of the multi-channel audio data. According to an embodiment of the present invention, when the second format based on the reproduction environment of the multi-channel audio data is different from the first format, the audio data conversion unit of the multi-channel audio data reproduction device uses the collective format conversion method 200. The conversion can be done.

図２を参照すると、第１フォーマットは、１０．２チャネルフォーマットであると仮定する。一括的なフォーマット変換方式２００によると、第２フォーマットが５．１チャネルフォーマットである場合、聴者の左側の前面スピーカＬは第１フォーマットの左側の前面スピーカＬと左側上段スピーカＬＨの線形結合（ｌｉｎｅａｒｃｏｍｂｉｎａｔｉｏｎ）として決定される。他の例として、第２フォーマットが７．１チャネルフォーマットである場合、右側の後面スピーカＲＢは、第１フォーマットの右側の後面スピーカＲＢと中央スピーカＣＨの線形結合に決定される。 Referring to FIG. 2, the first format is assumed to be 10.2 channel format. According to the collective format conversion method 200, when the second format is the 5.1 channel format, the left front speaker L of the listener is a linear combination of the left front speaker L and the left upper speaker LH of the first format. combination)). As another example, when the second format is the 7.1 channel format, the right rear speaker RB is determined to be a linear combination of the right rear speaker RB and the center speaker CH of the first format.

一括的なフォーマット変換方式２００によると、フォーマット変換方式はチャネル間の線形結合であることから、非線形変換はできない。また、再生区間ごとにフォーマット変換方式は変化されない。本発明の一実施形態によると、多チャネルオーディオデータの再生区間ごとに１つ以上のフォーマット変換方式が設定された動的フォーマット変換情報が提供される。また、第１フォーマットと第２フォーマットとの間の非線形変換をサポートするフォーマット変換方式が提供される。 According to the batch format conversion method 200, since the format conversion method is a linear combination between channels, non-linear conversion cannot be performed. Further, the format conversion method does not change for each reproduction section. According to an embodiment of the present invention, dynamic format conversion information in which one or more format conversion methods are set for each reproduction section of multi-channel audio data is provided. Also provided is a format conversion scheme that supports non-linear conversion between the first format and the second format.

図３は、本発明の一実施形態に係る多チャネルオーディオデータのフォーマット変換を行うことのできる動的フォーマット変換情報を示す図である。 FIG. 3 is a diagram showing dynamic format conversion information capable of performing format conversion of multi-channel audio data according to an embodiment of the present invention.

図３を参考すると、動的フォーマット変換情報３１０は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の複数のフォーマット変換方式（例えば、フォーマット変換方式Ｋ３２０、Ｌ３４０、Ｍ３３０）が多チャネルオーディオデータの再生区間ごとに設定されたものである。 Referring to FIG. 3, the dynamic format conversion information 310 includes a plurality of format conversion methods between a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of multi-channel audio data. (For example, format conversion methods K320, L340, M330) are set for each reproduction section of multi-channel audio data.

本発明の一実施形態によると、それぞれのフォーマット変換方式は、同一の第２フォーマットにフォーマットを変換する。ただし、変換する方式は互いに異なってもよい。図３を参照して説明すると、フォーマット変換方式Ｋ３２０は、第１フォーマットの複数の左側スピーカＬｅｆｔ_１とＬｅｆｔ_２の線形結合で第２フォーマットの左側スピーカＬｅｆｔの出力データを決定する。フォーマット変換方式Ｍ３３０は、第１フォーマットの複数の左側スピーカのうちの１つであるＬｅｆｔ_１だけで第２フォーマットの左側スピーカＬｅｆｔの出力データを決定する。本発明の一実施形態によると、それぞれの変換方式は非線形変換を含むことができる。 According to one embodiment of the present invention, each format conversion scheme converts the format to the same second format. However, the conversion methods may be different from each other. Referring to FIG. 3, the format conversion method K320 determines the output data of the left speaker Left in the second format by linearly combining the left speakers Left ₁ and Left ₂ in the first format. The format conversion method M330 determines the output data of the left speaker Left of the second format only by Left ₁ which is one of the left speakers of the first format. According to an embodiment of the present invention, each conversion scheme may include a non-linear conversion.

本発明の一実施形態によると、多チャネルオーディオデータ再生装置は、動的フォーマット変換情報から再生区間ごとに設定されたフォーマット変換方式を識別して変換することができる。図３を参照すると、多チャネルオーディオデータ再生装置は、再生区間ｔ＝０からｔ＝ｔ_１まで多チャネルオーディオデータをフォーマット変換方式Ｋ３２０によって変換する。多チャネルオーディオデータ再生装置は、以後の再生区間ｔ＝ｔ_１からｔ＝ｔ_２まで多チャネルオーディオデータをフォーマット変換方式Ｍ３３０によって変換する。同様に、多チャネルオーディオデータ再生装置は、再生区間ｔ＝ｔ_３からｔ＝ｔ_４まではフォーマット変換方式Ｌ３４０によって変換を行い、以後の再生区間でも同じ作業を繰り返す。 According to an embodiment of the present invention, the multi-channel audio data reproducing device can identify and convert the format conversion method set for each reproduction section from the dynamic format conversion information. Referring to FIG. 3, the multi-channel audio data reproducing device converts the multi-channel audio data from the reproduction section t=0 to t=t ₁ by the format conversion method K320. The multi-channel audio data reproducing device converts the multi-channel audio data from the subsequent reproduction section t=t ₁ to t=t ₂ by the format conversion method M330. Similarly, the multi-channel audio data reproducing device performs conversion by the format conversion method L340 from the reproduction section t=t ₃ to t=t _4, and repeats the same operation in the subsequent reproduction sections.

本発明の一実施形態によると、動的フォーマット変換情報３１０は、多チャネルオーディオデータの再生区間ごとにフォーマット変換方式を相異に設定するか、又は部分的に繰り返されるように設定してもよい。図３を参照すると、フォーマット変換方式Ｋ３２０は、再生区間ｔ＝０からｔ＝ｔ_１だけではなく、再生区間ｔ＝ｔ_２からｔ＝ｔ_３でも再び設定され得る。本発明の一実施形態によると、フォーマット変換方式は、一括的なフォーマット変換方式や線形結合による変換だけではなく、非線形変換も含むことができる。 According to an exemplary embodiment of the present invention, the dynamic format conversion information 310 may be set such that the format conversion method is set differently for each reproduction section of the multi-channel audio data or set to be partially repeated. .. Referring to FIG. 3, the format conversion method K320 may be set not only in the reproduction sections t=0 to t=t ₁ but also in the reproduction sections t=t ₂ to t=t ₃ . According to an embodiment of the present invention, the format conversion method may include a non-linear conversion as well as a batch format conversion method or a linear combination conversion.

本発明の一実施形態によると、フォーマット変換方式が設定されたそれぞれの再生区間は互いに同一の再生長さを有するか、互いに異なる再生長さを有し得る。図３を参照すると、再生区間ｔ＝ｔ_１からｔ＝ｔ_２と再生区間ｔ＝ｔ_７からｔ＝ｔ_８は互いに同じ再生長さを有し得る。 According to an exemplary embodiment of the present invention, the respective reproduction sections for which the format conversion method is set may have the same reproduction length or different reproduction lengths. Referring to FIG. 3, the reproduction sections t=t ₁ to t=t ₂ and the reproduction sections t=t ₇ to t=t ₈ may have the same reproduction length.

図４は、本発明の一実施形態に係る１つ以上の動的フォーマット変換情報を含むオーディオメタデータを示す図である。 FIG. 4 is a diagram illustrating audio metadata including one or more pieces of dynamic format conversion information according to an exemplary embodiment of the present invention.

図４を参考すると、多チャネルオーディオデータの再生環境が様々であるため、オーディオメタデータ１４０は、１つ以上の動的フォーマット変換情報４２０、４３０を含む。
多チャネルオーディオデータ再生装置１６０は、多チャネルオーディオデータの再生環境に基づいた第２フォーマットに該当する動的フォーマット変換情報を選択し、多チャネルオーディオデータのフォーマットを変換する。再生環境は、多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定される。 Referring to FIG. 4, the audio metadata 140 includes one or more pieces of dynamic format conversion information 420 and 430 due to various playback environments of multi-channel audio data.
The multi-channel audio data reproducing device 160 selects the dynamic format conversion information corresponding to the second format based on the reproduction environment of the multi-channel audio data, and converts the format of the multi-channel audio data. The reproduction environment is determined based on the layout of the speaker that reproduces the multi-channel audio data.

図４を参照すると、多チャネルオーディオデータの著作者が設定した第１フォーマットが２２．２チャネルフォーマットであり、多チャネルオーディオデータの再生環境に基づいた第２フォーマットが１０．２チャネルフォーマットであると仮定する。多チャネルオーディオデータ再生装置１６０のデータ識別部１７０は、オーディオメタデータの複数の動的フォーマット変換情報４２０、４３０のうち第２フォーマットに対応する動的フォーマット変換情報（１）４２０を識別する。同様に、多チャネルオーディオデータの再生環境に基づいた第２フォーマットが５．１チャネルフォーマットであれば、多チャネルオーディオデータ再生装置のデータ識別部１７０は動的フォーマット変換情報（２）４３０を識別する。 Referring to FIG. 4, the first format set by the author of multi-channel audio data is 22.2 channel format, and the second format based on the reproduction environment of multi-channel audio data is 10.2 channel format. I assume. The data identifying unit 170 of the multi-channel audio data reproducing device 160 identifies the dynamic format conversion information (1) 420 corresponding to the second format among the plurality of dynamic format conversion information 420 and 430 of the audio metadata. Similarly, if the second format based on the reproduction environment of multi-channel audio data is the 5.1-channel format, the data identification unit 170 of the multi-channel audio data reproduction device identifies the dynamic format conversion information (2) 430. ..

先に仮定した１０．２チャネルフォーマットで、オーディオデータ変換部１８０は、識別された動的フォーマット変換情報（１）４２０によって多チャネルオーディオデータのフォーマットを変換する。すなわち、オーディオデータ変換部１８０は、再生区間ごとに設定された複数のフォーマット変換方式４４０に基づいて多チャネルオーディオデータを再生区間ｔ＝０からｔ＝ｔ_１まではフォーマット変換方式Ｋ４５０によって変換し、再生区間ｔ＝ｔ_１からｔ＝ｔ２まではフォーマット変換方式Ｍ４６０によって変換する。本発明の一実施形態によると、動的フォーマット変換情報は、多チャネルオーディオデータの再生区間ごとに相異に設定されるか、又は部分的に繰り返されるように設定され得る。また、フォーマット変換方式が設定されたそれぞれの再生区間の再生長さも互いに異なるか、同じであってもよい。図４を参考すると、フォーマット変換方式Ｋ４５０は、再生区間ｔ＝_０からｔ＝ｔ１で用いられるが、その後の再生区間でも繰り返し用いてもよい。また、再生区間ｔ＝０からｔ＝ｔ_１と再生区間ｔ＝ｔ_１からｔ＝ｔ_２の再生長さも互いに異なるか、同じであってもよい。 The audio data conversion unit 180 converts the format of multi-channel audio data according to the identified dynamic format conversion information (1) 420 using the 10.2 channel format assumed above. That is, the audio data conversion unit 180 converts the multi-channel audio data from the reproduction section t=0 to t=t ₁ by the format conversion method K450 based on the plurality of format conversion methods 440 set for each reproduction section, The reproduction section t=t ₁ to t=t 2 is converted by the format conversion method M460. According to an exemplary embodiment of the present invention, the dynamic format conversion information may be set differently or partially repeated for each reproduction section of the multi-channel audio data. Also, the reproduction lengths of the respective reproduction sections for which the format conversion method is set may be different from each other or may be the same. Referring to FIG. 4, the format conversion method K450 is used in the reproduction section t= ₀ to t=t1, but may be repeatedly used in the subsequent reproduction section. Further, the reproduction lengths of the reproduction sections t=0 to t=t ₁ and the reproduction sections t=t ₁ to t=t ₂ may be different from each other or may be the same.

図５は、本発明の一実施形態に係るマトリックス方式を用いてフォーマット間の変換を行う実施形態を説明するための図である。 FIG. 5 is a diagram illustrating an embodiment in which conversion between formats is performed using a matrix method according to an embodiment of the present invention.

図５を参考すると、動的フォーマット変換情報でそれぞれのフォーマット変換方式は、変換マトリックス５３０、５４０に格納され得る。変換マトリックスは、多チャネルオーディオデータの著作者が設定した第１フォーマットから多チャネルオーディオデータの再生環境に基づいた第２フォーマットに変換するためのマトリックスである。オーディオデータ変換部は、第１フォーマットチャネルマトリックスを変換マトリックスに適用して第２フォーマットチャネルマトリックスを出力することで、第１フォーマットから第２フォーマットに変換することができる。 Referring to FIG. 5, each format conversion method in the dynamic format conversion information may be stored in the conversion matrices 530 and 540. The conversion matrix is a matrix for converting the first format set by the author of multi-channel audio data to the second format based on the reproduction environment of multi-channel audio data. The audio data conversion unit can convert the first format to the second format by applying the first format channel matrix to the conversion matrix and outputting the second format channel matrix.

図５を参照すると、多チャネルオーディオデータの著作者は、１０．２チャネルフォーマット（第１フォーマット）に多チャネルオーディオデータを製作５１０したと仮定し、多チャネルオーディオデータの再生環境は、５．１チャネルフォーマット（第２フォーマット）であると仮定する。この場合、フォーマット変換５５０を参考すると、オーディオデータ変換部は、第１フォーマットチャネルマトリックス５８０（チャネルマトリックスの各元素は各チャネルに対応する）を変換マトリックス５７０に適用して第２フォーマットチャネルマトリックス５６０を出力する方式によりフォーマットを変換する。したがって、図５に示す場合、第１フォーマットの１０．２チャネルフォーマットは１２個のチャネルを有し、第２フォーマットの５．１チャネルフォーマットは６個のチャネルを有するため、フォーマット変換方式に関する情報を含む変換マトリックス５３０、５４０は６行１２列に構成される。 Referring to FIG. 5, it is assumed that the author of the multi-channel audio data has produced the multi-channel audio data 510 in the 10.2 channel format (first format), and the reproduction environment of the multi-channel audio data is 5.1. It is assumed that the channel format is the second format. In this case, referring to the format conversion 550, the audio data conversion unit applies the first format channel matrix 580 (each element of the channel matrix corresponds to each channel) to the conversion matrix 570 to form the second format channel matrix 560. The format is converted according to the output method. Therefore, in the case of FIG. 5, the 10.2 channel format of the first format has 12 channels, and the 5.1 channel format of the second format has 6 channels. The conversion matrixes 530 and 540 including them are arranged in 6 rows and 12 columns.

また、オーディオデータ変換部は、再生区間ごとに設定されたフォーマット変換方式に合わせて変換マトリックス５７０を交換して変換することができる。例えば、図５に示す動的フォーマット変換情報５２０で、再生区間ｔ＝０からｔ＝ｔ_１まで変換方式Ｋが設定されているため、当該の再生区間でオーディオデータ変換部は、変換マトリックス５７０を変換方式Ｋに対する変換マトリックス５３０に設定して変換を行う。再生区間ｔ＝ｔ_１からｔ＝ｔ_２まで変換方式Ｍが設定されているため、当該の再生区間でオーディオデータ変換部は、変換マトリックス５７０を変換方式Ｍに対する変換マトリックス５４０に設定して変換を行う。 Further, the audio data conversion unit can exchange and convert the conversion matrix 570 according to the format conversion method set for each reproduction section. For example, in the dynamic format conversion information 520 shown in FIG. 5, since the conversion method K is set from the reproduction section t=0 to t=t ₁ , the audio data conversion section sets the conversion matrix 570 in the reproduction section. The conversion is performed by setting the conversion matrix 530 for the conversion method K. Since the conversion method M is set from the reproduction section t=t ₁ to t=t ₂ , the audio data conversion unit sets the conversion matrix 570 in the conversion matrix 540 for the conversion method M and performs conversion in the reproduction section. To do.

図６は、本発明の一実施形態に係るオーディオメタデータ提供装置が動的フォーマット変換情報の含まれたオーディオメタデータを提供する動作を示したフローチャートである。 FIG. 6 is a flowchart illustrating an operation of providing audio metadata including dynamic format conversion information by the audio metadata providing apparatus according to an exemplary embodiment of the present invention.

ステップＳ６１０において、オーディオメタデータ提供装置は、動的フォーマット変換情報を識別する。動的フォーマット変換情報は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の複数のフォーマット変換方式が多チャネルオーディオデータの再生区間ごとに設定されたものである。本発明の一実施形態によると、オーディオメタデータ提供装置は、多チャネルオーディオデータの著作者から動的フォーマット変換情報を識別する。本発明の他の実施形態によると、多チャネルオーディオデータ再生装置は、１つ以上のオーディオメタデータから複数の動的フォーマット変換情報を識別する。 In step S610, the audio metadata providing apparatus identifies the dynamic format conversion information. As the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are used to reproduce the multi-channel audio data. It is set for each section. According to an embodiment of the present invention, an audio metadata providing apparatus identifies dynamic format conversion information from an author of multi-channel audio data. According to another embodiment of the present invention, a multi-channel audio data reproducing device identifies a plurality of dynamic format conversion information from one or more audio metadata.

ステップＳ６２０において、オーディオメタデータ提供装置は、識別された動的フォーマット変換情報を含むオーディオメタデータを生成する。ここで、オーディオメタデータ提供装置は、オーディオメタデータに一般的に含まれる情報（例えば、著作者、レコード名、発売年度など）を含んでもよい。本発明の一実施形態によると、オーディオメタデータ提供装置は、複数の動的フォーマット変換情報をオーディオメタデータに含み得る。本発明の一実施形態によると、オーディオメタデータ提供装置は、動的フォーマット変換情報の各フォーマット変換方式をマトリックスの形態（例えば、図５に示す変換マトリックス５３０、５４０）にオーディオメタデータに記録することができる。 In operation S620, the audio metadata providing apparatus generates audio metadata including the identified dynamic format conversion information. Here, the audio metadata providing device may include information generally included in the audio metadata (for example, author, record name, release year, etc.). According to an embodiment of the present invention, the audio metadata providing apparatus may include a plurality of dynamic format conversion information in the audio metadata. According to an embodiment of the present invention, the audio metadata providing apparatus records each format conversion method of the dynamic format conversion information in the audio metadata in the form of a matrix (for example, the conversion matrices 530 and 540 shown in FIG. 5). be able to.

図７は、本発明の一実施形態に係る多チャネルオーディオデータ再生装置が多チャネルオーディオデータのフォーマットを変換した後、これを再生する動作を示したフローチャートである。 FIG. 7 is a flowchart showing an operation of the multi-channel audio data reproducing apparatus according to an embodiment of the present invention, which converts the format of multi-channel audio data and then reproduces the converted format.

ステップＳ７１０において、多チャネルオーディオデータ再生装置は、多チャネルオーディオデータ及びオーディオメタデータを受信する。本発明の一実施形態によると、オーディオメタデータは、多チャネルオーディオデータと共に提供されたり、別に提供されてもよい。本発明の一実施形態によると、オーディオメタデータは、リアルタイムで多チャネルオーディオデータ再生装置に受信されたり、又は、多チャネルオーディオデータ再生装置に予め送信されて多チャネルオーディオデータ再生装置のバッファ、メモリのような格納媒体に格納されてもよい。また、オーディオメタデータは、ＣＤ−ＲＯＭ、ＣＤ−ＲＷ、ＤＶＤ−Ｒ、ＤＶＤ−ＲＷなどのような光記録媒体に格納されて受信され得る。 In step S710, the multi-channel audio data reproducing device receives multi-channel audio data and audio metadata. According to an embodiment of the present invention, the audio metadata may be provided together with the multi-channel audio data or separately. According to an embodiment of the present invention, the audio metadata is received by the multi-channel audio data reproducing device in real time, or is transmitted to the multi-channel audio data reproducing device in advance, and the buffer and the memory of the multi-channel audio data reproducing device are transmitted. It may be stored in a storage medium such as. Also, the audio metadata can be stored and received in an optical recording medium such as a CD-ROM, a CD-RW, a DVD-R, a DVD-RW, or the like.

ステップＳ７２０において、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットが異なる場合、多チャネルオーディオデータ再生装置は、オーディオメタデータで動的フォーマット変換情報を識別することになる。本発明の一実施形態によると、オーディオメタデータは、１つ以上の動的フォーマット変換情報を含んでもよく、この場合、多チャネルオーディオデータ再生装置は、多チャネルオーディオデータ再生装置の第２フォーマットに対応する動的フォーマット変換情報を識別することができる。多チャネルオーディオデータの再生環境は、多チャネルオーディオデータが再生するスピーカのレイアウトに基づいて決定される。 In step S720, if the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are different, the multi-channel audio data reproduction device uses the dynamic format in the audio metadata. The conversion information will be identified. According to an embodiment of the present invention, the audio metadata may include one or more dynamic format conversion information, in which case the multi-channel audio data reproducing device is in the second format of the multi-channel audio data reproducing device. The corresponding dynamic format conversion information can be identified. The reproduction environment of the multi-channel audio data is determined based on the layout of the speaker that reproduces the multi-channel audio data.

オーディオメタデータで識別した動的フォーマット変換情報は、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとの間の複数のフォーマット変換方式が多チャネルオーディオデータの再生区間ごとに設定されたものである。動的フォーマット変換情報の複数のフォーマット変換方式が設定された再生区間は、互いに同一の再生長さを有するか、互いに異なる再生長さを有し得る。動的フォーマット変換情報の再生区間ごとに設定されたフォーマット変換方式は、相異に設定されるか、又は部分的に繰り返されるように設定され得る。 The dynamic format conversion information identified by the audio metadata includes a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data. It is set for each reproduction section of multi-channel audio data. The reproduction sections in which a plurality of format conversion methods of the dynamic format conversion information are set may have the same reproduction length or different reproduction lengths. The format conversion method set for each reproduction section of the dynamic format conversion information may be set differently or set to be partially repeated.

ステップＳ７３０において、多チャネルオーディオデータ再生装置は、識別した動的フォーマット変換情報によって多チャネルオーディオデータの著作者が設定した第１フォーマットで多チャネルオーディオデータの再生環境に基づいた第２フォーマットに変換を行う。本発明の一実施形態によると、変換する再生区間は、動的フォーマット変換情報によって同一の再生長さを有するか、互いに異なる再生長さを有し得る。本発明の一実施形態によると、フォーマット変換方式は、多チャネルオーディオデータの再生区間ごとに相異に変換したり、部分的に繰り返されるように変換してもよい。 In step S730, the multi-channel audio data reproduction device converts the identified dynamic format conversion information into the second format based on the reproduction environment of the multi-channel audio data in the first format set by the author of the multi-channel audio data. To do. According to an embodiment of the present invention, the playback sections to be converted may have the same playback length or different playback lengths according to the dynamic format conversion information. According to an exemplary embodiment of the present invention, the format conversion method may perform different conversion for each reproduction section of multi-channel audio data, or may perform partial conversion.

ステップＳ７４０において、多チャネルオーディオデータ再生装置は、変換された多チャネルオーディオデータを再生する。多チャネルオーディオデータ再生装置は、第２フォーマットに変換された多チャネルオーディオデータに対して各チャネルに対応するスピーカからオーディオデータを出力する。本発明の一実施形態によると、多チャネルオーディオデータの著作者が設定した第１フォーマットと多チャネルオーディオデータの再生環境に基づいた第２フォーマットとが互いに同一な場合、多チャネルオーディオデータ再生装置は変換を行うことなく、多チャネルオーディオデータを再生することができる。 In step S740, the multi-channel audio data reproducing device reproduces the converted multi-channel audio data. The multi-channel audio data reproducing device outputs audio data from the speaker corresponding to each channel with respect to the multi-channel audio data converted into the second format. According to an embodiment of the present invention, when the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are the same, the multi-channel audio data reproducing device is Multi-channel audio data can be played back without conversion.

本発明の実施形態に係る方法は、多様なコンピュータ手段を介して様々な処理を実行することができるプログラム命令の形態で実現され、コンピュータで読取可能な記録媒体に記録されてもよい。コンピュータ読取可能な媒体は、プログラム命令、データファイル、データ構造などのうち１つまたはその組合せを含んでもよい。媒体に記録されるプログラム命令は、本発明の目的のために特別に設計されて構成されたものでもよく、コンピュータソフトウェア分野の技術を有する当業者にとって公知のものであり、使用可能なものであってもよい。コンピュータ読取可能な記録媒体の例としては、ハードディスク、フロッピー（登録商標）ディスク及び磁気テープのような磁気媒体、ＣＤ−ＲＯＭ、ＤＶＤのような光記録媒体、光ディスクのような光磁気媒体、及びＲＯＭ、ＲＡＭ、フラッシュメモリなどのようなプログラム命令を保存して実行するように特別に構成されたハードウェア装置が含まれてもよい。プログラム命令の例には、コンパイラによって作られるような機械語コードだけでなく、インタープリタなどを用いてコンピュータによって実行できる高級言語コードが含まれる。前記したハードウェア装置は、本発明の動作を行うために１つ以上のソフトウェアモジュールとして動作するように構成されてもよく、その逆も同様である。 The method according to the embodiments of the present invention may be implemented in the form of program instructions capable of performing various processes via various computer means, and may be recorded in a computer-readable recording medium. Computer readable media may include one or a combination of program instructions, data files, data structures, and so on. The program instructions recorded on the medium may be those specially designed and configured for the purpose of the present invention, and are known and usable by those skilled in the art of computer software. May be. Examples of computer-readable recording media include hard disks, magnetic media such as floppy (registered trademark) disks and magnetic tapes, optical recording media such as CD-ROMs and DVDs, magneto-optical media such as optical disks, and ROMs. , RAM, flash memory, etc., and hardware devices specially configured to store and execute program instructions may be included. Examples of program instructions include not only machine language code such as that produced by a compiler, but also high level language code that can be executed by a computer using an interpreter or the like. The hardware devices described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.

上述したように、本発明を限定された実施形態と図面によって説明したが、本発明は、上記の実施形態に限定されることなく、本発明が属する分野における通常の知識を有する者であれば、このような実施形態から多様な修正及び変形が可能である。 As described above, the present invention has been described by the limited embodiments and the drawings. However, the present invention is not limited to the above embodiments, and a person having ordinary knowledge in the field to which the present invention belongs can be used. Various modifications and variations are possible from such an embodiment.

したがって、本発明の範囲は、開示された実施形態に限定されるものではなく、特許請求の範囲だけではなく特許請求の範囲と均等なものなどによって定められるものである。 Therefore, the scope of the present invention is not limited to the disclosed embodiments, but is defined not only by the claims but also by equivalents to the claims.

３１０：動的フォーマット変換情報
３２０：フォーマット変換方式Ｋ
３３０：フォーマット変換方式Ｍ
３４０：フォーマット変換方式Ｌ 310: Dynamic format conversion information 320: Format conversion method K
330: Format conversion method M
340: Format conversion method L

Claims

Dynamic data between the multi-channel audio data produced by the first format and the first format set by the author of the multi-channel audio data from the audio metadata and the second format based on the reproduction environment of the multi-channel audio data. A data identification section for identifying format conversion information,
An audio data conversion unit for converting the multi-channel audio data of the first format into a second format using the dynamic format conversion information;
An audio data reproducing unit for reproducing the multi-channel audio data converted into the second format,
Including,
In the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are the multi-channel audio data. A multi-channel audio data reproducing device which is set for each reproducing section.

The multi-channel audio data reproducing apparatus according to claim 1, wherein the reproduction sections of the multi-channel audio data converting unit have the same reproduction length or different reproduction lengths.

The multi-channel audio data according to claim 1, wherein the format conversion method of the multi-channel audio data conversion unit performs different conversion for each reproduction section of the multi-channel audio data or conversion so as to be partially repeated. Playback device.

The multi-channel audio data reproducing apparatus according to claim 1, wherein the reproduction environment of the multi-channel audio data is determined based on a layout of a speaker that reproduces the multi-channel audio data.

Identifying dynamic format conversion information in the multi-channel audio data between a first format set by the author of the multi-channel audio data and a second format based on the playback environment of the multi-channel audio data;
Generating audio metadata including the identified dynamic format conversion information;
Including,
In the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are the multi-channel audio data. A method for providing audio metadata, which is set for each playback section.

The audio metadata providing method according to claim 5, wherein the reproduction sections in which the plurality of format conversion methods are set have the same reproduction length or different reproduction lengths.

Dynamic data between the first format set by the author of the multi-channel audio data from the multi-channel audio data produced by the first format and the audio metadata and the second format based on the reproduction environment of the multi-channel audio data. Identifying the format conversion information,
Converting the multi-channel audio data of the first format into a second format using the dynamic format conversion information;
Playing the multi-channel audio data converted to the second format;
Including,
In the dynamic format conversion information, a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are used to reproduce the multi-channel audio data. A multi-channel audio data playback method that is set for each section.

The multi-channel audio data reproducing method according to claim 7, wherein the reproduction sections of the multi-channel audio data converting unit have the same reproduction length or different reproduction lengths.

8. The multi-channel audio data according to claim 7, wherein the format conversion method of the multi-channel audio data conversion unit performs different conversion for each reproduction section of the multi-channel audio data or conversion so as to be partially repeated. How to play.

The multi-channel audio data reproducing method according to claim 7, wherein the reproduction environment of the multi-channel audio data is determined based on a layout of a speaker that reproduces the multi-channel audio data.