KR102231750B1

KR102231750B1 - Audio metadata encoding and audio data playing apparatus for supporting dynamic format conversion, and method for performing by the appartus, and computer-readable medium recording the dynamic format conversions

Info

Publication number: KR102231750B1
Application number: KR1020190073485A
Authority: KR
Inventors: 유재현; 이태진; 이석진
Original assignee: 한국전자통신연구원; 경기대학교 산학협력단
Priority date: 2014-09-24
Filing date: 2019-06-20
Publication date: 2021-03-25
Also published as: KR102380279B1; KR20230071107A; KR102533824B1; KR20160035963A; JP6663147B2; JP2020092439A; KR20190076934A; KR20220044457A; JP7166398B2; JP6912612B2; JP2021170798A; JP2016072973A; KR101993348B1; KR20210033963A

Abstract

동적 포맷 변환을 지원하는 오디오 메타데이터 제공 장치 및 오디오 데이터 재생 장치, 상기 장치가 수행하는 방법 그리고 상기 동적 포맷 변환들이 기록된 컴퓨터에서 판독 가능한 기록매체가 개시된다. 동적 포맷 변환 정보는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 따른 제2 포맷 간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. 오디오 메타데이터 제공 장치 및 제공방법은 동적 포맷 변환 정보를 포함하는 메타데이터를 제공한다. 다채널 오디오 데이터 재생 장치는 오디오 메타데이터에서 동적 포맷 변환 정보를 식별한다. 식별한 동적 포맷 변환 정보에 의하여, 다채널 오디오 데이터 재생 장치는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷의 다채널 오디오 데이터를 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷으로 변환한다. 다채널 오디오 데이터 재생 장치는 변환된 다채널 오디오 데이터를 재생한다. Disclosed are an audio metadata providing apparatus and audio data reproducing apparatus supporting dynamic format conversion, a method performed by the apparatus, and a computer-readable recording medium in which the dynamic format conversions are recorded. In the dynamic format conversion information, a plurality of format conversion methods between a first format set by an author of the multi-channel audio data and a second format according to a reproduction environment of the multi-channel audio data are set for each reproduction section of the multi-channel audio data. An apparatus and method for providing audio metadata provides metadata including dynamic format conversion information. The multi-channel audio data reproducing apparatus identifies dynamic format conversion information in audio metadata. Based on the identified dynamic format conversion information, the multi-channel audio data reproducing apparatus converts multi-channel audio data of the first format set by the author of the multi-channel audio data into a second format based on a reproduction environment of the multi-channel audio data. The multi-channel audio data reproducing apparatus reproduces the converted multi-channel audio data.

Description

An audio metadata providing device and audio data reproducing device supporting dynamic format conversion, a method performed by the device, and a recording medium readable by a computer on which the dynamic format conversions are recorded (AUDIO METADATA ENCODING AND AUDIO DATA PLAYING APPARATUS FOR SUPPORTING DYNAMIC FORMAT) CONVERSION, AND METHOD FOR PERFORMING BY THE APPARTUS, AND COMPUTER-READABLE MEDIUM RECORDING THE DYNAMIC FORMAT CONVERSIONS}

본 발명은 다채널 오디오 데이터 재생 방법에 관한 것으로, 보다 구체적으로는 다채널 오디오 데이터의 다양한 포맷들 간의 변환 방법에 관한 것이다.The present invention relates to a method for reproducing multi-channel audio data, and more particularly, to a method for converting multi-channel audio data between various formats.

3DTV, 3D씨네마, UHDTV 등 차세대 콘텐츠 재생 환경에 대한 연구개발이 지속되면서 오디오도 다채널 라우드스피커를 사용하는 음향 재생 환경으로 변화가 빠르게 이루어지고 있다. As research and development on next-generation content playback environments such as 3DTV, 3D cinema, and UHDTV continues, audio is rapidly changing to a sound playback environment using multi-channel loudspeakers.

영화관 및 HDTV를 위한 입체 음향인 5.1채널 시스템 이후 상향 채널을 포함한 다양한 멀티채널 오디오 시스템이 도입되었고, ITU-R에서는 최근 Recommendation BS.2051을 제정하여 10.2채널, 13.1채널, 22.2채널 등을 비롯한 총 8개의 다채널 포맷을 차세대 오디오 시스템(advanced sound system)으로 정의하였다. 따라서 앞으로는 다양한 포맷에 기반을 둔 오디오 콘텐츠들이 제작될 가능성이 매우 높아졌다. After the 5.1-channel system, which is a stereoscopic sound for movie theaters and HDTVs, various multi-channel audio systems including upstream channels have been introduced, and ITU-R recently established Recommendation BS.2051 for a total of 8 channels including 10.2 channels, 13.1 channels, and 22.2 channels. Four multi-channel formats were defined as a next-generation audio system (advanced sound system). Therefore, it is highly likely that audio contents based on various formats will be produced in the future.

이러한 환경에서는 하나의 포맷으로 제작된 콘텐츠가 다른 포맷에서 재생될 가능성 또한 매우 높기 때문에, 콘텐츠 간 적절한 변환 방법이 요구된다. 종래에는 콘텐츠의 다채널 오디오 포맷으로부터 재생 환경 측의 새로운 다채널 오디오 포맷으로 포맷 변환을 함에 있어서 일괄적인 변환을 수행하였다. 그러나, 이러한 일괄 변환 방법은 콘텐츠 저작자의 저작 의도를 훼손할 수 밖에 없으며, 의도와 다른 변환을 수행할 수도 있다는 단점을 가지고 있다. In such an environment, the possibility that contents produced in one format can be reproduced in another format is also very high, and thus an appropriate conversion method between contents is required. Conventionally, in converting the format from a multi-channel audio format of content to a new multi-channel audio format on the side of the reproduction environment, batch conversion has been performed. However, such a batch conversion method has a disadvantage in that it inevitably damages the content author's authoring intention, and a conversion different from the intention may be performed.

본 발명은 다채널 오디오 데이터의 다양한 포맷 간에 저작자의 저작 의도가 완벽히 유지될 수 있게 포맷을 변환하는 동적 포맷 변환 방법을 제공하기 위한 오디오 메타데이터 제공 장치, 방법 및 동적 포맷 변환 방법에 따라 포맷을 변환하여 재생하는 장치, 방법 그리고 동적 포맷 변환 방법이 기록된 기록매체를 제안한다.The present invention converts the format according to an audio metadata providing apparatus, a method, and a dynamic format conversion method for providing a dynamic format conversion method for converting a format so that the author's authoring intention can be completely maintained between various formats of multi-channel audio data. Thus, we propose a recording medium on which a reproducing apparatus, a method, and a dynamic format conversion method are recorded.

본 발명은 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 따른 제2 포맷 간의 변환을 수행할 수 있는 동적 포맷 변환 정보가 포함된 오디오 메타데이터를 생성할 수 있는 오디오 메타데이터 제공 장치 및 방법을 제공한다.The present invention is capable of generating audio metadata including dynamic format conversion information capable of performing conversion between a first format set by an author of multi-channel audio data and a second format according to a reproduction environment of multi-channel audio data. An apparatus and method for providing audio metadata are provided.

본 발명은 다채널 오디오 데이터 및 동적 포맷 변환 정보가 담긴 오디오 메타데이터를 식별하여 제1 포맷에서 제2 포맷으로 다채널 오디오 데이터를 변환한 후 재생하는 다채널 오디오 데이터 재생 장치 및 방법을 제공한다.The present invention provides an apparatus and method for reproducing multi-channel audio data after converting multi-channel audio data from a first format to a second format by identifying multi-channel audio data and audio metadata including dynamic format conversion information.

본 발명은 다채널 오디오 데이터 및 동적 포맷 변환 정보가 포함된 오디오 메타데이터가 기록된 컴퓨터에서 판독 가능한 기록 매체를 제공한다.The present invention provides a computer-readable recording medium in which multi-channel audio data and audio metadata including dynamic format conversion information are recorded.

본 발명의 일실시예에 따른 오디오 메타데이터 제공 장치는 다채널 오디오 데이터에서 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 동적 포맷 변환 정보를 식별하는 변환 정보 식별부; 및 상기 식별된 동적 포맷 변환 정보를 포함하는 오디오 메타데이터를 생성하는 오디오 메타데이터 생성부를 포함한다. The apparatus for providing audio metadata according to an embodiment of the present invention provides dynamic format conversion information between a first format set by an author of multi-channel audio data from multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data. A conversion information identification unit to identify; And an audio metadata generator that generates audio metadata including the identified dynamic format conversion information.

상기 동적 포맷 변환 정보는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. The dynamic format conversion information is that a plurality of format conversion methods between a first format set by an author of the multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data are set for each reproduction section of the multi-channel audio data.

상기 복수의 포맷 변환 방식이 설정된 재생 구간은, 서로 동일한 재생 길이를 가지거나 또는 서로 다른 재생 길이를 가질 수 있다. The reproduction sections in which the plurality of format conversion schemes are set may have the same reproduction length or different reproduction lengths.

상기 다채널 오디오 데이터의 재생 환경은, 상기 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정될 수 있다. The reproduction environment of the multi-channel audio data may be determined based on a layout of speakers through which the multi-channel audio data is reproduced.

상기 복수의 포맷 변환 방식들은, 상기 제1 포맷에서 제2 포맷으로 변환하기 위한 매트릭스를 포함할 수 있다.The plurality of format conversion methods may include a matrix for converting the first format to a second format.

상기 동적 포맷 변환 정보는, 다채널 오디오 데이터의 재생 구간 별로 서로 다르게 설정되거나 또는 부분적으로 반복되게 설정될 수 있다.The dynamic format conversion information may be set differently or partially repeated for each reproduction section of multi-channel audio data.

본 발명의 일실시예에 따른 다채널 오디오 데이터 재생 장치는 제1 포맷에 따라 제작된 다채널 오디오 데이터 및 오디오 메타데이터로부터 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 동적 포맷 변환 정보를 식별하는 데이터 식별부; 상기 동적 포맷 변환 정보를 이용하여 상기 제1 포맷의 다채널 오디오 데이터를 제2 포맷으로 변환하는 오디오 데이터 변환부; 및 상기 제2 포맷으로 변환된 다채널 오디오 데이터를 재생하는 오디오 데이터 재생부를 포함한다. The apparatus for reproducing multi-channel audio data according to an embodiment of the present invention reproduces the multi-channel audio data and the first format set by the author of the multi-channel audio data from the multi-channel audio data and audio metadata produced according to the first format. A data identification unit for identifying dynamic format conversion information between second formats based on an environment; An audio data conversion unit converting the multi-channel audio data of the first format into a second format using the dynamic format conversion information; And an audio data reproducing unit for reproducing multi-channel audio data converted into the second format.

상기 다채널 오디오 데이터 변환부의 재생 구간은, 서로 동일한 재생 길이를 가지거나 또는 서로 다른 재생 길이를 가질 수 있다.The reproduction sections of the multi-channel audio data converter may have the same reproduction length or different reproduction lengths.

상기 다채널 오디오 데이터 변환부의 포맷 변환 방식은, 다채널 오디오 데이터의 재생 구간 별로 서로 다르게 변환하거나 또는 부분적으로 반복되게 변환할 수 있다.The format conversion method of the multi-channel audio data conversion unit may be converted differently or partially repetitively converted for each reproduction section of the multi-channel audio data.

상기 다채널 오디오 데이터의 재생 환경은, 상기 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정될 수 있다.The reproduction environment of the multi-channel audio data may be determined based on a layout of speakers through which the multi-channel audio data is reproduced.

본 발명의 일실시예에 따른 오디오 메타데이터 제공 방법은 다채널 오디오 데이터에서 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 동적 포맷 변환 정보를 식별하는 단계; 및 상기 식별된 동적 포맷 변환 정보를 포함하는 오디오 메타데이터를 생성하는 단계를 포함한다.In the method of providing audio metadata according to an embodiment of the present invention, dynamic format conversion information between a first format set by an author of the multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data in multi-channel audio data. Identifying; And generating audio metadata including the identified dynamic format conversion information.

상기 복수의 포맷 변환 방식이 설정된 재생 구간은, 서로 동일한 재생 길이를 가지거나 또는 서로 다른 재생 길이를 가질 수 있다.The reproduction sections in which the plurality of format conversion schemes are set may have the same reproduction length or different reproduction lengths.

본 발명의 일실시예에 따른 다채널 오디오 데이터 재생 방법은 제1 포맷에 따라 제작된 다채널 오디오 데이터 및 오디오 메타데이터로부터 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 동적 포맷 변환 정보를 식별하는 단계와; 상기 동적 포맷 변환 정보를 이용하여 상기 제1 포맷의 다채널 오디오 데이터를 제2 포맷으로 변환하는 단계; 및 상기 제2 포맷으로 변환된 다채널 오디오 데이터를 재생하는 단계를 포함한다.A method of reproducing multi-channel audio data according to an embodiment of the present invention is to reproduce a first format and multi-channel audio data set by an author of multi-channel audio data from multi-channel audio data and audio metadata produced according to a first format. Identifying dynamic format conversion information between the second formats based on the environment; Converting multi-channel audio data of the first format into a second format using the dynamic format conversion information; And reproducing the multi-channel audio data converted into the second format.

본 발명의 일실시예에 따른 컴퓨터에서 판독 가능한 기록 매체는 하나 이상의 채널로 구성된 다채널 오디오 데이터 및 다채널 오디오 데이터에서 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 동적 포맷 변환 정보가 포함된 오디오 메타데이터가 기록된다.A computer-readable recording medium according to an embodiment of the present invention includes multi-channel audio data composed of one or more channels, and a reproduction environment for multi-channel audio data and a first format set by the author of the multi-channel audio data from the multi-channel audio data. Audio metadata including dynamic format conversion information between the second formats based on is recorded.

본 발명의 일실시예에 따르면, 다채널 오디오 데이터의 다양한 포맷 간에 저작자의 저작 의도가 완벽히 유지될 수 있게 포맷을 변환하는 동적 포맷 변환 방법을 제공하기 위한 오디오 메타데이터 제공 장치, 방법 및 동적 포맷 변환 방법에 따라 포맷을 변환하여 재생하는 장치, 방법 그리고 동적 포맷 변환 방법이 기록된 기록 매체가 제공된다.According to an embodiment of the present invention, an audio metadata providing apparatus, method, and dynamic format conversion for providing a dynamic format conversion method for converting a format so that the author's authoring intention can be completely maintained between various formats of multi-channel audio data. An apparatus, a method for converting and reproducing a format according to the method, and a recording medium in which a method for dynamic format conversion is recorded are provided.

본 발명의 일실시예에 따르면, 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 변환을 수행할 수 있는 동적 포맷 변환 정보가 포함된 오디오 메타데이터를 생성할 수 있는 오디오 메타데이터 제공 장치 및 방법이 제공된다.According to an embodiment of the present invention, an audio meta including dynamic format conversion information capable of performing conversion between a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of multi-channel audio data An apparatus and method for providing audio metadata capable of generating data are provided.

본 발명의 일실시예에 따르면, 다채널 오디오 데이터 및 동적 포맷 변환 정보가 담긴 오디오 메타데이터를 식별하여 제1 포맷에서 제2 포맷으로 다채널 오디오 데이터를 변환한 후 재생하는 다채널 오디오 데이터 재생 장치 및 방법이 제공된다.According to an embodiment of the present invention, a multi-channel audio data reproducing apparatus that identifies multi-channel audio data and audio metadata containing dynamic format conversion information, converts multi-channel audio data from a first format to a second format, and reproduces it. And a method is provided.

본 발명의 일실시예에 따르면, 다채널 오디오 데이터 및 동적 포맷 변환 정보가 포함된 오디오 메타데이터가 기록된 컴퓨터에서 판독 가능한 기록 매체가 제공된다.According to an embodiment of the present invention, a computer-readable recording medium in which audio metadata including multi-channel audio data and dynamic format conversion information is recorded is provided.

도 1은 본 발명의 일실시예에 따른 오디오 메타데이터 제공 장치와 오디오 메타데이터 및 다채널 오디오 데이터 재생 장치를 도시한 도면이다.
도 2는 본 발명의 일실시예에 따른 다채널 오디오 데이터의 포맷을 일괄적으로 변환하는 일례를 도시한 도면이다.
도 3는 본 발명의 일실시예에 따른 동적 포맷 변환 정보로 다채널 오디오 데이터의 포맷을 변환하는 일례를 도시한 도면이다.
도 4는 본 발명의 일실시예에 따른 하나 이상의 동적 포맷 변환 정보를 포함한 오디오 메타데이터를 도시한 도면이다.
도 5는 본 발명의 일실시예에 따른 매트릭스 방식을 이용하여 포맷 간의 변환을 수행하는 실시예를 설명하기 위한 도면이다.
도 6은 본 발명의 일실시예에 따른 오디오 메타데이터 제공 장치가 동적 포맷 변환 정보가 포함된 오디오 메타데이터를 제공하는 동작을 나타낸 흐름도이다.
도 7은 본 발명의 일실시예에 따른 다채널 오디오 데이터 재생 장치가 다채널 오디오 데이터의 포맷을 변환한 이후 이를 재생하는 동작을 나타낸 흐름도이다.1 is a diagram illustrating an apparatus for providing audio metadata and an apparatus for reproducing audio metadata and multi-channel audio data according to an embodiment of the present invention.
2 is a diagram illustrating an example of collectively converting the format of multi-channel audio data according to an embodiment of the present invention.
3 is a diagram illustrating an example of converting a format of multi-channel audio data using dynamic format conversion information according to an embodiment of the present invention.
4 is a diagram illustrating audio metadata including one or more dynamic format conversion information according to an embodiment of the present invention.
5 is a diagram for explaining an embodiment of performing conversion between formats using a matrix method according to an embodiment of the present invention.
6 is a flowchart illustrating an operation of providing audio metadata including dynamic format conversion information by an audio metadata providing apparatus according to an embodiment of the present invention.
7 is a flowchart illustrating an operation of reproducing multi-channel audio data after converting the format of the multi-channel audio data according to an embodiment of the present invention.

이하, 본 발명의 실시예를 첨부된 도면을 참조하여 상세하게 설명한다. Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일실시예에 따른 오디오 메타데이터 제공 장치(110)와 오디오 메타데이터(140) 및 다채널 오디오 데이터 재생 장치(160)를 도시한 도면이다.1 is a diagram illustrating an audio metadata providing apparatus 110, an audio metadata 140, and a multi-channel audio data reproducing apparatus 160 according to an embodiment of the present invention.

도 1을 참고하면, 오디오 메타데이터 제공 장치(110)는 동적 포맷 변환 정보를 식별하는 변환 정보 식별부(120) 및 식별된 동적 포맷 변환 정보를 포함하는 오디오 메타데이터(140)를 생성하는 오디오 메타데이터 생성부(130)를 포함한다. 동적 포맷 변환 정보는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. Referring to FIG. 1, the audio metadata providing apparatus 110 includes a conversion information identification unit 120 for identifying dynamic format conversion information and an audio metadata for generating audio metadata 140 including the identified dynamic format conversion information. It includes a data generating unit 130. In the dynamic format conversion information, a plurality of format conversion methods between a first format set by an author of the multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data are set for each reproduction section of the multi-channel audio data.

본 발명의 일실시예에 따르면, 변환 정보 식별부(120)는 다채널 오디오 데이터의 저작자에게서 동적 포맷 변환 정보를 식별할 수 있다. 또 다른 실시예에 따르면, 변환 정보 식별부(120)는 하나 이상의 오디오 메타데이터로부터 복수의 동적 포맷 변환 정보를 식별할 수 있다. According to an embodiment of the present invention, the conversion information identification unit 120 may identify dynamic format conversion information from an author of multi-channel audio data. According to another embodiment, the conversion information identification unit 120 may identify a plurality of dynamic format conversion information from one or more audio metadata.

본 발명의 일실시예에 따르면, 변환 정보 식별부(120)에서 식별된 동적 포맷 변환 정보를 기초로 오디오 메타데이터를 생성하는 오디오 메타데이터 생성부(130)가 제공된다. 오디오 메타데이터 생성부(130)는 오디오 메타데이터에 식별된 복수의 동적 포맷 변환 정보를 포함할 수 있다. 본 발명의 일실시예에 따르면, 오디오 메타데이터 생성부(130)는 동적 포맷 변환 정보의 각 포맷 변환 방식을 매트릭스의 형태로 포함할 수 있다. 또 다른 실시예에 따르면, 오디오 메타데이터 생성부(130)는 메타데이터에 식별된 동적 포맷 변환 정보와 함께 메타데이터에 일반적으로 포함되는 정보(예를 들어, 저작자, 음반 제목, 출시 년도 등)를 포함할 수 있다. According to an embodiment of the present invention, there is provided an audio metadata generation unit 130 that generates audio metadata based on dynamic format conversion information identified by the conversion information identification unit 120. The audio metadata generator 130 may include a plurality of dynamic format conversion information identified in the audio metadata. According to an embodiment of the present invention, the audio metadata generator 130 may include each format conversion method of dynamic format conversion information in the form of a matrix. According to another embodiment, the audio metadata generation unit 130 stores information generally included in the metadata (eg, author, album title, release year, etc.) together with the dynamic format conversion information identified in the metadata. Can include.

본 발명의 일실시예에 따르면, 오디오 메타데이터 제공 장치(110)는 다채널 오디오 데이터 제공 장치의 일부 구성으로 포함될 수 있다.According to an embodiment of the present invention, the audio metadata providing device 110 may be included as a part of the multi-channel audio data providing device.

오디오 메타데이터 제공 장치(110)로부터 동적 포맷 변환 정보(150)를 포함하는 오디오 메타데이터(140)가 제공된다. 본 발명의 일실시예에 따르면, 오디오 메타데이터(140)는 동적 포맷 변환 정보(150)뿐만 아니라 메타데이터에 일반적으로 포함되는 정보를 포함할 수 있다. 본 발명의 다른 일실시예에 따르면, 오디오 메타데이터는 다채널 오디오 데이터와 함께 제공될 수 있다. 본 발명의 또 다른 일실시예에 따르면, 오디오 메타데이터(140)는 실시간으로 다채널 오디오 데이터 재생 장치(160)에 전송되거나, 또는 다채널 오디오 데이터 재생 장치(160)에 미리 전송되어 다채널 오디오 데이터 재생 장치(160)의 버퍼, 메모리 같은 저장 매체에 저장될 수 있다. 또는 오디오 메타데이터(140)는 CD-ROM, CD-RW, DVD-R, DVD-RW 등과 같은 광 기록 매체에 저장되어 배포될 수 있다. Audio metadata 140 including dynamic format conversion information 150 is provided from the audio metadata providing device 110. According to an embodiment of the present invention, the audio metadata 140 may include not only the dynamic format conversion information 150 but also information generally included in the metadata. According to another embodiment of the present invention, audio metadata may be provided together with multi-channel audio data. According to another embodiment of the present invention, the audio metadata 140 is transmitted to the multi-channel audio data reproducing apparatus 160 in real time, or previously transmitted to the multi-channel audio data reproducing apparatus 160 to provide multi-channel audio. It may be stored in a storage medium such as a buffer or memory of the data reproducing apparatus 160. Alternatively, the audio metadata 140 may be stored and distributed in an optical recording medium such as a CD-ROM, a CD-RW, a DVD-R, or a DVD-RW.

다채널 오디오 데이터를 동적 포맷 변환 정보에 의하여 포맷 간에 변환을 수행한 후 이를 재생할 수 있는 다채널 오디오 데이터 재생 장치(160)가 제공된다. 다채널 오디오 데이터 재생 장치(160)는 동적 변환 정보를 식별하는 데이터 식별부(170)와 식별된 동적 포맷 변환 정보로 포맷 간의 변환을 수행하는 오디오 데이터 변환부(180) 및 변환된 다채널 오디오 데이터를 재생하는 오디오 데이터 재생부(190)를 포함한다. A multi-channel audio data reproducing apparatus 160 capable of reproducing multi-channel audio data after performing conversion between formats according to dynamic format conversion information is provided. The multi-channel audio data reproducing apparatus 160 includes a data identification unit 170 for identifying dynamic conversion information, an audio data conversion unit 180 for performing conversion between formats using the identified dynamic format conversion information, and the converted multi-channel audio data. It includes an audio data reproducing unit 190 for reproducing.

본 발명의 일실시예에 따르면, 데이터 식별부(170)는 오디오 메타데이터(140)에서 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷에 해당하는 동적 포맷 변환 정보를 식별한다. 다채널 오디오 데이터의 재생 환경은 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정된다. 본 발명의 일실시예에 따르면, 데이터 식별부(170)는 오디오 메타데이터에 기록된 하나 이상의 동적 포맷 변환 정보 중 제2 포맷에 대응 되는 동적 포맷 변환 정보를 선택하여 식별할 수 있다. According to an embodiment of the present invention, the data identification unit 170 identifies dynamic format conversion information corresponding to a second format based on a reproduction environment of multi-channel audio data from the audio metadata 140. The reproduction environment of the multi-channel audio data is determined based on the layout of speakers in which the multi-channel audio data is reproduced. According to an embodiment of the present invention, the data identification unit 170 may select and identify dynamic format conversion information corresponding to the second format from among one or more dynamic format conversion information recorded in audio metadata.

본 발명의 일실시예에 따르면, 오디오 데이터 변환부(180)는 식별한 동적 포맷 변환 정보에 의해 다채널 오디오 데이터를 다채널 오디오 저작자가 설정한 제1 포맷에서 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷으로 변환한다. 동적 포맷 변환 정보는 제1 포맷과 제2 포맷간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. According to an embodiment of the present invention, the audio data conversion unit 180 converts the multi-channel audio data based on the identified dynamic format conversion information into the multi-channel audio data in a first format set by the multi-channel audio author based on a reproduction environment of multi-channel audio data. Convert to the second format. In the dynamic format conversion information, a plurality of format conversion methods between the first format and the second format are set for each reproduction section of the multi-channel audio data.

본 발명의 일실시예에 따르면, 오디오 데이터 변환부(180)는 재생 시간에 따라서 동적 포맷 변환 정보로부터 재생 시간을 포함하는 재생 구간을 식별하고, 동적 포맷 변환 정보에서 해당 재생 구간에 설정된 포맷 변환 방식을 식별하여 제1 포맷과 제2 포맷간의 포맷 변환을 수행한다. 본 발명의 일실시예에 따르면, 복수의 포맷 변환 방식이 설정된 재생 구간은 서로 동일한 재생 길이를 가지거나 서로 다른 재생 길이를 가질 수 있다. 본 발명의 일실시예에 따르면, 오디오 데이터 변환부(180)는 동적 포맷 변환 정보에 의하여 재생 구간 별로 서로 다른 포맷 변환 방식을 사용하여 변환하거나 또는 부분적으로 포맷 변환 방식을 반복되게 사용하여 변환할 수 있다. According to an embodiment of the present invention, the audio data conversion unit 180 identifies a playback section including a playback time from dynamic format conversion information according to a playback time, and a format conversion method set in the playback section from the dynamic format conversion information. Is identified to perform format conversion between the first format and the second format. According to an embodiment of the present invention, playback sections in which a plurality of format conversion schemes are set may have the same playback length or different playback lengths. According to an embodiment of the present invention, the audio data conversion unit 180 may convert by using different format conversion methods for each playback section according to the dynamic format conversion information or partially by repeatedly using the format conversion method. have.

본 발명의 일실시예에 따르면, 제2 포맷으로 변환된 다채널 오디오 데이터를 재생하는 오디오 데이터 재생부(190)가 제공된다. 제2 포맷은 다채널 오디오 데이터의 재생환경에 기초하며, 다채널 오디오 데이터의 재생 환경은 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정된다. 오디오 데이터 재생부(190)는 하나 이상의 스피커 출력부로 구성된다. 오디오 데이터 재생부(190)는 제2 포맷으로 변환된 다채널 오디오 데이터에 대하여 각 채널에 대응되는 스피커로 오디오 데이터를 출력한다. According to an embodiment of the present invention, an audio data reproducing unit 190 for reproducing multi-channel audio data converted into a second format is provided. The second format is based on a reproduction environment of the multi-channel audio data, and the reproduction environment of the multi-channel audio data is determined based on the layout of speakers in which the multi-channel audio data is reproduced. The audio data reproducing unit 190 is composed of one or more speaker output units. The audio data reproducing unit 190 outputs audio data to a speaker corresponding to each channel with respect to the multi-channel audio data converted into the second format.

본 발명의 일실시예에 따르면, 오디오 데이터 재생부(190)는 출력부에 연결된 스피커의 개수를 파악하여 다채널 오디오 데이터의 재생 환경을 식별할 수 있다. 더 나아가서, 오디오 데이터 재생부(190)는 스피커의 개수뿐만 아니라 각 스피커의 위치를 식별하거나 사용자로부터 재생 환경에 대한 정보를 입력 받음으로써 재생 환경을 식별할 수 있다.According to an embodiment of the present invention, the audio data reproducing unit 190 may identify a reproduction environment of multi-channel audio data by grasping the number of speakers connected to the output unit. Furthermore, the audio data reproducing unit 190 may identify the reproducing environment not only by identifying the number of speakers, but also by identifying the location of each speaker or receiving information on the reproducing environment from a user.

도 2는 본 발명의 일실시예에 따른 다채널 오디오 데이터의 포맷을 일괄적으로 변환하는 일례를 도시한 도면이다.2 is a diagram illustrating an example of collectively converting the format of multi-channel audio data according to an embodiment of the present invention.

다채널 오디오 데이터는 다채널 오디오 데이터의 저작자가 설정한 다채널 오디오 데이터 포맷인 제1 포맷에 맞추어 제작된다. 다채널 오디오 데이터를 재생하는 측의 다채널 오디오 데이터 포맷인 제2 포맷은 다채널 오디오 데이터의 재생 환경에 기초한다. 다채널 오디오 데이터의 재생 환경은 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정되므로, 제2 포맷은 다채널 오디오 데이터의 제1 포맷과 다를 수 있다. 본 발명의 일실시예에 따르면, 다채널 오디오 데이터의 재생 환경에 기초하는 제2 포맷이 제1 포맷과 다른 경우, 다채널 오디오 데이터 재생 장치의 오디오 데이터 변환부는 일괄적인 포맷 변환 방식(200)에 따라서 변환을 수행할 수 있다. The multi-channel audio data is produced according to the first format, which is a multi-channel audio data format set by the author of the multi-channel audio data. The second format, which is a multi-channel audio data format on a side that reproduces multi-channel audio data, is based on a reproduction environment of multi-channel audio data. Since the reproduction environment of the multi-channel audio data is determined based on the layout of speakers in which the multi-channel audio data is reproduced, the second format may be different from the first format of the multi-channel audio data. According to an embodiment of the present invention, when the second format based on the reproduction environment of multi-channel audio data is different from the first format, the audio data conversion unit of the multi-channel audio data reproducing apparatus uses the batch format conversion method 200. Therefore, you can perform the conversion.

도 2를 예로 들면, 제1 포맷은 10.2채널 포맷이라 가정하자. 일괄적인 포맷 변환 방식(200)에 따르면, 제2 포맷이 5.1채널 포맷인 경우, 청자의 좌측 전면 스피커 L은 제1 포맷의 좌측 전면 스피커 L과 좌측 상단 스피커 LH의 선형 결합(linear combination)으로 결정된다. 또 다른 예로써, 제2 포맷이 7.1채널 포맷인 경우, 우측 후면 스피커 RB는 제1 포맷의 우측 후면 스피커 RB와 중앙 스피커 CH의 선형 결합으로 결정된다. Taking FIG. 2 as an example, it is assumed that the first format is a 10.2 channel format. According to the batch format conversion method 200, when the second format is a 5.1-channel format, the left front speaker L of the listener is determined as a linear combination of the left front speaker L and the upper left speaker LH of the first format. do. As another example, when the second format is a 7.1-channel format, the right rear speaker RB is determined by a linear combination of the right rear speaker RB and the center speaker CH of the first format.

일괄적인 포맷 변환 방식(200)에 따르면, 포맷 변환 방식은 채널간의 선형 결합으로 주어지게 되어 비선형 변환은 불가하다. 또한, 재생 구간 별로 포맷 변환 방식이 변화할 수 없다. 본 발명의 일실시예에 따르면, 다채널 오디오 데이터의 재생 구간 별로 하나 이상의 포맷 변환 방식이 설정된 동적 변환 정보가 제공된다. 또한, 제1 포맷과 제2 포맷간의 비선형 변환을 지원하는 포맷 변환 방식이 제공된다. According to the batch format conversion method 200, the format conversion method is given by a linear combination between channels, and thus nonlinear conversion is impossible. In addition, the format conversion method cannot be changed for each playback section. According to an embodiment of the present invention, dynamic conversion information in which at least one format conversion method is set for each reproduction section of multi-channel audio data is provided. In addition, a format conversion method supporting nonlinear conversion between a first format and a second format is provided.

도 3은 본 발명의 일실시예에 따른 다채널 오디오 데이터의 포맷 변환을 수행할 수 있는 동적 포맷 변환 정보를 도시한 도면이다.3 is a diagram illustrating dynamic format conversion information capable of performing format conversion of multi-channel audio data according to an embodiment of the present invention.

도 3을 참고하면, 동적 포맷 변환 정보(310)는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷간의 복수의 포맷 변환 방식들(예를 들어, 포맷 변환 방식 K(320), L(340) 그리고 M(330))이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. Referring to FIG. 3, the dynamic format conversion information 310 includes a plurality of format conversion methods (for example, a first format set by an author of multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data). , Format conversion methods K (320), L (340) and M (330)) are set for each reproduction section of multi-channel audio data.

본 발명의 일실시예에 따르면, 각각의 포맷 변환 방식은 동일한 제2 포맷으로 포맷을 변환한다. 다만, 변환 하는 방식은 서로 다를 수 있다. 도 3을 예로 들면, 포맷 변환 방식 K(320)는 제1 포맷의 복수의 좌측 스피커 Left₁과 Left₂의 선형 결합으로 제2 포맷의 좌측 스피커 Left의 출력 데이터를 결정한다. 포맷 변환 방식 M(330)은 제1 포맷의 복수의 좌측 스피커 중 하나인 Left₁ 만으로 제2 포맷의 좌측 스피커 Left의 출력 데이터를 결정한다. 본 발명의 일실시예에 따르면, 각각의 변환 방식은 비선형 변환을 포함할 수 있다. According to an embodiment of the present invention, each format conversion method converts the format into the same second format. However, the conversion method may be different. Referring to FIG. 3 as an example, the format conversion method K 320 determines output data of the left speaker Left of the second format by linear combination of a _{plurality of left speakers Left 1} and Left _{2 of the first format.} The format conversion method M 330 determines the output data of the left speaker Left of the second format using only _{Left 1,} which is one of the plurality of left speakers of the first format. According to an embodiment of the present invention, each transformation method may include a nonlinear transformation.

본 발명의 일실시예에 따르면, 다채널 오디오 데이터 재생 장치는 동적 포맷 변환 정보로부터 재생 구간 별로 설정된 포맷 변환 방식을 식별하여 변환할 수 있다. 도 3을 예로 들면, 다채널 오디오 데이터 재생 장치는 재생 구간 t=0에서 t=t₁까지 다채널 오디오 데이터를 포맷 변환 방식 K(320)에 의하여 변환을 수행한다. 다채널 오디오 데이터 재생 장치는 이후의 재생 구간 t=t₁에서 t=t₂까지 다채널 오디오 데이터를 포맷 변환 방식 M(330)에 의하여 변환을 수행한다. 마찬가지로, 다채널 오디오 데이터 재생 장치는 재생 구간 t=t₃에서 t=t₄까지는 포맷 변환 방식 L(340)에 의하여 변환을 수행하고, 이후의 재생 구간에서도 같은 작업을 반복한다.According to an embodiment of the present invention, the apparatus for reproducing multi-channel audio data may identify and convert a format conversion method set for each reproduction section from dynamic format conversion information. Referring to FIG. 3 as an example, the multi-channel audio data reproducing apparatus converts multi-channel audio data from a reproduction period t = 0 to t = t ₁ by a format conversion method K (320). The multi-channel audio data reproducing apparatus converts multi-channel audio data from a subsequent reproduction period t = t ₁ to t = t ₂ by the format conversion method M 330. Similarly, the multi-channel audio data reproducing apparatus _{performs conversion by the format conversion method L 340 from the reproduction period t=t 3} to t=t ₄ , and repeats the same operation in the subsequent reproduction period.

본 발명의 일실시예에 따르면, 동적 포맷 변환 정보(310)는 다채널 오디오 데이터의 재생 구간 별로 포맷 변환 방식을 서로 다르게 설정하거나 또는 부분적으로 반복되게 설정할 수 있다. 도3을 예로 들면, 포맷 변환 방식 K(320)는 재생 구간 t=0에서 t=t₁뿐만 아니라, 재생 구간 t=t₂에서 t=t₃에서도 다시 설정 될 수 있다. 본 발명의 일실시예에 따르면, 포맷 변환 방식은 일괄적인 포맷 변환 방식이나 선형 결합(linear combination)에 의한 변환뿐만 아니라 비선형 변환도 포함할 수 있다.According to an embodiment of the present invention, the dynamic format conversion information 310 may set a format conversion method differently or partially repetitively for each reproduction section of multi-channel audio data. Referring to FIG. 3 as an example, the format conversion method K 320 may be set again _{not only in t=t 1} in the reproduction period t=0, but also in t=t ₃ in the reproduction period t=t _2. According to an embodiment of the present invention, the format conversion method may include nonlinear conversion as well as conversion by a batch format conversion method or a linear combination.

본 발명의 일실시예에 따르면, 포맷 변환 방식이 설정된 각각의 재생 구간은 서로 동일한 재생 길이를 가지거나 서로 다른 재생 길이를 가질 수 있다. 도3을 예로 들면, 재생 구간 t=t₁에서 t=t₂와 재생 구간 t=t₇에서 t=t₈은 서로 같은 재생 길이를 가질 수가 있다. According to an embodiment of the present invention, each playback section in which the format conversion method is set may have the same playback length or different playback lengths. FIG example 3 example, the playback period t = t ₁ at t = t ₂ and the playback period t = t ₇ at t = t ₈ are allowed to have the play time of each other.

도 4는 본 발명의 일실시예에 따른 하나 이상의 동적 포맷 변환 정보를 포함한 오디오 메타데이터를 도시한 도면이다. 4 is a diagram illustrating audio metadata including one or more dynamic format conversion information according to an embodiment of the present invention.

도 4를 참고하면, 다채널 오디오 데이터의 재생 환경이 다양하기 때문에, 오디오 메타데이터(140)는 하나 이상의 동적 포맷 변환 정보(420, 430)를 포함할 수 있다. 다채널 오디오 데이터 재생 장치(160)는 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷에 해당하는 동적 변환 정보를 선택하여 다채널 오디오 데이터를 변환한다. 재생 환경은 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정된다. Referring to FIG. 4, since the reproduction environment of multi-channel audio data is diverse, the audio metadata 140 may include one or more dynamic format conversion information 420 and 430. The multi-channel audio data reproducing apparatus 160 converts multi-channel audio data by selecting dynamic conversion information corresponding to a second format based on a reproduction environment of the multi-channel audio data. The reproduction environment is determined based on the layout of speakers in which multi-channel audio data is reproduced.

도 4를 예로 들면, 다채널 오디오 데이터의 저작자가 설정한 제1 포맷이 22.2채널 포맷이고, 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷이 10.2채널 포맷이라 가정하자. 다채널 오디오 데이터 재생 장치의 데이터 식별부(170)는 오디오 메타데이터의 복수의 동적 포맷 변환 정보들(420, 430) 중에서 제2 포맷에 대응하는 동적 포맷 변환 정보1(420)을 식별한다. 마찬가지로 다채널 오디오 데이터의 재생환경에 기초한 제2 포맷이 5.1채널 포맷이라면, 다채널 오디오 데이터 재생 장치의 데이터 식별부(170)는 동적 포맷 변환 정보2(430)를 식별할 것이다. Referring to FIG. 4 as an example, it is assumed that a first format set by an author of multi-channel audio data is a 22.2 channel format, and a second format based on a reproduction environment of multi-channel audio data is a 10.2 channel format. The data identification unit 170 of the multi-channel audio data reproducing apparatus identifies the dynamic format conversion information 1 420 corresponding to the second format from among the plurality of dynamic format conversion information 420 and 430 of the audio metadata. Similarly, if the second format based on the reproduction environment of the multi-channel audio data is a 5.1-channel format, the data identification unit 170 of the multi-channel audio data reproducing apparatus will identify the dynamic format conversion information 2 430.

앞서 가정한 10.2채널 포맷에서, 오디오 데이터 변환부(180)는 식별된 동적 포맷 변환 정보1(420)에 의하여 다채널 오디오 데이터를 변환한다. 즉, 오디오 데이터 변환부(180)는 재생 구간 별로 설정된 복수의 포맷 변환 방식들(440)을 기초로 다채널 오디오 데이터를 재생 구간 t=0에서 t=t₁까지는 포맷 변환 방식 K(450)에 의하여 변환하고, 재생 구간 t=t₁에서 t=t₂까지는 포맷 변환 방식 M(460)에 의하여 변환한다. 본 발명의 일실시예에 따르면, 동적 포맷 변환 정보는 다채널 오디오 데이터의 재생 구간 별로 서로 다르게 설정되거나 또는 부분적으로 반복되게 설정될 수 있다. 또한, 포맷 변환 방식이 설정된 각각의 재생 구간의 재생 길이도 서로 다르거나 같을 수 있다. 도 4를 참고하면, 포맷 변환 방식 K(450)는 재생 구간 t=0에서 t=t₁에서 사용되었지만 그 이후의 재생 구간에서도 반복되어 사용할 수 있다. 또한, 재생 구간 t=0에서 t=t₁과 재생 구간 t=t₁에서 t=t₂의 재생 길이도 서로 다르거나 같을 수 있다. In the previously assumed 10.2 channel format, the audio data conversion unit 180 converts multi-channel audio data according to the identified dynamic format conversion information 1 420. That is, the audio data conversion unit 180 converts the multi-channel audio data from the playback period t=0 to t=t ₁ to the format conversion method K 450 based on the plurality of format conversion methods 440 set for each playback period. And the reproduction period t=t ₁ to t=t _{2 is} converted according to the format conversion method M (460). According to an embodiment of the present invention, the dynamic format conversion information may be set differently or partially repeated for each reproduction section of multi-channel audio data. In addition, the reproduction lengths of each reproduction section in which the format conversion method is set may be different from each other or may be the same. Referring to FIG. 4, the format conversion method K 450 is used in the reproduction period t=0 and t=t ₁ , but may be repeatedly used in the subsequent reproduction period. It is also possible to be the same or different from each other play time of the playback section t = 0 at t = t _1, and the playback period t = t ₁ at t = t _2.

도 5는 본 발명의 일실시예에 따른 매트릭스 방식을 이용하여 포맷 간의 변환을 수행하는 실시예를 설명하기 위한 도면이다.5 is a diagram for explaining an embodiment of performing conversion between formats using a matrix method according to an embodiment of the present invention.

도 5를 참고하면, 동적 포맷 변환 정보에서 각각의 포맷 변환 방식들은 변환 매트릭스들(530,540)로 저장될 수 있다. 변환 매트릭스들은 다채널 오디오데이터의 저작자가 설정한 제1 포맷에서 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷으로 변환하기 위한 매트릭스이다. 오디오 데이터 변환부는 제1 포맷 채널 매트릭스를 변환 매트릭스에 적용하여 제2 포맷 채널 매트릭스를 출력함으로써 제1 포맷에서 제2 포맷으로 변환할 수 있다. Referring to FIG. 5, in the dynamic format conversion information, each format conversion method may be stored as conversion matrices 530 and 540. The conversion matrices are for converting from a first format set by an author of multi-channel audio data to a second format based on a reproduction environment of multi-channel audio data. The audio data converter may convert the first format channel matrix from the first format to the second format by applying the first format channel matrix to the conversion matrix and outputting the second format channel matrix.

도 5를 예로 들면, 다채널 오디오 데이터의 저작자는 10.2채널 포맷(제1 포맷)으로 다채널 오디오 데이터를 제작(510)했다고 가정하고, 다채널 오디오 데이터의 재생 환경은 5.1채널 포맷(제2 포맷)이라고 가정하자. 이 경우 포맷 변환(550)을 참고하면, 오디오 데이터 변환부는 제1 포맷 채널 매트릭스(580)(채널 매트릭스의 각 원소는 각 채널에 대응된다)를 변환 매트릭스(570)에 적용하여 제2 포맷 채널 매트릭스(560)를 출력하는 방식으로 포맷을 변환한다. 따라서, 도 5의 경우, 제1 포맷인 10.2채널 포맷은 열두 개의 채널을 가지고 제2 포맷인 5.1채널 포맷은 여섯개의 채널을 가지므로, 포맷 변환 방식에 대한 정보를 담은 변환 매트릭스들(530,540)은 6행 12열로 구성된다. Referring to FIG. 5 as an example, it is assumed that the author of multi-channel audio data has produced 510 multi-channel audio data in a 10.2-channel format (first format), and the reproduction environment of the multi-channel audio data is a 5.1-channel format (second format). ). In this case, referring to the format conversion 550, the audio data conversion unit applies the first format channel matrix 580 (each element of the channel matrix corresponds to each channel) to the conversion matrix 570 to obtain a second format channel matrix. Convert the format in a way that outputs 560. Accordingly, in the case of FIG. 5, since the 10.2 channel format, which is the first format, has twelve channels, and the 5.1 channel format, which is the second format, has six channels, the conversion matrices 530 and 540 containing information on the format conversion method are It consists of 6 rows and 12 columns.

또한, 오디오 데이터 변환부는 재생 구간별로 설정된 포맷 변환 방식에 맞추어 변환 매트릭스(570)를 교체하여 변환할 수 있다. 예를 들어, 도 5의 동적 포맷 변환 정보(520)에서, 재생 구간 t=0에서 t=t₁까지 변환 방식 K가 설정되어 있으므로, 해당 재생 구간에서 오디오 데이터 변환부는 변환 매트릭스(570)를 변환 방식 K에 대한 변환 매트릭스(530)로 설정하여 변환을 수행한다. 재생 구간 t=t₁에서 t=t₂까지 변환 방식 M이 설정되어 있으므로, 해당 재생 구간에서 오디오 데이터 변환부는 변환 매트릭스(570)를 변환 방식 M에 대한 변환 매트릭스(540)로 설정하여 변환을 수행한다. In addition, the audio data conversion unit may convert the conversion matrix 570 by replacing the conversion matrix 570 according to a format conversion method set for each reproduction section. For example, in the dynamic format conversion information 520 of FIG. 5, _{since the conversion method K is set from the playback period t=0 to t=t 1} , the audio data conversion unit transforms the conversion matrix 570 in the playback period. Transformation is performed by setting the transformation matrix 530 for method K. Since the conversion method M is set from the playback period t=t ₁ to t=t ₂ , the audio data conversion unit performs the conversion by setting the conversion matrix 570 as the conversion matrix 540 for the conversion method M in the corresponding playback period. do.

도 6은 본 발명의 일실시예에 따른 오디오 메타데이터 제공 장치가 동적 포맷 변환 정보가 포함된 오디오 메타데이터를 제공하는 동작을 나타낸 흐름도이다.6 is a flowchart illustrating an operation of providing audio metadata including dynamic format conversion information by an audio metadata providing apparatus according to an embodiment of the present invention.

단계(610)에서, 오디오 메타데이터 제공 장치는 동적 포맷 변환 정보를 식별한다. 동적 포맷 변환 정보는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. 본 발명의 일실시예에 따르면, 오디오 메타데이터 제공장치는 다채널 오디오 데이터의 저작자에게서 동적 포맷 변환 정보를 식별할 수 있다. 본 발명의 다른 실시예에 따르면, 다채널 오디오 데이터 재생 장치는 하나 이상의 오디오 메타데이터로부터 복수의 동적 포맷 변환 정보를 식별할 수 있다.In step 610, the apparatus for providing audio metadata identifies dynamic format conversion information. In the dynamic format conversion information, a plurality of format conversion methods between a first format set by an author of the multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data are set for each reproduction section of the multi-channel audio data. According to an embodiment of the present invention, the apparatus for providing audio metadata may identify dynamic format conversion information from an author of multi-channel audio data. According to another embodiment of the present invention, the multi-channel audio data reproducing apparatus may identify a plurality of dynamic format conversion information from one or more audio metadata.

단계(620)에서, 오디오 메타데이터 제공 장치는 식별된 동적 포맷 변환 정보를 포함하는 오디오 메타데이터를 생성한다. 이때, 오디오 메타데이터 제공 장치는 오디오 메타데이터에 일반적으로 포함되는 정보(예를 들어, 저작자, 음반 제목, 출시 년도 등)를 포함할 수 있다. 본 발명의 일실시예에 따르면, 오디오 메타데이터 제공 장치는 복수의 동적 포맷 변환 정보를 오디오 메타데이터에 포함할 수 있다. 본 발명의 일실시예에 따르면, 오디오 메타데이터 제공 장치는 동적 포맷 변환 정보의 각 포맷 변환 방식을 매트릭스의 형태(예를 들어, 도 5의 변환 매트릭스들(530, 540))로 오디오 메타데이터에 기록할 수 있다.In step 620, the apparatus for providing audio metadata generates audio metadata including the identified dynamic format conversion information. In this case, the apparatus for providing audio metadata may include information (eg, author, album title, release year, etc.) generally included in the audio metadata. According to an embodiment of the present invention, the apparatus for providing audio metadata may include a plurality of dynamic format conversion information in the audio metadata. According to an embodiment of the present invention, the audio metadata providing apparatus converts each format conversion method of dynamic format conversion information into audio metadata in the form of a matrix (for example, conversion matrices 530 and 540 of FIG. 5). Can be recorded.

도 7은 본 발명의 일실시예에 따른 다채널 오디오 데이터 재생 장치가 다채널 오디오 데이터의 포맷을 변환한 이후 이를 재생하는 동작을 나타낸 흐름도이다.7 is a flowchart illustrating an operation of reproducing multi-channel audio data after converting the format of the multi-channel audio data by the apparatus for reproducing multi-channel audio data according to an embodiment of the present invention.

단계(710)에서, 다채널 오디오 데이터 재생 장치는 다채널 오디오 데이터 및 오디오 메타데이터를 수신한다. 본 발명의 일실시예에 따르면, 오디오 메타데이터는 다채널 오디오 데이터와 함께 제공되거나 별도로 제공될 수 있다. 본 발명의 일실시예에 따르면, 오디오 메타데이터는 실시간으로 다채널 오디오 데이터 재생 장치로 수신되거나, 또는 다채널 오디오 데이터 재생 장치에 미리 전송되어 다채널 오디오 데이터 재생 장치의 버퍼, 메모리 같은 저장 매체에 저장될 수 있다. 또한 오디오 메타데이터는 CD-ROM, CD-RW, DVD-R, DVD-RW 등과 같은 광 기록 매체에 저장되어 수신될 수 있다. In step 710, the multi-channel audio data reproducing apparatus receives multi-channel audio data and audio metadata. According to an embodiment of the present invention, audio metadata may be provided together with or separately provided with multi-channel audio data. According to an embodiment of the present invention, audio metadata is received by a multi-channel audio data reproducing apparatus in real time, or transmitted in advance to a multi-channel audio data reproducing apparatus, and stored in a storage medium such as a buffer or a memory of the multi-channel audio data reproducing apparatus. Can be saved. In addition, audio metadata may be stored and received in an optical recording medium such as CD-ROM, CD-RW, DVD-R, DVD-RW, or the like.

단계(720)에서, 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷이 다를 경우, 다채널 오디오 데이터 재생 장치는 오디오 메타데이터에서 동적 포맷 변환 정보를 식별하게 된다. 본 발명의 일실시예에 따르면, 오디오 메타데이터는 하나 이상의 동적 포맷 변환 정보를 포함할 수 있으며, 이 경우, 다채널 오디오 데이터 재생 장치는 다채널 오디오 데이터 재생 장치의 제2 포맷에 대응하는 동적 포맷 변환 정보를 식별한다. 다채널 오디오 데이터의 재생 환경은 다채널 오디오 데이터가 재생되는 스피커들의 레이아웃에 기초하여 결정된다. In step 720, if the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are different, the multi-channel audio data reproducing apparatus provides dynamic format conversion information in the audio metadata. Will be identified. According to an embodiment of the present invention, the audio metadata may include one or more dynamic format conversion information. In this case, the multi-channel audio data reproducing apparatus is a dynamic format corresponding to the second format of the multi-channel audio data reproducing apparatus. Identify conversion information. The reproduction environment of the multi-channel audio data is determined based on the layout of speakers in which the multi-channel audio data is reproduced.

오디오 메타데이터에서 식별한 동적 포맷 변환 정보는 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷 간의 복수의 포맷 변환 방식들이 다채널 오디오 데이터의 재생 구간 별로 설정된 것이다. 동적 포맷 변환 정보의 복수의 포맷 변환 방식이 설정된 재생 구간은 서로 동일한 재생 길이를 가지거나 서로 다른 재생 길이를 가질 수 있다. 동적 포맷 변환 정보의 재생 구간 별로 설정된 포맷 변환 방식은 서로 다르게 설정되거나 또는 부분적으로 반복되게 설정될 수 있다. The dynamic format conversion information identified in the audio metadata includes a plurality of format conversion methods between the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data. It is set very much. Playback sections in which a plurality of format conversion schemes of the dynamic format conversion information are set may have the same playback length or different playback lengths. Format conversion methods set for each reproduction section of the dynamic format conversion information may be set differently or may be set to be partially repeated.

단계(730)에서, 다채널 오디오 데이터 재생 장치는 식별한 동적 포맷 변환 정보에 의하여 다채널 오디오 데이터의 저작자가 설정한 제1 포맷에서 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷으로 변환을 수행한다. 본 발명의 일실시예에 따르면, 변환하는 재생 구간은 동적 변환 정보에 의하여 서로 동일한 재생 길이를 가지거나 서로 다른 재생 길이를 가질 수 있다. 본 발명의 일실시예에 따르면, 포맷 변환 방식은 다채널 오디오 데이터의 재생 구간 별로 서로 다르게 변환하거나 부분적으로 반복되게 변환할 수 있다. In step 730, the multi-channel audio data reproducing apparatus performs conversion from the first format set by the author of the multi-channel audio data to a second format based on the reproduction environment of the multi-channel audio data according to the identified dynamic format conversion information. do. According to an embodiment of the present invention, the reproduction sections to be converted may have the same reproduction length or different reproduction lengths according to the dynamic conversion information. According to an embodiment of the present invention, the format conversion method may be converted differently or partially repeated for each reproduction section of multi-channel audio data.

단계(740)에서, 다채널 오디오 데이터 재생 장치는 변환된 다채널 오디오 데이터를 재생한다. 다채널 오디오 데이터 재생 장치는 제2 포맷으로 변환된 다채널 오디오 데이터에 대하여 각 채널에 대응되는 스피커로 오디오 데이터를 출력한다. 본 발명의 일실시예에 따르면, 다채널 오디오 데이터의 저작자가 설정한 제1 포맷과 다채널 오디오 데이터의 재생 환경에 기초한 제2 포맷이 서로 동일할 경우, 다채널 오디오 데이터 재생 장치는 변환을 수행하지 않고 다채널 오디오 데이터를 재생할 수 있다. In step 740, the multi-channel audio data reproducing apparatus reproduces the converted multi-channel audio data. The multi-channel audio data reproducing apparatus outputs audio data to a speaker corresponding to each channel with respect to multi-channel audio data converted into a second format. According to an embodiment of the present invention, when the first format set by the author of the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data are the same, the multi-channel audio data reproducing apparatus performs conversion. Multi-channel audio data can be played back

본 발명의 실시 예에 따른 방법들은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. Methods according to an embodiment of the present invention may be implemented in the form of program instructions that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded in the medium may be specially designed and configured for the present invention, or may be known and usable to those skilled in computer software.

이상과 같이 본 발명은 비록 한정된 실시예와 도면에 의해 설명되었으나, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상의 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다.As described above, although the present invention has been described by limited embodiments and drawings, the present invention is not limited to the above embodiments, and various modifications and variations from these descriptions are those of ordinary skill in the field to which the present invention pertains. This is possible.

그러므로, 본 발명의 범위는 설명된 실시예에 국한되어 정해져서는 아니 되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등한 것들에 의해 정해져야 한다.Therefore, the scope of the present invention is limited to the described embodiments and should not be defined, but should be defined by the claims to be described later as well as those equivalent to the claims.

310 : 동적 포맷 변환 정보
320 : 포맷 변환 방식 K
330 : 포맷 변환 방식 M
340 : 포맷 변환 방식 L
310: Dynamic format conversion information
320: Format conversion method K
330: format conversion method M
340: format conversion method L

Claims

Identifying dynamic format conversion information between a first format set by an author of the multi-channel audio data in the multi-channel audio data and a second format based on a reproduction environment of the multi-channel audio data; And
Generating audio metadata including the identified dynamic format conversion information
Including,
The reproduction environment of the multi-channel audio data,
The multi-channel audio data is determined based on a layout of speakers to be reproduced, and the layout of the speakers includes a position of each of the speakers and a number of channels corresponding to the speakers,
When the second format based on the reproduction environment of the multi-channel audio data is different from the first format, dynamic conversion information is selected according to the second format based on the reproduction environment of the multi-channel audio data, and the selected dynamic format conversion information According to a method for providing audio metadata, a first format of multi-channel audio data is converted into a second format.

The method of claim 1,
The dynamic format conversion information,
Audio metadata providing method including a matrix for converting the first format to a second format.

The method of claim 1,
The dynamic format conversion information,
A method of providing audio metadata that is set differently or partially repeated for each reproduction section of multi-channel audio data.

The method of claim 1,
The layout of the speakers,
Audio metadata providing method including the location of each of the speakers and the number of the speakers.

Identifying dynamic format conversion information between the first format set by the author of the multi-channel audio data and the second format based on a reproduction environment of the multi-channel audio data from the multi-channel audio data and audio metadata produced according to the first format ;
Converting multi-channel audio data of the first format into a second format using the dynamic format conversion information; And
Reproducing multi-channel audio data converted to the second format
Including,
The reproduction environment of the multi-channel audio data,
The multi-channel audio data is determined based on a layout of speakers to be reproduced, and the layout of the speakers includes a position of each of the speakers and a number of channels corresponding to the speakers,
When the second format based on the reproduction environment of the multi-channel audio data is different from the first format, dynamic conversion information is selected according to the second format based on the reproduction environment of the multi-channel audio data, and the selected dynamic format conversion information According to a method for reproducing multi-channel audio data, a first format of multi-channel audio data is converted into a second format.

The method of claim 5,
The dynamic format conversion information,
A method of reproducing multi-channel audio data including a matrix for converting the first format to a second format.

The method of claim 5,
The dynamic format conversion information,
A method of reproducing multi-channel audio data that is set differently or partially repeated for each reproduction section of multi-channel audio data.

The method of claim 5,
The layout of the speakers,
A method of reproducing multi-channel audio data, including positions of each of the speakers and the number of the speakers.

The method of claim 5,
The reproduction section of the multi-channel audio data,
A method of reproducing multi-channel audio data having the same reproduction length or different reproduction lengths.

The method of claim 9,
The format conversion method for the multi-channel audio data,
A method of reproducing multi-channel audio data that converts differently or partially repetitively for each reproduction section of multi-channel audio data.

Multi-channel audio data composed of one or more channels, and
Audio metadata including dynamic format conversion information between the first format set by the author of the multi-channel audio data in the multi-channel audio data and the second format based on the reproduction environment of the multi-channel audio data
Is recorded,
The reproduction environment of the multi-channel audio data,
The multi-channel audio data is determined based on a layout of speakers to be reproduced, and the layout of the speakers includes a position of each of the speakers and a number of channels corresponding to the speakers,
When the second format based on the reproduction environment of the multi-channel audio data is different from the first format, dynamic conversion information is selected according to the second format based on the reproduction environment of the multi-channel audio data, and the selected dynamic format conversion information A computer-readable recording medium in which a first format of multi-channel audio data is converted into a second format according to the method.