WO2019147041A1 - Method for generating binaural stereo audio and apparatus therefor - Google Patents
Method for generating binaural stereo audio and apparatus therefor Download PDFInfo
- Publication number
- WO2019147041A1 WO2019147041A1 PCT/KR2019/001019 KR2019001019W WO2019147041A1 WO 2019147041 A1 WO2019147041 A1 WO 2019147041A1 KR 2019001019 W KR2019001019 W KR 2019001019W WO 2019147041 A1 WO2019147041 A1 WO 2019147041A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- layer
- binaural
- output
- dimensional
- stereo
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000012545 processing Methods 0.000 claims abstract description 16
- 230000005236 sound signal Effects 0.000 claims description 7
- 210000004556 brain Anatomy 0.000 claims 2
- 230000000694 effects Effects 0.000 description 39
- 238000010586 diagram Methods 0.000 description 13
- 230000006870 function Effects 0.000 description 9
- 238000004891 communication Methods 0.000 description 5
- 238000013016 damping Methods 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 238000004091 panning Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 125000002066 L-histidyl group Chemical group [H]N1C([H])=NC(C([H])([H])[C@](C(=O)[*])([H])N([H])[H])=C1[H] 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004886 head movement Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/002—Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
- H04S3/004—For headphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
Definitions
- the present invention relates to a technique for generating binaural stereo audio, and more particularly, to a technique for generating a binaural stereo audio that can be reproduced in general by combining binaural output based on a three-dimensional layer and audio output based on a plane layer .
- contents including multichannel audio signals such as 7.1 channel, 10.2 channel, 11.1 channel and 22.2 channel more than 5.1 channel is increasing.
- user terminals possessed by users using contents can reproduce audio signals in stereo form, such as stereo speakers, headphones and earphones, high-quality multi-channel audio signals need to be converted into stereo-type audio signals .
- Korean Patent Laid-Open No. 10-2015-0013073 discloses a technique related to " binaural rendering method and apparatus for multi-channel audio signal ".
- a binaural stereo audio generating method comprising: generating a 3D layer binaural output by performing 3D layer binaural encoding corresponding to a 3D binaural layer; Performing audio processing corresponding to a planar layer to produce a planar layer audio output; And combining the three-dimensional layer binaural output and the planar layer audio output to produce a binaural stereo output.
- the plane layer performs a surround layer binaural encoding to generate a surround layer binaural output, a surround layer that provides the generated surround layer binaural output to the plane layer audio output, And a proximity stereo layer for generating the plane layer audio output corresponding to the stereo signal.
- the three-dimensional layer binaural output corresponds to a three-dimensional vector for a binaural point located on an eight-channel-based three-dimensional cubic (Cubic) composed of four up channels and four down channels Lt; / RTI >
- generating a binaural stereo output comprises applying a three-dimensional weight to the three-dimensional layer binaural output, applying a plane weight to the planar layer audio output, wherein the three-dimensional weight and the plane weight They can be set independently of each other.
- the step of generating a binaural stereo output may include adding the sub-woofer output corresponding to the sub-woofer layer together with the three-dimensional layer binaural output and the planar layer audio output to generate the binaural stereo output have.
- the cubic cubic can be generated by changing the positions of the eight dynamic speakers corresponding to the vertexes of the cubic cubic, corresponding to the size parameters for the 3D binaural layer.
- the three-dimensional vector is included in the three-dimensional cubic, and can be generated based on a reference listening point corresponding to the center of the two-dimensional plane corresponding to the surround layer.
- the generating of the 3D layer binaural output may include generating the 3D layer binaural output by applying direction information of the 3D vector to the 3D cubic that is rotated according to the head tracking information,
- the head tracking information may be obtained corresponding to at least one of a tracking input based on a head tracking module and a user input based on a user interface.
- the cubic cubic can be rotated corresponding to the rotation parameter of at least one of pan, tilt, and roll.
- planar layer may be located between the four up channels and the four down channels.
- the apparatus for generating binaural stereo audio may perform a three-dimensional layer binaural encoding corresponding to a three-dimensional binaural layer to generate a three-dimensional layer binaural output,
- the plane layer performs a surround layer binaural encoding to generate a surround layer binaural output, a surround layer that provides the generated surround layer binaural output to the plane layer audio output, And a proximity stereo layer for generating the plane layer audio output corresponding to the stereo signal.
- the three-dimensional layer binaural output corresponds to a three-dimensional vector for a binaural point located on an eight-channel-based three-dimensional cubic (Cubic) composed of four up channels and four down channels Lt; / RTI >
- the processor applies a three-dimensional weight to the three-dimensional layer binaural output, applies a plane weight to the plane layer audio output, and the three-dimensional weight and the plane weight may be set independently of each other.
- the processor may generate the binaural stereo output by summing the sub-woofer output corresponding to the sub-woofer layer with the three-dimensional layer binaural output and the planar layer audio output.
- the cubic cubic can be generated by changing the positions of the eight dynamic speakers corresponding to the vertexes of the cubic cubic, corresponding to the size parameters for the 3D binaural layer.
- the three-dimensional vector is included in the three-dimensional cubic, and can be generated based on a reference listening point corresponding to the center of the two-dimensional plane corresponding to the surround layer.
- the processor applies the direction information of the three-dimensional vector to the rotated cubic bikes corresponding to the head tracking information to generate the three-dimensional layer binaural output, wherein the head tracking information is based on the head tracking module A tracking input, and a user input based on a user interface.
- the cubic cubic can be rotated corresponding to the rotation parameter of at least one of pan, tilt, and roll.
- planar layer may be located between the four up channels and the four down channels.
- the present invention can provide a binaural engine which can easily adjust or adjust a sound element for generating an effective binaural effect.
- the present invention can improve compatibility with various kinds of contents based on natural upmix and downmix.
- FIG. 1 is a view illustrating a structure of a binaural engine according to an embodiment of the present invention.
- FIG. 2 is a view showing a structure of a conventional binaural engine.
- FIG. 3 is a block diagram illustrating a binaural stereo audio generating apparatus according to an embodiment of the present invention.
- FIG. 4 is a diagram illustrating a detailed structure for generating a three-dimensional layer binaural output according to an embodiment of the present invention.
- FIG. 5 is a diagram illustrating an example of an 8-channel based three-dimensional cubic according to the present invention.
- FIG. 6 is a diagram illustrating an example of 3D cubic with various sizes generated by changing the positions of dynamic speakers according to the present invention.
- FIG. 7 is a diagram showing an example of a three-dimensional vector according to the present invention.
- FIG. 8 is a diagram illustrating an example of applying direction information of a three-dimensional vector to rotated cubic cubes corresponding to head tracking information according to the present invention.
- FIG. 9 is a diagram showing an example of a rotation parameter according to the present invention.
- FIG. 10 illustrates a detailed structure for generating a surround layer binaural output according to an exemplary embodiment of the present invention.
- FIG. 11 is a diagram illustrating an example of a 5-channel based surround layer according to the present invention.
- FIG. 12 is a diagram illustrating a detailed structure for generating a stereo signal according to an embodiment of the present invention.
- FIG. 13 to 14 are views showing an example of a proximity stereo layer according to the present invention.
- 15 is a diagram illustrating an example of a plane layer positioned between an up channel and a down channel of a three-dimensional cubic according to the present invention.
- 16 to 17 are views showing an example of a proximity stereo layer used as a part of a channel of the surround layer according to the present invention.
- FIG. 18 is a diagram illustrating a detailed structure for generating a subwoofer output according to an embodiment of the present invention.
- FIG. 19 is a view showing an example of a structure of a combination of a 3D binaural layer, a plane layer, and a sub-woofer layer according to the present invention.
- 20 is a diagram showing an example of a sound represented by a conventional binaural engine.
- 21 is a view showing an example of a sound represented by the binaural engine according to the present invention.
- 22 is a flowchart illustrating a method of generating binaural stereo audio according to an embodiment of the present invention.
- FIG. 1 is a view showing a structure of a binaural engine according to an embodiment of the present invention
- FIG. 2 is a view showing a structure of a conventional binaural engine.
- a conventional binaural engine decodes a binaural encoded binaural output for a multi-channel audio file through a binaural encoder 210 through a dedicated player 220 to provide.
- the binaural encoding according to the related art uses a fixed speaker disposed at a certain distance from the listening position, it is difficult to adjust the position of the speaker to increase or decrease the image of the space.
- the conventional binaural engine is an engine specialized for content including both video and audio like surround movie contents.
- the binaurally encoded content can be reproduced only by using the dedicated player 220, the efficiency may be reduced in terms of utilization. For example, it is necessary to deliver sufficient loudness to the listener according to the characteristics of the music contents.
- the binaural encoder 210 shown in FIG. 2 has a limitation in providing a sound effect optimized for music contents.
- the conventional binaural engine uses only one encoder specialized for the effect mainly used according to the contents, it is impossible to apply various effects to the production. For example, since music contents often do not use subwoofers in nature, it has been rarely attempted to provide a bass reproduction element according to a subwoofer to music contents through a conventional binaural engine.
- the binaural engine according to an embodiment of the present invention shown in FIG. 1 mixes output including various binaural sound effects and output from audio processing to include more dramatic directing It is possible to generate an internal stereo output.
- binaural encoding is performed with a binaural encoder 111 corresponding to a multi-channel three-dimensional binaural layer 110 as shown in FIG. 1 to generate a three-dimensional layer binaural output .
- binaural encoding may be performed with the binaural encoder 121 corresponding to the surround layer 120 to generate a surround layer binaural output.
- the stereo bus 131 corresponding to the proximity stereo layer 130 may generate an audio output corresponding to the stereo signal. It is also possible to generate the subwoofer output on the LFE bus 141, which corresponds to the subwoofer layer 140.
- the respective outputs that is, the three-dimensional layer binaural output, the surround layer binaural output, the audio output corresponding to the stereo signal, and the sub-woofer output are summed through the binaural mixer 150, Output can be generated.
- the binaural stereo output coupled through the binaural mixer 150 can be output to the listener or user in a reproducible form via the general-purpose decoder.
- the binaural engine according to the present invention mixes outputs from various encoders to generate binaural stereo audio, so that the binaural engine can be used in a general form not specific to specific contents, Compatibility can be provided.
- a 3D layer binaural output, a stereo output, and a subwoofer output together with a surround layer binaural output that can be generated based on the motion of an object included in the image It is possible to provide a more dramatic sound production.
- music content that includes audio only, it provides dynamic music by mixing a stereo output or a subwoofer output with a 3D layer binaural output based on a 3D binaural layer You may.
- FIG. 3 is a block diagram illustrating a binaural stereo audio generating apparatus according to an embodiment of the present invention.
- the apparatus for generating binaural stereo audio includes a communication unit 310, a processor 320, and a memory 330.
- the communication unit 310 transmits and receives information necessary for generating binaural stereo audio through a communication network such as a network.
- the communication unit 310 receives the source or content that can be input for generating binaural stereo audio, head tracking information to be applied for binaural encoding, and information related to user input, It can provide binaural stereo audio equivalent to an in-built stereo output.
- the processor 320 performs three-dimensional layer binaural encoding corresponding to the three-dimensional binaural layer to generate a three-dimensional layer binaural output.
- a binaural encoder 420 corresponding to a three-dimensional cubic method may be used to generate a three-dimensional binaural layer, Dimensional layer binaural encoding corresponding to a plurality of channels included in the binaural layer.
- the 3D binaural layer may include four up channels 411 and four down channels 412 corresponding to 8-channel based cubic cubes.
- the three-dimensional layer binaural output 430 may correspond to an output generated by binaural encoding the 8-channel based audio, and may be output corresponding to 2 channels as shown in FIG.
- the two channels corresponding to the three-dimensional layer binaural output 430 may correspond to the left channel and the right channel, respectively.
- the 3-dimensional binaural layer may not be limited thereto. That is, the binaural engine or the binaural engine according to an embodiment of the present invention may include another usable three-dimensional binaural layer or a three-dimensional binaural layer to be developed in the future.
- the three-dimensional layer binaural output corresponds to a three-dimensional vector for a binaural point located on an eight-channel-based three-dimensional cubic (Cubic) composed of four up channels and four down channels Lt; / RTI >
- an 8-channel based three-dimensional cubic may include four dynamic speakers 511 to 514 corresponding to four up channels and four dynamic speakers corresponding to four down channels (515 to 518) may be a hexahedron structure.
- the positions of the eight dynamic speakers 511 to 518 can be changed, the range of the binaural effect caused by the three-dimensional cubic can also be changed dynamically.
- an immersive sound may be implemented with eight dynamic speakers by generating three-dimensional cubic using a conventional binaural Vbap (Vector base amplitude panning) scheme. That is, a position value for X, Y, and Z is given to each of the eight dynamic speakers, and a vector-based virtual track point based on the center of cubic cubic can be expressed. At this time, the virtual track point can be represented corresponding to the parameter value included in the head tracking information.
- Vbap Vector base amplitude panning
- the 3D cubic can be generated by changing the position of the eight dynamic speakers corresponding to the vertices of the 3D cubic by changing the size parameter for the 3D binaural layer. That is, it is possible to efficiently generate three-dimensional cubic by changing the position of dynamic speakers of the variable system rather than the fixed system freely according to the size parameter.
- the 3D cubic bins 610, 620, and 630 having various ranges as shown in FIG. 6 can be generated.
- the three-dimensional vector is included in the three-dimensional cubic and can be generated based on the reference listening point corresponding to the center of the two-dimensional plane corresponding to the surround layer.
- a reference listening point 700 which virtually expresses the position of a user or a listener listening to binaural stereo audio, is composed of a three-dimensional cubic 710 having eight dynamic speakers as vertices, But may be located at a center portion on the surround layer 720.
- the binaural point 730 is located on the upper surface of the cubic 710, as shown in FIG. 7, a three-dimensional vector 740 corresponding to the three-dimensional layer binaural output 7 in the direction from the reference listening point 700 shown in FIG. 7 to the binaural point 730.
- the surround layer 720 corresponds to an element for creating a surround image corresponding to a surround effect. In FIG. 7, But it may not be limited to a planar shape.
- the binaural point 730 when the binaural point 730 is located on the 3D cubic 710 higher than the surround layer 720 where the reference listening point 700 is located, Can be formed at the top of the. Also, when the binaural point 730 is located on the three-dimensional cubic 710 lower than the surround layer 720 where the reference listening point 700 is located, the output sound may be formed at the bottom of the listener.
- the direction information of the three-dimensional vector can be applied to the rotated cubic by corresponding to the head tracking information to generate the three-dimensional layer binaural output. That is, since the binaural point is set based on the listener's head corresponding to the reference listening point, the position of the binaural point on the cubic bicycle can also be changed if the listener's head position or angle is changed.
- the 3D cubic 710 shown in FIG. 7 is rotated as shown in FIG. 8 in accordance with the head tracking information.
- the direction information of the three-dimensional vector 740 shown in FIG. 7 can be directly applied to the three-dimensional cubic as shown in FIG. 8 to detect the position of the changed binaural point according to the rotation.
- the head tracking information corresponds to data obtained by tracking the head movement of the user or listener, and may be obtained corresponding to at least one of a tracking input based on a separate head tracking module and a user input based on the user interface.
- the head tracking module can measure the distance or angle of movement of the user's head and generate and transmit the head tracking information.
- the head tracking information may be artificially provided by the user or the listener through the user interface. That is, the user or the listener may input the head tracking information based on the user interface regardless of whether the head tracking information is received by the head tracking module in order to artificially rotate the spatial image. At this time, the user or the listener may input and modify the head tracking information while listening to the mixing process of generating the binaural stereo output or the binaural stereo output varying according to the inputted information.
- the cubic cubic can be rotated corresponding to the rotation parameter of at least one of pan, tilt, and roll.
- the listener rotates the head corresponding to at least one of pan, tilt, and roll as shown in FIG. 9, the value is obtained as a rotation parameter, .
- the effect of rotating the three-dimensional cubic according to the head tracking information or moving it in the up, down, left, and right directions can be mixed with the flat layer audio output in the future to generate the binaural stereo output. Therefore, it is possible to produce an immersive effect based on head tracking more efficiently than a conventional method of rotating or moving a surround layer, a proximity stereo layer, a sub-woofer layer, or the like corresponding to a flat layer.
- the processor 320 performs audio processing corresponding to the plane layer to produce a plane layer audio output.
- the planar layer corresponds to a layer having a structure different from that of the three-dimensional binaural layer, and may correspond to an element that produces an image corresponding to a surround effect or a stereo effect.
- the plane layer performs a surround layer binaural encoding to generate a surround layer binaural output, a surround layer to provide the generated surround layer binaural output as a plane layer audio output, and a surround layer to provide a stereo signal Or a proximity stereo layer that produces a corresponding flat layer audio output.
- a binaural encoder 1020 may be used to perform surround layer binaural encoding corresponding to a 5-channel or 7-channel 1010 surround layer.
- two channels corresponding to a proximity stereo layer may be included in the surround layer to perform 7-channel based surround layer binaural encoding.
- the surround layer may correspond to a structure including five speakers 1111 to 1115, for example, as shown in Fig.
- the surround layer binaural output 1030 may correspond to a binaural point located on the surround layer. If the listener is listening to a sound at a reference listening point located at the center of the surround layer, the surround layer binaural output 1030 is binaurally encoded as if it were sounding at a binaural point on the surround layer, Can be generated.
- the surround layer binaural output 1030 may be output corresponding to two channels as shown in FIG.
- the two channels corresponding to the surround layer binaural output 1030 may correspond to the left channel and the right channel, respectively.
- the channel of the surround layer is not limited to five channels or seven channels (1010).
- the surround layer is shown in a rectangular plane shape, but it is not limited thereto and can be expressed in various forms such as a line thickness, a planar shape, and a distance from a reference listening point.
- audio processing may be performed corresponding to the proximity stereo layer of the two channels 1210 based on a Stereo Bus 1220. That is, a stereo signal 1230 corresponding to a plane layer audio output may correspond to an output produced by processing 2-channel 1210 based stereo audio, and may be output corresponding to two channels.
- the proximity stereo layer corresponds to an element for producing a stereo image corresponding to the stereo effect, and may be included as a part of the surround layer.
- a surround stereo layer corresponding to two speakers 1311, 1312, 1411 and 1412 on a surround layer based on five speakers is included so that a total of seven Or a layer structure including speakers.
- the proximity stereo layer may be disposed at a distance from the reference listening point 1300 located on the surround layer.
- a proximity stereo layer may be used as the left and right side speakers of the reference listening point 1400.
- the stereo signal output corresponding to the proximity stereo layer can provide a damping feeling that is difficult to produce with spatial parameters used in binaural encoding.
- the binaural stereo output according to an embodiment of the present invention may provide a damping feeling while providing an immersive effect by binaural encoding.
- a planar layer audio output corresponding to a surround layer binaural output or a planar layer audio output corresponding to a stereo signal can be used for output corresponding to an output containing only a different sound effect Lt; / RTI > That is, the plane layer audio output may include various values other than the output corresponding to the three-dimensional layer, rather than the three-dimensional layer binaural output.
- the planar layer may be located between four up channels and four down channels corresponding to cubic cubic.
- the planar layers 1510 to 1530 include four up-channels included in a cubic cubic corresponding to a three-dimensional binaural layer and four down- May be located between the channels.
- the four up channels may correspond to four speakers located at the top of the cubic cubic
- the four down channels may correspond to the four speakers located at the bottom of the cubic cubic.
- the flat layers 1510 to 1530 may be located within a height range of a hexahedron corresponding to cubic cubic.
- each of the speakers included in the surround layer or the adjacent stereo layer corresponding to the flat layers 1510 to 1530 may be located between the four up channels included in the 3D cubic and the four down channels .
- the flat layers 1510 to 1530 are shown in the form of planes for convenience of explanation in FIG. 15, but the shape of the planar layers according to an embodiment of the present invention may not be limited to the planar form.
- FIGS. 16 and 17 show a structure in which the three-dimensional cubic and the flat layer 1610 corresponding to the three-dimensional binaural layer are viewed from above, respectively.
- the speaker of the proximity stereo layer included in the flat layer 1610 1622 and 1622 are also located between the up channel and the down channel of the cubic cubic.
- the processor 320 combines the three-dimensional layer binaural output and the planar layer audio output to produce a binaural stereo output. That is, by mixing an immersive element by a three-dimensional layer binaural output and a near-playback element and an object element by a flat layer audio output, a binaural stereo output capable of generating a binaural effect can be generated have.
- a binaural stereo output may be generated using only a three-dimensional layer binaural output.
- the subwoofer output corresponding to the subwoofer layer can be added together with the three-dimensional layer binaural output and the planar layer audio output to generate a binaural stereo output.
- the subwoofer outputs it is possible to maximize the immersive effect corresponding to the binaural stereo output, and to produce a dynamic bass reproduction element.
- a single channel or two-channel 1810 signal included in a sub-woofer layer may be processed based on an LFE bus (Low Frequency Effects Bus) 1820. That is, the subwoofer output 1830 may correspond to an output produced by processing a single channel or two channel (1810) based audio, and may correspond to a single channel or two channels as shown in FIG.
- LFE bus Low Frequency Effects Bus
- the subwoofer layer may correspond to a single channel, such as 5.1 channel, 7.1 channel and 11.1 channel, or may correspond to two channels, such as 10.2 channel and 22.2 channel.
- the sub-woofer layer can be located separately from the three-dimensional cubic or planar layer corresponding to the three-dimensional binaural layer.
- the sub-woofer layer 1940 is located at a distance from the three-dimensional cubic 1910, the surround layer 1920, and the proximity stereo layer 1930 corresponding to the three-dimensional binaural layer Can be located.
- the structure shown in FIG. 19 corresponds to an embodiment, and is not limited to a structure in which respective layers are combined.
- the three-dimensional weight can be applied to the three-dimensional layer binaural output
- the plane weight can be applied to the plane layer audio output
- the three-dimensional weight and the plane weight can be set independently of each other. That is, it is possible to generate a more dramatic binaural stereo output by adjusting the size of the layer-by-layer output and then performing the mixing, thereby maximizing the binaural effect.
- the present invention can support natural upmix and downmix functions based on the processor 320 having the above-described functions, it is possible to improve compatibility between contents supporting various kinds of sounds. For example, you can downmix a surround image represented by three-dimensional cubic into a surround layer. Also, the surround layer may be downmixed back to the adjacent stereo layer. As described above, by downmixing based on the area, the sound quality of the sound can be preserved more effectively.
- the memory 330 stores the 3D layer binaural output and the plane layer audio output.
- the memory 330 stores various information generated in the process of generating the binaural stereo audio according to an embodiment of the present invention, as described above.
- the memory 330 may be configured independently of the binaural stereo audio generation device to support the binaural stereo audio generation function. At this time, the memory 330 may operate as a separate mass storage and may include a control function for performing operations.
- a binaural stereo audio generating apparatus can store information in a memory on which a memory is mounted.
- the memory is a computer-readable medium.
- the memory may be a volatile memory unit, and in other embodiments, the memory may be a non-volatile memory unit.
- the storage device is a computer-readable medium.
- the storage device may include, for example, a hard disk device, an optical disk device, or any other mass storage device.
- Such a binaural stereo audio generator can maximize the binaural effect by mixing various sound elements.
- compatibility with various kinds of contents can be improved based on natural upmix and downmix.
- FIG. 20 is a view showing an example of a sound represented by a conventional binaural engine
- FIG. 21 is a view illustrating an example of a sound represented by a binaural engine according to the present invention.
- a binaural mix method using a conventional binaural engine has a limitation in expressing the proximity of sound.
- binaural mixing corresponds to providing a spatial image of sound, so there is no way to control the volume of sound to represent the proximity of sound through binaural mixing.
- Vbap Vector base amplitude panning
- the binaural engine according to the present invention mixes the flat layer audio output generated using the surround layer 2110 and the proximity stereo layer 2120 in addition to the three-dimensional binaural layer . That is, in the conventional binaural engine, the proximity expression of the sound, which is controlled only by the volume of the sound, can be controlled through the surround layer binaural output and the stereo signal.
- the engineer can express the actual sound direction 2130 corresponding to the intended sound direction 2010. That is, by transmitting the reference listening point 2100, it is possible to produce a sound that seems to be transmitted through the listener's body.
- 22 is a flowchart illustrating a method of generating binaural stereo audio according to an embodiment of the present invention.
- a binaural stereo audio generating method performs three-dimensional layer binaural encoding corresponding to a three-dimensional binaural layer to generate a three-dimensional layer binaural output (S2210).
- a binaural encoder 420 corresponding to a three-dimensional cubic method may be used to generate a three-dimensional binaural layer, Dimensional layer binaural encoding corresponding to a plurality of channels included in the binaural layer.
- the 3D binaural layer may include four up channels 411 and four down channels 412 corresponding to 8-channel based cubic cubes.
- the three-dimensional layer binaural output 430 may correspond to an output generated by binaural encoding the 8-channel based audio, and may be output corresponding to 2 channels as shown in FIG.
- the two channels corresponding to the three-dimensional layer binaural output 430 may correspond to the left channel and the right channel, respectively.
- the 3-dimensional binaural layer may not be limited thereto. That is, the binaural engine or the binaural engine according to an embodiment of the present invention may include another usable three-dimensional binaural layer or a three-dimensional binaural layer to be developed in the future.
- the three-dimensional layer binaural output corresponds to a three-dimensional vector for a binaural point located on an eight-channel-based three-dimensional cubic (Cubic) composed of four up channels and four down channels Lt; / RTI >
- an 8-channel based three-dimensional cubic may include four dynamic speakers 511 to 514 corresponding to four up channels and four dynamic speakers corresponding to four down channels (515 to 518) may be a hexahedron structure.
- the positions of the eight dynamic speakers 511 to 518 can be changed, the range of the binaural effect caused by the three-dimensional cubic can also be changed dynamically.
- an immersive sound may be implemented with eight dynamic speakers by generating three-dimensional cubic using a conventional binaural Vbap (Vector base amplitude panning) scheme. That is, a position value for X, Y, and Z is given to each of the eight dynamic speakers, and a vector-based virtual track point based on the center of cubic cubic can be expressed. At this time, the virtual track point can be represented corresponding to the parameter value included in the head tracking information.
- Vbap Vector base amplitude panning
- the 3D cubic can be generated by changing the position of the eight dynamic speakers corresponding to the vertices of the 3D cubic by changing the size parameter for the 3D binaural layer. That is, it is possible to efficiently generate three-dimensional cubic by changing the position of dynamic speakers of the variable system rather than the fixed system freely according to the size parameter.
- the 3D cubic bins 610, 620, and 630 having various ranges as shown in FIG. 6 can be generated.
- the three-dimensional vector is included in the three-dimensional cubic and can be generated based on the reference listening point corresponding to the center of the two-dimensional plane corresponding to the surround layer.
- a reference listening point 700 which virtually expresses the position of a user or a listener listening to binaural stereo audio, is composed of a three-dimensional cubic 710 having eight dynamic speakers as vertices, But may be located at a center portion on the surround layer 720.
- the binaural point 730 is located on the upper surface of the cubic 710, as shown in FIG. 7, a three-dimensional vector 740 corresponding to the three-dimensional layer binaural output 7 in the direction from the reference listening point 700 shown in FIG. 7 to the binaural point 730.
- the surround layer 720 corresponds to an element for creating a surround image corresponding to a surround effect. In FIG. 7, But it may not be limited to a planar shape.
- the binaural point 730 when the binaural point 730 is located on the 3D cubic 710 higher than the surround layer 720 where the reference listening point 700 is located, Can be formed at the top of the. Also, when the binaural point 730 is located on the three-dimensional cubic 710 lower than the surround layer 720 where the reference listening point 700 is located, the output sound may be formed at the bottom of the listener.
- the direction information of the three-dimensional vector can be applied to the rotated cubic by corresponding to the head tracking information to generate the three-dimensional layer binaural output. That is, since the binaural point is set based on the listener's head corresponding to the reference listening point, the position of the binaural point on the cubic bicycle can also be changed if the listener's head position or angle is changed.
- the 3D cubic 710 shown in FIG. 7 is rotated as shown in FIG. 8 in accordance with the head tracking information.
- the direction information of the three-dimensional vector 740 shown in FIG. 7 can be directly applied to the three-dimensional cubic as shown in FIG. 8 to detect the position of the changed binaural point according to the rotation.
- the head tracking information corresponds to data obtained by tracking the head movement of the user or the listener, and may be input through a separate head tracking module or a user interface.
- the head tracking module can measure the distance or angle of movement of the user's head and generate and transmit the head tracking information.
- head tracking information may be artificially assigned by a user or a listener. That is, the user or the listener may input the head tracking information based on the user interface irrespective of whether the head tracking information is received by the head tracking module to artificially rotate the spatial image. At this time, the user or the listener may input and modify the head tracking information while listening to the mixing process of generating the binaural stereo output or the binaural stereo output varying according to the inputted information.
- the cubic cubic can be rotated corresponding to the rotation parameter of at least one of pan, tilt, and roll.
- the listener rotates the head corresponding to at least one of pan, tilt, and roll as shown in FIG. 9, the value is obtained as a rotation parameter, .
- the effect of rotating the three-dimensional cubic according to the head tracking information or moving it in the up, down, left, and right directions can be mixed with the flat layer audio output in the future to generate the binaural stereo output. Therefore, it is possible to produce an immersive effect based on head tracking more efficiently than a conventional method of rotating or moving a surround layer, a proximity stereo layer, a sub-woofer layer, or the like corresponding to a flat layer.
- audio processing corresponding to a plane layer is performed to generate a plane layer audio output (S2220).
- the planar layer corresponds to a layer having a structure different from that of the three-dimensional binaural layer, and may correspond to an element that produces an image corresponding to a surround effect or a stereo effect.
- the plane layer performs a surround layer binaural encoding to generate a surround layer binaural output, a surround layer to provide the generated surround layer binaural output as a plane layer audio output, and a surround layer to provide a stereo signal Or a proximity stereo layer that produces a corresponding flat layer audio output.
- a binaural encoder 1020 may be used to perform surround layer binaural encoding corresponding to a 5-channel or 7-channel 1010 surround layer.
- two channels corresponding to a proximity stereo layer may be included in the surround layer to perform 7-channel based surround layer binaural encoding.
- the surround layer may correspond to a structure including five speakers 1111 to 1115, for example, as shown in Fig.
- the surround layer binaural output 1030 may correspond to a binaural point located on the surround layer. If the listener is listening to a sound at a reference listening point located at the center of the surround layer, the surround layer binaural output 1030 is binaurally encoded as if it were sounding at a binaural point on the surround layer, Can be generated.
- the surround layer binaural output 1030 may be output corresponding to two channels as shown in FIG.
- the two channels corresponding to the surround layer binaural output 1030 may correspond to the left channel and the right channel, respectively.
- the channel of the surround layer is not limited to five channels or seven channels (1010).
- the surround layer is shown in a rectangular plane shape, but it is not limited thereto and can be expressed in various forms such as a line thickness, a planar shape, and a distance from a reference listening point.
- a stereo signal 1230 corresponding to a plane layer audio output may correspond to an output produced by processing 2-channel 1210 based stereo audio, and may be output corresponding to two channels.
- the proximity stereo layer corresponds to an element for producing a stereo image corresponding to the stereo effect, and may be included as a part of the surround layer.
- a surround stereo layer corresponding to two speakers 1311, 1312, 1411 and 1412 on a surround layer based on five speakers is included so that a total of seven Or a layer structure including speakers.
- the proximity stereo layer may be disposed at a distance from the reference listening point 1300 located on the surround layer.
- a proximity stereo layer may be used as the left and right side speakers of the reference listening point 1400.
- the stereo signal output corresponding to the proximity stereo layer can provide a damping feeling that is difficult to produce with spatial parameters used in binaural encoding.
- the binaural stereo output according to an embodiment of the present invention may provide a damping feeling while providing an immersive effect by binaural encoding.
- a planar layer audio output corresponding to a surround layer binaural output or a planar layer audio output corresponding to a stereo signal can be used for output corresponding to an output containing only a different sound effect Lt; / RTI > That is, the plane layer audio output may include various values other than the output corresponding to the three-dimensional layer, rather than the three-dimensional layer binaural output.
- the planar layer may be located between four up channels and four down channels corresponding to cubic cubic.
- the planar layers 1510 to 1530 include four up-channels included in a cubic cubic corresponding to a three-dimensional binaural layer and four down- May be located between the channels.
- the four up channels may correspond to four speakers located at the top of the cubic cubic
- the four down channels may correspond to the four speakers located at the bottom of the cubic cubic.
- the flat layers 1510 to 1530 may be located within a height range of a hexahedron corresponding to cubic cubic.
- each of the speakers included in the surround layer or the adjacent stereo layer corresponding to the flat layers 1510 to 1530 may be located between the four up channels included in the 3D cubic and the four down channels .
- the flat layers 1510 to 1530 are shown in the form of planes for convenience of explanation in FIG. 15, but the shape of the planar layers according to an embodiment of the present invention may not be limited to the planar form.
- FIGS. 16 and 17 show a structure in which the three-dimensional cubic and the flat layer 1610 corresponding to the three-dimensional binaural layer are viewed from above, respectively.
- the speaker of the proximity stereo layer included in the flat layer 1610 1622 and 1622 are also located between the up channel and the down channel of the cubic cubic.
- the binaural stereo audio generation method combines the 3D layer binaural output and the plane layer audio output to generate a binaural stereo output (S2230). That is, by mixing an immersive element by a three-dimensional layer binaural output and a near-playback element and an object element by a flat layer audio output, a binaural stereo output capable of generating a binaural effect can be generated have.
- a binaural stereo output may be generated using only a three-dimensional layer binaural output.
- the subwoofer output corresponding to the subwoofer layer can be added together with the three-dimensional layer binaural output and the planar layer audio output to generate a binaural stereo output.
- the subwoofer outputs it is possible to maximize the immersive effect corresponding to the binaural stereo output, and to produce a dynamic bass reproduction element.
- a single channel or two-channel 1810 signal included in a sub-woofer layer may be processed based on an LFE bus (Low Frequency Effects Bus) 1820. That is, the subwoofer output 1830 may correspond to an output produced by processing a single channel or two channel (1810) based audio, and may correspond to a single channel or two channels as shown in FIG.
- LFE bus Low Frequency Effects Bus
- the subwoofer layer may correspond to a single channel, such as 5.1 channel, 7.1 channel and 11.1 channel, or may correspond to two channels, such as 10.2 channel and 22.2 channel.
- the sub-woofer layer can be located separately from the three-dimensional cubic or planar layer corresponding to the three-dimensional binaural layer.
- the sub-woofer layer 1940 is located at a distance from the three-dimensional cubic 1910, the surround layer 1920, and the proximity stereo layer 1930 corresponding to the three-dimensional binaural layer Can be located.
- the structure shown in FIG. 19 corresponds to an embodiment, and is not limited to a structure in which respective layers are combined.
- the three-dimensional weight can be applied to the three-dimensional layer binaural output
- the plane weight can be applied to the plane layer audio output
- the three-dimensional weight and the plane weight can be set independently of each other. That is, it is possible to generate a more dramatic binaural stereo output by adjusting the size of the layer-by-layer output and then performing the mixing, thereby maximizing the binaural effect.
- the present invention can support natural upmix and downmix functions based on the above-described functions, compatibility between contents supporting various kinds of sounds can be improved. For example, you can downmix a surround image represented by three-dimensional cubic into a surround layer. Also, the surround layer may be downmixed back to the adjacent stereo layer. As described above, by downmixing based on the area, the sound quality of the sound can be preserved more effectively.
- the method for generating binaural stereo audio can transmit and receive information necessary for generating binaural stereo audio through a communication network such as a network. Particularly, it is possible to receive head tracking information, information related to a user input or contents to be applied with a binaural effect, and provide a binaural stereo output according to an embodiment of the present invention.
- the method of generating binaural stereo audio according to an embodiment of the present invention may include generating binaural stereo audio according to an embodiment of the present invention, Various information is stored.
- embodiments of the invention may be embodied in a computer-implemented method or in a non-volatile computer readable medium having recorded thereon instructions executable by the computer.
- instructions readable by a computer are executed by a processor, the instructions readable by the computer are capable of performing at least one aspect of the invention.
- the method and apparatus for generating binaural stereo audio according to the present invention are not limited to the above-described embodiments, and various modifications may be made to the embodiments. All or some of the embodiments may be selectively combined.
- the present invention relates to a binaural stereo audio generating method and apparatus therefor, and it is possible to generate binaural stereo audio capable of maximizing a binaural effect by mixing various sound elements,
- the present invention provides a binaural engine that can easily adjust or adjust a sound element for generating a sound, and can improve compatibility with various kinds of contents based on natural upmix and downmix, thereby contributing to the development of industry.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
Abstract
Description
Claims (20)
- 3차원 바이노럴 레이어에 상응하는 3차원 레이어 바이노럴 인코딩을 수행하여 3차원 레이어 바이노럴 출력을 생성하는 단계;Performing a three-dimensional layer binaural encoding corresponding to a three-dimensional binaural layer to generate a three-dimensional layer binaural output;평면 레이어에 상응하는 오디오 프로세싱을 수행하여 평면 레이어 오디오 출력을 생성하는 단계; 및Performing audio processing corresponding to a planar layer to produce a planar layer audio output; And상기 3차원 레이어 바이노럴 출력 및 평면 레이어 오디오 출력을 합하여 바이노럴 스테레오 출력을 생성하는 단계Generating a binaural stereo output by summing the three-dimensional layer binaural output and the planar layer audio output를 포함하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.And generating a binaural stereo audio signal.
- 청구항 1에 있어서,The method according to claim 1,상기 평면 레이어는The planar layer서라운드 레이어 바이노럴 인코딩을 수행하여 서라운드 레이어 바이노럴 출력을 생성하고, 생성된 상기 서라운드 레이어 바이노럴 출력을 상기 평면 레이어 오디오 출력으로 제공하는 서라운드 레이어 및 A surround layer for performing surround layer binaural encoding to generate a surround layer binaural output and providing the generated surround layer binaural output as the plane layer audio output,스테레오 신호를 입력 받아서 상기 스테레오 신호에 상응하는 상기 평면 레이어 오디오 출력을 생성하는 근접용 스테레오 레이어 중 어느 하나인 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.And a proximity stereo layer for receiving the stereo signal and generating the plane layer audio output corresponding to the stereo signal.
- 청구항 2에 있어서,The method of claim 2,상기 3차원 레이어 바이노럴 출력은The three-dimensional layer binaural output4개의 업 채널들과 4개의 다운채널들로 구성된 8채널 기반의 3차원 큐빅(Cubic) 상에 위치하는 바이노럴 포인트에 대한 3차원 벡터에 상응하게 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Dimensional binaural point on the 8-channel based three-dimensional Cubic consisting of 4 up channels and 4 down channels. The binaural stereo audio Generation method.
- 청구항 2에 있어서,The method of claim 2,상기 바이노럴 스테레오 출력을 생성하는 단계는The step of generating the binaural stereo output3차원 가중치를 상기 3차원 레이어 바이노럴 출력에 적용하고, 평면 가중치를 상기 평면 레이어 오디오 출력에 적용하고, 상기 3차원 가중치 및 상기 평면 가중치는 서로 독립적으로 설정되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Applying a three-dimensional weight to the three-dimensional layer binaural output, applying a plane weight to the planar layer audio output, and wherein the three-dimensional weight and the plane weight are set independently of each other. How to create audio.
- 청구항 1에 있어서,The method according to claim 1,상기 바이노럴 스테레오 출력을 생성하는 단계는The step of generating the binaural stereo output서브우퍼 레이어에 상응하는 서브우퍼 출력을 상기 3차원 레이어 바이노럴 출력 및 평면 레이어 오디오 출력과 함께 합산하여 상기 바이노럴 스테레오 출력을 생성하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Wherein the binaural stereo output is generated by summing the sub-woofer output corresponding to the sub-woofer layer together with the three-dimensional layer binaural output and the planar layer audio output.
- 청구항 3에 있어서,The method of claim 3,상기 3차원 큐빅은The three-상기 3차원 큐빅의 꼭지점에 해당하는 8개의 동적 스피커들의 위치를 상기 3차원 바이노럴 레이어에 대한 크기 파라미터에 상응하게 변경하여 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Wherein the positions of the eight dynamic speakers corresponding to the vertexes of the cubic cubic are changed according to size parameters of the 3D binaural layer.
- 청구항 3에 있어서,The method of claim 3,상기 3차원 벡터는The three-dimensional vector상기 3차원 큐빅의 내부에 포함되고, 상기 서라운드 레이어에 상응하는 2차원 평면의 중심에 해당하는 기준 청취점을 기준으로 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Wherein the binaural stereo audio is generated based on a reference listening point included in the 3D cubic and corresponding to a center of a two-dimensional plane corresponding to the surround layer.
- 청구항 3에 있어서,The method of claim 3,상기 3차원 레이어 바이노럴 출력을 생성하는 단계는Wherein generating the three-dimensional layer binaural output comprises:상기 3차원 벡터의 방향 정보를 헤드 트래킹 정보에 상응하게 회전된 상기 3차원 큐빅에 적용하여 상기 3차원 레이어 바이노럴 출력을 생성하되, 상기 헤드 트래킹 정보는 헤드 트래킹 모듈에 기반한 트래킹 입력 및 사용자 인터페이스에 기반한 사용자 입력 중 적어도 하나에 상응하게 획득되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Dimensional brain binaural output by applying the direction information of the three-dimensional vector to the rotated cubic bikes corresponding to the head tracking information, wherein the head tracking information includes a tracking input based on a head tracking module and a user interface And a user input based on at least one of the user input and the user input.
- 청구항 8에 있어서,The method of claim 8,상기 3차원 큐빅은 The three-팬(Pan), 틸트(tilt) 및 롤(roll) 중 적어도 하나의 회전 파라미터에 상응하게 회전되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Wherein the at least one audio signal is rotated in accordance with a rotation parameter of at least one of a pan, a tilt, and a roll.
- 청구항 3에 있어서,The method of claim 3,상기 평면 레이어는 상기 4개의 업 채널들과 상기 4개의 다운채널들 사이에 위치하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 방법.Wherein the flat layer is located between the four up channels and the four down channels.
- 3차원 바이노럴 레이어에 상응하는 3차원 레이어 바이노럴 인코딩을 수행하여 3차원 레이어 바이노럴 출력을 생성하고, 평면 레이어에 상응하는 오디오 프로세싱을 수행하여 평면 레이어 오디오 출력을 생성하고, 상기 3차원 레이어 바이노럴 출력 및 평면 레이어 오디오 출력을 합하여 바이노럴 스테레오 출력을 생성하는 프로세서; 및Dimensional layer binaural encoding by performing a three-dimensional layer binaural encoding corresponding to a three-dimensional binaural layer, performing audio processing corresponding to a plane layer to generate a plane layer audio output, A processor for summing the dimension layer binaural output and the planar layer audio output to produce a binaural stereo output; And상기 3차원 레이어 바이노럴 출력 및 평면 레이어 오디오 출력을 저장하는 메모리A memory for storing the three-dimensional layer binaural output and the plane layer audio output를 포함하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the binaural stereo audio generating device comprises:
- 청구항 11에 있어서,The method of claim 11,상기 평면 레이어는The planar layer서라운드 레이어 바이노럴 인코딩을 수행하여 서라운드 레이어 바이노럴 출력을 생성하고, 생성된 상기 서라운드 레이어 바이노럴 출력을 상기 평면 레이어 오디오 출력으로 제공하는 서라운드 레이어 및 A surround layer for performing surround layer binaural encoding to generate a surround layer binaural output and providing the generated surround layer binaural output as the plane layer audio output,스테레오 신호를 입력 받아서 상기 스테레오 신호에 상응하는 상기 평면 레이어 오디오 출력을 생성하는 근접용 스테레오 레이어 중 어느 하나인 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.And a proximity stereo layer for receiving the stereo signal and generating the plane layer audio output corresponding to the stereo signal.
- 청구항 12에 있어서,The method of claim 12,상기 3차원 레이어 바이노럴 출력은The three-dimensional layer binaural output4개의 업 채널들과 4개의 다운채널들로 구성된 8채널 기반의 3차원 큐빅(Cubic) 상에 위치하는 바이노럴 포인트에 대한 3차원 벡터에 상응하게 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Dimensional binaural point on the 8-channel based three-dimensional Cubic consisting of 4 up channels and 4 down channels. The binaural stereo audio Generating device.
- 청구항 12에 있어서,The method of claim 12,상기 프로세서는The processor3차원 가중치를 상기 3차원 레이어 바이노럴 출력에 적용하고, 평면 가중치를 상기 평면 레이어 오디오 출력에 적용하고, Applying a three-dimensional weight to the three-dimensional layer binaural output, applying a plane weight to the planar layer audio output,상기 3차원 가중치 및 상기 평면 가중치는 서로 독립적으로 설정되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the three-dimensional weight and the plane weight are set independently of each other.
- 청구항 11에 있어서,The method of claim 11,상기 프로세서는The processor서브우퍼 레이어에 상응하는 서브우퍼 출력을 상기 3차원 레이어 바이노럴 출력 및 평면 레이어 오디오 출력과 함께 합산하여 상기 바이노럴 스테레오 출력을 생성하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the binaural stereo output is generated by summing the sub-woofer output corresponding to the sub-woofer layer together with the three-dimensional layer binaural output and the planar layer audio output.
- 청구항 13에 있어서,14. The method of claim 13,상기 3차원 큐빅은The three-상기 3차원 큐빅의 꼭지점에 해당하는 8개의 동적 스피커들의 위치를 상기 3차원 바이노럴 레이어에 대한 크기 파라미터에 상응하게 변경하여 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the positions of the eight dynamic speakers corresponding to the vertexes of the cubic cubic are generated by changing the positions of the dynamic speakers corresponding to the size parameters for the 3D binaural layer.
- 청구항 13에 있어서,14. The method of claim 13,상기 3차원 벡터는The three-dimensional vector상기 3차원 큐빅의 내부에 포함되고, 상기 서라운드 레이어에 상응하는 2차원 평면의 중심에 해당하는 기준 청취점을 기준으로 생성되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the binaural stereo audio is generated based on a reference listening point included in the 3D cubic and corresponding to a center of a two-dimensional plane corresponding to the surround layer.
- 청구항 13에 있어서,14. The method of claim 13,상기 프로세서는The processor상기 3차원 벡터의 방향 정보를 헤드 트래킹 정보에 상응하게 회전된 상기 3차원 큐빅에 적용하여 상기 3차원 레이어 바이노럴 출력을 생성하되, 상기 헤드 트래킹 정보는 헤드 트래킹 모듈에 기반한 트래킹 입력 및 사용자 인터페이스에 기반한 사용자 입력 중 적어도 하나에 상응하게 획득되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Dimensional brain binaural output by applying the direction information of the three-dimensional vector to the rotated cubic bikes corresponding to the head tracking information, wherein the head tracking information includes a tracking input based on a head tracking module and a user interface Wherein the binaural stereo audio is generated corresponding to at least one of the user input based on the input of the binaural stereo audio.
- 청구항 18에 있어서,19. The method of claim 18,상기 3차원 큐빅은 The three-팬(Pan), 틸트(tilt) 및 롤(roll) 중 적어도 하나의 회전 파라미터에 상응하게 회전되는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.And wherein the binaural stereo audio is rotated in accordance with at least one rotation parameter of a pan, a tilt, and a roll.
- 청구항 13에 있어서,14. The method of claim 13,상기 평면 레이어는 상기 4개의 업 채널들과 상기 4개의 다운채널들 사이에 위치하는 것을 특징으로 하는 바이노럴 스테레오 오디오 생성 장치.Wherein the flat layer is located between the four up channels and the four down channels.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2018-0010874 | 2018-01-29 | ||
KR1020180010874A KR102119239B1 (en) | 2018-01-29 | 2018-01-29 | Method for creating binaural stereo audio and apparatus using the same |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019147041A1 true WO2019147041A1 (en) | 2019-08-01 |
Family
ID=67394773
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2019/001019 WO2019147041A1 (en) | 2018-01-29 | 2019-01-24 | Method for generating binaural stereo audio and apparatus therefor |
Country Status (2)
Country | Link |
---|---|
KR (1) | KR102119239B1 (en) |
WO (1) | WO2019147041A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20140125745A (en) * | 2013-04-19 | 2014-10-29 | 한국전자통신연구원 | Processing appratus mulit-channel and method for audio signals |
JP2016523045A (en) * | 2013-05-07 | 2016-08-04 | ボーズ・コーポレーションBose Corporation | Signal processing for headrest-based audio systems |
WO2017051079A1 (en) * | 2015-09-25 | 2017-03-30 | Nokia Technologies Oy | Differential headtracking apparatus |
WO2017072118A1 (en) * | 2015-10-26 | 2017-05-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating a filtered audio signal realizing elevation rendering |
WO2017097324A1 (en) * | 2015-12-07 | 2017-06-15 | Huawei Technologies Co., Ltd. | An audio signal processing apparatus and method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102007991B1 (en) | 2013-07-25 | 2019-08-06 | 한국전자통신연구원 | Binaural rendering method and apparatus for decoding multi channel audio |
-
2018
- 2018-01-29 KR KR1020180010874A patent/KR102119239B1/en active IP Right Grant
-
2019
- 2019-01-24 WO PCT/KR2019/001019 patent/WO2019147041A1/en active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20140125745A (en) * | 2013-04-19 | 2014-10-29 | 한국전자통신연구원 | Processing appratus mulit-channel and method for audio signals |
JP2016523045A (en) * | 2013-05-07 | 2016-08-04 | ボーズ・コーポレーションBose Corporation | Signal processing for headrest-based audio systems |
WO2017051079A1 (en) * | 2015-09-25 | 2017-03-30 | Nokia Technologies Oy | Differential headtracking apparatus |
WO2017072118A1 (en) * | 2015-10-26 | 2017-05-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating a filtered audio signal realizing elevation rendering |
WO2017097324A1 (en) * | 2015-12-07 | 2017-06-15 | Huawei Technologies Co., Ltd. | An audio signal processing apparatus and method |
Also Published As
Publication number | Publication date |
---|---|
KR102119239B1 (en) | 2020-06-04 |
KR20190091824A (en) | 2019-08-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018182274A1 (en) | Audio signal processing method and device | |
WO2018056780A1 (en) | Binaural audio signal processing method and apparatus | |
WO2015147530A1 (en) | Method and apparatus for rendering acoustic signal, and computer-readable recording medium | |
WO2017191970A2 (en) | Audio signal processing method and apparatus for binaural rendering | |
WO2018147701A1 (en) | Method and apparatus for processing audio signal | |
WO2014175669A1 (en) | Audio signal processing method for sound image localization | |
WO2019147040A1 (en) | Method for upmixing stereo audio as binaural audio and apparatus therefor | |
WO2015142073A1 (en) | Audio signal processing method and apparatus | |
WO2014088328A1 (en) | Audio providing apparatus and audio providing method | |
WO2015156654A1 (en) | Method and apparatus for rendering sound signal, and computer-readable recording medium | |
WO2014157975A1 (en) | Audio apparatus and audio providing method thereof | |
WO2016089180A1 (en) | Audio signal processing apparatus and method for binaural rendering | |
WO2015147619A1 (en) | Method and apparatus for rendering acoustic signal, and computer-readable recording medium | |
WO2019004524A1 (en) | Audio playback method and audio playback apparatus in six degrees of freedom environment | |
WO2015152665A1 (en) | Audio signal processing method and device | |
WO2019107868A1 (en) | Apparatus and method for outputting audio signal, and display apparatus using the same | |
WO2012005507A2 (en) | 3d sound reproducing method and apparatus | |
WO2014021588A1 (en) | Method and device for processing audio signal | |
WO2019147064A1 (en) | Method for transmitting and receiving audio data and apparatus therefor | |
WO2011139090A2 (en) | Method and apparatus for reproducing stereophonic sound | |
WO2019031652A1 (en) | Three-dimensional audio playing method and playing apparatus | |
WO2019035622A1 (en) | Audio signal processing method and apparatus using ambisonics signal | |
WO2019066348A1 (en) | Audio signal processing method and device | |
WO2016190460A1 (en) | Method and device for 3d sound playback | |
JP2018110366A (en) | 3d sound video audio apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19743311 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19743311 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 05/02/2021) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19743311 Country of ref document: EP Kind code of ref document: A1 |