WO2015002517A1

WO2015002517A1 - Virtual sound image localization method for two dimensional and three dimensional spaces

Info

Publication number: WO2015002517A1
Application number: PCT/KR2014/006053
Authority: WO
Inventors: 유재현; 이용주; 서정일; 강경옥; 최근우; 방희석
Original assignee: 한국전자통신연구원
Priority date: 2013-07-05
Filing date: 2014-07-07
Publication date: 2015-01-08

Abstract

Disclosed is a virtual sound image localization method for two dimensional and three dimensional spaces. The virtual sound image localization method may comprise the steps of: setting up a playback area consisting of at least one loudspeaker which is usable in an output channel; dividing the playback area into a plurality of sub-areas; identifying a sub-area, among the divided sub-areas, in which a desired virtual sound source to be played is located; determining a Fanning coefficient for playing the virtual sound source on the basis of the identified sub-area; and rendering an input signal on the basis of the Fanning coefficient.

Description

Virtual Phonetic Positioning Method in 2D and 3D Spaces

The following embodiments relate to a virtual sound image positioning method using a plurality of loudspeakers corresponding to an output channel.

The panning method is a method of reproducing a virtual sound source by allocating power to a loudspeaker located near the virtual sound source in consideration of the position of the virtual sound source to be reproduced. As described above, determining the position of the virtual sound source in the virtual space by allocating power to the loudspeaker to determine the output size of the loudspeaker is called a virtual sound image positioning method.

At this time, the playback of the virtual sound source using two loudspeakers is defined as power panning, and the playback of the virtual sound source using three loudspeakers is defined as vector based amplitude panning (VBAP). . These techniques are widely used as a virtual sound image location method.

The methods described above use an operation that distributes power to the loudspeakers to map the position of the virtual sound source between two or three loudspeakers. According to this operation, sophisticated angle division is possible, but it is difficult for the listener to distinguish the virtual sound source located at the divided angle and the amount of calculation increases. In addition, when the input channel panned to the loudspeaker corresponding to the output channel increases, a sound quality degradation may occur. Therefore, there is a need for a panning technique to solve the problem caused by the angle division.

On the other hand, the loudspeakers generally arranged in the playback space have a symmetrical arrangement such as left, right, or middle with respect to the listener. However, this symmetrical batching represents an ideal situation in real life. In other words, in practice, loudspeakers are often asymmetrical in arrangement of front, rear, left and right. Accordingly, there is also a need for a panning technique for asymmetrically arranged loudspeakers.

The following embodiments provide a virtual sound image positioning method using loudspeakers existing in two-dimensional and three-dimensional space, and a loudspeaker renderer performing the method.

The following exemplary embodiments divide virtual playback regions composed of loudspeakers into sub-regions, and determine a panning coefficient based on the sub-region in which the virtual sound source to be played is located to reduce the amount of computation for determining the panning coefficient. A method and a loudspeaker renderer performing this method are provided.

The following embodiments provide a virtual sound image positioning method that can effectively reproduce a virtual sound source by determining a panning coefficient in consideration of whether the loudspeakers are located in a two-dimensional space or a three-dimensional space, and a loudspeaker renderer performing the method. do.

According to one or more exemplary embodiments, a virtual sound image positioning method includes determining reproduction information of at least one loudspeaker available in an output channel to reproduce a virtual sound source corresponding to an input channel; And rendering an input signal using the reproduction information.

The loudspeakers may be in two-dimensional or three-dimensional space.

The determining of the playback information of the loudspeakers may include: dividing a playback area composed of the loudspeakers into a plurality of detailed areas; The method may include determining a detailed area in which the virtual sound source to be reproduced from among the divided detailed areas is located and determining a panning coefficient of the loudspeakers based on the determined detailed area.

The dividing may include dividing a reproduction region corresponding to a circumference connecting the two loudspeakers into a plurality of subregions when the two loudspeakers are two, and determining the divided subregions. The detailed region in which the virtual sound source is located may be determined.

In the dividing step, when the loudspeakers are K pieces (K> 3), the dividing area consisting of the loudspeakers is divided into X sub-regions (X≥K), and the determining is performed. The detailed area in which the virtual sound source is located may be determined among the divided detailed areas.

According to another embodiment of the present invention, a virtual sound image positioning method may include: setting a reproduction area including at least one loudspeaker usable in an output channel; Dividing the playback area into a plurality of detailed areas; Determining a detailed area in which the virtual sound source to be reproduced is located among the divided detailed areas; Determining a panning coefficient for reproducing the virtual sound source based on the determined subregion; And rendering an input signal based on the panning coefficient.

The loudspeakers may be in two-dimensional or three-dimensional space.

According to another exemplary embodiment, a virtual sound image positioning method may include determining whether a panning coefficient for a virtual sound source may be determined using loudspeakers positioned on a plane; The method may include determining a panning coefficient for the virtual sound source based on the determination result.

The determining of the panning coefficient may include determining a panning coefficient for the virtual sound source based on a horizontal angle when the panning coefficient may be determined using a loudspeaker located on the plane.

The determining of the panning coefficient may include determining a panning coefficient for the virtual sound source based on a vertical angle when the panning coefficient cannot be determined using a loudspeaker positioned on the plane.

According to another embodiment of the present invention, a virtual sound image positioning method includes determining whether loudspeakers are located in a two-dimensional space or a three-dimensional space; And determining a panning coefficient for the virtual sound source based on the determination result.

In the determining of the panning coefficient, when the loudspeakers are located in a two-dimensional space, the panning coefficient for the virtual sound source may be determined based on a horizontal angle.

The determining of the panning coefficient may include determining a panning coefficient for the virtual sound source based on a vertical angle when the loudspeakers are located in a three-dimensional space.

According to an embodiment, a loudspeaker renderer may include: a determiner configured to determine reproduction information of at least one loudspeaker usable in an output channel to reproduce a virtual sound source corresponding to an input channel; And a rendering unit that renders an input signal using the reproduction information.

According to another exemplary embodiment, a loudspeaker renderer may include: a determiner configured to determine a panning coefficient for reproducing a virtual sound source based on a subdivided region of a reproduction region composed of at least one loudspeaker usable in an output channel; And a rendering unit that renders an input signal based on the panning coefficient.

The loudspeaker renderer according to another embodiment determines whether or not the panning coefficient for the virtual sound source can be determined using the loudspeakers located on a plane, and determines a panning coefficient for the virtual sound source based on the determination result. ; And a rendering unit that renders an input signal based on the panning coefficient.

According to another embodiment of the present invention, a loudspeaker renderer may include: a determiner configured to determine whether loudspeakers are positioned in a two-dimensional space or a three-dimensional space, and determine a panning coefficient for a virtual sound source based on a determination result; And a rendering unit that renders an input signal based on the panning coefficient.

The determining unit may determine a panning coefficient for the virtual sound source based on a horizontal angle when the loudspeakers are located in a two-dimensional space, and determine the panning coefficient for the virtual sound source based on a vertical angle when the loudspeakers are located in a three-dimensional space. Can determine the panning coefficient.

According to the following embodiments, it is possible to reduce the amount of calculation for determining the panning coefficient by dividing the playback region that is configured by the loudspeakers into subregions and determining the panning coefficient based on the subregion where the virtual sound source to be played is located. ..

The following embodiments can effectively reproduce the virtual sound source by determining the panning coefficient in consideration of whether the loudspeakers are located in a two-dimensional space or a three-dimensional space.

FIG. 1 is a diagram illustrating a loudspeaker renderer performing a virtual sound image positioning method according to an embodiment.

2 is a diagram illustrating a virtual sound image positioning method according to an embodiment.

3 is a diagram illustrating a virtual sound image positioning method according to another embodiment.

4 is a diagram illustrating a spatial grouping-based panning technique according to an embodiment.

FIG. 5 is a diagram illustrating a spatial grouping-based panning technique when K is 3 in FIG. 4.

6 is a diagram illustrating a spatial grouping-based panning technique according to another embodiment.

FIG. 7 is a diagram illustrating a spatial grouping-based panning technique when K is 4 in FIG. 6.

Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

Referring to FIG. 1, the loudspeaker renderer 102 may include a determiner 103 and a renderer 104.

The determiner 103 may receive a mixer output layout from the decoder 101. Here, the mixer output layout may refer to a format of the mixer output signal output by the decoder 101 by decoding the bitstream. For the loudspeaker renderer 102, the mixer output signal may be an input signal and the corresponding mixer output layout means an input format.

The determination unit 103 may determine the reproduction information of the plurality of loudspeakers in consideration of the mixer output layout and the reproduction layout. Here, the reproduction information means information used when converting an input format representing the mixer output layout into an output format representing the reproduction layout. Thus, the loudspeaker renderer 102 may be represented as a format converter.

Specifically, when the number of channels of the input format is larger than the number of channels of the output format, the reproduction information may mean a downmix matrix for mapping the input signal to the output signal. That is, the loudspeaker renderer 102 may convert the output signal of the N-channel corresponding to the reproduction layout to be considered when reproducing the input signal of the M-channel. The determination unit 103 may determine the reproduction information for format conversion.

In this case, the input signal corresponding to one channel may be mapped to an output signal corresponding to one channel or a plurality of channels according to the loudspeaker. In other words, the input signals may be mapped to output signals corresponding to one channel. Alternatively, the input signal may be panned into an output signal corresponding to two channels. In addition, the input signal may be divided into output signals corresponding to three or more channels.

Thus, the determination unit 103 may determine the reproduction information for mapping the input signal to an output signal corresponding to one channel or a plurality of channels. In this case, the reproduction information may include a downmix matrix composed of a plurality of panning coefficients.

In the following embodiment, a process of determining the reproduction information so that a sound source corresponding to the input signal can be reproduced in the loudspeaker when the input signal is mapped to the output signal will be described. In particular, the determination unit 103 controls the power input to the loudspeakers to provide the listener with the effect that the virtual sound source is reproduced in the virtual space between the loudspeakers, not the actual sound source. A panning coefficient may be determined for sound image localization. A process of determining the panning coefficient will be described with reference to FIGS. 2 and 3, respectively.

The rendering unit 104 may render the mixer output signal by mapping the mixer output signal received from the decoder 101 to the loudspeaker signal based on the reproduction information. In other words, the rendering unit 104 may render an input signal by mapping an input signal corresponding to the input format to an output signal corresponding to the output format. In detail, the renderer 104 may render the input signal by mapping the input signal to the output signal using the panning coefficient determined by the determiner 103.

In step 201, the loudspeaker renderer 102 may set up a playback area composed of a plurality of loudspeakers. Here, the reproduction region may mean a line connecting two loudspeakers or a plane including three or more loudspeakers. In this case, the line may include a straight line or a curve (circumference).

At this time, it is assumed that the virtual sound source corresponding to the input signal is reproduced in the reproduction region rather than the position where the loudspeaker is present. In other words, the reproduction region is a virtual two-dimensional or three-dimensional space composed of a plurality of loudspeakers, and may mean a position at which the virtual sound source is reproduced.

In step 202, the loudspeaker renderer 102 may divide the playback area into a plurality of subregions. At this time, the reproduction region may be divided into K subregions. The subdivided subdivisions may or may not be the same as each other.

In operation 203, the loudspeaker renderer 102 may determine a detailed area in which the virtual sound source is located. As described above, since the playback region refers to a position at which the virtual sound source is played, the loudspeaker renderer 102 may determine in which detail region of the plurality of detailed regions constituting the playback region.

In step 204, the loudspeaker renderer 102 may determine a panning coefficient for playing the virtual sound source based on the detail region. At this time, the panning coefficient for the loudspeaker may be determined to be -1 to 1.

In step 205, the loudspeaker renderer 102 may render the input signal according to the panning coefficient.

Since the virtual sound image positioning method illustrated in FIG. 2 uses a result of grouping a playback region composed of loudspeakers into a plurality of detailed regions, it may be defined as a grouping-based panning technique.

A process of converting a format of an input signal having multiple channels will be described based on the virtual sound image positioning method described with reference to FIG. 2. That is, the process of converting the format of the input signal represents the process of rendering the input signal by mapping the input signal to the output signal.

In order to reproduce (M> 2, N> 2) sound sources representing M input signals with N-channel loudspeakers, an M-channel input signal is converted into an N-channel output signal. May be performed based on Equation 1 below.

Here, Y denotes an output signal reproduced through a loudspeaker corresponding to n channels (n = 1 to N), and may be expressed according to Equation 2 below.

X denotes an input signal corresponding to m channels (m = 1 to M) and may be expressed according to Equation 3 below.

In addition, A is an N × M matrix and may be configured with the panning coefficient described in FIG. 2. In this case, A may be expressed according to Equation 4 below.

Then, Equation 1 is expressed again as Equation 5.

Equation 5 may be simply expressed by Equation 6.

Assuming that the input signals of the M channel are input signals of 22.2 channels, 14.0 channels, 11.1 channels, and 9.0 channels, only channels marked with x may be actually included as shown in Table 1 below.

In addition, assuming that the output signal of the N channel is an output signal of 5.1, 8.1, and 10.1 channels, only channels marked with x may be actually included as shown in Table 2 below.

Hereinafter, a process of rendering an input signal by mapping an input signal of M channel to an output signal of N channel will be described. That is, a process of converting an input format into an output format will be described. In the following Equations 7 to 24, the left side of the equal sign indicates the channel number of the output signal, and the right side of the equal sign indicates the combination of the panning coefficient and the channel number of the input signal.

(1) 22.2 channel to 5.1 channel conversion

(2) 22.2 channel to 8.1 channel conversion

(3) Conversion from 22.2 channels to 10.1 channels

(4) Conversion from 14.0 channels to 5.1 channels

(5) Conversion from 14.0 channel to 8.1 channel

(6) Conversion from 14.0 channels to 10.1 channels

(7) 11.1 channel to 5.1 channel conversion

(8) Conversion from 11.1 channel to 8.1 channel

(9) Conversion from 11.1 channel to 10.1 channel

(10) Conversion from 9.0 channel to 5.1 channel

(11) Conversion from 9.0 channel to 8.1 channel

(12) Conversion from 9.0 channel to 10.1 channel

On the other hand, the virtual sound image positioning method proposed in FIG. Can be applied. On the other hand, even if the mapping relationship between the same input signal and the output signal, different panning coefficients may be applied for each region according to the frequency band of the input signal.

In step 301, the loudspeaker renderer 102 may determine whether the panning coefficient can be determined by two or less loudspeakers present on the plane. If it is determined that the panning coefficient can be determined, in step 304 the loudspeaker renderer 102 may determine the panning coefficient for the virtual sound source using the horizontal angle based on the two loudspeakers. That is, the panning coefficient can be determined to pan two loudspeakers located on the plane.

Here, the panning coefficient for the virtual sound source may be determined based on Equation 25 below.

Here, the angle between the reference line facing the front of the listener and the right loudspeaker is

The angle between the baseline facing the listener's front and the left loudspeaker is 360-

It can be expressed as. Meanwhile,

Denotes an angle formed by the virtual sound source and the reference line facing the front of the listener.

The cos is the gain applied to the left and right loudspeakers respectively.

And sin

It is expressed as Then cos

And sin

The sum of the squares of equals to 1, indicating that the sum of powers allocated to the left and right loudspeakers is always constant.

If it is determined in step 301 that the panning coefficient cannot be determined, in step 302 the loudspeaker renderer 102 may determine whether the panning coefficient can be determined by three loudspeakers on the plane. . If it is determined that the panning coefficient can be determined, in step 304 the loudspeaker renderer 102 may determine the panning coefficient for the virtual sound source using the horizontal angle based on the three loudspeakers. That is, the panning coefficient can be determined to pan three loudspeakers located on the plane.

If it is determined in step 302 that the panning coefficient cannot be determined, in step 303, the loudspeaker renderer 102 may determine the panning coefficient for the virtual sound source using the vertical angle. In the case of step 303, this means a case where the virtual sound source is located on a plane where two or three loudspeakers exist. In this case, the loudspeaker renderer 102 may select the loudspeaker whose position is the closest to the virtual sound source, and determine a panning coefficient for the virtual sound source existing at the position where two or three loudspeakers are projected at the same vertical angle. have.

A process of converting a format of an input signal having multiple channels will be described based on the virtual sound image positioning method described with reference to FIG. 3. That is, the process of converting the format of the input signal represents the process of rendering the input signal by mapping the input signal to the output signal. The rendering process of FIG. 3 may determine the same process as that of Equations 1 to 6 described with reference to FIG. 2.

Assuming that the input signal of the M channel is an input signal of 22.2 channels, 14.0 channels, 11.1 channels, and 9.0 channels, only the channels marked with x may be actually included as shown in Table 1 above.

In addition, assuming that an output signal of N channels is an output signal of 5.1 channels and 10.1 channels, only channels marked with x may be actually included as shown in Table 3 below.

Hereinafter, a process of rendering an input signal by mapping an input signal of M channel to an output signal of N channel will be described. That is, a process of converting an input format into an output format will be described. In the following Equations 26 to 33, the left side of the equal sign indicates the channel number of the output signal, and the right side of the equal sign indicates the combination of the panning coefficient and the channel number of the input signal.

(1) 22.2 channel to 5.1 channel conversion

(2) Conversion from 22.2 channels to 10.1 channels

(3) Conversion from 14.0 channels to 5.1 channels

(4) Conversion from 14.0 channels to 10.1 channels

(5) 11.1 channel to 5.1 channel conversion

(6) Conversion from 11.1 channel to 10.1 channel

(7) Conversion from 9.0 channel to 5.1 channel

(8) Conversion from 9.0 channel to 10.1 channel

Panning when the vertical angles of the input channel corresponding to the input signal and the output channel corresponding to the output signal are different, such as when the input signal representing the upstream channel is reproduced by a loudspeaker located on a horizontal plane in Equations 27 to 33 Some of the coefficients can be used as negative numbers. As a result, the virtual sound source having a vertical angle different from that of the loudspeaker can be reproduced more effectively.

On the other hand, the proposed method is applied not only in the time domain but also in the frequency domain according to transform using fft (fast Fourier transform), or the subband domain according to transform using quadrature mirror filter (QMF) and / or Hybrid filter. can do. In this case, even when the same input / output channel is connected, different panning coefficients may be applied for each region according to a frequency band.

According to FIG. 3, even if the loudspeaker is not present at the position defined by the standardized output format, the panning coefficient can be determined by providing a horizontal angle and a vertical angle with respect to the loudspeaker. In addition, a distance variation between loudspeakers from which output signals converted from an input signal are reproduced may also be used when determining a panning coefficient.

The equations described in FIGS. 2 and 3 may be applied differently for each sample or frame through a flag. Here, Equation relates to a virtual sound image positioning method for reproducing a virtual sound source, the input signal of the M channel can be converted to the output signal of the N channel by different methods for each sample or frame.

Referring to FIG. 4, there are two

loudspeakers

401, 402. At this time, the left loudspeaker 401 and the right loudspeaker 402 are positioned around the listener 403. Here, it is assumed that the

loudspeakers

401 and 402 exist in a two-dimensional space (line or plane).

The playback area may be set based on the left loudspeaker 401 and the right loudspeaker 402 around the listener 403. Then, the reproduction region may be divided into K subregions (region 1, region 2, region K). These reproduction regions are grouped into sub-regions, and the panning coefficient may be determined based on which sub-region in which the virtual sound source to be reproduced is located.

The left loudspeaker 501 and the right loudspeaker 502 are positioned around the listener 504. At this time, the virtual sound source 503 may be reproduced by being located at the circumference connecting the left loudspeaker 501 and the right loudspeaker 502.

On the other hand, the circumference may be divided into detailed regions constituting the reproduction region. FIG. 5 illustrates a case in which a virtual sound source is reproduced by dividing three sub-regions into a reproduction region constituting the left loudspeaker 501 and the right loudspeaker 502. FIG. However, according to one embodiment it is not necessary to divide evenly.

At this time, when the angle formed by the left loudspeaker 501 and the right loudspeaker 502 is θ, and the angle corresponding to the detail region is θd, the process of determining the panning coefficient according to the virtual sound image positioning method is as follows.

For example, when the virtual sound source 503 is reproduced on the circumference corresponding to the subregion region 1, all the power is allocated to the left loudspeaker 501 in order to reproduce the virtual sound source 503. For example, when θ is 60 degrees, when the virtual sound source is reproduced at 0 degrees to 20 degrees when θ _d is 20 degrees, the virtual sound source may be reproduced by the left loudspeaker 501 at 0 degrees.

As another example, when the virtual sound source 503 is reproduced on the circumference corresponding to the detail region region 2, the same power is distributed to the left loudspeaker 501 and the right loudspeaker 502 to reproduce the virtual sound source 503. Can be. For example, when θ is 60 degrees, when the virtual sound source is reproduced at 20 degrees to 40 degrees when θd is 20 degrees, the input signal is input to the left loudspeaker 501 and the right loudspeaker 502.

The virtual sound source can be reproduced by distributing the power of.

As another example, when the virtual sound source 503 is reproduced on the circumference corresponding to the subregion region 3, all the power is allocated to the right loudspeaker 502 to reproduce the virtual sound source 503. For example, when θ is 60 degrees, when θd is 20 degrees, when the virtual sound source is reproduced at 40 degrees to 60 degrees, the virtual sound source may be reproduced by the right loudspeaker 502 at 60 degrees.

In the case of Fig. 5, the case where the reproduction region is divided into three subregions is explained. In contrast, when the reproduction region is divided into two sub-regions, the loudspeaker may be selected according to the position of the virtual sound source to be reproduced.

FIG. 6 illustrates a case where the

loudspeakers

601, 602, and 603 exist in a three-dimensional space unlike FIG. 5. For example, at least one of the

loudspeakers

601, 602, 603 is present in a plane, and the rest is arranged in a three-dimensional space rather than a plane. In other words, FIG. 6 means a case where the loudspeaker is present not only in the horizontal direction where the listener is located but also in the vertical direction (upward or downward).

In FIG. 6, a reproduction area composed of three

loudspeakers

601, 602, and 603 may be divided into K subregions. The reproduction region may be divided evenly or not evenly. Then, the panning coefficient may be determined so that power may be allocated to a loudspeaker associated with the detailed region corresponding to the position where the virtual sound source is reproduced among the K detailed regions. The panning coefficient may have a value between -1 and 1.

Referring to FIG. 7, a case in which a reproduction area composed of

loudspeakers

701, 702, and 703 existing in a three-dimensional space is divided into four subregions is illustrated. That is, four detailed regions may be determined by three

loudspeakers

701, 702, and 703. Then, the panning coefficient for the virtual sound source may be determined according to which of the four subregions the virtual sound source to be reproduced exists in.

The apparatus described above may be implemented as a hardware component, a software component, and / or a combination of hardware components and software components. For example, the devices and components described in the embodiments may be, for example, processors, controllers, arithmetic logic units (ALUs), digital signal processors, microcomputers, field programmable arrays (FPAs), It may be implemented using one or more general purpose or special purpose computers, such as a programmable logic unit (PLU), microprocessor, or any other device capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the operating system. The processing device may also access, store, manipulate, process, and generate data in response to the execution of the software. For convenience of explanation, one processing device may be described as being used, but one of ordinary skill in the art will appreciate that the processing device includes a plurality of processing elements and / or a plurality of types of processing elements. It can be seen that it may include. For example, the processing device may include a plurality of processors or one processor and one controller. In addition, other processing configurations are possible, such as parallel processors.

The software may include a computer program, code, instructions, or a combination of one or more of the above, and configure the processing device to operate as desired, or process it independently or collectively. You can command the device. Software and / or data may be any type of machine, component, physical device, virtual equipment, computer storage medium or device in order to be interpreted by or to provide instructions or data to the processing device. Or may be permanently or temporarily embodied in a signal wave to be transmitted. The software may be distributed over networked computer systems so that they may be stored or executed in a distributed manner. Software and data may be stored on one or more computer readable recording media.

The method according to the embodiment may be embodied in the form of program instructions that can be executed by various computer means and recorded in a computer readable medium. The computer readable medium may include program instructions, data files, data structures, etc. alone or in combination. The program instructions recorded on the media may be those specially designed and constructed for the purposes of the embodiments, or they may be of the kind well-known and available to those having skill in the computer software arts. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks, such as floppy disks. Magneto-optical media, and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like. The hardware device described above may be configured to operate as one or more software modules to perform the operations of the embodiments, and vice versa.

Although the embodiments have been described by the limited embodiments and the drawings as described above, various modifications and variations are possible to those skilled in the art from the above description. For example, the described techniques may be performed in a different order than the described method, and / or components of the described systems, structures, devices, circuits, etc. may be combined or combined in a different form than the described method, or other components. Or even if replaced or substituted by equivalents, an appropriate result can be achieved. Therefore, other implementations, other embodiments, and equivalents to the claims are within the scope of the claims that follow.

Claims

Determining reproduction information of at least one loudspeaker usable in the output channel to reproduce the virtual sound source corresponding to the input channel;

Rendering an input signal using the reproduction information;

Virtual phonetic stereotactic method comprising a.
The method of claim 1,

The loudspeakers are located in a two-dimensional space or three-dimensional space.
The method of claim 1,

Determining the reproduction information of the loudspeakers,

Dividing a playback area composed of the loudspeakers into a plurality of detailed areas;

Determining a detailed area in which the virtual sound source to be reproduced is located among the divided detailed areas;

Determining a panning coefficient of the loudspeakers based on the determined subregion.

Virtual phonetic stereotactic method comprising a.
The method of claim 3,

The dividing step,

When the two loudspeakers are two, the reproduction region corresponding to the circumference connecting the two loudspeakers is divided into a plurality of detailed regions,

The determining step,

The virtual sound image positioning method of determining the detailed region in which the virtual sound source is located among the divided sub-regions.
The method of claim 3,

The dividing step,

If the loudspeakers are K (K> 3), the playback area composed of the loudspeakers is divided into X (X≥K) sub-regions,

The determining step,

The virtual sound image positioning method of determining the detailed region in which the virtual sound source is located among the divided sub-regions.
Establishing a playback region comprised of at least one loudspeakers available in the output channel;

Dividing the playback area into a plurality of detailed areas;

Determining a detailed area in which the virtual sound source to be reproduced is located among the divided detailed areas;

Determining a panning coefficient for reproducing the virtual sound source based on the determined subregion; And

Rendering an input signal based on the panning coefficient

Virtual phonetic stereotactic method comprising a.
The method of claim 6,

The loudspeakers are located in a two-dimensional space or three-dimensional space.
The method of claim 6,

The dividing step,

When the two loudspeakers are two, the reproduction region corresponding to the circumference connecting the two loudspeakers is divided into a plurality of detailed regions,

The determining step,

The virtual sound image positioning method of determining the detailed region in which the virtual sound source is located among the divided sub-regions.
The method of claim 6,

The dividing step,

If the loudspeakers are K (K> 3), the playback area composed of the loudspeakers is divided into X (X≥K) sub-regions,

The determining step,

The virtual sound image positioning method of determining the detailed region in which the virtual sound source is located among the divided sub-regions.
Determining whether or not a panning coefficient for the virtual sound source can be determined using the loudspeakers located on the plane;

Determining a panning coefficient for the virtual sound source based on the determination result

Virtual phonetic stereotactic method comprising a.
The method of claim 10,

Determining the panning coefficient,

And determining a panning coefficient for the virtual sound source based on a horizontal angle when the panning coefficient can be determined using a loudspeaker located on the plane.
The method of claim 10,

Determining the panning coefficient,

And when a panning coefficient cannot be determined using a loudspeaker located on the plane, determining a panning coefficient for the virtual sound source based on a vertical angle.
Determining whether loudspeakers are located in a two-dimensional space or a three-dimensional space;

Determining a panning coefficient for the virtual sound source based on the determination result

Virtual phonetic stereotactic method comprising a.
The method of claim 13,

Determining the panning coefficient,

And determining the panning coefficient for the virtual sound source based on a horizontal angle when the loudspeakers are located in a two-dimensional space.
The method of claim 13,

Determining the panning coefficient,

And determining the panning coefficient for the virtual sound source based on a vertical angle when the loudspeakers are located in a three-dimensional space.
A determination unit that determines reproduction information of at least one loudspeaker usable in the output channel to reproduce the virtual sound source corresponding to the input channel; And

Rendering unit for rendering the input signal using the reproduction information

Loudspeaker renderer comprising.
A determination unit for determining a panning coefficient for reproducing the virtual sound source based on the subdivided region of the reproduction region composed of at least one loudspeaker available in the output channel; And

A rendering unit that renders an input signal based on the panning coefficient

Loudspeaker renderer comprising.
A determination unit that determines whether a panning coefficient for the virtual sound source can be determined using the loudspeakers located on the plane, and determines a panning coefficient for the virtual sound source based on the determination result; And

A rendering unit that renders an input signal based on the panning coefficient

Loudspeaker renderer comprising.
A determination unit that determines whether the loudspeakers are located in a two-dimensional space or a three-dimensional space, and determines a panning coefficient for the virtual sound source based on the determination result; And

A rendering unit that renders an input signal based on the panning coefficient

Loudspeaker renderer comprising.
The method of claim 19,

The determining unit,

When the loudspeakers are located in a two-dimensional space, the panning coefficient for the virtual sound source is determined based on a horizontal angle,

A loudspeaker renderer that determines a panning coefficient for the virtual sound source based on a vertical angle when the loudspeakers are located in three-dimensional space.