KR20170059135A - High-Quality 3D Audio Generation Method and System - Google Patents

High-Quality 3D Audio Generation Method and System Download PDF

Info

Publication number
KR20170059135A
KR20170059135A KR1020150163039A KR20150163039A KR20170059135A KR 20170059135 A KR20170059135 A KR 20170059135A KR 1020150163039 A KR1020150163039 A KR 1020150163039A KR 20150163039 A KR20150163039 A KR 20150163039A KR 20170059135 A KR20170059135 A KR 20170059135A
Authority
KR
South Korea
Prior art keywords
audio
channel
sound
original sound
output
Prior art date
Application number
KR1020150163039A
Other languages
Korean (ko)
Inventor
이영한
조충상
김제우
김용환
Original Assignee
전자부품연구원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 전자부품연구원 filed Critical 전자부품연구원
Priority to KR1020150163039A priority Critical patent/KR20170059135A/en
Publication of KR20170059135A publication Critical patent/KR20170059135A/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S5/00Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation 
    • H04S5/005Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation  of the pseudo five- or more-channel type, e.g. virtual surround
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

A method and system for producing high quality 3D audio is provided. A 3D audio generating method according to an embodiment of the present invention processes an audio original sound through one of a first channel and a second channel and processes the filtered audio original sound through the other channel. Thus, filtering is not performed in both channels, and sound quality improvement can be expected in providing 3D audio.

Description

TECHNICAL FIELD [0001] The present invention relates to a high-quality 3D audio generation method and system,

The present invention relates to audio technology, and more particularly, to a method and system for generating high quality 3D audio.

The HRTF (Head-Related Transfer Function) used in 3D audio is to measure the difference between the left and right audio signals to create an impulse response and apply it to the audio signal to generate a 3D audio effect.

FIG. 1 is a view showing the principle of conventional 3D audio. As shown in Fig. 1, in the 3D audio, what is transmitted from the sound source S to the ears is defined as filters.

That is, the source S can be seen that each converted to audio signals S L × H, S H × R of the left and right channel via the left and right filters H L, H R. As a result, the user can feel the difference between the channels, and the difference ultimately becomes the directionality.

Therefore, if H L and H R are known according to the angle of the sound source S, the direction of the sound source S can be added through the filtering process as shown in FIG. FIG. 2 is a diagram illustrating a 3D audio generation method based on HRTF.

However, since filtering is performed on both of the left and right channels, sound quality deterioration occurs in both the left and right channels. In particular, since the characteristics of the HRTF measurement space are applied to the audio generation, it is significantly different from the original sound.

SUMMARY OF THE INVENTION It is an object of the present invention to provide a high quality 3D audio generating method and system capable of minimizing sound quality deterioration.

According to an aspect of the present invention, there is provided a 3D audio generating method including: a first processing step of processing audio original sound through one of a first channel and a second channel; And a second processing step of causing the filtered audio original sound to be output through another one of the first channel and the second channel.

In the first processing step, the audio source sound is output through a channel having a short sound source distance among the first channel and the second channel.

In addition, the first processing step may not filter the audio original sound.

According to another aspect of the present invention, there is provided a 3D audio generating method comprising: changing a channel on which audio original sound is output and a channel on which a filtered audio original sound is to be output, when a channel having a short sound source distance is changed due to movement of a sound source; As shown in FIG.

Further, the filtering may use HRTF components made up of the HRTF component for the first channel and the HRTF component for the second channel.

According to another aspect of the present invention, there is provided a 3D audio generation system including: And an audio processor for processing the audio original sound to be output through one of the first channel and the second channel and processing the audio original sound filtered through the other of the first channel and the second channel to be output.

As described above, according to the embodiments of the present invention, filtering is not performed in both channels, and sound quality improvement can be expected in providing 3D audio.

In addition, according to the embodiments of the present invention, since only one channel is subjected to filtering, directional audio can be provided using a minimum amount of calculation and a memory.

FIG. 1 is a diagram illustrating the principle of conventional 3D audio,
2 is a diagram illustrating a 3D audio generation method based on HRTF,
FIG. 3 is a block diagram illustrating the principle of 3D audio generation according to an embodiment of the present invention.
FIG. 4 is a block diagram illustrating a method of generating 3D audio according to an exemplary embodiment of the present invention.
5 is a block diagram of a 3D audio generation system according to an embodiment of the present invention;
6 is a flowchart provided for explanation of the 3D audio generation process by the audio processor 120,
FIG. 7 shows the original sound for the case where the sound source is located on the right side,
FIG. 8 is a diagram illustrating 3D audio generated according to a conventional method,
9 is a spectrogram of 3D audio generated by a method according to an embodiment of the present invention.

Hereinafter, the present invention will be described in detail with reference to the drawings.

3D audio basically recognizes directions, distances, etc. by using difference of signals acquired from both ears.

It is a method to improve the disadvantage that the sound quality of 3D audio is deteriorated compared to the original sound due to the characteristics of the HRTF measurement space as well as the direction which can be felt by the ear when HRTF (Head-Related Transfer Function) is applied In the embodiment of the present invention, 3D audio is generated so that original sound is output from one ear.

That is, in the embodiment of the present invention, the difference between the filtering of one channel, that is, the difference of the impulse response between left and right channels is applied to only one channel, rather than filtering all channels.

FIG. 3 is a diagram provided in the explanation of the principle of 3D audio generation according to the embodiment of the present invention. In Fig. 3, it is assumed that the sound source is located close to the left ear. That is, it is assumed that the sound source distance of the left channel is shorter than the sound source distance of the right channel.

For high quality 3D audio generation, as shown in FIG. 3,

Figure pat00001
. Then, the audio signals of the left and right channels through H L , H R are S,
Figure pat00002
. That is,
Figure pat00003
Is positioned at a specific position.

FIG. 4 is a diagram illustrating a 3D audio generation method according to an embodiment of the present invention. 4, in the 3D audio generation method according to the embodiment of the present invention, the original sound S is provided to the left channel, and the ratio of the left HRTF component to the right HRTF component

Figure pat00004
).

This can be seen as a 3D audio generation method that does not perform filtering on both channels by composing a set of Impulse responses in the right and left and generating one impulse response.

The 3D audio generated according to the method shown in FIG. 4 has similar directionality to the 3D audio generated according to the conventional method, and the sound quality is improved. This is because the original sound is output from the channel close to the sound source.

Also, since the filtering process is reduced to one channel unlike the conventional method in which the filtering is performed for two channels, the amount of calculation for the memory storing the filter coefficients and the filtering process is reduced to half.

5 is a block diagram of a 3D audio generation system according to an embodiment of the present invention. The 3D audio generating system according to the embodiment of the present invention includes an audio providing unit 110, an audio processor 120, and audio output units 130-L, R, as shown in FIG.

The audio data providing unit 110 may be implemented by a microphone for acquiring an audio source sound, a storage medium for storing an audio source sound, and a communication unit for receiving an audio source sound from an external device or an external network.

The audio processor 120 generates 3D audio from the audio sound provided by the audio data providing unit 110. [ The 3D audio generation process by the audio processor 120 will be described later in detail with reference to FIG.

The audio output units 130-L and 130-L are means for outputting the 3D audio generated by the audio processor 120. In the audio output unit-L 130-L, And audio of the right channel is output in the audio channel 130-R.

FIG. 6 is a flowchart provided for explanation of the 3D audio generation process by the audio processor 120. FIG.

6, when an audio original sound is input from the audio data providing unit 110 to the audio processor 120 in step S210, the audio processor 120 determines a sound source distance in step S220.

In step S220, the source distance can be grasped not only by analyzing the sound source, but also by the input / setting contents of the 3D audio producer.

Next, the audio processor 120 generates 3D audio (S230, S240) such that the original sound is output for the channel whose sound source distance is short, and the filtered original sound is output for the channel whose sound source distance is long.

Steps S210 to S240 are repeated until 3D audio generation is completed (S250). In this process, when a change occurs in a channel having a short sound source distance due to the movement of the sound source, the channel to which the audio original sound is output and the channel to which the filtered audio original sound is output are changed.

The 3D audio quality enhancement according to the 3D audio generation method according to the embodiment of the present invention will be verified with reference to FIGS. 7 to 9. FIG.

FIG. 7 is an original sound when a sound source is located on the right side, FIG. 8 is 3D audio generated according to a conventional method, and FIG. 9 is a spectrogram of 3D audio generated by the method according to an embodiment of the present invention.

As can be seen from a comparison with the spectrogram of the original sound shown in FIG. 7, the conventional method shown in FIG. 8 shows that both channels of the original sound are modified, whereas the method according to the embodiment of FIG. Is the same as the original sound, and it can be confirmed that only one channel is modified.

Up to now, preferred embodiments have been described in detail for high quality 3D audio generation methods and systems.

In order to improve the disadvantage that the sound quality of the 3D audio is deteriorated compared to the original sound due to the characteristic of the HRTF measurement space as well as the direction that the ear can feel when the HRTF is applied, In one ear, 3D audio was generated to output the original sound.

That is, in the embodiment of the present invention, the difference between the filtering of one channel, that is, the difference of the impulse response between left and right channels is applied to only one channel, not the filtering of both channels.

While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, but, on the contrary, It will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the present invention.

110: Audio Offering
120: audio processor
130-L: Audio output section-L
130-R: Audio output section -R

Claims (6)

A first processing step of processing an audio original sound through one of a first channel and a second channel so as to be output; And
And a second processing step of processing the filtered audio original sound through another one of the first channel and the second channel to output the filtered audio original sound.
The method according to claim 1,
In the first processing step,
Wherein processing is performed such that audio original sound is output through a channel having a short sound source distance among the first channel and the second channel.
The method of claim 2,
In the first processing step,
Wherein the audio source sound is not filtered.
The method of claim 2,
And changing a channel on which audio original sound is to be output and a channel on which filtered audio original sound is to be output, when the channel having a short sound source distance is changed due to the movement of the sound source.
The method according to claim 1,
The filtering,
Wherein an HRTF component consisting of a ratio of an HRTF component to a first channel and an HRTF component to a second channel is used.
A providing unit for providing an audio original sound;
And an audio processor for processing the audio original sound to be outputted through one of the first channel and the second channel and for processing the audio original sound filtered through the other one of the first channel and the second channel to be outputted, 3D audio production system.
KR1020150163039A 2015-11-20 2015-11-20 High-Quality 3D Audio Generation Method and System KR20170059135A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020150163039A KR20170059135A (en) 2015-11-20 2015-11-20 High-Quality 3D Audio Generation Method and System

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020150163039A KR20170059135A (en) 2015-11-20 2015-11-20 High-Quality 3D Audio Generation Method and System

Publications (1)

Publication Number Publication Date
KR20170059135A true KR20170059135A (en) 2017-05-30

Family

ID=59052847

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020150163039A KR20170059135A (en) 2015-11-20 2015-11-20 High-Quality 3D Audio Generation Method and System

Country Status (1)

Country Link
KR (1) KR20170059135A (en)

Similar Documents

Publication Publication Date Title
CN110771182B (en) Audio processor, system, method and computer program for audio rendering
US10492017B2 (en) Audio signal processing apparatus and method
KR20180135973A (en) Method and apparatus for audio signal processing for binaural rendering
EP3222059B1 (en) An audio signal processing apparatus and method for filtering an audio signal
CN107431871B (en) audio signal processing apparatus and method for filtering audio signal
JP2008522483A (en) Apparatus and method for reproducing multi-channel audio input signal with 2-channel output, and recording medium on which a program for doing so is recorded
KR20180075610A (en) Apparatus and method for sound stage enhancement
US10524081B2 (en) Sound processing device, sound processing method, and sound processing program
MX2023005646A (en) Audio apparatus and method of audio processing.
JP2020502562A (en) Method and apparatus for adaptive control of a correlation separation filter
CN107017000B (en) Apparatus, method and computer program for encoding and decoding an audio signal
US11632643B2 (en) Recording and rendering audio signals
CN114402631A (en) Separating and rendering a voice signal and a surrounding environment signal
DE102021103210A1 (en) Surround sound playback based on room acoustics
US20170272881A1 (en) Audio signal processing apparatus and method for modifying a stereo image of a stereo signal
US11012774B2 (en) Spatially biased sound pickup for binaural video recording
CN108966110B (en) Sound signal processing method, device and system, terminal and storage medium
KR20170059135A (en) High-Quality 3D Audio Generation Method and System
EP4264963A1 (en) Binaural signal post-processing
US11595754B1 (en) Personalized headphone EQ based on headphone properties and user geometry
JP2016039568A5 (en)
US11546687B1 (en) Head-tracked spatial audio
CN112653985B (en) Method and apparatus for processing audio signal using 2-channel stereo speaker
KR100673288B1 (en) System for providing audio data and providing method thereof
WO2024036113A1 (en) Spatial enhancement for user-generated content