CN109036456B - Method for extracting source component environment component for stereo - Google Patents
Method for extracting source component environment component for stereo Download PDFInfo
- Publication number
- CN109036456B CN109036456B CN201811094388.0A CN201811094388A CN109036456B CN 109036456 B CN109036456 B CN 109036456B CN 201811094388 A CN201811094388 A CN 201811094388A CN 109036456 B CN109036456 B CN 109036456B
- Authority
- CN
- China
- Prior art keywords
- component
- channel
- source
- environment
- circle
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 238000000605 extraction Methods 0.000 claims abstract description 33
- 230000007613 environmental effect Effects 0.000 abstract description 18
- 238000000513 principal component analysis Methods 0.000 abstract description 12
- 238000000354 decomposition reaction Methods 0.000 abstract description 5
- 230000005236 sound signal Effects 0.000 description 20
- 230000000875 corresponding effect Effects 0.000 description 9
- 238000002156 mixing Methods 0.000 description 6
- 238000001914 filtration Methods 0.000 description 4
- 238000003672 processing method Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 241001272567 Hominoidea Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
Abstract
The invention discloses a method for extracting a source component environment component for stereo, which aims to solve the technical problem of correlation between left and right channel environment components obtained by decomposition by adopting a principal component analysis algorithm. The invention adopts a new method for extracting the source environment components, ensures that the left channel source component is vertical to the environment components by defining the amplitude ratio beta between the two channel environment components, but does not limit the right channel source component to be vertical to the environment components, and only requires no correlation between the environment components. So that a unique solution can be derived by source environment component extraction. The method can be used for multi-audio processing of expanding stereo audio into multi-channel audio, and can adapt to various different sound scenes by adjusting the amplitude ratio beta between the environmental components.
Description
Technical Field
The invention belongs to the technical field of audio processing, and particularly relates to a technology for extracting a source component environment component of stereo.
Background
In pursuit of a more immersive listening experience, multichannel playback devices are becoming more popular, such as loudspeaker systems, home cinema systems, and even gaming headsets, which are provided with physical multichannel, i.e. several independently controlled sound units. The audio source format that is widely used today is dominated by stereo audio. Stereo audio has advantages such as easy transmission and save, and the market reserves are huge. It is apparent that there is a mismatch between a multi-channel playing device and a stereo audio, and thus there is a practical problem in how to extend a stereo audio signal into a multi-channel audio signal, thereby maximizing the use of the multi-channel device to restore the stereoscopic impression and the presence of an audio recording site. This extension of a stereo audio signal into a multi-channel audio signal is known in the art as Upmix (Upmix) technology.
The earliest upmixing methods were only used to generate stereo audio from mono audio. Due to the lack of sound field information in the mono audio, the design goal of this up-mixing method is to keep the perceived loudness of the produced stereo audio consistent with the original mono audio. On this basis, carlos avenano and Jean-Marc joint have conducted extensive research on stereo to multi-channel audio upmixing methods. The upmixing concept based on source environment component Extraction (PAE) was published by Michael m.goodwin and Jean-Marc j on the international acoustic, speech and signal processing conference of 2007. They consider that from the perspective of an acoustic scene, both foreground and background sounds components remain in a stereo audio signal, and assume that only one foreground sound is present at a time. In the left and right channels of a stereo audio signal, foreground sounds have significant directionality, such as the sound of a cannonball breaking through the night sky in a film. Therefore, the foreground sound is also referred to as a source Component (Primary Component). Meanwhile, background sound provides atmosphere information that set off foreground sound, and is called Ambient Component (Ambient Component).
In patent application publication No. CN102037507A, an enhanced parametric stereo up-mixing device for generating a left signal and a right signal from a mono down-mix signal is disclosed which does not suffer from instability caused by interaural phase difference synthesis by improving the audio quality of the generated left and right signals without additional bit rate increase.
When the left and right channels of the stereo audio signal are respectively changed from x L (t) and x R (t) source components p of the left and right channels, respectively L (t)、p R (t) and an ambient component a L (t)、a R (t) satisfies: x is the number of L (t)=p L (t)+a L (t),x R (t)=p R (t)+a R (t);
And set p R (t)=kp L (t), where k is a Primary sound phase Factor (Primary Panning Factor), providing spatial location information of a sound source using Interaural Level Difference (ILD). Ideally, the ambient components of the left and right channelsThere is no correlation, nor is there a correlation between the ambient component and the source component.
In patent application publication No. CN101902679A, a processing method is disclosed for simulating a 5.1 channel audio signal with a stereo audio signal. In the scheme, a left channel signal and a right channel signal of an original signal are input firstly, and the left channel signal and the right channel signal are processed to generate a front channel signal; extracting low-frequency components of the stereo signal to generate a low-frequency effect channel signal; and performing difference operation on the left and right sound channel signals to obtain stereo left and right sound channel difference, performing time delay and amplitude attenuation, and generating simulated left and right surround channel signals by time-varying all-pass filtering to finally realize the simulation of 5.1 sound channels. A
In the patent application with publication number CN1791285A, a signal processing method for simulating a 5.1-channel surround sound by using a dual-channel stereo is disclosed, in which three front-channel signals are generated according to original left and right channel input signals, a difference is made between the left and right channel signals to obtain environmental acoustic information, a pair of left and right surround sound signals which are decorrelated are obtained after delay, band-pass filtering and random delay processing of six sub-bands, and the signals are finally fed back to a 5.1-channel surround sound system for retransmission.
The schemes disclosed in both of the above patent applications are directed to a processing method for stereo audio up-mixing to 5.1 channel audio. In contrast, the up-mixing method based on the source environment component extraction has no particular limitation on the number of playback channels. The multi-channel playback step is shown in fig. 1, and specifically includes:
(1) Extracting source environment components of an input stereo audio signal, and respectively extracting a source component and an environment component of a left channel and a source component and an environment component of a right channel;
(2) Analyzing and processing the source component and the environment component respectively;
(3) Obtaining multi-channel source components from the analyzed and processed source components to more accurately present the sound source position;
(4) Obtaining a multi-channel environment component from the analyzed and processed environment component to increase the sound presence;
(5) And synthesizing the multi-channel source component and the environment component into multi-channel audio.
(6) And playing back the multichannel audio by using multichannel equipment.
While Michael m, goodwin and Jean-Marc job propose a concept of extracting a source environment Component, a classical method of extracting a source environment Component is also proposed, that is, a Principal Component Analysis (PCA) algorithm is used. The principal component analysis algorithm can find the principal component direction of the data, then a new coordinate system is formed by the principal components, and the data is projected to the new coordinate system from the original coordinate system. In the process of extracting the source environment component, the component projected to the principal component direction is the source component, and the component projected to the direction orthogonal to the principal component direction is the environment component.
The audio signal is analyzed in time-frequency domain, so that only one complex value is provided at a corresponding frequency point at any time, and the complex value can be represented by a vector. The geometric relationship between the left and right channels can be obtained by rendering a vector diagram.
By X L 、X R Representing left and right channel components of a stereo audio signal in the frequency domain, the corresponding source and ambient components being denoted by A L 、A R Is represented by, and P L 、P R Referring to the drawings, FIG. 2 shows a geometric representation of the extraction of the source environment components based on a principal component analysis algorithm. The left and right channel source components and the environment component obtained by using the principal component analysis algorithm are respectively as follows:
as can be seen from the above extraction results, the principal component analysis algorithm decomposes the stereo audio signal into uncorrelated source and ambient components. An acoustic phase is introduced between the source components in the two channels. However, the introduction of the acoustic phase between the two in-channel ambient components also results in the two in-channel ambient components being correlated, which does not meet the ideal condition of no correlation between the two in-channel ambient components, so it is necessary to improve the manner of extracting the ambient components to solve the related technical problem.
Disclosure of Invention
The invention aims to: in view of the above existing problems, a new source environment component extraction method is provided.
The technical scheme of the invention is as follows:
a source component ambient component extraction method for stereo sound, comprising the steps of:
step S1: inputting an adjustable parameter β, wherein the parameter represents a magnitude ratio between left and right channel ambient components;
step S2: with left channel frequency domain signal X L Draw a circle for the diameter, marked as circle O 1 ;
And step S3: acquiring a channel source component and an environment component:
at the circle O 1 Taking a point Q at last to obtain a vectorAnd the vector perpendicular theretoWhere point 0 represents the origin and point D is on circle O 1 C, removing;
will vectorAs a left channel source component P L Vector ofAs a left channel ambient component A L ;
Will be the component P L Elongation to kP L Constructing the right source component P R Wherein k represents a source dephasing factor;
based on the right channel source component P R And right channel frequency domain signal X R Constructing a right channel ambient component A R =X R -P R ;
And step S4: with a left channel ambient component A L The division of the model (A) by the adjustable parameter beta is taken as the radius, the origin 0 is taken as the center of a circle, and the circle is recorded as a circle O 2 ;
Determining a right channel ambient component A R Whether or not it falls on the circle O 2 If yes, the currently obtained component A is added L 、A R 、P L And P R As a result of the extraction; otherwise, continue to step S3.
In the invention, the purpose of ensuring that the left sound channel source component is vertical to the environment component is achieved by adjusting the value of the parameter beta. The implementation means is that the user adjusts based on the scene requirement. In the invention, the optimal extraction of the environmental component of the source component is achieved by adjusting the value of beta. The implementation means is that the value of beta is adjusted by a user based on scene requirements.
Further, the extraction method of the invention can be realized by adopting the following steps:
inputting an adjustable parameter beta;
calculating the frequency domain signal X of the left channel L Phase angle theta of 0 Frequency domain signal X of right channel R Phase angle theta of 1 ;
According to the formulaDetermining the left source component P L And X L The included angle theta; further obtain a left source component P L ;
According to P R =kP L Obtaining a right channel source component P R Then according to X L =P L +A L 、X R =P R +A R Obtaining a left and a right channel environment component A L 、A R 。
In summary, due to the adoption of the technical scheme, the invention has the beneficial effects that: the invention can effectively solve the technical problem related to the left and right channel environment components obtained by decomposing the principal component analysis algorithm by introducing the adjustable parameter beta (the amplitude ratio between the left and right channel environment components) to extract the source components and the environment components, and can adapt to various different sound scenes, and the decomposition performance of the stereo audio signal is more excellent.
Drawings
FIG. 1 is a block diagram of multi-channel audio playback based on source ambient component extraction
FIG. 2 geometric representation of source environment component extraction based on principal component analysis algorithm
FIG. 3 is a geometric representation of the source environment component extraction for the new algorithm
FIG. 4 is a time domain waveform of the original source component of the left channel;
FIG. 5 is a time domain waveform of a left channel original ambient component;
FIG. 6 is a time domain waveform of a right channel original ambient component;
FIG. 7 is a time domain waveform of the left channel source component extracted by the new source environment component extraction method according to the present invention when the value of β is 2;
FIG. 8 is a time domain waveform of the left channel environmental component extracted by the new source environmental component extraction method according to the present invention when β is 2;
FIG. 9 is a time domain waveform of the right channel environmental component extracted by the new source environmental component extraction method according to the present invention when β is 2;
FIG. 10 is a time domain waveform of the left channel source component extracted by the new source environment component extraction method according to the present invention when β is 1;
FIG. 11 is a time domain waveform of the left channel environmental component extracted by the new source environmental component extraction method according to the present invention when β is 1;
fig. 12 is a time domain waveform of the right channel environmental component extracted by the new source environmental component extraction method according to the present invention when the value of β is 1.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the embodiments and the accompanying drawings.
The invention provides a new method for extracting source environment components, which ensures that a left channel source component is vertical to the environment components by defining an amplitude ratio beta between two channel environment components and adjusting the value of the beta, but does not limit the right channel source component to be vertical to the environment components and only requires no correlation between the environment components. In this way, the source environment component extraction can be derived with a unique solution. The amplitude ratio beta between the environmental components is adjusted, and the source environmental component extraction method can adapt to various different sound scenes.
Referring to fig. 3, the following operations are performed on the frequency points corresponding to the left and right channels at any time:
(1) With left channel frequency domain signal X L Drawing a circle for the diameter;
(2) At any point on the circle, the mutually vertical P can be separated L And A L ;
(3) Will P L Elongation to kP L Structure P R ;
(4) According to X R =P R +A R Structure A of R =X R -P R ;
(5) And is further represented by A L The modulus of (A) is divided by the amplitude coefficient beta as radius, and the origin is taken as the center of a circle if A R Falling on a circle, the group P L And A L 、P R And A R Is the solution of the source environment component extraction; otherwise, returning to the step (2) and re-selecting a point.
In FIG. 3, θ is P L And X L Angle between them, theta 0 Is X L Phase angle of (a) 1 Is X R Phase angle of (P) R =kP L ,P L And A L Perpendicular to each other, X R =P R +A R ,X L =P L +A L 。
Because of P L 、A L Is X L For a point on a circle of a diameter, the knowledge that the vector and the circumference angle subtended by the diameter are right angles shows that the amplitudes of the left channel source component and the environment component are:
|P L |=|X L |cosθ (1)
|A L |=|X L ||sinθ| (2)
based on the signal model it can be further deduced that:
|P R |=k|P L |=k|X L |cosθ (3)
because of X R =P R +A R Therefore, the following are:
A R =X R -P R (4)
squaring both sides of equation (4) to obtain the square of the right channel source component amplitude, i.e.
|A R | 2 =|X R -P R | 2 =|X R | 2 -2X R ·P R +|P R | 2 (5)
From FIG. 3, X is shown R And P R The included angle between is theta 0 +θ-θ 1 Therefore, the following are:
X R ·P R =|X R ||P R |cos(θ 0 +θ-θ 1 ) (6)
substituting the formulas (3) and (6) into the formula (5)
|A R | 2 =|X R | 2 -2k|X R ||X L |cosθcos(θ 0 +θ-θ 1 )+k 2 |X L | 2 cos 2 θ (7)
From the definition of β:
the one-to-one correspondence between beta and the angle theta can be obtained by taking the square of both sides of the formula (8) and substituting the square of the formula (2) and the formula (7), namely
Wherein the input signal X L And X R Is calculable, k is also predictable, so when the value of β is determined, the value of θ can be determined according to equation (9), whereby P L Then, P can be further determined R 、A L And A R Thereby extracting the source and ambient components of the two channels.
Conversely, if the value of θ is given, the value of β can be obtained from the expression (9), and P can be determined L And further determine P R 、A L And A R . The steps of extracting the source component and the environment component by using the algorithm are briefly described as follows:
(1) Inputting a time domain signal x L 、x R Conversion to frequency domain to obtain X L 、X R ;
(2) Finding X L 、X R Phase angle of (i.e. theta) 0 And theta 1 ;
(3) From the given beta value, the value of theta is obtained by the expression (9), and further P is obtained L ;
(4) According to P R =kP L Finding P R ;
(5) In the determination of P L 、P R Then according to X L =P L +A L 、X R =P R +A R Separately determine A L 、A R 。
In specific implementation, the value of beta can be adjusted by a user, and the P is calculated in real time by the extraction processing end of the source component environment component based on the currently adjusted beta value of the user L 、P R 、A L And A R For use in up-mixing processing. Through analysis, it can be found that when β is adjusted so that the right channel source component is perpendicular to the environment component, i.e. k, the extraction scheme of the present invention degenerates to a principal component analysis algorithm, so that the principal component analysis algorithm is a special case of the new method provided by the present invention.
Examples
First, stereo sound is produced as follows:
(1) Taking a single-channel voice audio signal as a source component of a left channel, multiplying the single-channel voice audio signal by a source sound phase factor k =2 to be used as a source component of a right channel, taking a left-channel audio signal with a sound wave as a left-channel environment component, and carrying out Hilbert transform on the environment component of the left channel to obtain an environment component of the right channel.
(2) And respectively calculating the power of the source component and the power of the environment component, and processing the source components of the two sound channels to enable the ratio of the total source component power to the total power to be 0.8.
(3) And respectively mixing the source component and the environment component of the left channel and the right channel to obtain output signals of the left channel and the right channel, namely obtaining a stereo audio signal to be used for extracting a source environment by the new method.
The output signals of the left and right channels are then framed, each frame containing 4096 sound samples with a 50% overlap between adjacent frames, i.e. 2048 coincident samples between adjacent frames. The purpose of such partitioning is to have the effect of a smooth transition from frame to frame.
And then, each frame after the left and right sound channels are divided is subjected to Hanning window filtering processing, so that frequency spectrum leakage possibly caused in the subsequent time domain-frequency domain conversion process can be effectively reduced.
Then, 4096-point Fast Fourier Transform (FFT) is performed on each frame of audio signal after the windowing filtering processing, so as to obtain the frequency spectrums of the left and right channel output signals.
The new source environment component extraction method provided by the invention is adopted to extract and process the source environment components based on the frequency domain components of the left and right sound channel output signals, and the method specifically comprises the following processing steps:
traversing all the frames, and traversing all the frequency points for each frame;
calculating the phase angle theta of corresponding frequency point pairs in corresponding frames of the left and right channels 0 、θ 1 And amplitude | X of the output signal L |、|X R |;
Substituting a given value of beta intoDuring programming, the corresponding theta value is calculated, and because the expression of beta relative to theta is known and is converted into the expression of theta relative to beta, the theta is excessively complicated, so that the theta is in the value range of the thetaInner discretizationThen take outThe minimum theta is used as a final solving result;
after theta is determined, determining the left channel source component of the corresponding frequency point according to the graph 3, and further determining the left channel source component of the corresponding frequency point according to A L And P L Perpendicular, and A L Is X L Determining a left channel environment component of a corresponding frequency point for a point on a circle of the diameter;
obtaining the right channel source component according to the relation of the left channel source component and the right channel source component, and further obtaining the right channel source component according to X R =P R +A R Determining a right channel ambient component;
and finally, the obtained frequency domain signals of all frames of the source components and the environment components of the left and right channels are inversely transformed into time domain signals, and are connected, and the extracted components are utilized according to actual requirements.
In this embodiment, the source component, the environment component, and the environment component of the left channel are extracted, written into a file, played, compared with the original speech, and if the result does not meet the requirement, the value of β is changed, and the extraction process is performed again to obtain a new decomposition result until the requirement is met. The decomposition results for different values of β are shown in fig. 7 to 12.
As can be seen from the decomposition results shown in fig. 4 to 12, in this embodiment, the time domain waveforms of the left channel source component, the ambience component, and the ambience component of the right channel extracted by the new source ambience component extraction method of the present invention are compared with the time domain waveforms of the originally recorded left channel source component, ambience component, and output signal (fig. 4 to 6), and it can be seen that the extracted signal waveforms are substantially consistent with the original signal waveforms, that is, the new source ambience component extraction method of the present invention is effective. The invention can effectively solve the technical problem of correlation between the left channel environmental component and the right channel environmental component obtained by decomposing the principal component analysis algorithm based on the extraction mode of extracting the source component and the environmental component based on the amplitude ratio beta between the left environmental component and the right environmental component. In addition, by comparing the source components obtained by the method with the source components obtained by the existing Sparsity-constrained environment Phase angle Estimation (APES) algorithm, the source components obtained by the extraction method of the invention contain fewer environment components. In conclusion, the new method for extracting the source environment component provided by the invention has superior performance.
Where mentioned above are merely embodiments of the invention, any feature disclosed in this specification may, unless stated otherwise, be replaced by alternative features serving equivalent or similar purposes; all of the disclosed features, or all of the method or process steps, may be combined in any combination, except mutually exclusive features and/or steps.
Claims (3)
1. A source component ambient component extraction method for stereo, comprising the steps of:
step S1: inputting an adjustable parameter β, wherein the parameter β represents a magnitude ratio between left and right channel ambient components;
step S2: with left channel frequency domain signal X L Draw a circle for the diameter, marked as circle O 1 ;
And step S3: obtaining a channel source component and an environment component:
at the circle O 1 Taking a point Q at last to obtain a vectorAnd the vector perpendicular theretoWherein point O represents the origin and point D is on circle O 1 The above step (1);
will vectorAs a left channel source component P L Vector ofAs a left channel ambient component A L ;
Will be the component P L Elongation to kP L Constructing the right source component P R Wherein k represents a source dephasing factor;
based on the right channel source component P R And right channel frequency domain signal X R Constructing a right channel ambient component A R =X R -P R ;
And step S4: to be provided withThe radius, the origin O is the center of the circle, the circle is marked as the circle O 2 ;
Determining a right channel ambient component A R Whether or not it falls on the circle O 2 If yes, the currently obtained component A is added L 、A R 、P L And P R As a result of the extraction; otherwise, continue to step S3.
2. A source component ambient component extraction method for stereo, comprising the steps of:
inputting an adjustable parameter β, wherein the parameter β represents a magnitude ratio between left and right channel ambient components;
calculating the frequency domain signal X of the left channel L Phase angle theta of 0 Frequency domain signal X of right channel R Phase angle theta of 1 ;
According to the formulaDetermining a left channel source component P L And X L The included angle theta; further obtain a left channel source component P L ;
According to P R =kP L Obtaining a right channel source component P R Wherein k represents a source dephasing factor;
then according to X L =P L +A L 、X R =P R +A R Obtaining a left and a right channel environment component A L 、A R 。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811094388.0A CN109036456B (en) | 2018-09-19 | 2018-09-19 | Method for extracting source component environment component for stereo |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811094388.0A CN109036456B (en) | 2018-09-19 | 2018-09-19 | Method for extracting source component environment component for stereo |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109036456A CN109036456A (en) | 2018-12-18 |
CN109036456B true CN109036456B (en) | 2022-10-14 |
Family
ID=64617286
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811094388.0A Expired - Fee Related CN109036456B (en) | 2018-09-19 | 2018-09-19 | Method for extracting source component environment component for stereo |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109036456B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113518299B (en) * | 2021-04-30 | 2022-06-03 | 电子科技大学 | Improved method, equipment and computer readable storage medium for extracting source component and environment component |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101009952A (en) * | 2005-12-19 | 2007-08-01 | 三星电子株式会社 | Method and apparatus to provide active audio matrix decoding based on the positions of speakers and a listener |
CN101604524A (en) * | 2008-06-11 | 2009-12-16 | 北京天籁传音数字技术有限公司 | Stereo encoding method and device thereof, stereo decoding method and device thereof |
CN101836249A (en) * | 2007-09-06 | 2010-09-15 | Lg电子株式会社 | The method and apparatus of decoded audio signal |
CN101889307A (en) * | 2007-10-04 | 2010-11-17 | 创新科技有限公司 | Phase-amplitude 3-D stereo encoder and demoder |
CN106233755A (en) * | 2014-03-21 | 2016-12-14 | 杜比国际公司 | For the method that high-order Ambisonics (HOA) signal is compressed, for the method that compressed HOA signal is decompressed, for the device that HOA signal is compressed and for the device that compressed HOA signal is decompressed |
CN107146630A (en) * | 2017-04-27 | 2017-09-08 | 同济大学 | A kind of binary channels language separation method based on STFT |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8379868B2 (en) * | 2006-05-17 | 2013-02-19 | Creative Technology Ltd | Spatial audio coding based on universal spatial cues |
US9014377B2 (en) * | 2006-05-17 | 2015-04-21 | Creative Technology Ltd | Multichannel surround format conversion and generalized upmix |
EP3297298B1 (en) * | 2016-09-19 | 2020-05-06 | A-Volute | Method for reproducing spatially distributed sounds |
-
2018
- 2018-09-19 CN CN201811094388.0A patent/CN109036456B/en not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101009952A (en) * | 2005-12-19 | 2007-08-01 | 三星电子株式会社 | Method and apparatus to provide active audio matrix decoding based on the positions of speakers and a listener |
CN101836249A (en) * | 2007-09-06 | 2010-09-15 | Lg电子株式会社 | The method and apparatus of decoded audio signal |
CN101889307A (en) * | 2007-10-04 | 2010-11-17 | 创新科技有限公司 | Phase-amplitude 3-D stereo encoder and demoder |
CN101604524A (en) * | 2008-06-11 | 2009-12-16 | 北京天籁传音数字技术有限公司 | Stereo encoding method and device thereof, stereo decoding method and device thereof |
CN106233755A (en) * | 2014-03-21 | 2016-12-14 | 杜比国际公司 | For the method that high-order Ambisonics (HOA) signal is compressed, for the method that compressed HOA signal is decompressed, for the device that HOA signal is compressed and for the device that compressed HOA signal is decompressed |
CN107146630A (en) * | 2017-04-27 | 2017-09-08 | 同济大学 | A kind of binary channels language separation method based on STFT |
Non-Patent Citations (2)
Title |
---|
Blind depth estimation based on primary-to-ambient energy ratio for 3-D acoustic depth rendering;Se-Woon Jeon;《proceedings of the 2012 Aisa Pacific Signal and Information Processing Association Annual Summit and Conference》;20121231;全文 * |
强噪声干扰源下的多分量LFM信号检测方法;万建等;《哈尔滨工业大学学报》;20111130(第11期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN109036456A (en) | 2018-12-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10674262B2 (en) | Merging audio signals with spatial metadata | |
TWI770059B (en) | Method for reproducing spatially distributed sounds | |
EP3320692B1 (en) | Spatial audio processing apparatus | |
CN109068263B (en) | Binaural rendering of headphones using metadata processing | |
US7567845B1 (en) | Ambience generation for stereo signals | |
TWI797417B (en) | Method and apparatus for rendering ambisonics format audio signal to 2d loudspeaker setup and computer readable storage medium | |
CN112205006B (en) | Adaptive remixing of audio content | |
US20130315402A1 (en) | Three-dimensional sound compression and over-the-air transmission during a call | |
EP1761110A1 (en) | Method to generate multi-channel audio signals from stereo signals | |
CN113170271B (en) | Method and apparatus for processing stereo signals | |
CN107005778A (en) | The audio signal processing apparatus and method rendered for ears | |
CN106797526B (en) | Apparatus for processing audio, method and computer readable recording medium | |
CN109036456B (en) | Method for extracting source component environment component for stereo | |
US20240056760A1 (en) | Binaural signal post-processing | |
WO2020057050A1 (en) | Method for extracting direct sound and background sound, and loudspeaker system and sound reproduction method therefor | |
KR100849030B1 (en) | 3D sound Reproduction Apparatus using Virtual Speaker Technique under Plural Channel Speaker Environments | |
KR100802339B1 (en) | 3D sound Reproduction Apparatus and Method using Virtual Speaker Technique under Stereo Speaker Environments | |
WO2024216494A1 (en) | Method for multichannel audio reconstruction and speaker system using the method | |
CN116615919A (en) | Post-processing of binaural signals | |
US20210136507A1 (en) | Detection of audio panning and synthesis of 3D audio from limited-channel surround sound | |
Kan et al. | Psychoacoustic evaluation of different methods for creating individualized, headphone-presented virtual auditory space from B-format room impulse responses |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20221014 |
|
CF01 | Termination of patent right due to non-payment of annual fee |