CN105120421B - A kind of method and apparatus for generating virtual surround sound - Google Patents
A kind of method and apparatus for generating virtual surround sound Download PDFInfo
- Publication number
- CN105120421B CN105120421B CN201510519948.2A CN201510519948A CN105120421B CN 105120421 B CN105120421 B CN 105120421B CN 201510519948 A CN201510519948 A CN 201510519948A CN 105120421 B CN105120421 B CN 105120421B
- Authority
- CN
- China
- Prior art keywords
- audio signal
- adjusting parameter
- signal
- audio
- surround sound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
The invention discloses a kind of method and apparatus for generating virtual surround sound, belong to field of signal processing.Method includes:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;According to the anglec of rotation, spin matrix is generated;According to first audio signal, the adjusting parameter of first audio signal is obtained;According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;According to second audio signal and the spin matrix, virtual surround sound is generated.Device includes:First acquisition module, the first generation module, the second acquisition module, adjusting module and the second generation module.The anglec of rotation that the present invention rotates according to user's head rotates virtual surround sound, such that it is able to improve the authenticity of virtual surround sound.
Description
Technical field
The present invention relates to field of signal processing, more particularly to a kind of method and apparatus for generating virtual surround sound.
Background technology
At present, when user listens to music using terminals such as mobile phone or computers, if it is desired to experience the void at concert scene
When intending the effect of surround sound, it is necessary to the multiple audio amplifiers of terminal connection, the music is played by multiple audio amplifiers;But due to price and sky
Between aspect the reason for, all without enough audio amplifiers, at this moment terminal needs to produce virtual surround sound general user, allows user to experience
In the effect at concert scene.
Prior art provides a kind of method for generating virtual surround sound, Ke Yiwei:Terminal obtains audio file to be included
B format signals, the B format signals are converted into virtual speaker array signal, virtual speaker array signal is passed through
HRTF (Head Related Transfer Function, head related transfer function) wave filter is filtered, and obtains virtual ring
Around sound.
Realize it is of the invention during, inventor find prior art at least there is problems with:
When user has on earphone and listens virtual surround sound, when user's head rotates, the virtual surround sound in earphone can be and then
The end rotation of user, so causes people to listen the sensation of music different at the scene, namely the virtual surround sound of generation is not true enough.
The content of the invention
In order to solve problem of the prior art, the invention provides a kind of method and apparatus for generating virtual surround sound.Skill
Art scheme is as follows:
A kind of method for generating virtual surround sound, methods described includes:
Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
According to the anglec of rotation, spin matrix is generated;
According to first audio signal, the adjusting parameter of first audio signal is obtained;
According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;
According to second audio signal and the spin matrix, virtual surround sound is generated.
Further, it is described according to second audio signal and the spin matrix, virtual surround sound is generated, including:
According to the spin matrix, second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
The virtual speaker array signal is filtered by head related transfer function wave filter, obtain virtual ring around
Sound.
Further, it is described according to first audio signal, obtain the adjusting parameter of first audio signal, bag
Include:
According to first audio signal, the recording scene of first audio signal is obtained, according to the recording scene,
The adjusting parameter of first audio signal is obtained from the corresponding relation for recording scene and adjusting parameter;Or,
According to first audio signal, first audio is obtained from the corresponding relation of audio signal and adjusting parameter
The adjusting parameter of signal.
Further, it is described according to first audio signal, obtain the recording scene of first audio signal, bag
Include:
First audio signal is analyzed, the content of first audio signal is obtained, according to the content, it is determined that described
The recording scene of the first audio signal;Or,
According to first audio signal, first audio is obtained from the corresponding relation of audio signal and recording scene
The recording scene of signal.
Further, the adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
It is described first audio signal to be adjusted and obtains the second audio signal according to the adjusting parameter, including:
According to the upper mixed exponent number, first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of the virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second sound
Frequency signal.
A kind of device for generating virtual surround sound, described device includes:
First acquisition module, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file;
First generation module, for according to the anglec of rotation, generating spin matrix;
Second acquisition module, for according to first audio signal, obtaining the adjusting parameter of first audio signal;
Adjusting module, for according to the adjusting parameter, being adjusted to first audio signal and obtaining the second audio
Signal;
Second generation module, for according to second audio signal and the spin matrix, generating virtual surround sound.
Further, second generation module, including:
Rotary unit, the 3rd audio is obtained for according to the spin matrix, second audio signal rotate
Signal;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker
Array signal;
Filter unit, for the virtual speaker array signal to be filtered by head related transfer function wave filter
Ripple, obtains virtual surround sound.
Further, second acquisition module, including:
First acquisition unit, for according to first audio signal, obtaining the recording scene of first audio signal;
Second acquisition unit, for according to the recording scene, being obtained from the corresponding relation for recording scene and adjusting parameter
Take the adjusting parameter of first audio signal;
Or, second acquisition module, including:
3rd acquiring unit, for according to first audio signal, from audio signal and the corresponding relation of adjusting parameter
The middle adjusting parameter for obtaining first audio signal.
Further, the first acquisition unit, including:
Analysis subelement, for analyzing first audio signal, obtains the content of first audio signal;
Determination subelement, for according to the content, determining the recording scene of first audio signal;
Or, the first acquisition unit, including:
Subelement is obtained, for according to first audio signal, from the corresponding relation of audio signal and recording scene
Obtain the recording scene of first audio signal.
Further, the adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
The adjusting module, including:
First processing units, for according to the upper mixed exponent number, first audio signal being carried out into upper mixed treatment and being obtained
4th audio signal;
Second processing unit, for the topological structure according to the virtual speaker, the 4th audio signal is carried out
The second audio signal is obtained around treatment.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation
Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter,
First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual
Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Brief description of the drawings
Fig. 1 is a kind of method flow diagram of the virtual surround sound of generation that the embodiment of the present invention 1 is provided;
Fig. 2-1 is a kind of method flow diagram of the virtual surround sound of generation that the embodiment of the present invention 2 is provided;
Fig. 2-2 is a kind of schematic diagram of the topological structure of virtual speaker that the embodiment of the present invention 2 is provided;
Fig. 2-3 is the schematic diagram of the topological structure of another virtual speaker that the embodiment of the present invention 2 is provided;
Fig. 3 is a kind of apparatus structure schematic diagram of the virtual surround sound of generation that the embodiment of the present invention 3 is provided.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention
Formula is described in further detail.
Embodiment 1
A kind of method for generating virtual surround sound is the embodiment of the invention provides, the executive agent of the method can be for eventually
End, referring to Fig. 1, wherein, the method includes:
Step 101:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
Step 102:According to the anglec of rotation, spin matrix is generated;
Step 103:According to the first audio signal, the adjusting parameter of the first audio signal is obtained;
Step 104:According to the adjusting parameter, the first audio signal is adjusted and obtains the second audio signal;
Step 105:According to the second audio signal and the spin matrix, virtual surround sound is generated.
Further, according to the second audio signal and the spin matrix, virtual surround sound is generated, including:
According to the spin matrix, the second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
Virtual speaker array signal is filtered by head related transfer function wave filter, virtual surround sound is obtained.
Further, according to the first audio signal, the adjusting parameter of the first audio signal is obtained, including:
According to the first audio signal, obtain the recording scene of the first audio signal, according to recording scene, from record scene and
The adjusting parameter of the first audio signal is obtained in the corresponding relation of adjusting parameter;Or,
According to the first audio signal, the tune of the first audio signal is obtained from the corresponding relation of audio signal and adjusting parameter
Whole parameter.
Further, according to the first audio signal, the recording scene of the first audio signal is obtained, including:
The first audio signal is analyzed, the content of the first audio signal is obtained, according to content, the record of the first audio signal is determined
Scene processed;Or,
According to the first audio signal, the record of the first audio signal is obtained from the corresponding relation of audio signal and recording scene
Scene processed.
Further, adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
According to adjusting parameter, the first audio signal is adjusted and obtains the second audio signal, including:
According to upper mixed exponent number, the first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio signal.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation
Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter,
First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual
Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Embodiment 2
A kind of method for generating virtual surround sound is the embodiment of the invention provides, the executive agent of the method can be for eventually
End, referring to Fig. 2-1, wherein, the method includes:
Step 201:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
When user plays the audio file that high in the clouds or server end are stored by earphone, terminal obtains audio file
First audio signal and the anglec of rotation of user's head rotation.
Wherein, the step of anglec of rotation of terminal acquisition user's head rotation can be:
Head-tracker is set on earphone or in equipment of the user's head with head-tracker, such as virtual reality
Whether display device, rotated by head-tracker real-time detection user's head, if user's head is rotated, is obtained
The anglec of rotation of user's head rotation is taken, the anglec of rotation is sent to terminal;Terminal receives the anglec of rotation that head-tracker sends
Degree.
Wherein, the first audio signal can be single order B format signals, and B format signals can be triple-track signal, it is also possible to
It is quadraphonic signal;If B format signals are triple-track signal, B format signals include W, X and Y;If B format signals are
Quadraphonic signal, then B format signals include W, X, Y and Z.Terminal can be mobile phone, panel computer or PC (personal
Computer, PC) terminal etc..
W sound channel signals represent omnirange sound wave, and X sound channel signals, Y sound channel signals and Z sound channel signals are represented along three mutually
Vertical orientated sound wave;X sound channel signals are represented to be listened from rear to preceding horizontally disposed, and Y sound channel signals are represented listens water from right to left
Plain cloth is put, and Z sound channel signals are represented to listen and are arranged vertically upwards.
Step 202:According to the anglec of rotation, spin matrix is generated;
Spin matrix is used to rotate virtual surround sound, so that when user's head rotates, virtual surround sound not basis
The rotation of user's head and rotate, realize the pleasant to the ear music of actual life effect.
For example, the direction of virtual surround sound is in front, when user's head is to 30 degree of anticlockwise, then by the virtual surround sound from
To 30 degree of right rotation at the postrotational position of user's head, so as to realize the direction of virtual surround sound still in original direction
On.
If B format signals are triple-track signal, spin matrix isIf B format signals
It is quadraphonic signal, then spin matrix isθ is the anglec of rotation.
Step 203:According to the first audio signal, the adjusting parameter of the first audio signal is obtained;
Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker, and the topological structure of virtual speaker includes void
Intend the number of loudspeaker and the position of each virtual speaker etc..
This step can be realized by following first way or the second way, for the first implementation, this
Step can be realized by following steps (1) and (2), including:
(1):According to the first audio signal, the recording scene of the first audio signal is obtained;
Recording scene is including concert scene, business meetings scene or natural environment scene etc..
This step can be realized by following steps (1-1) and (1-2), including:
(1-1):The first audio signal is analyzed, the content of the first audio signal is obtained;
The content of the first audio signal at least includes directional signal proportion, can also include attribute information and/or main
Direction of sound source etc.;Attribute information including the first audio signal including object, etc. Instrument categories and sound class.
Wherein, the first audio signal is analyzed, can be with the step of obtain the directional signal proportion that the first audio signal includes
For:
By Direct-ambience signal decomposition (analysis of sensing-ambient signal) Algorithm Analysis
One audio signal, obtains the proportion of the directional signal that the first audio signal includes, it is also possible to which obtaining the first audio signal includes
Non- directional signal proportion.
For example, only including one's voice in speech in the first audio signal, then the first audio signal is sounded and just have very strong
Directionality, then by Direct-ambience signal decomposition the first audio signals of Algorithm Analysis, obtain
The proportion of the directional signal in one audio signal is larger;For another example, noise or a large amount of reverberation are included in the first audio signal, then
First audio signal sounds that directionality is not just strong, then by Direct-ambience signal decomposition algorithms
The first audio signal is analyzed, the proportion for obtaining directional signal in the first audio signal is smaller.
Wherein, the first audio signal is analyzed, may be used also the step of obtain the directional signal proportion that the first audio signal includes
Realized with by following steps (A) to (C), including:
(A):Covariance matrix is set up to the first audio signal;
Covariance matrixcov(ωi, n)=α cov (ωi, n-1) and+(1- α) * S (ωi,
n)*SH(ωi,n)。
If the first audio signal includes W, X and Y, S (ωi, n)=[W (ωi,n)X(ωi,n)Y(ωi,n)]TIf,
First audio signal includes W, X, Y and Z, then S (ωi, n)=[W (ωi,n)X(ωi,n)Y(ωi,n)Z(ωi,n)]T。
Wherein, ωiIt is the frequency of the first audio signal, n is the index to frame number on time shaft, []HRepresentation vector is conjugated
Transposition;α is smoothing factor, and α can be previously set or dynamically be adjusted according to the characteristics of signals of the first audio signal, for example, α
Can be with 0.92.ωiInclude all frequencies interested and ωiCan as needed be configured and change;For example, ωiFor
100-16000HZ。
Also, in embodiments of the present invention, each ω can be setiWeight, then to the first audio signal set up association side
During difference matrix, can be according to each ωiThe audio signal of weight calculation first covariance matrix, then covariance matrixρ is ωiWeight.
(B):Signature analysis is carried out to covariance matrix, characteristic value is obtained;
Signature analysis is carried out by Matlab function pairs covariance matrix, [V, Λ]=eigs (cov (n)) is obtained.
Wherein, V is the matrix of 3*3 or the matrix of 4*4, and each column of the matrix represents the characteristic vector of cov (n);Λ bags
The individual features value being arranged in decreasing order is contained.
(C):According to characteristic value, the proportion of directional signal is calculated.
Select eigenvalue of maximum as the First Eigenvalue from characteristic value, selected from the characteristic value in addition to eigenvalue of maximum
Eigenvalue of maximum is selected as Second Eigenvalue, according to the First Eigenvalue and Second Eigenvalue, is calculated by below equation (1) and pointed to
The proportion of property signal.
Formula (1);
Wherein, DRR is the proportion of directional signal;λ1It is the First Eigenvalue, and λ1Corresponding to through acoustic energy;λ2It is
Two characteristic values, and λ2Correspondence reflection, echo, ambient sound etc..Between [0,1], the value of DRR is smaller to represent direct sound wave to the value of DRR
Proportion is lower, and sound field directivity is weaker, namely directional signal proportion is lower;The value of DRR is bigger, and the direct sound wave proportion that represents is higher,
Sound field directivity is stronger, namely directional signal proportion is higher.
Further, by direction of arrival (sound source arrival direction) audio signal of Algorithm Analysis first,
Obtain the direction of the first audio signal main sound source.
Further, by Instrument classification (musical instrument classification) audio signal of Algorithm Analysis first,
Obtain the Instrument categories of the first audio signal;By Speech music classification (voice music classification) algorithm
The first audio signal is analyzed, the sound class of the first audio signal is obtained.
Further, by Object extraction (object extraction) algorithm extract the first audio signal include it is right
As.
For example, the first audio signal is one section of voice, then the first audio is extracted by Object extraction algorithms and believed
Number object be voice;For another example, the first audio signal is one section of thunder, then extract the by Object extraction algorithms
The object of one audio signal is thunder etc.;For another example, the first audio signal is one section of music, then by Object extraction
The object that algorithm extracts the first audio signal is music etc..
Further, server can obtain the content of the first audio signal, by first with the audio signal of ex ante analysis first
The content of audio signal and the first audio signal is stored in the corresponding relation of audio signal and content, then this step can be:
According to the first audio signal, the first sound is obtained in the audio signal and the corresponding relation of content that are stored from server
The content of frequency signal.
Wherein, the audio signal and the corresponding relation of content for being stored in server can be stored in clothes in the form of metadata
In business device, and directly can be embedded in the content of the first audio signal in the first audio signal by server, it is also possible to by first
The content of audio signal is individually deposited, and sets up content file folder, by the content storage of the first audio signal in content file folder
In, and set up the corresponding relation of the first audio signal and content file folder.
Terminal when the first audio of audio file is obtained, can obtain the content of the first audio signal, it is also possible at this
The content of the first audio file is obtained in step.Also, audio signal and the correspondence pass of content that terminal is stored from server
The content of the first audio signal is obtained in system, the computational burden of terminal can be mitigated, and improve terminal and generate virtual surround sound
Efficiency.
(1-2):According to the content of the first audio signal, the recording scene of the first audio signal is determined.
The corresponding relation of storage content and recording scene in server, accordingly, this step can be:
Terminal according to the content of the first audio signal, in the content stored from server and the corresponding relation for recording scene
Obtain the recording scene of the first audio signal.
In this step, terminal can also obtain content from server and record the corresponding relation of scene, storage content
With the corresponding relation for recording scene;Accordingly, this step can be:
According to the content of the first audio signal, the content stored from terminal and the is obtained in the corresponding relation for recording scene
The recording scene of one audio signal.
Wherein, the corresponding relation of content and recording scene can be stored in terminal or server in the form of metadata,
And directly can be embedded in the recording scene of the first audio signal in the content of the first audio signal by terminal or server, also may be used
Individually deposited with by the recording scene of the first audio signal, set up and record document scene folder, by the recording of the first audio signal
Scape storage sets up the content of the first audio signal and the corresponding relation of the recording scene in recording document scene folder.
Further, terminal stores in audio signal and record the recording scene of the first audio signal and the first audio signal
In the corresponding relation of scene processed;When playing the first audio signal again again so as to terminal, first is determined without by above method
The recording scene of audio signal, directly obtains the recording of the first audio signal from the corresponding relation of audio signal and recording scene
Scene.
For example, when the proportion of the directional signal of terminal storage is more than 0.5, the recording scene for determining the first audio signal is
Business meetings;When the proportion of directional signal is less than 0.5, the recording scene for determining the first audio signal is concert.
(2):According to scene is recorded, the tune of the first audio signal is obtained from the corresponding relation for recording scene and adjusting parameter
Whole parameter.
The corresponding relation for recording scene and adjusting parameter is stored in server, then this step can be:
According to scene is recorded, the first sound is obtained in the recording scene and the corresponding relation of adjusting parameter that are stored from server
The adjusting parameter of frequency signal.
In this step, terminal can also obtain the corresponding relation for recording scene and adjusting parameter, storage from server
Record the corresponding relation of scene and adjusting parameter;Accordingly, this step can be:
According to scene is recorded, the first audio is obtained in the recording scene and the corresponding relation of adjusting parameter that are stored from terminal
The adjusting parameter of signal.
Further, terminal stores in audio signal and tune the adjusting parameter of the first audio signal and the first audio signal
In the corresponding relation of whole parameter, so that when terminal plays the first audio signal again, the first sound is determined without by above method
The recording scene of frequency signal, obtaining adjusting parameter according to recording scene, but directly from the right of audio signal and adjusting parameter
The adjusting parameter of the first audio signal of middle acquisition should be related to, so as to shorten the acquisition of the adjusting parameter for obtaining the first audio signal
Time, improve acquisition efficiency.
Further, for second implementation, this step can be:
The corresponding relation of audio signal and adjusting parameter is stored in server, terminal according to the first audio signal, from service
The adjusting parameter of the first audio signal is obtained in the audio signal and the corresponding relation of adjusting parameter that are stored in device.
Further, the corresponding relation of audio signal and adjusting parameter can be stored in server in the form of metadata
In, and server can directly by adjusting parameter first audio signal of insertion of the first audio signal, it is also possible to by first
The adjusting parameter of audio signal is individually deposited, and sets up adjusting parameter file, and the adjusting parameter storage of the first audio signal is existed
In adjusting parameter file, and set up the corresponding relation of the first audio signal and the adjusting parameter file.
For example, the proportion of directional signal is more than 0.5 in the first audio signal, then upper mixed exponent number is 3, and virtual speaker
Topological structure include 6 virtual speakers, referring to Fig. 2-2;For another example, the proportion of directional signal is less than in the first audio signal
0.5, then upper mixed exponent number is 1, and the topological structure of virtual speaker includes 4 virtual speakers, referring to Fig. 2-3.
For example, the directionality sound source of the first audio signal is distributed in a direction, such as sound field content is concert, sound field
Concentrate on stage direction namely front, then by the left front in the topological structure of virtual speaker and right speakers away from
It is big from becoming.
Further, the different adjusting parameter of different terminal type correspondences can also be set in embodiments of the present invention,
Then this step can be:
The terminal type of terminal is obtained, according to terminal type and the first audio signal, the adjustment of the first audio signal is obtained
Parameter.
The corresponding relation of terminal type, audio signal and adjusting parameter is previously stored in server;Accordingly, according to terminal
Type and the first audio signal, obtain the first audio signal adjusting parameter the step of can be:
According to terminal type and the first audio signal, the terminal type stored from server, audio signal and adjustment ginseng
The adjusting parameter of the first audio signal is obtained in several corresponding relations.
The topological structure of virtual speaker has great influence to the quality of virtual surround sound, and to virtual surround sound
Influence is different and different according to the content of the first audio signal;For example, the first audio signal major part both is from front, then it is empty
The topological structure for intending loudspeaker can select rectangular configuration, rather than square structure.Therefore, in the embodiment of the present invention, can realize
Content-adaptive adjustment adjusting parameter according to the first audio signal such that it is able to ensure the broadcasting tonequality of virtual surround sound.And
And, different terminals have different operational capabilities and power consumption, according to terminal type and the first audio signal, obtain the first audio letter
Number adjusting parameter can save the power consumption of terminal.
Step 204:According to the adjusting parameter, the first audio signal is adjusted and obtains the second audio signal;
Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker, then this step can be by following steps
(1) realized with (2), including:
(1):According to upper mixed exponent number, the first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
Wherein, this step is prior art, is no longer described in detail herein.
(2):According to the topological structure of virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio letter
Number.
4th audio signal is sequentially passed through into the virtual speaker that the topological structure of virtual speaker includes, so as to realize
4th audio signal is carried out surrounding treatment, the second audio signal is obtained.
For example, the first audio signal isThen the second audio signal after adjustment isFor another example, the first audio
Signal isThen the second audio signal after adjustment is
Step 205:According to spin matrix, the second audio signal rotate obtain the 3rd audio signal;
Spin matrix and the second audio signal are carried out into multiplying, the 3rd audio signal is obtained.
For example, the second audio signal isSpin matrix isThen the 3rd audio signal isFor another example, the second audio signal isSpin matrix is
Then the 3rd audio signal is
Step 206:According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
Virtual speaker matrix is obtained, virtual speaker matrix and the 3rd audio signal are carried out into matrix multiplication, obtain void
Intend loudspeaker array signal.
For example, virtual speaker array isThen virtual speaker array signal is For another example, virtual speaker array isThen virtual speaker battle array
Column signal is
Wherein, N is the number of the virtual speaker that virtual speaker topological structure includes.
Step 207:Virtual speaker array signal is filtered by head related transfer function wave filter, obtains virtual
Surround sound.
Head related transfer function wave filter is used to for virtual speaker array signal being converted to two roads stereo, namely ears
Signal, then this step can be:
Obtain the corresponding stereo matrix in two roads of head correlation function transforming function transformation function wave filter, the road stereoscopic matrixes of Jiang Gai bis- and void
Intending loudspeaker array signal carries out matrix multiplication, obtains virtual surround sound.
For example, the stereo matrix in two roads isThen virtual surround sound is Or virtual surround sound is
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation
Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter,
First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual
Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Embodiment 3
A kind of device for generating virtual surround sound is the embodiment of the invention provides, the device can be terminal, referring to Fig. 3,
Device includes:
First acquisition module 301, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file
Degree;
First generation module 302, for according to the anglec of rotation, generating spin matrix;
Second acquisition module 303, for according to the first audio signal, obtaining the adjusting parameter of the first audio signal;
Adjusting module 304, for according to adjusting parameter, being adjusted to the first audio signal and obtaining the second audio signal;
Second generation module 305, for according to the second audio signal and spin matrix, generating virtual surround sound.
Further, the second generation module 305, including:
Rotary unit, the 3rd audio signal is obtained for according to spin matrix, the second audio signal rotate;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker array signal;
Filter unit, for virtual speaker array signal to be filtered by head related transfer function wave filter, obtains
To virtual surround sound.
Further, the second acquisition module 303, including:
First acquisition unit, for according to the first audio signal, obtaining the recording scene of the first audio signal;
Second acquisition unit, for according to scene is recorded, obtaining the from the corresponding relation of scene and adjusting parameter is recorded
The adjusting parameter of one audio signal;
Or, the second acquisition module 303, including:
3rd acquiring unit, for according to the first audio signal, being obtained from the corresponding relation of audio signal and adjusting parameter
Take the adjusting parameter of the first audio signal.
Further, first acquisition unit, including:
Analysis subelement, for analyzing the first audio signal, obtains the content of the first audio signal;
Determination subelement, for according to content, determining the recording scene of the first audio signal;
Or, first acquisition unit, including:
Subelement is obtained, for according to the first audio signal, being obtained from the corresponding relation of audio signal and recording scene
The recording scene of the first audio signal.
Further, adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
Adjusting module 304, including:
First processing units, for according to upper mixed exponent number, the first audio signal being carried out into upper mixed treatment and obtaining the 4th audio
Signal;
Second processing unit, for the topological structure according to virtual speaker, the 4th audio signal is carried out surrounding treatment
Obtain the second audio signal.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation
Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter,
First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual
Surround sound, such that it is able to improve the authenticity of virtual surround sound.
It should be noted that:The device of the virtual surround sound of generation that above-described embodiment is provided when virtual surround sound is generated,
Only carried out with the division of above-mentioned each functional module for example, in practical application, as needed can distribute above-mentioned functions
Completed by different functional module, will the internal structure of device be divided into different functional modules, it is described above to complete
All or part of function.In addition, the device and the virtual surround sound of generation of the virtual surround sound of generation that above-described embodiment is provided
Embodiment of the method belongs to same design, and it implements process and refers to embodiment of the method, repeats no more here.
One of ordinary skill in the art will appreciate that realizing that all or part of step of above-described embodiment can be by hardware
To complete, it is also possible to instruct the hardware of correlation to complete by program, described program can be stored in a kind of computer-readable
In storage medium, storage medium mentioned above can be read-only storage, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, be not intended to limit the invention, it is all it is of the invention spirit and
Within principle, any modification, equivalent substitution and improvements made etc. should be included within the scope of the present invention.
Claims (8)
1. a kind of method for generating virtual surround sound, it is characterised in that methods described includes:
Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
According to the anglec of rotation, spin matrix is generated;
According to first audio signal, the adjusting parameter of first audio signal is obtained, the adjusting parameter includes upper mixed
The topological structure of exponent number and virtual speaker;
According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;
According to second audio signal and the spin matrix, virtual surround sound is generated;
Wherein, it is described according to first audio signal, the adjusting parameter of first audio signal is obtained, including:
According to first audio signal, the recording scene of first audio signal is obtained, according to the recording scene, from record
The adjusting parameter of first audio signal is obtained in the corresponding relation of scene processed and adjusting parameter;Or,
According to first audio signal, first audio signal is obtained from the corresponding relation of audio signal and adjusting parameter
Adjusting parameter;Or,
The terminal type of terminal is obtained, according to the terminal type and first audio signal, the first audio letter is obtained
Number adjusting parameter.
2. the method for claim 1, it is characterised in that described according to second audio signal and the spin moment
Battle array, generates virtual surround sound, including:
According to the spin matrix, second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
The virtual speaker array signal is filtered by head related transfer function wave filter, virtual surround sound is obtained.
3. the method for claim 1, it is characterised in that described according to first audio signal, obtains described first
The recording scene of audio signal, including:
First audio signal is analyzed, the content of first audio signal is obtained, according to the content, described first is determined
The recording scene of audio signal;Or,
According to first audio signal, first audio signal is obtained from the corresponding relation of audio signal and recording scene
Recording scene.
4. the method for claim 1, it is characterised in that described according to the adjusting parameter, believes first audio
Number it is adjusted and obtains the second audio signal, including:
According to the upper mixed exponent number, first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of the virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio letter
Number.
5. a kind of device for generating virtual surround sound, it is characterised in that described device includes:
First acquisition module, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file;
First generation module, for according to the anglec of rotation, generating spin matrix;
Second acquisition module, it is described for according to first audio signal, obtaining the adjusting parameter of first audio signal
Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
Adjusting module, for according to the adjusting parameter, being adjusted to first audio signal and obtaining the second audio signal;
Second generation module, for according to second audio signal and the spin matrix, generating virtual surround sound;
Second acquisition module, including:
First acquisition unit, for according to first audio signal, obtaining the recording scene of first audio signal;
Second acquisition unit, for according to the recording scene, institute being obtained from the corresponding relation for recording scene and adjusting parameter
State the adjusting parameter of the first audio signal;
Or, second acquisition module, including:
3rd acquiring unit, for according to first audio signal, being obtained from the corresponding relation of audio signal and adjusting parameter
Take the adjusting parameter of first audio signal;
Or, second acquisition module is additionally operable to obtain the terminal type of terminal, according to the terminal type and described first
Audio signal, obtains the adjusting parameter of first audio signal.
6. device as claimed in claim 5, it is characterised in that second generation module, including:
Rotary unit, the 3rd audio signal is obtained for according to the spin matrix, second audio signal rotate;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker array
Signal;
Filter unit, for the virtual speaker array signal to be filtered by head related transfer function wave filter, obtains
To virtual surround sound.
7. device as claimed in claim 5, it is characterised in that the first acquisition unit, including:
Analysis subelement, for analyzing first audio signal, obtains the content of first audio signal;
Determination subelement, for according to the content, determining the recording scene of first audio signal;
Or, the first acquisition unit, including:
Subelement is obtained, for according to first audio signal, being obtained from the corresponding relation of audio signal and recording scene
The recording scene of first audio signal.
8. device as claimed in claim 5, it is characterised in that the adjusting parameter includes upper mixed exponent number and virtual speaker
Topological structure;
The adjusting module, including:
First processing units, for according to the upper mixed exponent number, first audio signal being carried out into upper mixed treatment and obtaining the 4th
Audio signal;
Second processing unit, for the topological structure according to the virtual speaker, the 4th audio signal is surround
Treatment obtains the second audio signal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510519948.2A CN105120421B (en) | 2015-08-21 | 2015-08-21 | A kind of method and apparatus for generating virtual surround sound |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510519948.2A CN105120421B (en) | 2015-08-21 | 2015-08-21 | A kind of method and apparatus for generating virtual surround sound |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105120421A CN105120421A (en) | 2015-12-02 |
CN105120421B true CN105120421B (en) | 2017-06-30 |
Family
ID=54668260
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510519948.2A Active CN105120421B (en) | 2015-08-21 | 2015-08-21 | A kind of method and apparatus for generating virtual surround sound |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105120421B (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105682000B (en) * | 2016-01-11 | 2017-11-07 | 北京时代拓灵科技有限公司 | A kind of audio-frequency processing method and system |
CN105959905B (en) * | 2016-04-27 | 2017-10-24 | 北京时代拓灵科技有限公司 | Mixed mode spatial sound generates System and method for |
CN105872940B (en) * | 2016-06-08 | 2017-11-17 | 北京时代拓灵科技有限公司 | A kind of virtual reality sound field generation method and system |
CN106210990B (en) * | 2016-07-13 | 2018-08-10 | 北京时代拓灵科技有限公司 | A kind of panorama sound audio processing method |
CN107566936B (en) * | 2017-07-12 | 2020-07-10 | 捷开通讯(深圳)有限公司 | Earphone capable of automatically adjusting audio data, method thereof and storage medium |
CN108520756B (en) * | 2018-03-20 | 2020-09-01 | 北京时代拓灵科技有限公司 | Method and device for separating speaker voice |
CN108921000B (en) * | 2018-04-16 | 2024-02-06 | 深圳市深网视界科技有限公司 | Head angle labeling, prediction model training, prediction method, device and medium |
CN108966113A (en) * | 2018-07-13 | 2018-12-07 | 武汉轻工大学 | Sound field rebuilding method, audio frequency apparatus, storage medium and device based on angle |
TWI698132B (en) | 2018-07-16 | 2020-07-01 | 宏碁股份有限公司 | Sound outputting device, processing device and sound controlling method thereof |
CN110740415B (en) * | 2018-07-20 | 2022-04-26 | 宏碁股份有限公司 | Sound effect output device, arithmetic device and sound effect control method thereof |
GB2580360A (en) * | 2019-01-04 | 2020-07-22 | Nokia Technologies Oy | An audio capturing arrangement |
CN117597945A (en) * | 2022-06-14 | 2024-02-23 | 北京小米移动软件有限公司 | Audio playing method, device and storage medium |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1158047A (en) * | 1995-09-28 | 1997-08-27 | 索尼公司 | image/audio reproducing system |
CN101133679A (en) * | 2004-09-01 | 2008-02-27 | 史密斯研究公司 | Personalized headphone virtualization |
CN102318374A (en) * | 2009-02-13 | 2012-01-11 | 皇家飞利浦电子股份有限公司 | Head tracking |
CN103262159A (en) * | 2010-10-05 | 2013-08-21 | 华为技术有限公司 | Method and apparatus for encoding/decoding multichannel audio signal |
CN103559876A (en) * | 2013-11-07 | 2014-02-05 | 安徽科大讯飞信息科技股份有限公司 | Sound effect processing method and sound effect processing system |
CN104244164A (en) * | 2013-06-18 | 2014-12-24 | 杜比实验室特许公司 | Method, device and computer program product for generating surround sound field |
CN104284291A (en) * | 2014-08-07 | 2015-01-14 | 华南理工大学 | Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof |
CN104464739A (en) * | 2013-09-18 | 2015-03-25 | 华为技术有限公司 | Audio signal processing method and device and difference beam forming method and device |
-
2015
- 2015-08-21 CN CN201510519948.2A patent/CN105120421B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1158047A (en) * | 1995-09-28 | 1997-08-27 | 索尼公司 | image/audio reproducing system |
CN101133679A (en) * | 2004-09-01 | 2008-02-27 | 史密斯研究公司 | Personalized headphone virtualization |
CN102318374A (en) * | 2009-02-13 | 2012-01-11 | 皇家飞利浦电子股份有限公司 | Head tracking |
CN103262159A (en) * | 2010-10-05 | 2013-08-21 | 华为技术有限公司 | Method and apparatus for encoding/decoding multichannel audio signal |
CN104244164A (en) * | 2013-06-18 | 2014-12-24 | 杜比实验室特许公司 | Method, device and computer program product for generating surround sound field |
CN104464739A (en) * | 2013-09-18 | 2015-03-25 | 华为技术有限公司 | Audio signal processing method and device and difference beam forming method and device |
CN103559876A (en) * | 2013-11-07 | 2014-02-05 | 安徽科大讯飞信息科技股份有限公司 | Sound effect processing method and sound effect processing system |
CN104284291A (en) * | 2014-08-07 | 2015-01-14 | 华南理工大学 | Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof |
Also Published As
Publication number | Publication date |
---|---|
CN105120421A (en) | 2015-12-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105120421B (en) | A kind of method and apparatus for generating virtual surround sound | |
EP2285139B1 (en) | Device and method for converting spatial audio signal | |
CN105409243B (en) | The pretreatment of channelizing music signal | |
TWI686794B (en) | Method and apparatus for decoding encoded audio signal in ambisonics format for l loudspeakers at known positions and computer readable storage medium | |
CN108305603A (en) | Sound effect treatment method and its equipment, storage medium, server, sound terminal | |
CN107637095A (en) | The loudspeaker of reservation privacy, energy efficient for personal voice | |
CN105284133B (en) | Scaled and stereo enhanced apparatus and method based on being mixed under signal than carrying out center signal | |
CN102907120A (en) | System and method for sound processing | |
CN1762178A (en) | Method for treating an electric sound signal | |
CN109618274B (en) | Virtual sound playback method based on angle mapping table, electronic device and medium | |
US20050069143A1 (en) | Filtering for spatial audio rendering | |
TW201238366A (en) | Apparatus and method for a complete audio signal | |
CN107113496A (en) | The surround sound record of mobile device | |
CN105509691B (en) | The detection method of multisensor group fusion and the circular method for acoustic for supporting head tracking | |
CN109074813A (en) | Handle high definition audio data | |
Hoffbauer et al. | Four-directional ambisonic spatial decomposition method with reduced temporal artifacts | |
CN108040317A (en) | A kind of hybrid sense of hearing sound field broadening method | |
Woszczyk et al. | Streamlined 3D sound design: the capture and composition of a sound field | |
KR20080031709A (en) | 3d sound reproduction apparatus using virtual speaker technique under plural channel speaker environments | |
US20230269552A1 (en) | Electronic device, system, method and computer program | |
Ogg et al. | Acoustic correlates of auditory object perception: Speakers, musical timbres, and environmental sounds | |
Zhou et al. | The source and effects of binaural cue ambiguity in free-field stereo sound localization—Modeling simulation | |
Bai et al. | An integrated analysis-synthesis array system for spatial sound fields | |
Kates et al. | A speech quality metric based on a cochlear model. | |
Diedesch et al. | Localization and externalization of speech through hearing aids with adaptive features |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |