CN105120421B - A kind of method and apparatus for generating virtual surround sound - Google Patents

A kind of method and apparatus for generating virtual surround sound Download PDF

Info

Publication number
CN105120421B
CN105120421B CN201510519948.2A CN201510519948A CN105120421B CN 105120421 B CN105120421 B CN 105120421B CN 201510519948 A CN201510519948 A CN 201510519948A CN 105120421 B CN105120421 B CN 105120421B
Authority
CN
China
Prior art keywords
audio signal
adjusting parameter
signal
audio
surround sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510519948.2A
Other languages
Chinese (zh)
Other versions
CN105120421A (en
Inventor
孙学京
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Tuoling Inc
Original Assignee
Beijing Tuoling Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Tuoling Inc filed Critical Beijing Tuoling Inc
Priority to CN201510519948.2A priority Critical patent/CN105120421B/en
Publication of CN105120421A publication Critical patent/CN105120421A/en
Application granted granted Critical
Publication of CN105120421B publication Critical patent/CN105120421B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a kind of method and apparatus for generating virtual surround sound, belong to field of signal processing.Method includes:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;According to the anglec of rotation, spin matrix is generated;According to first audio signal, the adjusting parameter of first audio signal is obtained;According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;According to second audio signal and the spin matrix, virtual surround sound is generated.Device includes:First acquisition module, the first generation module, the second acquisition module, adjusting module and the second generation module.The anglec of rotation that the present invention rotates according to user's head rotates virtual surround sound, such that it is able to improve the authenticity of virtual surround sound.

Description

A kind of method and apparatus for generating virtual surround sound
Technical field
The present invention relates to field of signal processing, more particularly to a kind of method and apparatus for generating virtual surround sound.
Background technology
At present, when user listens to music using terminals such as mobile phone or computers, if it is desired to experience the void at concert scene When intending the effect of surround sound, it is necessary to the multiple audio amplifiers of terminal connection, the music is played by multiple audio amplifiers;But due to price and sky Between aspect the reason for, all without enough audio amplifiers, at this moment terminal needs to produce virtual surround sound general user, allows user to experience In the effect at concert scene.
Prior art provides a kind of method for generating virtual surround sound, Ke Yiwei:Terminal obtains audio file to be included B format signals, the B format signals are converted into virtual speaker array signal, virtual speaker array signal is passed through HRTF (Head Related Transfer Function, head related transfer function) wave filter is filtered, and obtains virtual ring Around sound.
Realize it is of the invention during, inventor find prior art at least there is problems with:
When user has on earphone and listens virtual surround sound, when user's head rotates, the virtual surround sound in earphone can be and then The end rotation of user, so causes people to listen the sensation of music different at the scene, namely the virtual surround sound of generation is not true enough.
The content of the invention
In order to solve problem of the prior art, the invention provides a kind of method and apparatus for generating virtual surround sound.Skill Art scheme is as follows:
A kind of method for generating virtual surround sound, methods described includes:
Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
According to the anglec of rotation, spin matrix is generated;
According to first audio signal, the adjusting parameter of first audio signal is obtained;
According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;
According to second audio signal and the spin matrix, virtual surround sound is generated.
Further, it is described according to second audio signal and the spin matrix, virtual surround sound is generated, including:
According to the spin matrix, second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
The virtual speaker array signal is filtered by head related transfer function wave filter, obtain virtual ring around Sound.
Further, it is described according to first audio signal, obtain the adjusting parameter of first audio signal, bag Include:
According to first audio signal, the recording scene of first audio signal is obtained, according to the recording scene, The adjusting parameter of first audio signal is obtained from the corresponding relation for recording scene and adjusting parameter;Or,
According to first audio signal, first audio is obtained from the corresponding relation of audio signal and adjusting parameter The adjusting parameter of signal.
Further, it is described according to first audio signal, obtain the recording scene of first audio signal, bag Include:
First audio signal is analyzed, the content of first audio signal is obtained, according to the content, it is determined that described The recording scene of the first audio signal;Or,
According to first audio signal, first audio is obtained from the corresponding relation of audio signal and recording scene The recording scene of signal.
Further, the adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
It is described first audio signal to be adjusted and obtains the second audio signal according to the adjusting parameter, including:
According to the upper mixed exponent number, first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of the virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second sound Frequency signal.
A kind of device for generating virtual surround sound, described device includes:
First acquisition module, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file;
First generation module, for according to the anglec of rotation, generating spin matrix;
Second acquisition module, for according to first audio signal, obtaining the adjusting parameter of first audio signal;
Adjusting module, for according to the adjusting parameter, being adjusted to first audio signal and obtaining the second audio Signal;
Second generation module, for according to second audio signal and the spin matrix, generating virtual surround sound.
Further, second generation module, including:
Rotary unit, the 3rd audio is obtained for according to the spin matrix, second audio signal rotate Signal;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker Array signal;
Filter unit, for the virtual speaker array signal to be filtered by head related transfer function wave filter Ripple, obtains virtual surround sound.
Further, second acquisition module, including:
First acquisition unit, for according to first audio signal, obtaining the recording scene of first audio signal;
Second acquisition unit, for according to the recording scene, being obtained from the corresponding relation for recording scene and adjusting parameter Take the adjusting parameter of first audio signal;
Or, second acquisition module, including:
3rd acquiring unit, for according to first audio signal, from audio signal and the corresponding relation of adjusting parameter The middle adjusting parameter for obtaining first audio signal.
Further, the first acquisition unit, including:
Analysis subelement, for analyzing first audio signal, obtains the content of first audio signal;
Determination subelement, for according to the content, determining the recording scene of first audio signal;
Or, the first acquisition unit, including:
Subelement is obtained, for according to first audio signal, from the corresponding relation of audio signal and recording scene Obtain the recording scene of first audio signal.
Further, the adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
The adjusting module, including:
First processing units, for according to the upper mixed exponent number, first audio signal being carried out into upper mixed treatment and being obtained 4th audio signal;
Second processing unit, for the topological structure according to the virtual speaker, the 4th audio signal is carried out The second audio signal is obtained around treatment.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter, First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Brief description of the drawings
Fig. 1 is a kind of method flow diagram of the virtual surround sound of generation that the embodiment of the present invention 1 is provided;
Fig. 2-1 is a kind of method flow diagram of the virtual surround sound of generation that the embodiment of the present invention 2 is provided;
Fig. 2-2 is a kind of schematic diagram of the topological structure of virtual speaker that the embodiment of the present invention 2 is provided;
Fig. 2-3 is the schematic diagram of the topological structure of another virtual speaker that the embodiment of the present invention 2 is provided;
Fig. 3 is a kind of apparatus structure schematic diagram of the virtual surround sound of generation that the embodiment of the present invention 3 is provided.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to embodiment party of the present invention Formula is described in further detail.
Embodiment 1
A kind of method for generating virtual surround sound is the embodiment of the invention provides, the executive agent of the method can be for eventually End, referring to Fig. 1, wherein, the method includes:
Step 101:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
Step 102:According to the anglec of rotation, spin matrix is generated;
Step 103:According to the first audio signal, the adjusting parameter of the first audio signal is obtained;
Step 104:According to the adjusting parameter, the first audio signal is adjusted and obtains the second audio signal;
Step 105:According to the second audio signal and the spin matrix, virtual surround sound is generated.
Further, according to the second audio signal and the spin matrix, virtual surround sound is generated, including:
According to the spin matrix, the second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
Virtual speaker array signal is filtered by head related transfer function wave filter, virtual surround sound is obtained.
Further, according to the first audio signal, the adjusting parameter of the first audio signal is obtained, including:
According to the first audio signal, obtain the recording scene of the first audio signal, according to recording scene, from record scene and The adjusting parameter of the first audio signal is obtained in the corresponding relation of adjusting parameter;Or,
According to the first audio signal, the tune of the first audio signal is obtained from the corresponding relation of audio signal and adjusting parameter Whole parameter.
Further, according to the first audio signal, the recording scene of the first audio signal is obtained, including:
The first audio signal is analyzed, the content of the first audio signal is obtained, according to content, the record of the first audio signal is determined Scene processed;Or,
According to the first audio signal, the record of the first audio signal is obtained from the corresponding relation of audio signal and recording scene Scene processed.
Further, adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
According to adjusting parameter, the first audio signal is adjusted and obtains the second audio signal, including:
According to upper mixed exponent number, the first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio signal.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter, First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Embodiment 2
A kind of method for generating virtual surround sound is the embodiment of the invention provides, the executive agent of the method can be for eventually End, referring to Fig. 2-1, wherein, the method includes:
Step 201:Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
When user plays the audio file that high in the clouds or server end are stored by earphone, terminal obtains audio file First audio signal and the anglec of rotation of user's head rotation.
Wherein, the step of anglec of rotation of terminal acquisition user's head rotation can be:
Head-tracker is set on earphone or in equipment of the user's head with head-tracker, such as virtual reality Whether display device, rotated by head-tracker real-time detection user's head, if user's head is rotated, is obtained The anglec of rotation of user's head rotation is taken, the anglec of rotation is sent to terminal;Terminal receives the anglec of rotation that head-tracker sends Degree.
Wherein, the first audio signal can be single order B format signals, and B format signals can be triple-track signal, it is also possible to It is quadraphonic signal;If B format signals are triple-track signal, B format signals include W, X and Y;If B format signals are Quadraphonic signal, then B format signals include W, X, Y and Z.Terminal can be mobile phone, panel computer or PC (personal Computer, PC) terminal etc..
W sound channel signals represent omnirange sound wave, and X sound channel signals, Y sound channel signals and Z sound channel signals are represented along three mutually Vertical orientated sound wave;X sound channel signals are represented to be listened from rear to preceding horizontally disposed, and Y sound channel signals are represented listens water from right to left Plain cloth is put, and Z sound channel signals are represented to listen and are arranged vertically upwards.
Step 202:According to the anglec of rotation, spin matrix is generated;
Spin matrix is used to rotate virtual surround sound, so that when user's head rotates, virtual surround sound not basis The rotation of user's head and rotate, realize the pleasant to the ear music of actual life effect.
For example, the direction of virtual surround sound is in front, when user's head is to 30 degree of anticlockwise, then by the virtual surround sound from To 30 degree of right rotation at the postrotational position of user's head, so as to realize the direction of virtual surround sound still in original direction On.
If B format signals are triple-track signal, spin matrix isIf B format signals It is quadraphonic signal, then spin matrix isθ is the anglec of rotation.
Step 203:According to the first audio signal, the adjusting parameter of the first audio signal is obtained;
Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker, and the topological structure of virtual speaker includes void Intend the number of loudspeaker and the position of each virtual speaker etc..
This step can be realized by following first way or the second way, for the first implementation, this Step can be realized by following steps (1) and (2), including:
(1):According to the first audio signal, the recording scene of the first audio signal is obtained;
Recording scene is including concert scene, business meetings scene or natural environment scene etc..
This step can be realized by following steps (1-1) and (1-2), including:
(1-1):The first audio signal is analyzed, the content of the first audio signal is obtained;
The content of the first audio signal at least includes directional signal proportion, can also include attribute information and/or main Direction of sound source etc.;Attribute information including the first audio signal including object, etc. Instrument categories and sound class.
Wherein, the first audio signal is analyzed, can be with the step of obtain the directional signal proportion that the first audio signal includes For:
By Direct-ambience signal decomposition (analysis of sensing-ambient signal) Algorithm Analysis One audio signal, obtains the proportion of the directional signal that the first audio signal includes, it is also possible to which obtaining the first audio signal includes Non- directional signal proportion.
For example, only including one's voice in speech in the first audio signal, then the first audio signal is sounded and just have very strong Directionality, then by Direct-ambience signal decomposition the first audio signals of Algorithm Analysis, obtain The proportion of the directional signal in one audio signal is larger;For another example, noise or a large amount of reverberation are included in the first audio signal, then First audio signal sounds that directionality is not just strong, then by Direct-ambience signal decomposition algorithms The first audio signal is analyzed, the proportion for obtaining directional signal in the first audio signal is smaller.
Wherein, the first audio signal is analyzed, may be used also the step of obtain the directional signal proportion that the first audio signal includes Realized with by following steps (A) to (C), including:
(A):Covariance matrix is set up to the first audio signal;
Covariance matrixcov(ωi, n)=α cov (ωi, n-1) and+(1- α) * S (ωi, n)*SHi,n)。
If the first audio signal includes W, X and Y, S (ωi, n)=[W (ωi,n)X(ωi,n)Y(ωi,n)]TIf, First audio signal includes W, X, Y and Z, then S (ωi, n)=[W (ωi,n)X(ωi,n)Y(ωi,n)Z(ωi,n)]T
Wherein, ωiIt is the frequency of the first audio signal, n is the index to frame number on time shaft, []HRepresentation vector is conjugated Transposition;α is smoothing factor, and α can be previously set or dynamically be adjusted according to the characteristics of signals of the first audio signal, for example, α Can be with 0.92.ωiInclude all frequencies interested and ωiCan as needed be configured and change;For example, ωiFor 100-16000HZ。
Also, in embodiments of the present invention, each ω can be setiWeight, then to the first audio signal set up association side During difference matrix, can be according to each ωiThe audio signal of weight calculation first covariance matrix, then covariance matrixρ is ωiWeight.
(B):Signature analysis is carried out to covariance matrix, characteristic value is obtained;
Signature analysis is carried out by Matlab function pairs covariance matrix, [V, Λ]=eigs (cov (n)) is obtained.
Wherein, V is the matrix of 3*3 or the matrix of 4*4, and each column of the matrix represents the characteristic vector of cov (n);Λ bags The individual features value being arranged in decreasing order is contained.
(C):According to characteristic value, the proportion of directional signal is calculated.
Select eigenvalue of maximum as the First Eigenvalue from characteristic value, selected from the characteristic value in addition to eigenvalue of maximum Eigenvalue of maximum is selected as Second Eigenvalue, according to the First Eigenvalue and Second Eigenvalue, is calculated by below equation (1) and pointed to The proportion of property signal.
Formula (1);
Wherein, DRR is the proportion of directional signal;λ1It is the First Eigenvalue, and λ1Corresponding to through acoustic energy;λ2It is Two characteristic values, and λ2Correspondence reflection, echo, ambient sound etc..Between [0,1], the value of DRR is smaller to represent direct sound wave to the value of DRR Proportion is lower, and sound field directivity is weaker, namely directional signal proportion is lower;The value of DRR is bigger, and the direct sound wave proportion that represents is higher, Sound field directivity is stronger, namely directional signal proportion is higher.
Further, by direction of arrival (sound source arrival direction) audio signal of Algorithm Analysis first, Obtain the direction of the first audio signal main sound source.
Further, by Instrument classification (musical instrument classification) audio signal of Algorithm Analysis first, Obtain the Instrument categories of the first audio signal;By Speech music classification (voice music classification) algorithm The first audio signal is analyzed, the sound class of the first audio signal is obtained.
Further, by Object extraction (object extraction) algorithm extract the first audio signal include it is right As.
For example, the first audio signal is one section of voice, then the first audio is extracted by Object extraction algorithms and believed Number object be voice;For another example, the first audio signal is one section of thunder, then extract the by Object extraction algorithms The object of one audio signal is thunder etc.;For another example, the first audio signal is one section of music, then by Object extraction The object that algorithm extracts the first audio signal is music etc..
Further, server can obtain the content of the first audio signal, by first with the audio signal of ex ante analysis first The content of audio signal and the first audio signal is stored in the corresponding relation of audio signal and content, then this step can be:
According to the first audio signal, the first sound is obtained in the audio signal and the corresponding relation of content that are stored from server The content of frequency signal.
Wherein, the audio signal and the corresponding relation of content for being stored in server can be stored in clothes in the form of metadata In business device, and directly can be embedded in the content of the first audio signal in the first audio signal by server, it is also possible to by first The content of audio signal is individually deposited, and sets up content file folder, by the content storage of the first audio signal in content file folder In, and set up the corresponding relation of the first audio signal and content file folder.
Terminal when the first audio of audio file is obtained, can obtain the content of the first audio signal, it is also possible at this The content of the first audio file is obtained in step.Also, audio signal and the correspondence pass of content that terminal is stored from server The content of the first audio signal is obtained in system, the computational burden of terminal can be mitigated, and improve terminal and generate virtual surround sound Efficiency.
(1-2):According to the content of the first audio signal, the recording scene of the first audio signal is determined.
The corresponding relation of storage content and recording scene in server, accordingly, this step can be:
Terminal according to the content of the first audio signal, in the content stored from server and the corresponding relation for recording scene Obtain the recording scene of the first audio signal.
In this step, terminal can also obtain content from server and record the corresponding relation of scene, storage content With the corresponding relation for recording scene;Accordingly, this step can be:
According to the content of the first audio signal, the content stored from terminal and the is obtained in the corresponding relation for recording scene The recording scene of one audio signal.
Wherein, the corresponding relation of content and recording scene can be stored in terminal or server in the form of metadata, And directly can be embedded in the recording scene of the first audio signal in the content of the first audio signal by terminal or server, also may be used Individually deposited with by the recording scene of the first audio signal, set up and record document scene folder, by the recording of the first audio signal Scape storage sets up the content of the first audio signal and the corresponding relation of the recording scene in recording document scene folder.
Further, terminal stores in audio signal and record the recording scene of the first audio signal and the first audio signal In the corresponding relation of scene processed;When playing the first audio signal again again so as to terminal, first is determined without by above method The recording scene of audio signal, directly obtains the recording of the first audio signal from the corresponding relation of audio signal and recording scene Scene.
For example, when the proportion of the directional signal of terminal storage is more than 0.5, the recording scene for determining the first audio signal is Business meetings;When the proportion of directional signal is less than 0.5, the recording scene for determining the first audio signal is concert.
(2):According to scene is recorded, the tune of the first audio signal is obtained from the corresponding relation for recording scene and adjusting parameter Whole parameter.
The corresponding relation for recording scene and adjusting parameter is stored in server, then this step can be:
According to scene is recorded, the first sound is obtained in the recording scene and the corresponding relation of adjusting parameter that are stored from server The adjusting parameter of frequency signal.
In this step, terminal can also obtain the corresponding relation for recording scene and adjusting parameter, storage from server Record the corresponding relation of scene and adjusting parameter;Accordingly, this step can be:
According to scene is recorded, the first audio is obtained in the recording scene and the corresponding relation of adjusting parameter that are stored from terminal The adjusting parameter of signal.
Further, terminal stores in audio signal and tune the adjusting parameter of the first audio signal and the first audio signal In the corresponding relation of whole parameter, so that when terminal plays the first audio signal again, the first sound is determined without by above method The recording scene of frequency signal, obtaining adjusting parameter according to recording scene, but directly from the right of audio signal and adjusting parameter The adjusting parameter of the first audio signal of middle acquisition should be related to, so as to shorten the acquisition of the adjusting parameter for obtaining the first audio signal Time, improve acquisition efficiency.
Further, for second implementation, this step can be:
The corresponding relation of audio signal and adjusting parameter is stored in server, terminal according to the first audio signal, from service The adjusting parameter of the first audio signal is obtained in the audio signal and the corresponding relation of adjusting parameter that are stored in device.
Further, the corresponding relation of audio signal and adjusting parameter can be stored in server in the form of metadata In, and server can directly by adjusting parameter first audio signal of insertion of the first audio signal, it is also possible to by first The adjusting parameter of audio signal is individually deposited, and sets up adjusting parameter file, and the adjusting parameter storage of the first audio signal is existed In adjusting parameter file, and set up the corresponding relation of the first audio signal and the adjusting parameter file.
For example, the proportion of directional signal is more than 0.5 in the first audio signal, then upper mixed exponent number is 3, and virtual speaker Topological structure include 6 virtual speakers, referring to Fig. 2-2;For another example, the proportion of directional signal is less than in the first audio signal 0.5, then upper mixed exponent number is 1, and the topological structure of virtual speaker includes 4 virtual speakers, referring to Fig. 2-3.
For example, the directionality sound source of the first audio signal is distributed in a direction, such as sound field content is concert, sound field Concentrate on stage direction namely front, then by the left front in the topological structure of virtual speaker and right speakers away from It is big from becoming.
Further, the different adjusting parameter of different terminal type correspondences can also be set in embodiments of the present invention, Then this step can be:
The terminal type of terminal is obtained, according to terminal type and the first audio signal, the adjustment of the first audio signal is obtained Parameter.
The corresponding relation of terminal type, audio signal and adjusting parameter is previously stored in server;Accordingly, according to terminal Type and the first audio signal, obtain the first audio signal adjusting parameter the step of can be:
According to terminal type and the first audio signal, the terminal type stored from server, audio signal and adjustment ginseng The adjusting parameter of the first audio signal is obtained in several corresponding relations.
The topological structure of virtual speaker has great influence to the quality of virtual surround sound, and to virtual surround sound Influence is different and different according to the content of the first audio signal;For example, the first audio signal major part both is from front, then it is empty The topological structure for intending loudspeaker can select rectangular configuration, rather than square structure.Therefore, in the embodiment of the present invention, can realize Content-adaptive adjustment adjusting parameter according to the first audio signal such that it is able to ensure the broadcasting tonequality of virtual surround sound.And And, different terminals have different operational capabilities and power consumption, according to terminal type and the first audio signal, obtain the first audio letter Number adjusting parameter can save the power consumption of terminal.
Step 204:According to the adjusting parameter, the first audio signal is adjusted and obtains the second audio signal;
Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker, then this step can be by following steps (1) realized with (2), including:
(1):According to upper mixed exponent number, the first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
Wherein, this step is prior art, is no longer described in detail herein.
(2):According to the topological structure of virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio letter Number.
4th audio signal is sequentially passed through into the virtual speaker that the topological structure of virtual speaker includes, so as to realize 4th audio signal is carried out surrounding treatment, the second audio signal is obtained.
For example, the first audio signal isThen the second audio signal after adjustment isFor another example, the first audio Signal isThen the second audio signal after adjustment is
Step 205:According to spin matrix, the second audio signal rotate obtain the 3rd audio signal;
Spin matrix and the second audio signal are carried out into multiplying, the 3rd audio signal is obtained.
For example, the second audio signal isSpin matrix isThen the 3rd audio signal isFor another example, the second audio signal isSpin matrix is Then the 3rd audio signal is
Step 206:According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
Virtual speaker matrix is obtained, virtual speaker matrix and the 3rd audio signal are carried out into matrix multiplication, obtain void Intend loudspeaker array signal.
For example, virtual speaker array isThen virtual speaker array signal is For another example, virtual speaker array isThen virtual speaker battle array Column signal is
Wherein, N is the number of the virtual speaker that virtual speaker topological structure includes.
Step 207:Virtual speaker array signal is filtered by head related transfer function wave filter, obtains virtual Surround sound.
Head related transfer function wave filter is used to for virtual speaker array signal being converted to two roads stereo, namely ears Signal, then this step can be:
Obtain the corresponding stereo matrix in two roads of head correlation function transforming function transformation function wave filter, the road stereoscopic matrixes of Jiang Gai bis- and void Intending loudspeaker array signal carries out matrix multiplication, obtains virtual surround sound.
For example, the stereo matrix in two roads isThen virtual surround sound is Or virtual surround sound is
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter, First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual Surround sound, such that it is able to improve the authenticity of virtual surround sound.
Embodiment 3
A kind of device for generating virtual surround sound is the embodiment of the invention provides, the device can be terminal, referring to Fig. 3, Device includes:
First acquisition module 301, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file Degree;
First generation module 302, for according to the anglec of rotation, generating spin matrix;
Second acquisition module 303, for according to the first audio signal, obtaining the adjusting parameter of the first audio signal;
Adjusting module 304, for according to adjusting parameter, being adjusted to the first audio signal and obtaining the second audio signal;
Second generation module 305, for according to the second audio signal and spin matrix, generating virtual surround sound.
Further, the second generation module 305, including:
Rotary unit, the 3rd audio signal is obtained for according to spin matrix, the second audio signal rotate;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker array signal;
Filter unit, for virtual speaker array signal to be filtered by head related transfer function wave filter, obtains To virtual surround sound.
Further, the second acquisition module 303, including:
First acquisition unit, for according to the first audio signal, obtaining the recording scene of the first audio signal;
Second acquisition unit, for according to scene is recorded, obtaining the from the corresponding relation of scene and adjusting parameter is recorded The adjusting parameter of one audio signal;
Or, the second acquisition module 303, including:
3rd acquiring unit, for according to the first audio signal, being obtained from the corresponding relation of audio signal and adjusting parameter Take the adjusting parameter of the first audio signal.
Further, first acquisition unit, including:
Analysis subelement, for analyzing the first audio signal, obtains the content of the first audio signal;
Determination subelement, for according to content, determining the recording scene of the first audio signal;
Or, first acquisition unit, including:
Subelement is obtained, for according to the first audio signal, being obtained from the corresponding relation of audio signal and recording scene The recording scene of the first audio signal.
Further, adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
Adjusting module 304, including:
First processing units, for according to upper mixed exponent number, the first audio signal being carried out into upper mixed treatment and obtaining the 4th audio Signal;
Second processing unit, for the topological structure according to virtual speaker, the 4th audio signal is carried out surrounding treatment Obtain the second audio signal.
In embodiments of the present invention, the anglec of rotation that user's head rotates is obtained by head-tracker, according to the rotation Angle, generates spin matrix, according to the first audio signal, obtains the adjusting parameter of the first audio signal, according to the adjusting parameter, First audio signal is adjusted and obtains the second audio signal, according to the second audio signal and the spin matrix, generation is virtual Surround sound, such that it is able to improve the authenticity of virtual surround sound.
It should be noted that:The device of the virtual surround sound of generation that above-described embodiment is provided when virtual surround sound is generated, Only carried out with the division of above-mentioned each functional module for example, in practical application, as needed can distribute above-mentioned functions Completed by different functional module, will the internal structure of device be divided into different functional modules, it is described above to complete All or part of function.In addition, the device and the virtual surround sound of generation of the virtual surround sound of generation that above-described embodiment is provided Embodiment of the method belongs to same design, and it implements process and refers to embodiment of the method, repeats no more here.
One of ordinary skill in the art will appreciate that realizing that all or part of step of above-described embodiment can be by hardware To complete, it is also possible to instruct the hardware of correlation to complete by program, described program can be stored in a kind of computer-readable In storage medium, storage medium mentioned above can be read-only storage, disk or CD etc..
The foregoing is only presently preferred embodiments of the present invention, be not intended to limit the invention, it is all it is of the invention spirit and Within principle, any modification, equivalent substitution and improvements made etc. should be included within the scope of the present invention.

Claims (8)

1. a kind of method for generating virtual surround sound, it is characterised in that methods described includes:
Obtain the anglec of rotation of the first audio signal and the user's head rotation of audio file;
According to the anglec of rotation, spin matrix is generated;
According to first audio signal, the adjusting parameter of first audio signal is obtained, the adjusting parameter includes upper mixed The topological structure of exponent number and virtual speaker;
According to the adjusting parameter, first audio signal is adjusted and obtains the second audio signal;
According to second audio signal and the spin matrix, virtual surround sound is generated;
Wherein, it is described according to first audio signal, the adjusting parameter of first audio signal is obtained, including:
According to first audio signal, the recording scene of first audio signal is obtained, according to the recording scene, from record The adjusting parameter of first audio signal is obtained in the corresponding relation of scene processed and adjusting parameter;Or,
According to first audio signal, first audio signal is obtained from the corresponding relation of audio signal and adjusting parameter Adjusting parameter;Or,
The terminal type of terminal is obtained, according to the terminal type and first audio signal, the first audio letter is obtained Number adjusting parameter.
2. the method for claim 1, it is characterised in that described according to second audio signal and the spin moment Battle array, generates virtual surround sound, including:
According to the spin matrix, second audio signal rotate obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted into virtual speaker array signal;
The virtual speaker array signal is filtered by head related transfer function wave filter, virtual surround sound is obtained.
3. the method for claim 1, it is characterised in that described according to first audio signal, obtains described first The recording scene of audio signal, including:
First audio signal is analyzed, the content of first audio signal is obtained, according to the content, described first is determined The recording scene of audio signal;Or,
According to first audio signal, first audio signal is obtained from the corresponding relation of audio signal and recording scene Recording scene.
4. the method for claim 1, it is characterised in that described according to the adjusting parameter, believes first audio Number it is adjusted and obtains the second audio signal, including:
According to the upper mixed exponent number, first audio signal is carried out into upper mixed treatment and obtains the 4th audio signal;
According to the topological structure of the virtual speaker, the 4th audio signal is carried out surrounding treatment obtaining the second audio letter Number.
5. a kind of device for generating virtual surround sound, it is characterised in that described device includes:
First acquisition module, the anglec of rotation of the first audio signal and the user's head rotation for obtaining audio file;
First generation module, for according to the anglec of rotation, generating spin matrix;
Second acquisition module, it is described for according to first audio signal, obtaining the adjusting parameter of first audio signal Adjusting parameter includes the topological structure of upper mixed exponent number and virtual speaker;
Adjusting module, for according to the adjusting parameter, being adjusted to first audio signal and obtaining the second audio signal;
Second generation module, for according to second audio signal and the spin matrix, generating virtual surround sound;
Second acquisition module, including:
First acquisition unit, for according to first audio signal, obtaining the recording scene of first audio signal;
Second acquisition unit, for according to the recording scene, institute being obtained from the corresponding relation for recording scene and adjusting parameter State the adjusting parameter of the first audio signal;
Or, second acquisition module, including:
3rd acquiring unit, for according to first audio signal, being obtained from the corresponding relation of audio signal and adjusting parameter Take the adjusting parameter of first audio signal;
Or, second acquisition module is additionally operable to obtain the terminal type of terminal, according to the terminal type and described first Audio signal, obtains the adjusting parameter of first audio signal.
6. device as claimed in claim 5, it is characterised in that second generation module, including:
Rotary unit, the 3rd audio signal is obtained for according to the spin matrix, second audio signal rotate;
Converting unit, for according to the 3rd audio signal, the 3rd audio signal being converted into virtual speaker array Signal;
Filter unit, for the virtual speaker array signal to be filtered by head related transfer function wave filter, obtains To virtual surround sound.
7. device as claimed in claim 5, it is characterised in that the first acquisition unit, including:
Analysis subelement, for analyzing first audio signal, obtains the content of first audio signal;
Determination subelement, for according to the content, determining the recording scene of first audio signal;
Or, the first acquisition unit, including:
Subelement is obtained, for according to first audio signal, being obtained from the corresponding relation of audio signal and recording scene The recording scene of first audio signal.
8. device as claimed in claim 5, it is characterised in that the adjusting parameter includes upper mixed exponent number and virtual speaker Topological structure;
The adjusting module, including:
First processing units, for according to the upper mixed exponent number, first audio signal being carried out into upper mixed treatment and obtaining the 4th Audio signal;
Second processing unit, for the topological structure according to the virtual speaker, the 4th audio signal is surround Treatment obtains the second audio signal.
CN201510519948.2A 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound Active CN105120421B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510519948.2A CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510519948.2A CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Publications (2)

Publication Number Publication Date
CN105120421A CN105120421A (en) 2015-12-02
CN105120421B true CN105120421B (en) 2017-06-30

Family

ID=54668260

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510519948.2A Active CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Country Status (1)

Country Link
CN (1) CN105120421B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105682000B (en) * 2016-01-11 2017-11-07 北京时代拓灵科技有限公司 A kind of audio-frequency processing method and system
CN105959905B (en) * 2016-04-27 2017-10-24 北京时代拓灵科技有限公司 Mixed mode spatial sound generates System and method for
CN105872940B (en) * 2016-06-08 2017-11-17 北京时代拓灵科技有限公司 A kind of virtual reality sound field generation method and system
CN106210990B (en) * 2016-07-13 2018-08-10 北京时代拓灵科技有限公司 A kind of panorama sound audio processing method
CN107566936B (en) * 2017-07-12 2020-07-10 捷开通讯(深圳)有限公司 Earphone capable of automatically adjusting audio data, method thereof and storage medium
CN108520756B (en) * 2018-03-20 2020-09-01 北京时代拓灵科技有限公司 Method and device for separating speaker voice
CN108921000B (en) * 2018-04-16 2024-02-06 深圳市深网视界科技有限公司 Head angle labeling, prediction model training, prediction method, device and medium
CN108966113A (en) * 2018-07-13 2018-12-07 武汉轻工大学 Sound field rebuilding method, audio frequency apparatus, storage medium and device based on angle
TWI698132B (en) 2018-07-16 2020-07-01 宏碁股份有限公司 Sound outputting device, processing device and sound controlling method thereof
CN110740415B (en) * 2018-07-20 2022-04-26 宏碁股份有限公司 Sound effect output device, arithmetic device and sound effect control method thereof
GB2580360A (en) * 2019-01-04 2020-07-22 Nokia Technologies Oy An audio capturing arrangement
CN117597945A (en) * 2022-06-14 2024-02-23 北京小米移动软件有限公司 Audio playing method, device and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1158047A (en) * 1995-09-28 1997-08-27 索尼公司 image/audio reproducing system
CN101133679A (en) * 2004-09-01 2008-02-27 史密斯研究公司 Personalized headphone virtualization
CN102318374A (en) * 2009-02-13 2012-01-11 皇家飞利浦电子股份有限公司 Head tracking
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal
CN103559876A (en) * 2013-11-07 2014-02-05 安徽科大讯飞信息科技股份有限公司 Sound effect processing method and sound effect processing system
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN104284291A (en) * 2014-08-07 2015-01-14 华南理工大学 Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof
CN104464739A (en) * 2013-09-18 2015-03-25 华为技术有限公司 Audio signal processing method and device and difference beam forming method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1158047A (en) * 1995-09-28 1997-08-27 索尼公司 image/audio reproducing system
CN101133679A (en) * 2004-09-01 2008-02-27 史密斯研究公司 Personalized headphone virtualization
CN102318374A (en) * 2009-02-13 2012-01-11 皇家飞利浦电子股份有限公司 Head tracking
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN104464739A (en) * 2013-09-18 2015-03-25 华为技术有限公司 Audio signal processing method and device and difference beam forming method and device
CN103559876A (en) * 2013-11-07 2014-02-05 安徽科大讯飞信息科技股份有限公司 Sound effect processing method and sound effect processing system
CN104284291A (en) * 2014-08-07 2015-01-14 华南理工大学 Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof

Also Published As

Publication number Publication date
CN105120421A (en) 2015-12-02

Similar Documents

Publication Publication Date Title
CN105120421B (en) A kind of method and apparatus for generating virtual surround sound
EP2285139B1 (en) Device and method for converting spatial audio signal
CN105409243B (en) The pretreatment of channelizing music signal
TWI686794B (en) Method and apparatus for decoding encoded audio signal in ambisonics format for l loudspeakers at known positions and computer readable storage medium
CN108305603A (en) Sound effect treatment method and its equipment, storage medium, server, sound terminal
CN107637095A (en) The loudspeaker of reservation privacy, energy efficient for personal voice
CN105284133B (en) Scaled and stereo enhanced apparatus and method based on being mixed under signal than carrying out center signal
CN102907120A (en) System and method for sound processing
CN1762178A (en) Method for treating an electric sound signal
CN109618274B (en) Virtual sound playback method based on angle mapping table, electronic device and medium
US20050069143A1 (en) Filtering for spatial audio rendering
TW201238366A (en) Apparatus and method for a complete audio signal
CN107113496A (en) The surround sound record of mobile device
CN105509691B (en) The detection method of multisensor group fusion and the circular method for acoustic for supporting head tracking
CN109074813A (en) Handle high definition audio data
Hoffbauer et al. Four-directional ambisonic spatial decomposition method with reduced temporal artifacts
CN108040317A (en) A kind of hybrid sense of hearing sound field broadening method
Woszczyk et al. Streamlined 3D sound design: the capture and composition of a sound field
KR20080031709A (en) 3d sound reproduction apparatus using virtual speaker technique under plural channel speaker environments
US20230269552A1 (en) Electronic device, system, method and computer program
Ogg et al. Acoustic correlates of auditory object perception: Speakers, musical timbres, and environmental sounds
Zhou et al. The source and effects of binaural cue ambiguity in free-field stereo sound localization—Modeling simulation
Bai et al. An integrated analysis-synthesis array system for spatial sound fields
Kates et al. A speech quality metric based on a cochlear model.
Diedesch et al. Localization and externalization of speech through hearing aids with adaptive features

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant