CN105120421A - Method and apparatus of generating virtual surround sound - Google Patents

Method and apparatus of generating virtual surround sound Download PDF

Info

Publication number
CN105120421A
CN105120421A CN201510519948.2A CN201510519948A CN105120421A CN 105120421 A CN105120421 A CN 105120421A CN 201510519948 A CN201510519948 A CN 201510519948A CN 105120421 A CN105120421 A CN 105120421A
Authority
CN
China
Prior art keywords
audio signal
adjustment parameter
obtains
signal
surround sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510519948.2A
Other languages
Chinese (zh)
Other versions
CN105120421B (en
Inventor
孙学京
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Tuoling Inc
Original Assignee
Beijing Tuoling Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Tuoling Inc filed Critical Beijing Tuoling Inc
Priority to CN201510519948.2A priority Critical patent/CN105120421B/en
Publication of CN105120421A publication Critical patent/CN105120421A/en
Application granted granted Critical
Publication of CN105120421B publication Critical patent/CN105120421B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a method and apparatus of generating virtual surround sound, belonging to the signal processing field. The method comprises: obtaining the first audio signal of an audio file, and rotation angles of heads of users; generating a rotation matrix according to the rotation angles of heads of users; obtaining adjusting parameters of the first audio signal according to the first audio signal; adjusting the first audio signal according to the adjusting parameters to obtain a second audio signal; and generating virtual surround sound according to the second audio signal and the rotation matrix. The apparatus comprises a first obtaining module, a first generation module, a second obtaining module, an adjusting module and a second generation module. The method and apparatus can rotate virtual surround sound according to the rotation angles of heads of users, thereby improving the reality of the virtual surround sound.

Description

A kind of method and apparatus of generating virtual surround sound
Technical field
The present invention relates to signal transacting field, particularly a kind of method and apparatus of generating virtual surround sound.
Background technology
At present, when user uses the terminals listen such as mobile phone or computer music, if when wanting the effect of the virtual surround sound experiencing concert scene, just need terminal to connect multiple audio amplifier, play this music by multiple audio amplifier; But due to price and aspect, space, general user does not have enough audio amplifiers, at this moment terminal needs to produce virtual surround sound, allows user experience effect at concert scene.
Prior art provides a kind of method of generating virtual surround sound, can be: terminal obtains the B format signal that audio file comprises, this B format signal is converted to virtual speaker array signal, by virtual speaker array signal by HRTF (HeadRelatedTransferFunction, head related transfer function) filter carries out filtering, obtains virtual surround sound.
Realizing in process of the present invention, inventor finds that prior art at least exists following problem:
User has on earphone when listening virtual surround sound, and when user's end rotation, the virtual surround sound in earphone can follow the end rotation of user, and the sensation causing people to listen to the music at the scene is like this different, and the virtual surround sound also namely generated is true not.
Summary of the invention
In order to solve the problem of prior art, the invention provides a kind of method and apparatus of generating virtual surround sound.Technical scheme is as follows:
A method for generating virtual surround sound, described method comprises:
Obtain the first audio signal of audio file and the anglec of rotation of user's end rotation;
According to the described anglec of rotation, generate spin matrix;
According to described first audio signal, obtain the adjustment parameter of described first audio signal;
According to described adjustment parameter, adjustment is carried out to described first audio signal and obtains the second audio signal;
According to described second audio signal and described spin matrix, generating virtual surround sound.
Further, described according to described second audio signal and described spin matrix, generating virtual surround sound, comprising:
According to described spin matrix, described second audio signal is carried out rotation and obtains the 3rd audio signal;
According to described 3rd audio signal, described 3rd audio signal is converted to virtual speaker array signal;
Described virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
Further, described according to described first audio signal, obtain the adjustment parameter of described first audio signal, comprising:
According to described first audio signal, obtain the recording scene of described first audio signal, according to described recording scene, from the corresponding relation recording scene and adjustment parameter, obtain the adjustment parameter of described first audio signal; Or,
According to described first audio signal, from the corresponding relation of audio signal and adjustment parameter, obtain the adjustment parameter of described first audio signal.
Further, described according to described first audio signal, obtain the recording scene of described first audio signal, comprising:
Analyze described first audio signal, obtain the content of described first audio signal, according to described content, determine the recording scene of described first audio signal; Or,
According to described first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of described first audio signal.
Further, described adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker;
Described according to described adjustment parameter, adjustment is carried out to described first audio signal and obtains the second audio signal, comprising:
According to described mixed exponent number, described first audio signal is carried out upper mixed process and obtains the 4th audio signal;
According to the topological structure of described virtual speaker, described 4th audio signal is carried out obtaining the second audio signal around process.
A device for generating virtual surround sound, described device comprises:
First acquisition module, for the anglec of rotation of the first audio signal and user's end rotation that obtain audio file;
First generation module, for according to the described anglec of rotation, generates spin matrix;
Second acquisition module, for according to described first audio signal, obtains the adjustment parameter of described first audio signal;
Adjusting module, for according to described adjustment parameter, carries out adjustment to described first audio signal and obtains the second audio signal;
Second generation module, for according to described second audio signal and described spin matrix, generating virtual surround sound.
Further, described second generation module, comprising:
Rotary unit, for according to described spin matrix, carries out rotation by described second audio signal and obtains the 3rd audio signal;
Converting unit, for according to described 3rd audio signal, is converted to virtual speaker array signal by described 3rd audio signal;
Filter unit, for described virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
Further, described second acquisition module, comprising:
First acquiring unit, for according to described first audio signal, obtains the recording scene of described first audio signal;
Second acquisition unit, for according to described recording scene, obtains the adjustment parameter of described first audio signal from the corresponding relation recording scene and adjustment parameter;
Or described second acquisition module, comprising:
3rd acquiring unit, for according to described first audio signal, obtains the adjustment parameter of described first audio signal from the corresponding relation of audio signal and adjustment parameter.
Further, described first acquiring unit, comprising:
Analyzing subelement, for analyzing described first audio signal, obtaining the content of described first audio signal;
Determine subelement, for according to described content, determine the recording scene of described first audio signal;
Or described first acquiring unit, comprising:
Obtain subelement, for according to described first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of described first audio signal.
Further, described adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker;
Described adjusting module, comprising:
First processing unit, for according to described mixed exponent number, carries out upper mixed process by described first audio signal and obtains the 4th audio signal;
Second processing unit, for the topological structure according to described virtual speaker, is undertaken obtaining the second audio signal around process by described 4th audio signal.
In embodiments of the present invention, the anglec of rotation of user's end rotation is obtained by head-tracker, according to this anglec of rotation, generate spin matrix, according to the first audio signal, obtain the adjustment parameter of the first audio signal, according to this adjustment parameter, adjustment is carried out to the first audio signal and obtains the second audio signal, according to the second audio signal and this spin matrix, generating virtual surround sound, thus the authenticity that can improve virtual surround sound.
Accompanying drawing explanation
Fig. 1 is the method flow diagram of a kind of generating virtual surround sound that the embodiment of the present invention 1 provides;
Fig. 2-1 is the method flow diagram of a kind of generating virtual surround sound that the embodiment of the present invention 2 provides;
Fig. 2-2 is schematic diagrames of the topological structure of a kind of virtual speaker that the embodiment of the present invention 2 provides;
Fig. 2-3 is schematic diagrames of the topological structure of the another kind of virtual speaker that the embodiment of the present invention 2 provides;
Fig. 3 is the apparatus structure schematic diagram of a kind of generating virtual surround sound that the embodiment of the present invention 3 provides.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, embodiment of the present invention is described further in detail.
Embodiment 1
Embodiments provide a kind of method of generating virtual surround sound, the executive agent of the method can be terminal, and see Fig. 1, wherein, the method comprises:
Step 101: obtain the first audio signal of audio file and the anglec of rotation of user's end rotation;
Step 102: according to this anglec of rotation, generates spin matrix;
Step 103: according to the first audio signal, obtains the adjustment parameter of the first audio signal;
Step 104: according to this adjustment parameter, carries out adjustment to the first audio signal and obtains the second audio signal;
Step 105: according to the second audio signal and this spin matrix, generating virtual surround sound.
Further, according to the second audio signal and this spin matrix, generating virtual surround sound, comprising:
According to this spin matrix, the second audio signal is carried out rotation and obtain the 3rd audio signal;
According to the 3rd audio signal, the 3rd audio signal is converted to virtual speaker array signal;
Virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
Further, according to the first audio signal, obtain the adjustment parameter of the first audio signal, comprising:
According to the first audio signal, obtain the recording scene of the first audio signal, according to recording scene, from the corresponding relation recording scene and adjustment parameter, obtain the adjustment parameter of the first audio signal; Or,
According to the first audio signal, from the corresponding relation of audio signal and adjustment parameter, obtain the adjustment parameter of the first audio signal.
Further, according to the first audio signal, obtain the recording scene of the first audio signal, comprising:
Analyze the first audio signal, obtain the content of the first audio signal, according to content, determine the recording scene of the first audio signal; Or,
According to the first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of the first audio signal.
Further, the topological structure that parameter comprises mixed exponent number and virtual speaker is adjusted;
According to adjustment parameter, adjustment is carried out to the first audio signal and obtains the second audio signal, comprising:
According to upper mixed exponent number, the first audio signal is carried out upper mixed process and obtain the 4th audio signal;
According to the topological structure of virtual speaker, the 4th audio signal is carried out obtaining the second audio signal around process.
In embodiments of the present invention, the anglec of rotation of user's end rotation is obtained by head-tracker, according to this anglec of rotation, generate spin matrix, according to the first audio signal, obtain the adjustment parameter of the first audio signal, according to this adjustment parameter, adjustment is carried out to the first audio signal and obtains the second audio signal, according to the second audio signal and this spin matrix, generating virtual surround sound, thus the authenticity that can improve virtual surround sound.
Embodiment 2
Embodiments provide a kind of method of generating virtual surround sound, the executive agent of the method can be terminal, and see Fig. 2-1, wherein, the method comprises:
Step 201: obtain the first audio signal of audio file and the anglec of rotation of user's end rotation;
When user plays the audio file of high in the clouds or server end storage by earphone, terminal obtains the first audio signal of audio file and the anglec of rotation of user's end rotation.
Wherein, the step of the anglec of rotation of terminal acquisition user end rotation can be:
Earphone arranges head-tracker or has the equipment of head-tracker at user's head-mount, as virtual reality display device, detect user's head in real time by head-tracker whether to rotate, if user's head rotates, then obtain the anglec of rotation of user's end rotation, send this anglec of rotation to terminal; Terminal receives the anglec of rotation that head-tracker sends.
Wherein, the first audio signal can be single order B format signal, and B format signal can be triple-track signal, also can be quadraphony signal; If B format signal is triple-track signal, then B format signal comprises W, X and Y; If B format signal is quadraphony signal, then B format signal comprises W, X, Y and Z.Terminal can be mobile phone, panel computer or PC (personalcomputer, PC) terminal etc.
W sound channel signal represents omnirange sound wave, and X sound channel signal, Y sound channel signal and Z sound channel signal represent the sound wave along three orthogonal orientations; X sound channel signal represents to be listened from rear to front horizontal arrangement, and Y sound channel signal represents listens horizontal arrangement from right to left, and Z sound channel signal represents to listen and is upwards arranged vertically.
Step 202: according to this anglec of rotation, generates spin matrix;
Spin matrix for rotating virtual surround sound, thus makes when user's end rotation, and virtual surround sound does not rotate according to the rotation of user's head, realizes the effect of listening to the music in actual life.
Such as, the direction of virtual surround sound in front, when user's head is to anticlockwise 30 degree, then by this virtual surround sound from the position after user's end rotation to right rotation 30 degree, thus the direction realizing virtual surround sound is still on original direction.
If B format signal is triple-track signal, then spin matrix is 1 0 0 0 c o s ( θ ) - s i n ( θ ) 0 s i n ( θ ) cos ( θ ) ; If B format signal is quadraphony signal, then spin matrix is 1 0 0 0 0 c o s ( θ ) - s i n ( θ ) 0 0 s i n ( θ ) c o s ( θ ) 0 0 0 0 1 , θ is this anglec of rotation.
Step 203: according to the first audio signal, obtains the adjustment parameter of the first audio signal;
Adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker, and the topological structure of virtual speaker comprises the number of virtual speaker and the position etc. of each virtual speaker.
This step can be realized by following first kind of way or the second way, and for the first implementation, this step can pass through following steps (1) and (2) realize, and comprising:
(1) the recording scene of the first audio signal: according to the first audio signal, is obtained;
Record scene and comprise concert scene, business meetings scene or natural environment scene etc.
This step can pass through following steps (1-1) and (1-2) realizes, and comprising:
(1-1): analyze the first audio signal, the content of the first audio signal is obtained;
The content of the first audio signal at least comprises directional signal proportion, can also comprise the direction etc. of attribute information and/or main sound source; Attribute information comprises object, Instrument categories and the sound class etc. that the first audio signal comprises.
Wherein, analyze the first audio signal, the step obtaining the directional signal proportion that the first audio signal comprises can be:
By Direct-ambiencesignaldecomposition (analysis of sensing-ambient signal) Algorithm Analysis first audio signal, obtain the proportion of the directional signal that the first audio signal comprises, also can obtain the proportion of the non-directional signal that the first audio signal comprises.
Such as, only one's voice in speech is comprised in first audio signal, then the first audio signal sounds and just has very strong directivity, then by Direct-ambiencesignaldecomposition Algorithm Analysis first audio signal, the proportion obtaining the directional signal in the first audio signal is larger; For another example, noise or a large amount of reverberation is comprised in first audio signal, then the first audio signal sounds that directivity is just not strong, then by Direct-ambiencesignaldecomposition Algorithm Analysis first audio signal, the proportion obtaining the directional signal in the first audio signal is less.
Wherein, analyze the first audio signal, the step obtaining the directional signal proportion that the first audio signal comprises can also be realized by following steps (A) to (C), comprising:
(A): covariance matrix is set up to the first audio signal;
Covariance matrix cov (ω i, n)=α cov (ω i, n-1) and+(1-α) * S (ω i, n) * S hi, n).
If the first audio signal comprises W, X and Y, then S (ω i, n)=[W (ω i, n) X (ω i, n) Y (ω i, n)] tif the first audio signal comprises W, X, Y and Z, then S (ω i, n)=[W (ω i, n) X (ω i, n) Y (ω i, n) Z (ω i, n)] t.
Wherein, ω ibe the frequency of the first audio signal, n is the index to frame number on time shaft, [] hrepresentation vector conjugate transpose; α is smoothing factor, and α can set in advance or according to the characteristics of signals dynamic conditioning of the first audio signal, and such as, α can 0.92.ω iinclude all interested frequencies and ω ican carry out as required arranging and changing; Such as, ω ifor 100-16000HZ.
Further, in embodiments of the present invention, each ω can be set iweight, then when covariance matrix being set up to the first audio signal, can according to each ω ithe covariance matrix of weight calculation first audio signal, then covariance matrix cov ( n ) = Σ cov ( ω i , n ) * ρ , ρ is ω iweight.
(B): signature analysis is carried out to covariance matrix, obtains characteristic value;
By Matlab function, signature analysis is carried out to covariance matrix, obtain [V, Λ]=eigs (cov (n)).
Wherein, V is the matrix of 3*3 or the matrix of 4*4, and the often row of this matrix represent the characteristic vector of cov (n); Λ contains the individual features value with descending.
(C) proportion of directional signal: according to characteristic value, is calculated.
From characteristic value, select eigenvalue of maximum as the First Eigenvalue, from the characteristic value except eigenvalue of maximum, select eigenvalue of maximum as Second Eigenvalue, according to the First Eigenvalue and Second Eigenvalue, calculated the proportion of directional signal by following formula (1).
D R R = 1 - λ 2 λ 1 Formula (1);
Wherein, DRR is the proportion of directional signal; λ 1for the First Eigenvalue, and λ 1corresponding to direct sound wave energy; λ 2for Second Eigenvalue, and λ 2corresponding reflection, echo, ambient sound etc.The value of DRR is between [0,1], and the value of DRR is less, and to represent direct sound wave proportion lower, and sound field directivity is more weak, and also namely directional signal proportion is lower; The value of DRR is larger, and to represent direct sound wave proportion higher, and sound field directivity is stronger, and also namely directional signal proportion is higher.
Further, by directionofarrival (sound source arrival direction) Algorithm Analysis first audio signal, the direction of the first audio signal main sound source is obtained.
Further, by Instrumentclassification (musical instrument classification) Algorithm Analysis first audio signal, the Instrument categories of the first audio signal is obtained; By Speechmusicclassification (voice music classification) Algorithm Analysis first audio signal, obtain the sound class of the first audio signal.
Further, extract by Objectextraction (object extraction) algorithm the object that the first audio signal comprises.
Such as, the first audio signal is one section of voice, then the object extracting the first audio signal by Objectextraction algorithm is voice; For another example, the first audio signal is one section of thunder, then the object extracting the first audio signal by Objectextraction algorithm is thunder etc.; For another example, the first audio signal is one section of music, then the object extracting the first audio signal by Objectextraction algorithm is music etc.
Further, server can ex ante analysis first audio signal, and obtain the content of the first audio signal, be stored in the corresponding relation of audio signal and content by the content of the first audio signal and the first audio signal, then this step can be:
According to the first audio signal, in the audio signal stored from server and the corresponding relation of content, obtain the content of the first audio signal.
Wherein, the audio signal stored in server and the corresponding relation of content can store in the server in the form of metadata, and the content of the first audio signal can directly embed in the first audio signal by server, also the content of the first audio signal can be deposited separately, set up content file folder, the content of the first audio signal is stored in this content file folder, and sets up the corresponding relation of the first audio signal and this content file folder.
Terminal when obtaining the first audio frequency of audio file, can obtain the content of the first audio signal, also can obtain the content of the first audio file in this step.Further, in the corresponding relation of the audio signal that terminal stores from server and content, obtain the content of the first audio signal, the computational burden of terminal can be alleviated, and improve the efficiency of terminal generating virtual surround sound.
(1-2) the recording scene of the first audio signal: according to the content of the first audio signal, is determined.
Store the corresponding relation of content and recording scene in server, accordingly, this step can be:
Terminal, according to the content of the first audio signal, obtains the recording scene of the first audio signal in the corresponding relation of the content stored from server and recording scene.
In this step, terminal also can obtain content and record the corresponding relation of scene from server, stores the corresponding relation of content and recording scene; Accordingly, this step can be:
According to the content of the first audio signal, in the corresponding relation of the content stored from terminal and recording scene, obtain the recording scene of the first audio signal.
Wherein, the corresponding relation of content and recording scene can be stored in terminal or server in the form of metadata, and the recording scene of the first audio signal can directly embed in the content of the first audio signal by terminal or server, also the recording scene of the first audio signal can be deposited separately, set up and record document scene folder, the recording scene of the first audio signal is stored in this recording document scene folder, and sets up the content of the first audio signal and the corresponding relation of this recording scene.
Further, the recording scene of the first audio signal and the first audio signal is stored in audio signal and records in the corresponding relation of scene by terminal; Thus terminal is when again playing the first audio signal again, the recording scene of the first audio signal need not be determined by above method, directly from audio signal and record scene corresponding relation obtain the recording scene of the first audio signal.
Such as, when the proportion of the directional signal of terminal storage is greater than 0.5, determine that the recording scene of the first audio signal is business meetings; When the proportion of directional signal is less than 0.5, determine that the recording scene of the first audio signal is concert.
(2) from the corresponding relation recording scene and adjustment parameter: according to recording scene, obtain the adjustment parameter of the first audio signal.
Store the corresponding relation recording scene and adjustment parameter in server, then this step can be:
According to recording scene, in the corresponding relation of the recording scene stored from server and adjustment parameter, obtain the adjustment parameter of the first audio signal.
In this step, terminal also can obtain the corresponding relation recording scene and adjustment parameter from server, stores the corresponding relation recording scene and adjustment parameter; Accordingly, this step can be:
According to recording scene, in the corresponding relation of the recording scene stored from terminal and adjustment parameter, obtain the adjustment parameter of the first audio signal.
Further, the adjustment parameter of the first audio signal and the first audio signal is stored in the corresponding relation of audio signal and adjustment parameter by terminal, thus terminal is when again playing the first audio signal, the recording scene of the first audio signal need not be determined by above method, adjustment parameter is being obtained according to recording scene, but from the corresponding relation of audio signal and adjustment parameter, directly obtain the adjustment parameter of the first audio signal, thus shorten the acquisition time of the adjustment parameter of acquisition first audio signal, improve acquisition efficiency.
Further, for the second implementation, this step can be:
The corresponding relation of stored audio signal and adjustment parameter in server, terminal, according to the first audio signal, obtains the adjustment parameter of the first audio signal in the corresponding relation of the audio signal stored from server and adjustment parameter.
Further, the corresponding relation of audio signal and adjustment parameter can store in the server in the form of metadata, and the adjustment parameter of the first audio signal can directly embed in the first audio signal by server, also the adjustment parameter of the first audio signal can be deposited separately, set up adjustment Parameter File folder, the adjustment parameter of the first audio signal is stored in adjustment Parameter File, and sets up the corresponding relation of the first audio signal and this adjustment Parameter File folder.
Such as, in the first audio signal, the proportion of directional signal is greater than 0.5, then upper mixed exponent number is 3, and the topological structure of virtual speaker comprises 6 virtual speakers, see Fig. 2-2; For another example, in the first audio signal, the proportion of directional signal is less than 0.5, then upper mixed exponent number is 1, and the topological structure of virtual speaker comprises 4 virtual speakers, see Fig. 2-3.
Such as, the directivity sound source of the first audio signal is distributed in a direction, and such as sound field content is concert, and sound field concentrates on Ye Ji dead ahead, stage direction, then the distance of the left front in the topological structure of virtual speaker and right speakers is become large.
Further, can also arrange the adjustment parameter that different terminal types is corresponding different in embodiments of the present invention, then this step can be:
Obtain the terminal type of terminal, according to terminal type and the first audio signal, obtain the adjustment parameter of the first audio signal.
The corresponding relation of prior storage terminal type, audio signal and adjustment parameter in server; Accordingly, according to terminal type and the first audio signal, the step obtaining the adjustment parameter of the first audio signal can be:
According to terminal type and the first audio signal, in the corresponding relation of the terminal type stored from server, audio signal and adjustment parameter, obtain the adjustment parameter of the first audio signal.
The quality of topological structure on virtual surround sound of virtual speaker has great impact, and different according to the difference of the content of the first audio signal on the impact of virtual surround sound; Such as, the first audio signal major part is all from front, then the topological structure of virtual speaker can select rectangular configuration, instead of square structure.Therefore, in the embodiment of the present invention, the content-adaptive adjustment adjustment parameter according to the first audio signal can be realized, thus the broadcasting tonequality of virtual surround sound can be ensured.Further, different terminals has different operational capabilities and power consumption, and according to terminal type and the first audio signal, the adjustment parameter obtaining the first audio signal can save the power consumption of terminal.
Step 204: according to this adjustment parameter, carries out adjustment to the first audio signal and obtains the second audio signal;
Adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker, then this step can pass through following steps (1) and (2) realization, comprising:
(1): according to upper mixed exponent number, the first audio signal is carried out upper mixed process and obtain the 4th audio signal;
Wherein, this step is prior art, no longer describes in detail at this.
(2): according to the topological structure of virtual speaker, the 4th audio signal is carried out obtaining the second audio signal around process.
By the 4th audio signal successively through the virtual speaker that the topological structure of virtual speaker comprises, thus realize the 4th audio signal to carry out around process, obtain the second audio signal.
Such as, the first audio signal is W 1 X 1 Y 1 , The second audio signal then after adjustment is W 2 X 2 Y 2 ; For another example, the first audio signal is W 1 X 1 Y 1 Z 1 , The second audio signal then after adjustment is W 2 X 2 Y 2 Z 2 .
Step 205: according to spin matrix, carries out rotation and obtains the 3rd audio signal by the second audio signal;
Spin matrix and the second audio signal are carried out multiplying, obtains the 3rd audio signal.
Such as, the second audio signal is W 2 X 2 Y 2 , Spin matrix is 1 0 0 0 c o s ( θ ) - s i n ( θ ) 0 s i n ( θ ) cos ( θ ) , Then the 3rd audio signal is W 3 X 3 Y 3 = 1 0 0 0 cos ( θ ) - s i n ( θ ) 0 sin ( θ ) cos ( θ ) W 2 X 2 Y 2 ; For another example, the second audio signal is W 2 X 2 Y 2 Z 2 , Spin matrix is 1 0 0 0 0 c o s ( θ ) - s i n ( θ ) 0 0 s i n ( θ ) cos ( θ ) 0 0 0 0 1 , Then the 3rd audio signal is W 3 X 3 Y 3 Z 3 = 1 0 0 0 0 c o s ( θ ) - s i n ( θ ) 0 0 s i n ( θ ) cos ( θ ) 0 0 0 0 1 W 2 X 2 Y 2 Z 2 .
Step 206: according to the 3rd audio signal, is converted to virtual speaker array signal by the 3rd audio signal;
Obtain virtual speaker matrix, virtual speaker matrix and the 3rd audio signal are carried out matrix multiplication, obtains virtual speaker array signal.
Such as, virtual speaker array is G w 1 G x 1 G y 1 G w 2 G x 2 G y 2 . . . . . . G w N G x N G y N , Then virtual speaker array signal is L 1 L 2 .. L N = G w 1 G x 1 G y 1 G w 2 G x 2 G y 2 . . . . . . G w N G x N G y N W 3 X 3 Y 3 ; For another example, virtual speaker array is G w 1 G x 1 G y 1 G z 1 G w 2 G x 2 G y 2 G z 2 . . . . . . . . G w N G x N G y N G z N , Then virtual speaker array signal is L 1 L 2 .. L N = G w 1 G x 1 G y 1 G z 1 G w 2 G x 2 G y 2 G z 2 . . . . . . . . G w N G x N G y N G z N W 3 X 3 Y 3 Z 3 .
Wherein, N is the number of the virtual speaker that virtual speaker topological structure comprises.
Step 207: virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
It is stereo that head related transfer function filter is used for that virtual speaker array signal is converted to two roads, and be also binaural signal, then this step can be:
Obtain the two stereo matrixes in road that head correlation function transforming function transformation function filter is corresponding, Jiang Gai bis-road stereoscopic matrix and virtual speaker array signal carry out matrix multiplication, obtain virtual surround sound.
Such as, the two stereo matrixes in road are H 1 L H 2 L .. H N L H 1 R H 2 R .. H N R Then virtual surround sound is L R = H 1 L H 2 L .. H 1 R H 2 R .. H N L H N R L 1 L 2 .. L N = F W L F X L F Y L F W R F X R F Y R W 1 X 1 Y 1 ; Or virtual surround sound is L R = H 1 L H 2 L .. H N L H 1 R H 2 R .. H N R L 1 L 2 .. L N = F W L F W R . F X L F Y L F Z L F X R F Y R F Z R W 1 X 1 Y 1 Z 1 .
In embodiments of the present invention, the anglec of rotation of user's end rotation is obtained by head-tracker, according to this anglec of rotation, generate spin matrix, according to the first audio signal, obtain the adjustment parameter of the first audio signal, according to this adjustment parameter, adjustment is carried out to the first audio signal and obtains the second audio signal, according to the second audio signal and this spin matrix, generating virtual surround sound, thus the authenticity that can improve virtual surround sound.
Embodiment 3
Embodiments provide a kind of device of generating virtual surround sound, this device can be terminal, and see Fig. 3, device comprises:
First acquisition module 301, for the anglec of rotation of the first audio signal and user's end rotation that obtain audio file;
First generation module 302, for according to the anglec of rotation, generates spin matrix;
Second acquisition module 303, for according to the first audio signal, obtains the adjustment parameter of the first audio signal;
Adjusting module 304, for according to adjustment parameter, carries out adjustment to the first audio signal and obtains the second audio signal;
Second generation module 305, for according to the second audio signal and spin matrix, generating virtual surround sound.
Further, the second generation module 305, comprising:
Rotary unit, for according to spin matrix, carries out rotation and obtains the 3rd audio signal by the second audio signal;
Converting unit, for according to the 3rd audio signal, is converted to virtual speaker array signal by the 3rd audio signal;
Filter unit, for virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
Further, the second acquisition module 303, comprising:
First acquiring unit, for according to the first audio signal, obtains the recording scene of the first audio signal;
Second acquisition unit, for according to recording scene, obtains the adjustment parameter of the first audio signal from the corresponding relation recording scene and adjustment parameter;
Or the second acquisition module 303, comprising:
3rd acquiring unit, for according to the first audio signal, obtains the adjustment parameter of the first audio signal from the corresponding relation of audio signal and adjustment parameter.
Further, the first acquiring unit, comprising:
Analyzing subelement, for analyzing the first audio signal, obtaining the content of the first audio signal;
Determine subelement, for according to content, determine the recording scene of the first audio signal;
Or the first acquiring unit, comprising:
Obtain subelement, for according to the first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of the first audio signal.
Further, the topological structure that parameter comprises mixed exponent number and virtual speaker is adjusted;
Adjusting module 304, comprising:
First processing unit, for according to upper mixed exponent number, carries out upper mixed process and obtains the 4th audio signal by the first audio signal;
Second processing unit, for the topological structure according to virtual speaker, is undertaken obtaining the second audio signal around process by the 4th audio signal.
In embodiments of the present invention, the anglec of rotation of user's end rotation is obtained by head-tracker, according to this anglec of rotation, generate spin matrix, according to the first audio signal, obtain the adjustment parameter of the first audio signal, according to this adjustment parameter, adjustment is carried out to the first audio signal and obtains the second audio signal, according to the second audio signal and this spin matrix, generating virtual surround sound, thus the authenticity that can improve virtual surround sound.
It should be noted that: the device of the generating virtual surround sound that above-described embodiment provides is when generating virtual surround sound, only be illustrated with the division of above-mentioned each functional module, in practical application, can distribute as required and by above-mentioned functions and be completed by different functional modules, internal structure by device is divided into different functional modules, to complete all or part of function described above.In addition, the device of the generating virtual surround sound that above-described embodiment provides and the embodiment of the method for generating virtual surround sound belong to same design, and its specific implementation process refers to embodiment of the method, repeats no more here.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can have been come by hardware, the hardware that also can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, the above-mentioned storage medium mentioned can be read-only memory, disk or CD etc.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, within the spirit and principles in the present invention all, any amendment done, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1. a method for generating virtual surround sound, is characterized in that, described method comprises:
Obtain the first audio signal of audio file and the anglec of rotation of user's end rotation;
According to the described anglec of rotation, generate spin matrix;
According to described first audio signal, obtain the adjustment parameter of described first audio signal;
According to described adjustment parameter, adjustment is carried out to described first audio signal and obtains the second audio signal;
According to described second audio signal and described spin matrix, generating virtual surround sound.
2. the method for claim 1, is characterized in that, described according to described second audio signal and described spin matrix, generating virtual surround sound, comprising:
According to described spin matrix, described second audio signal is carried out rotation and obtains the 3rd audio signal;
According to described 3rd audio signal, described 3rd audio signal is converted to virtual speaker array signal;
Described virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
3. the method for claim 1, is characterized in that, described according to described first audio signal, obtains the adjustment parameter of described first audio signal, comprising:
According to described first audio signal, obtain the recording scene of described first audio signal, according to described recording scene, from the corresponding relation recording scene and adjustment parameter, obtain the adjustment parameter of described first audio signal; Or,
According to described first audio signal, from the corresponding relation of audio signal and adjustment parameter, obtain the adjustment parameter of described first audio signal.
4. method as claimed in claim 3, is characterized in that, described according to described first audio signal, obtains the recording scene of described first audio signal, comprising:
Analyze described first audio signal, obtain the content of described first audio signal, according to described content, determine the recording scene of described first audio signal; Or,
According to described first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of described first audio signal.
5. the method for claim 1, is characterized in that, described adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker;
Described according to described adjustment parameter, adjustment is carried out to described first audio signal and obtains the second audio signal, comprising:
According to described mixed exponent number, described first audio signal is carried out upper mixed process and obtains the 4th audio signal;
According to the topological structure of described virtual speaker, described 4th audio signal is carried out obtaining the second audio signal around process.
6. a device for generating virtual surround sound, is characterized in that, described device comprises:
First acquisition module, for the anglec of rotation of the first audio signal and user's end rotation that obtain audio file;
First generation module, for according to the described anglec of rotation, generates spin matrix;
Second acquisition module, for according to described first audio signal, obtains the adjustment parameter of described first audio signal;
Adjusting module, for according to described adjustment parameter, carries out adjustment to described first audio signal and obtains the second audio signal;
Second generation module, for according to described second audio signal and described spin matrix, generating virtual surround sound.
7. device as claimed in claim 6, it is characterized in that, described second generation module, comprising:
Rotary unit, for according to described spin matrix, carries out rotation by described second audio signal and obtains the 3rd audio signal;
Converting unit, for according to described 3rd audio signal, is converted to virtual speaker array signal by described 3rd audio signal;
Filter unit, for described virtual speaker array signal is carried out filtering by head related transfer function filter, obtains virtual surround sound.
8. device as claimed in claim 6, it is characterized in that, described second acquisition module, comprising:
First acquiring unit, for according to described first audio signal, obtains the recording scene of described first audio signal;
Second acquisition unit, for according to described recording scene, obtains the adjustment parameter of described first audio signal from the corresponding relation recording scene and adjustment parameter;
Or described second acquisition module, comprising:
3rd acquiring unit, for according to described first audio signal, obtains the adjustment parameter of described first audio signal from the corresponding relation of audio signal and adjustment parameter.
9. device as claimed in claim 8, it is characterized in that, described first acquiring unit, comprising:
Analyzing subelement, for analyzing described first audio signal, obtaining the content of described first audio signal;
Determine subelement, for according to described content, determine the recording scene of described first audio signal;
Or described first acquiring unit, comprising:
Obtain subelement, for according to described first audio signal, from audio signal and record scene corresponding relation obtain the recording scene of described first audio signal.
10. device as claimed in claim 6, it is characterized in that, described adjustment parameter comprises the topological structure of mixed exponent number and virtual speaker;
Described adjusting module, comprising:
First processing unit, for according to described mixed exponent number, carries out upper mixed process by described first audio signal and obtains the 4th audio signal;
Second processing unit, for the topological structure according to described virtual speaker, is undertaken obtaining the second audio signal around process by described 4th audio signal.
CN201510519948.2A 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound Active CN105120421B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510519948.2A CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510519948.2A CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Publications (2)

Publication Number Publication Date
CN105120421A true CN105120421A (en) 2015-12-02
CN105120421B CN105120421B (en) 2017-06-30

Family

ID=54668260

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510519948.2A Active CN105120421B (en) 2015-08-21 2015-08-21 A kind of method and apparatus for generating virtual surround sound

Country Status (1)

Country Link
CN (1) CN105120421B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105682000A (en) * 2016-01-11 2016-06-15 北京时代拓灵科技有限公司 Audio processing method and system
CN105872940A (en) * 2016-06-08 2016-08-17 北京时代拓灵科技有限公司 Virtual reality sound field generating method and system
CN105959905A (en) * 2016-04-27 2016-09-21 北京时代拓灵科技有限公司 Mixing mode space sound generating system and method
CN106210990A (en) * 2016-07-13 2016-12-07 北京时代拓灵科技有限公司 A kind of panorama sound audio processing method
CN107566936A (en) * 2017-07-12 2018-01-09 捷开通讯(深圳)有限公司 Earphone and its method, the storage device of adjust automatically music data
CN108520756A (en) * 2018-03-20 2018-09-11 北京时代拓灵科技有限公司 A kind of method and device of speaker's speech Separation
CN108921000A (en) * 2018-04-16 2018-11-30 深圳市深网视界科技有限公司 Head angle mark, prediction model training, prediction technique, equipment and medium
CN108966113A (en) * 2018-07-13 2018-12-07 武汉轻工大学 Sound field rebuilding method, audio frequency apparatus, storage medium and device based on angle
CN110740415A (en) * 2018-07-20 2020-01-31 宏碁股份有限公司 Sound effect output device, arithmetic device and sound effect control method thereof
WO2020141261A1 (en) * 2019-01-04 2020-07-09 Nokia Technologies Oy An audio capturing arrangement
US11109175B2 (en) 2018-07-16 2021-08-31 Acer Incorporated Sound outputting device, processing device and sound controlling method thereof
WO2023240467A1 (en) * 2022-06-14 2023-12-21 北京小米移动软件有限公司 Audio playback method and apparatus, and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1158047A (en) * 1995-09-28 1997-08-27 索尼公司 image/audio reproducing system
CN101133679A (en) * 2004-09-01 2008-02-27 史密斯研究公司 Personalized headphone virtualization
CN102318374A (en) * 2009-02-13 2012-01-11 皇家飞利浦电子股份有限公司 Head tracking
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal
CN103559876A (en) * 2013-11-07 2014-02-05 安徽科大讯飞信息科技股份有限公司 Sound effect processing method and sound effect processing system
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN104284291A (en) * 2014-08-07 2015-01-14 华南理工大学 Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof
CN104464739A (en) * 2013-09-18 2015-03-25 华为技术有限公司 Audio signal processing method and device and difference beam forming method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1158047A (en) * 1995-09-28 1997-08-27 索尼公司 image/audio reproducing system
CN101133679A (en) * 2004-09-01 2008-02-27 史密斯研究公司 Personalized headphone virtualization
CN102318374A (en) * 2009-02-13 2012-01-11 皇家飞利浦电子股份有限公司 Head tracking
CN103262159A (en) * 2010-10-05 2013-08-21 华为技术有限公司 Method and apparatus for encoding/decoding multichannel audio signal
CN104244164A (en) * 2013-06-18 2014-12-24 杜比实验室特许公司 Method, device and computer program product for generating surround sound field
CN104464739A (en) * 2013-09-18 2015-03-25 华为技术有限公司 Audio signal processing method and device and difference beam forming method and device
CN103559876A (en) * 2013-11-07 2014-02-05 安徽科大讯飞信息科技股份有限公司 Sound effect processing method and sound effect processing system
CN104284291A (en) * 2014-08-07 2015-01-14 华南理工大学 Headphone dynamic virtual replaying method based on 5.1 channel surround sound and implementation device thereof

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105682000A (en) * 2016-01-11 2016-06-15 北京时代拓灵科技有限公司 Audio processing method and system
CN105959905A (en) * 2016-04-27 2016-09-21 北京时代拓灵科技有限公司 Mixing mode space sound generating system and method
CN105872940A (en) * 2016-06-08 2016-08-17 北京时代拓灵科技有限公司 Virtual reality sound field generating method and system
CN105872940B (en) * 2016-06-08 2017-11-17 北京时代拓灵科技有限公司 A kind of virtual reality sound field generation method and system
CN106210990A (en) * 2016-07-13 2016-12-07 北京时代拓灵科技有限公司 A kind of panorama sound audio processing method
CN107566936B (en) * 2017-07-12 2020-07-10 捷开通讯(深圳)有限公司 Earphone capable of automatically adjusting audio data, method thereof and storage medium
CN107566936A (en) * 2017-07-12 2018-01-09 捷开通讯(深圳)有限公司 Earphone and its method, the storage device of adjust automatically music data
CN108520756B (en) * 2018-03-20 2020-09-01 北京时代拓灵科技有限公司 Method and device for separating speaker voice
CN108520756A (en) * 2018-03-20 2018-09-11 北京时代拓灵科技有限公司 A kind of method and device of speaker's speech Separation
CN108921000A (en) * 2018-04-16 2018-11-30 深圳市深网视界科技有限公司 Head angle mark, prediction model training, prediction technique, equipment and medium
CN108921000B (en) * 2018-04-16 2024-02-06 深圳市深网视界科技有限公司 Head angle labeling, prediction model training, prediction method, device and medium
CN108966113A (en) * 2018-07-13 2018-12-07 武汉轻工大学 Sound field rebuilding method, audio frequency apparatus, storage medium and device based on angle
US11109175B2 (en) 2018-07-16 2021-08-31 Acer Incorporated Sound outputting device, processing device and sound controlling method thereof
CN110740415A (en) * 2018-07-20 2020-01-31 宏碁股份有限公司 Sound effect output device, arithmetic device and sound effect control method thereof
WO2020141261A1 (en) * 2019-01-04 2020-07-09 Nokia Technologies Oy An audio capturing arrangement
CN113287166A (en) * 2019-01-04 2021-08-20 诺基亚技术有限公司 Audio capture arrangement
WO2023240467A1 (en) * 2022-06-14 2023-12-21 北京小米移动软件有限公司 Audio playback method and apparatus, and storage medium

Also Published As

Publication number Publication date
CN105120421B (en) 2017-06-30

Similar Documents

Publication Publication Date Title
CN105120421A (en) Method and apparatus of generating virtual surround sound
EP2285139B1 (en) Device and method for converting spatial audio signal
Avni et al. Spatial perception of sound fields recorded by spherical microphone arrays with varying spatial resolution
CN106658343B (en) Method and apparatus for rendering the expression of audio sound field for audio playback
Zhang et al. Insights into head-related transfer function: Spatial dimensionality and continuous representation
CN1735922B (en) Method for processing audio data and sound acquisition device implementing this method
Murphy et al. Openair: An interactive auralization web resource and database
Farina et al. Ambiophonic principles for the recording and reproduction of surround sound for music
CN104699445A (en) Audio information processing method and device
CN104019885A (en) Sound field analysis system
CN101843114A (en) Focusing on a portion of an audio scene for an audio signal
CN106134223A (en) Reappear audio signal processing apparatus and the method for binaural signal
CN104349267A (en) Sound system
CN101518101A (en) Improved spatial resolution of the sound field for multi-channel audio playback systems by deriving signals with high order angular terms
CN107820158B (en) Three-dimensional audio generation device based on head-related impulse response
TW201426738A (en) Apparatus and method for generating a plurality of parametric audio streams and apparatus and method for generating a plurality of loudspeaker signals
US20050069143A1 (en) Filtering for spatial audio rendering
US20130044894A1 (en) System and method for efficient sound production using directional enhancement
CN104408040A (en) Head related function three-dimensional data compression method and system
Zhang et al. 2.5 D multizone reproduction using weighted mode matching: Performance analysis and experimental validation
Steffens et al. The role of early and late reflections on perception of source orientation
CN105509691A (en) Multi-sensor group integration type detection method and head tracking-enabled surround sound method
Hoffbauer et al. Four-directional ambisonic spatial decomposition method with reduced temporal artifacts
Gao et al. Sparse dnn model for frequency expanding of higher order ambisonics encoding process
US10659903B2 (en) Apparatus and method for weighting stereo audio signals

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant