CN103812462B

CN103812462B - Volume control method and device

Info

Publication number: CN103812462B
Application number: CN201210460201.0A
Authority: CN
Inventors: 王田; 吴文海; 张德军; 王凤玲
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2012-11-15
Filing date: 2012-11-15
Publication date: 2016-12-07
Anticipated expiration: 2032-11-15
Also published as: CN103812462A

Abstract

The embodiment of the present invention provides a kind of volume control method and device, and this volume control method includes: at least one road acoustical signal is carried out speech detection respectively；For each road acoustical signal, according to the voice signal detected, determine the speech loudness gain of voice signal, according to the mute signal detected, determine the gain of the largest tolerable of mute signal；Gain generating according to speech loudness gain and largest tolerable adjusts gain；According to adjusting gain, acoustical signal is carried out Gain tuning.The volume control method of embodiment of the present invention offer and device, it is achieved that under the application scenarios that noise is more or during real-time Communication for Power, the loudness to audio frequency controls, and improves the effect that loudness controls.

Description

Volume control method and device

Technical field

The present embodiments relate to audio signal processing technique, particularly relate to a kind of volume control method and device.

Background technology

Loudness is the index that the signal of telecommunication is converted to voice metric energy during acoustic vibration, is human auditory system Subjective feeling to sound intensity.Loudness controls (Loudness Control) mainly according to the loudness of signal Different to different frequency range perception with human ear, increase or deamplification so that acoustic playback is maintained at one In identical perception level or identical loudness.

The loudness of prior art controls to be mainly used in audio player, the audio frequency of audio player plays Voice quality generally relatively good.But, under the application scenarios that noise is more or real-time Communication for Power During, control to carry out Gain tuning by the loudness of prior art, can be by noise according to the adjustment of voice Gain is adjusted, poor effect.

Summary of the invention

The embodiment of the present invention provides a kind of volume control method and device, to realize in the more application of noise Loudness to audio frequency controls under scene or during real-time Communication for Power, improves the effect that loudness controls.

First aspect, the embodiment of the present invention provides a kind of volume control method, including:

At least one road acoustical signal is carried out speech detection respectively；

For each road acoustical signal, according to the voice signal detected, determine the language of described voice signal Sonority gain, according to the mute signal detected, determines the increasing of the largest tolerable of described mute signal Benefit；

Gain generating according to described speech loudness gain and described largest tolerable adjusts gain；

According to described adjustment gain, described acoustical signal is carried out Gain tuning.

In the implementation that the first is possible, described at least one road acoustical signal is carried out voice inspection respectively Survey, including:

For described each road acoustical signal, calculate the root-mean-square of described acoustical signal；

Signal envelope and noise envelope is generated respectively according to described root-mean-square；

Calculate described signal envelope and the ratio of described noise envelope, if described ratio presets threshold more than first Value, then detect described voice signal, otherwise, described mute signal detected.

In conjunction with the first possible implementation of first aspect, in the implementation that the second is possible, The mute signal that described basis detects, determines the gain of the largest tolerable of described mute signal, specifically For:

According to the level of mute signal described in described root mean square calculation, the level according to described mute signal is true The gain of fixed described largest tolerable.

In conjunction with the implementation that the second of first aspect is possible, in the implementation that the third is possible, The described level according to mute signal described in described root mean square calculation, particularly as follows:

The level Noise_Level of the application below equation described mute signal of calculating:

Noise_Level=0.99×Noise_Level+0.01×Ecur；

Wherein, Ecur is described root-mean-square.

In the 4th kind of possible implementation, the voice signal that described basis detects, determine institute's predicate The speech loudness gain of tone signal, particularly as follows:

Described voice signal is carried out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out Signal level is added up, and determines the level of described voice signal according to statistical result, according to described voice signal Level determine described speech loudness gain.

In the 5th kind of possible implementation, described can according to described speech loudness gain and described maximum The Gain generating of tolerance adjusts gain, particularly as follows:

If the absolute value of described speech loudness gain is more than the second predetermined threshold value, then application below equation generates Described adjustment gain G ain:

Gain=LGain×(1.0-(LGain+NGain)/(LGain×2))；

Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.

In the 6th kind of possible implementation, described according to described adjustment gain, described acoustical signal is entered Row Gain tuning, particularly as follows:

Adjustment duration is determined according to described acoustical signal, true according to described adjustment gain and described adjustment duration Synchronizing of setting the tone is long, according to described adjustment gain and described adjustment step-length, described acoustical signal is carried out gain tune Whole.

In the 7th kind of possible implementation, if described acoustical signal is at least two-way, the most described basis The Gain generating of described speech loudness gain and described largest tolerable adjusts after increasing, described in described basis Before adjustment gain carries out Gain tuning to described acoustical signal, described method also includes:

Calculate described speech loudness gain and the difference of described adjustment gain on each road, determine in each road Big difference, described adjustment gain G ain on each road is adjusted by application below equation:

Gain=2×Gain–LDiffMax–LGain；

Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.

Second aspect, the embodiment of the present invention provides a kind of loudness to control device, including:

Detector unit, for carrying out speech detection respectively at least one road acoustical signal；

First processing unit, is connected with described detector unit, for for each road acoustical signal, according to The voice signal detected, determines the speech loudness gain of described voice signal, quiet according to detect Signal, determines the gain of the largest tolerable of described mute signal；

Second processing unit, is connected with described first processing unit, for according to described speech loudness gain Gain is adjusted with the Gain generating of described largest tolerable；

First adjustment unit, is connected with described second processing unit, is used for according to described adjustment gain institute State acoustical signal and carry out Gain tuning.

In the implementation that the first is possible, described detector unit includes:

First processes subelement, for for described each road acoustical signal, calculates described acoustical signal Root-mean-square；

Second processes subelement, processes subelement with described first and is connected, for dividing according to described root-mean-square Sheng Cheng signal envelope and noise envelope；

Judgment sub-unit, processes subelement with described second and is connected, be used for calculating described signal envelope and institute State the ratio of noise envelope, if described ratio is more than the first predetermined threshold value, then described voice signal detected, Otherwise, described mute signal is detected.

In conjunction with the first possible implementation of second aspect, in the implementation that the second is possible, Described first processing unit specifically for the level according to mute signal described in described root mean square calculation, according to The level of described mute signal determines the gain of described largest tolerable.

In conjunction with the implementation that the second of second aspect is possible, in the implementation that the third is possible, Described first processing unit calculates the level of described mute signal specifically for application below equation Noise_Level:

Noise_Level=0.99×Noise_Level+0.01×Ecur；

Wherein, Ecur is described root-mean-square.

In the 4th kind of possible implementation, described first processing unit is specifically for believing described voice Number carry out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out signal level statistics, root Result determines the level of described voice signal according to statistics, determines institute's predicate according to the level of described voice signal Sonority gain.

In the 5th kind of possible implementation, if described second processing unit is specifically for institute's Voice Response The absolute value of degree gain is more than the second predetermined threshold value, then application below equation generates described adjustment gain G ain:

Gain=LGain×(1.0-(LGain+NGain)/(LGain×2))；

In the 6th kind of possible implementation, described first adjustment unit is specifically for according to described sound Signal determines adjustment duration, determines adjustment step-length according to described adjustment gain and described adjustment duration, according to Described adjustment gain and described adjustment step-length carry out Gain tuning to described acoustical signal.

In the 7th kind of possible implementation, if described acoustical signal is at least two-way, described loudness control Device processed also includes:

Second adjustment unit, is connected with described first adjustment unit, for calculating the described voice on each road Loudness gain and the difference of described adjustment gain, determine difference maximum in each road, applies below equation pair Described adjustment gain G ain on each road is adjusted:

Gain=2×Gain-LDiffMax-LGain；

As shown from the above technical solution, the embodiment of the present invention provide volume control method and device, loudness Control device and at least one road acoustical signal is carried out speech detection respectively, for each road acoustical signal, root According to the voice signal detected, determine the speech loudness gain of voice signal, according to the quiet letter detected Number, determine the gain of the largest tolerable of mute signal, according to speech loudness gain and largest tolerable Gain generating adjusts gain, according to adjusting gain, acoustical signal is carried out Gain tuning.By to quiet letter Number detection determine the gain of largest tolerable, and simultaneously according to speech loudness gain and largest tolerable Gain generating adjusts gain, is adjusted the gain of acoustical signal by this adjustment gain so that adjust After acoustical signal more conform to the perception level of human ear, greatly increase the effect that loudness controls.

Accompanying drawing explanation

In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that under, Accompanying drawing during face describes is some embodiments of the present invention, for those of ordinary skill in the art, On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.

The first volume control method flow chart that Fig. 1 provides for the embodiment of the present invention；

The second volume control method flow chart that Fig. 2 provides for the embodiment of the present invention；

The first loudness that Fig. 3 provides for the embodiment of the present invention controls apparatus structure schematic diagram；

The second loudness that Fig. 4 provides for the embodiment of the present invention controls apparatus structure schematic diagram；

The third loudness that Fig. 5 provides for the embodiment of the present invention controls apparatus structure schematic diagram.

Detailed description of the invention

For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with this Accompanying drawing in bright embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, Obviously, described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based on Embodiment in the present invention, those of ordinary skill in the art are obtained under not making creative work premise The every other embodiment obtained, broadly falls into the scope of protection of the invention.

The first volume control method flow chart that Fig. 1 provides for the embodiment of the present invention.As it is shown in figure 1, The volume control method that the present embodiment provides specifically can apply to but is not limited under virtual meeting scene sound The process that controls in real time of the loudness of tone signal, the volume control method that the present embodiment provides can pass through loudness Controlling device to perform, this loudness controls device and can be integrated in audio processing equipment, it is also possible to individually Arrange.This audio processing equipment specifically can apply in conference system.

The volume control method that the present embodiment provides specifically includes:

Step 10, at least one road acoustical signal is carried out speech detection respectively；

Step 20, for each road acoustical signal, according to the voice signal detected, determine described voice The speech loudness gain of signal, according to the mute signal detected, determines that the maximum of described mute signal can The gain of tolerance；

Step 30, adjust gain according to the Gain generating of described speech loudness gain and described largest tolerable；

Step 40, according to described adjustment gain, described acoustical signal is carried out Gain tuning.

Specifically, at least one road acoustical signal is the signal needing to carry out loudness control, when in virtual meeting Under view application scenarios, acoustical signal is usually multichannel.Audio processing equipment receives the monophonic of each terminal Code stream, obtain a road acoustical signal after being decoded a monaural code stream processing, more right Each road acoustical signal carries out loudness control.

The acoustical signal received is probably voice signal and is also likely to be mute signal, when a user speaks, This acoustical signal is voice signal, and when user does not speak, this acoustical signal is mute signal.To sound Tone signal carries out speech detection, voice signal or mute signal during to judge this acoustical signal.This voice Detection process specifically can use Voice activity detector (Voice Active Detection is called for short VAD) Method realizes,

When voice signal being detected, determine the speech loudness gain of voice signal, when quiet letter being detected Number time, information entrained in mute signal is usually noise, determines the largest tolerable of mute signal Gain, the calculating of the gain of speech loudness gain and largest tolerable specifically can be according to playback gain (Replay Gain) standard realizes.Gain generating according to speech loudness gain and largest tolerable is adjusted Whole gain, is adjusted the gain of this road acoustical signal by adjusting gain, to realize loudness control.

The volume control method that the present embodiment provides, loudness controls device at least one road acoustical signal respectively Carry out speech detection, for each road acoustical signal, according to the voice signal detected, determine that voice is believed Number speech loudness gain, according to the mute signal detected, determine the largest tolerable of mute signal Gain, adjusts gain according to the Gain generating of speech loudness gain and largest tolerable, according to adjusting gain Acoustical signal is carried out Gain tuning.By the detection of mute signal being determined the gain of largest tolerable, And adjust gain according to the Gain generating of speech loudness gain and largest tolerable simultaneously, increased by this adjustment The gain of acoustical signal is adjusted by benefit so that the acoustical signal after adjustment more conforms to the perception of human ear Level, greatly increases the effect that loudness controls.

The second volume control method flow chart that Fig. 2 provides for the embodiment of the present invention.As in figure 2 it is shown, In the present embodiment, step 10, described at least one road acoustical signal is carried out speech detection respectively, specifically May include steps of:

Step 101, for described each road acoustical signal, calculate the root-mean-square of described acoustical signal；

Step 102, generate signal envelope and noise envelope respectively according to described root-mean-square；

Step 103, calculate described signal envelope and the ratio of described noise envelope, if described ratio is more than First predetermined threshold value, then detect described voice signal, otherwise, described mute signal detected.

Specifically, acoustical signal is digital signal, may determine that sound by calculating the root-mean-square of acoustical signal The energy of tone signal.Acoustical signal includes that multiple audio frame sequence, each audio frame include multiple sampled point, The detection process of acoustical signal is the detection process to audio frame.Such as, audio frame is s (n), n=0, 1 ..., N-1, N are the number of sampled point, when the frame length of s (n) is 20ms, can pass through below equation Calculate the root-mean-square Ecur of s (n).

Ecur = 20 \times \log 10 Σ_{n = 0}^{N - 1} s (n) \times s (n) / N;

The process generating signal envelope Senv according to root-mean-square Ecur is specifically as follows:

If Ecur is more than thread1, then Senv=0.9 × Senv+0.1 × Ecur；

If Ecur is not more than thread1, then Senv=0.998 × Senv+0.002 × Ecur；

The process generating noise envelope Sno according to root-mean-square Ecur is specifically as follows:

If Ecur is more than thread2, then Sno=0.998 × Sno+0.002 × Ecur；

If Ecur is not more than thread2, then Sno=0.9 × Sno+0.1 × Ecur；

The initial value of thread1 and thread2 can be arranged based on experience value, and thread1 >=thread2, During processing, further according to Senv and Sno, thread1 and thread2 is adjusted.As, thread2 =(Senv+Sno) × 0.5, thread1 could be arranged to equal with thread2, or slightly larger than thread2.

The ratio Senv/Sno of signal calculated envelope and noise envelope, it is judged that whether Senv/Sno is more than Thread3, the most then this acoustical signal is voice signal, and otherwise, this acoustical signal is noise signal.

It should be noted that the parameter in above-mentioned formula all can arrange according to actual treatment situation and adjust Whole.

In the present embodiment, in step 20, the mute signal that described basis detects, determine described quiet The gain of the largest tolerable of signal, is specifically as follows:

Specifically, can first determine the maximum noise level that human ear can be tolerated, for example, 30dB, then Difference according to this maximum noise level Yu the level of mute signal determines the gain of this largest tolerable, i.e. In the range of human ear can be tolerated, mute signal is adjusted.

In the present embodiment, the described level according to mute signal described in described root mean square calculation, specifically may be used Think:

Noise_Level=0.99×Noise_Level+0.01×Ecur；

Wherein, Ecur is described root-mean-square.

Assume that the maximum noise level that human ear can be tolerated is NoiseThread, then the gain of largest tolerable NGain=NoiseThread-Noise_Level。

In the present embodiment, in step 20, the voice signal that described basis detects, determine described voice The speech loudness gain of signal, is specifically as follows:

Specifically, can use loudness wave filter that voice signal is carried out loudness Filtering Processing, this loudness is filtered Ripple device can be with the high pass IIR yulewalk wave filter on 10 rank and the Butterworth high pass of 2 order frequency 150HZ Wave filter cascade approaches this loudness wave filter.High pass IIR yulewalk wave filter and Butterworth high pass The parameter of wave filter can need to arrange according to actual process, is not limited with the present embodiment.

Voice signal after loudness Filtering Processing carries out signal level statistics be specifically as follows: calculate voice The root-mean-square Level of each audio frame in signal, audio frame is s (n), n=0,1 ..., N-1, N are for adopting The number of sampling point, when the frame length of s (n) is 20ms, and sample rate is 16Hz, root-mean-square Level specifically may be used To be realized by below equation:

Level = 20 \times \log 10 Σ_{n = 0}^{N - 1} (s (n) \times s (n) / N / 2^{15});

The root-mean-square of multiple audio frames is carried out level statistics, if the level distribution of major part audio frame, certain Around individual level, then can determine the level Level1 of voice signal according to this level.Can be by following Formula calculating speech loudness gain LGain:

LGain=Lref–Level1；Wherein, Lref is specifically as follows an empirical value, for example,-14dB.

In the present embodiment, step 30, described according to described speech loudness gain and described largest tolerable Gain generating adjust gain, be specifically as follows:

Gain=LGain×(1.0-(LGain+NGain)/(LGain×2))；

Specifically, first absolute value and second predetermined threshold value of speech loudness gain being compared, this is years old Two predetermined threshold value are specifically as follows an empirical value, for example, 3dB..Absolute value when speech loudness gain During more than the second predetermined threshold value, being generated by above-mentioned formula and adjust gain, the parameter in above-mentioned formula also may be used To arrange according to actual needs, the present embodiment merely provides one preferred embodiment, the present invention It is not limited thereto.

In the present embodiment, step 40, described according to described adjustment gain, described acoustical signal is increased Benefit adjusts, particularly as follows:

Determine adjustment duration according to described acoustical signal, determine according to described adjustment gain and described adjustment duration Adjust step-length, according to described adjustment gain and described adjustment step-length, described acoustical signal is carried out Gain tuning.

Acoustical signal is carried out Gain tuning and can use automatic growth control (Automatic Gain Control, is called for short AGC) method, detailed process is:

Signal characteristic according to acoustical signal calculates and adjusts step-length, first can be according to the signal of acoustical signal Type calculates and adjusts duration decay, such as: decay=Ratio × MaxFrameNum × FrameLen+ FrameLen, wherein, Ratio is acoustical signal similarity when being to speak state, MaxFrameNum For largest frames long number, FrameLen is the length of every frame,.

Calculate adjustment step-length delt:delt=(the curGain-m_oldGain)/decay of each sampled point, CurGain is the gain of current sampling point, and m_oldGain is the gain of last sampled point, for first The gain of sampled point could be arranged to 1.

Each audio frame of acoustical signal is carried out Gain tuning, S ' (n)=S (n) × (m_oldGain+delt) And update oldGain.This adjustment process is specifically as follows:

The gain of last sampled point is m_oldGain, and curGain is the current sampling point after some frames Gain, decay is the duration of some frames.

The first step, application formula delt=(curGain-m_oldGain)/decay, calculate a pointwise The delt updated.

Second step, updates each sampled point:

m_curGain=m_oldGain+delt；

S’(n)=S(n)×m_curGain；

m_oldGain=m_curGain；

Here adding the gain of a current sample, above iteration is until this frame signal has processed. After coming Deng lower frame signal, can be according to the processing procedure more than characteristic cycle of new signal.

In the present embodiment, if described acoustical signal is at least two-way, then step 30, described in described basis The Gain generating of speech loudness gain and described largest tolerable adjusts after increasing, step 40, described basis Before described adjustment gain carries out Gain tuning to described acoustical signal, described method can also include:

Step 50, the described speech loudness gain calculating each road and the difference of described adjustment gain, determine Difference maximum in each road, described adjustment gain G ain on each road is adjusted by application below equation:

Gain=2×Gain–LDiffMax–LGain；

Specifically, under virtual meeting application scenarios, it is usually present multi-channel sound signal, for example, M road, The adjustment gain of m road acoustical signal is Gain (m), [m=1..M].Ring at the voice calculating each road During degree gain, it is identical with reference to Lref, if the most each Lu Douyong LGain (m) is carried out Adjust, then the energy after adjustment is Lref, when the noise level difference on every road when, finally calculates Obtain Gain (m), each road acoustical signal through Gain (m) adjust after energy be different, by with Under type adjusts the adjustment energy of every road acoustical signal that aligns automatically so that the level of output is identical.

First the acoustical signal difference to Lref on every road is calculated: LDiff (m)=LGain (m)-Gain (m), from In choose maximum LDiffMax, i.e. LDiffMax=Max (LDiff (m)), then by formula Gain=2 Described adjustment gain G ain on each road is adjusted by × Gain LDiffMax LGain.

In actual application, can also be according to presetting through Gain tuning Hou Ge road acoustical signal Scene add azimuth information to every road acoustical signal, synthesize 3d audio frequency.

The first loudness that Fig. 3 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 3 institute Showing, the loudness that the present embodiment provides controls device specifically can realize the sound that any embodiment of the present invention provides Each step of degree control method, the process of implementing does not repeats them here.The loudness that the present embodiment provides Control device can be integrated in audio processing equipment, it is also possible to be separately provided.This audio processing equipment has Body can apply in conference system.

The present embodiment provide loudness control device specifically include detector unit the 11, first processing unit 12, Second processing unit 13 and the first adjustment unit 14.Described detector unit 11 is at least one road sound Signal carries out speech detection respectively.Described first processing unit 12 is connected with described detector unit 11, uses In for each road acoustical signal, according to the voice signal detected, determine the voice of described voice signal Loudness gain, according to the mute signal detected, determines the gain of the largest tolerable of described mute signal. Described second processing unit 13 is connected with described first processing unit 12, for according to described speech loudness The Gain generating of gain and described largest tolerable adjusts gain.Described first adjustment unit 14 and described the Two processing units 13 are connected, for described acoustical signal being carried out Gain tuning according to described adjustment gain.

The loudness that the present embodiment provides controls device, and at least one road acoustical signal is entered by detector unit 11 respectively Row speech detection, the first processing unit 12 is for each road acoustical signal, according to the voice signal detected, Determine the speech loudness gain of voice signal, according to the mute signal detected, determine that mute signal is Big tolerable gain, the second processing unit 13 is raw according to the gain of speech loudness gain and largest tolerable Becoming to adjust gain, the first adjustment unit 14 carries out Gain tuning according to adjusting gain to acoustical signal.Pass through Detection to mute signal determines the gain of largest tolerable, and simultaneously according to speech loudness gain and maximum Tolerable Gain generating adjusts gain, is adjusted the gain of acoustical signal by this adjustment gain, Make the acoustical signal after adjusting more conform to the perception level of human ear, greatly increase what loudness controlled Effect.

The second loudness that Fig. 4 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 4 institute Show, in the present embodiment, described detector unit 11 specifically can include the first process subelement 21, Two process subelement 22 and judgment sub-unit 23.Described first process subelement 21 for for described often One road acoustical signal, calculates the root-mean-square of described acoustical signal.Described second processes subelement 22 with described First processes subelement 21, for generating signal envelope and noise envelope respectively according to described root-mean-square.Institute State judgment sub-unit 23 to process subelement 22 with described second and be connected, be used for calculating described signal envelope with The ratio of described noise envelope, if described ratio is more than the first predetermined threshold value, then detects that described voice is believed Number, otherwise, described mute signal detected.

Ecur = 20 \times \log 10 Σ_{n = 0}^{N - 1} s (n) \times s (n) / N;

If Ecur is more than thread1, then Senv=0.9 × Senv+0.1 × Ecur；

If Ecur is not more than thread1, then Senv=0.998 × Senv+0.002 × Ecur；

If Ecur is more than thread2, then Sno=0.998 × Sno+0.002 × Ecur；

If Ecur is not more than thread2, then Sno=0.9 × Sno+0.1 × Ecur；

In the present embodiment, described first processing unit 12 specifically may be used for according to described root mean square calculation The level of described mute signal, determines the gain of described largest tolerable according to the level of described mute signal.

In the present embodiment, described first processing unit 12 specifically can be specifically for application below equation meter Calculate the level Noise_Level of described mute signal:

Noise_Level=0.99×Noise_Level+0.01×Ecur；

Wherein, Ecur is described root-mean-square.

In the present embodiment, described first processing unit 12 specifically may be used for ringing described voice signal Degree Filtering Processing, carries out signal level statistics to the voice signal after loudness Filtering Processing, according to statistical result Determine the level of described voice signal, determine described speech loudness gain according to the level of described voice signal.

Specifically, the first processing unit 12 can arrange loudness wave filter, by loudness wave filter to language Tone signal carries out loudness Filtering Processing, and this loudness wave filter can filter with the high pass IIR yulewalk on 10 rank The butterworth high pass filter cascade of device and 2 order frequency 150HZ approaches this loudness wave filter.High pass The parameter of IIR yulewalk wave filter and butterworth high pass filter can be according to actual process need Arrange, be not limited with the present embodiment.

In the present embodiment, if exhausted specifically for described speech loudness gain of described second processing unit 13 To value more than the second predetermined threshold value, then described adjustment gain G ain of application below equation generation:

Gain=LGain×(1.0-(LGain+NGain)/(LGain×2))；

In the present embodiment, described first adjustment unit 14 is specifically for determining tune according to described acoustical signal Whole duration, determines adjustment step-length according to described adjustment gain and described adjustment duration, increases according to described adjustment Beneficial and described adjustment step-length carries out Gain tuning to described acoustical signal.

In the present embodiment, further, if described acoustical signal is at least two-way, the control of described loudness Device can also include that the second adjustment unit 15, described second adjustment unit 15 adjust list with described first Unit 14 is connected, for calculating described speech loudness gain and the difference of described adjustment gain on each road, really Difference maximum in Ding Ge road, described adjustment gain G ain on each road is adjusted by application below equation:

Gain=2×Gain-LDiffMax-LGain；

Under virtual meeting application scenarios, it is usually present multi-channel sound signal, for example, M road, m road The adjustment gain of acoustical signal is Gain (m), [m=1..M].Speech loudness gain on each road of calculating During, it is identical with reference to Lref, if the most each Lu Douyong LGain (m) is adjusted, that Energy after Tiao Zhenging is Lref, when the noise level difference on every road when, is finally calculated Gain (m), acoustical signal energy after Gain (m) adjusts in each road is different, by with lower section Formula adjusts the adjustment energy of every road acoustical signal that aligns automatically so that the level of output is identical.

The third loudness that Fig. 5 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 5 institute Showing, the loudness that the present embodiment provides controls device specifically can realize the sound that any embodiment of the present invention provides Each step of degree control method, the process of implementing does not repeats them here.The loudness that the present embodiment provides Controlling device and specifically include processor 31 and memorizer 32, described memorizer 32 is used for storing instruction.Institute State processor 31 to couple with described memorizer 32, described processor 31 be configured to execution be stored in described in deposit Instruction in reservoir 32, wherein, described processor 31 is configurable at least one road acoustical signal Carry out speech detection respectively, for each road acoustical signal, according to the voice signal detected, determine institute The speech loudness gain of predicate tone signal, according to the mute signal detected, determines described mute signal The gain of largest tolerable, adjusts according to the Gain generating of described speech loudness gain and described largest tolerable Whole gain, carries out Gain tuning according to described adjustment gain to described acoustical signal.

One of ordinary skill in the art will appreciate that: all or part of step realizing said method embodiment can Completing with the hardware relevant by programmed instruction, aforesaid program can be stored in an embodied on computer readable and deposit In storage media, this program upon execution, performs to include the step of said method embodiment；And aforesaid storage Medium includes: the various media that can store program code such as ROM, RAM, magnetic disc or CD.

Last it is noted that above example is only in order to illustrate technical scheme, rather than to it Limit；Although the present invention being described in detail with reference to previous embodiment, the ordinary skill of this area Personnel it is understood that the technical scheme described in foregoing embodiments still can be modified by it, or Person carries out equivalent to wherein portion of techniques feature；And these amendments or replacement, do not make corresponding skill The essence of art scheme departs from the scope of various embodiments of the present invention technical scheme.

Claims

1. a volume control method, it is characterised in that including:

For each road acoustical signal, according to the voice signal detected, determine the voice of described voice signal Loudness gain, according to the mute signal detected, determines the gain of the largest tolerable of described mute signal；

According to described adjustment gain, described acoustical signal is carried out Gain tuning；

Wherein, described at least one road acoustical signal is carried out speech detection respectively, including:

Volume control method the most according to claim 1, it is characterised in that described basis detects Mute signal, determine the gain of the largest tolerable of described mute signal, particularly as follows:

Volume control method the most according to claim 2, it is characterised in that described according to described all Root calculates the level of described mute signal, particularly as follows:

Noise_Level=0.99 × Noise_Level+0.01 × Ecur；

Wherein, Ecur is described root-mean-square.

Volume control method the most according to claim 1, it is characterised in that described basis detects Voice signal, determine the speech loudness gain of described voice signal, particularly as follows:

Volume control method the most according to claim 1, it is characterised in that described according to institute's predicate The Gain generating of sonority gain and described largest tolerable adjusts gain, particularly as follows:

Gain=LGain × (1.0-(LGain+NGain)/(LGain × 2))；

Volume control method the most according to claim 1, it is characterised in that described according to described tune Whole gain carries out Gain tuning to described acoustical signal, particularly as follows:

Volume control method the most according to claim 1, it is characterised in that if described acoustical signal Being at least two-way, the most described Gain generating according to described speech loudness gain and described largest tolerable is adjusted After whole increasing, described according to described adjustment gain, described acoustical signal is carried out Gain tuning before, described Method also includes:

Gain=2 × Gain LDiffMax LGain；

8. a loudness controls device, it is characterised in that including:

First adjustment unit, is connected with described second processing unit, is used for according to described adjustment gain institute State acoustical signal and carry out Gain tuning；

Wherein, described detector unit includes:

Loudness the most according to claim 8 controls device, it is characterised in that: described first processes list Unit is specifically for the level according to mute signal described in described root mean square calculation, according to described mute signal Level determines the gain of described largest tolerable.

Loudness the most according to claim 9 controls device, it is characterised in that: described first processes Unit is specifically for the level Noise_Level of the application below equation described mute signal of calculating:

Noise_Level=0.99 × Noise_Level+0.01 × Ecur；

Wherein, Ecur is described root-mean-square.

11. loudness according to claim 8 control device, it is characterised in that: described first processes Unit is specifically for carrying out loudness Filtering Processing to described voice signal, to the voice after loudness Filtering Processing Signal carries out signal level statistics, determines the level of described voice signal according to statistical result, according to described The level of voice signal determines described speech loudness gain.

12. loudness according to claim 8 control device, it is characterised in that: described second processes If unit is more than the second predetermined threshold value, then below application specifically for the absolute value of described speech loudness gain Formula described adjustment gain G ain of generation:

Gain=LGain × (1.0-(LGain+NGain)/(LGain × 2))；

13. loudness according to claim 8 control device, it is characterised in that: described first adjusts Unit is specifically for determining adjustment duration according to described acoustical signal, according to described adjustment gain and described tune Whole duration determines adjustment step-length, enters described acoustical signal according to described adjustment gain and described adjustment step-length Row Gain tuning.

14. loudness according to claim 8 control device, it is characterised in that if described sound letter Number being at least two-way, described loudness controls device and also includes:

Gain=2 × Gain LDiffMax LGain；