CN103812462B - Volume control method and device - Google Patents

Volume control method and device Download PDF

Info

Publication number
CN103812462B
CN103812462B CN201210460201.0A CN201210460201A CN103812462B CN 103812462 B CN103812462 B CN 103812462B CN 201210460201 A CN201210460201 A CN 201210460201A CN 103812462 B CN103812462 B CN 103812462B
Authority
CN
China
Prior art keywords
gain
signal
loudness
level
adjustment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210460201.0A
Other languages
Chinese (zh)
Other versions
CN103812462A (en
Inventor
王田
吴文海
张德军
王凤玲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201210460201.0A priority Critical patent/CN103812462B/en
Publication of CN103812462A publication Critical patent/CN103812462A/en
Application granted granted Critical
Publication of CN103812462B publication Critical patent/CN103812462B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Circuit For Audible Band Transducer (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)

Abstract

The embodiment of the present invention provides a kind of volume control method and device, and this volume control method includes: at least one road acoustical signal is carried out speech detection respectively;For each road acoustical signal, according to the voice signal detected, determine the speech loudness gain of voice signal, according to the mute signal detected, determine the gain of the largest tolerable of mute signal;Gain generating according to speech loudness gain and largest tolerable adjusts gain;According to adjusting gain, acoustical signal is carried out Gain tuning.The volume control method of embodiment of the present invention offer and device, it is achieved that under the application scenarios that noise is more or during real-time Communication for Power, the loudness to audio frequency controls, and improves the effect that loudness controls.

Description

Volume control method and device
Technical field
The present embodiments relate to audio signal processing technique, particularly relate to a kind of volume control method and device.
Background technology
Loudness is the index that the signal of telecommunication is converted to voice metric energy during acoustic vibration, is human auditory system Subjective feeling to sound intensity.Loudness controls (Loudness Control) mainly according to the loudness of signal Different to different frequency range perception with human ear, increase or deamplification so that acoustic playback is maintained at one In identical perception level or identical loudness.
The loudness of prior art controls to be mainly used in audio player, the audio frequency of audio player plays Voice quality generally relatively good.But, under the application scenarios that noise is more or real-time Communication for Power During, control to carry out Gain tuning by the loudness of prior art, can be by noise according to the adjustment of voice Gain is adjusted, poor effect.
Summary of the invention
The embodiment of the present invention provides a kind of volume control method and device, to realize in the more application of noise Loudness to audio frequency controls under scene or during real-time Communication for Power, improves the effect that loudness controls.
First aspect, the embodiment of the present invention provides a kind of volume control method, including:
At least one road acoustical signal is carried out speech detection respectively;
For each road acoustical signal, according to the voice signal detected, determine the language of described voice signal Sonority gain, according to the mute signal detected, determines the increasing of the largest tolerable of described mute signal Benefit;
Gain generating according to described speech loudness gain and described largest tolerable adjusts gain;
According to described adjustment gain, described acoustical signal is carried out Gain tuning.
In the implementation that the first is possible, described at least one road acoustical signal is carried out voice inspection respectively Survey, including:
For described each road acoustical signal, calculate the root-mean-square of described acoustical signal;
Signal envelope and noise envelope is generated respectively according to described root-mean-square;
Calculate described signal envelope and the ratio of described noise envelope, if described ratio presets threshold more than first Value, then detect described voice signal, otherwise, described mute signal detected.
In conjunction with the first possible implementation of first aspect, in the implementation that the second is possible, The mute signal that described basis detects, determines the gain of the largest tolerable of described mute signal, specifically For:
According to the level of mute signal described in described root mean square calculation, the level according to described mute signal is true The gain of fixed described largest tolerable.
In conjunction with the implementation that the second of first aspect is possible, in the implementation that the third is possible, The described level according to mute signal described in described root mean square calculation, particularly as follows:
The level Noise_Level of the application below equation described mute signal of calculating:
Noise_Level=0.99×Noise_Level+0.01×Ecur;
Wherein, Ecur is described root-mean-square.
In the 4th kind of possible implementation, the voice signal that described basis detects, determine institute's predicate The speech loudness gain of tone signal, particularly as follows:
Described voice signal is carried out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out Signal level is added up, and determines the level of described voice signal according to statistical result, according to described voice signal Level determine described speech loudness gain.
In the 5th kind of possible implementation, described can according to described speech loudness gain and described maximum The Gain generating of tolerance adjusts gain, particularly as follows:
If the absolute value of described speech loudness gain is more than the second predetermined threshold value, then application below equation generates Described adjustment gain G ain:
Gain=LGain×(1.0-(LGain+NGain)/(LGain×2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
In the 6th kind of possible implementation, described according to described adjustment gain, described acoustical signal is entered Row Gain tuning, particularly as follows:
Adjustment duration is determined according to described acoustical signal, true according to described adjustment gain and described adjustment duration Synchronizing of setting the tone is long, according to described adjustment gain and described adjustment step-length, described acoustical signal is carried out gain tune Whole.
In the 7th kind of possible implementation, if described acoustical signal is at least two-way, the most described basis The Gain generating of described speech loudness gain and described largest tolerable adjusts after increasing, described in described basis Before adjustment gain carries out Gain tuning to described acoustical signal, described method also includes:
Calculate described speech loudness gain and the difference of described adjustment gain on each road, determine in each road Big difference, described adjustment gain G ain on each road is adjusted by application below equation:
Gain=2×Gain–LDiffMax–LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
Second aspect, the embodiment of the present invention provides a kind of loudness to control device, including:
Detector unit, for carrying out speech detection respectively at least one road acoustical signal;
First processing unit, is connected with described detector unit, for for each road acoustical signal, according to The voice signal detected, determines the speech loudness gain of described voice signal, quiet according to detect Signal, determines the gain of the largest tolerable of described mute signal;
Second processing unit, is connected with described first processing unit, for according to described speech loudness gain Gain is adjusted with the Gain generating of described largest tolerable;
First adjustment unit, is connected with described second processing unit, is used for according to described adjustment gain institute State acoustical signal and carry out Gain tuning.
In the implementation that the first is possible, described detector unit includes:
First processes subelement, for for described each road acoustical signal, calculates described acoustical signal Root-mean-square;
Second processes subelement, processes subelement with described first and is connected, for dividing according to described root-mean-square Sheng Cheng signal envelope and noise envelope;
Judgment sub-unit, processes subelement with described second and is connected, be used for calculating described signal envelope and institute State the ratio of noise envelope, if described ratio is more than the first predetermined threshold value, then described voice signal detected, Otherwise, described mute signal is detected.
In conjunction with the first possible implementation of second aspect, in the implementation that the second is possible, Described first processing unit specifically for the level according to mute signal described in described root mean square calculation, according to The level of described mute signal determines the gain of described largest tolerable.
In conjunction with the implementation that the second of second aspect is possible, in the implementation that the third is possible, Described first processing unit calculates the level of described mute signal specifically for application below equation Noise_Level:
Noise_Level=0.99×Noise_Level+0.01×Ecur;
Wherein, Ecur is described root-mean-square.
In the 4th kind of possible implementation, described first processing unit is specifically for believing described voice Number carry out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out signal level statistics, root Result determines the level of described voice signal according to statistics, determines institute's predicate according to the level of described voice signal Sonority gain.
In the 5th kind of possible implementation, if described second processing unit is specifically for institute's Voice Response The absolute value of degree gain is more than the second predetermined threshold value, then application below equation generates described adjustment gain G ain:
Gain=LGain×(1.0-(LGain+NGain)/(LGain×2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
In the 6th kind of possible implementation, described first adjustment unit is specifically for according to described sound Signal determines adjustment duration, determines adjustment step-length according to described adjustment gain and described adjustment duration, according to Described adjustment gain and described adjustment step-length carry out Gain tuning to described acoustical signal.
In the 7th kind of possible implementation, if described acoustical signal is at least two-way, described loudness control Device processed also includes:
Second adjustment unit, is connected with described first adjustment unit, for calculating the described voice on each road Loudness gain and the difference of described adjustment gain, determine difference maximum in each road, applies below equation pair Described adjustment gain G ain on each road is adjusted:
Gain=2×Gain-LDiffMax-LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
As shown from the above technical solution, the embodiment of the present invention provide volume control method and device, loudness Control device and at least one road acoustical signal is carried out speech detection respectively, for each road acoustical signal, root According to the voice signal detected, determine the speech loudness gain of voice signal, according to the quiet letter detected Number, determine the gain of the largest tolerable of mute signal, according to speech loudness gain and largest tolerable Gain generating adjusts gain, according to adjusting gain, acoustical signal is carried out Gain tuning.By to quiet letter Number detection determine the gain of largest tolerable, and simultaneously according to speech loudness gain and largest tolerable Gain generating adjusts gain, is adjusted the gain of acoustical signal by this adjustment gain so that adjust After acoustical signal more conform to the perception level of human ear, greatly increase the effect that loudness controls.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that under, Accompanying drawing during face describes is some embodiments of the present invention, for those of ordinary skill in the art, On the premise of not paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
The first volume control method flow chart that Fig. 1 provides for the embodiment of the present invention;
The second volume control method flow chart that Fig. 2 provides for the embodiment of the present invention;
The first loudness that Fig. 3 provides for the embodiment of the present invention controls apparatus structure schematic diagram;
The second loudness that Fig. 4 provides for the embodiment of the present invention controls apparatus structure schematic diagram;
The third loudness that Fig. 5 provides for the embodiment of the present invention controls apparatus structure schematic diagram.
Detailed description of the invention
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with this Accompanying drawing in bright embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, Obviously, described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based on Embodiment in the present invention, those of ordinary skill in the art are obtained under not making creative work premise The every other embodiment obtained, broadly falls into the scope of protection of the invention.
The first volume control method flow chart that Fig. 1 provides for the embodiment of the present invention.As it is shown in figure 1, The volume control method that the present embodiment provides specifically can apply to but is not limited under virtual meeting scene sound The process that controls in real time of the loudness of tone signal, the volume control method that the present embodiment provides can pass through loudness Controlling device to perform, this loudness controls device and can be integrated in audio processing equipment, it is also possible to individually Arrange.This audio processing equipment specifically can apply in conference system.
The volume control method that the present embodiment provides specifically includes:
Step 10, at least one road acoustical signal is carried out speech detection respectively;
Step 20, for each road acoustical signal, according to the voice signal detected, determine described voice The speech loudness gain of signal, according to the mute signal detected, determines that the maximum of described mute signal can The gain of tolerance;
Step 30, adjust gain according to the Gain generating of described speech loudness gain and described largest tolerable;
Step 40, according to described adjustment gain, described acoustical signal is carried out Gain tuning.
Specifically, at least one road acoustical signal is the signal needing to carry out loudness control, when in virtual meeting Under view application scenarios, acoustical signal is usually multichannel.Audio processing equipment receives the monophonic of each terminal Code stream, obtain a road acoustical signal after being decoded a monaural code stream processing, more right Each road acoustical signal carries out loudness control.
The acoustical signal received is probably voice signal and is also likely to be mute signal, when a user speaks, This acoustical signal is voice signal, and when user does not speak, this acoustical signal is mute signal.To sound Tone signal carries out speech detection, voice signal or mute signal during to judge this acoustical signal.This voice Detection process specifically can use Voice activity detector (Voice Active Detection is called for short VAD) Method realizes,
When voice signal being detected, determine the speech loudness gain of voice signal, when quiet letter being detected Number time, information entrained in mute signal is usually noise, determines the largest tolerable of mute signal Gain, the calculating of the gain of speech loudness gain and largest tolerable specifically can be according to playback gain (Replay Gain) standard realizes.Gain generating according to speech loudness gain and largest tolerable is adjusted Whole gain, is adjusted the gain of this road acoustical signal by adjusting gain, to realize loudness control.
The volume control method that the present embodiment provides, loudness controls device at least one road acoustical signal respectively Carry out speech detection, for each road acoustical signal, according to the voice signal detected, determine that voice is believed Number speech loudness gain, according to the mute signal detected, determine the largest tolerable of mute signal Gain, adjusts gain according to the Gain generating of speech loudness gain and largest tolerable, according to adjusting gain Acoustical signal is carried out Gain tuning.By the detection of mute signal being determined the gain of largest tolerable, And adjust gain according to the Gain generating of speech loudness gain and largest tolerable simultaneously, increased by this adjustment The gain of acoustical signal is adjusted by benefit so that the acoustical signal after adjustment more conforms to the perception of human ear Level, greatly increases the effect that loudness controls.
The second volume control method flow chart that Fig. 2 provides for the embodiment of the present invention.As in figure 2 it is shown, In the present embodiment, step 10, described at least one road acoustical signal is carried out speech detection respectively, specifically May include steps of:
Step 101, for described each road acoustical signal, calculate the root-mean-square of described acoustical signal;
Step 102, generate signal envelope and noise envelope respectively according to described root-mean-square;
Step 103, calculate described signal envelope and the ratio of described noise envelope, if described ratio is more than First predetermined threshold value, then detect described voice signal, otherwise, described mute signal detected.
Specifically, acoustical signal is digital signal, may determine that sound by calculating the root-mean-square of acoustical signal The energy of tone signal.Acoustical signal includes that multiple audio frame sequence, each audio frame include multiple sampled point, The detection process of acoustical signal is the detection process to audio frame.Such as, audio frame is s (n), n=0, 1 ..., N-1, N are the number of sampled point, when the frame length of s (n) is 20ms, can pass through below equation Calculate the root-mean-square Ecur of s (n).
Ecur = 20 × log 10 Σ n = 0 N - 1 s ( n ) × s ( n ) / N ;
The process generating signal envelope Senv according to root-mean-square Ecur is specifically as follows:
If Ecur is more than thread1, then Senv=0.9 × Senv+0.1 × Ecur;
If Ecur is not more than thread1, then Senv=0.998 × Senv+0.002 × Ecur;
The process generating noise envelope Sno according to root-mean-square Ecur is specifically as follows:
If Ecur is more than thread2, then Sno=0.998 × Sno+0.002 × Ecur;
If Ecur is not more than thread2, then Sno=0.9 × Sno+0.1 × Ecur;
The initial value of thread1 and thread2 can be arranged based on experience value, and thread1 >=thread2, During processing, further according to Senv and Sno, thread1 and thread2 is adjusted.As, thread2 =(Senv+Sno) × 0.5, thread1 could be arranged to equal with thread2, or slightly larger than thread2.
The ratio Senv/Sno of signal calculated envelope and noise envelope, it is judged that whether Senv/Sno is more than Thread3, the most then this acoustical signal is voice signal, and otherwise, this acoustical signal is noise signal.
It should be noted that the parameter in above-mentioned formula all can arrange according to actual treatment situation and adjust Whole.
In the present embodiment, in step 20, the mute signal that described basis detects, determine described quiet The gain of the largest tolerable of signal, is specifically as follows:
According to the level of mute signal described in described root mean square calculation, the level according to described mute signal is true The gain of fixed described largest tolerable.
Specifically, can first determine the maximum noise level that human ear can be tolerated, for example, 30dB, then Difference according to this maximum noise level Yu the level of mute signal determines the gain of this largest tolerable, i.e. In the range of human ear can be tolerated, mute signal is adjusted.
In the present embodiment, the described level according to mute signal described in described root mean square calculation, specifically may be used Think:
The level Noise_Level of the application below equation described mute signal of calculating:
Noise_Level=0.99×Noise_Level+0.01×Ecur;
Wherein, Ecur is described root-mean-square.
Assume that the maximum noise level that human ear can be tolerated is NoiseThread, then the gain of largest tolerable NGain=NoiseThread-Noise_Level。
In the present embodiment, in step 20, the voice signal that described basis detects, determine described voice The speech loudness gain of signal, is specifically as follows:
Described voice signal is carried out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out Signal level is added up, and determines the level of described voice signal according to statistical result, according to described voice signal Level determine described speech loudness gain.
Specifically, can use loudness wave filter that voice signal is carried out loudness Filtering Processing, this loudness is filtered Ripple device can be with the high pass IIR yulewalk wave filter on 10 rank and the Butterworth high pass of 2 order frequency 150HZ Wave filter cascade approaches this loudness wave filter.High pass IIR yulewalk wave filter and Butterworth high pass The parameter of wave filter can need to arrange according to actual process, is not limited with the present embodiment.
Voice signal after loudness Filtering Processing carries out signal level statistics be specifically as follows: calculate voice The root-mean-square Level of each audio frame in signal, audio frame is s (n), n=0,1 ..., N-1, N are for adopting The number of sampling point, when the frame length of s (n) is 20ms, and sample rate is 16Hz, root-mean-square Level specifically may be used To be realized by below equation:
Level = 20 × log 10 Σ n = 0 N - 1 ( s ( n ) × s ( n ) / N / 2 15 ) ;
The root-mean-square of multiple audio frames is carried out level statistics, if the level distribution of major part audio frame, certain Around individual level, then can determine the level Level1 of voice signal according to this level.Can be by following Formula calculating speech loudness gain LGain:
LGain=Lref–Level1;Wherein, Lref is specifically as follows an empirical value, for example,-14dB.
In the present embodiment, step 30, described according to described speech loudness gain and described largest tolerable Gain generating adjust gain, be specifically as follows:
If the absolute value of described speech loudness gain is more than the second predetermined threshold value, then application below equation generates Described adjustment gain G ain:
Gain=LGain×(1.0-(LGain+NGain)/(LGain×2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
Specifically, first absolute value and second predetermined threshold value of speech loudness gain being compared, this is years old Two predetermined threshold value are specifically as follows an empirical value, for example, 3dB..Absolute value when speech loudness gain During more than the second predetermined threshold value, being generated by above-mentioned formula and adjust gain, the parameter in above-mentioned formula also may be used To arrange according to actual needs, the present embodiment merely provides one preferred embodiment, the present invention It is not limited thereto.
In the present embodiment, step 40, described according to described adjustment gain, described acoustical signal is increased Benefit adjusts, particularly as follows:
Determine adjustment duration according to described acoustical signal, determine according to described adjustment gain and described adjustment duration Adjust step-length, according to described adjustment gain and described adjustment step-length, described acoustical signal is carried out Gain tuning.
Acoustical signal is carried out Gain tuning and can use automatic growth control (Automatic Gain Control, is called for short AGC) method, detailed process is:
Signal characteristic according to acoustical signal calculates and adjusts step-length, first can be according to the signal of acoustical signal Type calculates and adjusts duration decay, such as: decay=Ratio × MaxFrameNum × FrameLen+ FrameLen, wherein, Ratio is acoustical signal similarity when being to speak state, MaxFrameNum For largest frames long number, FrameLen is the length of every frame,.
Calculate adjustment step-length delt:delt=(the curGain-m_oldGain)/decay of each sampled point, CurGain is the gain of current sampling point, and m_oldGain is the gain of last sampled point, for first The gain of sampled point could be arranged to 1.
Each audio frame of acoustical signal is carried out Gain tuning, S ' (n)=S (n) × (m_oldGain+delt) And update oldGain.This adjustment process is specifically as follows:
The gain of last sampled point is m_oldGain, and curGain is the current sampling point after some frames Gain, decay is the duration of some frames.
The first step, application formula delt=(curGain-m_oldGain)/decay, calculate a pointwise The delt updated.
Second step, updates each sampled point:
m_curGain=m_oldGain+delt;
S’(n)=S(n)×m_curGain;
m_oldGain=m_curGain;
Here adding the gain of a current sample, above iteration is until this frame signal has processed. After coming Deng lower frame signal, can be according to the processing procedure more than characteristic cycle of new signal.
In the present embodiment, if described acoustical signal is at least two-way, then step 30, described in described basis The Gain generating of speech loudness gain and described largest tolerable adjusts after increasing, step 40, described basis Before described adjustment gain carries out Gain tuning to described acoustical signal, described method can also include:
Step 50, the described speech loudness gain calculating each road and the difference of described adjustment gain, determine Difference maximum in each road, described adjustment gain G ain on each road is adjusted by application below equation:
Gain=2×Gain–LDiffMax–LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
Specifically, under virtual meeting application scenarios, it is usually present multi-channel sound signal, for example, M road, The adjustment gain of m road acoustical signal is Gain (m), [m=1..M].Ring at the voice calculating each road During degree gain, it is identical with reference to Lref, if the most each Lu Douyong LGain (m) is carried out Adjust, then the energy after adjustment is Lref, when the noise level difference on every road when, finally calculates Obtain Gain (m), each road acoustical signal through Gain (m) adjust after energy be different, by with Under type adjusts the adjustment energy of every road acoustical signal that aligns automatically so that the level of output is identical.
First the acoustical signal difference to Lref on every road is calculated: LDiff (m)=LGain (m)-Gain (m), from In choose maximum LDiffMax, i.e. LDiffMax=Max (LDiff (m)), then by formula Gain=2 Described adjustment gain G ain on each road is adjusted by × Gain LDiffMax LGain.
In actual application, can also be according to presetting through Gain tuning Hou Ge road acoustical signal Scene add azimuth information to every road acoustical signal, synthesize 3d audio frequency.
The first loudness that Fig. 3 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 3 institute Showing, the loudness that the present embodiment provides controls device specifically can realize the sound that any embodiment of the present invention provides Each step of degree control method, the process of implementing does not repeats them here.The loudness that the present embodiment provides Control device can be integrated in audio processing equipment, it is also possible to be separately provided.This audio processing equipment has Body can apply in conference system.
The present embodiment provide loudness control device specifically include detector unit the 11, first processing unit 12, Second processing unit 13 and the first adjustment unit 14.Described detector unit 11 is at least one road sound Signal carries out speech detection respectively.Described first processing unit 12 is connected with described detector unit 11, uses In for each road acoustical signal, according to the voice signal detected, determine the voice of described voice signal Loudness gain, according to the mute signal detected, determines the gain of the largest tolerable of described mute signal. Described second processing unit 13 is connected with described first processing unit 12, for according to described speech loudness The Gain generating of gain and described largest tolerable adjusts gain.Described first adjustment unit 14 and described the Two processing units 13 are connected, for described acoustical signal being carried out Gain tuning according to described adjustment gain.
The loudness that the present embodiment provides controls device, and at least one road acoustical signal is entered by detector unit 11 respectively Row speech detection, the first processing unit 12 is for each road acoustical signal, according to the voice signal detected, Determine the speech loudness gain of voice signal, according to the mute signal detected, determine that mute signal is Big tolerable gain, the second processing unit 13 is raw according to the gain of speech loudness gain and largest tolerable Becoming to adjust gain, the first adjustment unit 14 carries out Gain tuning according to adjusting gain to acoustical signal.Pass through Detection to mute signal determines the gain of largest tolerable, and simultaneously according to speech loudness gain and maximum Tolerable Gain generating adjusts gain, is adjusted the gain of acoustical signal by this adjustment gain, Make the acoustical signal after adjusting more conform to the perception level of human ear, greatly increase what loudness controlled Effect.
The second loudness that Fig. 4 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 4 institute Show, in the present embodiment, described detector unit 11 specifically can include the first process subelement 21, Two process subelement 22 and judgment sub-unit 23.Described first process subelement 21 for for described often One road acoustical signal, calculates the root-mean-square of described acoustical signal.Described second processes subelement 22 with described First processes subelement 21, for generating signal envelope and noise envelope respectively according to described root-mean-square.Institute State judgment sub-unit 23 to process subelement 22 with described second and be connected, be used for calculating described signal envelope with The ratio of described noise envelope, if described ratio is more than the first predetermined threshold value, then detects that described voice is believed Number, otherwise, described mute signal detected.
Specifically, acoustical signal is digital signal, may determine that sound by calculating the root-mean-square of acoustical signal The energy of tone signal.Acoustical signal includes that multiple audio frame sequence, each audio frame include multiple sampled point, The detection process of acoustical signal is the detection process to audio frame.Such as, audio frame is s (n), n=0, 1 ..., N-1, N are the number of sampled point, when the frame length of s (n) is 20ms, can pass through below equation Calculate the root-mean-square Ecur of s (n).
Ecur = 20 × log 10 Σ n = 0 N - 1 s ( n ) × s ( n ) / N ;
The process generating signal envelope Senv according to root-mean-square Ecur is specifically as follows:
If Ecur is more than thread1, then Senv=0.9 × Senv+0.1 × Ecur;
If Ecur is not more than thread1, then Senv=0.998 × Senv+0.002 × Ecur;
The process generating noise envelope Sno according to root-mean-square Ecur is specifically as follows:
If Ecur is more than thread2, then Sno=0.998 × Sno+0.002 × Ecur;
If Ecur is not more than thread2, then Sno=0.9 × Sno+0.1 × Ecur;
The initial value of thread1 and thread2 can be arranged based on experience value, and thread1 >=thread2, During processing, further according to Senv and Sno, thread1 and thread2 is adjusted.As, thread2 =(Senv+Sno) × 0.5, thread1 could be arranged to equal with thread2, or slightly larger than thread2.
The ratio Senv/Sno of signal calculated envelope and noise envelope, it is judged that whether Senv/Sno is more than Thread3, the most then this acoustical signal is voice signal, and otherwise, this acoustical signal is noise signal.
It should be noted that the parameter in above-mentioned formula all can arrange according to actual treatment situation and adjust Whole.
In the present embodiment, described first processing unit 12 specifically may be used for according to described root mean square calculation The level of described mute signal, determines the gain of described largest tolerable according to the level of described mute signal.
In the present embodiment, described first processing unit 12 specifically can be specifically for application below equation meter Calculate the level Noise_Level of described mute signal:
Noise_Level=0.99×Noise_Level+0.01×Ecur;
Wherein, Ecur is described root-mean-square.
In the present embodiment, described first processing unit 12 specifically may be used for ringing described voice signal Degree Filtering Processing, carries out signal level statistics to the voice signal after loudness Filtering Processing, according to statistical result Determine the level of described voice signal, determine described speech loudness gain according to the level of described voice signal.
Specifically, the first processing unit 12 can arrange loudness wave filter, by loudness wave filter to language Tone signal carries out loudness Filtering Processing, and this loudness wave filter can filter with the high pass IIR yulewalk on 10 rank The butterworth high pass filter cascade of device and 2 order frequency 150HZ approaches this loudness wave filter.High pass The parameter of IIR yulewalk wave filter and butterworth high pass filter can be according to actual process need Arrange, be not limited with the present embodiment.
In the present embodiment, if exhausted specifically for described speech loudness gain of described second processing unit 13 To value more than the second predetermined threshold value, then described adjustment gain G ain of application below equation generation:
Gain=LGain×(1.0-(LGain+NGain)/(LGain×2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
In the present embodiment, described first adjustment unit 14 is specifically for determining tune according to described acoustical signal Whole duration, determines adjustment step-length according to described adjustment gain and described adjustment duration, increases according to described adjustment Beneficial and described adjustment step-length carries out Gain tuning to described acoustical signal.
In the present embodiment, further, if described acoustical signal is at least two-way, the control of described loudness Device can also include that the second adjustment unit 15, described second adjustment unit 15 adjust list with described first Unit 14 is connected, for calculating described speech loudness gain and the difference of described adjustment gain on each road, really Difference maximum in Ding Ge road, described adjustment gain G ain on each road is adjusted by application below equation:
Gain=2×Gain-LDiffMax-LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
Under virtual meeting application scenarios, it is usually present multi-channel sound signal, for example, M road, m road The adjustment gain of acoustical signal is Gain (m), [m=1..M].Speech loudness gain on each road of calculating During, it is identical with reference to Lref, if the most each Lu Douyong LGain (m) is adjusted, that Energy after Tiao Zhenging is Lref, when the noise level difference on every road when, is finally calculated Gain (m), acoustical signal energy after Gain (m) adjusts in each road is different, by with lower section Formula adjusts the adjustment energy of every road acoustical signal that aligns automatically so that the level of output is identical.
First the acoustical signal difference to Lref on every road is calculated: LDiff (m)=LGain (m)-Gain (m), from In choose maximum LDiffMax, i.e. LDiffMax=Max (LDiff (m)), then by formula Gain=2 Described adjustment gain G ain on each road is adjusted by × Gain LDiffMax LGain.
In actual application, can also be according to presetting through Gain tuning Hou Ge road acoustical signal Scene add azimuth information to every road acoustical signal, synthesize 3d audio frequency.
The third loudness that Fig. 5 provides for the embodiment of the present invention controls apparatus structure schematic diagram.Such as Fig. 5 institute Showing, the loudness that the present embodiment provides controls device specifically can realize the sound that any embodiment of the present invention provides Each step of degree control method, the process of implementing does not repeats them here.The loudness that the present embodiment provides Controlling device and specifically include processor 31 and memorizer 32, described memorizer 32 is used for storing instruction.Institute State processor 31 to couple with described memorizer 32, described processor 31 be configured to execution be stored in described in deposit Instruction in reservoir 32, wherein, described processor 31 is configurable at least one road acoustical signal Carry out speech detection respectively, for each road acoustical signal, according to the voice signal detected, determine institute The speech loudness gain of predicate tone signal, according to the mute signal detected, determines described mute signal The gain of largest tolerable, adjusts according to the Gain generating of described speech loudness gain and described largest tolerable Whole gain, carries out Gain tuning according to described adjustment gain to described acoustical signal.
One of ordinary skill in the art will appreciate that: all or part of step realizing said method embodiment can Completing with the hardware relevant by programmed instruction, aforesaid program can be stored in an embodied on computer readable and deposit In storage media, this program upon execution, performs to include the step of said method embodiment;And aforesaid storage Medium includes: the various media that can store program code such as ROM, RAM, magnetic disc or CD.
Last it is noted that above example is only in order to illustrate technical scheme, rather than to it Limit;Although the present invention being described in detail with reference to previous embodiment, the ordinary skill of this area Personnel it is understood that the technical scheme described in foregoing embodiments still can be modified by it, or Person carries out equivalent to wherein portion of techniques feature;And these amendments or replacement, do not make corresponding skill The essence of art scheme departs from the scope of various embodiments of the present invention technical scheme.

Claims (14)

1. a volume control method, it is characterised in that including:
At least one road acoustical signal is carried out speech detection respectively;
For each road acoustical signal, according to the voice signal detected, determine the voice of described voice signal Loudness gain, according to the mute signal detected, determines the gain of the largest tolerable of described mute signal;
Gain generating according to described speech loudness gain and described largest tolerable adjusts gain;
According to described adjustment gain, described acoustical signal is carried out Gain tuning;
Wherein, described at least one road acoustical signal is carried out speech detection respectively, including:
For described each road acoustical signal, calculate the root-mean-square of described acoustical signal;
Signal envelope and noise envelope is generated respectively according to described root-mean-square;
Calculate described signal envelope and the ratio of described noise envelope, if described ratio presets threshold more than first Value, then detect described voice signal, otherwise, described mute signal detected.
Volume control method the most according to claim 1, it is characterised in that described basis detects Mute signal, determine the gain of the largest tolerable of described mute signal, particularly as follows:
According to the level of mute signal described in described root mean square calculation, the level according to described mute signal is true The gain of fixed described largest tolerable.
Volume control method the most according to claim 2, it is characterised in that described according to described all Root calculates the level of described mute signal, particularly as follows:
The level Noise_Level of the application below equation described mute signal of calculating:
Noise_Level=0.99 × Noise_Level+0.01 × Ecur;
Wherein, Ecur is described root-mean-square.
Volume control method the most according to claim 1, it is characterised in that described basis detects Voice signal, determine the speech loudness gain of described voice signal, particularly as follows:
Described voice signal is carried out loudness Filtering Processing, the voice signal after loudness Filtering Processing is carried out Signal level is added up, and determines the level of described voice signal according to statistical result, according to described voice signal Level determine described speech loudness gain.
Volume control method the most according to claim 1, it is characterised in that described according to institute's predicate The Gain generating of sonority gain and described largest tolerable adjusts gain, particularly as follows:
If the absolute value of described speech loudness gain is more than the second predetermined threshold value, then application below equation generates Described adjustment gain G ain:
Gain=LGain × (1.0-(LGain+NGain)/(LGain × 2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
Volume control method the most according to claim 1, it is characterised in that described according to described tune Whole gain carries out Gain tuning to described acoustical signal, particularly as follows:
Determine adjustment duration according to described acoustical signal, determine according to described adjustment gain and described adjustment duration Adjust step-length, according to described adjustment gain and described adjustment step-length, described acoustical signal is carried out Gain tuning.
Volume control method the most according to claim 1, it is characterised in that if described acoustical signal Being at least two-way, the most described Gain generating according to described speech loudness gain and described largest tolerable is adjusted After whole increasing, described according to described adjustment gain, described acoustical signal is carried out Gain tuning before, described Method also includes:
Calculate described speech loudness gain and the difference of described adjustment gain on each road, determine in each road Big difference, described adjustment gain G ain on each road is adjusted by application below equation:
Gain=2 × Gain LDiffMax LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
8. a loudness controls device, it is characterised in that including:
Detector unit, for carrying out speech detection respectively at least one road acoustical signal;
First processing unit, is connected with described detector unit, for for each road acoustical signal, according to The voice signal detected, determines the speech loudness gain of described voice signal, quiet according to detect Signal, determines the gain of the largest tolerable of described mute signal;
Second processing unit, is connected with described first processing unit, for according to described speech loudness gain Gain is adjusted with the Gain generating of described largest tolerable;
First adjustment unit, is connected with described second processing unit, is used for according to described adjustment gain institute State acoustical signal and carry out Gain tuning;
Wherein, described detector unit includes:
First processes subelement, for for described each road acoustical signal, calculates described acoustical signal Root-mean-square;
Second processes subelement, processes subelement with described first and is connected, for dividing according to described root-mean-square Sheng Cheng signal envelope and noise envelope;
Judgment sub-unit, processes subelement with described second and is connected, be used for calculating described signal envelope and institute State the ratio of noise envelope, if described ratio is more than the first predetermined threshold value, then described voice signal detected, Otherwise, described mute signal is detected.
Loudness the most according to claim 8 controls device, it is characterised in that: described first processes list Unit is specifically for the level according to mute signal described in described root mean square calculation, according to described mute signal Level determines the gain of described largest tolerable.
Loudness the most according to claim 9 controls device, it is characterised in that: described first processes Unit is specifically for the level Noise_Level of the application below equation described mute signal of calculating:
Noise_Level=0.99 × Noise_Level+0.01 × Ecur;
Wherein, Ecur is described root-mean-square.
11. loudness according to claim 8 control device, it is characterised in that: described first processes Unit is specifically for carrying out loudness Filtering Processing to described voice signal, to the voice after loudness Filtering Processing Signal carries out signal level statistics, determines the level of described voice signal according to statistical result, according to described The level of voice signal determines described speech loudness gain.
12. loudness according to claim 8 control device, it is characterised in that: described second processes If unit is more than the second predetermined threshold value, then below application specifically for the absolute value of described speech loudness gain Formula described adjustment gain G ain of generation:
Gain=LGain × (1.0-(LGain+NGain)/(LGain × 2));
Wherein, LGain is described speech loudness gain, and NGain is the gain of described largest tolerable.
13. loudness according to claim 8 control device, it is characterised in that: described first adjusts Unit is specifically for determining adjustment duration according to described acoustical signal, according to described adjustment gain and described tune Whole duration determines adjustment step-length, enters described acoustical signal according to described adjustment gain and described adjustment step-length Row Gain tuning.
14. loudness according to claim 8 control device, it is characterised in that if described sound letter Number being at least two-way, described loudness controls device and also includes:
Second adjustment unit, is connected with described first adjustment unit, for calculating the described voice on each road Loudness gain and the difference of described adjustment gain, determine difference maximum in each road, applies below equation pair Described adjustment gain G ain on each road is adjusted:
Gain=2 × Gain LDiffMax LGain;
Wherein, LDiffMax is the difference of described maximum, and LGain is described speech loudness gain.
CN201210460201.0A 2012-11-15 2012-11-15 Volume control method and device Active CN103812462B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210460201.0A CN103812462B (en) 2012-11-15 2012-11-15 Volume control method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210460201.0A CN103812462B (en) 2012-11-15 2012-11-15 Volume control method and device

Publications (2)

Publication Number Publication Date
CN103812462A CN103812462A (en) 2014-05-21
CN103812462B true CN103812462B (en) 2016-12-07

Family

ID=50708755

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210460201.0A Active CN103812462B (en) 2012-11-15 2012-11-15 Volume control method and device

Country Status (1)

Country Link
CN (1) CN103812462B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105450193A (en) * 2014-08-28 2016-03-30 深圳Tcl新技术有限公司 Volume adjusting method and volume adjusting device
CN106992003A (en) * 2017-03-24 2017-07-28 深圳北斗卫星信息科技有限公司 Voice signal auto gain control method
CN108882115B (en) * 2017-05-12 2020-08-25 华为技术有限公司 Loudness adjustment method and device and terminal
CN107994879B (en) * 2017-12-04 2022-07-08 北京小米移动软件有限公司 Loudness control method and device
CN108806710B (en) * 2018-06-15 2020-07-24 会听声学科技(北京)有限公司 Voice enhancement gain adjustment method, system and earphone
CN116168719A (en) * 2022-12-26 2023-05-26 杭州爱听科技有限公司 Sound gain adjusting method and system based on context analysis

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101783656A (en) * 2010-03-17 2010-07-21 北京爱德发科技有限公司 Loudness control method, module and device of stereo system
CN102436821A (en) * 2011-12-02 2012-05-02 海能达通信股份有限公司 Method for adaptively adjusting sound effect and equipment thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200620817A (en) * 2004-12-10 2006-06-16 Quanta Comp Inc Audio output method and device for portable communication apparatus
US20090103751A1 (en) * 2007-10-22 2009-04-23 Stephen Gordon Lenk Sound volume leveler for speed sensitive volume

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101783656A (en) * 2010-03-17 2010-07-21 北京爱德发科技有限公司 Loudness control method, module and device of stereo system
CN102436821A (en) * 2011-12-02 2012-05-02 海能达通信股份有限公司 Method for adaptively adjusting sound effect and equipment thereof

Also Published As

Publication number Publication date
CN103812462A (en) 2014-05-21

Similar Documents

Publication Publication Date Title
CN103812462B (en) Volume control method and device
CN102075842B (en) Tinnitus hearing aid
CN105376668B (en) A kind of earphone noise-reduction method and device
US10271138B2 (en) Method and device for compensating frequency response of audio signal
CN102860046A (en) A hearing aid and a method for alleviating tinnitus
CN102610229B (en) Method, apparatus and device for audio dynamic range compression
EP3264799A1 (en) A method and a hearing device for improved separability of target sounds
CN108573709B (en) Automatic gain control method and device
CN103473005A (en) Method and device for performing sound effect control on played audio frequency
CN102868962A (en) Wireless binaural compressor
CN105596008A (en) Method of fitting a hearing device to a user, a fitting system for a hearing device and a hearing device
CN107454537B (en) Hearing device comprising a filter bank and an onset detector
CN108882115A (en) loudness adjusting method, device and terminal
US11605392B2 (en) Automatic gain control based on machine learning level estimation of the desired signal
CN106409309A (en) Tone quality enhancement method and microphone
CN109803218B (en) Automatic calibration method and device for loudspeaker sound field balance
CN107948869A (en) Audio-frequency processing method, device, sound system and storage medium
WO2015085946A1 (en) Voice signal processing method, apparatus and server
CN109951784A (en) Hearing devices and method with intelligently guiding
CN113747336B (en) Method for tuning and adapting sound field in different spaces based on audio processor
CN113259801B (en) Horn noise reduction method and related device of intelligent earphone
CN112333602B (en) Signal processing method, signal processing apparatus, computer-readable storage medium, and indoor playback system
CN103248845A (en) Voice-self adaptive television
CN111063363A (en) Voice acquisition method, audio equipment and device with storage function
CN106796782A (en) Information processor, information processing method and computer program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant