WO2011134415A1 - Audio signal switching method and device - Google Patents
Audio signal switching method and device Download PDFInfo
- Publication number
- WO2011134415A1 WO2011134415A1 PCT/CN2011/073479 CN2011073479W WO2011134415A1 WO 2011134415 A1 WO2011134415 A1 WO 2011134415A1 CN 2011073479 W CN2011073479 W CN 2011073479W WO 2011134415 A1 WO2011134415 A1 WO 2011134415A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- weight
- signal
- frequency band
- band signal
- speech
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
Definitions
- the embodiments of the present invention relate to the field of communications technologies, and in particular, to a voice and audio signal switching method and apparatus. Background technique
- the network will cut off the code stream of the speech and audio signals transmitted from the encoding end to the network, so that the decoding end will intercept the data.
- the subsequent code stream decodes speech and audio signals of different bandwidths.
- the narrowband signal mentioned in the present invention is a wideband signal which is switched to a low band component only and a high band component is empty by upsampling and low pass filtering, and the wideband speech and audio signal has both a low band signal component and a High frequency band signal component.
- the inventors have found that at least the following problems exist in the prior art: Since the narrowband speech audio signal and the wideband speech audio signal are different from each other in the high frequency band signal, the speech and audio signals of different bandwidths are switched. When the audio signal energy is excited, the user may feel uncomfortable and cause the quality of the user's audio signal to deteriorate.
- Embodiments of the present invention provide a method and device for switching voice and audio signals, which are implemented smoothly.
- the audio and video signals of different bandwidths are switched to improve the quality of the audio signals received by the user.
- the embodiment of the invention provides a method for switching voice and audio signals, including:
- the first high frequency band signal of the current frame audio signal and the second high frequency band signal of the previous M frame voice signal are weighted to obtain the processed first high frequency band signal.
- M is greater than or equal to 1;
- An embodiment of the present invention provides an audio signal switching apparatus, including:
- a processing module configured to perform weighting processing on the first high frequency band signal of the current frame speech audio signal and the second high frequency band signal of the previous M frame speech and audio signal when the audio signal is switched, to obtain the processed a high frequency band signal; wherein, M is greater than or equal to 1;
- a first synthesizing module configured to synthesize the processed first high frequency band signal with the first low frequency band signal of the current frame audio signal into a broadband signal.
- the voice-audio signal switching method and apparatus processes the first high-band signal of the current frame voice-audio signal according to the second high-band signal of the voice-audio signal of the previous M frame, so that the first M-frame
- the second high-band signal of the speech audio signal can smoothly transition to the processed first high-band signal, and combine the processed first high-band signal with the first low-band signal into a wide-band signal, thereby switching
- the speech/audio signal switching of different bandwidths can be smoothly performed, the influence of the subjective auditory quality difference of the speech and audio signals caused by the energy mutation is reduced, and the quality of the audio signal of the user is improved.
- Embodiment 1 is a flowchart of Embodiment 1 of a method for switching a voice signal according to the present invention
- 2 is a flowchart of Embodiment 2 of a method for switching a voice signal according to the present invention
- Figure 3 is a flow chart of the step 201 - an embodiment of Figure 2;
- step 302 in FIG. 3 is a flow chart of step 302 in FIG. 3;
- FIG. 5 is a flow chart 2 of another embodiment of step 302 in Figure 3;
- Figure 6 is a flow chart of the step 202 of Figure 2;
- FIG. 7 is a second flowchart of another embodiment of step 201 in Figure 2;
- Figure 8 is a third flowchart of another embodiment of step 201 in Figure 2;
- Embodiment 9 is a schematic structural diagram of Embodiment 1 of a speech audio signal switching apparatus according to the present invention.
- Embodiment 2 of a speech audio signal switching apparatus according to the present invention
- FIG 11 is a schematic structural diagram of a processing module in the second embodiment of the audio signal switching device of the present invention.
- FIG. 12 is a schematic structural diagram of a first module in Embodiment 2 of a speech audio signal switching apparatus according to the present invention.
- Figure 13a is a schematic diagram showing the structure of a processing module in the second embodiment of the voice-audio signal switching device of the present invention
- Figure 13b is a schematic structural view of the processing module in the second embodiment of the voice-audio signal switching device of the present invention.
- FIG. 1 is a flowchart of Embodiment 1 of a method for switching a voice signal according to the present invention.
- the voice-audio signal switching method of this embodiment when the voice-audio signal appears to be switched, after switching the frame
- Each frame is processed as follows:
- Step 101 When the speech signal is switched, the first high frequency band signal of the current frame speech audio signal and the second high frequency band signal of the previous M frame speech and audio signal are weighted to obtain the first high after processing. a band signal; wherein, M is greater than or equal to 1.
- Step 102 Synthesize the processed first high frequency band signal with the first low frequency band signal of the current frame speech audio signal into a wide frequency band signal.
- the pre-M frame speech and audio signal in the embodiment provided by the present invention refers to the M frame speech frequency signal before the current frame.
- the L frame speech and audio signal before switching refers to the L frame speech and audio signal before the frame is switched when the audio signal is switched.
- the current speech frame is a broadband signal and the previous frame is a narrowband signal; or the current speech frame is a narrowband signal and the previous frame is a wideband signal, then the speech and audio signals are switched, and the current speech frame is a switching frame. .
- the voice-audio signal switching method of the embodiment of the present invention processes the first high-band signal in the current frame voice-audio signal according to the second high-band signal in the pre-M frame voice-audio signal, so that the first M-frame is obtained.
- the second high-band signal in the speech audio signal can smoothly transition to the processed first high-band signal, thereby enabling high-band signals of different bandwidth speech and audio signals in switching between different bandwidth speech and audio signals.
- the smooth transition can be smoothly performed.
- the processed first high frequency band signal and the first low frequency band signal are combined into a wideband signal, and the wideband signal is transmitted to the user terminal, so that the user enjoys a high quality speech and audio signal.
- the speech and audio signal switching of different bandwidths can be smoothly performed, the influence of the subjective auditory quality difference of the speech and audio signals caused by the energy excitation is reduced, and the quality of the audio signal of the user is improved.
- FIG. 2 is a flowchart of Embodiment 2 of a method for switching a voice signal according to the present invention. As shown in FIG. 2, the voice and audio signal switching method in this embodiment includes:
- Step 200 Synthesize the first high frequency band signal of the current frame speech audio signal with the first low frequency band signal into a broadband signal when no switching occurs.
- the first frequency band audio signal in this embodiment may be a wideband speech audio signal or a narrowband speech audio signal.
- the first frequency band when the first frequency band is voiced
- the frequency signal does not switch, it is processed in the following two cases: 1. If the first band speech and audio signal is a wideband speech and audio signal, the low frequency band signal and the high frequency band signal in the wideband speech and audio signal are synthesized. Broadband signal; 2. If the first frequency band audio signal is a narrowband speech and audio signal, the low frequency band signal and the high frequency band signal in the narrow band speech and audio signal are combined into a wideband signal, at this time, although it is a wide band Signal, but the high band is empty, no information.
- Step 201 When the speech signal is switched, the first high frequency band signal of the current frame speech audio signal and the second high frequency band signal of the previous M frame speech and audio signal are weighted to obtain the first high after processing. Frequency band signal. Where M is greater than or equal to 1.
- the first high frequency band signal of the current frame voice signal is processed according to the second high frequency band signal of the previous M frame voice signal, so that the front M
- the second high-band signal of the frame speech audio signal can smoothly transition to the processed first high-band signal, for example, when the wide-band speech audio signal is switched to the narrow-band speech audio signal, due to the narrow-band speech audio signal corresponding to The high-band signal is empty, so in order to smoothly switch the wideband speech and audio signal to the narrow-band speech and audio signal, it is necessary to recover the components of the high-band signal corresponding to the narrow-band speech and audio signal, and when the narrow-band speech and audio signal is switched to Wideband speech and audio signals, since the high-band signal in the wideband speech and audio signal is not empty, in order to smoothly switch the narrow-band speech and audio signals to the wide-band speech and audio signals, it is necessary to reduce the continuous multi-frame wide-band speech and audio after switching.
- the energy of the high-band signal in the signal causing the high-band signal of the wideband speech and audio signal to gradually transition to the true high-band signal.
- the current frame speech and audio signal is processed in step 201, so that the high-band signal in the speech/audio signals of different bandwidths can smoothly transition, and when the switching between the wide-band speech audio signal and the narrow-band speech audio signal is solved, The user's hearing is uncomfortable due to the energy stimuli, so that the user receives a high quality audio signal.
- the first high-band signal and the second high-band signal of the pre-M frame speech and audio signal may be directly weighted, and the result obtained after the processing is The first high frequency band signal after processing.
- Step 202 The first high frequency band signal after processing and the first low frequency of the current frame voice signal Signaling a wideband signal with a signal.
- the second high frequency band signal of the pre-M frame speech and audio signal can be smoothly transitioned to the processed first high frequency band signal of the current frame, and then step 202 is performed.
- step 202 is performed.
- the smooth switching of the signal is beneficial to improve the quality of the audio signal received by the user.
- the voice-audio signal switching method of the embodiment of the present invention processes the first high-band signal in the current frame voice-audio signal according to the second high-band signal of the previous M-frame voice signal, so that the first M-frame language
- the second high-band signal of the audio signal can smoothly transition to the processed first high-band signal, thereby smoothing the high-band signal of the speech and audio signals of different bandwidths during the switching of the speech and audio signals of different bandwidths. Transition switching;
- the processed first high frequency band signal and the first low frequency band signal are combined into a wideband signal, and the wideband signal is transmitted to the user terminal, so that the user enjoys a high quality voice signal.
- the speech and audio signal switching method can smoothly switch the speech and audio signals of different bandwidths, reduce the influence of the subjective auditory quality difference of the audio signal caused by the energy excitation, and improve the quality of the audio signal received by the user. Further, by synthesizing the first high band signal of the current frame speech audio signal and the first low band signal into a wide band signal when the speech/audio signal switching of the different bandwidth does not occur, the user is allowed to obtain a high quality audio signal.
- step 201 in this embodiment includes:
- Step 301 Predict the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal.
- the speech and audio signals can be decomposed into two parts: fine structure information and envelope information, so that the speech and audio signals can be restored according to the fine structure information and the envelope information.
- fine structure information and envelope information In the process of switching from a wideband speech and audio signal to a narrowband speech and audio signal, since only a low frequency band signal is present in the narrowband speech and audio signal, the corresponding high frequency band signal is empty, in order to smooth the wideband speech and audio signal.
- Switching to the narrowband speech and audio signal requires recovery of the high frequency band signal required by the current narrowband speech and audio signal to achieve smooth switching of the speech and audio signals.
- Step 301 in this embodiment predicts the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal in the narrowband speech and audio signal.
- step 301 may further perform signal classification on the first low frequency band signal of the current frame speech audio signal; and then according to the first low frequency band.
- the signal type corresponding to the signal predicts predicted fine structure information and predicted envelope information corresponding to the first high frequency band signal.
- the narrowband speech and audio signal of the current frame may be a harmonic signal, or a non-harmonic signal or a transient signal, and the like, according to the information type corresponding to the narrowband speech and audio signal, the signal of the type should be known.
- the fine structure information and the envelope information are provided to more accurately predict the fine structure information and the envelope information corresponding to the high frequency band signal of the current frame.
- the speech audio signal switching method of the present invention does not limit the signal type of the narrowband speech and audio signal.
- Step 302 Perform weighting processing on the predicted envelope signal and the first M frame envelope information corresponding to the second high frequency band signal of the pre-M frame speech and audio signal to obtain the first information about the first high frequency band signal.
- the prediction may be based on the predicted envelope information and the second high frequency of the pre-M frame speech and audio signals.
- the first M-frame envelope information corresponding to the signal is generated, and the first packet information corresponding to the first high-band signal is generated.
- the process of generating the first envelope information corresponding to the first high-band signal in step 302 can be implemented in the following two manners, as follows:
- an embodiment of obtaining the first envelope information by step 302 may include:
- Step 401 Calculate a correlation between the first low-band signal and the low-band signal of the audio signal of the first N frame according to the first low-band signal and the low-band signal of the audio signal of the first N frame. Number; where N is greater than or equal to 1.
- the speech signal of the first N frames may be a mixed signal composed of a narrowband speech audio signal, a wideband speech audio signal or a narrowband speech audio signal and a wideband speech audio signal.
- Step 402 Determine whether the correlation coefficient is within a given first threshold range.
- the correlation coefficient is calculated in step 401, it is determined whether the correlation coefficient is within a given threshold range.
- the function of calculating the correlation coefficient is to know whether the current frame speech audio signal is fading from the speech signal of the previous N frame or is abrupt, that is to say, whether their characteristics are the same, and then judging the prediction of the current frame speech audio signal.
- the high frequency corresponding to the speech and audio signal of the previous frame Envelope information with envelope information or transitions occupies a larger weight; otherwise, if the first low-band signal of the current frame-audio signal differs greatly from the low-band signal energy of the speech signal of the previous frame, and the type is different , indicating that the speech signal of the previous frame has a lower correlation with the current frame speech audio signal. Therefore, in order to accurately restore the first envelope information corresponding to the current frame speech audio signal, the corresponding corresponding frame audio signal is restored.
- the high-band envelope information or the transition envelope information corresponding to the audio-video signal of the previous frame occupies a smaller weight;
- Step 403 if the correlation coefficient is not within the given first threshold range, according to the set
- the first first weight 1 and the first weight 2 are weighted to calculate the first envelope information.
- the first weight 1 is a weight value of the previous frame envelope information corresponding to the high frequency band signal of the previous frame speech and audio signal
- the first weight 2 is a weight value of the envelope information.
- step 402 concludes that the correlation coefficient is not within the given first threshold range, it may be known that the current frame speech audio signal has a small correlation with the previous N frame speech and audio signals, so the front M
- the first M frame envelope information or the transition envelope information corresponding to the first band speech and audio signal of the frame or the high band envelope information corresponding to the speech signal of the previous frame has little influence on the first envelope information, and is restored.
- the first envelope information corresponding to the current frame audio signal is the first M frame envelope information corresponding to the first band of the first M frame, or the transition envelope information or the high frequency corresponding to the previous frame of the speech signal Envelope information has a smaller weight.
- the first envelope information of the current frame can be calculated.
- the first weight one is a weight value of the envelope information corresponding to the high frequency band signal of the previous frame speech and audio signal, and the previous frame speech and audio signal may be a wideband speech audio signal or a processed narrow frequency band language.
- the audio signal when switched for the first time, the speech audio signal of the previous frame is the wideband speech and audio signal; and the first weight 2 is the weight value of the predicted envelope information.
- the product of the predicted envelope information and the first weight two is added, and the sum of the envelope information of the previous frame and the first weight one is added, and the sum of the weights obtained is the first envelope information of the current frame.
- the speech and audio signals transmitted later are restored in this manner and weights, and the first envelope information corresponding to the audio signal is restored until the speech and audio signals are switched again.
- Step 404 If the correlation coefficient is within the first threshold range, perform weighting processing according to the set second weight one and the second weight two to calculate transition envelope information.
- the second weight 1 is a weight value of the envelope information before the handover
- the second weight 2 is a weight value of the envelope information of the previous M frame; where L is greater than or equal to 1.
- step 402 finds that the correlation coefficient is within a given threshold range, it can be known that the current frame speech audio signal has similar characteristics to the speech signal of the previous consecutive N frames, and the current frame speech audio signal corresponds to the first
- An envelope information is greatly affected by the envelope information of the speech signal of the previous consecutive N frames, and considering the authenticity of the envelope of the previous M frame, therefore, the envelope information according to the previous M frame needs to be
- the envelope information before the handover is used to solve the transition envelope information corresponding to the current frame speech and audio signal, and the envelope information of the previous M frame and the envelope information of the pre-switch L frame are restored when the first envelope information of the current frame speech and audio signal is restored. Takes a larger weight; then solves the first envelope information through the transition envelope information.
- the second weight 1 is a weight value of the envelope information before the handover
- the second weight 2 is a weight value of the envelope information of the previous M frame. Then, the product of the envelope information before the switching and the second weight one, plus the sum of the product of the previous M frame envelope information and the second weight 2, the obtained weighted value is the transition envelope information.
- Step 405 Decrease the second weight one by one in the first weight step, and increase the second weight two by using the first weight step.
- the subsequent narrow-band speech and audio signals are gradually reduced by the influence of the wide-band speech and audio signals before switching, and in order to make the calculated first envelope information more accurate, the Two weights one and two weights two adjust the applicability. Since the subsequent audio signal is gradually reduced by the influence of the wideband speech and audio signal of the L frame before switching, the value of the second weight one gradually becomes smaller, and the value of the second weight two gradually increases, thereby weakening the pre-switching The effect of the envelope information on the first envelope information.
- the step 405 may modify the second weight one and the second weight two by using the following method: the new second weight one is equal to the old second weight one minus the first weight step, and the new second weight two is equal to the old The second weight 2 is added to the first weight step; wherein the first weight step is a set value.
- Step 406 Determine whether the third weight 1 that has been set is greater than the first weight one.
- the third weight 1 is a weight value of the transition envelope information.
- the third weight 1 By comparing the magnitudes of the third weight one with the second weight one, it can be known that the first envelope information of the current frame is affected by the transition envelope information.
- the transition envelope information is calculated from the envelope information of the first M frame and the envelope information before the handover. Therefore, the third weight 1 actually represents the degree of influence of the envelope information before the first envelope information is switched. .
- Step 407 If the third weight is not greater than the first weight one, perform weighting processing according to the first weight 1 and the first weight 2 that have been set to calculate the first envelope information.
- the third weight one is less than or equal to the first weight
- the current frame speech audio signal is far from the speech signal of the L frame before the handover, and the first envelope information is mainly affected by the envelope information of the previous M frame, and therefore, according to the first weight 1 and the first weight that have been set.
- the first envelope information of the current frame can be calculated.
- Step 408 If the third weight one is greater than the first weight one, perform weighting processing according to the set third weight one and third weight two to calculate the first envelope information.
- the third weight 1 is a weight value of the transition envelope information
- the third weight 2 is a weight value of the predicted envelope information.
- the third weight 1 is a weight value of the transition envelope information
- the third weight 2 is a weight value of the predicted envelope information. Then, the product of the transition envelope information and the third weight one, and the weighted value obtained by adding the sum of the predicted envelope information and the third weight two is the first envelope information.
- Step 409 Decrease the third weight one by the second weight step, and increase the third weight two by the second weight step, until the third weight one is equal to zero.
- the purpose of modifying the third weight one and the third weight two in step 409 is the same as the purpose of modifying the second weight one and the second weight two in step 405, both for the purpose of the subsequent transmission of the audio and video signals before being switched.
- the applicability is adjusted for the third weight one and the third weight two. Since the subsequent audio signal is gradually reduced by the influence of the L frame speech and audio signal before the switching, the value of the third weight one gradually becomes smaller, and the value of the third weight two gradually increases, thereby also achieving the weakening before the switching. The effect of the envelope information on the first envelope information.
- the step 409 may modify the third weight one and the third weight two by the following method: the new third weight one is equal to the old third weight one minus the second weight step, and the new third weight two is equal to the old The third weight 2 is added to the second weight step; wherein the second weight step is a set value.
- the sum of the first weight one and the first weight two is one, the sum of the second weight one and the second weight two is one, the sum of the third weight one and the third weight two is one; the initial value of the third weight one is greater than First right
- the initial value of the first one; the first weight one and the first weight two are fixed constants.
- the weight one and the weight two in the embodiment actually represent the percentage of the envelope information before the handover and the first M-frame envelope information composing the first envelope information of the current frame. For the current frame audio and video signal, the closer to the speech and audio signal of the L frame before the handover and the greater the correlation, the higher the percentage of the envelope information before the handover, and the opposite of the pre-M frame envelope information. The lower the percentage.
- the current frame speech audio signal When the current frame speech audio signal is far away from the speech signal of the L frame before switching, it indicates that the speech/audio signal has been stably transmitted in the network, or when the correlation between the current frame speech audio signal and the pre-switching L frame speech audio signal is low, It indicates that the current frame audio signal characteristics have changed. Therefore, the current frame speech and audio signals are less affected by the speech and audio signals of the L frames before switching, and the percentage of the envelope information before switching is lower.
- step 404 and step 405 in this embodiment may be interchanged, that is, the second weight one and the second weight two may be modified first, and then the transition is calculated according to the second weight one and the second weight two.
- Envelope information may be interchanged.
- step 408 and step 409 in this embodiment may be interchanged, that is, the third weight 1 and the third weight 2 may be modified first, and then the third weight 1 and the third weight 2 may be modified according to the modification.
- First envelope information may be interchanged, that is, the second weight one and the second weight two may be modified first, and then the transition is calculated according to the second weight one and the second weight two.
- step 408 and step 409 in this embodiment may be interchanged, that is, the third weight 1 and the third weight 2 may be modified first, and then the third weight 1 and the third weight 2 may be modified according to the modification.
- Manner 2 as shown in FIG. 5, another embodiment of obtaining the first envelope information by using step 302 may further include:
- Step 501 Calculate a correlation coefficient between the first low frequency band signal and the low frequency signal of the speech signal of the previous frame according to the first low frequency band signal of the current frame speech audio signal and the low frequency signal of the speech signal of the previous frame.
- the correlation coefficient may be represented by "corr”
- the current frame speech audio signal is obtained by the energy relationship between the first low frequency band signal of the current frame speech audio signal and the low frequency band signal of the speech signal of the previous frame.
- Correlation coefficient corr between the first low-band signal and the low-band signal of the speech signal of the previous frame the smaller the energy difference, the larger the corr, otherwise, corr The smaller.
- Step 502 Determine whether the correlation coefficient is within a given second threshold range.
- this embodiment can represent the second threshold range as cl ⁇ c2.
- Step 503 If the correlation coefficient is not within the second threshold range, perform weighting processing according to the first weight 1 and the first weight 2 that have been set to calculate the first envelope information.
- the first weight one is the weight value of the previous frame envelope information corresponding to the high frequency band signal of the speech audio signal of the previous frame, and the first weight 2 is the weight value of the predicted envelope information;
- the first weight two is a fixed constant.
- the step 502 is that the corr is smaller than cl or greater than c2
- the first envelope information corresponding to the current frame audio signal is less affected by the envelope information of the previous frame of the voice signal, so
- the first weight one and the first weight two the first envelope information of the current frame can be calculated.
- the product of the predicted envelope information and the first weight two, plus the sum of the product of the previous frame envelope information and the first weight one, the weighted sum obtained is the first envelope information of the current frame.
- the narrowband audio signal transmitted later recovers the first envelope information corresponding to the narrowband audio signal in this manner and the weight until the speech and audio signals of different bandwidths are switched again.
- the first weight 1 in this embodiment may be represented by al
- the first weight 2 may be represented by bl
- the previous frame envelope information may be represented by pre_fenv
- the predicted envelope information may be represented by fenv
- Information can be represented by cur_fenv.
- Step 504 If the correlation coefficient is within the second threshold range, determine whether the second weight 1 that has been set is greater than the first weight one.
- the second weight 1 is a weight value of the envelope information before the handover corresponding to the high frequency band signal of the previous frame of the audio signal.
- Step 505 If the second weight is not greater than the first weight one, calculate the first envelope information according to the first weight one and the first weight two that have been set.
- Step 506 If the second weight one is greater than the first weight one, perform weighting processing according to the second weight one and the second weight 2 that has been set to calculate the first envelope information.
- the second weight 2 is a weight value of the predicted envelope information.
- the second weight one can be represented by a2, and the second weight two can be represented by b2.
- step 504 when it is determined in step 504 that the second weight 1 is greater than the first weight, the current frame speech audio signal is closer to the first frequency band audio signal of the previous frame, and the first envelope information is switched by the previous frame.
- the envelope information before the switching of the speech and audio signals has a large influence. Therefore, based on the second weight one and the second weight two that have been set, the first envelope information of the current frame can be calculated. Then, the product of the predicted envelope information and the second weight 2, plus the sum of the product of the envelope information before the switching and the second weight, the obtained weighted sum is the first envelope information of the current frame.
- the envelope information before the handover can be represented by con_fenv
- Step 507 Decrease the second weight one by the second weight step, and add the second weight two by the second weight step.
- the subsequent current frame speech and audio signals are gradually reduced by the influence of the pre-switched audio signal, and in order to make the calculated first envelope information more accurate, the second weight is required.
- One and the second weight 2 are adjusted for applicability. Since the subsequent audio signal is gradually reduced by the influence of the previous frame of the speech and audio signal; and close to the current frame speech and audio signal The influence of a frame of audio and video signals becomes larger. Therefore, the value of the second weight one gradually becomes smaller, and the value of the second weight two gradually increases, thereby weakening the influence of the envelope information before the switching on the first envelope information, and enhancing the envelope information of the prediction to the first The impact of envelope information.
- the step 507 may modify the second weight one and the second weight two by using the following method: the new second weight one is equal to the old second weight one minus the first weight step, and the new second weight two is equal to the old The second weight 2 is added to the first weight step; wherein the first weight step is a set value.
- the sum of the first weight one and the first weight two is one, and the sum of the second weight one and the second weight two is one; the initial value of the second weight one is greater than the initial value of the first weight one.
- Step 303 Generate a processed first high frequency band signal according to the first envelope information and the predicted fine structure information.
- the required processed first high-band signal may be generated according to the first envelope information and the predicted fine structure information, so that the second The high frequency band signal can smoothly transition to the processed first high frequency band signal.
- the processed current frame is obtained by the predicted fine structure information and the first envelope information. a first high-band signal, so that the second high-band signal of the wide-band speech and audio signal before the switching can be smoothly transitioned to the processed first high-band signal corresponding to the narrow-band speech and audio signal, and more It is beneficial to improve the quality of the audio signal received by the user.
- step 202 in this embodiment includes:
- Step 601 Determine whether the processed first high frequency band signal needs to be attenuated according to the current frame speech audio signal and the switching of the speech and audio signals of the previous frame.
- the processed first high-frequency is processed.
- the energy with the signal is attenuated frame by frame until the attenuation coefficient reaches a given threshold.
- the interval between the current frame speech audio signal and the speech/audio signal of the previous frame can be known by the current frame audio signal and the audio signal of the previous frame.
- the narrowband speech and audio signal can be recorded by the counter.
- the number of frames transmitted. This frame number can be a value that is predetermined to be greater than or equal to zero.
- Step 602 Combine the processed first high frequency band signal with the first low frequency band signal into a wideband signal if attenuation is not required.
- step 601 if it is determined in step 601 that the processed first high-band signal does not need to be attenuated, the processed first high-band signal and the first low-band signal are directly combined into a wide-band signal.
- Step 603 If attenuation is required, determine whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a threshold.
- the initial value of the attenuation factor is one; the threshold is less than one and greater than or equal to zero. If it is determined in step 601 that the processed first high frequency band signal needs to be attenuated, it is determined in step 603 whether the attenuation factor corresponding to the processed first high frequency band signal is greater than a given threshold.
- Step 604 If the attenuation factor is not greater than a given threshold, multiply the processed first high-band signal by a threshold, and then synthesize the broadband signal with the first low-band signal.
- the value of the attenuation factor is not greater than a given threshold in step 603, it indicates that the energy of the processed first high-band signal has been attenuated to a certain extent, and the processed first high-band signal has been It will not bring bad effects, and you can maintain this attenuation ratio in the future. Then, the processed first high frequency band signal is multiplied by a threshold value, and then the wideband signal is synthesized with the first low frequency band signal.
- Step 605 If the attenuation factor is greater than a given threshold, multiplying the processed first high-band signal by an attenuation factor, and then synthesizing the broadband signal with the first low-band signal.
- step 603 finds that the value of the attenuation factor is greater than a given threshold, it indicates that the first high-band signal after processing may cause a bad hearing effect at the attenuation factor, and further Attenuate until a given threshold. Then, the processed first high frequency band signal is multiplied by the attenuation factor, and then the wide frequency band signal is synthesized with the first low frequency band signal.
- Step 606 Modify the attenuation factor to reduce the attenuation factor. Specifically, with the transmission of the speech and audio signals, the subsequent narrowband audio signals are gradually reduced by the influence of the speech and audio signals before switching, and correspondingly, the attenuation factor should also be gradually reduced.
- an implementation of the processed first high frequency band signal is obtained through step 201 in this embodiment. Examples include:
- Step 701 Perform weighting processing according to the set fourth weight one and fourth weight two to calculate the processed first high frequency band signal.
- the fourth weight 1 is a weight value of the second high frequency band signal
- the fourth weight 2 is a weight value of the first high frequency band signal of the current frame audio signal.
- the high frequency band signal corresponding to the narrowband speech and audio signal is Empty or processed high-band signals, in order to enable smooth switching of narrow-band speech and audio signals to wide-band speech and audio signals, energy attenuation of high-band signals in wide-band speech and audio signals is required to implement speech and audio signals. Smooth switching.
- the obtained weighted value is the processed first high frequency band signal.
- Step 702 Decrease the fourth weight one by the third weight step, and increase the fourth weight two by the third weight step, until the fourth weight one is equal to zero. The sum of the fourth weight one and the fourth weight two is one.
- the fourth weight gradually becomes smaller, and the fourth weight 2 gradually increases until the fourth weight 1 becomes zero, and the fourth weight 2 becomes one, that is, the transmitted speech and audio signal is always a broadband audio signal.
- another embodiment of obtaining the processed first high-band signal by step 201 in this embodiment may further include:
- Step 801 Perform weighting processing according to the set fifth weight one and fifth weight two to calculate the processed first high frequency band signal.
- the fifth weight one is a fixed parameter that has been set
- the fifth weight 2 is the weight value of the first high frequency band signal of the current frame speech audio signal.
- a fixed parameter may be set instead of the high frequency band signal of the narrowband speech and audio signal, wherein the fixed parameter is one greater than or equal to zero less than the first A constant of the energy of a high frequency band signal.
- the weighted value obtained by the product of the fixed parameter and the fifth weight one plus the product of the first high frequency band signal and the fifth weight two is the processed first high frequency band signal.
- Step 802 Decrease the fifth weight one in units of the fourth weight step, and increase the fifth weight two in units of the fourth weight step, until the fifth weight one is equal to zero; wherein, the fifth weight one and the fifth weight two The sum of one.
- the subsequent wideband speech and audio signals are gradually reduced by the influence of the narrowband speech and audio signals before switching. Therefore, the fifth weight gradually becomes smaller, and the fifth weight 2 gradually increases until the fifth weight becomes zero, and the fifth weight 2 becomes one, that is, the transmitted speech and audio signals are always true broadband words. audio signal.
- the high frequency band signal of the wideband speech and audio signal is attenuated and processed.
- the high-band signal enables the high-band signal corresponding to the narrow-band speech and audio signal before the switching to smoothly transition to the processed high-band signal corresponding to the wide-band speech and audio signal, which is more conducive to improving the user's listening audio.
- the quality of the signal is more conducive to improving the user's listening audio.
- the envelope information in this embodiment may also be replaced by other parameters capable of representing a high-band signal, such as: Linear Predictive Coding (LPC) parameters, or amplitude parameters.
- LPC Linear Predictive Coding
- FIG. 9 is a schematic structural diagram of Embodiment 1 of a speech audio signal switching apparatus according to the present invention.
- the audio signal switching apparatus of this embodiment includes: a processing module 91 and a first synthesizing module 92.
- the processing module 91 is configured to perform weighting processing on the first high frequency band signal of the current frame speech audio signal and the second high frequency band signal of the previous M frame speech and audio signal when the speech signal is switched, to obtain the processed A high frequency band signal.
- M is greater than or equal to 1.
- the first synthesis module 92 is configured to synthesize the processed first high frequency band signal with the first low frequency band signal of the current frame speech audio signal into a wide frequency band signal.
- the speech/audio signal switching device of the embodiment of the present invention processes, by the processing module, the first high-band signal in the current frame speech and audio signal according to the second high-band signal in the pre-M frame speech and audio signal, so that The two high-band signals can smoothly transition to the processed first high-band signal, so that the high-band signals of the different bandwidth speech and audio signals can be smoothly switched during the process of switching the speech and audio signals of different bandwidths; Finally, the processed first high frequency band signal and the first low frequency band signal are combined by the first synthesis module to synthesize a wideband signal, and the wideband signal is transmitted to the user terminal, so that the user enjoys a high quality speech and audio signal.
- the method for switching the speech and audio signals of the embodiment can smoothly perform the switching of the speech and audio signals of different bandwidths, reduce the influence of the subjective auditory quality difference of the speech and audio signals caused by the energy excitation, and improve the quality of the audio signal of the user.
- FIG. 10 is a schematic structural diagram of Embodiment 2 of a speech audio signal switching apparatus according to the present invention. As shown in FIG. 10, the audio signal switching apparatus of this embodiment is based on the first embodiment of the audio signal switching apparatus. The difference is that the audio signal switching apparatus of this embodiment further includes: a second combining module 103.
- the second synthesizing module 103 is configured to synthesize the first high band signal and the first low band signal into a wide band signal when the switching of the speech signal does not occur.
- the first low frequency in the first frequency band audio signal of the current frame may be used by the second combining module without switching the voice and audio signals of different bandwidths.
- the band signal and the first high frequency band signal are combined to form a wideband signal, thereby facilitating the improvement of the quality of the user's audio and video signals.
- the processing module 101 in this embodiment includes:
- the prediction module 1011 is configured to predict predicted fine structure information and predicted envelope information corresponding to the first high-band signal.
- the first generation module 1012 is configured to perform weighting processing according to the predicted envelope information and the pre-M frame envelope information corresponding to the second high-band signal of the pre-M frame speech and audio signal, to obtain a first corresponding to the first high-band signal.
- An envelope information is configured to perform weighting processing according to the predicted envelope information and the pre-M frame envelope information corresponding to the second high-band signal of the pre-M frame speech and audio signal, to obtain a first corresponding to the first high-band signal.
- the second generation module 1013 is configured to generate the processed first high frequency band signal according to the first envelope information and the predicted fine structure information.
- the audio signal switching apparatus of this embodiment may further include: a classification module 1010, configured to perform signal classification on the first low frequency band signal of the current frame speech audio signal; and the prediction module 1011 is further configured to use the first low frequency band signal according to the signal The corresponding signal type predicts predicted fine structure information and predicted envelope information corresponding to the first low frequency band signal of the current frame speech audio signal.
- the predicted fine structure information and the predicted envelope information corresponding to the first high frequency band signal are predicted by the prediction module, so that the first generation module and the second generation module can be accurately generated and processed.
- the first high-band signal so that the first high-band signal can be smoothly transitioned to the processed first high-band signal, which is more advantageous for improving the quality of the user's audio and video signals.
- the first low-band signal of the current frame speech and audio signal is classified by the classification module, and then the prediction module obtains the predicted fine structure information and the predicted envelope information according to the signal type, thereby making the predicted fine structure information and prediction.
- the envelope information is more accurate, and the quality of the speech and audio signals received by the user is higher.
- the first synthesizing module 102 in this embodiment includes:
- the first judging module 1021 is configured to judge whether the processed first high-band signal needs to be attenuated according to the current frame speech audio signal and the speech/audio signal of the previous frame.
- the third synthesizing module 1022 is configured to: if the first judging module 1021 obtains that the processed first high frequency band signal does not need to be attenuated, synthesize the processed first high frequency band signal and the first low frequency band signal into a wide width Frequency band signal.
- the second determining module 1023 is configured to determine whether the processed first high frequency band signal needs to be attenuated if the first determining module 1021 determines that the processed first high frequency band signal has an attenuation factor greater than a given threshold.
- the fourth synthesizing module 1024 is configured to: if the second judging module 1023 finds that the attenuation factor is not greater than a given threshold, multiply the processed first high-band signal by a threshold, and then synthesize the broadband with the first low-band signal. With signal.
- the fifth synthesizing module 1025 is configured to: if the second judging module 1023 obtains that the attenuation factor is greater than a given threshold, multiply the processed first high-band signal by an attenuation factor, and then synthesize the broadband with the first low-band signal. With signal.
- the first modification module 1026 is for modifying the attenuation factor to reduce the attenuation factor.
- the initial value of the attenuation factor is one; the threshold is less than one and greater than or equal to zero.
- the wide-band signal obtained by processing the current frame-audio signal can be more accurate, which is more advantageous for improving the user's listening to the audio signal. quality.
- the processing module 101 in this embodiment includes:
- the first calculating module 1011a is configured to perform weighting processing according to the set fourth weight one and fourth weight two to calculate the processed first high frequency band signal; wherein, the fourth weight one is the second high frequency a weight value with a signal, and a fourth weight 2 is a weight value of the first high frequency band signal;
- the second modification module 1012a is configured to reduce the fourth weight one by a third weight step, and add the fourth weight two by a third weight step, until the fourth weight one is equal to zero; wherein, the fourth weight is one The sum of the fourth weight two is one.
- the processing module 101 in this embodiment may further include:
- the second calculation module 101 lb is configured to perform the second weight and the fifth weight according to the set fifth weight Performing a weighting process to calculate a processed first high-band signal; wherein, the fifth weight one is a weight value of the fixed parameter that has been set, and the fifth weight two is a weight value of the first high-band signal;
- the third modification module 1012b is configured to reduce the fifth weight one by the fourth weight step, and add the fifth weight two by the fourth weight step, until the fifth weight one is equal to zero; wherein, the fifth weight one The sum of the fifth weight and the second weight is one; wherein, the fixed parameter is a constant greater than or equal to zero and less than the energy value of the first high frequency band signal.
- the speech/audio signal switching device is processed by attenuating the high-band signal of the wide-band speech and audio signal.
- the high-band signal enables the high-band signal corresponding to the narrow-band speech and audio signal before the switching to smoothly transition to the processed high-band signal corresponding to the wide-band speech and audio signal, which is more conducive to improving the user's listening audio.
- the quality of the signal is more conducive to improving the user's listening audio.
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
BR112012013306A BR112012013306B8 (en) | 2010-04-28 | 2011-04-28 | method and apparatus for switching speech or audio signals |
EP17151713.9A EP3249648B1 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
EP11774406.0A EP2485029B1 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
AU2011247719A AU2011247719B2 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
KR1020127012328A KR101377547B1 (en) | 2010-04-28 | 2011-04-28 | Method and apparatus for switching speech or audio signals |
JP2012541316A JP5667202B2 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
ES11774406.0T ES2635212T3 (en) | 2010-04-28 | 2011-04-28 | Procedure and device for switching audio signals |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010163406.3 | 2010-04-28 | ||
CN2010101634063A CN101964189B (en) | 2010-04-28 | 2010-04-28 | Audio signal switching method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2011134415A1 true WO2011134415A1 (en) | 2011-11-03 |
Family
ID=43517042
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2011/073479 WO2011134415A1 (en) | 2010-04-28 | 2011-04-28 | Audio signal switching method and device |
Country Status (8)
Country | Link |
---|---|
EP (2) | EP3249648B1 (en) |
JP (3) | JP5667202B2 (en) |
KR (1) | KR101377547B1 (en) |
CN (1) | CN101964189B (en) |
AU (1) | AU2011247719B2 (en) |
BR (1) | BR112012013306B8 (en) |
ES (2) | ES2635212T3 (en) |
WO (1) | WO2011134415A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8214218B2 (en) | 2010-04-28 | 2012-07-03 | Huawei Technologies Co., Ltd. | Method and apparatus for switching speech or audio signals |
US10720170B2 (en) | 2016-02-17 | 2020-07-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Post-processor, pre-processor, audio encoder, audio decoder and related methods for enhancing transient processing |
US11929084B2 (en) | 2014-07-28 | 2024-03-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder using a frequency domain processor with full-band gap filling and a time domain processor |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101110800B1 (en) * | 2003-05-28 | 2012-07-06 | 도꾸리쯔교세이호진 상교기쥬쯔 소고겡뀨죠 | Process for producing hydroxyl group-containing compound |
US11431312B2 (en) | 2004-08-10 | 2022-08-30 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US8284955B2 (en) | 2006-02-07 | 2012-10-09 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US10158337B2 (en) | 2004-08-10 | 2018-12-18 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US10848118B2 (en) | 2004-08-10 | 2020-11-24 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US10848867B2 (en) | 2006-02-07 | 2020-11-24 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US10701505B2 (en) | 2006-02-07 | 2020-06-30 | Bongiovi Acoustics Llc. | System, method, and apparatus for generating and digitally processing a head related audio transfer function |
CN101964189B (en) * | 2010-04-28 | 2012-08-08 | 华为技术有限公司 | Audio signal switching method and device |
CN105469805B (en) | 2012-03-01 | 2018-01-12 | 华为技术有限公司 | A kind of voice frequency signal treating method and apparatus |
CN105761724B (en) * | 2012-03-01 | 2021-02-09 | 华为技术有限公司 | Voice frequency signal processing method and device |
CN103516440B (en) * | 2012-06-29 | 2015-07-08 | 华为技术有限公司 | Audio signal processing method and encoding device |
CN103971693B (en) * | 2013-01-29 | 2017-02-22 | 华为技术有限公司 | Forecasting method for high-frequency band signal, encoding device and decoding device |
US9883318B2 (en) | 2013-06-12 | 2018-01-30 | Bongiovi Acoustics Llc | System and method for stereo field enhancement in two-channel audio systems |
US9906858B2 (en) | 2013-10-22 | 2018-02-27 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US9397629B2 (en) * | 2013-10-22 | 2016-07-19 | Bongiovi Acoustics Llc | System and method for digital signal processing |
US9524720B2 (en) * | 2013-12-15 | 2016-12-20 | Qualcomm Incorporated | Systems and methods of blind bandwidth extension |
CN103714822B (en) * | 2013-12-27 | 2017-01-11 | 广州华多网络科技有限公司 | Sub-band coding and decoding method and device based on SILK coder decoder |
KR101864122B1 (en) * | 2014-02-20 | 2018-06-05 | 삼성전자주식회사 | Electronic apparatus and controlling method thereof |
US10639000B2 (en) | 2014-04-16 | 2020-05-05 | Bongiovi Acoustics Llc | Device for wide-band auscultation |
US10820883B2 (en) | 2014-04-16 | 2020-11-03 | Bongiovi Acoustics Llc | Noise reduction assembly for auscultation of a body |
AU2019252524A1 (en) | 2018-04-11 | 2020-11-05 | Bongiovi Acoustics Llc | Audio enhanced hearing protection system |
CN110556116B (en) * | 2018-05-31 | 2021-10-22 | 华为技术有限公司 | Method and apparatus for calculating downmix signal and residual signal |
US10959035B2 (en) | 2018-08-02 | 2021-03-23 | Bongiovi Acoustics Llc | System, method, and apparatus for generating and digitally processing a head related audio transfer function |
CN112002333B (en) * | 2019-05-07 | 2023-07-18 | 海能达通信股份有限公司 | Voice synchronization method and device and communication terminal |
CN117373465B (en) * | 2023-12-08 | 2024-04-09 | 富迪科技(南京)有限公司 | Voice frequency signal switching system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050027516A1 (en) * | 2003-07-16 | 2005-02-03 | Samsung Electronics Co., Ltd. | Wide-band speech signal compression and decompression apparatus, and method thereof |
CN101335002A (en) * | 2007-11-02 | 2008-12-31 | 华为技术有限公司 | Method and apparatus for audio decoding |
CN101425292A (en) * | 2007-11-02 | 2009-05-06 | 华为技术有限公司 | Decoding method and device for audio signal |
CN101964189A (en) * | 2010-04-28 | 2011-02-02 | 华为技术有限公司 | Audio signal switching method and device |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4330689A (en) * | 1980-01-28 | 1982-05-18 | The United States Of America As Represented By The Secretary Of The Navy | Multirate digital voice communication processor |
US4769833A (en) * | 1986-03-31 | 1988-09-06 | American Telephone And Telegraph Company | Wideband switching system |
US5019910A (en) * | 1987-01-29 | 1991-05-28 | Norsat International Inc. | Apparatus for adapting computer for satellite communications |
FI115329B (en) * | 2000-05-08 | 2005-04-15 | Nokia Corp | Method and arrangement for switching the source signal bandwidth in a communication connection equipped for many bandwidths |
US7113522B2 (en) * | 2001-01-24 | 2006-09-26 | Qualcomm, Incorporated | Enhanced conversion of wideband signals to narrowband signals |
JP2005080079A (en) * | 2003-09-02 | 2005-03-24 | Sony Corp | Sound reproduction device and its method |
FI119533B (en) * | 2004-04-15 | 2008-12-15 | Nokia Corp | Coding of audio signals |
JPWO2005106848A1 (en) * | 2004-04-30 | 2007-12-13 | 松下電器産業株式会社 | Scalable decoding apparatus and enhancement layer erasure concealment method |
WO2006011444A1 (en) * | 2004-07-28 | 2006-02-02 | Matsushita Electric Industrial Co., Ltd. | Relay device and signal decoding device |
JP4989971B2 (en) * | 2004-09-06 | 2012-08-01 | パナソニック株式会社 | Scalable decoding apparatus and signal loss compensation method |
WO2006075663A1 (en) * | 2005-01-14 | 2006-07-20 | Matsushita Electric Industrial Co., Ltd. | Audio switching device and audio switching method |
US8249861B2 (en) * | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
EP1898397B1 (en) * | 2005-06-29 | 2009-10-21 | Panasonic Corporation | Scalable decoder and disappeared data interpolating method |
US8194865B2 (en) * | 2007-02-22 | 2012-06-05 | Personics Holdings Inc. | Method and device for sound detection and audio control |
JP5547081B2 (en) * | 2007-11-02 | 2014-07-09 | 華為技術有限公司 | Speech decoding method and apparatus |
-
2010
- 2010-04-28 CN CN2010101634063A patent/CN101964189B/en active Active
-
2011
- 2011-04-28 AU AU2011247719A patent/AU2011247719B2/en active Active
- 2011-04-28 BR BR112012013306A patent/BR112012013306B8/en active IP Right Grant
- 2011-04-28 ES ES11774406.0T patent/ES2635212T3/en active Active
- 2011-04-28 JP JP2012541316A patent/JP5667202B2/en active Active
- 2011-04-28 EP EP17151713.9A patent/EP3249648B1/en active Active
- 2011-04-28 EP EP11774406.0A patent/EP2485029B1/en active Active
- 2011-04-28 ES ES17151713T patent/ES2718947T3/en active Active
- 2011-04-28 KR KR1020127012328A patent/KR101377547B1/en active IP Right Grant
- 2011-04-28 WO PCT/CN2011/073479 patent/WO2011134415A1/en active Application Filing
-
2014
- 2014-12-11 JP JP2014250689A patent/JP6027081B2/en active Active
-
2016
- 2016-10-13 JP JP2016201593A patent/JP6410777B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050027516A1 (en) * | 2003-07-16 | 2005-02-03 | Samsung Electronics Co., Ltd. | Wide-band speech signal compression and decompression apparatus, and method thereof |
CN101335002A (en) * | 2007-11-02 | 2008-12-31 | 华为技术有限公司 | Method and apparatus for audio decoding |
CN101425292A (en) * | 2007-11-02 | 2009-05-06 | 华为技术有限公司 | Decoding method and device for audio signal |
CN101964189A (en) * | 2010-04-28 | 2011-02-02 | 华为技术有限公司 | Audio signal switching method and device |
Non-Patent Citations (1)
Title |
---|
See also references of EP2485029A4 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8214218B2 (en) | 2010-04-28 | 2012-07-03 | Huawei Technologies Co., Ltd. | Method and apparatus for switching speech or audio signals |
US11929084B2 (en) | 2014-07-28 | 2024-03-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder using a frequency domain processor with full-band gap filling and a time domain processor |
US10720170B2 (en) | 2016-02-17 | 2020-07-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Post-processor, pre-processor, audio encoder, audio decoder and related methods for enhancing transient processing |
US11094331B2 (en) | 2016-02-17 | 2021-08-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Post-processor, pre-processor, audio encoder, audio decoder and related methods for enhancing transient processing |
Also Published As
Publication number | Publication date |
---|---|
CN101964189A (en) | 2011-02-02 |
ES2635212T3 (en) | 2017-10-02 |
JP2017033015A (en) | 2017-02-09 |
EP3249648A1 (en) | 2017-11-29 |
JP2015045888A (en) | 2015-03-12 |
BR112012013306A2 (en) | 2016-03-01 |
JP5667202B2 (en) | 2015-02-12 |
CN101964189B (en) | 2012-08-08 |
AU2011247719A1 (en) | 2012-06-07 |
BR112012013306B1 (en) | 2020-11-10 |
BR112012013306B8 (en) | 2021-02-17 |
KR101377547B1 (en) | 2014-03-25 |
AU2011247719B2 (en) | 2013-07-11 |
ES2718947T3 (en) | 2019-07-05 |
EP2485029A1 (en) | 2012-08-08 |
JP2013512468A (en) | 2013-04-11 |
EP2485029A4 (en) | 2013-01-02 |
EP3249648B1 (en) | 2019-01-09 |
JP6410777B2 (en) | 2018-10-24 |
JP6027081B2 (en) | 2016-11-16 |
EP2485029B1 (en) | 2017-06-14 |
KR20120074303A (en) | 2012-07-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2011134415A1 (en) | Audio signal switching method and device | |
US10559313B2 (en) | Speech/audio signal processing method and apparatus | |
US9779749B2 (en) | Audio signal coding method and apparatus | |
US8214218B2 (en) | Method and apparatus for switching speech or audio signals | |
WO2021052285A1 (en) | Frequency band expansion method and apparatus, electronic device, and computer readable storage medium | |
KR101924767B1 (en) | Voice frequency code stream decoding method and device | |
JP2022526403A (en) | Frequency band expansion methods, devices, electronic devices and computer programs | |
JP2022548299A (en) | Audio encoding method and apparatus | |
WO2014000559A1 (en) | Processing method for speech or audio signals and encoding apparatus thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11774406 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 3907/CHENP/2012 Country of ref document: IN |
|
REEP | Request for entry into the european phase |
Ref document number: 2011774406 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011774406 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 20127012328 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011247719 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012541316 Country of ref document: JP |
|
ENP | Entry into the national phase |
Ref document number: 2011247719 Country of ref document: AU Date of ref document: 20110428 Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112012013306 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 112012013306 Country of ref document: BR Kind code of ref document: A2 Effective date: 20120601 |