WO2014000559A1 - 语音频信号处理方法和编码装置 - Google Patents

语音频信号处理方法和编码装置 Download PDF

Info

Publication number
WO2014000559A1
WO2014000559A1 PCT/CN2013/076862 CN2013076862W WO2014000559A1 WO 2014000559 A1 WO2014000559 A1 WO 2014000559A1 CN 2013076862 W CN2013076862 W CN 2013076862W WO 2014000559 A1 WO2014000559 A1 WO 2014000559A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
harmonic
bandwidth
audio signal
speech
Prior art date
Application number
PCT/CN2013/076862
Other languages
English (en)
French (fr)
Inventor
胡晨
刘泽新
苗磊
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to KR1020187028697A priority Critical patent/KR102005967B1/ko
Priority to KR1020197021968A priority patent/KR102165827B1/ko
Priority to EP20150138.4A priority patent/EP3748634B1/en
Priority to JP2015518805A priority patent/JP6359529B2/ja
Priority to EP13810131.6A priority patent/EP2851897B1/en
Priority to KR1020177030314A priority patent/KR101907494B1/ko
Priority to EP17195365.6A priority patent/EP3376499B1/en
Priority to ES13810131.6T priority patent/ES2654488T3/es
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to KR1020207028813A priority patent/KR102331531B1/ko
Priority to KR1020167035415A priority patent/KR101790680B1/ko
Priority to KR1020157000174A priority patent/KR101689138B1/ko
Publication of WO2014000559A1 publication Critical patent/WO2014000559A1/zh
Priority to US14/562,494 priority patent/US10056090B2/en
Priority to US16/051,139 priority patent/US11107486B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • G10L19/265Pre-filtering, e.g. high frequency emphasis prior to encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters

Definitions

  • Embodiments of the present invention relate to communication technologies, and in particular, to a speech and audio signal processing method and an encoding device. Background technique
  • voice, image, audio, and video transmissions have a wide range of applications, such as cell phone calls, audio and video conferencing, broadcast television, and multimedia entertainment.
  • the audio and video signals are digitized and transmitted from one terminal to another through a communication network, where the terminals can be mobile phones, digital telephone terminals or any other type of audio and audio terminals, such as VOIP phones or ISDN phones, computers, Cable communication phone.
  • the voice and audio signals are compressed and processed at the transmitting end and transmitted to the receiving end, and the receiving end recovers the voice and audio signals by the decompression process and plays them.
  • the bandwidth of the speech and audio signals changes from time to time.
  • the reason for the change of the bandwidth of the audio signal of the language may be the change of the state of the network, or the change of the speech/audio signal itself in the bandwidth, or other factors that can cause the speech and audio signals to be in the high frequency band signal and the low frequency band signal.
  • the factor of switching between. The process of switching such speech audio signals between high and low frequency bands is called bandwidth switching.
  • the network status changes frequently, and the network bandwidth narrows as the network status deteriorates. Accordingly, the speech and audio signals also need to switch between the high band signal and the low band signal as the network bandwidth changes.
  • the speech and audio signals need to be reduced from the high frequency band signal to the low frequency band signal, and when the network condition is restored, the speech and audio signal needs to be restored from the low frequency band signal to the high frequency band signal.
  • the bandwidth of the high-band signal and the low-band signal is a relative concept. For example, the bandwidth of the high-band signal is 0 to 16 kHz, the bandwidth of the low-band signal is 0 to 8 kHz, or the bandwidth of the high-band signal.
  • Embodiments of the present invention provide a voice and audio signal processing method and an encoding apparatus based on bandwidth switching.
  • An embodiment of the present invention provides a voice and audio signal processing method based on bandwidth switching, including: if a first bandwidth speech and audio signal is a harmonic signal, adjusting a second bandwidth speech and audio signal to be determined as a harmonic signal a decision condition to improve the probability that the second bandwidth speech and audio signal is determined to be a harmonic signal; the first bandwidth speech signal is a signal before bandwidth switching, and the second bandwidth speech audio signal is after bandwidth switching signal of;
  • An embodiment of the present invention further provides an encoding apparatus, including:
  • a decision condition adjustment module configured to: if the first bandwidth speech and audio signal is a harmonic signal, adjust a second bandwidth speech and audio signal to be determined as a harmonic signal, and obtain a first decision condition to improve the second bandwidth language
  • the audio signal is determined as a harmonic signal;
  • the first bandwidth voice signal is a signal before bandwidth switching, and the second bandwidth voice signal is a bandwidth switched signal;
  • a signal type determining module configured to determine, according to the first decision condition, whether the second bandwidth speech and audio signal is a harmonic signal.
  • the encoding apparatus may determine whether the first bandwidth speech and audio signal before the bandwidth switching is a harmonic signal, and in the case of determining that the harmonic signal is, adopting the second bandwidth speech and audio signal after adjusting the bandwidth switching.
  • the manner in which the harmonic signal is judged by the condition, the decision condition of whether the second bandwidth speech and audio signal after the bandwidth switching is a harmonic signal is relaxed, and the possibility that the second bandwidth speech and audio signal is judged as a harmonic signal is increased as much as possible, therefore,
  • the signal type of the speech and audio signal is kept as consistent as possible before and after the switching, so that the decoded speech and audio signals of the decoding end device are as continuous as possible, thereby improving the quality of the voice communication service.
  • Embodiment 1 is a flowchart of Embodiment 1 of a method for processing a speech audio signal according to the present invention
  • Embodiment 2 is a flowchart of Embodiment 2 of a method for processing a speech audio signal according to the present invention
  • Embodiment 3 is a flowchart of Embodiment 3 of a method for processing a speech audio signal according to the present invention
  • Embodiment 4 is a flowchart of Embodiment 4 of a method for processing a speech audio signal according to the present invention
  • Embodiment 5 is a flowchart of Embodiment 5 of a method for processing a speech audio signal according to the present invention
  • FIG. 6 is a schematic structural diagram of a coding end device provided with the coding apparatus of the present invention
  • FIG. 7 is a schematic structural diagram of Embodiment 1 of the coding apparatus according to the present invention
  • FIG. 8 is a schematic structural diagram of Embodiment 2 of an encoding apparatus according to the present invention.
  • the technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention.
  • the embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
  • the speech audio signal processing method of the present invention can be applied to an audio encoder.
  • audio codecs are widely used in various electronic devices, such as: mobile phones, wireless devices, personal data assistants (PDAs), handheld or portable computers, GPS receivers/navigators, cameras, audio/ Video player, video camera, video recorder, monitoring equipment, etc.
  • PDAs personal data assistants
  • Such an electronic device includes an audio encoder or an audio decoder, and the audio encoder or decoder may be directly implemented by a digital circuit or a chip such as a DSP (digital signal processor), or may be executed by a software code driven processor in the software code. The process is implemented.
  • DSP digital signal processor
  • Embodiment 1 is a flowchart of Embodiment 1 of a method for processing a voice signal according to the present invention, as shown in FIG.
  • the method of this embodiment may include:
  • Step 101 If the first bandwidth speech and audio signal is a harmonic signal, adjusting a second bandwidth speech and audio signal to be determined as a judgment condition of the harmonic signal to obtain a first determination condition, to improve the second bandwidth speech and audio signal is The decision is the possibility of a harmonic signal.
  • the first bandwidth speech and audio signal is a speech and audio signal before bandwidth switching
  • the second bandwidth speech and audio signal is a speech and audio signal after bandwidth switching.
  • Step 102 Determine, according to the first decision condition, whether the second bandwidth speech and audio signal is a harmonic signal.
  • the high-band signal may be an ultra-wideband signal
  • the low-band signal may be a wide-band signal.
  • a person skilled in the art may set a signal having a bandwidth range above a certain range as an ultra-wideband signal according to requirements, otherwise, a broadband signal, for example,
  • a signal with a bandwidth range of 0 to 8 kHz or more can be set as an ultra-wideband signal
  • a signal having a bandwidth of 0 to 8 kHz or less is a wideband signal.
  • the ultra-wideband signal can be divided into a harmonic signal, a normal signal, a transient signal, and a noise signal
  • the wideband signal can be divided into a harmonic signal and a normal signal.
  • the first bandwidth speech and audio signal in this embodiment may be an ultra-wideband signal
  • the switched second bandwidth speech and audio signal may be a wideband signal
  • the first bandwidth speech and audio signal may be a wideband signal
  • the switched second bandwidth language The audio signal can be an ultra wideband signal.
  • the signal type can be one of a harmonic signal, a normal signal, a transient signal, and a noise signal.
  • the signal type can be one of a harmonic signal and a normal signal.
  • the encoding device can determine the signal type of the ultra-wideband signal by using a harmonic signal decision condition corresponding to the ultra-wideband signal.
  • the encoding device can use the harmonic signal decision corresponding to the wideband signal.
  • the condition determines the signal type of the wideband signal.
  • the harmonic signal decision condition corresponding to the ultra-wideband signal and the harmonic signal decision condition corresponding to the wideband signal require the information of the previous frame signal as the reference information when performing the decision of the harmonic signal.
  • the encoding device still uses the signal before the bandwidth switching as the signal type after the bandwidth switching.
  • the reference information may switch the signal type during bandwidth switching. For example, the speech signal before the bandwidth switching is a harmonic signal, but the speech signal after the bandwidth switching may be judged as a transient signal.
  • the encoding end can encode the harmonic signal before the bandwidth switching by using the encoding method for the harmonic signal, and encode the non-harmonic signal after the bandwidth switching by using the encoding method for the non-harmonic signal, after which the encoding end can
  • the encoded signal is sent to the decoding end, and after receiving the encoded signal, the decoding end can decode the encoded signal by using a corresponding decoding method, so that the harmonic signal and the non-harmonic signal can be recovered. Since the harmonic signal and the non-harmonic signal are far apart in signal characteristics, the output of the two signals will make the speech and audio signals of the user at the decoding end intermittent. For the switching between the three signal types of the noise signal, the transient signal and the ordinary signal in the non-harmonic signal, the decoded audio signal does not have much influence on the decoding device.
  • the encoding device may determine whether the first bandwidth speech and audio signal before the bandwidth switching is a harmonic signal. If the first bandwidth speech and audio signal before the bandwidth switching is a harmonic signal, the encoding device may adjust the harmonic signal decision condition to improve the possibility that the second bandwidth speech and audio signal after the bandwidth switching is determined to be a harmonic signal. Therefore, in the case of bandwidth switching, when the speech/audio signal after the bandwidth switching is judged, the signal type of the speech and audio signal is not changed as much as possible, so that the signal type of the speech and audio signal received by the decoding end device is before and after the bandwidth switching.
  • the same decoding method can be used for decoding, so as to ensure the continuity of the speech and audio signals as much as possible, and only when the switched second bandwidth speech and audio signal does not satisfy the relaxed harmonic signal decision condition, That is, when the harmonic component in the second bandwidth speech and audio signal is really small, the signal type of the second bandwidth speech and audio signal is changed.
  • the second bandwidth speech and audio signal is a wideband signal
  • the first bandwidth speech and audio signal is a wideband signal
  • the second bandwidth speech and audio signal is Ultra-wideband signals.
  • the first bandwidth speech and audio signal is an ultra-wideband signal
  • the second bandwidth speech and audio signal is a broadband signal.
  • the encoding device may determine whether the ultra-wideband signal before the bandwidth switching is a harmonic signal or a non-harmonic signal by using a harmonic signal decision condition corresponding to the ultra-wideband signal, and the non-harmonic signal is a transient signal and a noise. One of a signal and a normal signal. If the result of the decision is a harmonic signal, the encoding device may relax the harmonic signal decision condition corresponding to the wideband signal to obtain a first decision condition, and according to the first decision condition, whether the wideband signal after the bandwidth switching is a harmonic signal Judge.
  • the harmonic signal decision condition corresponding to the wideband signal is relaxed, the possibility that the switched wideband signal is judged as a harmonic signal is improved, so that the signal type before and after the bandwidth switching is not changed as much as possible, thereby enabling the decoding end
  • the speech and audio signals decoded by the device are as continuous as possible.
  • the encoding apparatus may determine whether the first bandwidth speech and audio signal before the bandwidth switching is a harmonic signal, and if the harmonic signal is determined, adjust the harmonic of the second bandwidth speech and audio signal after the bandwidth switching.
  • the embodiment can make the signal type of the speech and audio signal consistent as before and after the switching, so that the decoded speech and audio signals of the decoding end device are as continuous as possible, thereby improving the quality of the voice communication service.
  • the encoding device may further include: before performing the step 102 of the method embodiment shown in FIG.
  • the encoding device may adjust the harmonic signal decision condition to improve the second bandwidth speech and audio signal to be determined to be harmonic.
  • the wave signal determines the threshold, thereby reducing the likelihood that the second bandwidth speech and audio signal is judged to be a harmonic signal. That is, if the first bandwidth before bandwidth switching If the speech signal is a non-harmonic signal, such as a noise signal, a transient signal or a normal signal, the second bandwidth speech and audio signal after the bandwidth switching can be judged as a noise signal as much as possible by raising the threshold of the harmonic signal. Transient signals or ordinary signals, and as far as possible not judged as harmonic signals.
  • the encoding end does not change the signal type of the speech and audio signals as much as possible during the bandwidth switching. For the decoding end, the decoded speech and audio signals are also as continuous as possible.
  • the second bandwidth speech and audio signal is a wideband signal
  • the first bandwidth speech and audio signal is a wideband signal
  • the second bandwidth speech and audio signal is ultra-wideband. signal.
  • the harmonic signal decision condition and the non-harmonic signal decision condition corresponding to the ultra-wideband signal used in the following embodiments and the harmonic signal decision condition and the non-harmonic signal decision condition corresponding to the wideband signal are detailed. Introduction. It should be noted that the following embodiments use the signal type decision condition specified in the standard as an example to determine whether the speech signal is a harmonic signal or a non-harmonic signal. Those skilled in the art can understand that these decision conditions can be It is modified according to the processing method of the speech and audio signals.
  • the signal type of the UWB signal can be determined in the following manner:
  • each time domain signal is respectively multiplied by a proportional coefficient as a time domain signal for determining the time domain envelope parameter value.
  • Step 1) ⁇ Step 3) Determine the condition for the transient signal.
  • the frequency domain signal of the current speech and audio signal is divided into multiple frequency bands, and one frequency domain amplitude peak is calculated in each frequency band, and then according to these frequencies
  • the peak value of the domain amplitude, the average of the frequency domain amplitude peaks of the frequency band, and the bandwidth of the frequency band calculate the harmonic characteristic value of each frequency band;
  • step 6 determining whether the harmonic characteristic value of each frequency band is greater than a given threshold and whether the frequency domain amplitude peak of the frequency band is greater than a given threshold T2, and if both are greater than, determining that the frequency band is a harmonic frequency band, and performing step 6), Otherwise, it is determined whether the harmonic characteristic value is less than a given threshold T3, and if it is less, the frequency band is determined to be a noise band, otherwise it is an ordinary frequency band;
  • the harmonic mode counter is updated, for example, the count value is decremented by 1, and then it is judged whether the harmonic mode count value is greater than a given threshold ⁇ 8, and if so, the current speech signal is judged as Harmonic signal
  • Step 4) ⁇ Step 8) is the harmonic signal decision condition.
  • the harmonic mode counter is an optional function.
  • the maximum peak parameter value of the current speech and audio signal is less than or equal to a given threshold ⁇ 4
  • the number of harmonic bands is less than or equal to a given threshold ⁇ 5 and the global energy ratio is not given.
  • the threshold value ( ⁇ 6, ⁇ 7) is within the range
  • the harmonic mode counter can be used as a reference for determining whether the current speech signal is a harmonic signal. If the number of previously accumulated harmonic signals exceeds a given threshold ⁇ 8, then the It is highly probable that the continuous speech signal is a harmonic signal, and even if it does not satisfy the first three conditions, the current speech signal can be judged as a harmonic signal.
  • the decision principle of the ultra-wideband signals is similar to the harmonic signal decision conditions. Specifically: When the encoding device determines whether the current speech signal is a harmonic signal, it is only necessary to determine whether the harmonic band number and the maximum peak parameter value are greater than a given threshold T5, ⁇ 4, respectively, and if so, determining that the current speech signal is a harmonic signal, and Increase the value of the harmonic mode counter, for example, increase its count value by one; if the two cannot be satisfied at the same time, reduce the value of the harmonic mode counter, for example, the count value minus 1, and then judge whether the count value of the harmonic mode counter at this time is It is greater than the given threshold ⁇ 8. If yes, it determines that the current speech and audio signal is a harmonic signal, otherwise it is a normal signal.
  • the first bandwidth speech and audio signal is an ultra-wideband signal
  • the second bandwidth speech and audio signal is a wideband signal
  • the bandwidth is switched from ultra-wideband.
  • the signal is switched to the broadband signal.
  • the method in this embodiment may include:
  • Step 201 Calculate a harmonic band number and a maximum peak parameter of the wideband signal after the bandwidth switching. For example, the foregoing step 6) can be implemented, and details are not described herein again.
  • Step 202 Update the harmonic mode count value according to the harmonic band number, the maximum peak parameter, and the harmonic signal judgment condition of the wideband signal.
  • the step can be implemented, for example, by using the above step 7). It should be noted that, for the wideband signal, it is not necessary to calculate the global energy ratio, but only the harmonic signal number and the maximum in the harmonic signal decision condition of the wideband signal.
  • the decision about whether the wideband signal after the bandwidth switching is a harmonic signal or a non-harmonic signal is based on the objective signal type of the wideband signal, and the updated harmonic mode counter is used for the subsequent speech and audio signals.
  • Step 203 determining whether the ultra-wideband signal before the bandwidth switching is a harmonic signal, and if so, executing step 204, otherwise performing step 206;
  • step 203 may be performed before step 204, and It is not limited that it must be executed after steps 201 and 202. In the actual processing, it can be executed before the bandwidth switching.
  • Step 204 Decrease at least one of a harmonic band number threshold and a maximum peak parameter threshold in a harmonic signal decision condition of the wideband signal.
  • the step 204 needs to relax the condition of determining the bandwidth-switched wideband signal as a harmonic signal.
  • the harmonic band number threshold T5 in the harmonic signal decision condition of the wideband signal and at least one of the maximum peak parameter thresholds ⁇ 4 can be reduced.
  • the adjustment mode of the reduction of ⁇ 4 and ⁇ 5 is relatively large in terms of the adjustment method of the harmonic signal decision condition with respect to the adjustment mode of reducing only ⁇ 4 or only ⁇ 5.
  • the threshold value of the reduced harmonic frequency band can be recorded as T51, and ⁇ 51 ⁇ ⁇ 5, and the reduced maximum peak parameter threshold is recorded as ⁇ 41, and ⁇ 41 ⁇ ⁇ 4.
  • the T51 can be half of the ⁇ 5 and the T41 is half of the ⁇ 4.
  • T51 and T41 can be set according to the harmonic signal decision requirements. For example, if a wideband signal with certain harmonic characteristics needs to be judged as a harmonic signal as much as possible, T51 and T41 can be adjusted to be lower, thereby making the harmonic signal decision condition wider.
  • Step 205 If the harmonic band number is greater than the reduced harmonic band number threshold and/or the maximum peak parameter is greater than the reduced maximum peak parameter threshold, the wideband signal is a harmonic signal.
  • the bandwidth-switched wideband signal can be judged as a harmonic signal.
  • the threshold value and, if one of the two conditions that the harmonic band number is greater than T51 and the maximum peak parameter is greater than T41, the signal after the bandwidth switching is determined to be a harmonic signal, thereby further relaxing the harmonic signal. Judgment conditions.
  • the embodiment may further add a value of the harmonic mode counter to determine, if the harmonic mode count value If it is greater than the preset value ⁇ 8, the wideband signal after the bandwidth switching is the harmonic signal.
  • Step 206 Increase at least one of a harmonic band number threshold and a maximum peak parameter threshold in a harmonic signal decision condition of the wideband signal.
  • the step 206 needs to improve the condition of determining the wideband signal after the bandwidth switching as a harmonic signal.
  • the harmonic band number threshold T5 in the harmonic signal decision condition of the wideband signal and at least one of the maximum peak parameter thresholds T4 can be increased. It can be understood that the adjustment manner of T4 and T5 is improved compared with the adjustment method of only increasing T4 or only increasing T5, and the degree of improvement of the harmonic signal decision condition is large.
  • the threshold value of the increased harmonic frequency band can be recorded as T52, and T52>T5, and the increased maximum peak parameter threshold is recorded as ⁇ 42, and ⁇ 42> ⁇ 4.
  • the ⁇ 52 can be doubled to ⁇ 5, and ⁇ 42 is twice as large as ⁇ 4.
  • ⁇ 52 and ⁇ 42 can also be set according to the harmonic signal decision requirements. For example, if it is desired to determine a wideband signal having more harmonic characteristics as a harmonic signal, the ⁇ 52 and ⁇ 42 can be adjusted to be higher, so that a wideband signal having a very strong harmonic characteristic can be judged as a harmonic signal.
  • Step 207 If the harmonic band number is greater than the increased harmonic band number threshold and/or the maximum peak parameter is greater than the increased maximum peak parameter threshold, the wideband signal is a harmonic signal.
  • the wideband signal after the bandwidth switching can be judged as a harmonic signal. .
  • the present embodiment may also add a value of the harmonic mode counter for determination, if the harmonic mode count value If it is greater than the preset value ⁇ 8, the wideband signal after bandwidth switching can also be judged as a harmonic signal.
  • the encoding apparatus may determine whether the ultra-wideband signal before the bandwidth switching is a harmonic signal or a non-harmonic signal, and if it is a harmonic signal, it may be used to reduce the signal for characterization.
  • the harmonic frequency band of the middle harmonic component and/or the decision threshold of the maximum peak parameter so as to try to determine the wideband signal after the bandwidth switching as a harmonic signal, and if it is a non-harmonic signal, it can be improved by the number of harmonic bands.
  • the harmonic mode counter can be assisted to make the decision. Therefore, in the embodiment, when the bandwidth is switched, the signal type is not changed as much as possible, so that the continuity of the speech and audio signals received by the decoding end can be ensured as much as possible.
  • a first bandwidth speech and audio signal is a wideband signal
  • a second bandwidth speech and audio signal is an ultra-wideband signal
  • the bandwidth is switched to a wideband signal.
  • Switching to the ultra-wideband signal, as shown in FIG. 3, the method in this embodiment may include:
  • Step 301 Calculate a harmonic band number and a maximum peak parameter of the ultra-wideband signal after the bandwidth switching, and update the harmonic mode count value according to the harmonic band number, the maximum peak parameter, and the harmonic signal decision condition of the ultra-wideband signal.
  • Step 301 can refer to the foregoing implementation of the signal type decision process for the ultra-wideband signal, and details are not described herein again.
  • Step 302 The default ultra-wideband signal is not a transient signal and the ratio of the global energy of the ultra-wideband signal to the global energy of the wideband signal before the bandwidth switching is within a preset range.
  • the ultra-wideband signal since the bandwidth switching is switching from the wideband signal to the ultra-wideband signal, the ultra-wideband signal includes four signal types, and the harmonic signal decision condition is increased compared to the harmonic signal decision condition of the wideband signal.
  • step 302 may not perform the above steps 1) to 3) and the ultra-wideband signal after the default bandwidth change is not a transient signal, and may also default to step 7)
  • the ratio of the global energy of the ultra-wideband signal after the medium bandwidth switching to the global energy of the wideband signal before the bandwidth switching is within a preset range (T6, T7).
  • Step 303 determining whether the wideband signal before the bandwidth switching is a harmonic signal, and if so, executing step 304, otherwise executing step 306;
  • Step 304 Reduce at least one of a harmonic band number threshold and a maximum peak parameter threshold in a harmonic signal decision condition of the ultra-wideband signal.
  • the step 304 needs to relax the condition for determining the bandwidth-switched ultra-wideband signal as a harmonic signal.
  • the harmonic band number threshold T5 and the maximum peak in the harmonic signal decision condition of the ultra-wideband signal can be reduced.
  • the threshold value of the reduced harmonic band number is also denoted as T51, and the reduced maximum peak parameter threshold is also denoted as ⁇ 41.
  • Step 305 If the harmonic frequency band is greater than the reduced harmonic frequency band threshold and/or the maximum peak parameter is greater than the reduced maximum peak parameter threshold, the ultra-wideband signal is a harmonic signal.
  • the harmonic band number is greater than the reduced harmonic band number threshold, or the maximum peak parameter is greater than the reduced maximum peak parameter threshold, only one of the two conditions satisfies
  • the switched UWB signal can be judged as a harmonic signal.
  • the embodiment may further add a value of the harmonic mode counter to determine, if the harmonic mode count value If it is greater than the preset value ⁇ 8, the ultra-wideband signal after the bandwidth switching is the harmonic signal.
  • Step 306 Improve at least one of a harmonic band number threshold and a maximum peak parameter threshold in a harmonic signal decision condition of the ultra-wideband signal.
  • Step 307 If the harmonic band number is greater than the increased harmonic band number threshold and/or the maximum peak parameter is greater than the increased maximum peak parameter threshold, the UWB signal is a harmonic signal.
  • the ultra-wideband signal after the bandwidth switching can be judged as a harmonic signal.
  • the present embodiment may also add a value of the harmonic mode counter for determination, if the harmonic mode count value If it is greater than the preset value ⁇ 8, the UWB signal after the bandwidth switching can also be judged as a harmonic signal.
  • the embodiment may also perform steps 1) to 3) to determine whether the ultra-wideband signal after the bandwidth switching is a transient signal. Moreover, in order to ensure the continuity of the signal, the present embodiment can improve the decision condition of the transient signal, so that the UWB signal having a large transient characteristic can be judged as a transient signal.
  • the encoding device may use the foregoing step 1) to calculate a time domain envelope parameter of the ultra-wideband signal, and improve the timing envelope threshold T1 in the step 2), and may improve the envelope threshold
  • the value is recorded as Ti l ; if the time domain envelope parameter is greater than Tl l , the UWB signal can be judged as a transient signal. For example, if the wideband signal before the bandwidth switching is a harmonic signal, the envelope threshold can be increased by 3 times; if the wideband signal before the bandwidth switching is a non-harmonic signal, the envelope threshold can be increased by 2 times.
  • the encoding apparatus may determine whether the wideband signal before the bandwidth switching is a harmonic signal or a non-harmonic signal, and if it is a harmonic signal, it may be used to reduce the signal.
  • the harmonic band number of the harmonic component and/or the decision threshold of the maximum peak parameter try to determine the UWB signal after the bandwidth switching as a harmonic signal, and if it is a non-harmonic signal, it can be improved by the number of harmonic bands and / or the decision threshold of the maximum peak parameter, try to determine the ultra-wideband signal after the bandwidth switching as a non-harmonic signal, and, after the adjustment of the harmonic signal decision condition, even if the ultra-wideband signal after the bandwidth switching does not satisfy the foregoing condition, It is also possible to assist the harmonic mode counter in making decisions. Therefore, in the embodiment, when the bandwidth is switched, the signal type is not changed as much as possible, so that the continuity of the speech and audio signals received by the decoding end can be ensured as much as possible.
  • the first bandwidth speech and audio signal is an ultra-wideband signal
  • the second bandwidth speech and audio signal is a wideband signal
  • the bandwidth is switched to
  • the UWB signal is switched to the broadband signal.
  • the method in this embodiment may include:
  • Step 401 Calculate a harmonic band number and a maximum peak parameter of the broadband signal after the bandwidth switching.
  • Step 402 Update the harmonic mode count value according to the harmonic band number, the maximum peak parameter, and the harmonic signal judgment condition of the wideband signal.
  • Step 403 determining whether the ultra-wideband signal before the bandwidth switching is a harmonic signal, and if so, executing step 404, otherwise executing step 405;
  • Steps 401 to 403 refer to the execution of steps 201 to 203 of the foregoing embodiment shown in FIG. 2, and details are not described herein again.
  • Step 404 Determine the wideband signal after the bandwidth switching is a harmonic signal.
  • Step 405 Determine that the wideband signal after the bandwidth switching is a non-harmonic signal.
  • the method embodiment shown in FIG. 2 determines whether the wideband signal after the bandwidth switching is a harmonic signal by adjusting the decision threshold in the harmonic signal decision condition.
  • the harmonic signal decision condition is adjusted to: The UWB signal before the bandwidth switching is a harmonic signal, and the wideband signal after the bandwidth switching is also forcibly determined as a harmonic signal.
  • the UWB signal before the bandwidth switching is a non-harmonic signal
  • the broadband signal after the bandwidth switching is also mandatory.
  • the decision is a non-harmonic signal.
  • the encoding apparatus may determine whether the ultra-wideband signal before the bandwidth switching is a harmonic signal or a non-harmonic signal, and if it is a harmonic signal, forcibly determine the bandwidth after the bandwidth switching.
  • the signal is a harmonic signal. If it is a non-harmonic signal, the wideband signal after the bandwidth switching is forcibly determined to be a non-harmonic signal. Therefore, in the embodiment, when the bandwidth is switched, the signal type is not changed, so that the continuity of the speech and audio signals can be ensured as much as possible for the speech and audio signals received by the decoding end.
  • the first bandwidth speech and audio signal is a wideband signal
  • the second bandwidth speech and audio signal is an ultra-wideband signal
  • the bandwidth is switched to
  • the broadband signal is switched to the ultra-wideband signal.
  • the method in this embodiment may include:
  • Step 501 Calculate the harmonic band number and the maximum peak parameter of the ultra-wideband signal after the bandwidth switching, and update the harmonic mode count value according to the harmonic band number, the maximum peak parameter, and the harmonic signal decision condition of the ultra-wideband signal.
  • Step 502 The UWB signal is not a transient signal by default and the ratio of the global energy of the ultra-wideband signal to the global energy of the wideband signal before the bandwidth switching is within a preset range.
  • Step 503 Determine whether the wideband signal before the bandwidth switching is a harmonic signal, and if yes, execute step 504, otherwise execute step 505;
  • Step 504 Determine that the ultra-wideband signal after the bandwidth switching is a harmonic signal.
  • Step 505 Determine that the ultra-wideband signal after the bandwidth switching is a non-harmonic signal.
  • the method embodiment shown in FIG. 3 determines whether the ultra-wideband signal after the bandwidth switching is a harmonic signal by adjusting the decision threshold in the harmonic signal decision condition.
  • the harmonic signal decision condition is adjusted as follows: As long as the wideband signal before the bandwidth switching is a harmonic signal, the ultra-wideband signal after the bandwidth switching is also forcibly determined as a harmonic signal, as long as the wideband signal before the bandwidth switching is For non-harmonic signals, the UWB signal after bandwidth switching is also forced to be judged as a non-harmonic signal.
  • the encoding apparatus may determine whether the wideband signal before the bandwidth switching is a harmonic signal or a non-harmonic signal, and if it is a harmonic signal, forcibly determine the ultra-wideband after the bandwidth switching.
  • the signal is a harmonic signal. If it is a non-harmonic signal, the ultra-wideband signal after the bandwidth switching is forcibly determined to be a non-harmonic signal. Therefore, in the embodiment, when the bandwidth is switched, the signal type is not changed, so that the continuity of the speech and audio signals can be ensured as much as possible for the speech and audio signals received by the decoding end.
  • the present invention also provides an encoding device that can be located in a terminal device, a network device, or a test device.
  • the encoding device may be implemented by a hardware circuit or by software in conjunction with hardware.
  • a processor is invoked by an encoding device to implement speech and audio signal processing.
  • the encoding apparatus can perform various methods and processes in the above method embodiments.
  • the encoding device can include a decision condition adjustment module and a signal type decision module.
  • FIG. 7 is a schematic structural diagram of Embodiment 1 of an encoding apparatus according to the present invention.
  • the encoding apparatus of this embodiment includes: a decision condition adjustment module 11 and a signal type decision module 12, wherein the decision condition adjustment module 11 is configured to If the first bandwidth speech and audio signal is a harmonic signal, adjusting the second bandwidth speech and audio signal to be determined as a judgment condition of the harmonic signal to obtain a first decision condition, so as to improve the second bandwidth speech and audio signal is determined to be a harmonic a probability of the signal; the first bandwidth speech and audio signal is a speech and audio signal before the bandwidth switching, the second bandwidth speech and audio signal is a bandwidth-switched speech and audio signal; and the signal type decision module 12 is configured to The first decision condition determines whether the second bandwidth speech and audio signal is a harmonic signal.
  • the decision condition adjustment module 11 is configured to relax the decision condition that the second bandwidth speech and audio signal is determined as a harmonic signal, and the relaxed decision condition is used as the first decision condition.
  • Embodiment 8 is a schematic structural diagram of Embodiment 2 of the encoding apparatus of the present invention. As shown in FIG. 8, the apparatus of this embodiment further includes: a harmonic mode updating module 13 on the basis of the apparatus shown in FIG. 7;
  • the decision condition adjustment module 11 is specifically configured to reduce at least one of a harmonic band number threshold and a maximum peak parameter threshold in the decision condition that the second bandwidth speech and audio signal is determined to be a harmonic signal.
  • the signal type decision module 12 may include: a calculating unit 121 and a processing unit 122, where the calculating unit 121 is configured to calculate a harmonic frequency band number and a maximum peak parameter of the second bandwidth speech and audio signal; For use in The second bandwidth speech and audio signal is a harmonic signal, wherein the harmonic frequency band number is greater than the reduced harmonic frequency band threshold and/or the maximum peak parameter is greater than the reduced maximum peak parameter threshold.
  • the harmonic mode update module 13 is configured to update the harmonic mode count according to a relationship between the harmonic frequency band number, the maximum peak parameter, and a decision condition that the second bandwidth speech and audio signal is determined to be a harmonic signal.
  • the signal type decision module 12 is further configured to: if the number of the harmonic bands is less than or equal to the reduced harmonic band number threshold and the maximum peak parameter is less than or equal to the reduced maximum peak parameter threshold And the harmonic mode count value is greater than a preset value, and then determining that the second bandwidth speech and audio signal is a harmonic signal.
  • the harmonic mode updating module 13 is specifically configured to increase the harmonic if the harmonic band number is greater than the harmonic band number threshold and the maximum peak parameter is greater than the maximum peak parameter threshold a mode count value; if the harmonic band number is less than or equal to the harmonic band number threshold and/or the maximum peak parameter is less than or equal to the maximum peak parameter threshold, the harmonic mode count value is decreased.
  • the decision condition adjustment module 1 1 is also used for the calculation Deriving a time domain envelope parameter of the ultra-wideband signal and increasing an envelope threshold in the transient signal decision condition; if the time domain envelope parameter is greater than or equal to the increased envelope threshold, the ultra-wideband signal is transient a signal, if the time domain envelope parameter is less than an increased envelope threshold, the default ultra-wideband signal is not a transient signal and the ratio of the global energy of the ultra-wideband signal to the global energy of the broadband signal is Within the preset range.
  • the decision condition adjustment module 1 1 is specifically configured to: if the broadband signal is a harmonic signal, increase the envelope threshold by a factor of three, and if the wideband signal is a non-harmonic signal, The envelope threshold is increased by a factor of two.
  • the signal type determining module 12 is specifically configured to determine the second wideband audio and audio signal as a harmonic according to the first decision condition.
  • the signal type decision module 12 is further configured to determine the second wideband audio signal as a non-harmonic signal if the first bandwidth speech and audio signal is not a harmonic signal.
  • the decision condition adjusting module 11 is further configured to: if the first bandwidth speech and audio signal is not a harmonic signal, Adjusting the harmonic signal decision condition to obtain a second decision condition to reduce the possibility that the second bandwidth speech and audio signal is determined to be a harmonic signal; correspondingly, the signal type decision module 12 is further configured to The second decision condition determines whether the second bandwidth speech audio signal is a harmonic signal.
  • the decision condition adjustment module 11 is configured to: at least one of a harmonic band number threshold and a maximum peak parameter threshold in the decision condition that the second bandwidth speech and audio signal is determined to be a harmonic signal; correspondingly, the signal
  • the type decision module 12 is specifically configured to: if the harmonic band number is greater than the increased harmonic band number threshold and/or the maximum peak parameter is greater than the increased maximum peak parameter threshold, the second bandwidth speech signal For harmonic signals.
  • the coding device of the foregoing embodiment of the present invention can perform the technical solution of the embodiment of the method shown in FIG. 1 to FIG. 5, and the implementation principle and technical effects thereof are similar, and details are not described herein again.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

一种基于带宽切换的语音频信号处理方法和编码装置。该方法包括:若第一带宽语音频信号是谐波信号,则调整第二带宽语音频信号被判决为谐波信号的判决条件得到第一判决条件,以提高所述第二带宽语音频信号被判决为谐波信号的可能性;所述第一带宽语音频信号为带宽切换前的信号,所述第二带宽语音频信号为带宽切换后的信号;根据所述第一判决条件判决所述第二带宽语音频信号是否为谐波信号。上述基于带宽切换的语音频信号处理方法和编码装置可以在带宽切换的情况下,尽可能使得语音频信号的信号类型在切换前后保持一致,从而使得解码端设备解码后的语音频信号尽可能连续,进而提高语音通信服务质量。

Description

语音频信号处理方法和编码装置
本申请要求于 2012 年 6 月 29 日提交中国专利局、 申请号为 CN201210223014.0,发明名称为"语音频信号处理方法和编码装置"的中国 专利申请的优先权, 其全部内容通过引用结合在本申请中。
技术领域
本发明实施例涉及通信技术, 尤其涉及一种语音频信号处理方法和编 码装置。 背景技术
在数字通信领域, 语音、 图像、 音频、 视频的传输有着非常广泛的应 用需求, 如手机通话、 音视频会议、 广播电视、 多媒体娱乐等。 语音频信 号被数字化处理, 通过通信网络从一个终端传递到另一个终端, 这里的终 端可以是手机、 数字电话终端或其他任何类型的语音频终端, 数字电话终 端例如 VOIP电话或 ISDN电话、 计算机、 电缆通信电话。 为了降低语音 频信号存储或者传输过程中占用的资源, 语音频信号在发送端进行压縮处 理后传输到接收端, 接收端通过解压縮处理恢复语音频信号并进行播放。
在实际语音通信过程中, 语音频信号的带宽会时常发生变化。 导致该 语音频信号的带宽发生变化的原因既可能是网络状态的变化, 也可能是语 音频信号本身在带宽上即发生变化, 或者是其它能够导致语音频信号在高 频带信号和低频带信号间切换的因素。这种语音频信号在高低频带间切换 的过程称为带宽切换。
具体来说, 网络状态时常会发生变化, 而网络带宽会随着网络状态变 差而变窄。 相应地, 语音频信号也需要随着网络带宽的变化而在高频带信 号和低频带信号之间切换。 在网络带宽变窄时, 语音频信号就需要从高频 带信号降低到低频带信号, 而当网络状况恢复时, 语音频信号则需要从低 频带信号恢复成高频带信号。 其中, 高频带信号和低频带信号的带宽大小 是相对概念, 举例来说, 该高频带信号的带宽为 0〜16kHz, 低频带信号 的带宽为 0〜8kHz, 或者高频带信号的带宽为 0〜8kHz, 低频带信号的带 宽为 0〜4kHz,该高频带信号也即超宽带信号,低频带信号也即宽带信号。 但是, 在编码端采用现有技术进行带宽切换后, 解码端时常出现语音 频信号不连续的问题, 从而导致语音通信服务质量降低。 发明内容 本发明实施例提供一种基于带宽切换的语音频信号处理方法和编码 装置。
本发明实施例提供一种基于带宽切换的语音频信号处理方法, 包括: 若第一带宽语音频信号是谐波信号, 则调整第二带宽语音频信号被判 决为谐波信号的判决条件得到第一判决条件, 以提高所述第二带宽语音频 信号被判决为谐波信号的可能性; 所述第一带宽语音信号为带宽切换前的 信号, 所述第二带宽语音频信号为带宽切换后的信号;
根据所述第一判决条件, 判决所述第二带宽语音频信号是否为谐波信 号。
本发明实施例还提供一种编码装置, 包括:
判决条件调整模块, 用于若第一带宽语音频信号是谐波信号, 则调整 第二带宽语音频信号被判决为谐波信号的判决条件得到第一判决条件, 以 提高所述第二带宽语音频信号被判决为谐波信号的可能性; 所述第一带宽 语音信号为带宽切换前的信号, 所述第二带宽语音频信号为带宽切换后的 信号;
信号类型判决模块, 用于根据所述第一判决条件, 判决所述第二带宽 语音频信号是否为谐波信号。
本发明实施例中, 编码装置可以确定带宽切换前的第一带宽语音频信 号是否是谐波信号, 并在确定是谐波信号的情况下, 采用调整带宽切换后 的第二带宽语音频信号的谐波信号判决条件的方式, 放宽对带宽切换后的 第二带宽语音频信号是否是谐波信号的判决条件, 尽可能提高该第二带宽 语音频信号判决为谐波信号的可能性, 因此, 本发明实施例可以在带宽切 换的情况下, 尽可能使得语音频信号的信号类型在切换前后保持一致, 从 而使得解码端设备解码后的语音频信号尽可能连续, 进而提高语音通信服 务质量。 附图说明 为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对 实施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易见 地, 下面描述中的附图是本发明的一些实施例, 对于本领域普通技术人员 来讲, 在不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的 附图。
图 1为本发明语音频信号处理方法实施例一的流程图;
图 2为本发明语音频信号处理方法实施例二的流程图;
图 3为本发明语音频信号处理方法实施例三的流程图;
图 4为本发明语音频信号处理方法实施例四的流程图;
图 5为本发明语音频信号处理方法实施例五的流程图;
图 6为设有本发明编码装置的编码端设备的一种结构示意图; 图 7为本发明编码装置实施例一的结构示意图;
图 8为本发明编码装置实施例二的结构示意图。 具体实施方式 为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合本 发明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整地描 述, 显然,所描述的实施例是本发明一部分实施例, 而不是全部的实施例。 基于本发明中的实施例, 本领域普通技术人员在没有作出创造性劳动前提 下所获得的所有其他实施例, 都属于本发明保护的范围。
本发明语音频信号处理方法可以应用于音频编码器中。数字信号处理 领域, 音频编解码器广泛应用于各种电子设备中, 例如: 移动电话, 无线 装置, 个人数据助理(PDA ) , 手持式或便携式计算机, GPS接收机 /导航 器, 照相机, 音频 /视频播放器, 摄像机, 录像机, 监控设备等。 通常, 这 类电子设备中包括音频编码器或音频解码器, 音频编码器或者解码器可以 直接由数字电路或芯片例如 DSP ( digital signal processor) 实现, 或者由 软件代码驱动处理器执行软件代码中的流程而实现。
图 1为本发明语音频信号处理方法实施例一的流程图, 如图 1所示, 本实施例的方法可以包括:
歩骤 101、 若第一带宽语音频信号是谐波信号, 则调整第二带宽语音 频信号被判决为谐波信号的判决条件得到第一判决条件, 以提高所述第二 带宽语音频信号被判决为谐波信号的可能性。
其中, 该第一带宽语音频信号为带宽切换前的语音频信号, 所述第二 带宽语音频信号为带宽切换后的语音频信号。
歩骤 102、 根据所述第一判决条件, 判决所述第二带宽语音频信号是 否为谐波信号。
具体地,高频带信号可以为超宽带信号,低频带信号可以为宽带信号, 本领域技术人员可以根据需求自行设定带宽范围为一定范围以上的信号 为超宽带信号, 否则为宽带信号, 举例来说, 可以设定带宽范围为 0〜8kHz 以上的信号为超宽带信号,带宽范围为 0〜8kHz及以下的信号为宽带信号。 在编码端进行编码时, 超宽带信号可以被分为谐波信号、 普通信号、 瞬态 信号以及噪声信号, 而宽带信号可以被分为谐波信号和普通信号。
本实施例中的第一带宽语音频信号可以是超宽带信号, 切换后的第二 带宽语音频信号可以是宽带信号, 或者第一带宽语音频信号可以是宽带信 号, 切换后的第二带宽语音频信号可以是超宽带信号。 对于超宽带信号来 说, 其信号类型可以是谐波信号、 普通信号、 瞬态信号以及噪声信号中的 一种, 对于宽带信号来说, 其信号类型可以是谐波信号和普通信号中的一 种。 针对超宽带信号来说, 编码装置可以采用与超宽带信号对应的谐波信 号判决条件判断该超宽带信号的信号类型, 对于宽带信号来说, 编码装置 可以采用与宽带信号对应的谐波信号判决条件判断该宽带信号的信号类 型。 在现有技术中, 与超宽带信号对应的谐波信号判决条件以及与宽带信 号对应的谐波信号判决条件在进行谐波信号的判决时均需要采用前一帧 信号的信息作为参考信息。
发明人在实践现有技术的过程中发现, 在编码端进行带宽切换的情况 下, 如果带宽切换前的语音频信号是超宽带信号中的谐波信号或者宽带信 号中的谐波信号, 则在解码端时常出现语音断断续续的情况, 从而影响了 用户的正常通信, 降低了语音通信服务质量。
发明人经过仔细研究发现, 出现上述问题的比较主要的原因在于, 不 管是与超宽带信号对应的谐波信号判决条件还是与宽带信号对应的谐波 信号判决条件, 其在进行谐波信号的判决时均需要采用前一帧信号的信息 作为参考信息, 而在带宽切换的情况发生时, 由于信号带宽发生变化, 导 致带宽切换前后的信号能量和频带差别较大, 基于这种变化, 编码装置如 果仍然采用带宽切换前的信号作为对带宽切换后的信号类型进行判断的 参考信息, 则可能在带宽切换时进行信号类型的切换, 例如, 带宽切换前 的语音频信号为谐波信号, 但是在带宽切换后的语音频信号则有可能被判 决为瞬态信号。编码端可以采用针对谐波信号的编码方法对带宽切换前的 谐波信号进行编码, 采用针对非谐波信号的编码方法对带宽切换后的非谐 波信号进行编码, 之后, 编码端即可将编码信号发送给解码端, 解码端在 接收到该编码信号后, 即可采用对应的解码方法对编码信号进行解码, 从 而可以恢复出谐波信号和非谐波信号。 由于谐波信号和非谐波信号在信号 特性上相差较远, 因此两种信号的输出, 将使得解码端的用户听起来的语 音频信号是断断续续的。 而对于非谐波信号中的噪声信号、 瞬态信号以及 普通信号这三种信号类型之间的切换, 对于解码端设备来说, 其解码后的 语音频信号并无太大的影响。
为此, 本实施例中, 编码装置可以确定带宽切换前的第一带宽语音频 信号是否是谐波信号。 如果带宽切换前的第一带宽语音频信号是谐波信 号, 则编码装置可以采用调整谐波信号判决条件的方式, 来提高带宽切换 后的第二带宽语音频信号被确定为谐波信号的可能性, 从而在带宽切换的 情况下, 在对带宽切换后的语音频信号进行判决时, 尽量不改变语音频信 号的信号类型, 从而使得解码端设备接收的语音频信号的信号类型在带宽 切换前后是一致的, 即可采用相同的解码方式进行解码, 从而尽量保证语 音频信号的连续性, 而只有在切换后的第二带宽语音频信号也不满足放宽 后的谐波信号判决条件时, 也即该第二带宽语音频信号中的谐波成分确实 很少时, 才改变该第二带宽语音频信号的信号类型。 在本实施例中, 若第 一带宽语音频信号为超宽带信号, 则第二带宽语音频信号即为宽带信号, 若第一带宽语音频信号为宽带信号, 则第二带宽语音频信号即为超宽带信 号。
以第一带宽语音频信号为超宽带信号, 第二带宽语音频信号为宽带信 号举例来说, 编码装置可以采用与超宽带信号对应的谐波信号判决条件确 定带宽切换前的超宽带信号是谐波信号还是非谐波信号, 该非谐波信号即 为瞬态信号、噪声信号以及普通信号中的一种。如果判决结果是谐波信号, 则编码装置可以放宽与宽带信号对应的谐波信号判决条件, 得到第一判决 条件, 并根据该第一判决条件对带宽切换后的宽带信号是否是谐波信号进 行判断。 由于放宽了与宽带信号对应的谐波信号判决条件, 因此, 切换后 的宽带信号被判决为谐波信号的可能性提升, 从而使得带宽切换前后的信 号类型尽可能不发生变化, 进而使得解码端设备解码后的语音频信号尽可 能连续。
需要说明的是, 本领域技术人员可以根据语音频信号的处理方法自行 设计与超宽带信号对应的谐波信号判决条件以及与宽带信号对应的谐波 信号判决条件或者采用标准中规定的谐波信号判决条件, 本实施例不作限 定。
本实施例中, 编码装置可以确定带宽切换前的第一带宽语音频信号是 否是谐波信号, 并在确定是谐波信号的情况下, 采用调整带宽切换后的第 二带宽语音频信号的谐波信号判决条件的方式, 放宽对带宽切换后的第二 带宽语音频信号是否是谐波信号的判决条件, 尽可能提高该第二带宽语音 频信号判决为谐波信号的可能性, 因此, 本实施例可以在带宽切换的情况 下, 尽可能使得语音频信号的信号类型在切换前后保持一致, 从而使得解 码端设备解码后的语音频信号尽可能连续, 进而提高语音通信服务质量。
在图 1所示方法实施例的基础上, 若编码装置确定第一带宽语音频信 号不是谐波信号,则编码装置在执行图 1所示方法实施例的歩骤 102之前, 还可以包括:
调整谐波信号判决条件得到第二判决条件, 以降低第二带宽语音频信 号被判决为谐波信号的可能性, 进而根据所述第二判决条件, 判决所述第 二带宽语音频信号是否为谐波信号。
具体来说, 如果编码装置在确定带宽切换前的第一带宽语音频信号并 非谐波信号, 则编码装置可以采用调整谐波信号判决条件的方式, 提高该 第二带宽语音频信号被判决为谐波信号判决门槛, 从而降低该第二带宽语 音频信号被判决为谐波信号的可能性。 也即, 如果带宽切换前的第一带宽 语音频信号为非谐波信号, 例如为噪声信号、 瞬态信号或者普通信号, 则 带宽切换后的第二带宽语音频信号通过提高谐波信号的判决门槛, 可以尽 可能被判决为噪声信号、 瞬态信号或者普通信号, 而尽可能不被判决为谐 波信号。 编码端在带宽切换时尽可能不改变语音频信号的信号类型, 对于 解码端来说, 其解码后的语音频信号也会尽可能连续。
如上所述, 若第一带宽语音频信号为超宽带信号, 则第二带宽语音频 信号即为宽带信号, 若第一带宽语音频信号为宽带信号, 则第二带宽语音 频信号即为超宽带信号。 下面针对不同的带宽切换情况, 采用不同的实施 例对本发明的技术方案进行详细说明。
首先, 对下述实施例中所采用的与超宽带信号对应的谐波信号判决条 件和非谐波信号判决条件, 以及与宽带信号对应的谐波信号判决条件和非 谐波信号判决条件进行详细介绍。 需要说明的是, 下述实施例以标准中规 定的信号类型判决条件为例对语音频信号是谐波信号还是非谐波信号进 行判决, 本领域技术人员可以理解的是, 这些判决条件, 可以根据语音频 信号的处理方法自行修改。
对于超宽带信号来说, 可以采用下述方式来判断该超宽带信号的信号 类型:
1 ) 将当前语音频信号分成多个信号段,获得多段时域信号并确定每段 时域信号的时域包络参数值, 可选地, 在确定每段时域信号的时域 包络参数值之前, 也可以根据各段时域信号在整个语音频信号中的 重要程度, 给各段时域信号分别乘一个比例系数作为用于确定时域 包络参数值的时域信号。
2) 判断多个时域信号的时域包络参数值中是否有一个时域包络参数 值大于给定的包络阈值 Tl, 该包络阈值 T1是由前若干个语音频信号 的包络值加权求和, 然后乘以预设值计算得到的。
3 ) 若至少有一个时域包络阈值大于 T1,则判断当前语音频信号为瞬态 信号;
歩骤 1 ) 〜歩骤 3 ) 为瞬态信号判决条件。
4) 若没有一个时域包络阈值大于 T1,则将当前语音频信号的频域信号 分为多个频带, 每个频带中计算 1个频域幅度峰值, 然后根据这些频 域幅度峰值、 该频带的频域幅度峰值的平均值以及频带宽度计算每 个频带的谐波特性值;
5 ) 判断每个频带的谐波特性值是否大于给定阈值以及该频带的频域 幅度峰值是否大于给定阈值 T2,若均大于则判断该频带为谐波频带, 并执行歩骤 6) , 否则进一歩判断谐波特性值是否小于给定阈值 T3 , 若小于则判断该频带为噪声频带, 否则为普通频带;
6) 确定最大峰值参数值, 即所有频带的幅度峰值的最大值, 并且统计 谐波频带数和噪声频带数, 计算当前语音频信号的全局能量与前一 语音频信号的全局能量的比值;
7 ) 判断最大峰值参数值是否大于给定阈值 T4,谐波频带数是否大于给 定阈值 T5, 全局能量比值是否在给定阈值(Τ6, Τ7 ) 范围内, 如果 都判断为是, 则判决当前语音频信号为谐波信号, 并更新谐波模式 计数器, 例如其计数值加 1 ;
8 ) 如果 3个判断条件不全部为是, 则更新谐波模式计数器, 例如其计 数值减 1,然后判断此时谐波模式计数值是否大于给定阈值 Τ8,若是 则判断当前语音频信号为谐波信号;
歩骤 4 ) 〜歩骤 8 ) 即为谐波信号判决条件。
其中, 需要说明的是, 该谐波模式计数器是可选功能, 在当前语音 频信号的最大峰值参数值小于等于给定阈值 Τ4, 谐波频带数小于等 于给定阈值 Τ5且全局能量比值不在给定阈值 (Τ6, Τ7 ) 范围内时, 该谐波模式计数器可以作为判决当前语音频信号是否为谐波信号的 参考, 如果之前累计谐波信号的个数超过了给定阈值 Τ8, 则说明该 连续的语音频信号是谐波信号的可能性较大, 即使其不满足前三个 条件, 此时也可以当前语音频信号判决为谐波信号。
9) 若不是则进一歩判断噪声频带数和其它噪声相关参数是否满足条 件, 若是则判断当前语音频信号为噪声信号, 否则当前语音频信号 为普通信号。
对于宽带信号来说, 其只需要区分谐波信号和普通信号, 而带宽切换 过程中对于谐波信号判决条件来说, 其与超宽带信号的判决原理类似, 具 体来说: 在编码装置判断当前语音频信号是否为谐波信号时, 只需判断谐波频 带数和最大峰值参数值是否分别大于给定阈值 T5、 Τ4, 若是则判断当前 语音频信号为谐波信号, 并增加谐波模式计数器的值, 例如其计数值加 1 ; 若两者不能同时满足, 则减少谐波模式计数器的值, 例如其计数值减 1, 然后判断此时谐波模式计数器的计数值是否大于给定阈值 Τ8 , 若是则判 断当前语音频信号为谐波信号, 否则为普通信号。
基于上述关于宽带信号和超宽带信号的信号类型判决的介绍, 下面对 本发明的技术方案进行详细说明。
图 2为本发明与音频信号处理方法实施例二的流程图,在本实施例中, 第一带宽语音频信号为超宽带信号, 第二带宽语音频信号为宽带信号, 带 宽切换为从超宽带信号向宽带信号切换, 如图 2所示, 本实施例的方法可 以包括:
歩骤 201、计算带宽切换后的宽带信号的谐波频带数和最大峰值参数。 该歩骤例如可以采用上述歩骤 6 ) 实现, 此处不再赘述。
歩骤 202、 根据谐波频带数、 最大峰值参数和宽带信号的谐波信号判 决条件, 更新谐波模式计数值。
该歩骤例如可以采用上述歩骤 7 ) 实现, 需要说明的是, 对于宽带信 号来说, 可以无需计算全局能量比值, 而只采用宽带信号的谐波信号判决 条件中对谐波频带数和最大峰值参数的判决, 从而可以更新谐波模式计数 器, 如果谐波频带数大于给定阈值 Τ5且最大峰值参数大于给定阈值 Τ4 , 则可以确定带宽切换后的宽带信号为谐波信号, 此时可以将谐波模式计数 器的值加 1, 如果谐波频带数小于等于给定阈值 Τ5和 /或最大峰值参数小 于等于给定阈值 Τ4, 则可以确定带宽切换后的宽带信号为非谐波信号, 此时可以将谐波模式计数器的值减 1。 由此可知, 此处关于带宽切换后的 宽带信号是谐波信号还是非谐波信号的判决是基于该宽带信号的客观信 号类型, 以此更新的谐波模式计数器是为了对后续语音频信号进行判决 时, 可以参考之前的语音频信号的客观信息。
歩骤 203、 确定带宽切换前的超宽带信号是否是谐波信号, 若是, 则 执行歩骤 204, 否则执行歩骤 206;
所需说明的是, 歩骤 203的执行顺序只要在歩骤 204之前即可, 而并 不限定其一定在歩骤 201和歩骤 202之后执行, 在实际处理过程中, 其在 带宽切换前就可以执行。
歩骤 204、 降低宽带信号的谐波信号判决条件中的谐波频带数阈值和 最大峰值参数阈值中的至少一个阈值。
由于带宽切换前的超宽带信号为谐波信号, 因此, 歩骤 204需要放宽 将带宽切换后的宽带信号判决为谐波信号的条件。 在本实施例中, 可以降 低宽带信号的谐波信号判决条件中的谐波频带数阈值 T5, 以及最大峰值 参数阈值 Τ4中的至少一个阈值。 可以理解的是, Τ4和 Τ5均降低的调整 方式相对于只降低 Τ4或者只降低 Τ5的调整方式来说,对谐波信号判决条 件的放宽程度是较大的。 本实施例中可以将降低后的谐波频带数阈值记为 T51 , 该 Τ51 < Τ5, 将降低后的最大峰值参数阈值记为 Τ41, 该 Τ41 < Τ4。 举例来说, 该 T51可以为 Τ5的一半, T41为 Τ4的一半。
本领域技术人员可以理解的是, T51和 T41的具体取值可以根据谐波 信号判决需求自行设定。 举例来说, 如果需要使具有一定谐波特性的宽带 信号可以尽量被判决为谐波信号, 则可以将 T51和 T41调整得低一些, 从 而将谐波信号判决条件放得较宽。
歩骤 205、若谐波频带数大于降低后的谐波频带数阈值和 /或最大峰值 参数大于降低后的最大峰值参数阈值, 则该宽带信号为谐波信号。
在谐波信号判决条件被放宽后, 如果谐波频带数大于 Τ51, 或者最大 峰值参数大于 T41这两个条件中只要有一个条件满足, 则带宽切换后的宽 带信号即可被判决为谐波信号。 需要说明的是, 现有技术在进行谐波信号 时,需要谐波频带数大于 Τ5和最大峰值参数大于 Τ4这两个条件同时满足, 而在本实施例中, 不仅降低了 Τ5和 Τ4的判决阈值, 而且, 在谐波频带数 大于 T51以及最大峰值参数大于 T41这两个条件中只要有一个条件满足 时, 就可以判决带宽切换后的信号为谐波信号, 从而进一歩放宽了谐波信 号的判决条件。
对于谐波频带数小于等于 T51且最大峰值参数小于等于 T41的情况, 也即上述两个条件都不满足, 则本实施例还可以附加谐波模式计数器的值 进行判决, 若谐波模式计数值大于预设值 Τ8 , 则带宽切换后的宽带信号 即为谐波信号。 歩骤 206、 提高宽带信号的谐波信号判决条件中的谐波频带数阈值和 最大峰值参数阈值中的至少一个阈值。
由于带宽切换前的超宽带信号为非谐波信号,例如为瞬态信号,因此, 歩骤 206需要提高将带宽切换后的宽带信号判决为谐波信号的条件。在本 实施例中, 可以提高宽带信号的谐波信号判决条件中的谐波频带数阈值 T5 , 以及最大峰值参数阈值 T4中的至少一个阈值。 可以理解的是, T4和 T5均提高的调整方式相对于只提高 T4或者只提高 T5的调整方式来说, 对谐波信号判决条件的提高程度是较大的。 本实施例中可以将提高后的谐 波频带数阈值记为 T52 , 该 T52〉T5, 将提高后的最大峰值参数阈值记为 Τ42 , 该 Τ42〉Τ4。 举例来说, 该 Τ52可以为 Τ5的一倍, Τ42为 Τ4的一 倍。
本领域技术人员可以理解的是, Τ52和 Τ42的具体取值也可以根据谐 波信号判决需求自行设定。 举例来说, 如果需要将存在较多谐波特性的宽 带信号判决为谐波信号, 则可以将 Τ52和 Τ42调整地较高, 从而可以将谐 波特性非常明显的宽带信号判决为谐波信号。
歩骤 207、若谐波频带数大于提高后的谐波频带数阈值和 /或最大峰值 参数大于提高后的最大峰值参数阈值, 则宽带信号为谐波信号。
在谐波信号判决条件被提高后, 如果谐波频带数大于 Τ52, 或者最大 峰值参数大于 Τ42这两个条件中只要有一个条件满足, 则带宽切换后的宽 带信号即可被判决为谐波信号。
对于谐波频带数小于等于 Τ52且最大峰值参数小于等于 Τ42的情况, 也即上述两个条件都不满足, 则本实施例也可以附加谐波模式计数器的值 进行判决, 若谐波模式计数值大于预设值 Τ8 , 则带宽切换后的宽带信号 也可以被判决为谐波信号。
本实施例中, 当编码端出现带宽切换的情况时, 编码装置可以确定带 宽切换前的超宽带信号是谐波信号还是非谐波信号, 如果是谐波信号, 则 可以通过降低用于表征信号中谐波成分的谐波频带数和 /或最大峰值参数 的判决阈值, 从而尽量将带宽切换后的宽带信号判决为谐波信号, 如果是 非谐波信号,则可以通过提高用于谐波频带数和 /或最大峰值参数的判决阈 值, 从而尽量将带宽切换后的宽带信号也判决为非谐波信号, 而且, 在谐 波信号判决条件调整后, 即使带宽切换后的宽带信号不满足前述条件, 还 可以辅助谐波模式计数器进行判决。 因此, 本实施例在带宽切换时, 尽可 能不改变信号类型, 从而对于解码端接收的语音频信号来说, 可以尽量保 证其连续性。
图 3为本发明语音频信号处理方法实施例三的流程图,在本实施例中, 第一带宽语音频信号为宽带信号, 第二带宽语音频信号为超宽带信号, 带 宽切换为从宽带信号向超宽带信号切换, 如图 3所示, 本实施例的方法可 以包括:
歩骤 301、 计算带宽切换后的超宽带信号的谐波频带数和最大峰值参 数, 根据谐波频带数、 最大峰值参数和超宽带信号的谐波信号判决条件, 更新谐波模式计数值。
歩骤 301可以参见前述关于超宽带信号的信号类型判决过程实现, 此 处不再赘述。
歩骤 302、 默认该超宽带信号不是瞬态信号且默认该超宽带信号的全 局能量与带宽切换前的宽带信号的全局能量的比值在预设范围内。
本实施例中, 由于带宽切换是从宽带信号向超宽带信号切换, 而超宽 带信号则包括了四种信号类型, 而且, 其谐波信号判决条件相比宽带信号 的谐波信号判决条件增加了带宽切换后的超宽带信号的全局能量与带宽 切换前的宽带信号的全局能量的比值的判定条件。 在本实施例中, 为了简 化判决条件, 歩骤 302可以不执行上述歩骤 1 )〜歩骤 3 ) 而默认带宽前换 后的超宽带信号不是瞬态信号, 并且还可以默认歩骤 7 ) 中带宽切换后的 超宽带信号的全局能量与带宽切换前的宽带信号的全局能量的比值在预 设范围 (T6, T7 ) 内。
歩骤 303、 确定带宽切换前的宽带信号是否是谐波信号, 若是, 则执 行歩骤 304, 否则执行歩骤 306;
歩骤 304、 降低超宽带信号的谐波信号判决条件中的谐波频带数阈值 和最大峰值参数阈值中的至少一个阈值。
由于带宽切换前的宽带信号为谐波信号, 因此, 歩骤 304需要放宽将 带宽切换后的超宽带信号判决为谐波信号的条件。 在本实施例中, 可以降 低超宽带信号的谐波信号判决条件中的谐波频带数阈值 T5, 以及最大峰 值参数阈值 T4中的至少一个阈值。 降低后的谐波频带数阈值同样记为 T51 , 降低后的最大峰值参数阈值同样记为 Τ41。
歩骤 305、若谐波频带数大于降低后的谐波频带数阈值和 /或最大峰值 参数大于降低后的最大峰值参数阈值, 则该超宽带信号为谐波信号。
在谐波信号判决条件被放宽后, 如果谐波频带数大于降低后的谐波频 带数阈值, 或者最大峰值参数大于降低后的最大峰值参数阈值这两个条件 中只要有一个条件满足, 则带宽切换后的超宽带信号即可被判决为谐波信 号。
对于谐波频带数小于等于 T51且最大峰值参数小于等于 T41的情况, 也即上述两个条件都不满足, 则本实施例还可以附加谐波模式计数器的值 进行判决, 若谐波模式计数值大于预设值 Τ8 , 则带宽切换后的超宽带信 号即为谐波信号。
歩骤 306、 提高超宽带信号的谐波信号判决条件中的谐波频带数阈值 和最大峰值参数阈值中的至少一个阈值。
歩骤 307、若谐波频带数大于提高后的谐波频带数阈值和 /或最大峰值 参数大于提高后的最大峰值参数阈值, 则该超宽带信号为谐波信号。
在谐波信号判决条件被提高后, 如果谐波频带数大于提高后的谐波频 带数阈值 Τ52 , 或者最大峰值参数大于提高后的最大峰值参数阈值 Τ42这 两个条件中只要有一个条件满足, 则带宽切换后的超宽带信号即可被判决 为谐波信号。
对于谐波频带数小于等于 Τ52且最大峰值参数小于等于 Τ42的情况, 也即上述两个条件都不满足, 则本实施例也可以附加谐波模式计数器的值 进行判决, 若谐波模式计数值大于预设值 Τ8 , 则带宽切换后的超宽带信 号也可以被判决为谐波信号。
可替代地, 本实施例也可以执行歩骤 1 ) 〜3 ) 对带宽切换后的超宽带 信号是否是瞬态信号进行判决。 而且, 为了保证信号的连续性, 本实施例 可以提高瞬态信号的判决条件, 从而可以将确实具有较大的瞬态特性的超 宽带信号判决为瞬态信号。
具体实现时, 编码装置可以采用上述歩骤 1 ) 计算超宽带信号的时域 包络参数, 并提高歩骤 2 ) 中的时序包络阈值 T1 , 可以将提高后的包络阈 值记为 Ti l ; 若时域包络参数大于 Tl l, 则该超宽带信号可以被判决为瞬 态信号。 举例来说, 若带宽切换前的宽带信号为谐波信号, 则可以将包络 阈值提高 3倍; 若带宽切换前的宽带信号为非谐波信号, 则可以将包络阈 值提高 2倍。
本实施例中, 当编码端出现带宽切换的情况时, 编码装置可以确定带 宽切换前的宽带信号是谐波信号还是非谐波信号, 如果是谐波信号, 则可 以通过降低用于表征信号中谐波成分的谐波频带数和 /或最大峰值参数的 判决阈值, 尽量将带宽切换后的超宽带信号判决为谐波信号, 如果是非谐 波信号, 则可以通过提高用于谐波频带数和 /或最大峰值参数的判决阈值, 尽量将带宽切换后的超宽带信号也判决为非谐波信号, 而且, 在谐波信号 判决条件调整后, 即使带宽切换后的超宽带信号不满足前述条件, 还可以 辅助谐波模式计数器进行判决。 因此, 本实施例在带宽切换时, 尽可能不 改变信号类型, 从而对于解码端接收的语音频信号来说, 可以尽量保证其 连续性。
图 4为本发明基于带宽切换的信号处理方法实施例四的流程图, 在本 实施例中, 第一带宽语音频信号为超宽带信号, 第二带宽语音频信号为宽 带信号, 带宽切换为从超宽带信号向宽带信号切换, 如图 4所示, 本实施 例的方法可以包括:
歩骤 401、计算带宽切换后的宽带信号的谐波频带数和最大峰值参数。 歩骤 402、 根据谐波频带数、 最大峰值参数和宽带信号的谐波信号判 决条件, 更新谐波模式计数值。
歩骤 403、 确定带宽切换前的超宽带信号是否是谐波信号, 若是, 则 执行歩骤 404, 否则执行歩骤 405;
歩骤 401〜歩骤 403可参考前述图 2所示实施例的歩骤 201〜歩骤 203 的执行过程, 此处不再赘述。
歩骤 404、 判决带宽切换后的宽带信号为谐波信号。
歩骤 405、 判决带宽切换后的宽带信号为非谐波信号。
本实施例与前述图 2所示方法实施例的区别在于, 图 2所示方法实施 例通过调整谐波信号判决条件中的判决阈值对带宽切换后的宽带信号是 否是谐波信号进行判决, 而本实施例则将谐波信号判决条件调整为: 只要 带宽切换前的超宽带信号是谐波信号, 则带宽切换后的宽带信号也强制判 决为谐波信号, 只要带宽切换前的超宽带信号为非谐波信号, 则带宽切换 后的宽带信号也强制判决为非谐波信号。
本实施例中, 当编码端出现带宽切换的情况时, 编码装置可以确定带 宽切换前的超宽带信号是谐波信号还是非谐波信号, 如果是谐波信号, 则 强制判决带宽切换后的宽带信号为谐波信号, 如果是非谐波信号, 则强制 判决带宽切换后的宽带信号为非谐波信号。因此,本实施例在带宽切换时, 不改变信号类型, 从而对于解码端接收的语音频信号来说, 可以尽量保证 语音频信号的连续性。
图 5为本发明基于带宽切换的信号处理方法实施例五的流程图, 在本 实施例中, 第一带宽语音频信号为宽带信号, 第二带宽语音频信号为超宽 带信号, 带宽切换为从宽带信号向超宽带信号切换, 如图 5所示, 本实施 例的方法可以包括:
歩骤 501、 计算带宽切换后的超宽带信号的谐波频带数和最大峰值参 数, 根据谐波频带数、 最大峰值参数和超宽带信号的谐波信号判决条件, 更新谐波模式计数值。
歩骤 502、 默认该超宽带信号不是瞬态信号且默认该超宽带信号的全 局能量与带宽切换前的宽带信号的全局能量的比值在预设范围内。
歩骤 503、 确定带宽切换前的宽带信号是否是谐波信号, 若是, 则执 行歩骤 504, 否则执行歩骤 505 ;
歩骤 501〜歩骤 503可参考前述图 3所示实施例的歩骤 301〜歩骤 303 的执行过程, 此处不再赘述。
歩骤 504、 判决带宽切换后的超宽带信号为谐波信号。
歩骤 505、 判决带宽切换后的超宽带信号为非谐波信号。
本实施例与前述图 3所示方法实施例的区别在于, 图 3所示方法实施 例通过调整谐波信号判决条件中的判决阈值对带宽切换后的超宽带信号 是否是谐波信号进行判决, 而本实施例则将谐波信号判决条件调整为: 只 要带宽切换前的宽带信号是谐波信号, 则带宽切换后的超宽带信号也强制 判决为谐波信号, 只要带宽切换前的宽带信号为非谐波信号, 则带宽切换 后的超宽带信号也强制判决为非谐波信号。 本实施例中, 当编码端出现带宽切换的情况时, 编码装置可以确定带 宽切换前的宽带信号是谐波信号还是非谐波信号, 如果是谐波信号, 则强 制判决带宽切换后的超宽带信号为谐波信号, 如果是非谐波信号, 则强制 判决带宽切换后的超宽带信号为非谐波信号。 因此, 本实施例在带宽切换 时, 不改变信号类型, 从而对于解码端接收的语音频信号来说, 可以尽量 保证语音频信号的连续性。
与上述方法实施例相关联, 本发明还提供一种编码装置, 该装置可以 位于终端设备, 网络设备, 或测试设备中。 所述编码装置可以由硬件电路 来实现, 或者由软件配合硬件来实现。 例如, 参考图 6, 由一个处理器调 用编码装置来实现语音频信号处理。 该编码装置可以执行上述方法实施例 中的各种方法和流程。 该编码装置可以包括判决条件调整模块以及信号类 型判决模块。
图 7为本发明编码装置实施例一的结构示意图, 如图 7所示, 本实施 例的编码装置包括: 判决条件调整模块 11以及信号类型判决模块 12, 其 中, 判决条件调整模块 11, 用于若第一带宽语音频信号是谐波信号, 则调 整第二带宽语音频信号被判决为谐波信号的判决条件得到第一判决条件, 以提高所述第二带宽语音频信号被判决为谐波信号的可能性; 所述第一带 宽语音频信号为带宽切换前的语音频信号, 所述第二带宽语音频信号为带 宽切换后的语音频信号; 信号类型判决模块 12, 用于根据所述第一判决条 件, 判决所述第二带宽语音频信号是否为谐波信号。
具体地, 判决条件调整模块 11用于放宽所述第二带宽语音频信号被 判决为谐波信号的判决条件, 放宽后的判决条件作为所述第一判决条件。
图 8为本发明编码装置实施例二的结构示意图, 如图 8所示, 本实施 例的装置在图 7所示装置的基础上, 进一歩地, 还包括: 谐波模式更新模 块 13 ;
在本实施例中, 判决条件调整模块 11, 具体用于降低所述第二带宽语 音频信号被判决为谐波信号的判决条件中的谐波频带数阈值和最大峰值 参数阈值中的至少一个阈值; 相应地, 信号类型判决模块 12可以包括: 计算单元 121和处理单元 122, 其中, 计算单元 121, 用于计算所述第二 带宽语音频信号的谐波频带数和最大峰值参数; 处理单元 122, 用于若所 述谐波频带数大于降低后的谐波频带数阈值和 /或所述最大峰值参数大于 降低后的最大峰值参数阈值, 则所述第二带宽语音频信号为谐波信号。
谐波模式更新模块 13, 用于根据所述谐波频带数、所述最大峰值参数 和所述第二带宽语音频信号被判决为谐波信号的判决条件之间的关系, 更 新谐波模式计数值; 相应地, 信号类型判决模块 12, 还用于若所述谐波频 带数小于等于所述降低后的谐波频带数阈值且所述最大峰值参数小于等 于所述降低后的最大峰值参数阈值且所述谐波模式计数值大于预设值, 则 确定所述第二带宽语音频信号为谐波信号。
进一歩地, 该谐波模式更新模块 13, 具体用于若所述谐波频带数大于 所述谐波频带数阈值且所述最大峰值参数大于所述最大峰值参数阈值, 则 增加所述谐波模式计数值; 若所述谐波频带数小于等于所述谐波频带数阈 值和 /或所述最大峰值参数小于等于所述最大峰值参数阈值,则减少所述谐 波模式计数值。
针对带宽切换为从宽带信号向超宽带信号切换, 即第一带宽语音频信 号为宽带信号, 所述第二带宽语音频信号为超宽带信号的情况, 判决条件 调整模块 1 1还用于计算所述超宽带信号的时域包络参数, 并提高瞬态信 号判决条件中的包络阈值; 若所述时域包络参数大于等于提高后的包络阈 值, 则所述超宽带信号为瞬态信号, 若所述时域包络参数小于提高后的包 络阈值, 则默认所述超宽带信号不是瞬态信号且默认所述超宽带信号的全 局能量与所述宽带信号的全局能量的比值在预设范围内。 在具体实现时, 判决条件调整模块 1 1具体用于若所述宽带信号为谐波信号, 则将所述包 络阈值提高 3倍, 若所述宽带信号为非谐波信号, 则将所述包络阈值提高 2倍。
本发明编码装置另一个实施例在图 7所示编码装置实施例的基础上, 信号类型判决模块 12可以具体用于根据所述第一判决条件将所述第二宽 带语音频信号判决为谐波信号; 或者, 信号类型判决模块 12还用于若所 述第一带宽语音频信号不是谐波信号, 则将所述第二宽带语音频信号判决 为非谐波信号。
本发明编码装置再一个实施例在图 7所示编码装置实施例的基础上, 判决条件调整模块 1 1还用于若所述第一带宽语音频信号不是谐波信号, 则调整所述谐波信号判决条件得到第二判决条件, 以降低所述第二带宽语 音频信号被判决为谐波信号的可能性; 相应地, 信号类型判决模块 12, 还 用于根据所述第二判决条件, 判决所述第二带宽语音频信号是否为谐波信 号。 具体地, 判决条件调整模块 11用于提高所述第二带宽语音频信号被 判决为谐波信号的判决条件中的谐波频带数阈值和最大峰值参数阈值中 的至少一个阈值; 相应地, 信号类型判决模块 12, 具体用于若所述谐波频 带数大于提高后的谐波频带数阈值和 /或所述最大峰值参数大于提高后的 最大峰值参数阈值, 则所述第二带宽语音频信号为谐波信号。
本发明上述实施例的编码装置可以对应地执行图 1〜图 5所示方法实 施例的技术方案, 其实现原理和技术效果类似, 此处不再赘述。
本领域普通技术人员可以理解: 实现上述方法实施例的全部或部分歩 骤可以通过程序指令相关的硬件来完成, 前述的程序可以存储于一计算机 可读取存储介质中, 该程序在执行时, 执行包括上述方法实施例的歩骤; 而前述的存储介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存储程 序代码的介质。
最后应说明的是: 以上实施例仅用以说明本发明的技术方案, 而非对 其限制; 尽管参照前述实施例对本发明进行了详细的说明, 本领域的普通 技术人员应当理解: 其依然可以对前述各实施例所记载的技术方案进行修 改, 或者对其中部分技术特征进行等同替换; 而这些修改或者替换, 并不 使相应技术方案的本质脱离本发明各实施例技术方案的范围。

Claims

权 利 要 求 书
1、 一种语音频信号处理方法, 其特征在于, 包括:
若第一带宽语音频信号是谐波信号, 则调整第二带宽语音频信号被判 决为谐波信号的判决条件得到第一判决条件, 以提高所述第二带宽语音频 信号被判决为谐波信号的可能性; 所述第一带宽语音频信号为带宽切换前 的语音频信号, 所述第二带宽语音频信号为带宽切换后的语音频信号; 根据所述第一判决条件, 判决所述第二带宽语音频信号是否为谐波信 号。
2、 根据权利要求 1所述的方法, 其特征在于, 所述调整第二带宽语 音频信号被判决为谐波信号的判决条件得到第一判决条件, 以提高所述第 二带宽语音频信号被判决为谐波信号的可能性, 包括:
放宽所述第二带宽语音频信号被判决为谐波信号的判决条件, 放宽后 的判决条件作为所述第一判决条件。
3、 根据权利要求 2所述的方法, 其特征在于, 所述放宽所述第二带 宽语音频信号被判决为谐波信号的判决条件, 包括:
降低所述第二带宽语音频信号被判决为谐波信号的判决条件中的谐 波频带数阈值和最大峰值参数阈值中的至少一个阈值;
所述根据所述第一判决条件, 判决所述第二带宽语音频信号是否为谐 波信号, 包括:
计算所述第二带宽语音频信号的谐波频带数和最大峰值参数; 若所述谐波频带数大于降低后的谐波频带数阈值和 /或所述最大峰值 参数大于降低后的最大峰值参数阈值, 则所述第二带宽语音频信号为谐波 信号。
4、 根据权利要求 3所述的方法, 其特征在于, 还包括:
根据所述谐波频带数、所述最大峰值参数和所述第二带宽语音频信号 被判决为谐波信号的判决条件之间的关系, 更新谐波模式计数值;
若所述谐波频带数小于等于所述降低后的谐波频带数阈值且所述最 大峰值参数小于等于所述降低后的最大峰值参数阈值, 则所述方法还包 括:
若所述谐波模式计数值大于预设值, 则确定所述第二带宽语音频信号 为谐波信号。
5、 根据权利要求 4所述的方法, 其特征在于, 所述根据所述谐波频 带数、所述最大峰值参数和所述第二带宽语音频信号被判决为谐波信号的 判决条件之间的关系, 更新谐波模式计数值, 包括:
若所述谐波频带数大于所述谐波频带数阈值且所述最大峰值参数大 于所述最大峰值参数阈值, 则增加所述谐波模式计数值;
若所述谐波频带数小于等于所述谐波频带数阈值和 /或所述最大峰值 参数小于等于所述最大峰值参数阈值, 则减少所述谐波模式计数值。
6、 根据权利要求 4或 5所述的方法, 其特征在于, 所述第一带宽语 音频信号为宽带信号, 所述第二带宽语音频信号为超宽带信号, 所述根据 所述第一判决条件, 判决所述第二带宽语音频信号是否为谐波信号之前, 还包括:
计算所述超宽带信号的时域包络参数, 并提高瞬态信号判决条件中的 包络阈值;
若所述时域包络参数大于等于提高后的包络阈值, 则所述超宽带信号 为瞬态信号;
若所述时域包络参数小于提高后的包络阈值, 则默认所述超宽带信号 不是瞬态信号且默认所述超宽带信号的全局能量与所述宽带信号的全局 能量的比值在预设范围内。
7、 根据权利要求 6所述的方法, 其特征在于, 所述提高瞬态信号判 决条件中的包络阈值, 具体为:
若所述宽带信号为谐波信号, 则将所述包络阈值提高 3倍;
若所述宽带信号为非谐波信号, 则将所述包络阈值提高 2倍。
8、 根据权利要求 1或 2所述的方法, 其特征在于, 所述根据所述第 —判决条件, 判决所述第二带宽语音频信号是否为谐波信号, 包括:
根据所述第一判决条件将所述第二宽带语音频信号判决为谐波信号。
9、 根据权利要求 1-8所述的任一方法, 其特征在于, 还包括: 若所述第一带宽语音频信号不是谐波信号, 则调整所述谐波信号判决 条件得到第二判决条件, 以降低所述第二带宽语音频信号被判决为谐波信 号的可能性; 根据所述第二判决条件, 判决所述第二带宽语音频信号是否为谐波信 号。
10、 根据权利要求 9所述的方法, 其特征在于, 所述调整所述谐波信 号判决条件得到第二判决条件, 以降低所述第二带宽语音频信号被判决为 谐波信号的可能性, 包括:
提高所述第二带宽语音频信号被判决为谐波信号的判决条件中的谐 波频带数阈值和最大峰值参数阈值中的至少一个阈值;
所述根据所述第二判决条件, 判决所述第二带宽语音频信号是否为谐 波信号, 包括:
若所述谐波频带数大于提高后的谐波频带数阈值和 /或所述最大峰值 参数大于提高后的最大峰值参数阈值, 则所述第二带宽语音频信号为谐波 信号。
1 1、 根据权利要求 1-8所述的任一方法, 其特征在于, 还包括: 若所述第一带宽语音频信号不是谐波信号, 则将所述第二宽带语音频 信号判决为非谐波信号。
12、 一种编码装置, 其特征在于, 包括:
判决条件调整模块, 若第一带宽语音频信号是谐波信号, 则用于调整 第二带宽语音频信号被判决为谐波信号的判决条件得到第一判决条件, 以 提高所述第二带宽语音频信号被判决为谐波信号的可能性; 所述第一带宽 语音频信号为带宽切换前的语音频信号, 所述第二带宽语音频信号为带宽 切换后的语音频信号;
信号类型判决模块, 用于根据所述第一判决条件, 判决所述第二带宽 语音频信号是否为谐波信号。
13、 根据权利要求 12所述的装置, 其特征在于, 所述判决条件调整 模块具体用于放宽所述第二带宽语音频信号被判决为谐波信号的判决条 件, 放宽后的判决条件作为所述第一判决条件。
14、 根据权利要求 13所述的装置, 其特征在于, 所述判决条件调整 模块, 具体用于降低所述第二带宽语音频信号被判决为谐波信号的判决条 件中的谐波频带数阈值和最大峰值参数阈值中的至少一个阈值;
所述信号类型判决模块, 包括: 计算单元, 用于计算所述第二带宽语音频信号的谐波频带数和最大峰 值参数;
处理单元, 用于若所述谐波频带数大于降低后的谐波频带数阈值和 / 或所述最大峰值参数大于降低后的最大峰值参数阈值, 则所述第二带宽语 音频信号为谐波信号。
15、 根据权利要求 14所述的装置, 其特征在于, 还包括:
谐波模式更新模块, 用于根据所述谐波频带数、 所述最大峰值参数和 所述第二带宽语音频信号被判决为谐波信号的判决条件之间的关系, 更新 谐波模式计数值;
所述信号类型判决模块, 还用于若所述谐波频带数小于等于所述降低 后的谐波频带数阈值且所述最大峰值参数小于等于所述降低后的最大峰 值参数阈值且所述谐波模式计数值大于预设值, 则确定所述第二带宽语音 频信号为谐波信号。
16、 根据权利要求 15所述的装置, 其特征在于, 所述谐波模式更新 模块, 具体用于若所述谐波频带数大于所述谐波频带数阈值且所述最大峰 值参数大于所述最大峰值参数阈值, 则增加所述谐波模式计数值; 若所述 谐波频带数小于等于所述谐波频带数阈值和 /或所述最大峰值参数小于等 于所述最大峰值参数阈值, 则减少所述谐波模式计数值。
17、 根据权利要求 14或 15所述的装置, 其特征在于, 所述第一带宽 语音频信号为宽带信号, 所述第二带宽语音频信号为超宽带信号, 所述判 决条件调整模块还用于计算所述超宽带信号的时域包络参数, 并提高瞬态 信号判决条件中的包络阈值; 若所述时域包络参数大于等于提高后的包络 阈值, 则所述超宽带信号为瞬态信号, 若所述时域包络参数小于提高后的 包络阈值, 则默认所述超宽带信号不是瞬态信号且默认所述超宽带信号的 全局能量与所述宽带信号的全局能量的比值在预设范围内。
18、 根据权利要求 17所述的装置, 其特征在于, 所述判决条件调整 模块具体用于若所述宽带信号为谐波信号, 则将所述包络阈值提高 3倍, 若所述宽带信号为非谐波信号, 则将所述包络阈值提高 2倍。
19、 根据权利要求 12或 13所述的装置, 其特征在于, 所述信号类型 判决模块具体用于根据所述第一判决条件将所述第二宽带语音频信号判 决为谐波信号。
20、 根据权利要求 12-19所述的任一装置, 其特征在于, 所述判决条 件调整模块还用于若所述第一带宽语音频信号不是谐波信号, 则调整所述 谐波信号判决条件得到第二判决条件, 以降低所述第二带宽语音频信号被 判决为谐波信号的可能性;
所述信号类型判决模块, 还用于根据所述第二判决条件, 判决所述第 二带宽语音频信号是否为谐波信号。
21、 根据权利要求 20所述的装置, 其特征在于, 所述判决条件调整 模块, 具体用于提高所述第二带宽语音频信号被判决为谐波信号的判决条 件中的谐波频带数阈值和最大峰值参数阈值中的至少一个阈值;
所述信号类型判决模块, 具体用于若所述谐波频带数大于提高后的谐 波频带数阈值和 /或所述最大峰值参数大于提高后的最大峰值参数阈值,则 所述第二带宽语音频信号为谐波信号。
22、 根据权利要求 12-19所述的任一装置, 其特征在于, 所述信号类 型判决模块还用于若所述第一带宽语音频信号不是谐波信号, 则将所述第 二宽带语音频信号判决为非谐波信号。
PCT/CN2013/076862 2012-06-29 2013-06-06 语音频信号处理方法和编码装置 WO2014000559A1 (zh)

Priority Applications (13)

Application Number Priority Date Filing Date Title
EP17195365.6A EP3376499B1 (en) 2012-06-29 2013-06-06 Speech/audio signal processing method and coding apparatus
EP20150138.4A EP3748634B1 (en) 2012-06-29 2013-06-06 Speech/audio signal processing method and coding apparatus
JP2015518805A JP6359529B2 (ja) 2012-06-29 2013-06-06 会話/音声信号処理方法および符号化装置
EP13810131.6A EP2851897B1 (en) 2012-06-29 2013-06-06 Processing method for speech or audio signals and encoding apparatus thereof
KR1020177030314A KR101907494B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
KR1020187028697A KR102005967B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
ES13810131.6T ES2654488T3 (es) 2012-06-29 2013-06-06 Método de procesamiento para señales de voz o audio y aparato de codificación de las mismas
KR1020197021968A KR102165827B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
KR1020207028813A KR102331531B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
KR1020167035415A KR101790680B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
KR1020157000174A KR101689138B1 (ko) 2012-06-29 2013-06-06 음성 또는 오디오 신호 처리 방법 및 인코딩 장치
US14/562,494 US10056090B2 (en) 2012-06-29 2014-12-05 Speech/audio signal processing method and coding apparatus
US16/051,139 US11107486B2 (en) 2012-06-29 2018-07-31 Speech/audio signal processing method and coding apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201210223014.0A CN103516440B (zh) 2012-06-29 2012-06-29 语音频信号处理方法和编码装置
CN201210223014.0 2012-06-29

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US14/562,494 Continuation US10056090B2 (en) 2012-06-29 2014-12-05 Speech/audio signal processing method and coding apparatus

Publications (1)

Publication Number Publication Date
WO2014000559A1 true WO2014000559A1 (zh) 2014-01-03

Family

ID=49782211

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/076862 WO2014000559A1 (zh) 2012-06-29 2013-06-06 语音频信号处理方法和编码装置

Country Status (7)

Country Link
US (2) US10056090B2 (zh)
EP (3) EP3748634B1 (zh)
JP (3) JP6359529B2 (zh)
KR (6) KR101790680B1 (zh)
CN (1) CN103516440B (zh)
ES (3) ES2779857T3 (zh)
WO (1) WO2014000559A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103516440B (zh) * 2012-06-29 2015-07-08 华为技术有限公司 语音频信号处理方法和编码装置
BR112016020988B1 (pt) * 2014-03-14 2022-08-30 Telefonaktiebolaget Lm Ericsson (Publ) Método e codificador para codificação de um sinal de áudio, e, dispositivo de comunicação
CN106303878A (zh) * 2015-05-22 2017-01-04 成都鼎桥通信技术有限公司 一种啸叫检测和抑制方法
US10431242B1 (en) * 2017-11-02 2019-10-01 Gopro, Inc. Systems and methods for identifying speech based on spectral features

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060149538A1 (en) * 2004-12-31 2006-07-06 Samsung Electronics Co., Ltd. High-band speech coding apparatus and high-band speech decoding apparatus in wide-band speech coding/decoding system and high-band speech coding and decoding method performed by the apparatuses
CN101662288A (zh) * 2008-08-28 2010-03-03 华为技术有限公司 音频编码、解码方法及装置、系统
CN101964189A (zh) * 2010-04-28 2011-02-02 华为技术有限公司 语音频信号切换方法及装置

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0356736B2 (zh) * 1979-05-28 1991-08-29
US5574724A (en) * 1995-05-26 1996-11-12 Lucent Technologies Inc. Adjustment of call bandwidth during a communication call
US20050065786A1 (en) * 2003-09-23 2005-03-24 Jacek Stachurski Hybrid speech coding and system
FI115329B (fi) 2000-05-08 2005-04-15 Nokia Corp Menetelmä ja järjestely lähdesignaalin kaistanleveyden vaihtamiseksi tietoliikenneyhteydessä, jossa on valmiudet useisiin kaistanleveyksiin
KR100462611B1 (ko) * 2002-06-27 2004-12-20 삼성전자주식회사 하모닉 성분을 이용한 오디오 코딩방법 및 장치
FI119533B (fi) * 2004-04-15 2008-12-15 Nokia Corp Audiosignaalien koodaus
WO2006030865A1 (ja) * 2004-09-17 2006-03-23 Matsushita Electric Industrial Co., Ltd. スケーラブル符号化装置、スケーラブル復号化装置、スケーラブル符号化方法、スケーラブル復号化方法、通信端末装置および基地局装置
US8311840B2 (en) * 2005-06-28 2012-11-13 Qnx Software Systems Limited Frequency extension of harmonic signals
CN101263554B (zh) 2005-07-22 2011-12-28 法国电信公司 在比特率分级和带宽分级的音频解码中的比特率切换方法
CA2558595C (en) * 2005-09-02 2015-05-26 Nortel Networks Limited Method and apparatus for extending the bandwidth of a speech signal
KR101131880B1 (ko) * 2007-03-23 2012-04-03 삼성전자주식회사 오디오 신호의 인코딩 방법 및 장치, 그리고 오디오 신호의디코딩 방법 및 장치
BRPI0818927A2 (pt) * 2007-11-02 2015-06-16 Huawei Tech Co Ltd Método e aparelho para a decodificação de áudio
EP2224432B1 (en) * 2007-12-21 2017-03-15 Panasonic Intellectual Property Corporation of America Encoder, decoder, and encoding method
WO2010028301A1 (en) * 2008-09-06 2010-03-11 GH Innovation, Inc. Spectrum harmonic/noise sharpness control
CN101763856B (zh) * 2008-12-23 2011-11-02 华为技术有限公司 信号分类处理方法、分类处理装置及编码系统
JP4945586B2 (ja) * 2009-02-02 2012-06-06 株式会社東芝 信号帯域拡張装置
WO2011156905A2 (en) * 2010-06-17 2011-12-22 Voiceage Corporation Multi-rate algebraic vector quantization with supplemental coding of missing spectrum sub-bands
US9236063B2 (en) * 2010-07-30 2016-01-12 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for dynamic bit allocation
EP2770503B1 (en) * 2011-10-21 2019-05-29 Samsung Electronics Co., Ltd. Method and apparatus for concealing frame errors and method and apparatus for audio decoding
US9390722B2 (en) * 2011-10-24 2016-07-12 Lg Electronics Inc. Method and device for quantizing voice signals in a band-selective manner
GB2502800B (en) * 2012-06-07 2015-05-20 Jaguar Land Rover Ltd Crane and related method of operation
CN103516440B (zh) * 2012-06-29 2015-07-08 华为技术有限公司 语音频信号处理方法和编码装置
BR122020016403B1 (pt) * 2013-06-11 2022-09-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V Aparelho de decodificação de sinal de áudio, aparelho de codificação de sinal de áudio, método de decodificação de sinal de áudio e método de codificação de sinal de áudio
US9564141B2 (en) * 2014-02-13 2017-02-07 Qualcomm Incorporated Harmonic bandwidth extension of audio signals
US9697843B2 (en) * 2014-04-30 2017-07-04 Qualcomm Incorporated High band excitation signal generation

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060149538A1 (en) * 2004-12-31 2006-07-06 Samsung Electronics Co., Ltd. High-band speech coding apparatus and high-band speech decoding apparatus in wide-band speech coding/decoding system and high-band speech coding and decoding method performed by the apparatuses
CN101662288A (zh) * 2008-08-28 2010-03-03 华为技术有限公司 音频编码、解码方法及装置、系统
CN101964189A (zh) * 2010-04-28 2011-02-02 华为技术有限公司 语音频信号切换方法及装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2851897A4 *

Also Published As

Publication number Publication date
KR20160150107A (ko) 2016-12-28
EP3376499B1 (en) 2020-01-08
CN103516440A (zh) 2014-01-15
US20150095038A1 (en) 2015-04-02
KR101790680B1 (ko) 2017-10-26
ES2654488T3 (es) 2018-02-13
KR20170120209A (ko) 2017-10-30
US20180336910A1 (en) 2018-11-22
US10056090B2 (en) 2018-08-21
KR20200118252A (ko) 2020-10-14
JP2017134412A (ja) 2017-08-03
JP6892491B2 (ja) 2021-06-23
JP6359529B2 (ja) 2018-07-18
JP6612808B2 (ja) 2019-11-27
EP3748634B1 (en) 2022-08-10
EP2851897B1 (en) 2017-11-15
JP2015526754A (ja) 2015-09-10
KR101689138B1 (ko) 2016-12-23
KR20180112121A (ko) 2018-10-11
US11107486B2 (en) 2021-08-31
KR20190091374A (ko) 2019-08-05
JP2020024461A (ja) 2020-02-13
CN103516440B (zh) 2015-07-08
ES2930240T3 (es) 2022-12-09
ES2779857T3 (es) 2020-08-20
KR102331531B1 (ko) 2021-12-01
KR101907494B1 (ko) 2018-10-12
KR102165827B1 (ko) 2020-10-14
EP3748634A1 (en) 2020-12-09
KR20150021100A (ko) 2015-02-27
KR102005967B1 (ko) 2019-07-31
EP2851897A1 (en) 2015-03-25
EP3376499A1 (en) 2018-09-19
EP2851897A4 (en) 2015-06-24

Similar Documents

Publication Publication Date Title
JP6378274B2 (ja) 音声/オーディオ信号処理方法および装置
TWI661422B (zh) 用於音訊帶寬選擇之器件及裝置、操作一解碼器之方法及電腦可讀儲存器件
JP6462653B2 (ja) オーディオ・データを処理するための方法、装置、及びシステム
JP6892491B2 (ja) 会話/音声信号処理方法および符号化装置
US8805695B2 (en) Bandwidth expansion method and apparatus
CN105761724B (zh) 一种语音频信号处理方法和装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13810131

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2013810131

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2013810131

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2015518805

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20157000174

Country of ref document: KR

Kind code of ref document: A