WO2009089728A1 - Method for high frequency band replication, coder and decoder thereof - Google Patents

Method for high frequency band replication, coder and decoder thereof Download PDF

Info

Publication number
WO2009089728A1
WO2009089728A1 PCT/CN2008/073728 CN2008073728W WO2009089728A1 WO 2009089728 A1 WO2009089728 A1 WO 2009089728A1 CN 2008073728 W CN2008073728 W CN 2008073728W WO 2009089728 A1 WO2009089728 A1 WO 2009089728A1
Authority
WO
WIPO (PCT)
Prior art keywords
band
frequency
low
sub
high frequency
Prior art date
Application number
PCT/CN2008/073728
Other languages
French (fr)
Chinese (zh)
Inventor
Hongfei Ma
Qingwei Guo
Haibo Zhang
Bo Zhang
Lijing Xu
Qing Zhang
Jianfeng Xu
Wei Li
Zhengzhong Du
Chen Hu
Yi Yang
Lei Miao
Fengyan Qi
Original Assignee
Huawei Technologies Co., Ltd.
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd., Xidian University filed Critical Huawei Technologies Co., Ltd.
Publication of WO2009089728A1 publication Critical patent/WO2009089728A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a high frequency reconstruction method, an encoding module, and a decoding module.
  • High frequency reconstruction is a relatively critical technology.
  • High-frequency compression and recovery technology represented by SBR Spectrum Band Replication
  • SBR Spectrum Band Replication
  • the low frequency signal of audio or speech is passed through a digital filtering group to obtain a set of low frequency subband signals; and the low frequency subband group is used as a whole block signal to perform high frequency signal reproduction.
  • the method for replicating the entire high frequency band signal is to divide the high frequency band into segments according to the frequency from low to high, each segment having substantially the same bandwidth as the whole low frequency signal; then continuously copying the entire low frequency subband group to the high frequency Each segment of the band. In this way, the entire low-frequency sub-band group will be periodically used in the high-frequency band several times until the entire high-frequency band that needs to be recovered is copied.
  • FIG. 1 which is a schematic diagram of the overall translational replication of a low frequency sub-band in the prior art; 2) folding the whole low frequency sub-band group first, that is, inverting the sub-band arrangement order, and then The whole low frequency sub-band group is translated to the corresponding high frequency band.
  • FIG. 2 which is a schematic diagram of the overall folding translation of the prior art low frequency sub-band.
  • modes 1) and 2) may be used interchangeably.
  • the entire low frequency subband group is used periodically until the entire high frequency band that needs to be recovered is copied.
  • FIG. 4 is an energy waveform diagram of the prior art original audio and its sub-band signals (for straight In comparison, only the waveforms of the first 29 sub-bands are drawn in the figure).
  • Figure 5 is a three dimensional view of the energy waveforms of the various subbands of the prior art original audio.
  • FIG. 6 is an energy waveform diagram of each sub-band signal obtained by performing high-frequency reconstruction in the mode 1 of the prior art
  • FIG. 7 is a corresponding three-dimensional diagram of each sub-band energy.
  • 8 is an energy waveform of each sub-band signal obtained by performing high-frequency reconstruction in the mode 2 of the prior art
  • FIG. 9 is a corresponding three-dimensional diagram of each sub-band energy.
  • the structure of the waveform shown therein is: The lowest waveform is the original audio waveform; the 0th to 8th curves are low frequency subband waveforms, which will be used to replicate the high frequency subband; Between 8 and 9 is the dividing line between high frequency and low frequency; all sub-bands represented upward from the 9th curve are the range of high frequency reconstruction and processing.
  • the audio parameters described in the figure are the energy amplitude, the number of audio frames (30 frames) and the number of sub-bands (29). Corresponding to the processed 29 sub-bands shown in the energy waveform diagram.
  • the ninth sub-band or more is a high-frequency processing part.
  • the low frequency subband is passed through the low pass filter bank to obtain a set of low frequency subbands.
  • the selected low-frequency sub-band group is taken as a whole, and the high-frequency part to be recovered is continuously copied in the whole segment, and the sub-bands in the low-frequency sub-band group are respectively used to recover correspondingly.
  • the frequency of the harmonic component is often an integer multiple of its corresponding fundamental frequency. Accordingly, the prior art 2 proposes that if the subband numbers of some subbands of the high frequency portion are integer multiples of natural numbers such as 2, 3, 4, 5, in other words, some high frequency subbands and low frequency subbands There is a correspondence between multiples, then these sub-bands are likely to have rich harmonic components, which need to be restored.
  • FIG. 3 is a schematic diagram of discrete replication of the prior art two low frequency sub-bands.
  • the entire frequency band of the audio signal is divided into thirty sub-bands by sub-band filtering (sub-band numbers are 0, 1, 2 in order)
  • the low frequency sub-band group includes eight sub-bands such as 0, 1, and 27, and the high-frequency sub-band group to be recovered includes twenty-five sub-bands such as 8, 9, 10, 31 and 32. Among them, the low frequency subband group provides four consecutive sub-bands each time to complete the copy.
  • the first thing to start is the replication process II. Because the serial numbers of the 4th, 10th, 12th, and 14th subbands in the high frequency subband group are all integer multiples of 2, the 4th, 5th, 6th, and 7th subbands are selected from the low frequency subband group, and sequentially correspond to Restore the four high frequency sub-bands of 8, 10, 12, and 14. Then there is replication process III. The number of the five sub-bands of the 9th, 12th, 15th, 18th, and 21th bands in the high-frequency sub-band group is an integer multiple of 3, but the 12th sub-band has just been copied, and its position also affects the continuity of these sub-bands. Then, the third sub-bands of the third, fifth, sixth, and seventh sub-bands are selected from the low-frequency sub-band group, and the four high-frequency sub-bands of the 9, 15, 18, and 21 are sequentially restored.
  • the serial numbers of the 8th, 12th, 16th, 20th, 24th, 28th, etc. subbands in the high frequency subband group are all integer multiples of 4, but the 8th and 12th subbands have been copied, and then from the low frequency subband group.
  • Four sub-bands of 4, 5, 6, and 7 are selected, and the four high-frequency sub-bands of the 16, 20, 24, and 28 are sequentially restored.
  • serial numbers of the 10th, 15th, 20th, 25th, 30th, etc. subbands in the high frequency subband group are all integer multiples of 5, but the 10th, 15th, and 20th subbands have been copied, and only the low frequency subbands are needed.
  • the sixth and seventh sub-bands are selected in the group, and the two high-frequency sub-bands of the 25th and the 30th are sequentially restored.
  • the process of recovering the discretely distributed high frequency sub-bands with successive low frequency sub-band sets is completed.
  • the low-frequency sub-bands with similar waveforms are also selected to recover the missing high-frequency sub-bands, thereby completing the replication of all high-frequency sub-bands.
  • a low frequency sub-band according to the prior art is used as a whole block for periodic translational copying or folding replication.
  • double-frequency reproduction according to the prior art 2 both mechanically recovering harmonics, without considering the diversity and variability of the audio voice signal, and additionally copying and copying according to the sub-band number in sequence, due to the low frequency sub-band and high
  • the waveform of the frequency subband is originally different, so the reproduced high frequency sub-band may have a larger waveform difference or peak difference than the original high frequency sub-band, so the reconstructed high-frequency signal is not too high.
  • the technical problem to be solved by the embodiments of the present invention is to provide a high frequency reconstruction method, an encoding module, and a decoding module, which can perform high frequency reconstruction more accurately.
  • An embodiment of the present invention provides a high frequency reconstruction method, including: filtering an audio or voice signal to obtain a low frequency subband and a high frequency subband; determining a frequency band replication strategy; and replicating according to the determined frequency band
  • the strategy acquires the correlation between the low frequency subband and the high frequency subband, selects a low frequency subband as a copy frequency band for the high frequency subband with reference to the determined correlation, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency band.
  • the embodiment of the present invention provides a high-frequency reconstruction method, including: receiving high-frequency reconstruction parameter information including a correspondence relationship of selected frequency bands, where the correspondence relationship of the selected frequency bands is specifically corresponding to a high-frequency sub-band determined by reference correlation. The relationship of the low frequency sub-bands; the low frequency sub-band is copied as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band.
  • An embodiment of the present invention provides an encoding module, including: an analysis filter module, configured to filter audio or voice signals to obtain a low frequency subband and a high frequency subband; and a frequency band selection module, configured to determine a frequency band replication strategy, Determining a correlation between the low frequency sub-band and the high frequency sub-band by using the determined frequency band replication strategy, selecting a low frequency sub-band as a copy frequency band for the high frequency sub-band with reference to the determined correlation, and outputting a correspondence relationship including the selected frequency band High frequency reconstruction parameter information.
  • An embodiment of the present invention provides a decoding module, including a high frequency generator module, where the high frequency generator module includes: a receiving unit, configured to receive high frequency reconstruction parameter information including a correspondence relationship of selected frequency bands, where the selected frequency band is The correspondence relationship is specifically a relationship between the high frequency sub-band determined by the reference correlation and the corresponding low frequency sub-band; the reconstruction unit is configured to, in the high frequency band, the low frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency band Copy as a high frequency sub-band.
  • the scheme of the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and obtains the correlation between the low frequency subband and the high frequency subband according to the determined frequency band replication strategy.
  • the determined correlation is that the high frequency sub-band selects the low frequency sub-band as the copy frequency band, and outputs the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands, so that the high frequency reconstruction can be performed based on the high frequency reconstruction parameter information.
  • the energy waveform and the energy three-dimensional image of each sub-band signal obtained after the high-frequency reconstruction can be clearly found that the reconstructed high-frequency portion of the embodiment of the present invention is closer to the original audio signal, and the effect is better than the prior art solution. Therefore, the technical solution of the embodiment of the present invention can perform high frequency reconstruction more accurately.
  • FIG. 1 is a schematic diagram of an overall translational replication of a low frequency sub-band of the prior art
  • FIG. 2 is a schematic diagram of the overall folding translation of a low frequency sub-band of the prior art
  • 3 is a schematic diagram of discrete replication of a prior art two low frequency sub-band
  • 4 is an energy waveform diagram of a prior art original audio and its respective sub-band signals
  • 5 is a three-dimensional energy diagram of energy waveforms of each sub-band of the prior art original audio
  • 6 is an energy waveform diagram of each sub-band signal obtained by performing high-frequency reconstruction in the manner of the prior art 1;
  • FIG. 10 is a block diagram showing the principle and structure of a high frequency reconstruction according to an embodiment of the present invention.
  • FIG. 11 is a schematic diagram of a high frequency segmentation low frequency matching replication strategy according to an embodiment of the present invention.
  • FIG. 12 is a schematic diagram of a high frequency matching replication strategy for a low frequency dominant frequency band according to an embodiment of the present invention
  • FIG. 13 is a schematic diagram of a high frequency matching replication strategy for a low frequency band according to an embodiment of the present invention
  • FIG. 14 is a schematic diagram of an extended replication policy according to an embodiment of the present invention.
  • Figure 15 (a) is a structural block diagram of an adaptive band copying mode of an encoding end according to an embodiment of the present invention
  • Figure 15 (b) is a structural block diagram of a fixed-band copying mode of an encoding end of the embodiment of the present invention
  • Figure 16 is an adaptive embodiment of the present invention
  • FIG. 17 is a flow chart of selecting a frequency band replication strategy according to an embodiment of the present invention
  • 19 is a flowchart of an algorithm for detecting time-varying characteristics according to an embodiment of the present invention.
  • 20 is a schematic diagram of a high frequency reconstruction performed by a decoding end according to parameter information of an encoding end according to an embodiment of the present invention
  • 21 is a flowchart of an algorithm for decoding a high frequency generator according to an embodiment of the present invention.
  • FIG. 22 is an energy waveform diagram of the original audio signal after the method of the embodiment of the present invention is restored;
  • FIG. 23 is a three-dimensional diagram of the energy of the original audio signal recovered by the method of the embodiment of the present invention;
  • FIG. 24 is a schematic structural diagram of the encoding module according to the embodiment of the present invention; ;
  • 25 is a schematic structural diagram of an encoding module 2 according to an embodiment of the present invention.
  • FIG. 26 is a schematic structural diagram of a decoding module according to an embodiment of the present invention. detailed description
  • the embodiment of the invention provides a high frequency reconstruction method, which can perform high frequency reconstruction more accurately.
  • FIG. 10 is a schematic diagram and a structural block diagram of a high frequency reconstruction according to an embodiment of the present invention.
  • the upper part is the module related to high frequency processing in the encoding end
  • the lower part is the module related to high frequency processing in the decoding end.
  • the original audio signal is converted by the analysis filter module into subband signals distributed in different frequency bands, including low frequency subbands and high frequency subbands, and the low frequency subbands are encoded by the core encoder and transmitted to the decoding end.
  • the low frequency sub-bands are also processed to obtain parameter information that guides the high frequency reconstruction.
  • the low frequency sub-band passes through the analysis and detection module, and the obtained detection result is input to the frequency band selection module for guiding the analysis strategy of the frequency band selection module; the frequency band selection module selects according to the guidance information of the analysis and detection module.
  • a suitable replication strategy, and using the maximum correlation criterion to select a matching low frequency subband for each or each high frequency subband, and the envelope parameter extraction module performs envelope parameter extraction, and finally outputs the parameter information of the high frequency reconstruction.
  • the core decoder decodes and recovers the low-frequency sub-band signal by using the encoded information of the received low-frequency sub-band, and then copies the high-frequency sub-band according to the parameter information of the high-frequency reconstruction from the encoding end, and then performs envelope adjustment to obtain the reconstructed high.
  • the frequency subband; the last low frequency subband and high frequency subband signals are recovered by the synthesis filter to recover the full band audio and speech signals.
  • the frequency band replication strategy proposed by the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and is also suitable for short-term characteristics and time-varying characteristics of audio and voice signals, and has flexible frequency band selection. The function ensures that the frequency bands used for copying and copying have an optimal correlation.
  • the following embodiments of the present invention propose the following three frequency band replication strategies and their extension strategies:
  • This strategy divides the high frequency band components into multiple copy bands, and the division of the copy bands can be performed in different ways.
  • the copy band can be divided according to the Bark scale band, and the sub-bands are grouped into a copy band and the like according to different resolutions in the Bark proportional band.
  • the resolution indicates the number of subbands included in the copy band. The smaller the number, the higher the resolution, and the lower the number, the lower the resolution.
  • the copy band division can also equally divide the high frequency component into a plurality of copy bands at the same resolution, and then select the most relevant low band for each high band for copying.
  • FIG. 11 it is a schematic diagram of a high frequency segmentation low frequency matching replication strategy according to an embodiment of the present invention.
  • B0 is the end subband of the low frequency subband signal
  • B1 is the end subband of the high frequency processed signal
  • bl, b2, b3 are the boundaries of the complex band division.
  • the strategy selects the most relevant low frequency sub-band for each copy band for copying, and the low-frequency sub-band can be reused as long as it has the greatest correlation with the copied copy band.
  • This replication strategy makes full use of the correlation between high frequency and low frequency, and is suitable for the case where the spectral envelope is relatively stable. Both high and low frequencies have good spectral envelope energy, because the high frequency components at this time have important audio components. Especially in the high-frequency band near the low-frequency part, if the high-frequency reproduction produces distortion, noise will be introduced and the sound quality will be affected.
  • the segment-selective copying strategy will select the most relevant low-frequency band for each high-band to copy, ensuring the inter-band. Correspondence can be used to avoid distortion caused by misuse of the frequency band.
  • the copying strategy has the difference that: the prior art continuously and repeatedly copies the high frequency signal by using the whole low frequency signal, and copies the low frequency subband with a large difference when the correlation between the high and low frequency signals is poor.
  • the high-frequency sub-band introduces a large distortion
  • the copying strategy uses the original high-frequency sub-band signal as a reference to select the low-frequency sub-band with the greatest correlation to copy, which can effectively avoid the band misuse problem.
  • the strategy first selects the frequency band in which the energy is concentrated in the low frequency signal as the dominant frequency band, and then selects a high frequency signal segment that is more correlated with the low frequency signal of the segment, and performs the frequency band in which the energy is concentrated in the selected low frequency signal segment in the high frequency signal segment.
  • the copy is used as a high frequency sub-band; for the high frequency signal segments, the remaining small frequency bands that have not been copied are respectively selected for the closest low frequency band to be copied.
  • the method first has good harmonics The frequency band is processed, and then the nearest low-frequency sub-band is selected for the scattered band between them. In the frequency band selection, the highest-frequency harmonics with the best correlation are selected first, and then the most relevant low-frequency sub-bands are selected by the non-harmonic high-frequency sub-bands.
  • FIG. 12 it is a schematic diagram of a high frequency matching replication strategy for a low frequency dominant frequency band according to an embodiment of the present invention.
  • the low-frequency signal of the energy concentration is selected, and then the high-frequency part with good correlation is selected according to the selected low-frequency signal, and the part is copied, for example, the high-band signals i and j are the copy frequency bands selected by the low-frequency signal, and then the i and The scattered high frequency band other than j selects the appropriate low frequency band for copying.
  • the high frequency matching replication strategy of the low frequency dominant frequency band utilizes the frequency domain harmonic characteristics of the signal to select high frequency harmonics of different orders for the fundamental frequency signals in the low frequency, and is suitable for speech and audio signals with good harmonic characteristics.
  • the interval of harmonics appearing in the high frequency part will gradually decrease, and the frequency range of harmonic coverage will gradually increase. Therefore, the conventional harmonic copying by the frequency doubling replication method will be generated.
  • High-frequency distortion, and the embodiment of the present invention can accurately determine the position of the high-frequency harmonic using the maximum correlation determination method, and also select the low-frequency sub-band for copying using the maximum correlation method for the scattered sub-band between the high-frequency harmonics. Ensure the integrity of the harmonics and compensate for the phenomenon of high frequency harmonic widening.
  • the copying strategy has the difference that the prior art adopts the frequency doubling copying method, and uses the low frequency subband signal to copy to the high frequency in multiples, and the formed high frequency harmonics contain harmonics of different low frequency signals.
  • the wave destroys the continuity of the harmonics, and the copying strategy continuously replaces the fundamental frequency signal with the high frequency harmonics to ensure harmonic continuity without causing high frequency distortion.
  • This strategy treats the entire low-band as a reference-band signal and then selectively replicates the harmonics in the high-band according to the optimal matching principle.
  • FIG. 13 it is a schematic diagram of a low frequency band high frequency matching replication strategy according to an embodiment of the present invention.
  • the high frequency bands i and j are selected harmonic components more related to the low frequency signal, and the entire low frequency band is copied at this position, and the scattered subband between them still uses the maximum correlation copy band selection method to select the low frequency subband copy. .
  • Such a harmonic selection replication strategy is suitable for an audio signal with a relatively smooth spectral envelope and a high-frequency energy drop, and a low-frequency energy.
  • Such an audio signal generally has an index of high-frequency harmonic energy that increases with an order of magnitude. Decrease, because the high-frequency energy is small, it can be regarded as a mixture of harmonics and noise, so the entire low-frequency signal can be selectively copied to the high-frequency, but the high-frequency sub-band replication accuracy near the low frequency band at a low code rate Very important, you should make detailed selection of relevant bands.
  • the copying strategy has the following differences: the prior art uses a low-frequency signal to continuously copy high-frequency signals, and the copying strategy considers the low-frequency as a whole and selects and low-frequency signals from the high-frequency components.
  • This method of expansion strategy is to use the lower frequency high frequency subband band previously obtained by high frequency copying for higher frequency band reproduction.
  • the band selection range is extended, and the low frequency signal and a small number of high frequency subbands adjacent to the low frequency signal are regarded as an integral part, and then strategy 1 and strategy 2 are used. Or the method in Strategy 3 determines the replication band. The most relevant low frequency sub-band replica is selected for the small number of high frequency sub-bands added.
  • the range of the extended band ie, the high frequency sub-band as the copy source at the time of recovery
  • each extended band is selected for copying using the maximum correlation criterion.
  • the band selection uses the highest band resolution (ie, a single subband is the copy band); then the extended band and the low frequency subband are combined as a copy source, and then Strategy 1, Strategy 2 or Strategy 3 is used to select the replication band for the high frequency subband.
  • FIG. 14 it is a schematic diagram of an extended replication policy according to an embodiment of the present invention.
  • the lower frequency bands of the low frequency band and the copy frequency band 1 are combined into a continuous frequency band, and used for signal reconstruction of the complex frequency band 2 and above.
  • This copying strategy is suitable for use at low bitrates because the low frequency band handled by the core codec is shorter at low bitrates, may not cover all fundamental frequency overtones, and is more harmonic in the mid-range than low-frequency overtones. Close to the high frequency overtone characteristics, the reconstructed signal can be used for higher frequency band reproduction after ensuring that the lower frequency high frequency signal is reconstructed with higher resolution. Through this method of copying, harmonics can be more completely depicted, which is conducive to expanding the range of high-frequency reconstruction.
  • Adaptive band copying mode As shown in FIG. 15(a), it is a structural block diagram of an encoding end adaptive band copying mode according to an embodiment of the present invention. In this way, the characteristics of the audio signal are detected by the energy language analysis and estimation method, and the guidance information is output according to the detection result for guiding the selection of the replication strategy, thereby guiding the optimal frequency band selection.
  • the characteristics of the speech and audio signals are usually the same in a certain period of time, that is, the quasi-stationary characteristic, so the introduction of the time-varying characteristic detection only needs to re-band the band when the time-varying characteristic variable is greater than the tolerance. select.
  • FIG. 15 (b) it is a structural block diagram of the fixed-band copying mode of the encoding end of the embodiment of the present invention.
  • the fixed copy mode is selected in advance according to actual needs, that is, one of the above-mentioned proposed band copy policies (such as policy 1, policy 2, policy 3 or their extended policies) is determined in the entire audio. It remains unchanged during processing and is implemented in conjunction with appropriate optimal frequency band selection.
  • the fixed-band copy mode does not need to guide the selection of the band copy strategy according to the result of the short-term characteristic analysis module, the copy policy is specified by setting parameters, so the short-term characteristic analysis module is not needed.
  • the high frequency reconstruction method of the adaptive band copy method will be described below.
  • FIG. 16 is a flowchart of a high frequency reconstruction method in an adaptive band replication mode according to an embodiment of the present invention, including the steps:
  • Step 1601 Perform short-term characteristic analysis on the sub-band signal obtained by the analysis filter module;
  • Step 1602 perform a frequency band replication strategy selection according to the result of the short-term characteristic analysis;
  • Step 1603 Perform optimal frequency band selection according to the selected frequency band replication policy.
  • Step 1604 Perform frequency band replication according to the optimal frequency band.
  • Step 1601 Perform short-term characteristic analysis on the sub-band signal obtained by the analysis filter module. For the original audio signal, convert the sub-band signal distributed in different frequency bands by the analysis filter module, and perform short-term characteristic analysis on the sub-band signal. .
  • Short-term characterization is a preparation for selecting the appropriate band replication strategy.
  • the audio or speech signal is first subjected to time-frequency transform, and then the energy distribution of the harmonic condition, the low frequency part and the high frequency part is analyzed, and the analyzed parameter result determines the band replication strategy.
  • the low frequency sub-band sample point be. ⁇ , /).
  • the high-frequency sub-band sample be ⁇ ( W), where: represents the high frequency subband, k 0 ⁇ k ⁇ k e , which is the end subband of the high frequency processing.
  • Step 1602 Perform a frequency band replication policy selection according to a result of short-term characteristic analysis
  • FIG. 17 is a flowchart of selecting a frequency band replication policy according to an embodiment of the present invention.
  • the judgment process consisting of Strategy 2, Strategy 3 and Strategy 1 is the main body of policy selection, and the band replication expansion strategy is used as an auxiliary strategy, mainly in the process of recovering high frequency, for the case where the low frequency band is relatively narrow, and the expansion is used for copying.
  • the width of the low frequency band improves the integrity of the fundamental frequency.
  • the band replica expansion strategy can make the selection of the low frequency subbands of the high frequency portion not too restrictive. It bundles several mid-high frequency sub-bands and low-frequency parts that would otherwise require band replication to form a new low-frequency part for most of the high-frequency sub-bands to choose and extract their corresponding copy parameters.
  • the best-matched low-frequency sub-bands are selected from the original low-frequency parts, and the copy parameters are extracted in turn.
  • the Band Replication Extension Policy is an extension of Policy 2, Policy 3, and Policy 1.
  • the band copy extension policy is used. Therefore, when there is an extend_flag output, the selected policy 2, policy 3 or policy 1 will become an extended policy 2, an extended strategy 3 or an extended strategy 1 accordingly.
  • Step 1701 The time-frequency transform is completed, and a QMF (Quadature Mirror Filter) sub-band is input;
  • Step 1702 determining whether the input sub-band is in a low coding rate mode, if yes, proceeding to step 1703, if no, proceeding to step 1705;
  • Step 1703 expanding the range of the low frequency part participating in the copy, forming a new low frequency part, proceeding to step 1704;
  • Step 1704 the output flag extend_flag, for the band copy expansion strategy, proceeds to step 1705;
  • Step 1705 determines whether there is a low frequency subband with low energy, if not, proceeds to step 1708, and if yes, proceeds to step 1706;
  • Step 1706 searching for a low frequency subband with higher energy and continuous distribution and determining a selection strategy 2; This step mainly looks for a subband interval in which the energy distribution of the low frequency portion is continuous as the fundamental frequency portion of the strategy 2.
  • the judgment algorithm is as follows:
  • Step 1707 Output the flag flag corresponding to the policy 2, and output the subband sequence number n and the subband interval number q. It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 2.
  • Step 1708 determining whether the high frequency energy average is too low compared with the low frequency energy average, if not, proceeding to step 1709, and if yes, proceeding to step 1710;
  • step 1705 If the low-frequency sub-band energy and the low-frequency energy mean ⁇ are compared by step 1705, the following formula is satisfied: £(")> * ⁇ , the emphasis of the analysis is shifted to the energy relationship between the low-frequency and high-frequency parts.
  • Step 1709 Output the corresponding flag of the policy 1 Flag; It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 1.
  • Step 1710 Output the corresponding flag of the policy 3 Flag; It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 3.
  • Step 1603 Perform optimal frequency band selection according to the selected frequency band replication policy.
  • the optimal frequency band selection module uses the maximum correlation as a standard to flexibly search for the optimal matching frequency band for copying of a reference frequency band, ensuring the correlation of frequency band replication, so that the copied high frequency signal does not need excessive adjustment, and approaches the original signal.
  • the optimal high-low frequency signal correspondence is selected.
  • the band replication strategy guides the optimal band selection, determines whether the band selection is to select a low frequency signal with a high frequency signal, or select a high frequency harmonic with a low frequency signal, for example, In the first round, the optimal frequency band is selected to select the best low frequency signal for copying for each high frequency copy band signal, and in strategy 2, the optimal frequency band selection first selects the high frequency that can be copied for the base frequency signal. harmonic.
  • the initialization band table guides the estimated bandwidth of the optimal band selection, as well as the selected band range.
  • the optimal frequency band selection is to compare the high and low frequency signal correlation and the similarity degree of the high and low frequency signal envelope characteristics, and then determine the optimal high and low frequency signal matching relationship based on the obtained two parameters.
  • the signal is normalized according to the initial band table range before the estimation, so that the matching signal is selected to analyze the similarity of the signal characteristics, and the energy difference can be Make adjustments when the signal is reconstructed.
  • the algorithms used for optimal band selection are the same.
  • the following strategy 1 is taken as an example, and a single subrepresentative representing the maximum frequency resolution is taken.
  • the band is the copy band, indicating the general algorithm for optimal band selection.
  • Step 1801 divide a copy frequency band and an alternate frequency band according to an initialized frequency band table in the frequency band copy policy information, and input an input sub-band signal according to an initialized frequency band table. Perform high and low frequency division;
  • nb Let the number of copy bands be represented by nb, and set the low-frequency sub-band samples with ⁇ (", where 1 ⁇ " ⁇ refers to the low-frequency sub-band number, which is the first sub-band number of high-frequency processing, 0 ⁇ / ⁇ 32 Indicates the sample points in the sub-band, and sets the high-frequency sub-band samples as ⁇ ⁇ , ⁇ ), where ⁇ indicates the high-frequency sub-band, which is the end sub-band of the high-frequency processing.
  • Step 1802 Perform normalization processing on the high and low frequency signals in the length of the copy frequency band according to the division of the copy frequency band in the initialization band table.
  • the copy frequency band is equal to the transform subband
  • the normalization process is as follows: ⁇ (0) X m k !, step 1803, calculated for each frequency band or a high frequency band with the correlation function may be low frequency or low frequency band; offset considering the case of sample, in order to obtain the best approximation of the waveform of the high-band low frequency band of the low frequency band After the sample is offset, the correlation function is calculated.
  • r k ["] Represents the correlation function value of the high and low frequency bands after the sample offset. If the time-frequency transform uses a complex transform form, the sub-band samples are complex-valued, and the real part can be analyzed. In the offset correlation function with a certain high frequency band, the correlation value is selected for each low frequency band. The above calculation is performed on all the high frequency bands, and the maximum correlation matrix ax WW composed of ["] is obtained, and R max WW records the maximum correlation values of all the high and low frequency bands. Step 1804, estimating the variation characteristics of the high and low frequency band envelopes, and calculating the high frequency Band and low band envelope variation characteristics;
  • the envelope characteristic estimation method is to treat the sample point in the length of the replication frequency band as a sample, calculate its autocorrelation function within the second order, and then obtain the high and low frequency envelope by comparing the mean square error of the autocorrelation function of the high and low frequency bands. Difference in characteristics.
  • the reference band is not only the low frequency sub-band, but may be all the bands before the detection band.
  • a band selection table W K - +1] is generated indicating the optimum frequency band required for copying.
  • Step 1603 After the optimal frequency band selection is performed according to the selected frequency band replication policy, the optimal frequency band may be used all the time, and the time-varying characteristic detection may be further performed, and the frequency band is reselected according to the detection result.
  • Audio and speech signals generally have the same characteristics (ie quasi-stationary characteristics) over a period of time, so the same high-frequency duplication strategy may be used in successive frames, with the same copy strategy, based on the audio signal Time periodicity, the same copy band selection table can be used in consecutive frames, and it is not necessary to perform optimal band selection every frame.
  • the band selection table Once the band selection table is determined, it can be used continuously for multiple frames, which can save calculation amount and transmission bit rate. It can also guarantee the continuity between frames.
  • a time-varying characteristic detecting module is introduced. The function of time-varying characteristic detection is to determine whether the current frame can use the band selection table of the previous frame.
  • the time-varying characteristic detecting method is to estimate the audio characteristic change of the low-frequency signal of the current frame and the previous frame, and the envelope difference comparison method may be specifically used. If the envelope difference is small, according to the correlation between high and low frequencies, the difference of the high frequency signal will be 4 ⁇ , and the band selection table generated by the previous frame can be used; if the current low frequency signal and the previous frame low frequency signal The envelope difference is within the tolerance range, but there is a frequency offset.
  • the optimal frequency band needs to be re-selected, and the band selection table is refreshed because the two sets of overtones are in accordance with the pitch shift theory.
  • the frequency difference within the defined critical band is between 5% and 50%, and the two sets of overtones are dysfunctional, producing auditory perceptible differences.
  • Step 1904 determining whether the offset frequency ⁇ / is less than 5% of the current critical band bandwidth, and if so, proceeds to step 1905, and if not, proceeds to step 1906;
  • Step 1905 A band selection table generated by using a previous frame is used
  • Step 1906 Perform optimal band selection again.
  • the short-term characteristic analysis of the sub-band signals obtained by the analysis filter module is not required, and the short-term characteristics are not required.
  • the result of the analysis is selected for the band replication strategy, and the flow of the optimal band selection process and the time-varying characteristic detection is the same.
  • Step 1604 Perform frequency band replication according to the optimal frequency band. After the decoding end obtains the optimal frequency band, the frequency band copying can be performed according to the optimal frequency band.
  • FIG. 20 it is a schematic diagram of the high frequency reconstruction performed by the decoding end according to the parameter information of the encoding end according to the embodiment of the present invention.
  • the decoding end of the embodiment of the present invention does not change the function and cooperation relationship of most modules, but only the "high frequency sub-band copying strategy" of the high frequency generator module is modified.
  • three parameters are added according to the high frequency reconstruction guide parameter information, namely "new algorithm use flag", "band selection table replacement flag” and "band selection table".
  • the "band selection table” is a relatively important parameter, which records the copy correspondence between the high frequency sub-band and the low frequency sub-band when the high frequency sub-band is recovered for each frame signal.
  • the "new algorithm uses flags" determines whether high-frequency reconstruction is guided by a new algorithm at the encoding end or high-frequency reconstruction using a standard SBR method.
  • the new algorithm referred to herein refers to the algorithm used in the process of finally determining the high frequency reconstruction parameters at the encoding end by the embodiment of the present invention described above. If the flag is "1”, the high frequency is reconstructed according to the new algorithm; if the flag is "0", the high frequency is reconstructed according to the standard SBR method. With this setup, the interface can be reserved for compatibility between the new algorithm and the standard SBR method in subsequent studies.
  • the "band selection table replacement flag” determines how the current signal obtains the "band selection table” when the high frequency sub-band is restored. If the flag bit is "0", the current signal directly extends the high-low frequency sub-band correspondence of the previous frame signal to guide the high-frequency copy; if the flag bit is "1", the refresh is read according to the read in the SBR code stream.
  • the "band selection table” parameter completes the high frequency copy.
  • the main role of the "band selection table replacement flag” is to reduce the amount of data transmitted to the decoder. It should be noted that when the "band selection table replacement flag" is 0, the "band selection table” parameter will not be included in the transmitted SBR code stream.
  • the code stream information received by the decoder is described in detail below.
  • Sbr_start frequency SBR field start QMF subband selection pointer, select corresponding element in the band selection Int (integer) table StartTable
  • Sbr_ stop—frequency SBR field terminates QMF subband selection pointer, selects Int in frequency band Select the corresponding element in the table StopTable
  • Header extra— 1 Indicates if there is additional information, value 0 or 1 Int
  • FreqScale SBR band contains the maximum number of QMF subbands used to calculate Int
  • Sbr_limiter band The number of bands that are limited when calculating the gain, used to calculate the limit frequency Int
  • ENV—DATA is the data structure that describes the SBR information for each frame.
  • the parameters "New Algorithm Usage Flag”, "Band Selection Table Replacement Flag” and “Band Selection Table” are added to the ENV DATA structure describing the SBR information.
  • the "band selection table” stores the corresponding serial numbers of the high frequency sub-band and the low frequency sub-band that need to be restored, and is stored.
  • FreTable[28] In the array FreTable[28]. Among them, different encoding modes have different numbers of high frequency sub-bands to be recovered. For the highest bit rate encoding mode, the number of high frequency sub-bands that need to be recovered is 28. As the encoding bit rate decreases, the number of high frequency sub-bands that need to be recovered also decreases accordingly.
  • FIG. 21 it is a flow chart of the decoder high frequency generator module algorithm according to the embodiment of the present invention, including the steps:
  • Step 2101 Receive "new algorithm use flag", “band selection table replacement flag”, and "band selection” Table";
  • Step 2102 judging "new algorithm use flag", if it is 0, proceeds to step 2103, if it is 1, proceeds to step 2104;
  • Step 2103 decoding according to a standard SBR method
  • Step 2104 judging "band selection table replacement flag", if it is 0, proceeds to step 2105, if it is 1, proceeds to step 2106;
  • Step 2105 If the flag bit is "0", the current signal directly extends the high-low frequency sub-band correspondence of the signal of the previous frame, and guides the high-frequency copy;
  • the band selection table of the current frame is backed up in the buffer.
  • the next frame signal is called on the band selection table in the buffer, if needed.
  • Step 2106 If the flag bit is "1", the high frequency copy is guided according to the "band selection table" parameter read in the SBR code stream;
  • Step 2107 Perform preliminary high frequency copying.
  • the initial replication of the high-frequency sub-band After the initial replication of the high-frequency sub-band, it will enter the module of envelope adjustment and harmonic component addition to complete the high-frequency replication.
  • the high frequency reconstruction method of the embodiment of the invention more accurately realizes the reconstruction of the high frequency signal.
  • FIG. 22 is an energy waveform diagram of the original audio signal after the method of the embodiment of the present invention is restored;
  • FIG. 23 is a three-dimensional diagram of the energy of the original audio signal after the method of the embodiment of the present invention is restored.
  • the method of the embodiment of the present invention can reconstruct the high frequency signal more accurately by using the information of a few low frequency subbands, and is also beneficial for compressing the audio information, can greatly improve the compression efficiency of the audio and speech encoder, and improve the audio quality; Reduce distortion and noise caused by bit rate audio and speech signal compression coding. And for different audio characteristics, a variety of corresponding frequency band replication strategies are proposed, which can provide adaptive high-frequency reconstruction methods for various audio and voice signals, and improve the flexibility of audio and voice signal processing.
  • the embodiment of the present invention provides an encoding module and a decoding module.
  • FIG. 24 it is a schematic structural diagram of an encoding module according to an embodiment of the present invention.
  • the encoding module includes: an analysis filter module 241 and a band selection module 242.
  • the analysis filter module 241 is configured to filter the audio or voice signal to obtain a low frequency sub-band and a high frequency sub-band.
  • a frequency band selection module 242 configured to determine a frequency band replication policy, obtain a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-frequency for the high frequency sub-band with reference to the determined correlation
  • a low frequency sub-band having a high correlation is selected as a high frequency sub-band as an optimum copy band, and high-frequency reconstruction parameter information including a correspondence relationship of the selected band is output.
  • the encoding module further includes: a short-term characteristic analysis module 243 for performing short-term characteristic analysis on the audio or voice signal.
  • the band selection module 242 includes: a replication policy selection module 2421, an optimal band selection module 2422.
  • the replication policy selection module 2421 is configured to select a different frequency band replication policy according to the analysis result of the short-term characteristic analysis module 243;
  • An optimal frequency band selection module 2422 configured to acquire a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-band as a high frequency sub-band as a reference with reference to the determined correlation
  • the frequency band for example, a high frequency sub-band selects a low frequency sub-band having a large correlation as an optimal copy frequency band, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands.
  • the encoding module further includes: a time-varying characteristic detecting module 244, configured to perform time-varying characteristic detection on the filtered audio or voice signal; correspondingly, the optimal frequency band selecting module 2422 is further detected according to the time-varying characteristic detecting module 244. As a result, the low frequency sub-band is selected, for example, the optimal copy band is selected.
  • the short-term characteristic analysis module 243 performs short-term characteristic analysis on the audio or voice signal, specifically: calculating a low-frequency partial energy average of the audio or voice signal, a high-frequency partial energy average, a low-frequency partial sub-band energy value, and a high frequency.
  • the replication strategy selection module 2421 correspondingly selects different frequency band replication strategies according to the analysis result of the short-term characteristic analysis module 243, specifically: comparing the energy values of the sub-bands in the low-frequency part with the average value of the low-frequency parts; The sub-band energy value of the partial sub-band of the low-frequency part is less than or equal to the first weighting value of the low-frequency part energy average, and the selected strategy is: selecting the low-frequency sub-band of the energy concentration, and selecting the selected low-frequency sub-band of the energy concentration The high frequency band of the large portion replicates the low frequency sub-band; if the sub-band energy value of the low frequency partial sub-band is greater than the first weighting value of the low-frequency partial energy mean, the second weighting of the high-frequency partial energy mean and the low-frequency partial energy mean is further performed.
  • the value is compared; if the high-frequency partial energy mean is less than or equal to the second weight of the low-frequency partial energy mean Election
  • the selected strategy is: selecting the entire low frequency sub-band, and copying the low frequency sub-band in the selected high frequency band having a large correlation with the entire low frequency sub-band; if the high-frequency partial energy average is greater than the second weighting value of the low-frequency partial energy average, selecting The strategy is as follows: The high frequency is divided into multiple copy frequency bands, and the low frequency sub-bands with large correlation are selected for each copy frequency band for copying.
  • the strategy selected by the replication policy selection module 2421 further includes: further copying, by copying, a high frequency sub-band adjacent to the low frequency sub-band together with the selected low frequency sub-band as a copy source, and the high frequency sub-band of the adjacent low frequency sub-band Select the low-frequency sub-band copy with high correlation, which is equivalent to the expansion strategy of each of the above strategies.
  • FIG. 25 it is a schematic structural diagram of an encoding module 2 according to an embodiment of the present invention.
  • the encoding module includes: an analysis filter module 241, and a band selection module 242.
  • the analysis filter module 241 is configured to receive an audio or voice signal and perform filtering processing to obtain a low frequency subband and a high frequency subband.
  • a frequency band selection module 242 configured to determine a frequency band replication policy, obtain a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-frequency for the high frequency sub-band with reference to the determined correlation
  • a low frequency sub-band having a high correlation is selected as a high frequency sub-band as an optimum frequency band, and high-frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands is output.
  • the band selection module 242 includes: a copy policy setting module 2423, an optimal band selection module 2422.
  • the copy policy setting module 2423 is configured to determine a unique band copying policy according to the preset parameters.
  • the band replication policy is one of the policies described in connection with FIG. 24 or its corresponding extension policy.
  • An optimal frequency band selection module 2422 configured to acquire a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-band as a high frequency sub-band as a reference with reference to the determined correlation
  • the frequency band for example, a high frequency sub-band selects a low frequency sub-band having a large correlation as an optimum frequency band, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands.
  • the encoding module further includes: a time-varying characteristic detecting module 244, configured to perform time-varying characteristic detection on the filtered audio or voice signal; correspondingly, the optimal frequency band selecting module further selects according to a result detected by the time-varying characteristic detecting module The low frequency subband, for example, selects the optimal copy band.
  • FIG. 26 it is a schematic structural diagram of a decoding module according to an embodiment of the present invention.
  • the decoding module includes a high frequency generator module 261, and the high frequency generator module 261 includes: The receiving unit 2611 and the reconstructing unit 2612.
  • the receiving unit 2611 is configured to receive high-frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands, where the correspondence relationship of the selected frequency bands is specifically a relationship between the high-frequency sub-band determined by the reference correlation and the corresponding low-frequency sub-band, for example, correlation
  • the large low frequency sub-band corresponds to the high frequency sub-band.
  • the reconstruction unit 2612 is configured to copy the low frequency sub-band as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band.
  • the parameter information received by the receiving unit 2611 further includes a value of the new algorithm using the flag and a band selection table replacement flag; the reconstruction unit 2612 determines an algorithm used by the copying process according to the value of the new algorithm using the flag, according to the The band selection table replacement flag determines a band selection table used in the copying process, and the low frequency sub-band in the correspondence is copied as a high frequency sub-band in the high band according to the determined algorithm and the band selection table.
  • the solution of the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and obtains the correlation between the low frequency subband and the high frequency subband according to the determined frequency band replication strategy, and refers to the determination.
  • the correlation is that the high frequency sub-band selects the low frequency sub-band as the copy frequency band, and outputs the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands, so that the high frequency reconstruction can be performed based on the high frequency reconstruction parameter information.
  • the energy waveform and the energy three-dimensional image of each sub-band signal obtained after the high-frequency reconstruction can be clearly found that the reconstructed high-frequency portion of the embodiment of the present invention is closer to the original audio signal, and the effect is better than the prior art solution. Therefore, the technical solution of the embodiment of the present invention can perform high frequency reconstruction more accurately.
  • the solution of the embodiment of the present invention may include an adaptive band copy mode and a fixed band copy mode, and has a flexible band selection function.
  • the technical solution of the embodiment of the present invention may further increase the time-varying characteristic detection of the audio or voice signal, and perform adjustment according to the detection result.

Abstract

A method for high frequency band replication, which comprises: to filter the audio or voice signal to gain the low frequency sub-band and the high frequency sub-band; to determine the strategy of spectral band replication; to gain the correlation between said low frequency sub-band and high frequency sub-band according to said determined strategy of spectral band replication, and to select the low frequency sub-band as the replication frequency band for the high frequency sub-band according to said correlation, and to output the information of high frequency band replication parameters which includes the correspondence relation of the selected frequency band. The present invention else provides a method for high frequency band replication, which comprises: to receive the information of high frequency band replication parameters which includes the correspondence relation of the selected frequency band, and said correspondence relation of the selected frequency band is the relation between the high frequency sub-band and the corresponding low frequency sub-band thereof, which is determined by the correlation; In the high band, to replicate the high frequency sub-band by the low frequency sub-band according to said information of high frequency band replication parameters which includes the correspondence relation of the selected frequency band. Accordingly, the present invention provides a coder and decoder. The solution of the embodiment of the present invention can fulfill the high frequency replication more exactly.

Description

高频重建方法、 编码模块和解码模块  High frequency reconstruction method, coding module and decoding module
本申请要求于 2007 年 12 月 27 日提交中国专利局、 申请号为 200710305087.3、 发明名称为 "高频重建方法、 编码模块和解码模块" 的中国 专利申请的优先权, 其全部内容通过引用结合在本申请中。  This application claims priority to Chinese Patent Application No. 200710305087.3, entitled "High Frequency Reconstruction Method, Encoding Module and Decoding Module", filed on December 27, 2007, the entire contents of which are incorporated by reference. In this application.
技术领域 Technical field
本发明涉及通信技术领域,具体涉及一种高频重建方法、编码模块和解码 模块。  The present invention relates to the field of communications technologies, and in particular, to a high frequency reconstruction method, an encoding module, and a decoding module.
背景技术 Background technique
在音频和语音处理技术中,进行高频重建是一项比较关键的技术。 以频带 复制 ( SBR, Spectral Band Replication )为代表的高频压缩和恢复技术是目前 较好的一种高频重建方法, 它把低频段的波形复制到高频段部分,再利用编码 时已提取的能量调整参数和谐波调整参数对复制的高频段进行修复,从而达到 高频重建的目的。  In audio and speech processing, high frequency reconstruction is a relatively critical technology. High-frequency compression and recovery technology represented by SBR (Spectral Band Replication) is a better high-frequency reconstruction method. It copies the waveform of the low-frequency band to the high-frequency section and then extracts the extracted frequency. The energy adjustment parameters and the harmonic adjustment parameters repair the copied high frequency band to achieve the purpose of high frequency reconstruction.
现有技术利用低频频段信号进行高频重建的方法主要有两种,请参阅以下 介绍:  There are two main methods for performing high frequency reconstruction using low frequency band signals in the prior art, please refer to the following introduction:
现有技术一:  Prior art one:
把音频或语音的低频信号通过一个数字滤波组 , 得到一组低频子带信号; 再把该低频子带组作为一个整块信号来进行高频信号的复制。整个高频频段信 号的复制方法是将高频频段按照频率由低到高划分成若干段,每段与上述整块 低频信号的带宽大致相同;然后将整块低频子带组连续复制到高频频段的每一 段。 这样, 整块低频子带组会被周期性地在高频频段使用若干次, 直到整个需 要恢复的高频频段都被复制完成为止, 具体方式有两种: 1 )把整块低频子带 组平移到对应的高频频段, 该方式可参阅图 1 , 是现有技术一低频子带的整体 平移复制示意图; 2 )把整块低频子带组先折叠, 即颠倒子带排列顺序, 再把 整块低频子带组平移到对应的高频频段, 该方式可参阅图 2, 是现有技术一低 频子带的整体折叠平移示意图。 复制过程中, 方式 1 )和 2 )可能会交叉使用。 这样,整块低频子带组会被周期性地使用, 直到整个需要恢复的高频频段都被 复制完成为止。  The low frequency signal of audio or speech is passed through a digital filtering group to obtain a set of low frequency subband signals; and the low frequency subband group is used as a whole block signal to perform high frequency signal reproduction. The method for replicating the entire high frequency band signal is to divide the high frequency band into segments according to the frequency from low to high, each segment having substantially the same bandwidth as the whole low frequency signal; then continuously copying the entire low frequency subband group to the high frequency Each segment of the band. In this way, the entire low-frequency sub-band group will be periodically used in the high-frequency band several times until the entire high-frequency band that needs to be recovered is copied. There are two specific ways: 1) The whole low-frequency sub-band group Translating to the corresponding high frequency band, the method can be referred to FIG. 1 , which is a schematic diagram of the overall translational replication of a low frequency sub-band in the prior art; 2) folding the whole low frequency sub-band group first, that is, inverting the sub-band arrangement order, and then The whole low frequency sub-band group is translated to the corresponding high frequency band. This method can be seen in FIG. 2 , which is a schematic diagram of the overall folding translation of the prior art low frequency sub-band. During the copying process, modes 1) and 2) may be used interchangeably. Thus, the entire low frequency subband group is used periodically until the entire high frequency band that needs to be recovered is copied.
请参阅图 4, 是现有技术原始音频及其各子带信号的能量波形图 (为了直 观比较, 图中只画出前 29个子带的波形)。 图 5是现有技术原始音频各子带能 量波形的三维图。 图 6是采用现有技术一中的方式 1 )进行高频重建后得到的 各子带信号的能量波形图, 图 7是对应的各子带能量三维图。 图 8是采用现有 技术一中的方式 2 )进行高频重建后得到的各子带信号的能量波形, 图 9是对 应的各子带能量三维图。 对于能量波形图, 其中所示波形的结构均是: 最下面 的波形是原始音频波形; 标号第 0到第 8条曲线是低频子带波形, 它们将被用 来复制高频子带; 标号第 8和第 9之间是高频和低频的分界线; 从第 9条曲线 向上代表的所有子带, 都是高频重建和处理的范围。 对于能量三维图, 图中描 述的音频参数分别是能量幅度, 音频帧数( 30帧)和子带数( 29个)。 对应着 能量波形图所示的处理过的 29个子带。 其中, 第 9个子带以上是高频处理部 分。 Please refer to FIG. 4, which is an energy waveform diagram of the prior art original audio and its sub-band signals (for straight In comparison, only the waveforms of the first 29 sub-bands are drawn in the figure). Figure 5 is a three dimensional view of the energy waveforms of the various subbands of the prior art original audio. FIG. 6 is an energy waveform diagram of each sub-band signal obtained by performing high-frequency reconstruction in the mode 1 of the prior art, and FIG. 7 is a corresponding three-dimensional diagram of each sub-band energy. 8 is an energy waveform of each sub-band signal obtained by performing high-frequency reconstruction in the mode 2 of the prior art, and FIG. 9 is a corresponding three-dimensional diagram of each sub-band energy. For the energy waveform diagram, the structure of the waveform shown therein is: The lowest waveform is the original audio waveform; the 0th to 8th curves are low frequency subband waveforms, which will be used to replicate the high frequency subband; Between 8 and 9 is the dividing line between high frequency and low frequency; all sub-bands represented upward from the 9th curve are the range of high frequency reconstruction and processing. For the energy three-dimensional map, the audio parameters described in the figure are the energy amplitude, the number of audio frames (30 frames) and the number of sub-bands (29). Corresponding to the processed 29 sub-bands shown in the energy waveform diagram. Among them, the ninth sub-band or more is a high-frequency processing part.
现有技术二:  Prior art 2:
把低频子带通过低通滤波器组,得到一组低频子带。在此不再像现有技术 一那样,把选取的低频子带组作为一个整体,整段地连续复制需要恢复的高频 部分, 而是利用低频子带组中的子带,分别对应地恢复一些离散分布的高频子 带。  The low frequency subband is passed through the low pass filter bank to obtain a set of low frequency subbands. Here, as in the prior art 1, the selected low-frequency sub-band group is taken as a whole, and the high-frequency part to be recovered is continuously copied in the whole segment, and the sub-bands in the low-frequency sub-band group are respectively used to recover correspondingly. Some discretely distributed high frequency sub-bands.
在高频部分,如果有非常丰富的谐波分量, 那么谐波分量的频率很多都是 其相应基频的整数倍。 据此, 现有技术二提出, 如果高频部分某些子带的子带 序号是 2、 3、 4、 5等自然数的整数倍, 换句话说, 某些高频子带和低频子带 之间存在倍数的对应关系, 那么这些子带很可能存在丰富的谐波成分, 需要重 点恢复。  In the high frequency part, if there is a very rich harmonic component, the frequency of the harmonic component is often an integer multiple of its corresponding fundamental frequency. Accordingly, the prior art 2 proposes that if the subband numbers of some subbands of the high frequency portion are integer multiples of natural numbers such as 2, 3, 4, 5, in other words, some high frequency subbands and low frequency subbands There is a correspondence between multiples, then these sub-bands are likely to have rich harmonic components, which need to be restored.
请参阅图 3 , 是现有技术二低频子带的离散复制示意图。 音频信号的整个 频段通过子带滤波会被划分三十三个子带 (子带序号依次为 0、 1、 2  Please refer to FIG. 3 , which is a schematic diagram of discrete replication of the prior art two low frequency sub-bands. The entire frequency band of the audio signal is divided into thirty sub-bands by sub-band filtering (sub-band numbers are 0, 1, 2 in order)
31、 32 )。 其中, 低频子带组包括第 0、 1、 2 7等八个子带, 需要恢复 的高频子带组包括第 8、 9、 10 31、 32等二十五个子带。 其中, 低频 子带组每次尽量提供四个连续子带来完成复制。  31, 32). The low frequency sub-band group includes eight sub-bands such as 0, 1, and 27, and the high-frequency sub-band group to be recovered includes twenty-five sub-bands such as 8, 9, 10, 31 and 32. Among them, the low frequency subband group provides four consecutive sub-bands each time to complete the copy.
首先开始的是复制过程 II。 因为高频子带组中的第 8、 10、 12、 14四个子 带的序号都是 2的整数倍, 则从低频子带组中选取第 4、 5、 6、 7四个子带, 依次对应恢复第 8、 10、 12、 14四个高频子带。 然后是复制过程 III。 高频子带组中的第 9、 12、 15、 18、 21五个子带的序 号都是 3的整数倍, 但第 12子带刚刚被复制过, 其位置还影响了这些子带的 连续性, 则从低频子带组中选取第 3、 5、 6、 7四个子带, 依次对应恢复第 9、 15、 18、 21四个高频子带。 The first thing to start is the replication process II. Because the serial numbers of the 4th, 10th, 12th, and 14th subbands in the high frequency subband group are all integer multiples of 2, the 4th, 5th, 6th, and 7th subbands are selected from the low frequency subband group, and sequentially correspond to Restore the four high frequency sub-bands of 8, 10, 12, and 14. Then there is replication process III. The number of the five sub-bands of the 9th, 12th, 15th, 18th, and 21th bands in the high-frequency sub-band group is an integer multiple of 3, but the 12th sub-band has just been copied, and its position also affects the continuity of these sub-bands. Then, the third sub-bands of the third, fifth, sixth, and seventh sub-bands are selected from the low-frequency sub-band group, and the four high-frequency sub-bands of the 9, 15, 18, and 21 are sequentially restored.
接着是复制过程 IV。 高频子带组中的第 8、 12、 16、 20、 24、 28等子带的 序号都是 4的整数倍, 但第 8、 12子带已被复制过, 则从低频子带组中选取第 4、 5、 6、 7四个子带, 依次对应恢复第 16、 20、 24、 28四个高频子带。  This is followed by the replication process IV. The serial numbers of the 8th, 12th, 16th, 20th, 24th, 28th, etc. subbands in the high frequency subband group are all integer multiples of 4, but the 8th and 12th subbands have been copied, and then from the low frequency subband group. Four sub-bands of 4, 5, 6, and 7 are selected, and the four high-frequency sub-bands of the 16, 20, 24, and 28 are sequentially restored.
最后是复制过程 。 高频子带组中的第 10、 15、 20、 25、 30等子带的序 号都是 5的整数倍, 但第 10、 15、 20子带已被复制过, 则只需从低频子带组 中选取第 6、 7两个子带, 依次对应恢复第 25、 30两个高频子带。  Finally, the copy process. The serial numbers of the 10th, 15th, 20th, 25th, 30th, etc. subbands in the high frequency subband group are all integer multiples of 5, but the 10th, 15th, and 20th subbands have been copied, and only the low frequency subbands are needed. The sixth and seventh sub-bands are selected in the group, and the two high-frequency sub-bands of the 25th and the 30th are sequentially restored.
这样,用连续的低频子带组恢复离散分布的高频子带的过程就完成。最后, 对于上述方法遗漏的高频子带,还要选取波形与之相近的低频子带,对遗漏的 高频子带进行恢复, 从而完成所有高频子带的复制。  Thus, the process of recovering the discretely distributed high frequency sub-bands with successive low frequency sub-band sets is completed. Finally, for the high-frequency sub-bands missed by the above method, the low-frequency sub-bands with similar waveforms are also selected to recover the missing high-frequency sub-bands, thereby completing the replication of all high-frequency sub-bands.
在对现有技术的研究和实践过程中 , 发明人发现现有技术存在以下问题: 现有技术中无论按现有技术一把低频子带作为一个整块进行周期性地平 移复制或折叠复制 ,还是按现有技术二进行倍频复制 ,都是机械性地恢复谐波, 没有考虑音频语音信号的多样性和变化性,另外复制时候是按照子带序号依次 提取复制, 由于低频子带和高频子带的波形本来就不同, 所以被复制的高频子 带和原始的高频子带相比,可能会存在较大波形差别或峰值差异, 因此重建的 高频信号准确性不是太高。 如观察上述提到的波形图, 对比可发现, 现有技术 方法重建后的波形和原始波形差异较大; 观察上述提到的能量波形图,对比的 结果表明, 现有技术方法重建后失去了许多高频谐波。  In the research and practice of the prior art, the inventors have found that the prior art has the following problems: In the prior art, a low frequency sub-band according to the prior art is used as a whole block for periodic translational copying or folding replication. Or double-frequency reproduction according to the prior art 2, both mechanically recovering harmonics, without considering the diversity and variability of the audio voice signal, and additionally copying and copying according to the sub-band number in sequence, due to the low frequency sub-band and high The waveform of the frequency subband is originally different, so the reproduced high frequency sub-band may have a larger waveform difference or peak difference than the original high frequency sub-band, so the reconstructed high-frequency signal is not too high. If the waveform diagram mentioned above is observed, it can be found that the waveform reconstructed by the prior art method has a large difference between the original waveform and the original waveform; the energy waveform diagram mentioned above is observed, and the comparison result shows that the prior art method is lost after reconstruction. Many high frequency harmonics.
发明内容 Summary of the invention
本发明实施例要解决的技术问题是提供一种高频重建方法、编码模块和解 码模块, 能够更准确的进行高频重建。  The technical problem to be solved by the embodiments of the present invention is to provide a high frequency reconstruction method, an encoding module, and a decoding module, which can perform high frequency reconstruction more accurately.
为解决上述技术问题 , 本发明所提供的实施例是通过以下技术方案实现 的:  In order to solve the above technical problem, the embodiment provided by the present invention is implemented by the following technical solutions:
本发明实施例提供一种高频重建方法, 包括: 对音频或语音信号进行滤波 处理得到低频子带和高频子带; 确定频带复制策略;根据所述确定的频带复制 策略获取所述低频子带与高频子带的相关性,参考所述确定的相关性为高频子 带选择低频子带作为复制频带,并输出包括选择频带的对应关系的高频重建参 数信息。 An embodiment of the present invention provides a high frequency reconstruction method, including: filtering an audio or voice signal to obtain a low frequency subband and a high frequency subband; determining a frequency band replication strategy; and replicating according to the determined frequency band The strategy acquires the correlation between the low frequency subband and the high frequency subband, selects a low frequency subband as a copy frequency band for the high frequency subband with reference to the determined correlation, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency band. .
本发明实施例提供一种高频重建方法, 包括:接收包括选择频带的对应关 系的高频重建参数信息,所述选择频带的对应关系具体为参考相关性确定的高 频子带与其相对应的低频子带的关系;在高频段根据所述包括选择频带的对应 关系的高频重建参数信息将低频子带复制作为高频子带。  The embodiment of the present invention provides a high-frequency reconstruction method, including: receiving high-frequency reconstruction parameter information including a correspondence relationship of selected frequency bands, where the correspondence relationship of the selected frequency bands is specifically corresponding to a high-frequency sub-band determined by reference correlation. The relationship of the low frequency sub-bands; the low frequency sub-band is copied as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band.
本发明实施例提供一种编码模块, 包括: 分析滤波器模块, 用于对音频或 语音信号进行滤波处理得到低频子带和高频子带; 频带选择模块, 用于确定频 带复制策略,根据所述确定的频带复制策略获取所述低频子带与高频子带的相 关性, 参考所述确定的相关性为高频子带选择低频子带作为复制频带, 并输出 包括选择频带的对应关系的高频重建参数信息。  An embodiment of the present invention provides an encoding module, including: an analysis filter module, configured to filter audio or voice signals to obtain a low frequency subband and a high frequency subband; and a frequency band selection module, configured to determine a frequency band replication strategy, Determining a correlation between the low frequency sub-band and the high frequency sub-band by using the determined frequency band replication strategy, selecting a low frequency sub-band as a copy frequency band for the high frequency sub-band with reference to the determined correlation, and outputting a correspondence relationship including the selected frequency band High frequency reconstruction parameter information.
本发明实施例提供一种解码模块,含有高频生成器模块, 所述高频生成器 模块包括:接收单元,用于接收包括选择频带的对应关系的高频重建参数信息 , 所述选择频带的对应关系具体为参考相关性确定的高频子带与其相对应的低 频子带的关系; 重建单元, 用于在高频段根据所述包括选择频带的对应关系的 高频重建参数信息将低频子带复制作为高频子带。  An embodiment of the present invention provides a decoding module, including a high frequency generator module, where the high frequency generator module includes: a receiving unit, configured to receive high frequency reconstruction parameter information including a correspondence relationship of selected frequency bands, where the selected frequency band is The correspondence relationship is specifically a relationship between the high frequency sub-band determined by the reference correlation and the corresponding low frequency sub-band; the reconstruction unit is configured to, in the high frequency band, the low frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency band Copy as a high frequency sub-band.
上述技术方案可以看出,本发明实施例方案充分考虑低频子带和高频子带 的相关性,通过根据确定的频带复制策略获取所述低频子带与高频子带的相关 性, 参考所述确定的相关性为高频子带选择低频子带作为复制频带, 并输出包 括选择频带的对应关系的高频重建参数信息 ,从而可以根据这些高频重建参数 信息进行高频重建。从高频重建后得到的各子带信号的能量波形图和能量三维 图可以明显的发现, 本发明实施例重建后的高频部分与原始音频信号较为接 近,效果比现有技术方案更好, 因此本发明实施例技术方案能够更准确的进行 高频重建。  The foregoing technical solution can be seen that the scheme of the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and obtains the correlation between the low frequency subband and the high frequency subband according to the determined frequency band replication strategy. The determined correlation is that the high frequency sub-band selects the low frequency sub-band as the copy frequency band, and outputs the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands, so that the high frequency reconstruction can be performed based on the high frequency reconstruction parameter information. The energy waveform and the energy three-dimensional image of each sub-band signal obtained after the high-frequency reconstruction can be clearly found that the reconstructed high-frequency portion of the embodiment of the present invention is closer to the original audio signal, and the effect is better than the prior art solution. Therefore, the technical solution of the embodiment of the present invention can perform high frequency reconstruction more accurately.
附图说明 DRAWINGS
图 1是现有技术一低频子带的整体平移复制示意图;  1 is a schematic diagram of an overall translational replication of a low frequency sub-band of the prior art;
图 2是现有技术一低频子带的整体折叠平移示意图;  2 is a schematic diagram of the overall folding translation of a low frequency sub-band of the prior art;
图 3是现有技术二低频子带的离散复制示意图; 图 4是现有技术原始音频及其各子带信号的能量波形图; 3 is a schematic diagram of discrete replication of a prior art two low frequency sub-band; 4 is an energy waveform diagram of a prior art original audio and its respective sub-band signals;
图 5是现有技术原始音频各子带能量波形的能量三维图;  5 is a three-dimensional energy diagram of energy waveforms of each sub-band of the prior art original audio;
图 6是采用现有技术一中的方式 1 )进行高频重建后得到的各子带信号的 能量波形图;  6 is an energy waveform diagram of each sub-band signal obtained by performing high-frequency reconstruction in the manner of the prior art 1;
图 7是采用现有技术一中的方式 1 )进行高频重建后得到的各子带能量三 维图;  7 is a three-dimensional diagram of energy of each sub-band obtained by performing high-frequency reconstruction in the manner of the prior art;
图 8是采用现有技术一中的方式 2 )进行高频重建后得到的各子带信号的 能量波形图;  8 is an energy waveform diagram of each sub-band signal obtained by performing high-frequency reconstruction in the manner of the prior art;
图 9是采用现有技术一中的方式 2 )进行高频重建后得到的各子带能量三 维图;  9 is a three-dimensional diagram of energy of each sub-band obtained by performing high-frequency reconstruction in the manner of the prior art;
图 10是本发明实施例高频重建的原理和结构框图;  10 is a block diagram showing the principle and structure of a high frequency reconstruction according to an embodiment of the present invention;
图 11是本发明实施例高频分段低频匹配复制策略示意图;  11 is a schematic diagram of a high frequency segmentation low frequency matching replication strategy according to an embodiment of the present invention;
图 12是本发明实施例低频主导频段高频匹配复制策略示意图; 图 13是本发明实施例低频频段高频匹配复制策略示意图;  12 is a schematic diagram of a high frequency matching replication strategy for a low frequency dominant frequency band according to an embodiment of the present invention; FIG. 13 is a schematic diagram of a high frequency matching replication strategy for a low frequency band according to an embodiment of the present invention;
图 14是本发明实施例扩展的复制策略示意图;  14 is a schematic diagram of an extended replication policy according to an embodiment of the present invention;
图 15 ( a )是本发明实施例编码端自适应频带复制方式的结构框图; 图 15 ( b )是本发明实施例编码端固定频带复制方式的结构框图; 图 16是本发明实施例自适应频带复制方式的高频重建方法流程图; 图 17是本发明实施例频带复制策略选择流程图;  Figure 15 (a) is a structural block diagram of an adaptive band copying mode of an encoding end according to an embodiment of the present invention; Figure 15 (b) is a structural block diagram of a fixed-band copying mode of an encoding end of the embodiment of the present invention; Figure 16 is an adaptive embodiment of the present invention; FIG. 17 is a flow chart of selecting a frequency band replication strategy according to an embodiment of the present invention; FIG.
图 18是本发明实施例最优频带选择流程图;  18 is a flowchart of optimal frequency band selection in an embodiment of the present invention;
图 19是本发明实施例时变特性检测的算法流程图;  19 is a flowchart of an algorithm for detecting time-varying characteristics according to an embodiment of the present invention;
图 20是本发明实施例解码端根据编码端参数信息进行高频重建的示意 图;  20 is a schematic diagram of a high frequency reconstruction performed by a decoding end according to parameter information of an encoding end according to an embodiment of the present invention;
图 21是本发明实施例解码端高频生成器算法流程图;  21 is a flowchart of an algorithm for decoding a high frequency generator according to an embodiment of the present invention;
图 22是本发明实施例方法对原始音频信号恢复后的能量波形图; 图 23是本发明实施例方法对原始音频信号恢复后的能量三维图; 图 24是本发明实施例编码模块一结构示意图;  22 is an energy waveform diagram of the original audio signal after the method of the embodiment of the present invention is restored; FIG. 23 is a three-dimensional diagram of the energy of the original audio signal recovered by the method of the embodiment of the present invention; FIG. 24 is a schematic structural diagram of the encoding module according to the embodiment of the present invention; ;
图 25是本发明实施例编码模块二结构示意图;  25 is a schematic structural diagram of an encoding module 2 according to an embodiment of the present invention;
图 26是本发明实施例解码模块结构示意图。 具体实施方式 FIG. 26 is a schematic structural diagram of a decoding module according to an embodiment of the present invention. detailed description
本发明实施例提供了一种高频重建方法, 能够更准确的进行高频重建。 请参阅图 10, 是本发明实施例高频重建的原理和结构框图。  The embodiment of the invention provides a high frequency reconstruction method, which can perform high frequency reconstruction more accurately. Please refer to FIG. 10, which is a schematic diagram and a structural block diagram of a high frequency reconstruction according to an embodiment of the present invention.
如图 10所示,其中上半部分是编码端中与高频处理相关的模块, 下半部分 是解码端中与高频处理相关的模块。  As shown in Fig. 10, the upper part is the module related to high frequency processing in the encoding end, and the lower part is the module related to high frequency processing in the decoding end.
在编码端 ,原始音频信号通过分析滤波器模块转换为分布在不同频段的子 带信号, 包括低频子带和高频子带,低频子带将被核心编码器编码并传输给解 码端, 另外, 还将对低频子带进行处理得到指导高频重建的参数信息。 具体来 说, 在编码端, 低频子带经过分析与检测模块, 得到的检测结果输入到频带选 择模块, 用于指导频带选择模块的分析策略; 频带选择模块根据分析与检测模 块的指导信息,选择适合的复制策略, 并用最大相关准则为每个或每段高频子 带选择匹配的低频子带, 而包络参数提取模块进行包络参数提取, 最后输出高 频重建的参数信息。  At the encoding end, the original audio signal is converted by the analysis filter module into subband signals distributed in different frequency bands, including low frequency subbands and high frequency subbands, and the low frequency subbands are encoded by the core encoder and transmitted to the decoding end. The low frequency sub-bands are also processed to obtain parameter information that guides the high frequency reconstruction. Specifically, at the encoding end, the low frequency sub-band passes through the analysis and detection module, and the obtained detection result is input to the frequency band selection module for guiding the analysis strategy of the frequency band selection module; the frequency band selection module selects according to the guidance information of the analysis and detection module. A suitable replication strategy, and using the maximum correlation criterion to select a matching low frequency subband for each or each high frequency subband, and the envelope parameter extraction module performs envelope parameter extraction, and finally outputs the parameter information of the high frequency reconstruction.
在解码端,核心解码器利用接收到的低频子带的编码信息解码恢复低频子 带信号,再按照来自编码端的高频重建的参数信息复制高频子带, 然后进行包 络调整得到重建的高频子带;最后低频子带和高频子带的信号通过综合滤波器 处理恢复出全频带的音频和语音信号。  At the decoding end, the core decoder decodes and recovers the low-frequency sub-band signal by using the encoded information of the received low-frequency sub-band, and then copies the high-frequency sub-band according to the parameter information of the high-frequency reconstruction from the encoding end, and then performs envelope adjustment to obtain the reconstructed high. The frequency subband; the last low frequency subband and high frequency subband signals are recovered by the synthesis filter to recover the full band audio and speech signals.
以下先介绍本发明实施例的频带复制策略:  The frequency band replication strategy of the embodiment of the present invention is first described below:
现有技术传统的频带复制方法通常选定一定范围的低频段作为基频,然后 运用某种固定的规则向高频部分复制。例如平移复制、倍频关系复制和折叠复 制等,这样的复制方法限制了高频重建的准确性。 因为虽然高频与低频成分具 有一定的相关性,但是平移和折叠复制不能确保这种相关性的对应, 用于复制 的低频子带与被复制的高频子带相关性可能不是很好,甚至相关性较差,这样 就会引入噪声或改变音质; 倍频复制方法利用了谐波周期性,但是并非每个子 带都是谐波成分, 相关性也不好, 这样也会引入噪声或改变音质; 加之语音和 音频信号中声源的多样性, 用某种固定的复制方法更容易引起复制频带的错 用。  Conventional conventional band replication methods typically select a range of low frequency bands as the fundamental frequency and then replicate to the high frequency portion using some fixed rule. For example, translational replication, multiplier relationship replication, and folding replication, such replication methods limit the accuracy of high frequency reconstruction. Because although the high frequency and low frequency components have a certain correlation, the translation and folding replication cannot ensure the correspondence of this correlation, and the correlation between the low frequency subbands for copying and the copied high frequency subbands may not be very good, even Poor correlation, which will introduce noise or change the sound quality; the frequency doubling method utilizes harmonic periodicity, but not every sub-band is a harmonic component, and the correlation is not good, which will introduce noise or change the sound quality. In addition to the diversity of sound sources in speech and audio signals, it is more likely to cause misuse of the copying band by some fixed copying method.
本发明实施例提出的频带复制策略,充分考虑低频子带和高频子带的相关 性, 另外还适合音频和语音信号的短时特性和时变特性,具有灵活的频带选择 功能,可以保证用于复制和被复制的频带具有最优的相关性。本发明实施例提 出以下三种频带复制策略及其它们的扩展策略: The frequency band replication strategy proposed by the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and is also suitable for short-term characteristics and time-varying characteristics of audio and voice signals, and has flexible frequency band selection. The function ensures that the frequency bands used for copying and copying have an optimal correlation. The following embodiments of the present invention propose the following three frequency band replication strategies and their extension strategies:
( 1 ) 高频分段低频匹配复制策略(策略 1 ):  (1) High-frequency segmentation low-frequency matching replication strategy (strategy 1):
此策略将高频频段成分划分成多个复制频带 ,复制频带的划分可以按照不 同的方法进行。 例如复制频带可根据 Bark (巴克标度) 比例带进行划分, 并 在 Bark比例带内按不同解析度将子带分组构成复制频带等。 解析度表示复制 频带包含的子带个数, 个数越少, 解析度越高, 个数越多, 解析度越低。 在 Bark带内划分复制频带时, 解析度随频率的升高而下降。 复制频带划分也可 以按相同解析度将高频成分等分为多个复制频带,然后为每个高频段选择最相 关的低频段进行复制。  This strategy divides the high frequency band components into multiple copy bands, and the division of the copy bands can be performed in different ways. For example, the copy band can be divided according to the Bark scale band, and the sub-bands are grouped into a copy band and the like according to different resolutions in the Bark proportional band. The resolution indicates the number of subbands included in the copy band. The smaller the number, the higher the resolution, and the lower the number, the lower the resolution. When the copy band is divided within the Bark band, the resolution decreases as the frequency increases. The copy band division can also equally divide the high frequency component into a plurality of copy bands at the same resolution, and then select the most relevant low band for each high band for copying.
如图 11所示, 是本发明实施例高频分段低频匹配复制策略示意图。 B0是 低频子带信号的结束子带, B1是高频处理信号的结束子带, bl , b2, b3是复 制频带划分边界。 划分复制频带后,本策略分别为每个复制频带选取最相关的 低频子带用于复制,低频子带可以重复使用, 只要保证其与被复制的复制频带 具有最大相关性。  As shown in FIG. 11, it is a schematic diagram of a high frequency segmentation low frequency matching replication strategy according to an embodiment of the present invention. B0 is the end subband of the low frequency subband signal, B1 is the end subband of the high frequency processed signal, and bl, b2, b3 are the boundaries of the complex band division. After dividing the copy frequency band, the strategy selects the most relevant low frequency sub-band for each copy band for copying, and the low-frequency sub-band can be reused as long as it has the greatest correlation with the copied copy band.
这种复制策略充分利用了高频与低频的相关性,适用于谱包络较平稳的情 况, 高低频都具有良好的谱包络能量, 因为此时的高频成分带有重要的音频成 分, 尤其是靠近低频部分的高频带信号, 如果高频复制产生失真, 将会引入噪 声、影响音质,分段选择复制策略为每个高频段选择最大相关的低频段进行复 制, 确保了频段间的相关性对应, 可以避免错用频带引起的失真。  This replication strategy makes full use of the correlation between high frequency and low frequency, and is suitable for the case where the spectral envelope is relatively stable. Both high and low frequencies have good spectral envelope energy, because the high frequency components at this time have important audio components. Especially in the high-frequency band near the low-frequency part, if the high-frequency reproduction produces distortion, noise will be introduced and the sound quality will be affected. The segment-selective copying strategy will select the most relevant low-frequency band for each high-band to copy, ensuring the inter-band. Correspondence can be used to avoid distortion caused by misuse of the frequency band.
本复制策略与现有技术相比, 不同之处是: 现有技术一用整块低频信号连 续和重复地复制高频信号,在高低频信号相关性差时会用差异较大的低频子带 复制高频子带, 引入较大的失真, 而本复制策略以原始高频子带信号为基准, 选择与其具有最大相关性的低频子带来复制, 可以有效避频带错用问题。  Compared with the prior art, the copying strategy has the difference that: the prior art continuously and repeatedly copies the high frequency signal by using the whole low frequency signal, and copies the low frequency subband with a large difference when the correlation between the high and low frequency signals is poor. The high-frequency sub-band introduces a large distortion, and the copying strategy uses the original high-frequency sub-band signal as a reference to select the low-frequency sub-band with the greatest correlation to copy, which can effectively avoid the band misuse problem.
( 2 )低频主导频段高频匹配复制策略(策略 2 ):  (2) Low frequency dominant frequency band high frequency matching replication strategy (strategy 2):
此策略首先选取低频信号中能量集中的频段作为主导频段,然后选出与此 段低频信号较相关的高频信号段,在该高频信号段将所述选取的低频信号中能 量集中的频段进行复制作为高频子带;对于高频信号段那些尚未被复制的剩余 的小频段则分别为其选择最接近的低频段来复制。该方法先对具有良好谐波特 性的频带进处理, 然后对其间的零散频带选择最接近的低频子带来复制。频带 选择时先是用低频选择相关性最好的高频谐波,然后再用非谐波的高频子带选 择最相关的低频子带。 The strategy first selects the frequency band in which the energy is concentrated in the low frequency signal as the dominant frequency band, and then selects a high frequency signal segment that is more correlated with the low frequency signal of the segment, and performs the frequency band in which the energy is concentrated in the selected low frequency signal segment in the high frequency signal segment. The copy is used as a high frequency sub-band; for the high frequency signal segments, the remaining small frequency bands that have not been copied are respectively selected for the closest low frequency band to be copied. The method first has good harmonics The frequency band is processed, and then the nearest low-frequency sub-band is selected for the scattered band between them. In the frequency band selection, the highest-frequency harmonics with the best correlation are selected first, and then the most relevant low-frequency sub-bands are selected by the non-harmonic high-frequency sub-bands.
如图 12所示, 是本发明实施例低频主导频段高频匹配复制策略示意图。 首先选取能量集中的低频信号,然后根据所选低频信号选择相关性较好的高频 部分, 在该部分进行复制, 如高频段信号 i和 j是低频信号选择的复制频带, 然后为除 i和 j以外的零散高频段选择合适的低频段进行复制。  As shown in FIG. 12, it is a schematic diagram of a high frequency matching replication strategy for a low frequency dominant frequency band according to an embodiment of the present invention. Firstly, the low-frequency signal of the energy concentration is selected, and then the high-frequency part with good correlation is selected according to the selected low-frequency signal, and the part is copied, for example, the high-band signals i and j are the copy frequency bands selected by the low-frequency signal, and then the i and The scattered high frequency band other than j selects the appropriate low frequency band for copying.
低频主导频段高频匹配复制策略利用信号的频域谐波特性 ,对低频中的类 基频信号选取不同阶次的高频谐波,适合具有良好谐波特性的语音和音频信号 使用。 根据谐波变换的一般性质, 谐波在高频部分出现的间隔会逐渐下降, 谐 波覆盖的频带范围会逐渐升高,所以现有技术传统的用倍频复制方法进行的谐 波复制会产生高频失真, 而本发明实施例使用最大相关判定方法,可以准确确 定高频谐波位置 ,并且对高频谐波间的零散子带也使用最大相关方法选取用于 复制的低频子带, 可以保证谐波的完整性, 弥补高频谐波变宽的现象。  The high frequency matching replication strategy of the low frequency dominant frequency band utilizes the frequency domain harmonic characteristics of the signal to select high frequency harmonics of different orders for the fundamental frequency signals in the low frequency, and is suitable for speech and audio signals with good harmonic characteristics. According to the general nature of harmonic transformation, the interval of harmonics appearing in the high frequency part will gradually decrease, and the frequency range of harmonic coverage will gradually increase. Therefore, the conventional harmonic copying by the frequency doubling replication method will be generated. High-frequency distortion, and the embodiment of the present invention can accurately determine the position of the high-frequency harmonic using the maximum correlation determination method, and also select the low-frequency sub-band for copying using the maximum correlation method for the scattered sub-band between the high-frequency harmonics. Ensure the integrity of the harmonics and compensate for the phenomenon of high frequency harmonic widening.
本复制策略与现有技术相比,不同之处是:现有技术采用倍频复制方法时, 使用低频子带信号向高频以倍数复制 ,形成的高频谐波包含了不同低频信号的 谐波,破坏了谐波的连续性, 而本复制策略对类基频信号连续的置换到高频谐 波, 保证了谐波连续性, 不会引起高频失真。  Compared with the prior art, the copying strategy has the difference that the prior art adopts the frequency doubling copying method, and uses the low frequency subband signal to copy to the high frequency in multiples, and the formed high frequency harmonics contain harmonics of different low frequency signals. The wave destroys the continuity of the harmonics, and the copying strategy continuously replaces the fundamental frequency signal with the high frequency harmonics to ensure harmonic continuity without causing high frequency distortion.
( 3 )低频频段高频匹配复制策略(策略 3 ):  (3) High frequency matching replication strategy in low frequency band (strategy 3):
该策略把整个低频段看成一个基准频段信号,然后按照最优匹配原则在高 频段的谐波进行选择性复制。  This strategy treats the entire low-band as a reference-band signal and then selectively replicates the harmonics in the high-band according to the optimal matching principle.
如图 13所示, 是本发明实施例低频频段高频匹配复制策略示意图。 高频 段 i和 j是选择出来的与低频信号较为相关的谐波成分, 在该位置将整个低频 段进行复制, 而其间的零散子带仍然用最大相关复制频带选择方法, 来选取低 频子带复制。  As shown in FIG. 13, it is a schematic diagram of a low frequency band high frequency matching replication strategy according to an embodiment of the present invention. The high frequency bands i and j are selected harmonic components more related to the low frequency signal, and the entire low frequency band is copied at this position, and the scattered subband between them still uses the maximum correlation copy band selection method to select the low frequency subband copy. .
这样的谐波选择复制策略适用于谱包络较平稳且高频能量平稳下降,高频 能量较低的音频信号,这样的音频信号一般高频谐波能量随着阶次的升高而成 指数下降, 由于高频能量较小, 可以看成谐波和噪声的混合, 所以可以用整个 低频信号向高频选择性复制,但在低码率时靠近低频带的高频子带复制准确性 非常重要, 应该做细致的相关频带选择。 Such a harmonic selection replication strategy is suitable for an audio signal with a relatively smooth spectral envelope and a high-frequency energy drop, and a low-frequency energy. Such an audio signal generally has an index of high-frequency harmonic energy that increases with an order of magnitude. Decrease, because the high-frequency energy is small, it can be regarded as a mixture of harmonics and noise, so the entire low-frequency signal can be selectively copied to the high-frequency, but the high-frequency sub-band replication accuracy near the low frequency band at a low code rate Very important, you should make detailed selection of relevant bands.
本复制策略与现有技术相比, 不同之处是: 现有技术使用低频信号整块连 续的复制高频信号,而本复制策略把低频看作一个整体后从高频成分中选择与 低频信号最大相关的高频谐波, 在该位置将整个低频段进行复制, 如图 13中 的 i和 j , 允许谐波之间存在过渡频带, 再用最优频带选择方法为过渡频带选 择合适的低频带来复制, 这样可以防止谐波的偏移。  Compared with the prior art, the copying strategy has the following differences: the prior art uses a low-frequency signal to continuously copy high-frequency signals, and the copying strategy considers the low-frequency as a whole and selects and low-frequency signals from the high-frequency components. The most relevant high-frequency harmonics, at which the entire low-frequency band is reproduced, as i and j in Figure 13, allowing a transition band between the harmonics, and then selecting the appropriate low-frequency for the transition band using the optimal band selection method. Brings a copy, which prevents harmonic shifts.
( 4 )频带复制策略(策略 1、 策略 2和策略 3 ) 的扩展策略:  (4) Extended strategies for band replication strategies (Policy 1, Strategy 2, and Strategy 3):
此种扩展策略方法是将之前进行高频复制得到的频率较低的高频子带频 段也用于更高频段的复制。  This method of expansion strategy is to use the lower frequency high frequency subband band previously obtained by high frequency copying for higher frequency band reproduction.
因为低码率情况下低频信号可能没有覆盖一个完整的谐波,所以扩大频带 选择范围,把低频信号与少量临近低频信号的高频子带看作一个整体部分, 然 后再用策略 1、 策略 2或策略 3中的方法确定复制频带。 对加入的少量高频子 带选择最大相关的低频子带复制。进行频带选择检测时, 首先根据编码比特率 和谐波完整性关系来确定扩展频带(即恢复时作为复制源的高频子带)的范围, 并用最大相关准则为每个扩展频带选择用于复制的低频子带,由于扩展频带需 要最高的重建准确性, 所以频带选择时采用最高频带解析度(即以单个子带为 复制频带); 然后把扩展频带与低频子带合起来作为复制源, 再采用策略 1、 策略 2或策略 3来为高频子带选择复制频带。  Because the low frequency signal may not cover a complete harmonic under low bit rate, the band selection range is extended, and the low frequency signal and a small number of high frequency subbands adjacent to the low frequency signal are regarded as an integral part, and then strategy 1 and strategy 2 are used. Or the method in Strategy 3 determines the replication band. The most relevant low frequency sub-band replica is selected for the small number of high frequency sub-bands added. When performing the band selection detection, first, the range of the extended band (ie, the high frequency sub-band as the copy source at the time of recovery) is determined according to the encoding bit rate and the harmonic integrity relationship, and each extended band is selected for copying using the maximum correlation criterion. The low frequency subband, since the extended band requires the highest reconstruction accuracy, the band selection uses the highest band resolution (ie, a single subband is the copy band); then the extended band and the low frequency subband are combined as a copy source, and then Strategy 1, Strategy 2 or Strategy 3 is used to select the replication band for the high frequency subband.
如图 14所示, 是本发明实施例扩展的复制策略示意图。 在复制频带 1重 建以后, 又利用低频带和复制频带 1中的较低频带组合成连续频段, 并用于复 制频带 2及其以上复制频带的信号重建。  As shown in FIG. 14, it is a schematic diagram of an extended replication policy according to an embodiment of the present invention. After the reproduction of the frequency band 1 is re-established, the lower frequency bands of the low frequency band and the copy frequency band 1 are combined into a continuous frequency band, and used for signal reconstruction of the complex frequency band 2 and above.
这种复制策略适合于在低码率时使用 ,因为低码率时由核心编解码器处理 的低频带较短,可能没有涵盖所有的基频泛音, 并且与低频泛音相比中频段的 泛音更接近于高频泛音特性,所以可以在保证用较高的解析度重建较低频率的 高频信号后,将重建后的信号用于更高频带的复制。通过这种复制方法可以更 完整的刻画谐波, 有利于扩大高频重建范围。  This copying strategy is suitable for use at low bitrates because the low frequency band handled by the core codec is shorter at low bitrates, may not cover all fundamental frequency overtones, and is more harmonic in the mid-range than low-frequency overtones. Close to the high frequency overtone characteristics, the reconstructed signal can be used for higher frequency band reproduction after ensuring that the lower frequency high frequency signal is reconstructed with higher resolution. Through this method of copying, harmonics can be more completely depicted, which is conducive to expanding the range of high-frequency reconstruction.
以下详细进一步详细介绍本发明实施例高频重建方法。  The high frequency reconstruction method of the embodiment of the present invention is described in further detail below.
本发明实施例高频重建方法可以有两种,即自适应频带复制方式和固定频 带复制方式: ( 1 ) 自适应频带复制方式: 如图 15 ( a )所示, 是本发明实施例编码端 自适应频带复制方式的结构框图。这种方式利用能量语分析和估计方法检测音 频信号的特性,根据检测结果输出指导信息用于指导选择复制策略,进而指导 最优频带选择。 因为在一定时长内语音和音频信号的特性通常是相同的, 即准 平稳特性, 没有必要重新进行频带选择, 所以引入时变特性检测, 只需要在时 变特性变量大于容限时, 才重新进行频带选择。 There are two types of high frequency reconstruction methods in the embodiments of the present invention, namely, an adaptive band copy mode and a fixed band copy mode: (1) Adaptive band copying mode: As shown in FIG. 15(a), it is a structural block diagram of an encoding end adaptive band copying mode according to an embodiment of the present invention. In this way, the characteristics of the audio signal are detected by the energy language analysis and estimation method, and the guidance information is output according to the detection result for guiding the selection of the replication strategy, thereby guiding the optimal frequency band selection. Since the characteristics of the speech and audio signals are usually the same in a certain period of time, that is, the quasi-stationary characteristic, there is no need to re-band the band selection, so the introduction of the time-varying characteristic detection only needs to re-band the band when the time-varying characteristic variable is greater than the tolerance. select.
( 2 ) 固定频带复制方式: 如图 15 ( b )所示, 是本发明实施例编码端固 定频带复制方式的结构框图。这种方式只要事先按照实际需要选择一种固定的 复制方式, 即确定本发明实施例上述提出的频带复制策略之一(如策略 1、 策 略 2、 策略 3或它们的扩展策略), 在整个音频处理过程中保持不变, 并结合 适当的最优频带选择来实现。 因为固定频带复制方式下, 不需要根据短时特性 分析模块的结果指导频带复制策略的选择, 而是通过设置参数指定复制策略, 所以不需要短时特性分析模块。  (2) Fixed-band copying mode: As shown in Fig. 15 (b), it is a structural block diagram of the fixed-band copying mode of the encoding end of the embodiment of the present invention. In this way, as long as the fixed copy mode is selected in advance according to actual needs, that is, one of the above-mentioned proposed band copy policies (such as policy 1, policy 2, policy 3 or their extended policies) is determined in the entire audio. It remains unchanged during processing and is implemented in conjunction with appropriate optimal frequency band selection. Because the fixed-band copy mode does not need to guide the selection of the band copy strategy according to the result of the short-term characteristic analysis module, the copy policy is specified by setting parameters, so the short-term characteristic analysis module is not needed.
还需要说明的是, 自适应频带复制方式和固定频带复制方式中, 时变特性 检测并不是必须的。  It should also be noted that in the adaptive band copy mode and the fixed band copy mode, time-varying characteristic detection is not necessary.
以下对自适应频带复制方式的高频重建方法进行介绍。  The high frequency reconstruction method of the adaptive band copy method will be described below.
请参阅图 16, 是本发明实施例自适应频带复制方式的高频重建方法流程 图, 包括步骤:  Please refer to FIG. 16, which is a flowchart of a high frequency reconstruction method in an adaptive band replication mode according to an embodiment of the present invention, including the steps:
步骤 1601、 对分析滤波器模块得到的子带信号进行短时特性分析; 步骤 1602、 根据短时特性分析的结果进行频带复制策略选择;  Step 1601: Perform short-term characteristic analysis on the sub-band signal obtained by the analysis filter module; Step 1602: perform a frequency band replication strategy selection according to the result of the short-term characteristic analysis;
步骤 1603、 根据选择的频带复制策略进行最优频带选择;  Step 1603: Perform optimal frequency band selection according to the selected frequency band replication policy.
步骤 1604、 根据最优频带进行频带复制。  Step 1604: Perform frequency band replication according to the optimal frequency band.
以下对各步骤进行具体介绍。  The steps are described in detail below.
步骤 1601、 对分析滤波器模块得到的子带信号进行短时特性分析; 对于原始音频信号 ,通过分析滤波器模块转换为分布在不同频段的子带信 号, 再对子带信号进行短时特性分析。  Step 1601: Perform short-term characteristic analysis on the sub-band signal obtained by the analysis filter module. For the original audio signal, convert the sub-band signal distributed in different frequency bands by the analysis filter module, and perform short-term characteristic analysis on the sub-band signal. .
短时特性分析是为了选择合适的频带复制策略进行的准备工作。音频或语 音信号先进行时频变换后, 然后针对谐波情况、低频部分和高频部分的能量分 布进行分析, 分析的参数结果决定频带复制策略。 短时特性分析的实现算法很多 ,本发明实施例使用其中一种算法但不局限 于此。 Short-term characterization is a preparation for selecting the appropriate band replication strategy. The audio or speech signal is first subjected to time-frequency transform, and then the energy distribution of the harmonic condition, the low frequency part and the high frequency part is analyzed, and the analyzed parameter result determines the band replication strategy. There are many implementation algorithms for short-term characteristic analysis, and embodiments of the present invention use one of the algorithms, but are not limited thereto.
设低频子带样点为 。^ ,/)。 其中"表示低频子带序号, ί≤η , 是高 频处理的第一个子带序号; /表示子带中的样点, 0≤/ < 32。 设高频子带样点 为 ^^ (W), 其中 :代表高频子带, k0≤k≤ke , 是高频处理的结束子带。 Let the low frequency sub-band sample point be. ^ , /). Where "represents the low frequency sub-band number, ί ≤ η, which is the first sub-band number of the high-frequency processing; / represents the sample point in the sub-band, 0 ≤ / < 32. Let the high-frequency sub-band sample be ^^ ( W), where: represents the high frequency subband, k 0 ≤ k ≤ k e , which is the end subband of the high frequency processing.
①、 计算低频部分各个子带的能量, 如下式所示:  1. Calculate the energy of each subband in the low frequency part, as shown in the following equation:
E(") = f 2w(",/) E(") = f 2 . w (", /)
/=0  /=0
②、 计算整个低频部分的能量均值, 如下式所示:
Figure imgf000013_0001
2. Calculate the energy mean of the entire low frequency part, as shown in the following equation:
Figure imgf000013_0001
③、 计算高频部分各个子带的能量, 如下式所示:  3. Calculate the energy of each sub-band in the high-frequency part, as shown in the following equation:
Ε{^ = ^ΧΗ 2 ίφ { Ι) Ε{^ = ^Χ Η 2 ίφ { Ι)
1=0  1=0
④、 计算整个高频部分的能量均值, 如下式所示:4. Calculate the energy average of the entire high frequency part, as shown in the following equation:
- 经过上述计算,短时特性分析结束,得到的分析参数将应用于频带复制策 略选择部分。  - After the above calculation, the short-term characteristic analysis ends and the obtained analysis parameters will be applied to the band copy strategy selection section.
步骤 1602、 根据短时特性分析的结果进行频带复制策略选择;  Step 1602: Perform a frequency band replication policy selection according to a result of short-term characteristic analysis;
上述已经提到四种频带复制策略,根据短时特性分析的结果,从中选择出 一种策略来进行复制。频带复制策略确定后,输出复制策略标志位和复制策略 信息, 来指导后面的最优频带选择。  Four frequency band replication strategies have been mentioned above, and based on the results of the short-term characteristic analysis, a strategy is selected for replication. After the band replication policy is determined, the replication policy flag and the replication policy information are output to guide the subsequent optimal band selection.
请参阅图 17 , 是本发明实施例频带复制策略选择流程图。  Please refer to FIG. 17, which is a flowchart of selecting a frequency band replication policy according to an embodiment of the present invention.
把短时特性分析得到的各低频子带能量值和低频部分的能量均值比较,再 把高频部分能量均值和低频部分能量均值比较。根据比较的情况, 当部分低频 子带能量值远低于均值,另有部分连续低频子带的能量很接近均值或在均值之 上, 则选择策略 2; 如果低频各个子带能量接近,低频部分能量曲线连续平緩, 且高频部分能量均值和低频部分能量均值差距较大, 则选择策略 3; 否则, 选 择策略 1。 Comparing the energy values of the low frequency sub-bands obtained by the short-term characteristic analysis with the energy average values of the low-frequency parts, and comparing the average values of the high-frequency partial energy with the low-frequency partial energy average. According to the comparison, when the energy value of the partial low frequency sub-band is much lower than the mean value, and the energy of some continuous low-frequency sub-bands is close to the mean or above the mean value, then strategy 2 is selected; if the energy of each sub-band of the low frequency is close, the low-frequency part The energy curve is continuous and gentle, and the difference between the average value of the high-frequency part energy and the low-frequency part energy is larger, then choose strategy 3; otherwise, select Choose strategy 1.
策略 2、 策略 3和策略 1所组成的判断流程是策略选择的主体, 而频带复 制扩展策略作为一个辅助策略, 主要是在恢复高频过程中,针对低频频段比较 窄的情况, 拓展用来复制的低频频段的宽度, 改善基频的完整性。 当编码速率 较低,低频子带数目有限,频带复制扩展策略可以使高频部分对低频子带的选 择不会过于受限。它将本来也需要频带复制的若干中高频子带和低频部分捆绑 到一起, 组成新的低频部分, 供大部分高频子带来选择, 提取其相应的复制参 数。 同时, 对于新低频部分中的若干中高频子带, 会从原低频部分中选出和它 们最匹配的低频子带, 对其依次完成复制参数的提取。  The judgment process consisting of Strategy 2, Strategy 3 and Strategy 1 is the main body of policy selection, and the band replication expansion strategy is used as an auxiliary strategy, mainly in the process of recovering high frequency, for the case where the low frequency band is relatively narrow, and the expansion is used for copying. The width of the low frequency band improves the integrity of the fundamental frequency. When the coding rate is low and the number of low frequency subbands is limited, the band replica expansion strategy can make the selection of the low frequency subbands of the high frequency portion not too restrictive. It bundles several mid-high frequency sub-bands and low-frequency parts that would otherwise require band replication to form a new low-frequency part for most of the high-frequency sub-bands to choose and extract their corresponding copy parameters. At the same time, for some of the high-frequency sub-bands in the new low-frequency part, the best-matched low-frequency sub-bands are selected from the original low-frequency parts, and the copy parameters are extracted in turn.
频带复制扩展策略是对策略 2、 策略 3和策略 1的扩展。 当频带复制策略 选择流程中输出 extend— flag标志时, 则使用频带复制扩展策略。 所以, 当有 extend— flag输出时, 选定的策略 2、 策略 3或者策略 1会相应地变成扩展型策 略 2、 扩展型策略 3或者扩展型策略 1。  The Band Replication Extension Policy is an extension of Policy 2, Policy 3, and Policy 1. When the extend-flag flag is output in the band copy policy selection process, the band copy extension policy is used. Therefore, when there is an extend_flag output, the selected policy 2, policy 3 or policy 1 will become an extended policy 2, an extended strategy 3 or an extended strategy 1 accordingly.
图 17中具体流程如下所示:  The specific process in Figure 17 is as follows:
步骤 1701、 时频变换完成, 输入 QMF ( Quadrature Mirror Filter, 正交镜 像滤波器)子带;  Step 1701: The time-frequency transform is completed, and a QMF (Quadature Mirror Filter) sub-band is input;
步骤 1702、 判断输入的子带是否为低编码速率模式, 如果是, 进入步骤 1703 , 如果否, 进入步骤 1705;  Step 1702, determining whether the input sub-band is in a low coding rate mode, if yes, proceeding to step 1703, if no, proceeding to step 1705;
步骤 1703、 扩展参与复制的低频部分的范围, 组成新的低频部分, 进入 步骤 1704;  Step 1703, expanding the range of the low frequency part participating in the copy, forming a new low frequency part, proceeding to step 1704;
步骤 1704、输出标志 extend— flag,用于频带复制扩展策略,进入步骤 1705; 步骤 1705、 判断是否存在能量过低的低频子带, 若否, 进入步骤 1708, 若是, 进入步骤 1706;  Step 1704, the output flag extend_flag, for the band copy expansion strategy, proceeds to step 1705; Step 1705, determines whether there is a low frequency subband with low energy, if not, proceeds to step 1708, and if yes, proceeds to step 1706;
把每个低频子带的能量 和低频部分的能量均值^^进行比较, 如果存 在某个低频子带的能量 满足下式: Ε^ δ Ε^ , 说明低频子带中存在子 带能量陡降的情况,基频能量分布不连续,进入步骤 1706,否则进入步骤 1708。 其中, 的取值范围是 0< 4 < 1 , 该取值范围是对该复制策略对应的波形情况 进行观察得到的经验值, 可以根据要求设置取值。  Comparing the energy of each low-frequency sub-band with the mean value of the low-frequency part, if there is a certain low-frequency sub-band energy that satisfies the following formula: Ε^ δ Ε^ , indicating that there is a sub-band energy drop in the low-frequency sub-band In the event that the fundamental frequency energy distribution is not continuous, proceed to step 1706, otherwise proceed to step 1708. The value range is 0< 4 < 1, and the value range is the empirical value obtained by observing the waveform corresponding to the copy strategy. The value can be set according to requirements.
步骤 1706、 搜索能量较高且连续分布的低频子带并确定选择策略 2; 该步骤主要寻找低频部分能量分布连续的子带区间,作为策略 2的基频部 分。 判断算法如下所示: Step 1706, searching for a low frequency subband with higher energy and continuous distribution and determining a selection strategy 2; This step mainly looks for a subband interval in which the energy distribution of the low frequency portion is continuous as the fundamental frequency portion of the strategy 2. The judgment algorithm is as follows:
如果满足 (")>(¾*^。„, E(n + l)>S2*ELow , ...... , E(n + q-l)>S2*ELow ,If (") >( 3⁄4*^.„, E(n + l)>S 2 *E Low , ...... , E(n + ql)>S 2 *E Low ,
E{n + q)<S2*ELow , (其中?≥1, l≤n<k。, 的取值范围是 0 < < < 1 , 该取值 范围是对该复制策略对应的波形情况进行观察得到的经验值,可以根据要求设 置取值), 则决定采用策略 2, 记录子带序号 n和子带区间数 q; E{n + q)<S 2 *E Low , (where ? ≥1 , l≤n<k,, the value range is 0 <<< 1, the value range is the waveform corresponding to the copy strategy If the empirical value obtained by observation can be set according to requirements, then it is decided to adopt strategy 2, and record the sub-band number n and the sub-band interval number q;
步骤 1707、 输出策略 2对应的标志位 Flag, 并且输出子带序号 n和子带 区间数 q; 需说明的是, 如果同时输出 extend— flag, 则当前策略为扩展型策略 2。  Step 1707: Output the flag flag corresponding to the policy 2, and output the subband sequence number n and the subband interval number q. It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 2.
步骤 1708、 通过与低频能量均值相比, 判断高频能量均值是否过低, 若 否, 进入步骤 1709, 若是, 进入步骤 1710;  Step 1708, determining whether the high frequency energy average is too low compared with the low frequency energy average, if not, proceeding to step 1709, and if yes, proceeding to step 1710;
如果通过步骤 1705 将每个低频子带能量 和低频能量均值^^比较, 均满足下式: £(")> *Ε^ , 则将分析的重点转移到低频和高频部分的能量关 系。  If the low-frequency sub-band energy and the low-frequency energy mean ^^ are compared by step 1705, the following formula is satisfied: £(")> *Ε^, the emphasis of the analysis is shifted to the energy relationship between the low-frequency and high-frequency parts.
£ 和 进行比较, 其中 的取值范围是 0< <1, 是对相关波形观 察得到的经验值, 可以根据要求设置取值。 Compare the £ and the value, where the value range is 0<<1, which is the empirical value observed for the relevant waveform. The value can be set as required.
当能够满足 £^≤/1* 。 则决定采用策略 3, 进入步骤 1710, 否则 £ >/1*£^时, 决定采用策略 1, 进入步骤 1709; When able to satisfy £ ^ ≤ /1 *. Then decide to adopt strategy 3, proceed to step 1710, otherwise, when £ >/1 * £ ^, decide to adopt strategy 1, and proceed to step 1709;
步骤 1709、 输出策略 1相应的标志位 Flag; 需说明的是, 如果同时输出 extend— flag, 则当前策略为扩展型策略 1。  Step 1709: Output the corresponding flag of the policy 1 Flag; It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 1.
步骤 1710、 输出策略 3相应的标志位 Flag; 需说明的是, 如果同时输出 extend— flag , 则当前策略为扩展型策略 3。  Step 1710: Output the corresponding flag of the policy 3 Flag; It should be noted that if the extend_flag is output at the same time, the current policy is the extended policy 3.
步骤 1603、 根据选择的频带复制策略进行最优频带选择;  Step 1603: Perform optimal frequency band selection according to the selected frequency band replication policy.
最优频带选择模块以最大相关为标准,灵活的查找用于某参考频带复制的 最优匹配频带,确保频带复制的相关性,使复制后的高频信号不需要过多的调 整, 就逼近原始信号。  The optimal frequency band selection module uses the maximum correlation as a standard to flexibly search for the optimal matching frequency band for copying of a reference frequency band, ensuring the correlation of frequency band replication, so that the copied high frequency signal does not need excessive adjustment, and approaches the original signal.
根据确定的复制策略和复制策略信息(包括相应的初始化频带表)的指导, 选择出最优的高低频信号对应关系。频带复制策略指导最优频带选择, 决定频 带选择是以高频信号选择低频信号,还是以低频信号选择高频谐波, 例如在策 略 1下,最优频带选择为每个高频复制频带信号选取最优的用于复制的低频信 号, 而在策略 2下, 最优频带选择首先为类基频信号选择出可以复制的高频谐 波。 初始化频带表指导最优频带选择的估计频宽, 以及被选频带范围。 According to the determination of the determined replication policy and replication policy information (including the corresponding initialization band table), the optimal high-low frequency signal correspondence is selected. The band replication strategy guides the optimal band selection, determines whether the band selection is to select a low frequency signal with a high frequency signal, or select a high frequency harmonic with a low frequency signal, for example, In the first round, the optimal frequency band is selected to select the best low frequency signal for copying for each high frequency copy band signal, and in strategy 2, the optimal frequency band selection first selects the high frequency that can be copied for the base frequency signal. harmonic. The initialization band table guides the estimated bandwidth of the optimal band selection, as well as the selected band range.
最优频带选择是比较高低频信号相关性,以及高低频信号包络特征相似程 度, 然后根据得到的两个参数综合确定最优的高低频信号匹配关系。 为了避免 比较相关性和包络特征时信号能量幅度带来的计算差异,估计前先对信号按照 初始化频带表范围进行归一化,这样选择匹配信号时着重分析信号特性的相似 程度, 能量差异可以在信号重建时进行调整。  The optimal frequency band selection is to compare the high and low frequency signal correlation and the similarity degree of the high and low frequency signal envelope characteristics, and then determine the optimal high and low frequency signal matching relationship based on the obtained two parameters. In order to avoid the calculation difference caused by the signal energy amplitude when comparing the correlation and the envelope characteristics, the signal is normalized according to the initial band table range before the estimation, so that the matching signal is selected to analyze the similarity of the signal characteristics, and the energy difference can be Make adjustments when the signal is reconstructed.
无论最优频带选择的参考频带是高频信号还是低频信号,最优频带选择所 使用的算法都是相同的, 为了便于说明, 下面以策略 1为例 , 并取代表最大频 率解析度的单个子带为复制频带, 说明最优频带选择的一般算法。  Regardless of whether the reference band selected by the optimal band is a high frequency signal or a low frequency signal, the algorithms used for optimal band selection are the same. For convenience of explanation, the following strategy 1 is taken as an example, and a single subrepresentative representing the maximum frequency resolution is taken. The band is the copy band, indicating the general algorithm for optimal band selection.
请参阅图 18 , 是本发明实施例最优频带选择流程图, 包括步骤: 步骤 1801、 根据频带复制策略信息中的初始化频带表划分复制频带和备 选频带, 按初始化频带表对输入子带信号进行高低频划分;  Referring to FIG. 18, it is a flowchart of an optimal frequency band selection according to an embodiment of the present invention, including the following steps: Step 1801: divide a copy frequency band and an alternate frequency band according to an initialized frequency band table in the frequency band copy policy information, and input an input sub-band signal according to an initialized frequency band table. Perform high and low frequency division;
设复制频带个数用 nb表示,设低频子带样点用 ^ (", 表示,其中 1≤ " < 指低频子带序号, 是高频处理的第一个子带序号, 0≤/ < 32表示子带中的样 点, 设高频子带样点用 ^ ^,Ζ)表示, 其中 指示高频子带, 是高频 处理的结束子带。 Let the number of copy bands be represented by nb, and set the low-frequency sub-band samples with ^ (", where 1 ≤ "< refers to the low-frequency sub-band number, which is the first sub-band number of high-frequency processing, 0 ≤ / < 32 Indicates the sample points in the sub-band, and sets the high-frequency sub-band samples as ^ ^, Ζ ), where indicates the high-frequency sub-band, which is the end sub-band of the high-frequency processing.
步骤 1802、按初始化频带表中复制频带的划分,对高低频信号进行复制频 带长度内的归一化处理, 这里假设复制频带等于变换子带, 归一化处理如下: 丽 (0 ) Xm k! 步骤 1803、计算各高频段或高频带与各可能的低频段或低频带的相关函数; 考虑到样点偏移情况, 为了得到与高频带波形最逼近的低频带, 对低频带 样点做偏移后再计算相关函数, 公式如下: rk m[n] = jXHigh (k,l)XLow (n - m) , m = 0,1,2 „ , A ^ „ , ^ „r ,Step 1802: Perform normalization processing on the high and low frequency signals in the length of the copy frequency band according to the division of the copy frequency band in the initialization band table. Here, it is assumed that the copy frequency band is equal to the transform subband, and the normalization process is as follows: 丽(0) X m k !, step 1803, calculated for each frequency band or a high frequency band with the correlation function may be low frequency or low frequency band; offset considering the case of sample, in order to obtain the best approximation of the waveform of the high-band low frequency band of the low frequency band After the sample is offset, the correlation function is calculated. The formula is as follows: r k m [n] = j X High (k,l)X Low (n - m) , m = 0,1,2 „ , A ^ „ , ^ „ r ,
^ ,其中, 是偏移样点数量, rk ["] 代表作样点偏移后高低频带的相关函数值。 如果时频变换采用复数变换形式, 子带样点是复数值的, 可以取 的实 部来分析, 有
Figure imgf000017_0001
在与某个高频带的偏移相关函数中, 为每个低频带选择相关值最大的
Figure imgf000017_0002
以上计算对所有的高频带 进行, 得到 ["]组成的最大相关矩阵 ax WW , Rmax WW记录了所有高低频带的最大相关值。 步骤 1804、 估计高低频带包络的变化特性, 计算高频带与低频带包络变化 特性差异;
^ , where is the number of offset samples, r k ["] Represents the correlation function value of the high and low frequency bands after the sample offset. If the time-frequency transform uses a complex transform form, the sub-band samples are complex-valued, and the real part can be analyzed.
Figure imgf000017_0001
In the offset correlation function with a certain high frequency band, the correlation value is selected for each low frequency band.
Figure imgf000017_0002
The above calculation is performed on all the high frequency bands, and the maximum correlation matrix ax WW composed of ["] is obtained, and R max WW records the maximum correlation values of all the high and low frequency bands. Step 1804, estimating the variation characteristics of the high and low frequency band envelopes, and calculating the high frequency Band and low band envelope variation characteristics;
包络特性估计方法是, 把复制频带长度内的样点看作一个样本, 计算它的 二阶以内的自相关函数, 然后通过比较高低频带的自相关函数的均方误差,得 到高低频包络特性差异。  The envelope characteristic estimation method is to treat the sample point in the length of the replication frequency band as a sample, calculate its autocorrelation function within the second order, and then obtain the high and low frequency envelope by comparing the mean square error of the autocorrelation function of the high and low frequency bands. Difference in characteristics.
首先, 按复制频带长度计算高低频成分的二阶自相关函数:  First, calculate the second-order autocorrelation function of the high and low frequency components according to the length of the replication band:
Figure imgf000017_0003
,其中, Ή和 分别表示高频 段和低频段的自相关函数, 代表自相关间隔。 然后, 计算高低频子带间的包络差异: e (k, n) = Pl∑(r [l] -
Figure imgf000017_0004
[/]) + A∑ ( [/] - rn 2 [l]
Figure imgf000017_0003
, where Ή and the autocorrelation function representing the high and low bands, respectively, represent the autocorrelation interval. Then, calculate the envelope difference between the high and low frequency subbands: e (k, n) = Pl ∑ (r [l] -
Figure imgf000017_0004
[/]) + A ∑ ( [/] - r n 2 [l]
, 其中 A和 A是两个系数, A + A = 1 , e ^, " )代表高频段 与低频段 "的包络差异。 步骤 1805、 综合比较^ 和 为每个复制频带选择出最优的低频 带复制; 显然 ΓΜ的值越大代表了高低频段的相关性越好, 而 ^,")的值越小, 代表高低频段的包络越相似,要找到最合适高低频段匹配关系, 必须形成统一 的参数进行比较, 所以有如下变换:
Figure imgf000018_0001
其中"和 是权系数, 是形成的频带选择系数。
Figure imgf000018_0002
是数 组 ^]和 W的均值, α = β。 步骤 1806、 生成频带选择表 指示复制所需的最优频带。
, where A and A are two coefficients, A + A = 1 , e ^, " ) represents the envelope difference between the high and low bands. Step 1805, comprehensively comparing ^ and selecting an optimal low frequency for each copy band With copying; obviously, the larger the value of ΓΜ, the better the correlation between high and low frequency bands, and the smaller the value of ^, "), the more similar the envelope representing the high and low frequency bands. To find the most suitable matching relationship between high and low frequency bands, uniformity must be formed. The parameters are compared, so there are the following transformations:
Figure imgf000018_0001
Where "sum is the weight coefficient, which is the formed band selection coefficient.
Figure imgf000018_0002
Is the mean of the array ^] and W, α = β. Step 1806, generating a band selection table indicating an optimal frequency band required for copying.
对每个高频子带 选择出使%(")值最大的", 当选用频带复制扩展策略 时, 参考频带不仅仅是低频子带, 可以是检测频带之前的所有频带。 生成频带 选择表 WK - +1] , 指示复制所需的最优频带。 For each high frequency sub-band, the value that maximizes the %(") value is selected. When the band copy expansion strategy is selected, the reference band is not only the low frequency sub-band, but may be all the bands before the detection band. A band selection table W K - +1] is generated indicating the optimum frequency band required for copying.
步骤 1603根据选择的频带复制策略进行最优频带选择后, 可以一直利用 所述最优频带,也可以进一步进行时变特性检测 ,根据检测结果重新选择频带。  Step 1603: After the optimal frequency band selection is performed according to the selected frequency band replication policy, the optimal frequency band may be used all the time, and the time-varying characteristic detection may be further performed, and the frequency band is reselected according to the detection result.
以下具体介绍时变特性检测:  The following describes the time-varying feature detection:
音频和语音信号一般在一段时长内具有相同的特性(即准平稳特性 ), 所 以在连续的若干帧内可能可以使用相同的高频复制策略,在复制策略不变的情 况下,根据音频信号的时间周期性,在连续几帧内可以使用相同的复制频带选 择表, 不必每帧都做最优频带选择, 频带选择表一旦确定可以连续多帧使用, 这样既可以节省计算量和传输比特率, 又可以保证帧间连续性。 为了判断是否 能沿用上一帧的频带选择表, 引入时变特性检测模块。 时变特性检测的作用是 用于判断当前帧是否可以沿用上一帧的频带选择表,如果检测到相邻帧间的音 频特性差异大于门限, 就刷新频带选择表, 重新选择频带; 否则, 频带选择表 保持不变。 时变特性检测方法是,估计当前帧与前一帧低频信号的音频特性变化, 具 体可以采用包络差异比较方法。 如果包络差异很小, 则根据高低频的相关性, 高频信号的差异也会 4艮小,这时可以沿用前一帧生成的频带选择表; 如果当前 低频信号与前一帧低频信号的包络差异在容限范围内,但是有频率偏移,若频 率偏移大于临界频带的 5%, 则需要重新选择最优频带, 刷新频带选择表, 因 为根据音调失调理论,两组泛音在其所限定的临界频带内的频差在 5% ~ 50%, 则这两组泛音是失调的, 会产生听觉可感知的差异。 Audio and speech signals generally have the same characteristics (ie quasi-stationary characteristics) over a period of time, so the same high-frequency duplication strategy may be used in successive frames, with the same copy strategy, based on the audio signal Time periodicity, the same copy band selection table can be used in consecutive frames, and it is not necessary to perform optimal band selection every frame. Once the band selection table is determined, it can be used continuously for multiple frames, which can save calculation amount and transmission bit rate. It can also guarantee the continuity between frames. In order to judge whether or not the band selection table of the previous frame can be used, a time-varying characteristic detecting module is introduced. The function of time-varying characteristic detection is to determine whether the current frame can use the band selection table of the previous frame. If it is detected that the difference in audio characteristics between adjacent frames is greater than the threshold, the band selection table is refreshed and the frequency band is reselected; otherwise, the frequency band The selection table remains unchanged. The time-varying characteristic detecting method is to estimate the audio characteristic change of the low-frequency signal of the current frame and the previous frame, and the envelope difference comparison method may be specifically used. If the envelope difference is small, according to the correlation between high and low frequencies, the difference of the high frequency signal will be 4艮, and the band selection table generated by the previous frame can be used; if the current low frequency signal and the previous frame low frequency signal The envelope difference is within the tolerance range, but there is a frequency offset. If the frequency offset is greater than 5% of the critical band, then the optimal frequency band needs to be re-selected, and the band selection table is refreshed because the two sets of overtones are in accordance with the pitch shift theory. The frequency difference within the defined critical band is between 5% and 50%, and the two sets of overtones are dysfunctional, producing auditory perceptible differences.
请参阅图 19, 是本发明实施例时变特性检测的算法流程图, 包括步骤: 步骤 1901、 计算当前帧与前一帧的低频子带能量均方误差 A™ ;
Figure imgf000019_0001
,其中 代表当前帧各低频子带能量, £»代 表前一帧低频子带能量, 步骤 1902、 判断低频子带能量均方误差 是否小于判决门限为 Thr,若 是, 进入步骤 1903 , 若否, 进入步骤 1906; 步骤 1903、 估计能量集中的低频频段的偏移频率 ; 选取能量最高的频带或频段, 其中心频率设为 ,
Figure imgf000019_0002
, 其中 和 Λ 分别代表最高能量频带的下界和上界, 计算频率偏移 Δ/ = _ ;
Referring to FIG. 19, it is a flowchart of an algorithm for time-varying characteristic detection according to an embodiment of the present invention, including the following steps: Step 1901: Calculate a low-frequency sub-band energy mean square error ATM of a current frame and a previous frame;
Figure imgf000019_0001
, which represents the low frequency subband energy of the current frame, £ » represents the low frequency subband energy of the previous frame, step 1902, determines whether the mean square error of the low frequency subband energy is less than the threshold of the decision is Thr, and if yes, proceeds to step 1903, and if not, enters Step 1906; Step 1903, estimating an offset frequency of the low frequency band in which the energy is concentrated; selecting a frequency band or a frequency band with the highest energy, and setting a center frequency thereof,
Figure imgf000019_0002
, where Λ represents the lower and upper bounds of the highest energy band, respectively, and calculates the frequency offset Δ / = _ ;
步骤 1904、 判断偏移频率 Δ /是否小于当前临界频带带宽的 5%, 若是, 进 入步骤 1905 , 若否, 进入步骤 1906; Step 1904, determining whether the offset frequency Δ / is less than 5% of the current critical band bandwidth, and if so, proceeds to step 1905, and if not, proceeds to step 1906;
步骤 1905、 沿用前一帧生成的频带选择表;  Step 1905: A band selection table generated by using a previous frame is used;
步骤 1906、 重新进行最优频带选择。  Step 1906: Perform optimal band selection again.
需要说明的是,如果是固定频带复制方式时,跟上述自适应频带复制方式 的过程相比, 不需要对分析滤波器模块得到的子带信号进行短时特性分析,也 不需要根据短时特性分析的结果进行频带复制策略选择,而最优频带选择流程 和时变特性检测的流程是相同的。  It should be noted that, in the case of the fixed-band copying mode, the short-term characteristic analysis of the sub-band signals obtained by the analysis filter module is not required, and the short-term characteristics are not required. The result of the analysis is selected for the band replication strategy, and the flow of the optimal band selection process and the time-varying characteristic detection is the same.
步骤 1604、 根据最优频带进行频带复制。 解码端得到最优频带后, 可根据最优频带进行频带复制。 请参阅图 20, 是 本发明实施例解码端根据编码端参数信息进行高频重建的示意图。 Step 1604: Perform frequency band replication according to the optimal frequency band. After the decoding end obtains the optimal frequency band, the frequency band copying can be performed according to the optimal frequency band. Referring to FIG. 20, it is a schematic diagram of the high frequency reconstruction performed by the decoding end according to the parameter information of the encoding end according to the embodiment of the present invention.
本发明实施例解码端与现有技术解码端相比,大部分模块的功能和协作关 系并没有改变, 只是对高频生成器模块的"高频子带复制策略"做了修改。 在输 入高频生成器的 SBR码流中, 根据高频重建指导参数信息添加了三个参数, 即"新算法使用标志"、 "频带选择表更替标志"和"频带选择表"。  Compared with the prior art decoding end, the decoding end of the embodiment of the present invention does not change the function and cooperation relationship of most modules, but only the "high frequency sub-band copying strategy" of the high frequency generator module is modified. In the SBR code stream input to the high frequency generator, three parameters are added according to the high frequency reconstruction guide parameter information, namely "new algorithm use flag", "band selection table replacement flag" and "band selection table".
"频带选择表"是比较关键的参数, 它记录每帧信号恢复高频子带时, 高频 子带和低频子带的复制对应关系。  The "band selection table" is a relatively important parameter, which records the copy correspondence between the high frequency sub-band and the low frequency sub-band when the high frequency sub-band is recovered for each frame signal.
"新算法使用标志"决定了在解码时, 使用编码端的新算法指导高频重建, 还是使用标准 SBR方法指导高频重建。 这里所说的新算法, 指的是前面描述 的本发明实施例在编码端最终确定高频重建参数的过程中所使用的算法。如果 标志位为" 1", 则按照新算法重建高频; 如果标志位为 "0", 则按照标准 SBR 方法重建高频。 通过这样设置, 可以为新算法和标准 SBR方法在后续研究中 的兼容预留接口。  The "new algorithm uses flags" determines whether high-frequency reconstruction is guided by a new algorithm at the encoding end or high-frequency reconstruction using a standard SBR method. The new algorithm referred to herein refers to the algorithm used in the process of finally determining the high frequency reconstruction parameters at the encoding end by the embodiment of the present invention described above. If the flag is "1", the high frequency is reconstructed according to the new algorithm; if the flag is "0", the high frequency is reconstructed according to the standard SBR method. With this setup, the interface can be reserved for compatibility between the new algorithm and the standard SBR method in subsequent studies.
"频带选择表更替标志"决定了恢复高频子带时, 当前信号如何获得"频带 选择表"。 如果标志位为 "0", 则当前信号直接延用前一帧信号的高低频子带对 应关系, 指导高频复制; 如果标志位为 "1", 则根据 SBR码流中读取的刷新后 的"频带选择表"参数, 完成高频复制。 "频带选择表更替标志"的主要作用是为 了减少向解码端传输的数据量。 需要说明的是, 当"频带选择表更替标志 "为 0 时, 传输的 SBR码流中将不包括"频带选择表"参数。  The "band selection table replacement flag" determines how the current signal obtains the "band selection table" when the high frequency sub-band is restored. If the flag bit is "0", the current signal directly extends the high-low frequency sub-band correspondence of the previous frame signal to guide the high-frequency copy; if the flag bit is "1", the refresh is read according to the read in the SBR code stream. The "band selection table" parameter completes the high frequency copy. The main role of the "band selection table replacement flag" is to reduce the amount of data transmitted to the decoder. It should be noted that when the "band selection table replacement flag" is 0, the "band selection table" parameter will not be included in the transmitted SBR code stream.
以下详细介绍解码端接收的码流信息。  The code stream information received by the decoder is described in detail below.
请参阅表 1 , 描述的是头文件 HeadFile数据结构, 用于在解码端开始工作 时候, 进行初始化设置。  See Table 1, which describes the header file HeadFile data structure, which is used to initialize the settings when the decoder starts working.
变量名 说明 数据类型 sbr_amp_res 量化阶梯, 不编码时作为指导信息 Char (字符 型)  Variable name Description Data type sbr_amp_res Quantization ladder, as a guide when not encoding Char (character type)
sbr— start— frequency SBR域起始 QMF子带选择指针, 在频带选择 Int (整型) 表 StartTable口中选择对应元素  Sbr_start—frequency SBR field start QMF subband selection pointer, select corresponding element in the band selection Int (integer) table StartTable
sbr— stop— frequency SBR域终止 QMF子带选择指针, 在频带选择 Int 表 StopTable口中选择对应元素 Sbr_ stop—frequency SBR field terminates QMF subband selection pointer, selects Int in frequency band Select the corresponding element in the table StopTable
xover— band 主频代表指针 Int  Xover—band main frequency representative pointer Int
header— extra— 1 指示是否有额外信息, 取值 0或 1 Int  Header— extra— 1 Indicates if there is additional information, value 0 or 1 Int
header— extra— 2 指示是否有额外信息, 取值 0或 1 Int  Header— extra— 2 Indicates if there is additional information, value 0 or 1 Int
freqScale SBR带包含的 QMF子带最大数量, 用于计算 Int  FreqScale SBR band contains the maximum number of QMF subbands used to calculate Int
主频带选择表  Main band selection table
alterScale 最高频 SBR带包含 QMF子带的可增数量, 用 Int  alterScale The highest frequency SBR band contains the number of QMF subbands that can be increased, using Int
于计算主频带选择表  Computing main band selection table
noise— bands 噪声带数量 Int  Noise—the number of noise bands Int
sbr— limiter— band 计算增益时限制频带的数量, 用于计算限制频 Int  Sbr_limiter— band The number of bands that are limited when calculating the gain, used to calculate the limit frequency Int
带选择表  Belt selection table
sbr—limiter— gains P艮制频带内增益的最大值 Int  Sbr_limiter—gains P The maximum gain in the band Int
sbr— interpol— freq 指示是否应用频带改写, 默认值 1 Int  Sbr — interpol — freq Indicates whether band rewriting is applied, default value 1 Int
sbr— smoothing— length 是否应用滤波 Int  Sbr— smoothing— length Whether to apply filtering Int
表 1 HeaderFile数据结构表  Table 1 HeaderFile data structure table
ENV— DATA是描述每帧 SBR信息的数据结构。 参数"新算法使用标志"、 "频带选择表更替标志"和"频带选择表"都被添加进描述 SBR 信息的 ENV DATA结构中。  ENV—DATA is the data structure that describes the SBR information for each frame. The parameters "New Algorithm Usage Flag", "Band Selection Table Replacement Flag" and "Band Selection Table" are added to the ENV DATA structure describing the SBR information.
定义一个结构体变量专门用来存储 "新算法使用标志"、 "频带选择表更替 标志"和"频带选择表,,三个参数信息, 具体如下:  Define a structure variable specifically for storing "new algorithm usage flag", "band selection table replacement flag" and "band selection table, three parameter information, as follows:
Struct INDVEC  Struct INDVEC
{ char flag— 1;  { char flag — 1;
char flag— 2;  Char flag— 2;
char FreTable[28];  Char FreTable[28];
} index Vector;  } index Vector;
"新算法使用标志"、 "频带选择表更替标志"两个参数的值均为" 0"或者 "1" , 所以设置两个字符型变量" flag— 1"、 "flag— 2"来分别描述"新算法使用标 志"、 "频带选择表更替标志"。  The values of the "new algorithm use flag" and "band selection table replacement flag" are both "0" or "1", so two character variables "flag-1" and "flag-2" are set to describe respectively. "New algorithm uses flag", "band selection table replacement flag".
"频带选择表"存储的是需要恢复的高频子带和低频子带的对应序号,存储 在数组 FreTable[28]中。 其中, 不同的编码模式, 需要恢复的高频子带数目各 不相同。 最高比特率的编码模式, 需要恢复的高频子带是 28个。 随着编码比 特率降低, 需要恢复的高频子带数目也相应减少。 The "band selection table" stores the corresponding serial numbers of the high frequency sub-band and the low frequency sub-band that need to be restored, and is stored. In the array FreTable[28]. Among them, different encoding modes have different numbers of high frequency sub-bands to be recovered. For the highest bit rate encoding mode, the number of high frequency sub-bands that need to be recovered is 28. As the encoding bit rate decreases, the number of high frequency sub-bands that need to be recovered also decreases accordingly.
结构体变量 indexVector在 ENV— DATA结构中的位置如表 2所示。  The position of the structure variable indexVector in the ENV-DATA structure is shown in Table 2.
Figure imgf000022_0001
Figure imgf000022_0001
表 2 ENV— DATA结构定义表  Table 2 ENV-DATA Structure Definition Table
以下介绍高频生成器模块的算法流程, 请参阅图 21 , 是本发明实施例解 码端高频生成器模块算法流程图, 包括步骤:  The algorithm flow of the high frequency generator module is described below. Referring to FIG. 21, it is a flow chart of the decoder high frequency generator module algorithm according to the embodiment of the present invention, including the steps:
步骤 2101、 接收"新算法使用标志"、 "频带选择表更替标志"和"频带选择 表"; Step 2101: Receive "new algorithm use flag", "band selection table replacement flag", and "band selection" Table";
步骤 2102、 判断"新算法使用标志", 如果为 0, 进入步骤 2103 , 如果为 1, 进入步骤 2104;  Step 2102, judging "new algorithm use flag", if it is 0, proceeds to step 2103, if it is 1, proceeds to step 2104;
步骤 2103、 按标准 SBR方法解码;  Step 2103, decoding according to a standard SBR method;
步骤 2104、 判断"频带选择表更替标志", 如果为 0, 进入步骤 2105 , 如果 为 1, 进入步骤 2106;  Step 2104, judging "band selection table replacement flag", if it is 0, proceeds to step 2105, if it is 1, proceeds to step 2106;
步骤 2105、 如果标志位为 "0", 则当前信号直接延用前一帧信号的高低频 子带对应关系, 指导高频复制;  Step 2105: If the flag bit is "0", the current signal directly extends the high-low frequency sub-band correspondence of the signal of the previous frame, and guides the high-frequency copy;
需要注意的是,每帧信号在确定了高低频子带对应关系后,会把当前帧的 频带选择表备份在緩存中。如果需要, 下一帧信号会对緩存中的频带选择表进 行调用。  It should be noted that after determining the high-low frequency sub-band correspondence of each frame signal, the band selection table of the current frame is backed up in the buffer. The next frame signal is called on the band selection table in the buffer, if needed.
步骤 2106、 如果标志位为 "1", 则根据 SBR码流中读取的 "频带选择表 "参 数, 指导高频复制;  Step 2106: If the flag bit is "1", the high frequency copy is guided according to the "band selection table" parameter read in the SBR code stream;
步骤 2107、 完成初步高频复制。  Step 2107: Perform preliminary high frequency copying.
经过初步复制的高频子带, 随后会进入包络调整、添加谐波成分等模块进 行处理, 最终完成高频复制。  After the initial replication of the high-frequency sub-band, it will enter the module of envelope adjustment and harmonic component addition to complete the high-frequency replication.
通过本发明实施例的高频重建方法, 更准确的实现高频信号的重建。请参 阅图 22, 是本发明实施例方法对原始音频信号恢复后的能量波形图; 图 23 , 是 本发明实施例方法对原始音频信号恢复后的能量三维图。通过这两个图与现有 技术的图进行对比, 可以发现本发明实施例的高频重建效果比现有技术更好。 因此,本发明实施例方法能够利用少数低频子带的信息就更准确的重建高频信 号, 也有利于压缩音频信息, 可以大幅提高音频和语音编码器的压缩效率, 同 时提高音频质量;有效地降低比特率音频和语音信号压缩编码带来的失真和噪 声。 并且针对不同音频特性, 提出了多种对应频带复制策略, 可以对各种音频 和语音信号提供适应的高频重建方法, 提高音频和语音信号处理的灵活性。  The high frequency reconstruction method of the embodiment of the invention more accurately realizes the reconstruction of the high frequency signal. Referring to FIG. 22, it is an energy waveform diagram of the original audio signal after the method of the embodiment of the present invention is restored; FIG. 23 is a three-dimensional diagram of the energy of the original audio signal after the method of the embodiment of the present invention is restored. By comparing these two figures with the prior art figures, it can be found that the high frequency reconstruction effect of the embodiment of the present invention is better than the prior art. Therefore, the method of the embodiment of the present invention can reconstruct the high frequency signal more accurately by using the information of a few low frequency subbands, and is also beneficial for compressing the audio information, can greatly improve the compression efficiency of the audio and speech encoder, and improve the audio quality; Reduce distortion and noise caused by bit rate audio and speech signal compression coding. And for different audio characteristics, a variety of corresponding frequency band replication strategies are proposed, which can provide adaptive high-frequency reconstruction methods for various audio and voice signals, and improve the flexibility of audio and voice signal processing.
上述内容伴细介绍了本发明实施例高频重建方法,相应的,本发明实施例 提供一种编码模块和解码模块。  The above description is accompanied by a detailed description of the high frequency reconstruction method of the embodiment of the present invention. Correspondingly, the embodiment of the present invention provides an encoding module and a decoding module.
请参阅图 24, 是本发明实施例编码模块一结构示意图。  Referring to FIG. 24, it is a schematic structural diagram of an encoding module according to an embodiment of the present invention.
编码模块包括: 分析滤波器模块 241、 频带选择模块 242。 分析滤波器模块 241 , 用于对音频或语音信号进行滤波处理得到低频子带 和高频子带。 The encoding module includes: an analysis filter module 241 and a band selection module 242. The analysis filter module 241 is configured to filter the audio or voice signal to obtain a low frequency sub-band and a high frequency sub-band.
频带选择模块 242, 用于确定频带复制策略, 根据所述确定的频带复制策 略获取所述低频子带与高频子带的相关性,参考所述确定的相关性为高频子带 选择低频子带作为复制频带,例如为高频子带选择相关性大的低频子带作为最 优复制频带, 并输出包括选择频带的对应关系的高频重建参数信息。  a frequency band selection module 242, configured to determine a frequency band replication policy, obtain a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-frequency for the high frequency sub-band with reference to the determined correlation As the copy frequency band, for example, a low frequency sub-band having a high correlation is selected as a high frequency sub-band as an optimum copy band, and high-frequency reconstruction parameter information including a correspondence relationship of the selected band is output.
所述编码模块进一步包括: 短时特性分析模块 243 , 对所述音频或语音信 号进行短时特性分析。  The encoding module further includes: a short-term characteristic analysis module 243 for performing short-term characteristic analysis on the audio or voice signal.
所述频带选择模块 242包括: 复制策略选择模块 2421、 最优频带选择模 块 2422。  The band selection module 242 includes: a replication policy selection module 2421, an optimal band selection module 2422.
复制策略选择模块 2421 , 用于根据短时特性分析模块 243分析的结果对 应选择不同频带复制策略;  The replication policy selection module 2421 is configured to select a different frequency band replication policy according to the analysis result of the short-term characteristic analysis module 243;
最优频带选择模块 2422, 用于根据所述确定的频带复制策略获取所述低 频子带与高频子带的相关性,参考所述确定的相关性为高频子带选择低频子带 作为复制频带, 例如为高频子带选择相关性大的低频子带作为最优复制频带, 并输出包括选择频带的对应关系的高频重建参数信息。  An optimal frequency band selection module 2422, configured to acquire a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-band as a high frequency sub-band as a reference with reference to the determined correlation The frequency band, for example, a high frequency sub-band selects a low frequency sub-band having a large correlation as an optimal copy frequency band, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands.
所述编码模块进一步包括: 时变特性检测模块 244, 用于对滤波处理后的 音频或语音信号进行时变特性检测; 相应的, 最优频带选择模块 2422还根据 时变特性检测模块 244检测的结果选择低频子带, 例如选择最优复制频带。  The encoding module further includes: a time-varying characteristic detecting module 244, configured to perform time-varying characteristic detection on the filtered audio or voice signal; correspondingly, the optimal frequency band selecting module 2422 is further detected according to the time-varying characteristic detecting module 244. As a result, the low frequency sub-band is selected, for example, the optimal copy band is selected.
所述短时特性分析模块 243 对所述音频或语音信号进行短时特性分析具 体为: 计算音频或语音信号的低频部分能量均值、 高频部分能量均值、低频部 分各子带能量值、 高频部分各子带能量值; 所述复制策略选择模块 2421根据 短时特性分析模块 243分析的结果对应选择不同频带复制策略具体为:将低频 部分各子带能量值与低频部分能量均值比较;若存在低频部分的部分子带的子 带能量值小于或等于低频部分能量均值的第一加权值,选择的策略为: 选取能 量集中的低频子带,在选取的与所述能量集中的低频子带相关性大的高频段复 制所述低频子带;若低频部分子带的子带能量值均大于低频部分能量均值的第 一加权值,进一步将高频部分能量均值与低频部分能量均值的第二加权值进行 比较; 如果高频部分能量均值小于或等于低频部分能量均值的第二加权值,选 择的策略为: 选取整个低频子带,在选取的与整个低频子带相关性大的高频段 复制所述低频子带;如果高频部分能量均值大于低频部分能量均值的第二加权 值, 选择的策略为: 将高频分为多个复制频带, 为每个复制频带选择相关性大 的低频子带进行复制。 The short-term characteristic analysis module 243 performs short-term characteristic analysis on the audio or voice signal, specifically: calculating a low-frequency partial energy average of the audio or voice signal, a high-frequency partial energy average, a low-frequency partial sub-band energy value, and a high frequency. Part of each sub-band energy value; the replication strategy selection module 2421 correspondingly selects different frequency band replication strategies according to the analysis result of the short-term characteristic analysis module 243, specifically: comparing the energy values of the sub-bands in the low-frequency part with the average value of the low-frequency parts; The sub-band energy value of the partial sub-band of the low-frequency part is less than or equal to the first weighting value of the low-frequency part energy average, and the selected strategy is: selecting the low-frequency sub-band of the energy concentration, and selecting the selected low-frequency sub-band of the energy concentration The high frequency band of the large portion replicates the low frequency sub-band; if the sub-band energy value of the low frequency partial sub-band is greater than the first weighting value of the low-frequency partial energy mean, the second weighting of the high-frequency partial energy mean and the low-frequency partial energy mean is further performed. The value is compared; if the high-frequency partial energy mean is less than or equal to the second weight of the low-frequency partial energy mean Election The selected strategy is: selecting the entire low frequency sub-band, and copying the low frequency sub-band in the selected high frequency band having a large correlation with the entire low frequency sub-band; if the high-frequency partial energy average is greater than the second weighting value of the low-frequency partial energy average, selecting The strategy is as follows: The high frequency is divided into multiple copy frequency bands, and the low frequency sub-bands with large correlation are selected for each copy frequency band for copying.
所述复制策略选择模块 2421选择的策略进一步包括: 复制时进一步将临 近低频子带的高频子带与所述选取的低频子带一起作为复制源 ,所述临近低频 子带的高频子带选取与其相关性大的低频子带复制 ,这样就相当于上述各策略 的扩展策略。  The strategy selected by the replication policy selection module 2421 further includes: further copying, by copying, a high frequency sub-band adjacent to the low frequency sub-band together with the selected low frequency sub-band as a copy source, and the high frequency sub-band of the adjacent low frequency sub-band Select the low-frequency sub-band copy with high correlation, which is equivalent to the expansion strategy of each of the above strategies.
请参阅图 25 , 是本发明实施例编码模块二结构示意图。  Referring to FIG. 25, it is a schematic structural diagram of an encoding module 2 according to an embodiment of the present invention.
编码模块包括: 分析滤波器模块 241、 频带选择模块 242。  The encoding module includes: an analysis filter module 241, and a band selection module 242.
分析滤波器模块 241 , 用于接收音频或语音信号后, 进行滤波处理得到低 频子带和高频子带。  The analysis filter module 241 is configured to receive an audio or voice signal and perform filtering processing to obtain a low frequency subband and a high frequency subband.
频带选择模块 242, 用于确定频带复制策略, 根据所述确定的频带复制策 略获取所述低频子带与高频子带的相关性,参考所述确定的相关性为高频子带 选择低频子带作为复制频带,例如为高频子带选择相关性大的低频子带作为最 优频带, 并输出包括选择频带的对应关系的高频重建参数信息。  a frequency band selection module 242, configured to determine a frequency band replication policy, obtain a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-frequency for the high frequency sub-band with reference to the determined correlation As the copy frequency band, for example, a low frequency sub-band having a high correlation is selected as a high frequency sub-band as an optimum frequency band, and high-frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands is output.
所述频带选择模块 242包括: 复制策略设定模块 2423、 最优频带选择模 块 2422。  The band selection module 242 includes: a copy policy setting module 2423, an optimal band selection module 2422.
复制策略设定模块 2423 , 用于根据预先设置的参数确定唯一频带复制策 略。 频带复制策略为对应图 24所描述的其中一种策略或其对应的扩展策略。  The copy policy setting module 2423 is configured to determine a unique band copying policy according to the preset parameters. The band replication policy is one of the policies described in connection with FIG. 24 or its corresponding extension policy.
最优频带选择模块 2422, 用于根据所述确定的频带复制策略获取所述低 频子带与高频子带的相关性,参考所述确定的相关性为高频子带选择低频子带 作为复制频带, 例如为高频子带选择相关性大的低频子带作为最优频带, 并输 出包括选择频带的对应关系的高频重建参数信息。  An optimal frequency band selection module 2422, configured to acquire a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-band as a high frequency sub-band as a reference with reference to the determined correlation The frequency band, for example, a high frequency sub-band selects a low frequency sub-band having a large correlation as an optimum frequency band, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands.
所述编码模块进一步包括: 时变特性检测模块 244, 用于对滤波处理后的 音频或语音信号进行时变特性检测;相应的, 最优频带选择模块还根据时变特 性检测模块检测的结果选择低频子带 , 例如选择最优复制频带。  The encoding module further includes: a time-varying characteristic detecting module 244, configured to perform time-varying characteristic detection on the filtered audio or voice signal; correspondingly, the optimal frequency band selecting module further selects according to a result detected by the time-varying characteristic detecting module The low frequency subband, for example, selects the optimal copy band.
请参阅图 26, 是本发明实施例解码模块结构示意图。  Referring to FIG. 26, it is a schematic structural diagram of a decoding module according to an embodiment of the present invention.
解码模块, 含有高频生成器模块 261 , 所述高频生成器模块 261包括: 接 收单元 2611、 重建单元 2612。 The decoding module includes a high frequency generator module 261, and the high frequency generator module 261 includes: The receiving unit 2611 and the reconstructing unit 2612.
接收单元 2611 , 用于接收包括选择频带的对应关系的高频重建参数信息, 所述选择频带的对应关系具体为参考相关性确定的高频子带与其相对应的低 频子带的关系 , 例如相关性大的低频子带和高频子带之间对应。  The receiving unit 2611 is configured to receive high-frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands, where the correspondence relationship of the selected frequency bands is specifically a relationship between the high-frequency sub-band determined by the reference correlation and the corresponding low-frequency sub-band, for example, correlation The large low frequency sub-band corresponds to the high frequency sub-band.
重建单元 2612, 用于在高频段根据所述包括选择频带的对应关系的高频 重建参数信息将低频子带复制作为高频子带。  The reconstruction unit 2612 is configured to copy the low frequency sub-band as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band.
所述接收单元 2611接收的所述参数信息还包括新算法使用标志的值和频 带选择表更替标志; 所述重建单元 2612根据所述新算法使用标志的值确定复 制过程使用的算法,根据所述频带选择表更替标志确定复制过程使用的频带选 择表,在高频段根据所述确定的算法和频带选择表将所述对应关系中的低频子 带复制作为高频子带。  The parameter information received by the receiving unit 2611 further includes a value of the new algorithm using the flag and a band selection table replacement flag; the reconstruction unit 2612 determines an algorithm used by the copying process according to the value of the new algorithm using the flag, according to the The band selection table replacement flag determines a band selection table used in the copying process, and the low frequency sub-band in the correspondence is copied as a high frequency sub-band in the high band according to the determined algorithm and the band selection table.
综上所述,本发明实施例方案充分考虑低频子带和高频子带的相关性,通 过根据确定的频带复制策略获取所述低频子带与高频子带的相关性,参考所述 确定的相关性为高频子带选择低频子带作为复制频带,并输出包括选择频带的 对应关系的高频重建参数信息,从而可以根据这些高频重建参数信息进行高频 重建。从高频重建后得到的各子带信号的能量波形图和能量三维图可以明显的 发现,本发明实施例重建后的高频部分与原始音频信号较为接近,效果比现有 技术方案更好, 因此本发明实施例技术方案能够更准确的进行高频重建。  In summary, the solution of the embodiment of the present invention fully considers the correlation between the low frequency subband and the high frequency subband, and obtains the correlation between the low frequency subband and the high frequency subband according to the determined frequency band replication strategy, and refers to the determination. The correlation is that the high frequency sub-band selects the low frequency sub-band as the copy frequency band, and outputs the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands, so that the high frequency reconstruction can be performed based on the high frequency reconstruction parameter information. The energy waveform and the energy three-dimensional image of each sub-band signal obtained after the high-frequency reconstruction can be clearly found that the reconstructed high-frequency portion of the embodiment of the present invention is closer to the original audio signal, and the effect is better than the prior art solution. Therefore, the technical solution of the embodiment of the present invention can perform high frequency reconstruction more accurately.
进一步的,本发明实施例方案可以包括自适应频带复制方式和固定频带复 制方式两种方式, 具有灵活的频带选择功能。  Further, the solution of the embodiment of the present invention may include an adaptive band copy mode and a fixed band copy mode, and has a flexible band selection function.
进一步的 ,本发明实施例技术方案还可以增加对音频或语音信号时变特性 检测, 根据检测结果进行调整。  Further, the technical solution of the embodiment of the present invention may further increase the time-varying characteristic detection of the audio or voice signal, and perform adjustment according to the detection result.
以上对本发明实施例所提供的一种高频重建方法、编码模块和解码模块进 行了详细介绍, 对于本领域的一般技术人员, 依据本发明实施例的思想, 在具 体实施方式及应用范围上均会有改变之处, 综上所述,本说明书内容不应理解 为对本发明的限制。  The high-frequency reconstruction method, the coding module, and the decoding module provided by the embodiments of the present invention are described in detail. For those skilled in the art, according to the embodiments of the present invention, the specific implementation manners and application ranges are There are variations, and the description should not be construed as limiting the invention.

Claims

权 利 要 求 Rights request
1、 一种高频重建方法, 其特征在于, 包括:  A high frequency reconstruction method, characterized in that it comprises:
对音频或语音信号进行滤波处理得到低频子带和高频子带;  Filtering the audio or speech signal to obtain a low frequency sub-band and a high frequency sub-band;
确定频带复制策略;  Determine the band replication strategy;
根据所述确定的频带复制策略获取所述低频子带与高频子带的相关性,参 考所述确定的相关性为高频子带选择低频子带作为复制频带,并输出包括选择 频带的对应关系的高频重建参数信息。  Acquiring a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, selecting a low frequency sub-band as a copy frequency band for the high frequency sub-band with reference to the determined correlation, and outputting a correspondence including the selected frequency band High frequency reconstruction parameter information of the relationship.
2、 根据权利要求 1所述的高频重建方法, 其特征在于:  2. The high frequency reconstruction method according to claim 1, wherein:
所述确定频带复制策略具体为:  The determining the frequency band replication strategy is specifically:
根据预先设置的参数确定唯一频带复制策略。  A unique band replication policy is determined based on pre-set parameters.
3、 根据权利要求 1所述的高频重建方法, 其特征在于:  3. The high frequency reconstruction method according to claim 1, wherein:
所述确定频带复制策略具体为:  The determining the frequency band replication strategy is specifically:
对所述音频或语音信号进行短时特性分析;  Performing short-term characteristic analysis on the audio or voice signal;
根据进行短时特性分析的结果对应选择不同频带复制策略。  The different frequency band replication strategies are selected according to the results of the short-term characteristic analysis.
4、 根据权利要求 2或 3所述的高频重建方法, 其特征在于:  4. The high frequency reconstruction method according to claim 2 or 3, characterized in that:
所述对音频或语音信号进行滤波处理后进一步包括:  After filtering the audio or voice signal, the method further includes:
对滤波处理后的音频或语音信号进行时变特性检测;所述为高频子带选择 低频子带作为复制频带时是进一步结合进行时变特性检测的结果选择低频子 带。  Time-varying characteristic detection is performed on the filtered audio or speech signal; when the low-frequency sub-band is selected as the replica frequency band for the high-frequency sub-band, the low-frequency sub-band is selected as a result of further combining the time-varying characteristic detection.
5、 根据权利要求 3所述的高频重建方法, 其特征在于:  5. The high frequency reconstruction method according to claim 3, wherein:
所述对所述音频或语音信号进行短时特性分析,根据进行短时特性分析的 结果对应选择不同频带复制策略具体为:  Performing short-term characteristic analysis on the audio or voice signal, and selecting a different frequency band replication strategy according to the result of performing short-term characteristic analysis is specifically:
确定音频或语音信号的低频部分能量均值、 高频部分能量均值、低频部分 各子带能量值、 高频部分各子带能量值;  Determining the average value of the low-frequency part of the audio or speech signal, the mean value of the high-frequency part, the energy value of each sub-band in the low-frequency part, and the energy value of each sub-band in the high-frequency part;
将低频部分各子带能量值与低频部分能量均值的第一加权值比较; 若存在低频部分的部分子带的子带能量值小于或等于低频部分能量均值 的第一加权值, 选择的策略为: 选取能量集中的低频子带, 在选取的与所述能 量集中的低频子带相关性大的高频段复制所述低频子带;  Comparing the energy values of the subbands in the low frequency portion with the first weighting values of the energy average of the low frequency portions; if the subband energy value of the partial subbands in the low frequency portion is less than or equal to the first weighting value of the energy average of the low frequency portion, the selected strategy is : selecting a low frequency sub-band of the energy concentration, and copying the low frequency sub-band in a selected high frequency band having a high correlation with the low frequency sub-band of the energy concentration;
若低频部分子带的子带能量值均大于低频部分能量均值的第一加权值 ,进 一步将高频部分能量均值与低频部分能量均值的第二加权值进行比较; 如果高频部分能量均值小于或等于低频部分能量均值的第二加权值,选择 的策略为: 选取整个低频子带,在选取的与整个低频子带相关性大的高频段复 制所述低频子带; If the subband energy value of the low frequency partial subband is greater than the first weighting value of the low frequency partial energy average, The first step is to compare the high-frequency partial energy average with the second weighted value of the low-frequency partial energy average; if the high-frequency partial energy average is less than or equal to the second weighted value of the low-frequency partial energy mean, the selected strategy is: selecting the entire low-frequency sub-band, Copying the low frequency sub-band at a selected high frequency band having a large correlation with the entire low frequency sub-band;
如果高频部分能量均值大于低频部分能量均值的第二加权值,选择的策略 为: 将高频分为多个复制频带, 为每个复制频带选择相关性大的低频子带进行 复制。  If the high-frequency partial energy average is greater than the second weighted value of the low-frequency partial energy mean, the selected strategy is: dividing the high-frequency into a plurality of replicated frequency bands, and selecting a highly correlated low-frequency sub-band for each replicated frequency band for copying.
6、 根据权利要求 5所述的高频重建方法, 其特征在于:  6. The high frequency reconstruction method according to claim 5, wherein:
所述选择的策略进一步包括:  The selected strategy further includes:
复制时进一步将临近低频子带的高频子带与所述选取的低频子带一起作 为复制源, 所述临近低频子带的高频子带选取与其相关性大的低频子带复制。  When copying, the high frequency sub-band adjacent to the low frequency sub-band is further used as a copy source together with the selected low frequency sub-band, and the high frequency sub-band adjacent to the low frequency sub-band is selected to be copied with the low frequency sub-band having a large correlation.
7、 根据权利要求 4所述的高频重建方法, 其特征在于:  7. The high frequency reconstruction method according to claim 4, wherein:
所述对滤波处理后的音频或语音信号进行时变特性检测 ,所述为高频子带 选择低频子带作为复制频带时是进一步结合进行时变特性检测的结果选择低 频子带具体为:  Performing time-varying characteristic detection on the filtered audio or speech signal, wherein the high-frequency sub-band selects the low-frequency sub-band as the copy frequency band is further combined with the time-varying characteristic detection result, and the low-frequency sub-band is selected as follows:
确定当前帧与前一帧的低频子带能量均方误差,若所述能量均方误差小于 判决门限, 并且能量集中的低频子带的偏移频率大于预设值, 则重新进行复制 频带选择, 否则沿用之前确定的复制频带。  Determining the mean square error of the low frequency subband energy of the current frame and the previous frame. If the mean square error of the energy is less than the decision threshold, and the offset frequency of the low frequency subband of the energy concentration is greater than a preset value, then the copy band selection is performed again. Otherwise the previously determined copy band is used.
8、 根据权利要求 1所述的高频重建方法, 其特征在于:  8. The high frequency reconstruction method according to claim 1, wherein:
所述根据确定的频带复制策略获取所述低频子带与高频子带的相关性,参 考所述确定的相关性为高频子带选择低频子带作为复制频带具体为:  And obtaining, according to the determined frequency band replication strategy, the correlation between the low frequency sub-band and the high frequency sub-band, and referring to the determined correlation, the high frequency sub-band selecting the low frequency sub-band as the replication frequency band is specifically:
确定高频子带和低频子带的相关函数值;  Determining a correlation function value of the high frequency sub-band and the low frequency sub-band;
根据高频子带和低频子带的自相关函数值确定高频子带和低频子带间的 包络差异值;  Determining an envelope difference value between the high frequency sub-band and the low frequency sub-band according to the autocorrelation function values of the high frequency sub-band and the low frequency sub-band;
根据所述相关函数值和所述包络差异值为高频子带选择相关性大的低频 子带作为复制频带。  The low frequency sub-band having a high correlation of the high frequency sub-band is selected as the copy frequency band according to the correlation function value and the envelope difference value.
9、 一种高频重建方法, 其特征在于:  9. A high frequency reconstruction method, characterized in that:
接收包括选择频带的对应关系的高频重建参数信息 ,所述选择频带的对应 关系具体为参考相关性确定的高频子带与其相对应的低频子带的关系; 在高频段根据所述包括选择频带的对应关系的高频重建参数信息将低频 子带复制作为高频子带。 Receiving high-frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands, where the correspondence relationship of the selected frequency bands is specifically a relationship between the high-frequency sub-bands determined by the reference correlation and the corresponding low-frequency sub-bands; The low frequency sub-band is copied as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands at the high frequency band.
10、 根据权利要求 9所述的高频重建方法, 其特征在于:  10. The high frequency reconstruction method according to claim 9, wherein:
所述参数信息还包括新算法使用标志和频带选择表更替标志;  The parameter information further includes a new algorithm using the flag and a band selection table replacement flag;
所述在高频段根据所述包括选择频带的对应关系的高频重建参数信息将 低频子带复制作为高频子带具体为:  The replicating the low frequency subband as the high frequency subband according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band is specifically:
根据所述新算法使用标志的值确定复制过程使用的算法,  Determining the algorithm used by the replication process based on the value of the flag using the new algorithm,
根据所述频带选择表更替标志的值确定复制过程使用的频带选择表; 在高频段根据所述确定的算法和频带选择表将所述对应关系中的低频子 带复制作为高频子带。  A band selection table used by the copying process is determined based on the value of the band selection table replacement flag; the low frequency sub-band in the correspondence is copied as a high frequency sub-band in the high band according to the determined algorithm and the band selection table.
11、 一种编码模块, 其特征在于, 包括:  11. An encoding module, comprising:
分析滤波器模块,用于对音频或语音信号进行滤波处理得到低频子带和高 频子带;  An analysis filter module for filtering an audio or voice signal to obtain a low frequency sub-band and a high frequency sub-band;
频带选择模块, 用于确定频带复制策略,根据所述确定的频带复制策略获 取所述低频子带与高频子带的相关性,参考所述确定的相关性为高频子带选择 低频子带作为复制频带, 并输出包括选择频带的对应关系的高频重建参数信 息。  a frequency band selection module, configured to determine a frequency band replication policy, obtain a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication policy, and select a low frequency sub-band for the high frequency sub-band with reference to the determined correlation As the copy band, the high frequency reconstruction parameter information including the correspondence of the selected bands is output.
12、 根据权利要求 11所述的编码模块, 其特征在于, 所述频带选择模块 包括:  The coding module according to claim 11, wherein the frequency band selection module comprises:
复制策略设定模块, 用于根据预先设置的参数确定唯一频带复制策略; 最优频带选择模块,用于根据所述确定的频带复制策略获取所述低频子带 与高频子带的相关性,参考所述确定的相关性为高频子带选择低频子带作为复 制频带, 并输出包括选择频带的对应关系的高频重建参数信息。  a replication policy setting module, configured to determine a unique frequency band replication policy according to a preset parameter; an optimal frequency band selection module, configured to acquire, according to the determined frequency band replication policy, a correlation between the low frequency subband and a high frequency subband, Referring to the determined correlation, a low frequency sub-band is selected as a copy frequency band for the high frequency sub-band, and high frequency reconstruction parameter information including a correspondence relationship of the selected frequency band is output.
13、 根据权利要求 11所述的编码模块, 其特征在于:  13. The encoding module of claim 11 wherein:
所述编码模块进一步包括:  The encoding module further includes:
短时特性分析模块, 对所述音频或语音信号进行短时特性分析; 所述频带选择模块包括:  a short-term characteristic analysis module, performing short-term characteristic analysis on the audio or voice signal; the frequency band selection module includes:
复制策略选择模块,用于根据短时特性分析模块分析的结果对应选择不同 频带复制策略; 最优频带选择模块,用于根据所述确定的频带复制策略获取所述低频子带 与高频子带的相关性,参考所述确定的相关性为高频子带选择相低频子带作为 复制频带, 并输出包括选择频带的对应关系的高频重建参数信息。 a replication policy selection module, configured to select a different frequency band replication strategy according to a result of the short-term characteristic analysis module analysis; An optimal frequency band selection module, configured to acquire a correlation between the low frequency sub-band and a high frequency sub-band according to the determined frequency band replication strategy, and refer to the determined correlation as a high frequency sub-band to select a phase low frequency sub-band as a replica a frequency band, and outputs high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands.
14、根据权利要求 12或 13所述的编码模块, 其特征在于, 所述编码模块 进一步包括:  The encoding module according to claim 12 or 13, wherein the encoding module further comprises:
时变特性检测模块,用于对滤波处理后的音频或语音信号进行时变特性检 测;  a time-varying characteristic detecting module, configured to perform time-varying characteristic detection on the filtered audio or voice signal;
所述最优频带选择模块还根据时变特性检测模块检测的结果选择低频子 带。  The optimal band selection module also selects a low frequency sub-band based on the result of the time varying characteristic detection module detection.
15、 根据权利要求 13所述的编码模块, 其特征在于,  15. The encoding module of claim 13 wherein:
所述短时特性分析模块对所述音频或语音信号进行短时特性分析具体为: 确定音频或语音信号的低频部分能量均值、 高频部分能量均值、低频部分各子 带能量值、 高频部分各子带能量值;  The short-term characteristic analysis module performs short-term characteristic analysis on the audio or voice signal, specifically: determining an average energy value of a low-frequency part of the audio or voice signal, an average value of the energy of the high-frequency part, an energy value of each sub-band of the low-frequency part, and a high-frequency part. Energy value of each sub-band;
所述复制策略选择模块根据短时特性分析模块分析的结果对应选择不同 频带复制策略具体为:将低频部分各子带能量值与低频部分能量均值的第一加 权值比较;  The copying strategy selection module correspondingly selects different frequency band replication strategies according to the results of the short-term characteristic analysis module analysis: comparing the energy values of the sub-bands of the low-frequency part with the first weighting values of the energy average of the low-frequency parts;
若存在低频部分的部分子带的子带能量值小于或等于低频部分能量均值 的第一加权值, 选择的策略为: 选取能量集中的低频子带, 在选取的与所述能 量集中的低频子带相关性大的高频段复制所述低频子带;  If the subband energy value of the partial subband of the low frequency portion is less than or equal to the first weighting value of the low frequency partial energy mean, the selected strategy is: selecting the low frequency subband of the energy concentration, and selecting the low frequency subgroup with the energy concentration Copying the low frequency sub-band with a high frequency band with high correlation;
若低频部分子带的子带能量值均大于低频部分能量均值的第一加权值,进 一步将高频部分能量均值与低频部分能量均值的第二加权值进行比较;  If the subband energy value of the low frequency partial subband is greater than the first weighting value of the low frequency partial energy mean, the higher frequency partial energy mean is further compared with the second weighted value of the low frequency partial energy mean;
如果高频部分能量均值小于或等于低频部分能量均值的第二加权值,选择 的策略为: 选取整个低频子带,在选取的与整个低频子带相关性大的高频段复 制所述低频子带;  If the high-frequency partial energy mean is less than or equal to the second weighted value of the low-frequency partial energy mean, the selected strategy is: selecting the entire low-frequency sub-band, and copying the low-frequency sub-band in the selected high frequency band having a large correlation with the entire low-frequency sub-band ;
如果高频部分能量均值大于低频部分能量均值的第二加权值,选择的策略 为: 将高频分为多个复制频带, 为每个复制频带选择相关性大的低频子带进行 复制。  If the high-frequency partial energy average is greater than the second weighted value of the low-frequency partial energy mean, the selected strategy is: dividing the high-frequency into a plurality of replicated frequency bands, and selecting a highly correlated low-frequency sub-band for each replicated frequency band for copying.
16、 根据权利要求 14所述的编码模块, 其特征在于,  16. The encoding module of claim 14 wherein:
所述复制策略选择模块选择的策略进一步包括:复制时进一步将临近低频 子带的高频子带与所述选取的低频子带一起作为复制源,所述临近低频子带的 高频子带选取与其相关性大的低频子带复制。 The policy selected by the replication policy selection module further includes: further moving to the low frequency when copying The high frequency sub-band of the sub-band is used as a copy source together with the selected low-frequency sub-band, and the high-frequency sub-band of the adjacent low-frequency sub-band is selected to be copied with a low-frequency sub-band having a large correlation.
17、 一种解码模块, 含有高频生成器模块, 其特征在于, 所述高频生成器 模块包括:  17. A decoding module, comprising a high frequency generator module, wherein the high frequency generator module comprises:
接收单元, 用于接收包括选择频带的对应关系的高频重建参数信息, 所述 选择频带的对应关系具体为参考相关性确定的高频子带与其相对应的低频子 带的关系 ;  a receiving unit, configured to receive high frequency reconstruction parameter information including a correspondence relationship of the selected frequency bands, where the corresponding relationship of the selected frequency bands is specifically a relationship between the high frequency sub-band determined by the reference correlation and the corresponding low frequency sub-band;
重建单元,用于在高频段根据所述包括选择频带的对应关系的高频重建参 数信息将低频子带复制作为高频子带。  And a reconstruction unit configured to copy the low frequency sub-band as a high frequency sub-band according to the high frequency reconstruction parameter information including the correspondence relationship of the selected frequency bands in the high frequency band.
18、 根据权利要求 17所述的解码模块, 其特征在于:  18. The decoding module of claim 17, wherein:
所述接收单元接收的所述参数信息还包括新算法使用标志和频带选择表 更替标志;  The parameter information received by the receiving unit further includes a new algorithm using a flag and a band selection table replacement flag;
所述重建单元根据所述新算法使用标志的值确定复制过程使用的算法,根 据所述频带选择表更替标志的值确定复制过程使用的频带选择表,在高频段根 据所述确定的算法和频带选择表将所述对应关系中的低频子带复制作为高频 子带。  The reconstruction unit determines an algorithm used by the copying process according to the value of the new algorithm using the flag, and determines a frequency band selection table used by the copying process according to the value of the frequency band selection table replacement flag, according to the determined algorithm and frequency band in the high frequency band. The selection table copies the low frequency sub-bands in the correspondence as high frequency sub-bands.
PCT/CN2008/073728 2007-12-27 2008-12-25 Method for high frequency band replication, coder and decoder thereof WO2009089728A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200710305087.3 2007-12-27
CN 200710305087 CN101471072B (en) 2007-12-27 2007-12-27 High-frequency reconstruction method, encoding device and decoding module

Publications (1)

Publication Number Publication Date
WO2009089728A1 true WO2009089728A1 (en) 2009-07-23

Family

ID=40828487

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2008/073728 WO2009089728A1 (en) 2007-12-27 2008-12-25 Method for high frequency band replication, coder and decoder thereof

Country Status (2)

Country Link
CN (1) CN101471072B (en)
WO (1) WO2009089728A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9117440B2 (en) 2011-05-19 2015-08-25 Dolby International Ab Method, apparatus, and medium for detecting frequency extension coding in the coding history of an audio signal
CN113299313A (en) * 2021-01-28 2021-08-24 维沃移动通信有限公司 Audio processing method and device and electronic equipment

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2655085T3 (en) 2010-03-09 2018-02-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Magnitude response and improved time alignment in bandwidth extension based on a phase vocoder for audio signals
CN103038819B (en) 2010-03-09 2015-02-18 弗兰霍菲尔运输应用研究公司 Apparatus and method for processing an audio signal using patch border alignment
WO2011110496A1 (en) 2010-03-09 2011-09-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for handling transient sound events in audio signals when changing the replay speed or pitch
US9384749B2 (en) * 2011-09-09 2016-07-05 Panasonic Intellectual Property Corporation Of America Encoding device, decoding device, encoding method and decoding method
ES2568640T3 (en) * 2012-02-23 2016-05-03 Dolby International Ab Procedures and systems to efficiently recover high frequency audio content
ES2836194T3 (en) * 2013-06-11 2021-06-24 Fraunhofer Ges Forschung Device and procedure for bandwidth extension for acoustic signals
CN105513601A (en) * 2016-01-27 2016-04-20 武汉大学 Method and device for frequency band reproduction in audio coding bandwidth extension
CN107221334B (en) * 2016-11-01 2020-12-29 武汉大学深圳研究院 Audio bandwidth extension method and extension device
CN106507113B (en) * 2016-11-28 2019-03-29 河海大学 One kind three describes lattice vector quantization prediction wing coding/decoding method
TWI752166B (en) 2017-03-23 2022-01-11 瑞典商都比國際公司 Backward-compatible integration of harmonic transposer for high frequency reconstruction of audio signals
CN108489596B (en) * 2018-03-20 2020-04-21 南京凯奥思数据技术有限公司 Continuous scanning laser quick vibration measuring method and system thereof
CN108682413B (en) * 2018-04-24 2020-09-29 上海师范大学 Emotion persuasion system based on voice conversion
CN109036457B (en) * 2018-09-10 2021-10-08 广州酷狗计算机科技有限公司 Method and apparatus for restoring audio signal
CN110556122B (en) * 2019-09-18 2024-01-19 腾讯科技(深圳)有限公司 Band expansion method, device, electronic equipment and computer readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1377499A (en) * 1999-10-01 2002-10-30 编码技术瑞典股份公司 Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching
CN1527995A (en) * 2001-11-14 2004-09-08 ���µ�����ҵ��ʽ���� Encoding device and decoding device
CN1784020A (en) * 2004-12-01 2006-06-07 三星电子株式会社 Apparatus, method,and medium for processing audio signal using correlation between bands

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020128839A1 (en) * 2001-01-12 2002-09-12 Ulf Lindgren Speech bandwidth extension
DE60230856D1 (en) * 2001-07-13 2009-03-05 Panasonic Corp AUDIO SIGNAL DECODING DEVICE AND AUDIO SIGNAL CODING DEVICE

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1377499A (en) * 1999-10-01 2002-10-30 编码技术瑞典股份公司 Efficient spectral envelope coding using variable time/frequency resolution and time/frequency switching
CN1527995A (en) * 2001-11-14 2004-09-08 ���µ�����ҵ��ʽ���� Encoding device and decoding device
CN1784020A (en) * 2004-12-01 2006-06-07 三星电子株式会社 Apparatus, method,and medium for processing audio signal using correlation between bands

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9117440B2 (en) 2011-05-19 2015-08-25 Dolby International Ab Method, apparatus, and medium for detecting frequency extension coding in the coding history of an audio signal
CN113299313A (en) * 2021-01-28 2021-08-24 维沃移动通信有限公司 Audio processing method and device and electronic equipment
CN113299313B (en) * 2021-01-28 2024-03-26 维沃移动通信有限公司 Audio processing method and device and electronic equipment

Also Published As

Publication number Publication date
CN101471072A (en) 2009-07-01
CN101471072B (en) 2012-01-25

Similar Documents

Publication Publication Date Title
WO2009089728A1 (en) Method for high frequency band replication, coder and decoder thereof
US10522168B2 (en) Audio signal synthesizer and audio signal encoder
JP6363683B2 (en) Method and apparatus for high frequency domain encoding and decoding
US6708145B1 (en) Enhancing perceptual performance of sbr and related hfr coding methods by adaptive noise-floor addition and noise substitution limiting
US8639500B2 (en) Method, medium, and apparatus with bandwidth extension encoding and/or decoding
US8447621B2 (en) Methods for improving high frequency reconstruction
US20140372108A1 (en) Method and apparatus for encoding and decoding high frequency signal
KR102343135B1 (en) Improved subband block bas -ed harmonic transposition
KR101376098B1 (en) Method and apparatus for bandwidth extension decoding
US9280978B2 (en) Packet loss concealment for bandwidth extension of speech signals
JP7297368B2 (en) Frequency band extension method, apparatus, electronic device and computer program
US20090319278A1 (en) Efficient coding of overcomplete representations of audio using the modulated complex lapped transform (mclt)
CN109247069B (en) Encoding for reconstructing phase information by using structure tensor on audio spectrogram
CN113470667A (en) Voice signal coding and decoding method and device, electronic equipment and storage medium
US20210287687A1 (en) Audio processor and method for generating a frequency enhanced audio signal using pulse processing
RU2409874C2 (en) Audio signal compression
Ryu et al. Effective high frequency regeneration based on sinusoidal modeling for MPEG-4 HE-AAC
Lin et al. Adaptive bandwidth extension of low bitrate compressed audio based on spectral correlation
Liu et al. Blind bandwidth extension of audio signals based on harmonic mapping in phase space

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08870878

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08870878

Country of ref document: EP

Kind code of ref document: A1