WO2021139757A1 - 一种音频编解码方法和音频编解码设备 - Google Patents

一种音频编解码方法和音频编解码设备 Download PDF

Info

Publication number
WO2021139757A1
WO2021139757A1 PCT/CN2021/070831 CN2021070831W WO2021139757A1 WO 2021139757 A1 WO2021139757 A1 WO 2021139757A1 CN 2021070831 W CN2021070831 W CN 2021070831W WO 2021139757 A1 WO2021139757 A1 WO 2021139757A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
current frame
enhancement layer
frequency band
layer
Prior art date
Application number
PCT/CN2021/070831
Other languages
English (en)
French (fr)
Inventor
王宾
夏丙寅
王喆
周建同
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to KR1020227025669A priority Critical patent/KR20220117332A/ko
Priority to JP2022542238A priority patent/JP7481457B2/ja
Priority to EP21738625.9A priority patent/EP4071756B1/en
Publication of WO2021139757A1 publication Critical patent/WO2021139757A1/zh
Priority to US17/857,725 priority patent/US20220335962A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • This application relates to the technical field of audio signal coding and decoding, and in particular to an audio coding and decoding method and audio coding and decoding equipment.
  • the user's demand for audio services is getting higher and higher, which will require constant updating of audio codec equipment. While meeting users' demands for new audio services, it is also necessary to ensure that they are fully compatible with old audio codec equipment so that the old audio codec equipment can still provide audio services.
  • One of the more critical links is that the new audio codec equipment can be compatible with the old audio codec equipment.
  • the embodiments of the present application provide an audio codec method and audio codec device, which are used to implement compatibility between a new codec device and an old codec device, and can improve the coding and decoding efficiency of audio signals.
  • an embodiment of the present application provides an audio encoding method, the method includes: acquiring a current frame of an audio signal, the current frame including: a high-band signal and a low-band signal; according to the high-band signal and The low-band signal obtains the compatibility layer coding parameter of the current frame; the enhancement layer coding parameter of the current frame is obtained according to the high-band signal; the compatibility layer coding parameter and the enhancement layer coding parameter are coded Stream multiplexing to get the coded stream.
  • the entire frequency domain range of the audio signal can be encoded in the compatibility layer, while only the high frequency domain range of the audio signal is encoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio coding equipment, while the enhancement layer and compatibility layer can be implemented using new audio coding equipment. Therefore, in this embodiment of the present application, the compatibility between the new audio coding equipment and the old audio coding equipment is realized. According to the device type of the audio coding device itself, you can choose to encode only in the compatibility layer, or to encode in the compatibility layer and the enhancement layer at the same time. The embodiment of this application does not need to add a transcoding module for the old audio coding device, so it saves The cost of upgrading audio coding equipment is eliminated, and the coding efficiency of audio signals can be improved.
  • the obtaining the enhancement layer coding parameters of the current frame according to the high-band signal includes: obtaining signal type information of the high-band signal of the current frame; When the signal type information of the high frequency band signal of the frame indicates the preset signal type, the high frequency band signal of the current frame is encoded to obtain the enhancement layer coding parameters of the current frame.
  • the signal type information of the high-band signal of the current frame is acquired, and the signal type information may include multiple signal classification results according to the divided signal type.
  • the signal type information of the high frequency band signal of the current frame indicates the preset signal type
  • the high frequency band signal of the current frame is encoded to obtain the enhancement layer coding parameters of the current frame.
  • the audio signal can be divided into N preset signal types, N coding modes can be set in the enhancement layer, and a corresponding enhancement layer coding mode can be executed for each preset signal type, so that it can be used for different signal types.
  • the corresponding enhancement layer coding mode improves the coding efficiency of the audio signal.
  • the preset signal type includes at least one of the following: a harmonic signal type, a tone signal type, a white noise-like signal type, a transient signal type, or a fricative signal type.
  • a harmonic signal type for the high-band signal of the current frame.
  • the signal type of the high-band signal of the current frame can be a harmonic signal type, that is, the high-band signal of the current frame is a harmonic signal. Therefore, the enhancement layer coding mode 1 can be used to encode the harmonic signal in the enhancement layer.
  • the enhancement layer coding mode 2 can be used in the enhancement layer to encode the tone signal.
  • the signal type of the high-band signal of the current frame can be a white-noise-like signal type, that is, the high-band signal of the current frame includes a white-noise-like signal, so the enhancement layer coding mode 3 can be used in the enhancement layer.
  • the signal is encoded.
  • the enhancement layer coding mode 4 can be used in the enhancement layer to encode the transient signal .
  • the signal type of the high-band signal of the current frame can be the fricative signal type, that is, the high-frequency signal of the current frame includes the fricative signal, so the enhanced layer coding mode 5 can be used in the enhancement layer to encode the fricative signal.
  • a corresponding enhancement layer coding mode can be executed for each of the foregoing preset signal types, so that corresponding enhancement layer coding modes are adopted for different signal types, thereby improving the coding efficiency of audio signals.
  • the enhancement layer coding parameter of the current frame further includes: signal type information of the high frequency band signal of the current frame.
  • the enhancement layer encoding parameters generated after encoding the high-band signal of the current frame in the enhancement layer also include the signal type information of the high-band signal of the current frame. Therefore, when the code stream is multiplexed, the generated code The code stream can carry the signal type information of the high-band signal of the current frame, so that the signal type information can also be used in the decoding component to decode according to different preset signal types in the enhancement layer, so that the enhancement layer signal can be used To process part of the spectrum processed by the compatibility layer to achieve the purpose of improving the performance of the final output signal.
  • the obtaining the enhancement layer coding parameters of the current frame according to the high-band signal includes: obtaining compatible layer coding frequency band information; and determining the current frame according to the compatible layer coding frequency band information
  • the frequency band signal to be encoded in the high frequency band signal of the frame; the frequency band signal to be encoded is encoded to obtain the enhancement layer encoding parameter.
  • the compatibility layer encoding frequency band information indicates the frequency band information of the audio signal encoded in the compatibility layer, that is, the compatibility layer encoding frequency band information can determine which frequency band or bands in the compatibility layer are encoded by the compatibility layer.
  • the compatibility layer encoding frequency band information output by the compatibility layer can be used to guide the encoding processing of the enhancement layer at the encoding end, so that the encoding in the enhancement layer and the encoding in the compatibility layer can complement each other and improve the audio in the enhancement layer. Signal coding efficiency.
  • an embodiment of the present application provides an audio decoding method, the method includes: obtaining an encoded bitstream; demultiplexing the encoded bitstream to obtain the compatible layer encoding parameters of the current frame of the audio signal And the enhancement layer coding parameters of the current frame; the compatibility layer signal of the current frame is obtained according to the compatibility layer coding parameter, and the compatibility layer signal includes: the first high-frequency band signal of the current frame and the current The first low-band signal of the frame; the enhancement layer signal of the current frame is obtained according to the enhancement layer coding parameter; the first high-frequency band of the current frame is obtained according to the enhancement layer coding parameter or the enhancement layer signal of the current frame The signal is adapted to obtain the second high frequency band signal of the current frame; according to the enhancement layer signal of the current frame, the second high frequency band signal of the current frame, and the first low frequency signal of the current frame With the signal, the audio output signal of the current frame is obtained.
  • the entire frequency domain range of the audio signal can be decoded in the compatibility layer, while only the high frequency domain range of the audio signal is decoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio decoding equipment, and the enhancement layer and compatibility layer can be implemented using new audio decoding equipment. Therefore, in this embodiment of the application, the compatibility between the new audio decoding equipment and the old audio decoding equipment is realized. According to the device type of the audio decoding device itself, you can choose to decode only at the compatibility layer, or at the same time the compatibility layer and the enhancement layer.
  • the embodiment of this application does not need to add a transcoding module for the old audio decoding device, so it saves The cost of upgrading audio decoding equipment is eliminated, and the decoding efficiency of audio signals can be improved.
  • the obtaining the enhancement layer signal of the current frame according to the enhancement layer coding parameter includes: obtaining signal type information according to the enhancement layer coding parameter of the current frame; The preset signal type indicated by the information decodes the enhancement layer coding parameters of the current frame to obtain the enhancement layer signal of the current frame.
  • the encoded bitstream can carry the signal type information of the audio signal.
  • the signal type information of the enhancement layer encoding parameters of the current frame can be obtained.
  • the enhancement layer coding parameters of the current frame are decoded according to the preset signal type indicated by the signal type information to obtain the enhancement layer signal of the current frame.
  • the audio signal can be divided into N preset signal types, and the enhancement layer can be set with N kinds of decoding modes, a corresponding enhancement layer decoding mode can be executed for each preset signal type, so that corresponding enhancement layer decoding modes are adopted for different signal types, thereby improving the decoding efficiency of audio signals.
  • the decoding component uses the signal type information to select a suitable enhancement layer decoding process, so that the enhancement layer signal can be used to process part of the spectrum processed by the compatibility layer to achieve the purpose of improving the performance of the final output signal.
  • the first high-frequency band signal of the current frame is adapted to obtain the first high-frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame.
  • Two high frequency band signals including: obtaining compatibility layer high frequency band adjustment parameters according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame; using the compatibility layer high frequency
  • the band adjustment parameter performs adaptation processing on the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the enhancement layer coding parameters or the enhancement layer signal and the first high frequency band signal of the compatibility layer can be used to obtain the compatibility layer high frequency band adjustment parameters, which may be referred to as the compatibility layer high frequency band adjustment parameters in the subsequent embodiments.
  • the adjustment parameter is an adjustment parameter used to adjust the high frequency part of the compatibility layer signal.
  • the compatibility layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the first high frequency band signal of the current frame are both It is a high-frequency audio signal.
  • An adjustment parameter can be calculated from the enhancement layer signal of the current frame and the first high-frequency signal of the current frame, and the first high-frequency signal of the current frame is adapted through the adjustment parameter. , To get the second high frequency band signal of the current frame.
  • the obtaining the compatibility layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame includes: obtaining all The enhancement layer coding parameter of the current frame or the envelope information corresponding to the enhancement layer signal, and the envelope information of the first high frequency band signal of the current frame is acquired; according to the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal Obtaining the compatibility layer high-band adjustment parameter by using the envelope information and the envelope information of the first high-band signal.
  • the output information of the compatibility layer can be directly analyzed from the compatibility layer. This output information and the enhancement layer signal are jointly calculated to obtain the high-frequency spectrum adjustment parameters of the compatibility layer signal.
  • This adjustment parameter is used to improve the compatibility layer signal.
  • the frequency band signal is adjusted and combined with the output signal of the enhancement layer to obtain the final output signal.
  • the calculation of the adjustment parameter can be implemented in multiple ways.
  • the adjustment parameter can be calculated by using the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high frequency band signal, where the enhancement layer coding parameter corresponds to
  • the envelope information of may be the envelope information of the high-band signal calculated according to the enhancement layer coding parameters, or the envelope information corresponding to the enhancement layer signal may be the amplitude of the enhancement layer signal, the envelope information of the first high-band signal
  • the information can be the amplitude of the high-band signal in the compatibility layer signal.
  • Using the enhancement layer coding parameters or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high-band signal can calculate the compatibility layer high-band adjustment parameter.
  • the first high-frequency band signal of the current frame is adapted to obtain the first high-frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame.
  • Two high-band signals including: selecting the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signals of the current frame according to a preset high-band spectrum selection rule; The spectrum signal is combined with the first high-frequency signal of the current frame to obtain the second high-frequency signal of the current frame.
  • a high-band spectrum selection rule can be set in advance, and the high-band spectrum selection rule can be used to indicate the selection of a high-band spectrum signal from the enhancement layer signal. For example, the high-band spectrum selection rule specifies the selected one.
  • multiple frequency bands, or high-band spectrum selection rules indicate the frequency bands to be selected from the enhancement layer signal.
  • the spectrum signal is combined with the high-frequency spectrum signal of the enhancement layer and the first high-frequency signal of the current frame to obtain the second high-frequency signal of the current frame.
  • part of the high-frequency signal can be selected from the enhancement layer signal to be combined with the first high-frequency signal in the compatibility layer, which can be generated in the compatibility layer
  • the second high-band signal therefore, the embodiment of the present application can obtain a higher-band signal of a better compatibility layer, thereby realizing output of a better audio output signal, and improving the performance of the audio output signal.
  • the selecting the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to a preset high-band spectrum selection rule includes: acquiring the The compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high frequency band signal of the current frame; it is determined that the signal corresponding to the compatibility layer frequency band extension signal in the enhancement layer signal of the current frame is the signal of the current frame Enhancement layer high-band spectrum signal.
  • the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high frequency band signal wherein the compatibility layer decoded signal is a signal obtained by decoding the compatibility layer encoding parameters in the compatibility layer by the decoding component
  • the compatibility layer frequency band extension signal is a signal obtained by the decoding component through frequency band extension in the compatibility layer. For example, the low frequency band signal is extended to the high frequency band to obtain the compatibility layer frequency band extension signal.
  • the decoding component can select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to the compatibility layer frequency band extension signal, that is, the enhancement layer signal and the compatibility layer decoded signal in the compatibility layer The corresponding signal is not selected, so that the enhancement layer high-band spectrum signal is a part of the spectrum signal selected from the enhancement layer signal.
  • a higher frequency band signal of a better compatibility layer can be obtained, so as to output a better audio output signal, and improve the performance of the audio output signal.
  • the first high-frequency band signal of the current frame is adapted to obtain the first high-frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame.
  • the second high frequency band signal includes: replacing the first high frequency band signal of the current frame with the enhancement layer signal of the current frame to obtain the second high frequency band signal of the current frame.
  • one implementation of the adaptation process can be direct replacement.
  • the decoding component can use the enhancement layer signal of the current frame to replace the first high-band signal of the current frame, that is, the first low-band signal in the compatibility layer.
  • the embodiment of the present application can obtain a higher frequency band signal of a better compatibility layer, thereby realizing output of a better audio output signal, and improving the performance of the audio output signal.
  • the using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes : Acquire the enhancement layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame; use the enhancement layer high-band adjustment parameter to adjust the current
  • the enhancement layer signal of the frame is subjected to adaptation processing to obtain an adapted enhancement layer signal; the enhancement layer signal after the adaptation processing is used to replace the first high frequency band signal of the current frame to obtain the The second high-band signal of the current frame.
  • the enhancement layer signal and the first high-band signal of the compatibility layer can be used to obtain the enhancement layer high-band adjustment parameters.
  • the enhancement layer high-band adjustment parameters (which may be simply referred to as adjustment parameters in the subsequent embodiments) are used
  • the enhancement layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the current frame
  • the first high-frequency signal of the frame is a high-frequency audio signal
  • an adjustment parameter can be calculated through the enhancement layer signal of the current frame and the first high-frequency signal of the current frame, and the enhancement of the current frame by the adjustment parameter
  • the layer signal undergoes adaptation processing to obtain an enhanced layer signal after the adaptation processing.
  • the using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes : Obtain enhancement layer high frequency band adjustment parameters according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame; use the enhancement layer signal of the current frame to compare the current frame Replace the first high-band signal to obtain a replaced first high-band signal; use the enhancement layer high-band adjustment parameter to perform adaptation processing on the replaced first high-band signal to Obtain the second high frequency band signal of the current frame.
  • the enhancement layer signal and the first high-band signal of the compatibility layer can be used to obtain the enhancement layer high-band adjustment parameters.
  • the enhancement layer high-band adjustment parameters (which may be simply referred to as adjustment parameters in the subsequent embodiments) are used
  • the enhancement layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the current frame
  • the first high-frequency signal of the frame is a high-frequency audio signal.
  • an adjustment parameter can be calculated. After the signal is taken, the first high-frequency signal after replacement is adapted through the adjustment parameter to obtain the second high-frequency signal of the current frame.
  • the using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes : Perform spectral component comparison selection on the enhancement layer signal of the current frame and the first high frequency band signal of the current frame to select the first enhancement layer sub-signal from the enhancement layer signal of the current frame; use all The first enhancement layer sub-signal replaces a signal with the same frequency spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame to obtain the second high-frequency band signal of the current frame .
  • the spectrum component corresponding to the enhancement layer signal can be compared with the spectrum component corresponding to the first high-band signal in the compatibility layer signal.
  • the first enhancement layer signal can be selected from the enhancement layer signal of the current frame.
  • the decoding component performs the above-mentioned spectral component comparison and selection, and uses a part of the spectral components in the enhancement layer signal to replace the corresponding spectral components in the compatibility layer signal according to the comparison result to obtain the spectral components in the final output signal.
  • another part of the spectrum component in the enhancement layer signal is discarded, and the replaced spectrum component in the compatibility layer signal is combined with other spectrum components in the compatibility layer signal to obtain all the spectrum components of the final output signal.
  • the obtaining the enhancement layer signal of the current frame according to the enhancement layer coding parameter includes: determining the enhancement layer coding according to the enhancement layer coding parameter and the compatible layer coding parameter The to-be-decoded enhancement layer high-frequency signal in the parameters; and the to-be-decoded enhancement layer high-frequency signal in the enhancement layer coding parameters is decoded to obtain the enhancement layer signal of the current frame.
  • the enhancement layer coding parameters and the compatibility layer coding parameters can be obtained, and the decoding component determines the high frequency signal (that is, to be decoded) in the enhancement layer coding parameters that need to be decoded in the enhancement layer according to the enhancement layer coding parameters and the compatibility layer coding parameters.
  • Enhancement layer high frequency signal After the high frequency signal that is not determined to be decoded in the enhancement layer coding parameters, it can be discarded, so only the high frequency of the enhancement layer to be decoded The signal is decoded without the need to decode the entire enhancement layer coding parameters, which improves the decoding efficiency of the audio signal in the enhancement layer.
  • the first high-frequency band signal of the current frame is adapted to obtain the first high-frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame.
  • Two high frequency band signals including: obtaining the compatibility layer decoded signal and the compatibility layer frequency band extension signal in the compatibility layer signal of the current frame; performing combined processing on the compatibility layer frequency band extension signal and the enhancement layer signal of the current frame , To obtain the second high frequency band signal of the current frame.
  • the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the compatibility layer signal can be determined, where the compatibility layer decoded signal is the signal obtained by decoding the compatibility layer encoding parameters in the compatibility layer by the decoding component, and the compatibility layer
  • the frequency band extension signal is a signal obtained by the decoding component through frequency band extension in the compatibility layer. For example, the low frequency band signal is extended to the high frequency band to obtain the compatibility layer frequency band extension signal.
  • the decoding component can perform combined processing on the compatibility layer frequency band extension signal and the enhancement layer signal of the current frame, that is, the compatibility layer decoded signal in the first high frequency band signal is not used for the combined processing with the enhancement layer signal, and the decoding The component uses only the compatibility layer frequency band extension signal and the enhancement layer signal of the current frame for combined processing. After obtaining the second high frequency band signal of the current frame, it uses the second high frequency band signal, the enhancement layer signal and the first low frequency band signal for processing. After the combination, the final output signal is obtained. A higher frequency band signal of a better compatibility layer can be obtained, so as to output a better audio output signal, and improve the performance of the audio output signal.
  • the decoding component can obtain which spectrums of the compatible layer signal are obtained through coding and decoding, and which spectrums are obtained through frequency band expansion, and the final output signal includes the codec of the compatible layer signal.
  • the frequency spectrum of the processing part, and the frequency spectrum of the frequency band extension part can be obtained by combining the corresponding spectral components in the enhancement layer signal and the compatibility layer signal.
  • the audio of the current frame is obtained according to the enhancement layer signal of the current frame, the second high-frequency signal of the current frame, and the first low-frequency signal of the current frame
  • the method further includes: performing post-processing on the audio output signal of the current frame.
  • the audio output signal can also be post-processed, so that the gain of the post-processing can be obtained.
  • the audio of the current frame is obtained according to the enhancement layer signal of the current frame, the second high-frequency signal of the current frame, and the first low-frequency signal of the current frame
  • the method further includes: obtaining a post-processing parameter according to the compatibility layer signal; and using the post-processing parameter to post-process the enhancement layer signal to obtain an enhancement layer signal that has completed the post-processing.
  • post-processing parameters refer to the parameters required for post-processing.
  • the corresponding post-processing parameters need to be obtained according to different types of post-processing.
  • Processing parameters use the post-processing parameters to post-process the enhancement layer signal. After the post-processing is completed, you can combine the post-processed enhancement layer signal, the second high-band signal of the current frame, and the first low-band signal of the current frame After processing, the audio output signal is obtained.
  • post-processing can be performed on the enhancement layer signal, so that the gain of the post-processing can be obtained.
  • an embodiment of the present application further provides an audio encoding device, the audio encoding device includes at least one processor, and the at least one processor is configured to be coupled with a memory to read and execute instructions in the memory, To achieve the method as described in any one of the foregoing first aspects.
  • the audio encoding device further includes: the memory.
  • an embodiment of the present application also provides an audio decoding device, the audio decoding device includes at least one processor, and the at least one processor is configured to be coupled with a memory to read and execute instructions in the memory, To achieve the method as described in any one of the foregoing second aspects.
  • the audio decoding device further includes: the memory.
  • an embodiment of the present application also provides an audio encoding device, the audio encoding device includes: a compatible layer encoder, an enhancement layer encoder, and a code stream multiplexer, wherein the compatible layer encoder is used for Acquire the current frame of the audio signal, the current frame including: a high-band signal and a low-band signal; obtain the compatible layer coding parameters of the current frame according to the high-band signal and the low-band signal; the enhancement layer An encoder for obtaining a current frame of an audio signal, where the current frame includes: a high-band signal and a low-band signal; obtaining the enhancement layer coding parameters of the current frame according to the high-band signal; the code stream complex The user is used for code stream multiplexing the compatibility layer coding parameter and the enhancement layer coding parameter to obtain a coded code stream.
  • the enhancement layer encoder is used to obtain the signal type information of the high-band signal of the current frame; when the signal type information of the high-band signal of the current frame indicates the preset signal type At this time, the high frequency band signal of the current frame is encoded to obtain the enhancement layer encoding parameters of the current frame.
  • the preset signal type includes at least one of the following: a harmonic signal type, a tone signal type, a white noise-like signal type, a transient signal type, or a fricative signal type.
  • the enhancement layer coding parameters of the current frame further include: signal type information of the high frequency band signal of the current frame.
  • the enhancement layer encoder is used to obtain compatible layer coding frequency band information; determine the to-be-coded frequency band signal in the high frequency band signal of the current frame according to the compatible layer coding frequency band information; The frequency band signal to be coded is coded to obtain the enhancement layer coding parameters.
  • the components of the audio encoding device can also perform the steps described in the first aspect and various possible implementations.
  • the components of the audio encoding device can also perform the steps described in the first aspect and various possible implementations.
  • an embodiment of the present application also provides an audio decoding device, the audio decoding device includes: a code stream demultiplexer, a compatibility layer decoder, an enhancement layer decoder, an adaptation processor, and a combiner, wherein: The code stream demultiplexer is used to obtain a code stream; perform code stream demultiplexing on the code stream to obtain the compatible layer coding parameters of the current frame of the audio signal and the enhancement layer coding of the current frame Parameters; the compatibility layer decoder is used to obtain the compatibility layer signal of the current frame according to the compatibility layer encoding parameter, the compatibility layer signal including: the first high-frequency band signal of the current frame and the current The first low-band signal of the frame; the enhancement layer decoder is configured to obtain the enhancement layer signal of the current frame according to the enhancement layer coding parameters; the adaptation processor is configured to obtain the enhancement layer signal of the current frame according to the enhancement layer of the current frame Layer coding parameters or enhancement layer signals to perform adaptation processing on the first high-frequency band signal
  • the enhancement layer decoder is configured to obtain signal type information according to the enhancement layer coding parameters of the current frame; enhance the current frame according to the preset signal type indicated by the signal type information
  • the layer coding parameters are decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation processor is configured to obtain the compatibility layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame ; Use the compatibility layer high-band adjustment parameter to perform adaptation processing on the first high-band signal of the current frame to obtain the second high-band signal of the current frame.
  • the adaptation processor is configured to obtain the enhancement layer coding parameters of the current frame or the envelope information corresponding to the enhancement layer signal, and obtain the information of the first high frequency band signal of the current frame. Envelope information; obtaining the compatibility layer high-band adjustment parameter according to the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high-band signal.
  • the adaptation processor is configured to select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to a preset high-band spectrum selection rule; Combining the high-frequency spectrum signal of the enhancement layer and the first high-frequency signal of the current frame is performed to obtain the second high-frequency signal of the current frame.
  • the adaptation processor is configured to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high frequency band signal of the current frame; and determine the enhancement layer of the current frame
  • the signal corresponding to the compatibility layer frequency band extension signal in the signal is the enhancement layer high-band spectrum signal of the current frame.
  • the adaptation processor is configured to use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame. Band signal.
  • the adaptation processor is configured to obtain the enhancement layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame ; Use the enhancement layer high-band adjustment parameters to perform adaptation processing on the enhancement layer signal of the current frame to obtain the enhancement layer signal after the adaptation processing; use the enhancement layer signal after the adaptation processing to the The first high frequency band signal of the current frame is replaced to obtain the second high frequency band signal of the current frame.
  • the adaptation processor is configured to obtain the enhancement layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame Use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the replaced first high frequency band signal; use the enhancement layer high frequency band adjustment parameter to adjust the The replaced first high-frequency signal is subjected to adaptation processing to obtain the second high-frequency signal of the current frame.
  • the adaptation processor is configured to compare and select the spectral components of the enhancement layer signal of the current frame and the first high-frequency band signal of the current frame to select The first enhancement layer sub-signal is selected from the enhancement layer signal; the first enhancement layer sub-signal is used to perform processing on a signal with the same frequency spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame Replaced to obtain the second high frequency band signal of the current frame.
  • the enhancement layer decoder is configured to determine the to-be-decoded enhancement layer high-frequency signal in the enhancement layer coding parameters according to the enhancement layer coding parameters and the compatible layer coding parameters; The enhancement layer high frequency signal to be decoded in the enhancement layer coding parameters is decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation processor is configured to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal in the compatibility layer signal of the current frame;
  • the enhancement layer signals of the frame are combined and processed to obtain the second high frequency band signal of the current frame.
  • the spectrum range of the compatibility layer signal is [0, FL], wherein the spectrum range of the compatibility layer decoded signal is [0, FT], and the frequency band extension signal of the compatibility layer is The frequency spectrum range is [FT, FL]; the frequency spectrum range of the enhancement layer signal is [FX, FY]; the frequency spectrum range of the audio output signal is [0, FY];
  • the FL ⁇ FY, the FX>FT determines that the audio output signal is determined in the following manner: a signal with a frequency spectrum range of [0, FX] in the audio output signal is obtained from the compatibility layer signal, and the audio A signal with a frequency spectrum range of [FX, FL] in the output signal is obtained from the compatibility layer signal and the enhancement layer signal.
  • the adaptation processor is also used for the combiner according to the enhancement layer signal of the current frame, the second high frequency band signal of the current frame, and the first low frequency band signal of the current frame After the signal obtains the audio output signal of the current frame, post-processing is performed on the audio output signal of the current frame.
  • the adaptation processor is also used for the combiner according to the enhancement layer signal of the current frame, the second high frequency band signal of the current frame, and the first low frequency band signal of the current frame Before the signal obtains the audio output signal of the current frame, obtain post-processing parameters according to the compatibility layer signal; use the post-processing parameters to post-process the enhancement layer signal to obtain the post-processed enhancement layer signal .
  • the components of the audio decoding device can also perform the steps described in the foregoing second aspect and various possible implementations.
  • steps described in the foregoing second aspect and various possible implementations please refer to the foregoing description of the second aspect and various possible implementations. instruction of.
  • an embodiment of the present application also provides an audio encoding device, which may include: an acquisition module for acquiring a current frame of an audio signal, the current frame including: a high-band signal and a low-band signal; a compatibility layer encoding module , Used to obtain the compatible layer coding parameters of the current frame according to the high-band signal and the low-band signal; an enhancement layer coding module, used to obtain the enhancement layer coding of the current frame according to the high-band signal Parameters; multiplexing module, used for code stream multiplexing the compatibility layer coding parameters and the enhancement layer coding parameters to obtain a coded code stream.
  • the enhancement layer coding module is configured to obtain the signal type information of the high-band signal of the current frame; when the signal type information of the high-band signal of the current frame indicates the preset signal type At this time, the high frequency band signal of the current frame is encoded to obtain the enhancement layer encoding parameters of the current frame.
  • the preset signal type includes at least one of the following: a harmonic signal type, a tone signal type, a white noise-like signal type, a transient signal type, or a fricative signal type.
  • the enhancement layer coding parameters of the current frame further include: signal type information of the high frequency band signal of the current frame.
  • the enhancement layer coding module is used to obtain compatible layer coding frequency band information; determine the to-be-coded frequency band signal in the high frequency band signal of the current frame according to the compatible layer coding frequency band information; The frequency band signal to be coded is coded to obtain the enhancement layer coding parameters.
  • an embodiment of the present application also provides an audio decoding device, which may include: an acquisition module, configured to acquire an encoded code stream; and a demultiplexing module, configured to perform code stream demultiplexing on the encoded code stream to Obtain the compatibility layer encoding parameters of the current frame of the audio signal and the enhancement layer encoding parameters of the current frame; the compatibility layer decoding module is configured to obtain the compatibility layer signal of the current frame according to the compatibility layer encoding parameters, the compatibility layer The signal includes: the first high-band signal of the current frame and the first low-band signal of the current frame; an enhancement layer decoding module for obtaining the enhancement layer signal of the current frame according to the enhancement layer coding parameters; The adaptation module is configured to perform adaptation processing on the first high-band signal of the current frame according to the enhancement layer coding parameters or the enhancement layer signal of the current frame to obtain the second high-band signal of the current frame A combination module for obtaining the audio output signal of the current frame according to the enhancement layer signal of the current frame, the second
  • the enhancement layer decoding module is configured to obtain signal type information according to the enhancement layer coding parameters of the current frame; enhance the current frame according to the preset signal type indicated by the signal type information
  • the layer coding parameters are decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation module is configured to obtain the compatibility layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame; Use the compatibility layer high frequency band adjustment parameter to perform adaptation processing on the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the adaptation module is configured to obtain the enhancement layer coding parameters of the current frame or the envelope information corresponding to the enhancement layer signal, and to obtain the packet of the first high frequency band signal of the current frame. Envelope information; obtaining the compatibility layer high-band adjustment parameter according to the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high-band signal.
  • the adaptation module is configured to select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to a preset high-band spectrum selection rule;
  • the enhancement layer high-band spectrum signal and the first high-band signal of the current frame are combined for processing to obtain the second high-band signal of the current frame.
  • the adaptation module is configured to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high frequency band signal of the current frame; determine the enhancement layer signal of the current frame The signal corresponding to the compatibility layer frequency band extension signal is the enhancement layer high-band spectrum signal of the current frame.
  • the adaptation module is configured to use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency signal of the current frame With signal.
  • the adaptation module is configured to obtain an enhancement layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame; Use the enhancement layer high frequency band adjustment parameters to perform adaptation processing on the enhancement layer signal of the current frame to obtain an enhancement layer signal after the adaptation processing; use the enhancement layer signal after the adaptation processing to perform the adaptation process on the current frame The first high-frequency signal of the frame is replaced to obtain the second high-frequency signal of the current frame.
  • the adaptation module is configured to obtain an enhancement layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame; Use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the replaced first high frequency band signal; use the enhancement layer high frequency band adjustment parameter to perform the replacement The subsequent first high-frequency signal is subjected to adaptation processing to obtain the second high-frequency signal of the current frame.
  • the adaptation module is configured to compare and select the spectral components of the enhancement layer signal of the current frame and the first high-frequency band signal of the current frame to select from the enhancement of the current frame
  • the first enhancement layer sub-signal is selected from the layer signal; the first enhancement layer sub-signal is used to replace the signal with the same frequency spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame , To obtain the second high frequency band signal of the current frame.
  • the enhancement layer decoding module is configured to determine the to-be-decoded enhancement layer high-frequency signal in the enhancement layer coding parameters according to the enhancement layer coding parameters and the compatible layer coding parameters; The enhancement layer high frequency signal to be decoded in the enhancement layer coding parameters is decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation module is used to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal in the compatibility layer signal of the current frame; and for the compatibility layer frequency band extension signal and the current frame
  • the enhancement layer signals of the s are combined and processed to obtain the second high-frequency band signal of the current frame.
  • the spectrum range of the compatibility layer signal is [0, FL], wherein the spectrum range of the compatibility layer decoded signal is [0, FT], and the frequency band extension signal of the compatibility layer is The frequency spectrum range is [FT, FL]; the frequency spectrum range of the enhancement layer signal is [FX, FY]; the frequency spectrum range of the audio output signal is [0, FY];
  • the FL ⁇ FY, the FX>FT determines that the audio output signal is determined in the following manner: a signal with a frequency spectrum range of [0, FX] in the audio output signal is obtained from the compatibility layer signal, and the audio A signal with a frequency spectrum range of [FX, FL] in the output signal is obtained from the compatibility layer signal and the enhancement layer signal.
  • the audio decoding device 1000 may further include: a post-processing module, configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the After the first low-band signal of the current frame obtains the audio output signal of the current frame, post-processing is performed on the audio output signal of the current frame.
  • a post-processing module configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the After the first low-band signal of the current frame obtains the audio output signal of the current frame, post-processing is performed on the audio output signal of the current frame.
  • the audio decoding device may further include: a post-processing module, configured to combine modules based on the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the current frame Before obtaining the audio output signal of the current frame from the first low-band signal, obtain post-processing parameters according to the compatibility layer signal; use the post-processing parameters to perform post-processing on the enhancement layer signal to obtain the completion of the post-processing parameter The processed enhancement layer signal.
  • a post-processing module configured to combine modules based on the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the current frame Before obtaining the audio output signal of the current frame from the first low-band signal, obtain post-processing parameters according to the compatibility layer signal; use the post-processing parameters to perform post-processing on the enhancement layer signal to obtain the completion of the post-processing parameter The processed enhancement layer signal.
  • an embodiment of the present application provides a computer-readable storage medium that stores instructions in the computer-readable storage medium, which when run on a computer, causes the computer to execute the above-mentioned first or second aspect. The method described.
  • the embodiments of the present application provide a computer program product containing instructions, which when run on a computer, cause the computer to execute the method described in the first aspect or the second aspect.
  • an embodiment of the present application provides a communication device.
  • the communication device may include an entity such as an audio codec device or a chip.
  • the communication device includes a processor and optionally a memory; In storing instructions; the processor is configured to execute the instructions in the memory, so that the communication device executes the method according to any one of the foregoing first aspect or second aspect.
  • this application provides a chip system that includes a processor for supporting audio codec devices to implement the functions involved in the above aspects, for example, sending or processing the data and data involved in the above methods. /Or information.
  • the chip system further includes a memory, and the memory is used to store necessary program instructions and data of the audio codec device.
  • the chip system can be composed of chips, and can also include chips and other discrete devices.
  • FIG. 1 is a schematic structural diagram of an audio codec system provided by an embodiment of the application
  • FIG. 2 is a schematic flowchart of an audio coding method provided by an embodiment of the application
  • FIG. 3 is a schematic flowchart of an audio decoding method provided by an embodiment of this application.
  • FIG. 4 is a schematic diagram of a mobile terminal according to an embodiment of the application.
  • Fig. 5 is a schematic diagram of a network element according to an embodiment of the application.
  • FIG. 6 is a schematic flowchart of an audio encoding method according to an embodiment of the application.
  • FIG. 7a is a schematic diagram of the original signal spectrum provided by an embodiment of this application.
  • FIG. 7b is a schematic diagram of a spectrum of a compatible layer coded signal provided by an embodiment of this application.
  • FIG. 7c is a schematic diagram of a frequency spectrum of an enhancement layer coded signal provided by an embodiment of the application.
  • FIG. 7d is a schematic diagram of a frequency spectrum of an audio output signal provided by an embodiment of the application.
  • FIG. 8 is a schematic diagram of an output spectrum after a combination of enhancement layer coding parameters and compatibility layer coding parameters provided by an embodiment of the application;
  • FIG. 9 is a schematic diagram of the composition structure of an audio coding device provided by an embodiment of the application.
  • FIG. 10 is a schematic diagram of the composition structure of an audio decoding device provided by an embodiment of the application.
  • FIG. 11 is a schematic diagram of the composition structure of another audio coding device provided by an embodiment of the application.
  • FIG. 12 is a schematic diagram of the composition structure of another audio decoding device provided by an embodiment of the application.
  • FIG. 13 is a schematic diagram of the composition structure of another audio coding device provided by an embodiment of the application.
  • FIG. 14 is a schematic diagram of the composition structure of another audio decoding device provided by an embodiment of the application.
  • the embodiments of the present application provide an audio codec method and audio codec device, which are used to implement compatibility between a new codec device and an old codec device, and can improve the coding and decoding efficiency of audio signals.
  • the audio signal in the embodiment of the present application refers to the input signal in the audio encoding device.
  • the audio signal may include multiple frames.
  • the current frame may specifically refer to a certain frame in the audio signal.
  • the current frame The audio signal coding and decoding are illustrated by examples.
  • the previous frame or the next frame of the current frame in the audio signal can be coded and decoded according to the coding and decoding mode of the current frame audio signal.
  • the audio signal in the embodiment of the present application may be a mono audio signal, or may also be a stereo signal.
  • the stereo signal can be the original stereo signal, it can also be a stereo signal composed of two signals (the left channel signal and the right channel signal) included in the multi-channel signal, or it can be a multi-channel signal.
  • Fig. 1 is a schematic structural diagram of an audio coding and decoding system according to an exemplary embodiment of the application.
  • the audio codec system includes an encoding component 110 and a decoding component 120.
  • the audio codec system in the embodiments of the present application may include a compatibility layer and an enhancement layer.
  • an encoding component and a decoding component may be set for the compatibility layer
  • an encoding component and a decoding component may be set for the enhancement layer
  • the compatibility layer and The enhancement layer refers to two layers divided according to the frequency spectrum range of the processed audio signal.
  • the compatibility layer can process the entire frequency domain range of the audio signal, while the enhancement layer only processes the high frequency frequency domain range of the audio signal.
  • the compatibility layer can be implemented using old codec components, and the enhancement layer and compatibility layer can be implemented using new codec components.
  • the new codec components are compared with the old ones.
  • the new codec component must be fully backward compatible with the old new codec component, that is, the compatible layer signal of the audio codec includes all the spectral components of the input signal.
  • the audio codec system provided by the embodiment of the present application includes a compatibility layer and an enhancement layer.
  • the compatibility layer can fully implement the audio codec function, and the generated code stream is fully compatible with the old codec system.
  • the input of the compatibility layer is the original audio signal input to the audio codec system, and the compatibility layer encodes and decodes all the spectral components of the input signal.
  • the enhancement layer can encode and decode part of the frequency spectrum (for example, the high frequency range) of the input audio signal.
  • the decoder decides whether to use the decoded audio signal output by the compatible layer as the final decoded output signal, or to combine the decoded output signal of the enhancement layer with the decoded output signal of the compatible layer first, and then use it as the final decoded output signal.
  • the encoding component 110 is used to encode the current frame (audio signal) in the frequency domain or the time domain.
  • the encoding component 110 can be implemented by software; alternatively, it can also be implemented by hardware; or, it can also be implemented by a combination of software and hardware, which is not limited in the embodiments of the present application.
  • the encoding component 110 encodes the current frame in the frequency domain or the time domain, in a possible implementation manner, the steps shown in FIG. 2 may be included.
  • 201 Acquire a current frame of an audio signal, where the current frame includes: a high-frequency band signal and a low-frequency band signal.
  • the current frame can be any frame in the audio signal, and the current frame can include a high-band signal and a low-band signal.
  • the division of the high-band signal and the low-band signal can be determined by the frequency band threshold, which is higher than the frequency band threshold.
  • the frequency band threshold signal is a high frequency band signal, and the signal below the frequency band threshold value is a low frequency band signal.
  • the frequency band threshold can be determined according to the transmission bandwidth, the data processing capability of the encoding component 110 and the decoding component 120, and it will not be done here. limited.
  • the encoding of the high-band signal and the low-band signal can be implemented in the compatibility layer.
  • the compatibility layer encoding parameters of the current frame can be obtained.
  • the compatibility layer coding parameters refer to coding parameters obtained by encoding all frequency band signals of the audio signal in the compatibility layer.
  • the encoding of the high-band signal can be implemented in the enhancement layer.
  • the enhancement layer encoding parameters of the current frame can be obtained.
  • the enhancement layer coding parameter refers to the coding parameter obtained by encoding the high-band signal of the audio signal in the enhancement layer.
  • step 203 obtains the enhancement layer coding parameters of the current frame according to the high-band signal, including:
  • the high frequency band signal of the current frame is encoded to obtain the enhancement layer coding parameters of the current frame.
  • a signal classifier may be provided in the encoding component 110, by which the audio signal input to the encoding component 110 can be classified.
  • the signal type information of the high-band signal of the current frame is obtained, and the signal type information
  • a variety of signal classification results can be included according to the classified signal type.
  • the signal type information of the high frequency band signal of the current frame indicates the preset signal type
  • the high frequency band signal of the current frame is encoded to obtain the enhancement layer coding parameters of the current frame.
  • the audio signal can be divided into N preset signal types, N coding modes can be set in the enhancement layer, and a corresponding enhancement layer coding mode can be executed for each preset signal type, so that it can be used for different signal types.
  • the corresponding enhancement layer coding mode improves the coding efficiency of the audio signal.
  • the encoding component in the embodiment of the present application is provided with a signal classifier, and the signal classifier can be used to detect a specific type of audio signal.
  • the high-band signal is coded in the enhancement layer, otherwise it is not coded.
  • the signal classification result is used for the code stream multiplexing in step 204.
  • the high-band signal encoding parameters are also used for the code stream multiplexing in step 204. Otherwise, the code stream will not be multiplexed.
  • the encoding component uses the signal classification result to select a suitable enhancement layer encoding process, so that the signal classification result can also be used in the decoding end to decode the enhancement layer according to different preset signal types, so that the enhancement layer can be decoded according to different preset signal types.
  • the layer signal is used to process part of the spectrum processed by the compatibility layer to achieve the purpose of improving the performance of the final output signal.
  • the preset signal type includes at least one of the following: a harmonic signal type, a tone signal type, a white noise-like signal type, a transient signal type, or a fricative signal type.
  • the preset signal type of the high-band signal of the current frame can be multiple.
  • the signal type of the high-band signal of the current frame can be a harmonic signal type, that is, the high-band signal of the current frame is a harmonic signal. Therefore, the enhancement layer coding mode 1 can be used in the enhancement layer to encode the harmonic signal.
  • the signal type of the high-band signal of the current frame can be a tone signal type, that is, the high-band signal of the current frame contains tonal components, so the enhancement layer coding mode 2 can be used in the enhancement layer to encode the tone signal.
  • the enhancement layer coding mode 3 can be used in the enhancement layer.
  • the signal is encoded. If the signal type of the high-band signal of the current frame can be a transient signal type, that is, the high-band signal of the current frame includes a transient signal, so the enhancement layer coding mode 4 can be used in the enhancement layer to encode the transient signal .
  • the signal type of the high-band signal of the current frame can be the fricative signal type, that is, the high-frequency signal of the current frame includes the fricative signal, so the enhanced layer coding mode 5 can be used in the enhancement layer to encode the fricative signal.
  • a corresponding enhancement layer coding mode can be executed for each of the foregoing preset signal types, so that corresponding enhancement layer coding modes are adopted for different signal types, thereby improving the coding efficiency of audio signals.
  • the high-band signal of the current frame is not of the above-mentioned preset signal type, the high-band signal may not be encoded in the enhancement layer here.
  • the enhancement layer coding parameters of the current frame further include: signal type information of the high frequency band signal of the current frame.
  • the encoding component 110 can identify the high-frequency signal of the current frame according to the preset signal type for the audio signal, and the encoding component 110 can generate the signal type information of the high-frequency signal of the current frame, and compare the high-frequency signal of the current frame in the enhancement layer.
  • the enhancement layer coding parameters generated after the frequency band signal is encoded also include the signal type information of the high-band signal of the current frame. Therefore, when the code stream is multiplexed, the generated code stream can carry the high-frequency signal of the current frame.
  • Signal type information so that the signal type information can also be used in the decoding component to decode according to different preset signal types in the enhancement layer, so that the enhancement layer signal can be used to process part of the spectrum processed by the compatibility layer to achieve correct The purpose of the final output signal performance improvement.
  • step 203 obtains the enhancement layer coding parameters of the current frame according to the high-band signal, including:
  • the band signal to be coded is coded to obtain the enhancement layer coding parameters.
  • the encoding component 110 can also obtain compatibility layer encoding frequency band information.
  • the compatibility layer encoding frequency band information indicates the frequency band information of the audio signal encoded in the compatibility layer. That is, the compatibility layer encoding frequency band information can determine which or Which frequency bands have been coded with compatibility layer. Determine the to-be-encoded frequency band signal in the high-band signal of the current frame according to the compatibility layer encoding frequency band information, and determine the high-frequency band signal that needs to be encoded in the enhancement layer through the compatibility layer encoding frequency band information.
  • the coded band signal to be coded is coded to obtain enhancement layer coding parameters.
  • the compatibility layer encoding frequency band information output by the compatibility layer can be used to guide the encoding processing of the enhancement layer at the encoding end, so that the encoding in the enhancement layer and the encoding in the compatibility layer can complement each other and improve the audio in the enhancement layer. Signal coding efficiency.
  • the enhancement layer according to the signal classification information of the enhancement layer and the compatible layer coding band information, it is determined which high-frequency spectrum components to perform enhancement layer coding processing.
  • the signal classification information indicates the 4 frequency domain subbands of the current frame.
  • the enhancement layer can have the remaining 3 frequency domain subbands Perform enhancement layer coding processing, and no longer perform enhancement layer frequency domain coding on a frequency domain subband that has been coded in the compatibility layer, thereby reducing the number of frequency domain subbands that need to be coded in the enhancement layer and improving the enhancement layer Audio signal coding efficiency.
  • code stream multiplexing can be performed. From this, the compatible layer coding parameters and the enhancement layer coding parameters can be multiplexed into one code stream, that is, the coding code
  • the stream may include compatibility layer coding parameters and enhancement layer coding parameters.
  • the encoding component 110 can generate an encoded bitstream after encoding is completed, and the encoding component 110 can send the encoded bitstream to the decoding component 120, so that the decoding component 120 can receive the encoded bitstream, and then decode the encoded bitstream.
  • the component 120 obtains the audio output signal from the coded stream.
  • the encoding method shown in FIG. 2 is only an example and not a limitation.
  • the embodiment of the present application does not limit the execution order of the steps in FIG. 2 and the encoding method shown in FIG. 2 may also include more Or fewer steps, which are not limited in the embodiments of the present application.
  • the current frame of the audio signal is acquired.
  • the current frame includes: a high-band signal and a low-band signal; the compatibility layer encoding of the current frame is obtained according to the high-band signal and the low-band signal.
  • Parameters Obtain the enhancement layer coding parameters of the current frame according to the high frequency band signal; perform code stream multiplexing on the compatible layer coding parameters and the enhancement layer coding parameters to obtain the code stream.
  • the entire frequency domain range of the audio signal can be encoded in the compatibility layer, while only the high frequency domain range of the audio signal is encoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio coding equipment, while the enhancement layer and compatibility layer can be implemented using new audio coding equipment. Therefore, in this embodiment of the present application, the compatibility between the new audio coding equipment and the old audio coding equipment is realized. According to the device type of the audio coding device itself, you can choose to encode only in the compatibility layer, or to encode in the compatibility layer and the enhancement layer at the same time. The embodiment of this application does not need to add a transcoding module for the old audio coding device, so it saves The cost of upgrading audio coding equipment is eliminated, and the coding efficiency of audio signals can be improved.
  • the encoding component 110 and the decoding component 120 may be connected in a wired or wireless manner, and the decoding component 120 may obtain the encoded bitstream generated by the encoding component 110 through the connection between the encoding component 110 and the encoding component 110; or, the encoding component 110 may The generated code stream is stored in the memory, and the decoding component 120 reads the code stream in the memory.
  • the decoding component 120 can be implemented by software; alternatively, it can also be implemented by hardware; or, it can also be implemented by a combination of software and hardware, which is not limited in the embodiment of the present application.
  • the decoding component 120 decodes the current frame (audio signal) in the frequency domain or the time domain, in a possible implementation manner, the steps shown in FIG. 3 may be included.
  • the code stream is sent by the coding component 110 to the decoding component 120.
  • the code stream may include compatibility layer coding parameters and enhancement layer coding parameters.
  • the decoding component 120 after the decoding component 120 obtains the coded code stream, it performs code stream demultiplexing on the current frame of the audio signal in the coded code stream, thereby obtaining the compatible layer coding parameters of the current frame and the enhancement layer of the current frame. Encoding parameters.
  • the compatibility layer signal includes: the first high frequency band signal of the current frame and the first low frequency band signal of the current frame.
  • the compatibility layer encoding parameters can be decoded in the compatibility layer to obtain the compatibility layer signal of the current frame.
  • the compatibility layer is decoded for the entire frequency domain range of the audio signal, Therefore, the obtained compatibility layer signal includes: the first high frequency band signal of the current frame and the first low frequency band signal of the current frame, that is, the first high frequency band signal and the first low frequency band signal are decoded in the compatibility layer.
  • the enhancement layer coding parameters can be decoded in the enhancement layer to obtain the enhancement layer signal of the current frame.
  • the high frequency range of the audio signal is decoded in the enhancement layer, so
  • the obtained enhancement layer signal includes: the high frequency band signal of the current frame, that is, the high frequency band signal is decoded in the enhancement layer.
  • the decoding component 120 is an old decoding component, all the frequency domain signals of the audio signal can be obtained by performing step 303. If the decoding component 120 is a new decoding component, steps 303 and 304 need to be performed. , You can get the compatibility layer signal and the enhancement layer signal separately.
  • obtaining the enhancement layer signal of the current frame according to the enhancement layer coding parameter includes:
  • the enhancement layer coding parameters of the current frame are decoded according to the preset signal type indicated by the signal type information to obtain the enhancement layer signal of the current frame.
  • the coded code stream can carry the signal type information of the audio signal.
  • the signal type information of the enhancement layer coding parameter of the current frame can be obtained.
  • the enhancement layer coding parameters of the current frame are decoded according to the preset signal type indicated by the signal type information to obtain the enhancement layer signal of the current frame.
  • the audio signal can be divided into N preset signal types, and the enhancement layer can be set with N kinds of decoding modes, a corresponding enhancement layer decoding mode can be executed for each preset signal type, so that corresponding enhancement layer decoding modes are adopted for different signal types, thereby improving the decoding efficiency of audio signals.
  • the decoding component uses the signal type information to select a suitable enhancement layer decoding process, so that the enhancement layer signal can be used to process part of the spectrum processed by the compatibility layer to achieve the purpose of improving the performance of the final output signal.
  • step 304 obtains the enhancement layer signal of the current frame according to the enhancement layer coding parameters, including:
  • the enhancement layer high frequency signal to be decoded in the enhancement layer coding parameters is decoded to obtain the enhancement layer signal of the current frame.
  • the decoding component can obtain the enhancement layer coding parameters and the compatibility layer coding parameters, and the decoding component determines the high frequency signals in the enhancement layer coding parameters that need to be decoded in the enhancement layer according to the enhancement layer coding parameters and the compatibility layer coding parameters (that is, the enhancement layer to be decoded). Layer high-frequency signal), and then decode the high-frequency signal that needs to be decoded in the enhancement layer.
  • the high-frequency signal that is not determined to be decoded in the enhancement layer coding parameters can be discarded, so only the high-frequency signal of the enhancement layer to be decoded Decoding is performed without the need to decode the entire enhancement layer coding parameters, and the audio signal decoding efficiency in the enhancement layer is improved.
  • the enhancement layer coding parameters and the compatibility layer coding parameters can be used to guide the decoding processing of the enhancement layer at the decoding end, so that the decoding in the enhancement layer and the decoding in the compatibility layer can complement each other and improve the audio in the enhancement layer. Signal coding efficiency.
  • the to-be-decoded enhancement layer high-frequency signal in the enhancement layer coding parameters is determined according to the enhancement layer coding parameters and the compatible layer coding parameters, that is, it can be determined which high frequency spectrum components to perform enhancement layer decoding processing.
  • the signal classification information indicates that the 4 frequency domain subbands of the current frame need to undergo enhancement layer encoding processing, but the encoding frequency band information output by the compatibility layer indicates that there is 1 of the 4 frequency domain subbands.
  • Frequency domain subbands are coded in the compatibility layer coding, so the enhancement layer can perform enhancement layer coding processing on the remaining 3 frequency domain subbands, and no enhancement is performed on one frequency domain subband that has been coded in the compatibility layer.
  • the frequency domain coding of the layer, the decoding end processing process is, the enhancement layer decodes and outputs 3 frequency domain subband signals, the compatible layer decodes the corresponding 3 frequency domain subband signals in the output signal and the 3 frequency domain subband signals in the enhancement layer signal.
  • the 3 frequency domain sub-band spectrum components which are the final output signal, are combined with all other sub-band signals to obtain the final output signal.
  • the number of frequency domain subbands that need to be decoded in the enhancement layer can be reduced, and the audio signal decoding efficiency in the enhancement layer can be improved.
  • the enhancement layer coding parameters or enhancement layer signal of the current frame can be used for adaptation processing, so as to realize the first high-band signal in the compatibility layer.
  • the adaptation processing of the current frame obtains the second high-band signal in the compatibility layer.
  • the enhancement layer coding parameter or enhancement layer signal of the current frame can be used to compare the first high-band signal in the compatibility layer. The signal is adapted and processed to achieve the purpose of improving the performance of the final audio output signal.
  • the adaptation processing on the first high-band signal of the current frame can be realized by using the enhancement layer signal of the current frame, where the adaptation processing refers to performing the first high-band signal in the compatibility layer. Adjust to improve the performance of the high-band signal output by the compatibility layer decoding.
  • the adaptation processing will be described in detail in the following.
  • step 305 performs adaptation processing on the first high frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the decoding component 120 may use the enhancement layer coding parameter or the enhancement layer signal and the first high frequency band signal of the compatibility layer to obtain the compatibility layer high frequency band adjustment parameter, and the compatibility layer high frequency band adjustment parameter (in the subsequent embodiments may be referred to as The adjustment parameter) is an adjustment parameter used to adjust the high frequency part of the compatibility layer signal.
  • the compatibility layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the first high frequency band signal of the current frame are both It is a high-frequency audio signal.
  • An adjustment parameter can be calculated from the enhancement layer signal of the current frame and the first high-frequency signal of the current frame, and the first high-frequency signal of the current frame is adapted through the adjustment parameter. , To get the second high frequency band signal of the current frame.
  • the adjustment parameters can be obtained from the enhancement layer signal of the current frame and the first high-frequency band signal of the current frame.
  • the adjustment parameters are used to adapt the high-frequency spectrum components of the compatible layer signal to combine the enhancement layer signal and the first high-frequency band signal. After the compatible layer signals after the adaptation process are combined, the final output signal can be obtained.
  • obtaining the compatibility layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame includes:
  • the decoding component can directly parse the compatibility layer to obtain the output information of the compatibility layer.
  • This output information and the enhancement layer signal are jointly calculated to obtain the high-frequency spectrum adjustment parameters of the compatibility layer signal.
  • This adjustment parameter is used to adjust the high frequency of the compatibility layer signal.
  • the band signal is adjusted and combined with the output signal of the enhancement layer to obtain the final output signal.
  • the calculation of the adjustment parameter can be implemented in multiple ways.
  • the adjustment parameter can be calculated by using the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high frequency band signal, where the enhancement layer coding parameter corresponds to
  • the envelope information of may be the envelope information of the high-band signal calculated according to the enhancement layer coding parameters, or the envelope information corresponding to the enhancement layer signal may be the amplitude of the enhancement layer signal, the envelope information of the first high-band signal
  • the information can be the amplitude of the high-band signal in the compatibility layer signal.
  • Using the enhancement layer coding parameters or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high-band signal can calculate the compatibility layer high-band adjustment parameter. Among them, there may be multiple ways to calculate the high-band adjustment parameters of the compatibility layer.
  • the adjustment parameter para (Envelope-EnvTonal)/Envelope is calculated first, and the compatibility layer signal is The high frequency part of is multiplied by the adjustment parameter para to obtain the adjusted compatibility layer signal, and the enhancement layer signal and the adjusted compatibility layer signal are combined to obtain the final output signal.
  • the compatibility layer high frequency band adjustment parameters can be directly obtained from the compatibility layer in this embodiment, the compatibility layer high frequency band adjustment parameters are used to adjust the compatibility layer signal and combine with the enhancement layer output to obtain the final output signal.
  • a higher frequency band signal of a better compatibility layer can be obtained, so as to output a better audio output signal, and improve the performance of the audio output signal.
  • step 305 performs adaptation processing on the first high frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the high-frequency spectrum selection rule can be preset in the decoding component, and the high-frequency spectrum selection rule can be used to instruct to select the high-frequency spectrum signal from the enhancement layer signal.
  • the high-frequency spectrum selection rule specifies the selected one.
  • multiple frequency bands, or high-band spectrum selection rules indicate the frequency bands to be selected from the enhancement layer signal.
  • Select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to the preset high-band spectrum selection rule, and the enhancement layer high-band spectrum signal is the selected high-frequency band in the enhancement layer signal
  • the spectrum signal is combined with the high-frequency spectrum signal of the enhancement layer and the first high-frequency signal of the current frame to obtain the second high-frequency signal of the current frame.
  • part of the high-frequency signal can be selected from the enhancement layer signal to be combined with the first high-frequency signal in the compatibility layer, which can be generated in the compatibility layer
  • the second high-band signal therefore, the embodiment of the present application can obtain a higher-band signal of a better compatibility layer, thereby realizing output of a better audio output signal, and improving the performance of the audio output signal.
  • selecting the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to a preset high-band spectrum selection rule includes:
  • the signal corresponding to the compatible layer frequency band extension signal in the enhancement layer signal of the current frame is the enhancement layer high-band spectrum signal of the current frame.
  • the decoding component can determine the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high-frequency band signal, where the compatibility layer decoded signal is a signal obtained by the decoding component decoding the compatibility layer encoding parameters in the compatibility layer,
  • the compatibility layer frequency band extension signal is a signal obtained by the decoding component through frequency band extension in the compatibility layer. For example, the low frequency band signal is extended to the high frequency band to obtain the compatibility layer frequency band extension signal.
  • the decoding component can select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to the compatibility layer frequency band extension signal, that is, the enhancement layer signal and the compatibility layer decoded signal in the compatibility layer The corresponding signal is not selected, so that the enhancement layer high-band spectrum signal is a part of the spectrum signal selected from the enhancement layer signal.
  • a higher frequency band signal of a better compatibility layer can be obtained, so as to output a better audio output signal, and improve the performance of the audio output signal.
  • the enhancement layer signal is subjected to selection processing by analyzing the output signal of the compatibility layer, and then combined with the compatibility layer signal to obtain the final output signal.
  • the principle of selection processing may include: the compatibility layer signal includes the codec part and the frequency band extension part, the enhancement layer signal should be combined with the frequency band extension part in the compatibility layer signal to obtain the high frequency part of the final output signal, if the compatibility layer signal is Corresponding spectrum components in the enhancement layer signal are obtained by encoding and decoding, and the high-band part of the final output signal does not select this part of the spectrum component of the enhancement layer signal. Otherwise, the part of the spectrum component in the enhancement layer signal and the compatible layer signal are selected. This part of the spectrum is combined and processed to obtain this part of the spectrum component of the final output signal.
  • the difference between the second adaptation processing method and the aforementioned adaptation processing method one is that a part of the enhancement layer signal needs to be selected to be combined with the compatibility layer signal to obtain the final output signal, and part of the spectrum component of the enhancement layer signal is discarded.
  • a part of the enhancement layer signal needs to be selected to be combined with the compatibility layer signal to obtain the final output signal, and part of the spectrum component of the enhancement layer signal is discarded.
  • there is a tonal component at a certain frequency of the enhancement layer signal and it happens that there is a tonal component with equivalent energy near this frequency in the compatible layer signal.
  • the tonal component in the compatible layer signal is obtained by direct encoding and decoding.
  • So the tonal component output at this frequency point of the enhancement layer is discarded at this time, and the tonal component of this frequency point in the compatibility layer is directly output as the frequency spectrum of the final output signal at this frequency point.
  • this embodiment analyzes and compares the spectrum components of the enhancement layer signal with the spectrum components corresponding to the compatibility layer signal. The conclusion is that some of the spectrum components in the enhancement layer signal are discarded, and the other part of the spectrum components is combined with the compatibility layer signal. As the final output signal, that is, based on the enhancement layer signal and the compatibility layer signal, a better output signal can be obtained.
  • the enhancement layer signal may be a frequency domain signal
  • the compatibility layer signal may be a time domain signal.
  • the compatibility layer signal may be converted into a frequency domain signal first, and the frequency domain signal After the frequency domain coefficients of the enhancement layer signal are adapted and combined with the frequency domain coefficients of the compatibility layer signal, the frequency domain signal is converted into a time domain signal, so that the final output signal can be obtained.
  • step 305 performs adaptation processing on the first high frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame to obtain the second high frequency band signal of the current frame.
  • one implementation of the adaptation process can be direct replacement.
  • the decoding component can use the enhancement layer signal of the current frame to replace the first high-band signal of the current frame, that is, the first low-band signal in the compatibility layer is not reserved.
  • the first high frequency band signal in the compatibility layer can be replaced with the enhancement layer signal of the current frame, and the enhancement layer signal of the current frame can be used as the second high frequency band signal after the adaptation process. Therefore, the embodiment of the present application can obtain a higher frequency band signal of a better compatibility layer, thereby realizing output of a better audio output signal, and improving the performance of the audio output signal.
  • the third adaptation processing method is different from the foregoing adaptation processing methods 1 and 2 in that in the adaptation processing method 3, the enhancement layer signal is replaced with part of the spectrum component of the compatible layer signal.
  • the compatibility layer signal is Ylc(n)
  • the enhancement layer signal is Yel(n).
  • the high-frequency spectrum HF in the compatibility layer signal Ylc(n) is removed, and the signal HFe and Ylc(n) represented by Yel(n) are removed.
  • the low frequency spectrum LF in) is combined to form the final output signal Y(n).
  • the compatibility layer signal is the time-domain signal Ylc(t)
  • the enhancement layer signal is the time-domain signal Yel(t)
  • the time-domain signal Ylc(t) is first subjected to low-pass filtering, and the time-domain signal Ylc(t) is combined with the time-domain signal Yel(t).
  • the compatibility layer signal is the frequency domain signal Ylc(k)
  • the enhancement layer signal is the frequency domain signal Yel(k)
  • Obtain the final spectral coefficients, and convert the spectral coefficients into time-domain signals as the final output signal, that is, the output signal Y(t) is obtained by the following formula:
  • Y(k) is converted into time-domain signal Y(t) as the final output signal.
  • the enhancement layer output spectrum components By using the enhancement layer output spectrum components to replace part of the spectrum components in the compatible layer signal, an output signal with better coding and decoding performance than that of the compatible layer signal is obtained.
  • the compatibility layer of this embodiment is fully backward compatible with the old codec.
  • the enhancement layer of this embodiment encodes and decodes certain types of signals according to the signal classification information, and the output signal spectrum of the enhancement layer is used at the decoder according to the signal classification information. After the components replace part of the spectral components in the output signal of the compatibility layer, the final output signal is obtained.
  • using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes:
  • the first high-frequency band signal of the current frame is replaced with the enhanced layer signal after the adaptation process to obtain the second high-frequency band signal of the current frame.
  • the decoding component 120 may use the enhancement layer signal and the first high frequency band signal of the compatibility layer to obtain the enhancement layer high frequency band adjustment parameter, and the enhancement layer high frequency band adjustment parameter (may be simply referred to as the adjustment parameter in the subsequent embodiments) is used
  • the enhancement layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the current frame
  • the first high-frequency signal of the frame is a high-frequency audio signal
  • an adjustment parameter can be calculated through the enhancement layer signal of the current frame and the first high-frequency signal of the current frame, and the enhancement of the current frame by the adjustment parameter
  • the layer signal undergoes adaptation processing to obtain an enhanced layer signal after the adaptation processing.
  • using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes:
  • the decoding component 120 may use the enhancement layer signal and the first high frequency band signal of the compatibility layer to obtain the enhancement layer high frequency band adjustment parameter, and the enhancement layer high frequency band adjustment parameter (may be simply referred to as the adjustment parameter in the subsequent embodiments) is used
  • the enhancement layer high frequency band adjustment parameter can be obtained by using the enhancement layer signal of the current frame and the first high frequency band signal of the current frame, where the enhancement layer signal of the current frame and the current frame
  • the first high-frequency signal of the frame is a high-frequency audio signal. According to the enhancement layer signal of the current frame and the first high-frequency signal of the current frame, an adjustment parameter can be calculated.
  • the first high-frequency signal after replacement is adapted through the adjustment parameter to obtain the second high-frequency signal of the current frame.
  • the parameters to perform adaptation processing on the replaced first high-frequency signal a higher-frequency signal of a better compatibility layer can be obtained, thereby achieving a better audio output signal and improving the performance of the audio output signal.
  • the enhancement layer signal is adapted to replace part of the spectrum components of the compatibility layer signal, and then combined with other spectrum components of the compatibility layer to obtain the final output signal.
  • the enhancement layer signal is replaced with part of the spectrum component of the compatibility layer signal before adaptation processing is performed, and the final output signal is obtained after combining with other spectrum components of the compatibility layer.
  • the spectrum component of the enhancement layer signal needs to be adapted before or after replacing the spectrum component corresponding to the compatibility layer, which is specifically as follows:
  • the compatibility layer signal is the time domain signal Ylc(t) and the enhancement layer signal is the time domain signal Yel(t)
  • the time domain signal Ylc(t) is first subjected to low-pass filtering and adaptation processing, and then the time domain signal Ylc(t)
  • the signal Yel(t) is superimposed to obtain the final output signal, that is, the output signal Y(t) is obtained by the following formula:
  • Y(t) LowFilter(Ylc(t))+Preprocessing(Yel(t)).
  • the compatibility layer signal is the frequency domain signal Ylc(k), and its corresponding high-frequency spectrum component energy is EnerLC
  • the enhancement layer signal is the frequency domain signal Yel(k), and its energy is EnerEL.
  • the frequency domain coefficients of the enhancement layer after the adaptation process are combined with the low-band frequency domain coefficients of the compatibility layer to obtain
  • the frequency domain coefficient of the output signal specifically, the output signal Y(t) is obtained by the following formula:
  • This embodiment achieves the purpose of improving the codec performance quality of the final output signal by replacing the enhanced layer signal after the adaptation process with the spectrum component corresponding to the compatible layer signal.
  • using the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame includes:
  • the first enhancement layer sub-signal is used to replace a signal with the same frequency spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame to obtain the second high-frequency band signal of the current frame.
  • the decoding component can compare the spectral component corresponding to the enhancement layer signal with the spectral component corresponding to the first high-band signal in the compatible layer signal, and after completing the spectral component comparison, select the first enhancement layer from the enhancement layer signal of the current frame Sub-signal, and finally use the selected first enhancement layer sub-signal to replace the signal with the same spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame to obtain the second high-frequency signal of the current frame With signal.
  • the decoding component performs the above-mentioned spectral component comparison and selection, and uses a part of the spectral components in the enhancement layer signal to replace the corresponding spectral components in the compatibility layer signal according to the comparison result to obtain the spectral components in the final output signal.
  • another part of the spectrum component in the enhancement layer signal is discarded, and the replaced spectrum component in the compatibility layer signal is combined with other spectrum components in the compatibility layer signal to obtain all the spectrum components of the final output signal.
  • the decoding component performs spectral component comparison and selection operation before combining the enhancement layer signal and the compatibility layer signal.
  • the process of comparison and selection is: suppose that there is a spectral component Wk in the enhancement layer signal, and the compatibility layer signal is near Wk If there is a spectrum component Zk with equivalent energy, the judgment conclusion is that the spectrum component Zk is obtained by the compatible layer codec processing. Compared with Wk, Zk is closer to the corresponding spectrum component in the original signal, so Zk is selected as the spectrum component of the final output signal.
  • the decoding component selects the optimal enhancement layer signal corresponding to the frequency spectrum component of the final signal output according to the enhancement layer signal and the compatibility layer signal.
  • the high frequency band of the compatibility layer signal contains high-quality codec spectrum components. Select the compatibility layer to output the new spectrum components as the spectrum components of the final output signal. Under the principle of introducing the enhancement layer codec to improve the overall codec performance, taking into account the special situation of the compatibility layer signal containing high-performance codec spectrum components, and finally achieve Optimal codec output signal.
  • step 305 performs adaptation processing on the first high frequency band signal of the current frame according to the enhancement layer coding parameters or enhancement layer signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the decoding component can determine the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the compatibility layer signal, where the compatibility layer decoded signal is the signal obtained by the decoding component decoding the compatibility layer encoding parameters in the compatibility layer, and the compatibility layer frequency band
  • the extension signal is a signal obtained by the decoding component through frequency band extension in the compatibility layer. For example, the low frequency band signal is extended to the high frequency band to obtain the compatibility layer frequency band extension signal.
  • the decoding component can perform combined processing on the compatibility layer frequency band extension signal and the enhancement layer signal of the current frame, that is, the compatibility layer decoded signal in the first high frequency band signal is not used for the combined processing with the enhancement layer signal, and the decoding The component only uses the compatibility layer frequency band extension signal and the enhancement layer signal of the current frame for combined processing. After obtaining the second high frequency band signal of the current frame, it uses the second high frequency band signal, the enhancement layer signal and the first low frequency band signal for processing. After the combination, the final output signal is obtained. A higher frequency band signal of a better compatibility layer can be obtained, so as to output a better audio output signal, and improve the performance of the audio output signal.
  • the spectrum range of the compatibility layer signal is [0, FL], where the spectrum range of the compatibility layer decoded signal is [0, FT], and the spectrum range of the compatibility layer band extension signal is [FT, FL];
  • the frequency spectrum range of the enhancement layer signal is [FX, FY];
  • the frequency spectrum range of the audio output signal is [0, FY];
  • the audio output signal is determined by the following method: the signal with the spectrum range of [0, FT] in the audio output signal is obtained from the compatible layer signal, and the signal with the spectrum range of [FT, FL] in the audio output signal The signal is obtained from the compatibility layer signal and the enhancement layer signal; or,
  • the audio output signal is determined by the following method: the signal with the spectrum range of [0, FX] in the audio output signal is obtained from the compatible layer signal, and the signal with the spectrum range of [FX, FL] in the audio output signal The signal is obtained from the compatibility layer signal and the enhancement layer signal; or,
  • the audio output signal is determined by the following method: the signal with the spectrum range of [0, FT] in the audio output signal is obtained from the compatible layer signal, and the spectrum range of the audio output signal is [FT, FL] The signal of is obtained from the compatibility layer signal and the enhancement layer signal; or,
  • the signal with the frequency range of [0, FX] in the audio output signal is obtained from the compatible layer signal, and the signal with the frequency range of [FX, FL] in the audio output signal The signal is obtained from the compatibility layer signal and the enhancement layer signal.
  • the compatibility layer signal may include a compatibility layer decoded signal and a compatibility layer frequency band extension signal
  • the decoding component can determine the boundary between the compatibility layer decoded signal and the compatibility layer frequency band extension signal in the compatibility layer signal, so that the spectrum of the compatibility layer decoded signal can be determined
  • the range is [0, FT]
  • the spectrum range of the compatible layer band extension signal is [FT, FL].
  • the decoding component can obtain which spectrums in the compatibility layer signal are obtained through codec processing and which spectrums are obtained through frequency band extension.
  • the final output signal includes the codec processing part of the compatibility layer signal.
  • the frequency spectrum, and the frequency spectrum of the frequency band extension part can be obtained by combining the corresponding spectral components in the enhancement layer signal and the compatibility layer signal.
  • the spectrum range of the compatibility layer signal is 0 to FL, where 0 to FT are directly encoded and decoded, and FT to FL
  • the spectrum range of the enhancement layer signal is FX to FY
  • the final output signal is Y.
  • the audio output signal is determined by the following method:
  • the spectrum range of the audio output signal The signal of [0, FT] is obtained through the compatibility layer signal, and the signal of the frequency spectrum range of [FT, FL] in the audio output signal is obtained through the compatibility layer signal and the enhancement layer signal.
  • FL FY, FX>FT, that is, the minimum spectrum range FX of the enhancement layer signal is greater than the maximum spectrum range of the compatible layer decoded signal.
  • the audio output signal is determined by the following method: the spectrum range of the audio output signal is [0, The signal of FX] is obtained through the compatibility layer signal, and the signal of the frequency spectrum range of [FX, FL] in the audio output signal is obtained through the compatibility layer signal and the enhancement layer signal.
  • the minimum spectrum range FX of the enhancement layer signal is smaller than the maximum spectrum range of the compatible layer decoded signal.
  • the audio output signal is determined by the following method:
  • the signal with the spectrum range of [0, FT] in the audio output signal is obtained through the compatibility layer signal, and the signal with the spectrum range of [FT, FL] in the audio output signal is enhanced by the compatibility layer signal
  • the layer signal is obtained.
  • FL ⁇ FY,FX>FT that is, the maximum spectrum range FY of the enhancement layer signal is greater than the spectrum range of the compatible layer frequency band extension signal, and the minimum spectrum range FX of the enhancement layer signal is greater than the maximum spectrum range of the compatible layer decoded signal.
  • the audio output signal is determined by the following method: the signal with the spectrum range of [0, FX] in the audio output signal is obtained through the compatibility layer signal, and the signal with the spectrum range of [FX, FL] in the audio output signal is obtained through the compatibility layer signal and the enhancement layer signal get
  • the compatibility layer is fully backward compatible with the old codec components, and the combined output adaptively generates a high-performance final output signal based on the output signal of the compatibility layer, the codec spectrum range, and the enhancement layer signal.
  • the upper limit of the spectrum range of the combined processing is the upper limit of the enhanced layer codec processing spectrum range, which is the cut-off frequency of the original signal
  • the lower limit of the spectrum range of the combined processing is compatible
  • Layer coding processes the larger value of the upper limit of the spectrum range and the lower limit of the enhancement layer signal spectrum range to ensure that the final output signal spectrum range includes the entire spectrum range of the input signal.
  • the output signal has both compatible layer signals and enhancements. Advantages of layer signal.
  • step 305 the adaptation process for the first high-band signal can be completed in the compatibility layer, and the second high-band signal in the compatibility layer is obtained.
  • the compatibility layer The first low-band signal output by decoding, the enhancement layer signal in the enhancement layer, and the second high-band signal in the compatibility layer are combined for processing, so that the audio output signal of the current frame can be obtained, and the audio output signal of the current frame can be used for Audio playback of the audio playback component.
  • the decoding method shown in FIG. 3 is only an example and not a limitation.
  • the embodiment of the present application does not limit the execution order of the steps in FIG. 3, and the decoding method shown in FIG. 3 may also include more Or fewer steps, which are not limited in the embodiments of the present application.
  • step 306 obtains the audio output signal of the current frame according to the enhancement layer signal of the current frame, the second high-frequency signal of the current frame, and the first low-frequency signal of the current frame
  • the decoding methods provided also include:
  • the decoding component after the decoding component obtains the audio output signal of the current frame, it can also perform post-processing on the audio output signal, so as to obtain the gain of the post-processing.
  • post-processing includes at least one of the following: dynamic range control, rendering, and sound mixing.
  • the decoding component may include a post-processor.
  • the post-processor is used to post-process high-frequency signals, such as the enhancement layer signal, the second high-frequency signal of the current frame, and the first low-frequency signal of the current frame.
  • the audio output signal is obtained after the combined processing of the band signal, the audio output signal is post-processed.
  • the functions of the post-processor may include dynamic range control (DRC), rendering, audio mixing, etc., and the post-processing method adopted in actual application scenarios is not limited.
  • step 306 obtains the audio output signal of the current frame according to the enhancement layer signal of the current frame, the second high-frequency signal of the current frame, and the first low-frequency signal of the current frame
  • the decoding methods provided also include:
  • the post-processing parameters are used to post-process the enhancement layer signal to obtain the post-processed enhancement layer signal.
  • the decoding component can also obtain post-processing parameters according to the compatibility layer signal before obtaining the audio output signal of the current frame.
  • the post-processing parameters refer to the parameters required for post-processing, and the corresponding post-processing needs to be obtained according to different types of post-processing. Parameters, use the post-processing parameters to post-process the enhancement layer signal. After the post-processing is completed, the post-processed enhancement layer signal, the second high-band signal of the current frame, and the first low-band signal of the current frame can be combined for processing , And then get the audio output signal.
  • post-processing can be performed on the enhancement layer signal, so that the gain of the post-processing can be obtained.
  • the enhancement layer signal is combined with the post-processed compatibility layer signal to obtain the final output signal.
  • the same post-processing as the compatibility layer is added to the enhancement layer.
  • post-processing such as dynamic range control, rendering, and mixing is performed, and then combined processing is performed. For example, if it is possible to obtain the directly decoded and processed signal of the compatibility layer, the enhancement layer signal and the compatibility layer signal are combined first, and then the above-mentioned post-processing is performed together. For another example, if the signal after the direct decoding process of the compatibility layer cannot be obtained, the above-mentioned post-processing is performed on the enhancement layer signal first, and then combined with the compatibility layer signal.
  • the post-processing parameter can be directly obtained from the compatible layer signal, and then the post-processing parameter can be used to perform the post-processing on the enhancement layer signal.
  • the post-processing parameter can be used to perform the post-processing on the enhancement layer signal.
  • through post-processing it can be ensured that the spectrum components before and after the combination processing have similar energy relationships between sub-bands in the frame according to the sub-bands, so as to ensure that the final audio output signal can be obtained through the combination processing.
  • the compatibility layer is fully compatible with the old codec components, and the combined processed signal includes the post-processing operations carried out when the compatibility layer is output, so that the old codec component can realize the full frequency range of the audio signal. Codec within.
  • the coded stream is obtained; the coded stream is demultiplexed to obtain the compatible layer coding parameters of the current frame of the audio signal and the enhancement layer coding parameters of the current frame ;
  • the compatibility layer signal includes: the first high frequency band signal of the current frame and the first low frequency band signal of the current frame; obtain the enhancement layer signal of the current frame according to the enhancement layer coding parameters ;
  • the enhancement layer coding parameters or enhancement layer signal of the current frame the first high frequency band signal of the current frame is adapted to obtain the second high frequency band signal of the current frame; according to the enhancement layer signal of the current frame, the current frame
  • the second high-band signal of the current frame and the first low-band signal of the current frame obtain the audio output signal of the current frame.
  • the entire frequency domain range of the audio signal can be decoded in the compatibility layer, while only the high frequency domain range of the audio signal is decoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio decoding equipment, and the enhancement layer and compatibility layer can be implemented using new audio decoding equipment. Therefore, in this embodiment of the application, the compatibility between the new audio decoding equipment and the old audio decoding equipment is realized. According to the device type of the audio decoding device itself, you can choose to decode only at the compatibility layer, or at the same time the compatibility layer and the enhancement layer.
  • the embodiment of this application does not need to add a transcoding module for the old audio decoding device, so it saves The cost of upgrading audio decoding equipment is eliminated, and the decoding efficiency of audio signals can be improved.
  • the encoding component 110 and the decoding component 120 can be provided in the same device; or, they can also be provided in different devices.
  • the device can be a terminal with audio signal processing functions such as mobile phones, tablet computers, laptop computers and desktop computers, Bluetooth speakers, voice recorders, wearable devices, etc., or it can be a core network or wireless network with audio signal processing capabilities This embodiment does not limit this.
  • the encoding component 110 is installed in the mobile terminal 130
  • the decoding component 120 is installed in the mobile terminal 140.
  • the mobile terminal 130 and the mobile terminal 140 are independent of each other and have audio signal processing capabilities.
  • the electronic device may be a mobile phone, a wearable device, a virtual reality (VR) device, or an augmented reality (AR) device, etc., and the mobile terminal 130 and the mobile terminal 140 are connected wirelessly or wiredly. Take network connection as an example.
  • the mobile terminal 130 may include an acquisition component 131, an encoding component 110, and a channel encoding component 132, where the acquisition component 131 is connected to the encoding component 110, and the encoding component 110 is connected to the encoding component 132.
  • the mobile terminal 140 may include an audio playing component 141, a decoding component 120, and a channel decoding component 142.
  • the audio playing component 141 is connected to the decoding component 120
  • the decoding component 120 is connected to the channel decoding component 142.
  • the mobile terminal 130 After the mobile terminal 130 collects the audio signal through the collection component 131, it encodes the audio signal through the encoding component 110 to obtain a coded code stream; then, the channel coding component 132 encodes the coded code stream to obtain a transmission signal.
  • the mobile terminal 130 transmits the transmission signal to the mobile terminal 140 through a wireless or wired network.
  • the mobile terminal 140 After receiving the transmission signal, the mobile terminal 140 decodes the transmission signal through the channel decoding component 142 to obtain a code stream; decodes the code stream through the decoding component 110 to obtain an audio signal; and plays the audio signal through the audio playback component. It can be understood that the mobile terminal 130 may also include components included in the mobile terminal 140, and the mobile terminal 140 may also include components included in the mobile terminal 130.
  • the encoding component 110 and the decoding component 120 are provided in a network element 150 capable of processing audio signals in the same core network or wireless network as an example for description.
  • the network element 150 includes a channel decoding component 151, a decoding component 120, an encoding component 110, and a channel encoding component 152.
  • the channel decoding component 151 is connected to the decoding component 120
  • the decoding component 120 is connected to the encoding component 110
  • the encoding component 110 is connected to the channel encoding component 152.
  • the channel decoding component 151 After the channel decoding component 151 receives the transmission signal sent by other devices, it decodes the transmission signal to obtain the first coded code stream; the decoding component 120 decodes the coded code stream to obtain the audio signal; the coding component 110 performs the decoding on the audio signal Encode to obtain a second coded code stream; use the channel coding component 152 to encode the second coded code stream to obtain a transmission signal.
  • the other device may be a mobile terminal with audio signal processing capability; or, it may also be other network elements with audio signal processing capability, which is not limited in this embodiment.
  • the encoding component 110 and the decoding component 120 in the network element can transcode the encoded code stream sent by the mobile terminal.
  • the device installed with the encoding component 110 may be referred to as an audio encoding device.
  • the audio encoding device may also have an audio decoding function, which is not limited in the implementation of this application.
  • the device installed with the decoding component 120 may be referred to as an audio decoding device.
  • the audio decoding device may also have an audio encoding function, which is not limited in the implementation of this application.
  • FIG. 6 is a schematic diagram of an audio encoding and decoding process in an embodiment of this application.
  • the left side of the dotted line is the encoding end
  • the right side of the dotted line is the decoding end.
  • FIG. 7a it is a schematic diagram of the original signal frequency spectrum provided by the embodiment of this application.
  • the curve shown in Fig. 7a is the frequency spectrum of the original signal in each frequency band.
  • FIG. 7b it is a schematic diagram of the spectrum of the compatibility layer encoding signal provided by this embodiment of the application.
  • the compatibility layer encoding signal spectrum includes: high-band signals and For low-band signals, the left side of the vertical line in Figure 7b is the low-band signal, and the right side of the vertical line is the high-band signal.
  • the encoding end can also perform signal classification on the input signal, generate signal type parameters during signal classification, and perform enhancement layer encoding according to the signal type parameters to obtain an enhancement layer signal.
  • the enhanced layer encoding signal spectrum provided by this embodiment of the application
  • the dotted line shown in FIG. 7c is the frequency spectrum of the enhancement layer coded signal in the high frequency band.
  • the compatibility layer signal, the enhancement layer signal and the signal type parameter are coded stream multiplexed to obtain the coded code stream. As shown in FIG.
  • FIG. 7d a schematic diagram of the frequency spectrum of the audio output signal provided by this embodiment of the application, the compatibility layer signal, the enhancement layer signal, and the signal type parameter are coded stream multiplexed, that is, the compatibility layer coded signal spectrum and the signal type parameter shown in FIG. 7b can be combined with The enhanced layer coded signal spectra shown in Fig. 7c are combined to generate a coded code stream.
  • An example is as follows. First, input the input signal to the compatible layer encoder, and the compatible layer encoding parameters encoded by the compatible encoder are sent to the code stream multiplexer.
  • the input signal can also be input to the signal classifier, and the signal type parameters are sent to the code stream multiplexer. Enter the code stream multiplexer, select the corresponding enhancement layer mode 1 to N according to the signal type parameters to encode part of the spectral components of the input signal, and send the enhancement layer coding parameters encoded by the enhancement layer encoder to the code stream multiplexer
  • the code stream output by the code stream multiplexer is sent to the decoding end.
  • the compatible layer coding frequency band information may also be sent to the enhancement layer encoder, so that the enhancement layer encoder can determine which information in the enhancement layer is to be used in the enhancement layer according to the compatible layer coding frequency band information.
  • the frequency band is encoded.
  • the decoder first demultiplexes the coded stream, decodes the signal type parameter to obtain the signal type parameter, obtains the enhancement layer signal through the enhancement layer decoding, and obtains the compatible layer signal through the compatibility layer decoding, and then uses the signal type parameter and enhancement
  • the layer signal performs adaptation processing on the compatible layer signal, and then combines the adapted compatible layer signal, signal type parameter, and enhancement layer signal, and finally obtains the output signal.
  • the decoder uses the code stream demultiplexer to send the compatible layer encoding parameters to the compatible layer decoder to obtain the compatible layer signal.
  • the signal type parameter decoder decodes the signal type parameter, and the enhancement layer mode 1 to N decoder Decode and output the enhancement layer signal according to the input corresponding code stream and signal type parameters.
  • the enhancement layer signal is used through the adapter to adapt the compatibility layer signal.
  • the adapted compatibility layer signal, enhancement layer signal and signal type parameters will be adjusted The information is sent to the combiner, and the final output signal of the decoder is obtained from the combiner.
  • the compatibility layer codec in the embodiment of the application can be any codec.
  • the compatibility layer codec can be an MPEG-H 3D Audio codec.
  • This codec includes the time domain codec mode and the transform domain codec. Decoding mode, supports encoding and decoding of multi-channel input signals. The coding and decoding process of the compatibility layer codec will not be described in detail.
  • the compatibility layer signal may also be sent to the enhancement layer decoder, so that the enhancement layer decoder can determine which frequency bands in the enhancement layer to decode according to the compatibility layer signal,
  • the enhancement layer decoder can determine which frequency bands in the enhancement layer to decode according to the compatibility layer signal
  • the enhancement layer coding and decoding processing method will be illustrated by an example.
  • One processing method includes: the signal classifier divides the high-frequency signal into the following three preset signal types: harmonic signals, signals containing independent tonal components, and other signals. Perform different processing operations on the above three signals.
  • the encoder end can encode the fundamental frequency, number of harmonics, amplitude, and base energy of the harmonic signal, so that the enhancement layer coding parameters can be obtained.
  • the decoding end reconstructs a harmonic signal with energy equivalent to the original signal at the corresponding position according to the fundamental frequency, the number of harmonics, the amplitude and the base energy.
  • the tonal components are processed according to a sinusoidal trajectory curve at the encoding end, and the amplitude, phase, and the starting point and ending point of the trajectory line are encoded to obtain the enhancement layer coding parameters.
  • the enhancement layer coding The parameters are sent to the decoder, and the decoder reconstructs the signal containing tonal components according to the decoded amplitude, phase, and the starting point and end point of the trajectory.
  • the encoding end does not perform enhancement layer encoding processing, and directly uses the compatible layer signal as the final output signal.
  • Another processing method includes: the signal classifier divides the high-frequency signal into 4 types of signals, including: harmonic signals, signals containing independent tonal components, white-noise-like signals, and other signals.
  • the harmonic signal is a signal with independent tonal components.
  • the processing method of other signals is the same as the previous processing method.
  • the encoding end uses white noise as the excitation signal to calculate the envelope information of the enhancement layer together with the original high-band signal.
  • the envelope information of the enhancement layer is transmitted to the decoding end as an enhancement layer encoding parameter, and the decoding end Using the received envelope information, white noise is used as the excitation signal to reconstruct the enhancement layer signal.
  • the signal classifier can also divide the high-band signal into more types of signals and divide into N signal types.
  • the enhancement layer encoder has N coding modes, and each coding mode handles one type. signal.
  • the signal classifier divides the high-frequency signal into 6 types of signals, including: harmonic signals, signals containing independent tonal components, white noise-like signals, transient signals, fricative signals and other signals.
  • the harmonic signal the signal containing the independent tone component, the white noise-like signal, other signals are processed in the same way as the previous one.
  • the enhancement layer encodes the time-domain envelope more finely, so that the assignment difference between the time-domain envelopes of the subframes included in the transient signal is more obvious.
  • the enhancement layer finely encodes the spectrum envelope of the signal, so that the spectrum envelope of the restored signal at the decoding end is closer to the original signal, thereby achieving the purpose of improving the encoding performance.
  • FIG. 8 it is a schematic diagram of the output spectrum after the combination of the enhancement layer coding parameters and the compatibility layer coding parameters provided in this embodiment of the application.
  • Ylc(n) represents the compatibility layer coding parameters
  • Ylc(n) includes high frequency signal HF and low frequency signal LF
  • Yel(n) represents enhancement layer coding parameters
  • Yel(n) includes high frequency signal HFe
  • enhancement layer coding The final output signal after the combination of the parameters and the compatibility layer coding parameters is Y(n).
  • Y(n) includes the high-frequency signal HFnew and the low-frequency signal LF, where the high-frequency signal HFnew can be the enhancement layer signal and the compatibility layer signal adaptation After the high-frequency signal.
  • the x(n) signal passes through the signal classifier, and the generated signal classification parameters are put into the code stream. If the signal classification parameters indicate that the current frame contains harmonic signals, it will be coded through the enhancement layer.
  • the above Ylc(n) and Yel(n) are combined to obtain the output signal Y(n), the signal bandwidth of which is composed of two partial frequency bands LF and HFnew.
  • the codec performance quality of Y(n) is better than that of Ylc(n).
  • H1(.) and H2(.) are the adaptation processing function of the compatibility layer signal and the adaptation processing function of the enhancement layer signal, respectively.
  • the decoding end reconstructs the corresponding harmonic component according to the magnitude of the fundamental frequency, the number of harmonics and the amplitude and sets it to Yel(k).
  • the output signal Y(k) is:
  • Y(k) is transformed into time-domain signal Y(t), which is the final output signal.
  • an audio codec system includes a compatibility layer and an enhancement layer.
  • the compatibility layer can fully implement the audio codec function, and the generated code stream is fully compatible with the old codec system.
  • the compatibility layer of this embodiment is fully backward compatible with the old codec.
  • the enhancement layer of this embodiment encodes and decodes signals of the preset signal type according to the signal classification parameters, and the enhancement layer signal and the compatibility layer signal are processed according to the signal classification parameters at the decoding end. The final output signal is obtained after combined processing.
  • the enhancement layer can encode and decode part of the frequency spectrum of the input audio signal.
  • the decoder decides whether to use the decoded audio signal output by the compatible layer as the final decoded output signal, or to combine the decoded output of the enhancement layer and the compatible layer as the final decoded output signal.
  • the compatibility layer and the audio codec system have the same input signal, and the compatibility layer encodes and decodes all the spectral components of the input signal.
  • a signal classifier is used to perform enhanced encoding of signals of a preset signal type through an enhancement layer, and an enhancement layer signal and a compatible layer signal are combined to obtain the overall output signal of the decoder.
  • the overall output signal coding and decoding performance of the decoder is excellent.
  • an audio coding device 900 may include: an acquisition module 901, a compatibility layer coding module 902, an enhancement layer coding module 903, and a multiplexing module 904, where:
  • the acquisition module is used to acquire the current frame of the audio signal, where the current frame includes: a high frequency band signal and a low frequency band signal;
  • a compatible layer coding module configured to obtain the compatible layer coding parameters of the current frame according to the high frequency band signal and the low frequency band signal;
  • An enhancement layer coding module configured to obtain the enhancement layer coding parameters of the current frame according to the high frequency band signal
  • the multiplexing module is used for code stream multiplexing the compatibility layer coding parameter and the enhancement layer coding parameter to obtain a coded code stream.
  • the enhancement layer coding module is configured to obtain the signal type information of the high-band signal of the current frame; when the signal type information of the high-band signal of the current frame indicates the preset signal type At this time, the high frequency band signal of the current frame is encoded to obtain the enhancement layer encoding parameters of the current frame.
  • the preset signal type includes at least one of the following: a harmonic signal type, a tone signal type, a white noise-like signal type, a transient signal type, or a fricative signal type.
  • the enhancement layer coding parameters of the current frame further include: signal type information of the high frequency band signal of the current frame.
  • the enhancement layer coding module is used to obtain compatible layer coding frequency band information; determine the to-be-coded frequency band signal in the high frequency band signal of the current frame according to the compatible layer coding frequency band information; The frequency band signal to be coded is coded to obtain the enhancement layer coding parameters.
  • the current frame of the audio signal is acquired.
  • the current frame includes: a high-band signal and a low-band signal; the compatibility layer encoding of the current frame is obtained according to the high-band signal and the low-band signal.
  • Parameters Obtain the enhancement layer coding parameters of the current frame according to the high frequency band signal; perform code stream multiplexing on the compatible layer coding parameters and the enhancement layer coding parameters to obtain the code stream.
  • the entire frequency domain range of the audio signal can be encoded in the compatibility layer, while only the high frequency domain range of the audio signal is encoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio coding equipment, while the enhancement layer and compatibility layer can be implemented using new audio coding equipment. Therefore, in this embodiment of the present application, the compatibility between the new audio coding equipment and the old audio coding equipment is realized. According to the device type of the audio coding device itself, you can choose to encode only in the compatibility layer, or to encode in the compatibility layer and the enhancement layer at the same time. The embodiment of this application does not need to add a transcoding module for the old audio coding device, so it saves The cost of upgrading audio coding equipment is eliminated, and the coding efficiency of audio signals can be improved.
  • an audio decoding device 1000 may include: an acquisition module 1001, a demultiplexing module 1002, a compatibility layer decoding module 1003, an enhancement layer decoding module 1004, an adaptation module 1005, and Combination module 1006, in which,
  • the acquisition module is used to acquire the code stream
  • a demultiplexing module configured to demultiplex the code stream to obtain the compatible layer coding parameters of the current frame of the audio signal and the enhancement layer coding parameters of the current frame;
  • the compatibility layer decoding module is configured to obtain the compatibility layer signal of the current frame according to the compatibility layer coding parameters, and the compatibility layer signal includes: the first high-frequency band signal of the current frame and the first high-frequency signal of the current frame Low-band signal;
  • An enhancement layer decoding module configured to obtain the enhancement layer signal of the current frame according to the enhancement layer coding parameter
  • the adaptation module is configured to perform adaptation processing on the first high-band signal of the current frame according to the enhancement layer coding parameters or the enhancement layer signal of the current frame to obtain the second high-band signal of the current frame ;
  • the combination module is configured to obtain the audio output signal of the current frame according to the enhancement layer signal of the current frame, the second high frequency band signal of the current frame, and the first low frequency band signal of the current frame.
  • the enhancement layer decoding module is configured to obtain signal type information according to the enhancement layer coding parameters of the current frame; enhance the current frame according to the preset signal type indicated by the signal type information
  • the layer coding parameters are decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation module is configured to obtain the compatibility layer high frequency band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high frequency band signal of the current frame; Use the compatibility layer high frequency band adjustment parameter to perform adaptation processing on the first high frequency band signal of the current frame to obtain the second high frequency band signal of the current frame.
  • the adaptation module is configured to obtain the enhancement layer coding parameters of the current frame or the envelope information corresponding to the enhancement layer signal, and to obtain the packet of the first high frequency band signal of the current frame. Envelope information; obtaining the compatibility layer high-band adjustment parameter according to the enhancement layer coding parameter or the envelope information corresponding to the enhancement layer signal and the envelope information of the first high-band signal.
  • the adaptation module is configured to select the enhancement layer high-band spectrum signal of the current frame from the enhancement layer signal of the current frame according to a preset high-band spectrum selection rule;
  • the enhancement layer high-band spectrum signal and the first high-band signal of the current frame are combined for processing to obtain the second high-band signal of the current frame.
  • the adaptation module is configured to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal included in the first high frequency band signal of the current frame; determine the enhancement layer signal of the current frame The signal corresponding to the compatibility layer frequency band extension signal is the enhancement layer high-band spectrum signal of the current frame.
  • the adaptation module is configured to use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the second high frequency signal of the current frame With signal.
  • the adaptation module is configured to obtain an enhancement layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame; Use the enhancement layer high frequency band adjustment parameters to perform adaptation processing on the enhancement layer signal of the current frame to obtain an enhancement layer signal after the adaptation processing; use the enhancement layer signal after the adaptation processing to perform the adaptation process on the current frame The first high-frequency signal of the frame is replaced to obtain the second high-frequency signal of the current frame.
  • the adaptation module is configured to obtain an enhancement layer high-band adjustment parameter according to the enhancement layer coding parameter or enhancement layer signal of the current frame and the first high-band signal of the current frame; Use the enhancement layer signal of the current frame to replace the first high frequency band signal of the current frame to obtain the replaced first high frequency band signal; use the enhancement layer high frequency band adjustment parameter to perform the replacement The subsequent first high-frequency signal is subjected to adaptation processing to obtain the second high-frequency signal of the current frame.
  • the adaptation module is configured to compare and select the spectral components of the enhancement layer signal of the current frame and the first high-frequency band signal of the current frame to select from the enhancement of the current frame
  • the first enhancement layer sub-signal is selected from the layer signal; the first enhancement layer sub-signal is used to replace the signal with the same frequency spectrum as the first enhancement layer sub-signal in the first high-frequency band signal of the current frame , To obtain the second high frequency band signal of the current frame.
  • the enhancement layer decoding module is configured to determine the to-be-decoded enhancement layer high-frequency signal in the enhancement layer coding parameters according to the enhancement layer coding parameters and the compatible layer coding parameters; The enhancement layer high frequency signal to be decoded in the enhancement layer coding parameters is decoded to obtain the enhancement layer signal of the current frame.
  • the adaptation module is used to obtain the compatibility layer decoded signal and the compatibility layer frequency band extension signal in the compatibility layer signal of the current frame; and for the compatibility layer frequency band extension signal and the current frame
  • the enhancement layer signals of the s are combined and processed to obtain the second high-frequency band signal of the current frame.
  • the spectrum range of the compatibility layer signal is [0, FL], wherein the spectrum range of the compatibility layer decoded signal is [0, FT], and the frequency band extension signal of the compatibility layer is The frequency spectrum range is [FT, FL]; the frequency spectrum range of the enhancement layer signal is [FX, FY]; the frequency spectrum range of the audio output signal is [0, FY];
  • the FL ⁇ FY, the FX>FT determines that the audio output signal is determined in the following manner: a signal with a frequency spectrum range of [0, FX] in the audio output signal is obtained from the compatibility layer signal, and the audio A signal with a frequency spectrum range of [FX, FL] in the output signal is obtained from the compatibility layer signal and the enhancement layer signal.
  • the audio decoding device 1000 may further include: a post-processing module, configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the After the first low-band signal of the current frame obtains the audio output signal of the current frame, post-processing is performed on the audio output signal of the current frame.
  • a post-processing module configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the After the first low-band signal of the current frame obtains the audio output signal of the current frame, post-processing is performed on the audio output signal of the current frame.
  • the audio decoding device 1000 may further include: a post-processing module, configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the Before the first low-band signal of the current frame obtains the audio output signal of the current frame, obtain post-processing parameters according to the compatibility layer signal; use the post-processing parameters to perform post-processing on the enhancement layer signal to obtain the completion data
  • a post-processing module configured to combine the module according to the enhancement layer signal of the current frame, the second high-band signal of the current frame, and the Before the first low-band signal of the current frame obtains the audio output signal of the current frame, obtain post-processing parameters according to the compatibility layer signal; use the post-processing parameters to perform post-processing on the enhancement layer signal to obtain the completion data
  • the post-processed enhancement layer signal is described.
  • the coded stream is obtained; the coded stream is demultiplexed to obtain the compatible layer coding parameters of the current frame of the audio signal and the enhancement layer coding parameters of the current frame ;
  • the compatibility layer signal includes: the first high frequency band signal of the current frame and the first low frequency band signal of the current frame; obtain the enhancement layer signal of the current frame according to the enhancement layer coding parameters ;
  • the enhancement layer coding parameters or enhancement layer signal of the current frame the first high frequency band signal of the current frame is adapted to obtain the second high frequency band signal of the current frame; according to the enhancement layer signal of the current frame, the current frame
  • the second high-band signal of the current frame and the first low-band signal of the current frame obtain the audio output signal of the current frame.
  • the entire frequency domain range of the audio signal can be decoded in the compatibility layer, while only the high frequency domain range of the audio signal is decoded in the enhancement layer.
  • the compatibility layer can be implemented using old audio decoding equipment, and the enhancement layer and compatibility layer can be implemented using new audio decoding equipment. Therefore, in this embodiment of the application, the compatibility between the new audio decoding equipment and the old audio decoding equipment is realized. According to the device type of the audio decoding device itself, you can choose to decode only at the compatibility layer, or at the same time the compatibility layer and the enhancement layer.
  • the embodiment of this application does not need to add a transcoding module for the old audio decoding device, so it saves The cost of upgrading audio decoding equipment is eliminated, and the decoding efficiency of audio signals can be improved.
  • an embodiment of the present application further provides an audio encoding device.
  • the audio encoding device 1100 includes: a compatible layer encoder 1101, an enhancement layer encoder 1102, and a code stream multiplexer 1103, where:
  • the compatibility layer encoder is used to obtain a current frame of an audio signal, and the current frame includes: a high-band signal and a low-band signal; obtaining the current frame of the current frame according to the high-band signal and the low-band signal Compatibility layer coding parameters;
  • the enhancement layer encoder is configured to obtain a current frame of an audio signal, the current frame including: a high frequency band signal and a low frequency band signal; obtaining the enhancement layer coding parameters of the current frame according to the high frequency band signal;
  • the code stream multiplexer is configured to perform code stream multiplexing on the compatibility layer coding parameter and the enhancement layer coding parameter to obtain a coded code stream.
  • the audio encoding device may execute the audio encoding method shown in FIG. 2 above.
  • the audio encoding method shown in FIG. 2 above.
  • the audio decoding device 1200 includes: a code stream demultiplexer 1201, a compatible layer decoder 1202, an enhancement layer decoder 1203, and an adaptation processor 1204 and combiner 1205, of which,
  • the code stream demultiplexer is used to obtain a code stream; perform code stream demultiplexing on the code stream to obtain the compatible layer coding parameters of the current frame of the audio signal and the enhancement layer coding of the current frame parameter;
  • the compatibility layer decoder is configured to obtain the compatibility layer signal of the current frame according to the compatibility layer encoding parameter, and the compatibility layer signal includes: the first high frequency band signal of the current frame and the current frame The first low-band signal;
  • the enhancement layer decoder is configured to obtain the enhancement layer signal of the current frame according to the enhancement layer coding parameter
  • the adaptation processor is configured to perform adaptation processing on the first high frequency band signal of the current frame according to the enhancement layer coding parameter or enhancement layer signal of the current frame, so as to obtain the second high frequency band signal of the current frame.
  • Band signal
  • the combiner is configured to obtain the audio output signal of the current frame according to the enhancement layer signal of the current frame, the second high frequency band signal of the current frame, and the first low frequency band signal of the current frame.
  • the audio decoding device may execute the audio decoding method shown in FIG. 3, for details, please refer to the example of the audio decoding method in the foregoing embodiment, which will not be repeated here.
  • An embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a program, and the program executes some or all of the steps recorded in the above method embodiments.
  • the audio coding device 1300 includes:
  • the receiver 1301, the transmitter 1302, the processor 1303, and the memory 1304 (the number of processors 1303 in the audio encoding device 1300 may be one or more, and one processor is taken as an example in FIG. 13).
  • the receiver 1301, the transmitter 1302, the processor 1303, and the memory 1304 may be connected by a bus or in other ways, wherein the bus connection is taken as an example in FIG. 13.
  • the memory 1304 may include a read-only memory and a random access memory, and provides instructions and data to the processor 1303. A part of the memory 1304 may also include a non-volatile random access memory (NVRAM).
  • NVRAM non-volatile random access memory
  • the memory 1304 stores an operating system and operating instructions, executable modules or data structures, or a subset of them, or an extended set of them.
  • the operating instructions may include various operating instructions for implementing various operations.
  • the operating system may include various system programs for implementing various basic services and processing hardware-based tasks.
  • the processor 1303 controls the operation of the audio encoding device.
  • the processor 1303 may also be referred to as a central processing unit (CPU).
  • CPU central processing unit
  • the various components of the audio encoding device are coupled together through a bus system.
  • the bus system may also include a power bus, a control bus, and a status signal bus.
  • various buses are referred to as bus systems in the figure.
  • the methods disclosed in the foregoing embodiments of the present application may be applied to the processor 1303 or implemented by the processor 1303.
  • the processor 1303 may be an integrated circuit chip with signal processing capability. In the implementation process, each step of the above method can be completed by an integrated logic circuit of hardware in the processor 1303 or instructions in the form of software.
  • the aforementioned processor 1303 may be a general-purpose processor, a digital signal processing (digital signal processing, DSP), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or Other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
  • DSP digital signal processing
  • ASIC application specific integrated circuit
  • FPGA field-programmable gate array
  • the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
  • the software module can be located in a mature storage medium in the field, such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
  • the storage medium is located in the memory 1304, and the processor 1303 reads the information in the memory 1304, and completes the steps of the foregoing method in combination with its hardware.
  • the receiver 1301 can be used to receive input digital or character information, and to generate signal input related to the related settings and function control of the audio coding device.
  • the transmitter 1302 can include display devices such as a display screen, and the transmitter 1302 can be used to output through an external interface Number or character information.
  • the processor 1303 is configured to execute the audio encoding method shown in FIG. 2 above.
  • the audio decoding device 1400 includes:
  • the receiver 1401, the transmitter 1402, the processor 1403, and the memory 1404 (the number of processors 1403 in the audio decoding device 1400 may be one or more, and one processor is taken as an example in FIG. 14).
  • the receiver 1401, the transmitter 1402, the processor 1403, and the memory 1404 may be connected by a bus or in other ways, where the bus connection is taken as an example in FIG. 14.
  • the memory 1404 may include a read-only memory and a random access memory, and provides instructions and data to the processor 1403. A part of the memory 1404 may also include NVRAM.
  • the memory 1404 stores an operating system and operating instructions, executable modules or data structures, or a subset of them, or an extended set of them.
  • the operating instructions may include various operating instructions for implementing various operations.
  • the operating system may include various system programs for implementing various basic services and processing hardware-based tasks.
  • the processor 1403 controls the operation of the audio decoding device, and the processor 1403 may also be referred to as a CPU.
  • the various components of the audio decoding device are coupled together through a bus system, where the bus system may include a power bus, a control bus, and a status signal bus in addition to a data bus.
  • bus system may include a power bus, a control bus, and a status signal bus in addition to a data bus.
  • various buses are referred to as bus systems in the figure.
  • the methods disclosed in the foregoing embodiments of the present application may be applied to the processor 1403 or implemented by the processor 1403.
  • the processor 1403 may be an integrated circuit chip with signal processing capability. In the implementation process, the steps of the foregoing method can be completed by an integrated logic circuit of hardware in the processor 1403 or instructions in the form of software.
  • the aforementioned processor 1403 may be a general-purpose processor, DSP, ASIC, FPGA or other programmable logic device, discrete gate or transistor logic device, or discrete hardware component.
  • the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
  • the software module may be located in a mature storage medium in the field, such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
  • the storage medium is located in the memory 1404, and the processor 1403 reads the information in the memory 1404, and completes the steps of the foregoing method in combination with its hardware.
  • the processor 1403 is configured to execute the audio decoding method shown in FIG. 3.
  • the chip when the audio encoding device or the audio decoding device is a chip in the terminal, the chip includes: a processing unit and a communication unit.
  • the processing unit may be, for example, a processor, and the communication unit may be, for example, Input/output interface, pin or circuit, etc.
  • the processing unit can execute the computer-executable instructions stored in the storage unit, so that the chip in the terminal executes the method of any one of the above-mentioned first aspects.
  • the storage unit is a storage unit in the chip, such as a register, a cache, etc., and the storage unit may also be a storage unit in the terminal located outside the chip, such as a read-only memory (read-only memory). -only memory, ROM) or other types of static storage devices that can store static information and instructions, random access memory (RAM), etc.
  • processor mentioned in any of the foregoing may be a general-purpose central processing unit, a microprocessor, an ASIC, or one or more integrated circuits used to control the execution of the program of the method in the first aspect.
  • the device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physically separate.
  • the physical unit can be located in one place or distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the connection relationship between the modules indicates that they have a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines.
  • this application can be implemented by means of software plus necessary general hardware.
  • it can also be implemented by dedicated hardware including dedicated integrated circuits, dedicated CPUs, dedicated memory, Dedicated components and so on to achieve.
  • all functions completed by computer programs can be easily implemented with corresponding hardware.
  • the specific hardware structures used to achieve the same function can also be diverse, such as analog circuits, digital circuits or special-purpose circuits. Circuit etc.
  • software program implementation is a better implementation in more cases.
  • the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a readable storage medium, such as a computer floppy disk. , U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk, etc., including several instructions to make a computer device (which can be a personal computer, server, or network device, etc.) execute the methods described in each embodiment of this application .
  • a computer device which can be a personal computer, server, or network device, etc.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
  • the computer instructions may be stored in a computer-readable storage medium, or transmitted from one computer-readable storage medium to another computer-readable storage medium.
  • the computer instructions may be transmitted from a website, computer, server, or data center. Transmission to another website site, computer, server or data center via wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.).
  • wired such as coaxial cable, optical fiber, digital subscriber line (DSL)
  • wireless such as infrared, wireless, microwave, etc.
  • the computer-readable storage medium may be any available medium that can be stored by a computer or a data storage device such as a server or a data center integrated with one or more available media.
  • the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

一种音频编解码方法和音频编解码设备,该方法包括:获取音频信号的当前帧,当前帧包括:高频带信号和低频带信号(201);根据高频带信号和低频带信号得到当前帧的兼容层编码参数(202);根据高频带信号得到当前帧的增强层编码参数(203);对兼容层编码参数和增强层编码参数进行码流复用,以得到编码码流(204)。该方法能够实现新的编解码设备与旧的编解码设备的兼容,且能够提高音频信号的编解码效率。

Description

一种音频编解码方法和音频编解码设备
本申请要求于2020年1月10日提交中国专利局、申请号为202010028452.6、发明名称为“一种音频编解码方法和音频编解码设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及音频信号编解码技术领域,尤其涉及一种音频编解码方法和音频编解码设备。
背景技术
用户对音频服务的需求越来越高,这就会要求不断更新音频编解码设备。而在满足用户对新的音频服务的需求的同时,也要保证完全兼容旧的音频编解码设备,使得旧的音频编解码设备仍然能够提供音频服务。这其中一个比较关键的环节就是新的音频编解码设备能够兼容旧的音频编解码设备。
为使得新的编解码设备能够兼容旧的音频编解码设备,目前需要在旧的音频编解码设备中部署转码模块,通过该转码模块可实现旧的音频编解码设备与新的音频编解码设备的互通。但是在旧的音频编解码设备中新增转码模块,会增加对旧的音频编解码设备进行改造的成本,同时也增加了编解码设备的设备复杂度和能耗,降低了音频信号的编解码效率。
发明内容
本申请实施例提供了一种音频编解码方法和音频编解码设备,用于实现新的编解码设备与旧的编解码设备的兼容,且能够提高音频信号的编解码效率。
为解决上述技术问题,本申请实施例提供以下技术方案:
第一方面,本申请实施例提供一种音频编码方法,所述方法包括:获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;根据所述高频带信号得到所述当前帧的增强层编码参数;对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。本申请实施例中在兼容层中可以编码音频信号的全部频域范围,而在增强层中只编码音频信号的高频频域范围。兼容层可以使用旧的音频编码设备来实现,而增强层和兼容层可以使用新的音频编码设备来实现,因此在本申请实施例中,实现新的音频编码设备与旧的音频编码设备的兼容,根据音频编码设备自身的设备类型,可以选择只在兼容层进行编码,或者同时在兼容层和增强层进行编码,本申请实施例不需要针对旧的音频编码设备新增转码模块,因此省去了音频编码设备的升级成本,且能够提高音频信号的编码效率。
在一种可能的实现方式中,所述根据所述高频带信号得到所述当前帧的增强层编码参数,包括:获取所述当前帧的高频带信号的信号类型信息;当所述当前帧的高频带信号的信号类型信息指示预设信号类型时,对所述当前帧的高频带信号进行编码,以得到所述当前帧的增强层编码参数。在该方案中,获取当前帧的高频带信号的信号类型信息,该信号 类型信息根据所划分的信号类型可以包括多种信号分类结果。当该当前帧的高频带信号的信号类型信息指示预设信号类型时,对当前帧的高频带信号进行编码,以得到当前帧的增强层编码参数。例如可以将音频信号划分为N种预设信号类型,增强层中可以设置有N种编码模式,针对每种预设信号类型可以执行一种相应的增强层编码模式,因此实现针对不同信号类型采用相应的增强层编码模式,从而提高音频信号的编码效率。
在一种可能的实现方式中,所述预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。在该方案中,当前帧的高频带信号的预设信号类型可以有多种,例如当前帧的高频带信号的信号类型可以是谐波信号类型,即当前帧的高频带信号是谐波信号,因此可以在增强层中采用增强层编码模式1对谐波信号进行编码。若当前帧的高频带信号的信号类型可以是音调信号类型,即当前帧的高频带信号中包含音调成分,因此可以在增强层中采用增强层编码模式2对音调信号进行编码。若当前帧的高频带信号的信号类型可以是类白噪声信号类型,即当前帧的高频带信号中包括类白噪声信号,因此可以在增强层中采用增强层编码模式3对类白噪声信号进行编码。若当前帧的高频带信号的信号类型可以是瞬态信号类型,即当前帧的高频带信号中包括瞬态信号,因此可以在增强层中采用增强层编码模式4对瞬态信号进行编码。若当前帧的高频带信号的信号类型可以是摩擦音信号类型,即当前帧的高频带信号中包括摩擦音信号,因此可以在增强层中采用增强层编码模式5对摩擦音信号进行编码。本申请实施例中,针对上述每种预设信号类型可以执行一种相应的增强层编码模式,因此实现针对不同信号类型采用相应的增强层编码模式,从而提高音频信号的编码效率。
在一种可能的实现方式中,所述当前帧的增强层编码参数还包括:所述当前帧的高频带信号的信号类型信息。在该方案中,增强层中对当前帧的高频带信号进行编码后生成的增强层编码参数还包括当前帧的高频带信号的信号类型信息,因此在码流复用时,生成的编码码流中可以携带当前帧的高频带信号的信号类型信息,以使得在解码组件中同样可以使用信号类型信息在增强层中按照不同的预设信号类型进行解码,从而可以将增强层信号用于对兼容层处理的部分频谱进行处理,达到对最终输出信号性能提升的目的。
在一种可能的实现方式中,所述根据所述高频带信号得到所述当前帧的增强层编码参数,包括:获取兼容层编码频带信息;根据所述兼容层编码频带信息确定所述当前帧的高频带信号中的待编码频带信号;对所述待编码频带信号进行编码,以得到所述增强层编码参数。在该方案中,兼容层编码频带信息指示在兼容层中编码的音频信号的频带信息,即通过该兼容层编码频带信息可以确定出在兼容层中对哪个或哪些频带进行了兼容层编码。根据兼容层编码频带信息确定当前帧的高频带信号中的待编码频带信号,通过兼容层编码频带信息可以确定出在增强层中需要进行编码的高频带信号,最后对需要在增强层中编码的待编码频带信号进行编码,以得到增强层编码参数。本申请实施例中,兼容层输出的兼容层编码频带信息可以用于指导增强层在编码端的编码处理,从而使得增强层中的编码能够与兼容层中的编码相互补充,提高增强层中的音频信号编码效率。
第二方面,本申请实施例提供一种音频解码方法,所述方法包括:获取编码码流;对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层 信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;根据所述增强层编码参数得到所述当前帧的增强层信号;根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。本申请实施例中在兼容层中可以解码音频信号的全部频域范围,而在增强层中只解码音频信号的高频频域范围。兼容层可以使用旧的音频解码设备来实现,而增强层和兼容层可以使用新的音频解码设备来实现,因此在本申请实施例中,实现新的音频解码设备与旧的音频解码设备的兼容,根据音频解码设备自身的设备类型,可以选择只在兼容层进行解码,或者同时在兼容层和增强层进行解码,本申请实施例不需要针对旧的音频解码设备新增转码模块,因此省去了音频解码设备的升级成本,且能够提高音频信号的解码效率。
在一种可能的实现方式中,所述根据所述增强层编码参数得到所述当前帧的增强层信号,包括:根据所述当前帧的增强层编码参数获取信号类型信息;按照所述信号类型信息指示的预设信号类型对所述当前帧的增强层编码参数进行解码,以得到所述当前帧的增强层信号。在该方案中,编码码流中可以携带音频信号的信号类型信息,解码组件对编码码流进行码流解复用之后,可以得到当前帧的增强层编码参数的信号类型信息。按照信号类型信息指示的预设信号类型对当前帧的增强层编码参数进行解码,以得到当前帧的增强层信号,例如可以将音频信号划分为N种预设信号类型,增强层中可以设置有N种解码模式,针对每种预设信号类型可以执行一种相应的增强层解码模式,因此实现针对不同信号类型采用相应的增强层解码模式,从而提高音频信号的解码效率。本申请实施例中,解码组件使用信号类型信息选择合适的增强层解码处理,从而可以将增强层信号用于对兼容层处理的部分频谱进行处理,达到对最终输出信号性能提升的目的。
在一种可能的实现方式中,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数;使用所述兼容层高频带调整参数对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。在该方案中,可以利用增强层编码参数或增强层信号和兼容层的第一高频带信号获取兼容层高频带调整参数,该兼容层高频带调整参数(后续实施例中可以简称为调整参数)是用于对兼容层信号中的高频部分进行调整的调整参数。例如,该兼容层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,通过该调整参数对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。通过调整参数对第一高频带信号的适配处理,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数,包括:获取所述当前帧的增强层编码参数或增强层信号对应的包络信息,以及获取所述当前帧的第一高频带信号的包络信 息;根据所述增强层编码参数或增强层信号对应的包络信息和所述第一高频带信号的包络信息获取所述兼容层高频带调整参数。在该方案中,可以从兼容层直接解析获得兼容层输出信息,此输出信息和增强层信号进行联合计算,获得兼容层信号的高频带频谱调整参数,利用此调整参数对兼容层信号的高频带信号进行调整并与增强层的输出信号组合获得最终的输出信号。该调整参数的计算可以有多种实现方式,利用增强层编码参数或增强层信号对应的包络信息和第一高频带信号的包络信息可以计算出调整参数,其中,增强层编码参数对应的包络信息可以是根据增强层编码参数计算出的高频带信号的包络信息,或者增强层信号对应的包络信息可以是增强层信号的幅度大小,第一高频带信号的包络信息可以是兼容层信号中的高频带信号的幅度大小,使用增强层编码参数或增强层信号对应的包络信息和第一高频带信号的包络信息可以计算出兼容层高频带调整参数。
在一种可能的实现方式中,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号;对所述增强层高频带频谱信号与所述当前帧的第一高频带信号进行组合处理,以得到所述当前帧的第二高频带信号。在该方案中,可以预先设置高频带频谱选择规则,该高频带频谱选择规则可以用指示从增强层信号中选择高频带频谱信号,例如高频带频谱选择规则规定了所选择的一个或多个频带,或者高频带频谱选择规则指示了从增强层信号中需要选择的频带。根据预设高频带频谱选择规则从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号,该增强层高频带频谱信号是增强层信号中被选择出来的高频带频谱信号,使用该增强层高频带频谱信号与当前帧的第一高频带信号进行组合处理,以得到当前帧的第二高频带信号。本申请实施例中,通过设置高频带频谱选择规则,可以从增强层信号中选择出部分高频带信号用于与兼容层中的第一高频带信号进行组合,可以在兼容层中生成第二高频带信号,因此本申请实施例可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号,包括:获取所述当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;确定所述当前帧的增强层信号中与所述兼容层频带扩展信号对应的信号为所述当前帧的增强层高频带频谱信号。在该方案中,可以确定第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号,其中,兼容层解码信号是解码组件在兼容层中对兼容层编码参数进行解码得到的信号,兼容层频带扩展信号是解码组件在兼容层中通过频带扩展得到的信号,例如将低频带信号扩展至高频带从而可以得到兼容层频带扩展信号。本申请实施例中,解码组件可以根据兼容层频带扩展信号从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号,即增强层信号中与兼容层中的兼容层解码信号对应的信号没有被选择出,从而使得增强层高频带频谱信号是从增强层信号中选择出的部分频谱信号,使用增强层高频带频谱信号对兼容层信号进行调整后与增强层输出组合后,获得最终的输出信号。可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述根据所述当前帧的增强层编码参数或增强层信号对所 述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。在该方案中,适配处理的一种实现方式可以是直接替换,解码组件可以使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,即兼容层中的第一低频带信号保留不变,对于兼容层中的第一高频带信号可以替换为当前帧的增强层信号,该当前帧的增强层信号可以作为适配处理后的第二高频带信号。因此本申请实施例可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述增强层高频带调整参数对所述当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;使用所述适配处理后的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。在该方案中,可以利用增强层信号和兼容层的第一高频带信号获取增强层高频带调整参数,该增强层高频带调整参数(后续实施例中可以简称为调整参数)是用于对增强层信号进行调整的调整参数,该增强层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,通过该调整参数对当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号。通过调整参数对当前帧的增强层信号的适配处理,再使用适配处理后的增强层信号对当前帧的第一高频带信号进行替换,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;使用所述增强层高频带调整参数对所述替换后的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。在该方案中,可以利用增强层信号和兼容层的第一高频带信号获取增强层高频带调整参数,该增强层高频带调整参数(后续实施例中可以简称为调整参数)是用于对增强层信号进行调整的调整参数,该增强层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,在得到替换后的第一高频带信号之后,通过该调整参数对替换后的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。通过调整参数对替换后的第一高频带信号进行适配处理,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述使用所述当前帧的增强层信号对所述当前帧的第一高 频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:对所述当前帧的增强层信号和所述当前帧的第一高频带信号进行频谱成分对比选择,以从所述当前帧的增强层信号中选择出第一增强层子信号;使用所述第一增强层子信号对所述当前帧的第一高频带信号中与所述第一增强层子信号的频谱相同的信号进行替换,以得到所述当前帧的第二高频带信号。在该方案中,可以比较增强层信号对应的频谱成分与兼容层信号中第一高频带信号对应的频谱成分,在完成频谱成分对比之后,从当前帧的增强层信号中选择出第一增强层子信号,最后再使用选择出的第一增强层子信号对当前帧的第一高频带信号中与第一增强层子信号的频谱相同的信号进行替换,以得到当前帧的第二高频带信号。例如,解码组件进行上述的频谱成分对比选择,根据比较结果将增强层信号中的一部分频谱成分用于和兼容层信号中对应的频谱成分进行替换处理,来得到最终的输出信号中的频谱成分,同时会舍弃掉增强层信号中的另一部分频谱成分,将兼容层信号中替换后的频谱成分与兼容层信号中的其它频谱成分组合后得到最终输出信号的全部频谱成分。
在一种可能的实现方式中,所述根据所述增强层编码参数得到所述当前帧的增强层信号,包括:根据所述增强层编码参数和所述兼容层编码参数确定所述增强层编码参数中的待解码增强层高频信号;对所述增强层编码参数中的待解码增强层高频信号进行解码,以得到所述当前帧的增强层信号。在该方案中,可以获取增强层编码参数和兼容层编码参数,解码组件根据增强层编码参数和兼容层编码参数确定增强层编码参数中需要在增强层中进行解码的高频信号(即待解码增强层高频信号),然后对需要在增强层中进行解码的高频信号进行解码,对于增强层编码参数中没有被确定需要解码的高频信号可以丢弃,因此只需要对待解码增强层高频信号进行解码,而不需要对整个增强层编码参数进行解码,提高增强层中的音频信号解码效率。
在一种可能的实现方式中,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:获取所述当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;对所述兼容层频带扩展信号和所述当前帧的增强层信号进行组合处理,以得到所述当前帧的第二高频带信号。在该方案中,可以确定兼容层信号中包括的兼容层解码信号和兼容层频带扩展信号,其中,兼容层解码信号是解码组件在兼容层中对兼容层编码参数进行解码得到的信号,兼容层频带扩展信号是解码组件在兼容层中通过频带扩展得到的信号,例如将低频带信号扩展至高频带从而可以得到兼容层频带扩展信号。本申请实施例中,解码组件可以对兼容层频带扩展信号和当前帧的增强层信号进行组合处理,即第一高频带信号中的兼容层解码信号不用于与增强层信号的组合处理,解码组件只使用兼容层频带扩展信号和当前帧的增强层信号进行组合处理,在得到当前帧的第二高频带信号之后,使用第二高频带信号、增强层信号和第一低频带信号进行组合之后,获得最终的输出信号。可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在一种可能的实现方式中,所述兼容层信号的频谱范围为[0,FL],其中,所述兼容层解码信号的频谱范围为[0,FT],所述兼容层频带扩展信号的频谱范围为[FT,FL];所述增强层信号的频谱范围为[FX,FY];所述音频输出信号的频谱范围为[0,FY];所述FL=FY,所述FX<=FT,所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0, FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,所述FL=FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,所述FL<FY,所述FX<=FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,所述FL<FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到。在该方案中,本实施例,解码组件可以获得兼容层信号中哪些频谱是通过编解码处理获得的,以及哪些频谱是通过频带扩展获得的,在最终的输出信号中包括兼容层信号中编解码处理部分的频谱,而频带扩展部分的频谱可以使用增强层信号和兼容层信号中对应频谱成分组合处理获得的。
在一种可能的实现方式中,所述根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之后,所述方法还包括:对所述当前帧的音频输出信号进行后处理。在该方案中,在得到当前帧的音频输出信号之后,还可以对音频输出信号进行后处理,从而可以取得后处理的增益。
在一种可能的实现方式中,所述根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之前,所述方法还包括:根据所述兼容层信号获取后处理参数;使用所述后处理参数对所述增强层信号进行后处理,以得到完成所述后处理的增强层信号。在该方案中,还可以在得到当前帧的音频输出信号之前,根据兼容层信号获取后处理参数,该后处理参数是指后处理所需要的参数,根据后处理的类型不同需要获取相应的后处理参数,使用后处理参数对增强层信号进行后处理,完成后处理后,可以对完成后处理的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号进行组合处理,之后得到音频输出信号。本申请实施例中可以对增强层信号进行后处理,从而可以取得后处理的增益。
第三方面,本申请实施例还提供一种音频编码设备,所述音频编码设备,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如前述第一方面中任一项所述的方法。
在一种可能的实现方式中,所述音频编码设备还包括:所述存储器。
第四方面,本申请实施例还提供一种音频解码设备,所述音频解码设备,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如前述第二方面中任一项所述的方法。
在一种可能的实现方式中,所述音频解码设备还包括:所述存储器。
第五方面,本申请实施例还提供一种音频编码设备,所述音频编码设备包括:兼容层编码器、增强层编码器和码流复用器,其中,所述兼容层编码器,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;所述增强层编码器,用于获取音频信号的当前帧, 所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号得到所述当前帧的增强层编码参数;所述码流复用器,用于对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
在本申请的一些实施例中,增强层编码器,用于获取所述当前帧的高频带信号的信号类型信息;当所述当前帧的高频带信号的信号类型信息指示预设信号类型时,对所述当前帧的高频带信号进行编码,以得到所述当前帧的增强层编码参数。
在本申请的一些实施例中,所述预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。
在本申请的一些实施例中,所述当前帧的增强层编码参数还包括:所述当前帧的高频带信号的信号类型信息。
在本申请的一些实施例中,增强层编码器,用于获取兼容层编码频带信息;根据所述兼容层编码频带信息确定所述当前帧的高频带信号中的待编码频带信号;对所述待编码频带信号进行编码,以得到所述增强层编码参数。
在本申请的第五方面中,音频编码设备的组成部分还可以执行前述第一方面以及各种可能的实现方式中所描述的步骤,详见前述对第一方面以及各种可能的实现方式中的说明。
第六方面,本申请实施例还提供一种音频解码设备,所述音频解码设备包括:码流解复用器、兼容层解码器、增强层解码器、适配处理器和组合器,其中,所述码流解复用器,用于获取编码码流;对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;所述兼容层解码器,用于根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;所述增强层解码器,用于根据所述增强层编码参数得到所述当前帧的增强层信号;所述适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;所述组合器,用于根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
在本申请的一些实施例中,增强层解码器,用于根据所述当前帧的增强层编码参数获取信号类型信息;按照所述信号类型信息指示的预设信号类型对所述当前帧的增强层编码参数进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数;使用所述兼容层高频带调整参数对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配处理器,用于获取所述当前帧的增强层编码参数或增强层信号对应的包络信息,以及获取所述当前帧的第一高频带信号的包络信息;根据所述增强层编码参数或增强层信号对应的包络信息和所述第一高频带信号的包络信息获取所述兼容层高频带调整参数。
在本申请的一些实施例中,适配处理器,用于根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号;对所述增强层高频带频 谱信号与所述当前帧的第一高频带信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配处理器,用于获取所述当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;确定所述当前帧的增强层信号中与所述兼容层频带扩展信号对应的信号为所述当前帧的增强层高频带频谱信号。
在本申请的一些实施例中,适配处理器,用于使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述增强层高频带调整参数对所述当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;使用所述适配处理后的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;使用所述增强层高频带调整参数对所述替换后的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配处理器,用于对所述当前帧的增强层信号和所述当前帧的第一高频带信号进行频谱成分对比选择,以从所述当前帧的增强层信号中选择出第一增强层子信号;使用所述第一增强层子信号对所述当前帧的第一高频带信号中与所述第一增强层子信号的频谱相同的信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,增强层解码器,用于根据所述增强层编码参数和所述兼容层编码参数确定所述增强层编码参数中的待解码增强层高频信号;对所述增强层编码参数中的待解码增强层高频信号进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配处理器,用于获取所述当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;对所述兼容层频带扩展信号和所述当前帧的增强层信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,所述兼容层信号的频谱范围为[0,FL],其中,所述兼容层解码信号的频谱范围为[0,FT],所述兼容层频带扩展信号的频谱范围为[FT,FL];所述增强层信号的频谱范围为[FX,FY];所述音频输出信号的频谱范围为[0,FY];
所述FL=FY,所述FX<=FT,所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL=FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX<=FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围 为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到。
在本申请的一些实施例中,适配处理器,还用于组合器根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之后,对所述当前帧的音频输出信号进行后处理。
在本申请的一些实施例中,适配处理器,还用于组合器根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之前,根据所述兼容层信号获取后处理参数;使用所述后处理参数对所述增强层信号进行后处理,以得到完成所述后处理的增强层信号。
在本申请的第六方面中,音频解码设备的组成部分还可以执行前述第二方面以及各种可能的实现方式中所描述的步骤,详见前述对第二方面以及各种可能的实现方式中的说明。
第七方面,本申请实施例还提供一种音频编码设备,可以包括:获取模块,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;兼容层编码模块,用于根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;增强层编码模块,用于根据所述高频带信号得到所述当前帧的增强层编码参数;复用模块,用于对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
在本申请的一些实施例中,增强层编码模块,用于获取所述当前帧的高频带信号的信号类型信息;当所述当前帧的高频带信号的信号类型信息指示预设信号类型时,对所述当前帧的高频带信号进行编码,以得到所述当前帧的增强层编码参数。
在本申请的一些实施例中,所述预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。
在本申请的一些实施例中,所述当前帧的增强层编码参数还包括:所述当前帧的高频带信号的信号类型信息。
在本申请的一些实施例中,增强层编码模块,用于获取兼容层编码频带信息;根据所述兼容层编码频带信息确定所述当前帧的高频带信号中的待编码频带信号;对所述待编码频带信号进行编码,以得到所述增强层编码参数。
第八方面,本申请实施例还提供一种音频解码设备,可以包括:获取模块,用于获取编码码流;解复用模块,用于对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;兼容层解码模块,用于根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;增强层解码模块,用于根据所述增强层编码参数得到所述当前帧的增强层信号;适配模块,用于根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;组合模块,用于根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
在本申请的一些实施例中,增强层解码模块,用于根据所述当前帧的增强层编码参数 获取信号类型信息;按照所述信号类型信息指示的预设信号类型对所述当前帧的增强层编码参数进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数;使用所述兼容层高频带调整参数对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的增强层编码参数或增强层信号对应的包络信息,以及获取所述当前帧的第一高频带信号的包络信息;根据所述增强层编码参数或增强层信号对应的包络信息和所述第一高频带信号的包络信息获取所述兼容层高频带调整参数。
在本申请的一些实施例中,适配模块,用于根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号;对所述增强层高频带频谱信号与所述当前帧的第一高频带信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;确定所述当前帧的增强层信号中与所述兼容层频带扩展信号对应的信号为所述当前帧的增强层高频带频谱信号。
在本申请的一些实施例中,适配模块,用于使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述增强层高频带调整参数对所述当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;使用所述适配处理后的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;使用所述增强层高频带调整参数对所述替换后的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于对所述当前帧的增强层信号和所述当前帧的第一高频带信号进行频谱成分对比选择,以从所述当前帧的增强层信号中选择出第一增强层子信号;使用所述第一增强层子信号对所述当前帧的第一高频带信号中与所述第一增强层子信号的频谱相同的信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,增强层解码模块,用于根据所述增强层编码参数和所述兼容层编码参数确定所述增强层编码参数中的待解码增强层高频信号;对所述增强层编码参数中的待解码增强层高频信号进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;对所述兼容层频带扩展信号和所述当前帧的增强层信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,所述兼容层信号的频谱范围为[0,FL],其中,所述兼容层解码信号的频谱范围为[0,FT],所述兼容层频带扩展信号的频谱范围为[FT,FL];所述增强层信号的频谱范围为[FX,FY];所述音频输出信号的频谱范围为[0,FY];
所述FL=FY,所述FX<=FT,所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL=FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX<=FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到。
在本申请的一些实施例中,音频解码设备1000,还可以包括:后处理模块,用于组合模块根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之后,对所述当前帧的音频输出信号进行后处理。
在本申请的一些实施例中,音频解码设备还可以包括:后处理模块,用于组合模块根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之前,根据所述兼容层信号获取后处理参数;使用所述后处理参数对所述增强层信号进行后处理,以得到完成所述后处理的增强层信号。
第九方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机执行上述第一方面或第二方面所述的方法。
第十方面,本申请实施例提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述第一方面或第二方面所述的方法。
第十一方面,本申请实施例提供一种通信装置,该通信装置可以包括音频编解码设备或者芯片等实体,所述通信装置包括:处理器,可选的,还包括存储器;所述存储器用于存储指令;所述处理器用于执行所述存储器中的所述指令,使得所述通信装置执行如前述第一方面或第二方面中任一项所述的方法。
第十二方面,本申请提供了一种芯片系统,该芯片系统包括处理器,用于支持音频编解码设备实现上述方面中所涉及的功能,例如,发送或处理上述方法中所涉及的数据和/或信息。在一种可能的设计中,所述芯片系统还包括存储器,所述存储器,用于保存音频编解码设备必要的程序指令和数据。该芯片系统,可以由芯片构成,也可以包括芯片和其他分立器件。
附图说明
图1为本申请实施例提供的一种音频编解码系统的结构示意图;
图2为本申请实施例提供的一种音频编码方法的示意性流程图;
图3为本申请实施例提供的一种音频解码方法的示意性流程图;
图4为本申请实施例的移动终端的示意图;
图5为本申请实施例的网元的示意图;
图6为本申请一个实施例的音频编码方法的示意性流程图;
图7a为本申请实施例提供的原始信号频谱示意图;
图7b为本申请实施例提供的兼容层编码信号频谱示意图;
图7c为本申请实施例提供的增强层编码信号频谱示意图;
图7d为本申请实施例提供的音频输出信号频谱示意图;
图8为本申请实施例提供的增强层编码参数和兼容层编码参数进行组合后的输出频谱示意图;
图9为本申请实施例提供的一种音频编码设备的组成结构示意图;
图10为本申请实施例提供的一种音频解码设备的组成结构示意图;
图11为本申请实施例提供的另一种音频编码设备的组成结构示意图;
图12为本申请实施例提供的另一种音频解码设备的组成结构示意图;
图13为本申请实施例提供的另一种音频编码设备的组成结构示意图;
图14为本申请实施例提供的另一种音频解码设备的组成结构示意图。
具体实施方式
本申请实施例提供了一种音频编解码方法和音频编解码设备,用于实现新的编解码设备与旧的编解码设备的兼容,且能够提高音频信号的编解码效率。
下面结合附图,对本申请的实施例进行描述。
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的术语在适当情况下可以互换,这仅仅是描述本申请的实施例中对相同属性的对象在描述时所采用的区分方式。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,以便包含一系列单元的过程、方法、系统、产品或设备不必限于那些单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它单元。
本申请实施例中的音频信号是指音频编码设备中的输入信号,该音频信号中可以包括多个帧,例如当前帧可以特指音频信号中的某一个帧,本申请实施例中以当前帧音频信号的编解码进行示例说明,音频信号中当前帧的前一帧或者后一帧都可以根据该当前帧音频信号的编解码方式进行相应的编解码,对于音频信号中当前帧的前一帧或者后一帧的编解码过程不再逐一说明。另外,本申请实施例中的音频信号可以是单声道音频信号,或者,也可以为立体声信号。其中,立体声信号可以是原始的立体声信号,也可以是多声道信号中包括的两路信号(左声道信号和右声道信号)组成的立体声信号,还可以是由多声道信号中包含的至少三路信号产生的两路信号组成的立体声信号,本申请实施例中对此并不限 定。
图1为本申请一个示例性实施例的音频编解码系统的结构示意图。该音频编解码系统包括编码组件110和解码组件120。
本申请实施例中音频编解码系统可以包括兼容层和增强层,例如在音频编解码系统中可以针对兼容层设置编码组件和解码组件,针对增强层设置编码组件和解码组件,其中,兼容层和增强层是指根据处理音频信号的频谱范围而划分的两个层,具体的,在兼容层中可以处理音频信号的全部频域范围,而在增强层中只处理音频信号的高频频域范围。兼容层可以使用旧的编解码组件来实现,而增强层和兼容层可以使用新的编解码组件来实现,因此在本申请实施例提供的音频编解码系统中,实现新的编解码组件与旧的编解码组件的兼容,根据编解码组件自身的设备类型,可以选择只在兼容层进行编解码,或者同时在兼容层和增强层进行编解码,此处不做限定。
举例说明如下,本申请实施例中,新的编解码组件要与旧的新的编解码组件完全后向兼容,即音频编解码的兼容层信号包含输入信号的所有频谱成分。在本申请实施例提供的音频编解码系统中包含一个兼容层和一个增强层。兼容层能够完整的实现音频编解码功能,且生成的码流与旧的编解码系统完全兼容。兼容层的输入是输入到音频编解码系统中的原始音频信号,兼容层对输入信号的所有频谱成分进行编解码。增强层能够对输入音频信号的部分频谱(例如高频频域范围)进行编解码。解码端根据增强层的信息决定将兼容层输出的解码音频信号作为最终的解码输出信号,还是将增强层解码输出信号与兼容层解码输出信号先进行组合,再作为最终的解码输出信号。
编码组件110用于对当前帧(音频信号)在频域或时域上进行编码。可选地,编码组件110可以通过软件实现;或者,也可以通过硬件实现;或者,还可以通过软硬件结合的形式实现,本申请实施例中对此不作限定。
编码组件110对当前帧在频域或时域上进行编码时,在一种可能的实现方式中,可以包括如图2所示的步骤。
201、获取音频信号的当前帧,当前帧包括:高频带信号和低频带信号。
其中,当前帧可以是音频信号中的任意一个帧,在当前帧中可以包括高频带信号和低频带信号,其中,高频带信号和低频带信号的划分可以通过频带阈值确定,高于该频带阈值的信号为高频带信号,低于该频带阈值的信号为低频带信号,对于频带阈值的确定可以根据传输带宽、编码组件110和解码组件120的数据处理能力来确定,此处不做限定。
202、根据高频带信号和低频带信号得到当前帧的兼容层编码参数。
在本申请实施例中,高频带信号和低频带信号的编码可以在兼容层实现,以当前帧的高频带信号和低频带信号的编码为例,可以得到当前帧的兼容层编码参数。其中,兼容层编码参数是指在兼容层对音频信号的全部频带信号进行编码得到的编码参数。
203、根据高频带信号得到当前帧的增强层编码参数。
在本申请实施例中,高频带信号的编码可以在增强层实现,以当前帧的高频带信号的编码为例,可以得到当前帧的增强层编码参数。其中,增强层编码参数是指在增强层对音频信号的高频带信号进行编码得到的编码参数。
在本申请的一些实施例中,步骤203根据高频带信号得到当前帧的增强层编码参数, 包括:
获取当前帧的高频带信号的信号类型信息;
当该当前帧的高频带信号的信号类型信息指示预设信号类型时,对当前帧的高频带信号进行编码,以得到当前帧的增强层编码参数。
其中,在编码组件110中可以设置有信号分类器,通过该信号分类器可以实现对输入编码组件110的音频信号的分类,首先获取当前帧的高频带信号的信号类型信息,该信号类型信息根据所划分的信号类型可以包括多种信号分类结果。当该当前帧的高频带信号的信号类型信息指示预设信号类型时,对当前帧的高频带信号进行编码,以得到当前帧的增强层编码参数。例如可以将音频信号划分为N种预设信号类型,增强层中可以设置有N种编码模式,针对每种预设信号类型可以执行一种相应的增强层编码模式,因此实现针对不同信号类型采用相应的增强层编码模式,从而提高音频信号的编码效率。
举例说明如下,本申请实施例中编码组件中设置有信号分类器,该信号分类器可用于检测特定类型的音频信号。当检测到该类信号时,在增强层中对高频带信号进行编码,否则不编码。在增强层中编码之后,将信号分类结果用于步骤204中的码流复用,同时如果特定类型音频信号被检测到,则也将高频带信号编码参数用于步骤204中的码流复用,否则不进行码流复用。本申请实施例中,编码组件使用信号分类结果选择合适的增强层编码处理,以使得在解码端中同样可以使用信号分类结果在增强层中按照不同的预设信号类型进行解码,从而可以将增强层信号用于对兼容层处理的部分频谱进行处理,达到对最终输出信号性能提升的目的。
在本申请的一些实施例中,预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。
其中,当前帧的高频带信号的预设信号类型可以有多种,例如当前帧的高频带信号的信号类型可以是谐波信号类型,即当前帧的高频带信号是谐波信号,因此可以在增强层中采用增强层编码模式1对谐波信号进行编码。若当前帧的高频带信号的信号类型可以是音调信号类型,即当前帧的高频带信号中包含音调成分,因此可以在增强层中采用增强层编码模式2对音调信号进行编码。若当前帧的高频带信号的信号类型可以是类白噪声信号类型,即当前帧的高频带信号中包括类白噪声信号,因此可以在增强层中采用增强层编码模式3对类白噪声信号进行编码。若当前帧的高频带信号的信号类型可以是瞬态信号类型,即当前帧的高频带信号中包括瞬态信号,因此可以在增强层中采用增强层编码模式4对瞬态信号进行编码。若当前帧的高频带信号的信号类型可以是摩擦音信号类型,即当前帧的高频带信号中包括摩擦音信号,因此可以在增强层中采用增强层编码模式5对摩擦音信号进行编码。本申请实施例中,针对上述每种预设信号类型可以执行一种相应的增强层编码模式,因此实现针对不同信号类型采用相应的增强层编码模式,从而提高音频信号的编码效率。
可以理解的是,本申请实施例中,若当前帧的高频带信号不是上述预设信号类型,此处可以不在增强层中对该高频带信号进行编码。
在本申请的一些实施例中,当前帧的增强层编码参数还包括:当前帧的高频带信号的信号类型信息。
其中,编码组件110可以对音频信号按照预设信号类型识别出当前帧的高频带信号,编码组件110可以生成当前帧的高频带信号的信号类型信息,在增强层中对当前帧的高频带信号进行编码后生成的增强层编码参数还包括当前帧的高频带信号的信号类型信息,因此在码流复用时,生成的编码码流中可以携带当前帧的高频带信号的信号类型信息,以使得在解码组件中同样可以使用信号类型信息在增强层中按照不同的预设信号类型进行解码,从而可以将增强层信号用于对兼容层处理的部分频谱进行处理,达到对最终输出信号性能提升的目的。
在本申请的一些实施例中,步骤203根据高频带信号得到当前帧的增强层编码参数,包括:
获取兼容层编码频带信息;
根据兼容层编码频带信息确定当前帧的高频带信号中的待编码频带信号;
对待编码频带信号进行编码,以得到增强层编码参数。
其中,编码组件110还可以获取兼容层编码频带信息,兼容层编码频带信息指示在兼容层中编码的音频信号的频带信息,即通过该兼容层编码频带信息可以确定出在兼容层中对哪个或哪些频带进行了兼容层编码。根据兼容层编码频带信息确定当前帧的高频带信号中的待编码频带信号,通过兼容层编码频带信息可以确定出在增强层中需要进行编码的高频带信号,最后对需要在增强层中编码的待编码频带信号进行编码,以得到增强层编码参数。本申请实施例中,兼容层输出的兼容层编码频带信息可以用于指导增强层在编码端的编码处理,从而使得增强层中的编码能够与兼容层中的编码相互补充,提高增强层中的音频信号编码效率。
举例说明如下,在增强层中,根据增强层的信号分类信息和兼容层编码频带信息决定对哪些高频带频谱成分进行增强层编码处理,例如信号分类信息指示当前帧的4个频域子带要进行增强层编码处理,但是兼容层输出的编码频带信息指示4个频域子带中有1个频域子带在兼容层编码中进行编码处理,所以增强层可以其余3个频域子带进行增强层编码处理,而对兼容层中已经编码的1个频域子带不再进行增强层的频域编码,从而减少增强层中需要编码的频域子带个数,提高增强层中的音频信号编码效率。
204、对兼容层编码参数和增强层编码参数进行码流复用,以得到编码码流。
在本申请实施例中,兼容层编码和增强层编码分别完成之后,可以进行码流复用,从可以将兼容层编码参数和增强层编码参数复用到一个编码码流中,即该编码码流可以包括兼容层编码参数和增强层编码参数。
205、向解码组件发送编码码流。
在本申请实施例中,编码组件110在完成编码之后,可以生成编码码流,编码组件110可以向解码组件120发送编码码流,从而使得解码组件120可以接收到该编码码流,再由解码组件120从编码码流中得到音频输出信号。
需要说明的是,图2中所示的编码方法仅为示例而非限定,本申请实施例对图2中各步骤的执行顺序并不限定,图2中所示的编码方法也可以包括更多或更少的步骤,本申请实施例中对此并不限定。
通过前述实施例对本申请中编码方法的举例说明可知,获取音频信号的当前帧,当前 帧包括:高频带信号和低频带信号;根据高频带信号和低频带信号得到当前帧的兼容层编码参数;根据高频带信号得到当前帧的增强层编码参数;对兼容层编码参数和增强层编码参数进行码流复用,以得到编码码流。本申请实施例中在兼容层中可以编码音频信号的全部频域范围,而在增强层中只编码音频信号的高频频域范围。兼容层可以使用旧的音频编码设备来实现,而增强层和兼容层可以使用新的音频编码设备来实现,因此在本申请实施例中,实现新的音频编码设备与旧的音频编码设备的兼容,根据音频编码设备自身的设备类型,可以选择只在兼容层进行编码,或者同时在兼容层和增强层进行编码,本申请实施例不需要针对旧的音频编码设备新增转码模块,因此省去了音频编码设备的升级成本,且能够提高音频信号的编码效率。
可选地,编码组件110与解码组件120可以通过有线或无线的方式相连,解码组件120可以通过其与编码组件110之间的连接获取编码组件110生成的编码码流;或者,编码组件110可以将生成的编码码流存储至存储器,解码组件120读取存储器中的编码码流。
可选地,解码组件120可以通过软件实现;或者,也可以通过硬件实现;或者,还可以通过软硬件结合的形式实现,本申请实施例中对此不作限定。
解码组件120对当前帧(音频信号)在频域或时域上进行解码时,在一种可能的实现方式中,可以包括如图3所示的步骤。
301、获取编码码流。
其中,编码码流由编码组件110发送给解码组件120。编码码流可以包括兼容层编码参数和增强层编码参数。
302、对编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和当前帧的增强层编码参数。
在本申请实施例中,解码组件120在获取到编码码流之后,针对编码码流中音频信号的当前帧进行码流解复用,从而得到当前帧的兼容层编码参数和当前帧的增强层编码参数。
303、根据兼容层编码参数得到当前帧的兼容层信号,兼容层信号包括:当前帧的第一高频带信号和当前帧的第一低频带信号。
在本申请实施例中,兼容层编码参数可以在兼容层中进行解码,得到当前帧的兼容层信号,由前述对兼容层的说明,在兼容层中针对音频信号的全部频域范围进行解码,因此得到的兼容层信号包括:当前帧的第一高频带信号和当前帧的第一低频带信号,即在兼容层中解码出了第一高频带信号和第一低频带信号。
304、根据增强层编码参数得到当前帧的增强层信号。
在本申请实施例中,增强层编码参数可以在增强层中进行解码,得到当前帧的增强层信号,由前述对增强层的说明,在增强层中针对音频信号的高频范围进行解码,因此得到的增强层信号包括:当前帧的高频带信号,即在增强层中解码出了高频带信号。
需要说明的是,若解码组件120为旧的解码组件,则只需要执行步骤303就可以得到音频信号的全部频域信号,若解码组件120为新的解码组件,则需执行步骤303和步骤304,可以分别得到兼容层信号和增强层信号。
在本申请的一些实施例中,根据增强层编码参数得到当前帧的增强层信号,包括:
根据当前帧的增强层编码参数获取信号类型信息;
按照信号类型信息指示的预设信号类型对当前帧的增强层编码参数进行解码,以得到当前帧的增强层信号。
其中,编码码流中可以携带音频信号的信号类型信息,解码组件对编码码流进行码流解复用之后,可以得到当前帧的增强层编码参数的信号类型信息。按照信号类型信息指示的预设信号类型对当前帧的增强层编码参数进行解码,以得到当前帧的增强层信号,例如可以将音频信号划分为N种预设信号类型,增强层中可以设置有N种解码模式,针对每种预设信号类型可以执行一种相应的增强层解码模式,因此实现针对不同信号类型采用相应的增强层解码模式,从而提高音频信号的解码效率。本申请实施例中,解码组件使用信号类型信息选择合适的增强层解码处理,从而可以将增强层信号用于对兼容层处理的部分频谱进行处理,达到对最终输出信号性能提升的目的。
在本申请的一些实施例中,步骤304根据增强层编码参数得到当前帧的增强层信号,包括:
根据增强层编码参数和兼容层编码参数确定增强层编码参数中的待解码增强层高频信号;
对增强层编码参数中的待解码增强层高频信号进行解码,以得到当前帧的增强层信号。
其中,解码组件可以获取增强层编码参数和兼容层编码参数,解码组件根据增强层编码参数和兼容层编码参数确定增强层编码参数中需要在增强层中进行解码的高频信号(即待解码增强层高频信号),然后对需要在增强层中进行解码的高频信号进行解码,对于增强层编码参数中没有被确定需要解码的高频信号可以丢弃,因此只需要对待解码增强层高频信号进行解码,而不需要对整个增强层编码参数进行解码,提高增强层中的音频信号解码效率。
本申请实施例中,通过增强层编码参数和兼容层编码参数可以确定出在增强层中对哪个或哪些频带进行增强层解码。本申请实施例中,增强层编码参数和兼容层编码参数可以用于指导增强层在解码端的解码处理,从而使得增强层中的解码能够与兼容层中的解码相互补充,提高增强层中的音频信号编码效率。
举例说明如下,在增强层中,根据增强层编码参数和兼容层编码参数确定增强层编码参数中的待解码增强层高频信号,即可以决定对哪些高频带频谱成分进行增强层解码处理。结合前述的增强层编码过程的举例说明可知,信号分类信息指示当前帧的4个频域子带要进行增强层编码处理,但是兼容层输出的编码频带信息指示4个频域子带中有1个频域子带在兼容层编码中进行编码处理,所以增强层可以对其余3个频域子带进行增强层编码处理,而对兼容层中已经编码的1个频域子带不再进行增强层的频域编码,解码端处理过程为,增强层解码输出3个频域子带信号,兼容层解码输出信号中与其对应的3个频域子带信号与增强层信号中3个频域子带信号组合后作为最终输出信号的3个频域子带频谱成分,与其它所有子带信号一起得到最终的输出信号。本申请实施例中,可以减少增强层中需要解码的频域子带个数,提高增强层中的音频信号解码效率。
305、根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。
在本申请实施例中,针对兼容层中的第一高频带信号,可以使用当前帧的增强层编码 参数或者增强层信号进行适配处理,从而实现对兼容层中的第一高频带信号的适配处理,得到了当前帧在兼容层中的第二高频带信号,本申请实施例中当前帧的增强层编码参数或者增强层信号可以用于对兼容层中的第一高频带信号进行适配处理,达到对最终的音频输出信号性能提升的目的。
在本申请实施例中,对当前帧的第一高频带信号进行适配处理可以使用当前帧的增强层信号来实现,其中适配处理是指针对兼容层中的第一高频带信号进行调整,以提高兼容层解码输出的高频带信号的性能。其中,本申请实施例中的适配处理的方式有多种,接下来对适配处理进行详细的举例说明。
适配处理方式一:
在本申请的一些实施例中,步骤305根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号,包括:
根据当前帧的增强层编码参数或增强层信号和当前帧的第一高频带信号获取兼容层高频带调整参数;
使用兼容层高频带调整参数对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。
其中,解码组件120可以利用增强层编码参数或增强层信号和兼容层的第一高频带信号获取兼容层高频带调整参数,该兼容层高频带调整参数(后续实施例中可以简称为调整参数)是用于对兼容层信号中的高频部分进行调整的调整参数。例如,该兼容层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,通过该调整参数对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。通过调整参数对第一高频带信号的适配处理,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
举例说明如下,通过当前帧的增强层信号和当前帧的第一高频带信号可以获得调整参数,利用此调整参数对兼容层信号的高频带频谱成分进行适配处理,将增强层信号和适配处理后的兼容层信号进行组合后,可以获得最终的输出信号。
在本申请的一些实施例中,根据当前帧的增强层编码参数或增强层信号和当前帧的第一高频带信号获取兼容层高频带调整参数,包括:
获取当前帧的增强层编码参数或增强层信号对应的包络信息,以及获取当前帧的第一高频带信号的包络信息;
根据增强层编码参数或增强层信号对应的包络信息和第一高频带信号的包络信息获取兼容层高频带调整参数。
其中,解码组件可以从兼容层直接解析获得兼容层输出信息,此输出信息和增强层信号进行联合计算,获得兼容层信号的高频带频谱调整参数,利用此调整参数对兼容层信号的高频带信号进行调整并与增强层的输出信号组合获得最终的输出信号。该调整参数的计算可以有多种实现方式,利用增强层编码参数或增强层信号对应的包络信息和第一高频带信号的包络信息可以计算出调整参数,其中,增强层编码参数对应的包络信息可以是根据 增强层编码参数计算出的高频带信号的包络信息,或者增强层信号对应的包络信息可以是增强层信号的幅度大小,第一高频带信号的包络信息可以是兼容层信号中的高频带信号的幅度大小,使用增强层编码参数或增强层信号对应的包络信息和第一高频带信号的包络信息可以计算出兼容层高频带调整参数。其中,计算兼容层高频带调整参数的方式可以有多种。
例如,兼容层解码器输出高频带信号的包络信息为Envelope,增强层输出音调成分的包络信息为EnvTonal,则首先计算调整参数para=(Envelope-EnvTonal)/Envelope,将兼容层信号中的高频带部分与调整参数para相乘,获得调整后的兼容层信号,将增强层信号和调整后的兼容层信号组合后获得最终的输出信号。
由于本实施例中可以从兼容层中直接获得兼容层高频带调整参数,使用兼容层高频带调整参数对兼容层信号进行调整后与增强层输出组合后,获得最终的输出信号。可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
适配处理方式二:
在本申请的一些实施例中,步骤305根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号,包括:
根据预设高频带频谱选择规则从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号;
对增强层高频带频谱信号与当前帧的第一高频带信号进行组合处理,以得到当前帧的第二高频带信号。
其中,解码组件中可以预先设置高频带频谱选择规则,该高频带频谱选择规则可以用指示从增强层信号中选择高频带频谱信号,例如高频带频谱选择规则规定了所选择的一个或多个频带,或者高频带频谱选择规则指示了从增强层信号中需要选择的频带。根据预设高频带频谱选择规则从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号,该增强层高频带频谱信号是增强层信号中被选择出来的高频带频谱信号,使用该增强层高频带频谱信号与当前帧的第一高频带信号进行组合处理,以得到当前帧的第二高频带信号。本申请实施例中,通过设置高频带频谱选择规则,可以从增强层信号中选择出部分高频带信号用于与兼容层中的第一高频带信号进行组合,可以在兼容层中生成第二高频带信号,因此本申请实施例可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在本申请的一些实施例中,根据预设高频带频谱选择规则从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号,包括:
获取当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;
确定当前帧的增强层信号中与兼容层频带扩展信号对应的信号为当前帧的增强层高频带频谱信号。
其中,解码组件可以确定第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号,其中,兼容层解码信号是解码组件在兼容层中对兼容层编码参数进行解码得到的信号,兼容层频带扩展信号是解码组件在兼容层中通过频带扩展得到的信号,例如将低频带 信号扩展至高频带从而可以得到兼容层频带扩展信号。本申请实施例中,解码组件可以根据兼容层频带扩展信号从当前帧的增强层信号中选择出当前帧的增强层高频带频谱信号,即增强层信号中与兼容层中的兼容层解码信号对应的信号没有被选择出,从而使得增强层高频带频谱信号是从增强层信号中选择出的部分频谱信号,使用增强层高频带频谱信号对兼容层信号进行调整后与增强层输出组合后,获得最终的输出信号。可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
举例说明如下,本申请实施例中,将增强层信号通过分析兼容层的输出信号进行选择处理后,再与兼容层信号组合后获得最终的输出信号。选择处理的原则可以包括:兼容层信号包括编解码部分和频带扩展部分,增强层信号要与兼容层信号中的频带扩展部分进行组合,获得最终输出信号的高频带部分,如果兼容层信号中与增强层信号中对应频谱成分是编解码得到的,则最终的输出信号高频带部分不选择增强层信号的此部分频谱成分,否则选择增强层信号中的此部分频谱成分与兼容层信号中的此部分频谱进行组合处理,获得最终输出信号的此部分频谱成分。
适配处理方式二和前述的适配处理方式一的区别是,需要选择出增强层信号的一部分成分来与兼容层信号进行组合用于得到最终的输出信号,舍弃掉增强层信号的一部分频谱成分,例如增强层信号的某一频点处有一个音调成分,而恰好兼容层信号中在此频点附近也有一个能量相当的音调成分,此时可以判断兼容层信号中音调成分为直接编解码获得,所以此时舍弃掉增强层此频点处输出的音调成分,直接将兼容层中此频点的音调成分作为最终输出信号此频点处的频谱输出。
基于上述举例说明可知,本实施例通过分析比较增强层信号的频谱成分与兼容层信号对应的频谱成分,结论是增强层信号中一部分频谱成分被舍弃掉,另一部分频谱成分与兼容层信号组合后作为最终的输出信号,即根据增强层信号和兼容层信号,可以得到更优的输出信号。
在本申请的一些实施例中,增强层信号可以为频域信号,兼容层信号可以为时域信号,在组合处理流程中,可以先将兼容层信号转换成频域信号,在频域中对增强层信号的频域系数与兼容层信号的频域系数适配和组合处理后,将频域信号转换成时域信号,从而可以获得最终的输出信号。
适配处理方式三:
在本申请的一些实施例中,步骤305根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号,包括:
使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,以得到当前帧的第二高频带信号。
其中,适配处理的一种实现方式可以是直接替换,解码组件可以使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,即兼容层中的第一低频带信号保留不变,对于兼容层中的第一高频带信号可以替换为当前帧的增强层信号,该当前帧的增强层信号可以作为适配处理后的第二高频带信号。因此本申请实施例可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
接下来对适配处理方式三进行举例说明,解码组件将增强层信号替换兼容层信号的部 分频谱成分后,获得最终的输出信号。
适配处理方式三与前述适配处理方式一、二的不同之处在于,适配处理方式三中将增强层信号替换兼容层信号的部分频谱成分。例如兼容层信号为Ylc(n),增强层信号为Yel(n),将兼容层信号Ylc(n)中的高频带频谱HF去掉,将Yel(n)所表示的信号HFe与Ylc(n)中的低频带频谱LF组合组成最终的输出信号Y(n)。
例如,兼容层信号为时域信号Ylc(t),增强层信号为时域信号Yel(t),则首先对时域信号Ylc(t)进行低通滤波处理后,与时域信号Yel(t)叠加后,获得最终的输出信号,即通过如下公式得到输出信号Y(t):Y(t)=LowFilter(Ylc(t))+Yel(t)。例如,兼容层信号为频域信号Ylc(k),增强层信号为频域信号Yel(k),则直接用增强层频域系数Yel(k)替换兼容层频域系数Ylc(k)后,得到最终的频谱系数,将频谱系数转换成时域信号作为最终的输出信号,即通过如下公式得到输出信号Y(t):
Y(k)=Ylc(k),k=0,1,2,….M-V,
Y(k)=Yel(k-M+V-1),k=M-V+1,M-V+2,…,M。
最后,再将Y(k)转换成时域信号Y(t),作为最终的输出信号。
通过利用增强层输出频谱成分替换兼容层信号中的部分频谱成分,获得编解码性能相比较兼容层信号编解码性能质量更优的输出信号。例如,本实施例兼容层完全后向兼容旧的编解码器,本实施例增强层根据信号分类信息对某些类型的信号进行编解码,在解码端根据信号分类信息用增强层的输出信号频谱成分替换掉兼容层的输出信号中的部分频谱成分后,获得最终的输出信号。
进一步的,在本申请的一些实施例中,使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,以得到当前帧的第二高频带信号,包括:
根据当前帧的增强层编码参数或增强层信号和当前帧的第一高频带信号获取增强层高频带调整参数;
使用增强层高频带调整参数对当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;
使用适配处理后的增强层信号对当前帧的第一高频带信号进行替换,以得到当前帧的第二高频带信号。
其中,解码组件120可以利用增强层信号和兼容层的第一高频带信号获取增强层高频带调整参数,该增强层高频带调整参数(后续实施例中可以简称为调整参数)是用于对增强层信号进行调整的调整参数,该增强层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,通过该调整参数对当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号。通过调整参数对当前帧的增强层信号的适配处理,再使用适配处理后的增强层信号对当前帧的第一高频带信号进行替换,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
在本申请的另一些实施例中,使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,以得到当前帧的第二高频带信号,包括:
根据当前帧的增强层编码参数或增强层信号和当前帧的第一高频带信号获取增强层高频带调整参数;
使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;
使用增强层高频带调整参数对替换后的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。
其中,解码组件120可以利用增强层信号和兼容层的第一高频带信号获取增强层高频带调整参数,该增强层高频带调整参数(后续实施例中可以简称为调整参数)是用于对增强层信号进行调整的调整参数,该增强层高频带调整参数可以使用当前帧的增强层信号和当前帧的第一高频带信号来得到,其中,当前帧的增强层信号和当前帧的第一高频带信号都是高频带的音频信号,通过当前帧的增强层信号和当前帧的第一高频带信号可以计算出一个调整参数,在得到替换后的第一高频带信号之后,通过该调整参数对替换后的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号。通过调整参数对替换后的第一高频带信号进行适配处理,可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
举例说明如下,将增强层信号做适配处理后替换兼容层信号的部分频谱成分后,与兼容层其它频谱成分组合后,获得最终的输出信号。或者将增强层信号替换兼容层信号的部分频谱成分后再进行适配处理,与兼容层其它频谱成分组合后获得最终的输出信号。
本实施例中,增强层信号的频谱成分在替换兼容层对应的频谱成分之前或者之后需要进行适配处理,具体如下:
如果兼容层信号为时域信号Ylc(t),增强层信号为时域信号Yel(t),则首先对时域信号Ylc(t)进行低通滤波处理并进行适配处理后,与时域信号Yel(t)叠加后获得最终的输出信号,即通过如下公式得到输出信号Y(t):
Y(t)=LowFilter(Ylc(t))+Preprocessing(Yel(t))。
具体的,适配处理(Preprocessing)可以包括多种处理算法,例如假设增强层信号Yel(t)的总能量为EnerEL,兼容层信号对应的高频带频谱成分能量为EnerLC,通过如下方式计算调整参数para=sqrt(EnerLC/EnerEL)。然后使用调整参数para与增强层信号Yel(t)进行相乘,从而获得适配处理后的增强层信号,通过适配处理后的增强层信号与低通处理后的兼容层信号,可以获得最终的输出信号。
又如,兼容层信号为频域信号Ylc(k),其对应的高频带频谱成分能量为EnerLC,增强层信号为频域信号Yel(k),其能量为EnerEL,通过如下方式计算调整参数para=sqrt(EnerLC/EnerEL)。然后用调整参数para与增强层信号Yel(k)进行相乘后,获得适配处理后增强层的频域系数,适配处理后的增强层频域系数与兼容层低频带频域系数组合得到输出信号的频域系数,具体的,通过如下公式得到输出信号Y(t):
para=sqrt(EnerLC/EnerEL),
Y(k)=Ylc(k),k=0,1,2,…,M-V,
Y(k)=para*Yel(k-M+V-1),k=M-V+1,M-V+2,…M。
最后,将Y(k)进行频时变换,得到时域信号Y(t)作为最终的输出信号。
本实施例通过将适配处理后的增强层信号替换兼容层信号对应的频谱成分的方式,达到了提升最终输出信号的编解码性能质量的目的。
在本申请的另一些实施例中,使用当前帧的增强层信号对当前帧的第一高频带信号进行替换,以得到当前帧的第二高频带信号,包括:
对当前帧的增强层信号和当前帧的第一高频带信号进行频谱成分对比选择,以从当前帧的增强层信号中选择出第一增强层子信号;
使用第一增强层子信号对当前帧的第一高频带信号中与第一增强层子信号的频谱相同的信号进行替换,以得到当前帧的第二高频带信号。
其中,解码组件可以比较增强层信号对应的频谱成分与兼容层信号中第一高频带信号对应的频谱成分,在完成频谱成分对比之后,从当前帧的增强层信号中选择出第一增强层子信号,最后再使用选择出的第一增强层子信号对当前帧的第一高频带信号中与第一增强层子信号的频谱相同的信号进行替换,以得到当前帧的第二高频带信号。例如,解码组件进行上述的频谱成分对比选择,根据比较结果将增强层信号中的一部分频谱成分用于和兼容层信号中对应的频谱成分进行替换处理,来得到最终的输出信号中的频谱成分,同时会舍弃掉增强层信号中的另一部分频谱成分,将兼容层信号中替换后的频谱成分与兼容层信号中的其它频谱成分组合后得到最终输出信号的全部频谱成分。
举例说明如下,解码组件在增强层信号和兼容层信号组合之前,先进行频谱成分对比选择操作,对比选择的处理过程为:假设增强层信号中有频谱成分Wk,而兼容层信号中在Wk附近有能量相当的频谱成分Zk,则判决结论是频谱成分Zk为兼容层编解码处理获得,Zk相比较Wk更加接近于原始信号中的对应的频谱成分,所以选择Zk作为最终输出信号的频谱成分。而增强层信号中Wk附近在兼容层信号中没有对应的频谱成分,则选择Wk为基础进行适配处理后作为最终输出信号的频谱成分,再与兼容层信号中的其它频谱成分组合后,得到最终输出信号的全部频谱成分。
本实施例中,解码组件根据增强层信号和兼容层信号选择最优的增强层信号对应的最终信号输出的频谱成分,本实施例对于兼容层信号高频段中含有高质量编解码频谱成分的情况选择兼容层输出新的频谱成分作为最终输出信号的频谱成分,在引入增强层编解码提升整体编解码性能的原则下,兼顾了兼容层信号中含有高性能编解码频谱成分的特殊情况,最终达到了最优的编解码输出信号。
适配处理方式四:
在本申请的一些实施例中,步骤305根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号,包括:
获取当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;
对兼容层频带扩展信号和当前帧的增强层信号进行组合处理,以得到当前帧的第二高频带信号。
其中,解码组件可以确定兼容层信号中包括的兼容层解码信号和兼容层频带扩展信号,其中,兼容层解码信号是解码组件在兼容层中对兼容层编码参数进行解码得到的信号,兼容层频带扩展信号是解码组件在兼容层中通过频带扩展得到的信号,例如将低频带信号扩展至高频带从而可以得到兼容层频带扩展信号。本申请实施例中,解码组件可以对兼容层 频带扩展信号和当前帧的增强层信号进行组合处理,即第一高频带信号中的兼容层解码信号不用于与增强层信号的组合处理,解码组件只使用兼容层频带扩展信号和当前帧的增强层信号进行组合处理,在得到当前帧的第二高频带信号之后,使用第二高频带信号、增强层信号和第一低频带信号进行组合之后,获得最终的输出信号。可以得到更优的兼容层的高频带信号,从而实现输出更优的音频输出信号,提升了音频输出信号的性能。
进一步的,在本申请的一些实施例中,兼容层信号的频谱范围为[0,FL],其中,兼容层解码信号的频谱范围为[0,FT],兼容层频带扩展信号的频谱范围为[FT,FL];增强层信号的频谱范围为[FX,FY];音频输出信号的频谱范围为[0,FY];
FL=FY,FX<=FT,音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FT]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FT,FL]的信号通过兼容层信号和增强层信号得到;或者,
FL=FY,FX>FT,确定音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FX]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FX,FL]的信号通过兼容层信号和增强层信号得到;或者,
FL<FY,FX<=FT,确定音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FT]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FT,FL]的信号通过兼容层信号和增强层信号得到;或者,
FL<FY,FX>FT,确定音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FX]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FX,FL]的信号通过兼容层信号和增强层信号得到。
具体的,兼容层信号可以包括兼容层解码信号和兼容层频带扩展信号,解码组件可以确定兼容层信号中兼容层解码信号和兼容层频带扩展信号的界限,从而可以确定出兼容层解码信号的频谱范围为[0,FT],兼容层频带扩展信号的频谱范围为[FT,FL]。例如,本实施例,解码组件可以获得兼容层信号中哪些频谱是通过编解码处理获得的,以及哪些频谱是通过频带扩展获得的,在最终的输出信号中包括兼容层信号中编解码处理部分的频谱,而频带扩展部分的频谱可以使用增强层信号和兼容层信号中对应频谱成分组合处理获得的。
例如,假设音频编解码器的原始输入信号采样频率是FS,频谱范围是0至FS/2,兼容层信号的频谱范围为0至FL,其中,0至FT直接编解码处理得到,FT至FL为频带扩展处理得到,增强层信号的频谱范围是FX至FY,最终的输出信号为Y。则根据各频谱范围边界值的大小关系可以得到前述的处理方式。例如,FL=FY=FS/2,FX<=FT,即增强层信号的最小频谱范围FX小于兼容层解码信号的最大频谱范围,此时音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FT]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FT,FL]的信号通过兼容层信号和增强层信号得到。又如,FL=FY,FX>FT,即增强层信号的最小频谱范围FX大于兼容层解码信号的最大频谱范围,此时音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FX]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FX,FL]的信号通过兼容层信号和增强层信号得到。又如,FL<FY,FX<=FT,即增强层信号的最大频谱范围FY大于兼容层频带扩展信号的频谱范围,且增强层信号的最小频谱范围FX小于兼容层解码信号的最大频谱范围,此时音频输出信号通过如下方式确定: 音频输出信号中频谱范围为[0,FT]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FT,FL]的信号通过兼容层信号和增强层信号得到。又如FL<FY,FX>FT,即增强层信号的最大频谱范围FY大于兼容层频带扩展信号的频谱范围,且增强层信号的最小频谱范围FX大于兼容层解码信号的最大频谱范围,此时音频输出信号通过如下方式确定:音频输出信号中频谱范围为[0,FX]的信号通过兼容层信号得到,音频输出信号中频谱范围为[FX,FL]的信号通过兼容层信号和增强层信号得到
本实施例中,兼容层完全后向兼容旧的编解码组件,组合输出自适应根据兼容层的输出信号、编解码频谱范围和增强层信号计算生成高性能的最终输出信号。在保证兼容层完全后向兼容旧的编码组件的基础上,组合处理的频谱范围的上限为增强层编解码处理频谱范围的上限即是原始信号的截止频率,组合处理的频谱范围的下限为兼容层编码处理频谱范围上限和增强层信号频谱范围下限中较大的值,保证了最终的输出信号频谱范围包含输入信号的整个频谱范围,在此种情况下输出信号兼具了兼容层信号和增强层信号的优点。
306、根据当前帧的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号得到当前帧的音频输出信号。
在本申请实施例中,通过前述步骤305的描述可知,兼容层中可以完成对第一高频带信号的适配处理,得到了兼容层中的第二高频带信号,最后将兼容层中解码输出的第一低频带信号、增强层中的增强层信号以及兼容层中的第二高频带信号进行组合处理,从而可以得到当前帧的音频输出信号,当前帧的音频输出信号可以用于音频播放组件的音频播放。
需要说明的是,图3中所示的解码方法仅为示例而非限定,本申请实施例对图3中各步骤的执行顺序并不限定,图3中所示的解码方法也可以包括更多或更少的步骤,本申请实施例中对此并不限定。
在本申请的一些实施例中,步骤306根据当前帧的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号得到当前帧的音频输出信号之后,本申请实施例提供的解码方法还包括:
对当前帧的音频输出信号进行后处理。
其中,解码组件在得到当前帧的音频输出信号之后,还可以对音频输出信号进行后处理,从而可以取得后处理的增益。
在本申请的一些实施例中,后处理包括如下至少一种:动态范围控制、渲染、混音。
举例说明如下,解码组件中可以包括后处理器,后处理器的作用是对高频带信号进行后处理,例如在增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号进行组合处理之后得到音频输出信号时,对该音频输出信号进行后处理。后处理器的功能可包含动态范围控制(dynamic range control,DRC)、渲染、混音等,对于实际应用场景中采取的后处理方式不做限定。
在本申请的一些实施例中,步骤306根据当前帧的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号得到当前帧的音频输出信号之前,本申请实施例提供的解码方法还包括:
根据兼容层信号获取后处理参数;
使用后处理参数对增强层信号进行后处理,以得到完成后处理的增强层信号。
其中,解码组件还可以在得到当前帧的音频输出信号之前,根据兼容层信号获取后处理参数,该后处理参数是指后处理所需要的参数,根据后处理的类型不同需要获取相应的后处理参数,使用后处理参数对增强层信号进行后处理,完成后处理后,可以对完成后处理的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号进行组合处理,之后得到音频输出信号。本申请实施例中可以对增强层信号进行后处理,从而可以取得后处理的增益。
举例说明如下,增强层信号与经过后处理后的兼容层信号进行组合处理,获得最终的输出信号。本实施例与前述实施例不同之处在于,在增强层增加与兼容层相同的后处理。在确定出兼容层信号之后,再进行例如动态范围控制、渲染、混音等后处理,然后再进行组合处理。例如,如果能够获得兼容层直接解码处理后的信号,则先将增强层信号与兼容层信号组合后再一起进行上述的后处理。又如,如果无法获得兼容层直接解码处理后的信号,则先对增强层信号进行上述后处理,再与兼容层信号做组合处理。
具体的,对增强层信号进行后处理的方式有多种,例如可以直接从兼容层信号中获取后处理参数,然后使用该后处理参数对增强层信号进行后处理。又如,通过后处理,可以保证组合处理前后的频谱成分按子带在帧内子带间有相似的能量关系,以保证通过组合处理可以得到最终的音频输出信号。
本申请实施例中,兼容层完全兼容旧的编解码组件,组合处理后的信号包含了兼容层输出时所带有的后处理操作,使得旧的编解码组件可以实现对音频信号的全频带范围内的编解码。
通过前述实施例对本申请中解码方法的举例说明可知,获取编码码流;对编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和当前帧的增强层编码参数;根据兼容层编码参数得到当前帧的兼容层信号,兼容层信号包括:当前帧的第一高频带信号和当前帧的第一低频带信号;根据增强层编码参数得到当前帧的增强层信号;根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号;根据当前帧的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号得到当前帧的音频输出信号。本申请实施例中在兼容层中可以解码音频信号的全部频域范围,而在增强层中只解码音频信号的高频频域范围。兼容层可以使用旧的音频解码设备来实现,而增强层和兼容层可以使用新的音频解码设备来实现,因此在本申请实施例中,实现新的音频解码设备与旧的音频解码设备的兼容,根据音频解码设备自身的设备类型,可以选择只在兼容层进行解码,或者同时在兼容层和增强层进行解码,本申请实施例不需要针对旧的音频解码设备新增转码模块,因此省去了音频解码设备的升级成本,且能够提高音频信号的解码效率。
可选地,编码组件110和解码组件120可以设置在同一设备中;或者,也可以设置在不同设备中。设备可以为手机、平板电脑、膝上型便携计算机和台式计算机、蓝牙音箱、录音笔、可穿戴式设备等具有音频信号处理功能的终端,也可以是核心网、无线网中具有音频信号处理能力的网元,本实施例对此不作限定。
示意性地,如图4所示,本实施例以编码组件110设置于移动终端130中、解码组件120设置于移动终端140中,移动终端130与移动终端140是相互独立的具有音频信号处 理能力的电子设备,例如可以是手机,可穿戴设备,虚拟现实(virtual reality,VR)设备,或增强现实(augmented reality,AR)设备等等,且移动终端130与移动终端140之间通过无线或有线网络连接为例进行说明。
可选地,移动终端130可以包括采集组件131、编码组件110和信道编码组件132,其中,采集组件131与编码组件110相连,编码组件110与编码组件132相连。
可选地,移动终端140可以包括音频播放组件141、解码组件120和信道解码组件142,其中,音频播放组件141与解码组件120相连,解码组件120与信道解码组件142相连。
移动终端130通过采集组件131采集到音频信号后,通过编码组件110对该音频信号进行编码,得到编码码流;然后,通过信道编码组件132对编码码流进行编码,得到传输信号。
移动终端130通过无线或有线网络将该传输信号发送至移动终端140。
移动终端140接收到该传输信号后,通过信道解码组件142对传输信号进行解码得到码码流;通过解码组件110对编码码流进行解码得到音频信号;通过音频播放组件播放该音频信号。可以理解的是,移动终端130也可以包括移动终端140所包括的组件,移动终端140也可以包括移动终端130所包括的组件。
示意性地,如图5所示,以编码组件110和解码组件120设置于同一核心网或无线网中具有音频信号处理能力的网元150中为例进行说明。
可选地,网元150包括信道解码组件151、解码组件120、编码组件110和信道编码组件152。其中,信道解码组件151与解码组件120相连,解码组件120与编码组件110相连,编码组件110与信道编码组件152相连。
信道解码组件151接收到其它设备发送的传输信号后,对该传输信号进行解码得到第一编码码流;通过解码组件120对编码码流进行解码得到音频信号;通过编码组件110对该音频信号进行编码,得到第二编码码流;通过信道编码组件152对该第二编码码流进行编码得到传输信号。
其中,其它设备可以是具有音频信号处理能力的移动终端;或者,也可以是具有音频信号处理能力的其它网元,本实施例对此不作限定。
可选地,网元中的编码组件110和解码组件120可以对移动终端发送的编码码流进行转码。
可选地,本申请实施例中可以将安装有编码组件110的设备称为音频编码设备,在实际实现时,该音频编码设备也可以具有音频解码功能,本申请实施对此不作限定。
可选地,本申请实施例中可以将安装有解码组件120的设备称为音频解码设备,在实际实现时,该音频解码设备也可以具有音频编码功能,本申请实施对此不作限定。
为便于更好的理解和实施本申请实施例的上述方案,下面举例相应的应用场景来进行具体说明。
请参阅如图6所示,为本申请实施例中一种音频编解码流程的示意图,图6中虚线左侧为编码端,虚线右侧为解码端。将输入信号分别用增强层和兼容层进行编码,将增强层信号和兼容层信号组合后获得编解码器最终的输出。
如图7a所示,为本申请实施例提供的原始信号频谱示意图,图7a中所示的曲线为原 始信号在各个频带的频谱。在编码端,首先对输入信号进行兼容层编码得到兼容层信号,如图7b所示,为本申请实施例提供的兼容层编码信号频谱示意图,该兼容层编码信号频谱包括:高频带信号和低频带信号,图7b中竖线左侧为低频带信号,竖线右侧为高频带信号。编码端还可以对该输入信号进行信号分类,信号分类时生成信号类型参数,根据信号类型参数进行增强层编码得到增强层信号,图7c所示,为本申请实施例提供的增强层编码信号频谱示意图,图7c中所示的虚线为增强层编码信号在高频带的频谱。将兼容层信号、增强层信号和信号类型参数进行码流复用,得到编码码流。如图7d所示,为本申请实施例提供的音频输出信号频谱示意图,兼容层信号、增强层信号和信号类型参数进行码流复用,即可以将图7b所示的兼容层编码信号频谱和图7c所示的增强层编码信号频谱进行组合,以生成编码码流。
举例说明如下,首先将输入信号输入到兼容层编码器,兼容编码器编码后的兼容层编码参数送入到码流复用器,输入信号还可以输入到信号分类器中,将信号类型参数送入到码流复用器中,根据信号类型参数选择对应的增强层模式1至N对输入信号的部分频谱成分进行编码,增强层编码器编码后的增强层编码参数送入到码流复用器中,码流复用器输出的编码码流发送到解码端。
在本申请的一些实施例中,如图6所示,兼容层编码频带信息还可以发送至增强层编码器中,以使得增强层编码器可以按照该兼容层编码频带信息确定对增强层中哪些频带进行编码,详见前述实施例中的说明,此处不再展开说明。
解码端首先对该编码码流进行码流解复用,通过信号类型参数解码得到信号类型参数,通过增强层解码得到增强层信号,通过兼容层解码得到兼容层信号,然后使用信号类型参数和增强层信号对兼容层信号进行适配处理,然后将适配后的兼容层信号、信号类型参数和增强层信号进行组合处理,最后得到输出信号。
举例说明如下,解码端利用码流解复用器将兼容层编码参数送入到兼容层解码器中得到兼容层信号,信号类型参数解码器解码出信号类型参数,增强层模式1至N解码器根据输入的对应码流和信号类型参数解码输出得到增强层信号,通过适配器使用增强层信号对兼容层信号进行适配处理,最后将将适配后的兼容层信号、增强层信号和信号类型参数信息送入到组合器中,从组合器得到解码器最终的输出信号。
其中,本申请实施例中的兼容层编解码器可以是任意编解码器,例如兼容层编解码器可以是MPEG-H 3D Audio编解码器,此编解码包含时域编解码模式和变换域编解码模式,支持多声道输入信号的编解码。对于兼容层编解码器的编解码流程不再详细说明。
在本申请的一些实施例中,如图6所示,兼容层信号还可以发送至增强层解码器中,以使得增强层解码器可以按照该兼容层信号确定对增强层中哪些频带进行解码,详见前述实施例中的说明,此处不再展开说明。
接下来对增强层编解码处理方式进行举例说明。
一种处理方式包括:信号分类器将高频带信号分为如下三种预设信号类型:谐波信号、含有独立音调成分的信号、以及其它信号。对上述三种信号执行不同的处理操作,例如对谐波信号,编码端可以对谐波信号的编码基频,谐波个数,幅度以及基底能量进行编码,从而可以得到增强层编码参数,在解码端根据基频,谐波个数,幅度以及基底能量在对应 的位置重建出能量与原始信号相当得的谐波信号。又如,对含有独立音调成分的信号,在编码端将音调成分按正弦轨迹曲线进行处理,将幅度,相位以及轨迹线的起始点和结束点编码后可以得到增强层编码参数,该增强层编码参数被发送到解码端,解码端根据解码得到的幅度,相位以及轨迹线的起始点和结束点重建得到含有音调成分的信号,对于除谐波信号、含有独立音调成分的信号之外的其它信号,编码端不进行增强层编码处理,直接将兼容层信号作为最终的输出信号。
另一种处理方式包括:信号分类器将高频带信号分成4类信号,包括:谐波信号、含有独立音调成分的信号、类白噪声信号和其它信号。其中谐波信号,含有独立音调成分的信号,其它信号的处理方式和上一种处理方式相同。对于类白噪声信号,编码端使用白噪声作为激励信号与原始的高频带信号一起计算得到增强层的包络信息,该增强层的包络信息作为增强层编码参数传送到解码端,解码端利用接收到的包络信息使用白噪声作为激励信号重建得到增强层信号。
不限的是,信号分类器还可以将高频带信号分成更多类型的信号,划分出N种信号类型,则增强层编码器有N种编码模式,每一种编码模式处理一种类型的信号。例如信号分类器将高频带信号分成6类信号,包括:谐波信号、含有独立音调成分的信号、类白噪声信号,瞬态信号,摩擦音信号和其它信号。其中,谐波信号,含有独立音调成分的信号,类白噪声信号,其它信号的处理方式和上一种处理方式相同。对于瞬态信号,增强层对时域包络进行更精细的编码,从而使得瞬态信号包括的子帧的时域包络之间的赋值差异更加明显。对于摩擦音信号,增强层对信号的频谱包络进行精细编码,从而使得解码端的恢复信号的频谱包络与原始信号更接近,从而达到提升编码性能的目的。
如图8所示,为本申请实施例提供的增强层编码参数和兼容层编码参数进行组合后的输出频谱示意图。例如Ylc(n)表示兼容层编码参数,Ylc(n)中包括高频信号HF和低频信号LF,Yel(n)表示增强层编码参数,Yel(n)中包括高频信号HFe,增强层编码参数和兼容层编码参数组合后的最终输出信号为Y(n),Y(n)中包括高频信号HFnew和低频信号LF,其中,高频信号HFnew可以是增强层信号和兼容层信号适配后的高频信号。
例如,对谐波信号的具体的处理流程包括:编码器的输入信号信号为:x(n),n=0,1,2,3,…,x(n)的采样频率为Fs,频带宽度为Fs/2,x(n)信号经过兼容层编码后输出频带宽度为Fs/2的Ylc(n),n=0,1,2,3,…。x(n)信号经过信号分类器,产生的信号分类参数放入编码码流中,如果信号分类参数指示当前帧含有谐波信号,则对其通过增强层进行编码,编码信号解码后输出频带为HFe的信号Yel(n),n=0,1,2,3,…。
上述Ylc(n)和Yel(n)进行组合后获得输出信号Y(n),其信号带宽有两个部分频段LF和HFnew组成。Y(n)的编解码性能质量优于Ylc(n)的编解码性能质量。
接下来对增强层信号兼容层信号的组合过程进行说明,Ylc(n)信号的频域表达式为Ylc(k),k=0,1,2,3,….M;Yel(n)信号的频域表达式为Yel(k),k=0,1,2,3,…,V,则Y(n)信号的频域表达式为Y(k),k=0,1,2,3,…M;
Y(k)=Ylc(k),k=0,1,2,…,M-V;
Y(k)=Ylc(k)*H1(k-M+V-1)+Yel(k-M+V-1)*H2(k-M+V-1),k=M-V+1,M-V+2,…,M。
其中,上述的H1(.)和H2(.)分别为兼容层信号的适配处理函数和增强层信号的适配处 理函数。
以谐波信号的解码为例,解码端根据基频大小、谐波个数和幅度重建出对应的谐波分量成分设为Yel(k),假设增强层的基底能量EnerNF,兼容层输出的包络能量为EnerENV,则上述两个适配处理函数如下所示:H1(k)=EnerNF/EnerENV,H2(k)=1。
输出信号Y(k)为:
Y(k)=Ylc(k),k=0,1,2,…,M-V,
Y(k)=Ylc(k)*EnerNF/EnerENV+Yel(k-M+V-1),k=M-V+1,M-V+2,…,M。
最后,再将Y(k)转化成时域信号Y(t),即为最终的输出信号。
在本申请实施例提供的前述音频编解码流程中,一个音频编解码系统包含一个兼容层和一个增强层。兼容层能够完整的实现音频编解码功能,且生成的码流与旧的编解码系统完全兼容。本实施例兼容层完全后向兼容旧的编解码器,本实施例增强层根据信号分类参数对预设信号类型的信号进行编解码,在解码端根据信号分类参数对增强层信号和兼容层信号进行组合处理后获得最终的输出信号。增强层能够对输入音频信号的部分频谱进行编解码。解码端根据增强层的信息决定将兼容层输出的解码音频信号作为最终的解码输出信号,还是将增强层解码输出与兼容层解码输出先进行组合,再作为最终的解码输出信号。其中,兼容层与音频编解码系统有相同的输入信号,兼容层对输入信号的所有频谱成分进行编解码。
本实施例利用信号分类器将预设信号类型的信号通过增强层进行增强编码,利用增强层信号与兼容层信号进行组合后获得解码器整体的输出信号,解码器的整体输出信号编解码性能优于兼容层编解码直接输出信号的编解码性能。
需要说明的是,对于前述的各方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请并不受所描述的动作顺序的限制,因为依据本申请,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作和模块并不一定是本申请所必须的。
为便于更好的实施本申请实施例的上述方案,下面还提供用于实施上述方案的相关装置。
请参阅图9所示,本申请实施例提供的一种音频编码设备900,可以包括:获取模块901、兼容层编码模块902、增强层编码模块903、复用模块904,其中,
获取模块,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;
兼容层编码模块,用于根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;
增强层编码模块,用于根据所述高频带信号得到所述当前帧的增强层编码参数;
复用模块,用于对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
在本申请的一些实施例中,增强层编码模块,用于获取所述当前帧的高频带信号的信号类型信息;当所述当前帧的高频带信号的信号类型信息指示预设信号类型时,对所述当前帧的高频带信号进行编码,以得到所述当前帧的增强层编码参数。
在本申请的一些实施例中,所述预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。
在本申请的一些实施例中,所述当前帧的增强层编码参数还包括:所述当前帧的高频带信号的信号类型信息。
在本申请的一些实施例中,增强层编码模块,用于获取兼容层编码频带信息;根据所述兼容层编码频带信息确定所述当前帧的高频带信号中的待编码频带信号;对所述待编码频带信号进行编码,以得到所述增强层编码参数。
通过前述实施例对本申请中编码方法的举例说明可知,获取音频信号的当前帧,当前帧包括:高频带信号和低频带信号;根据高频带信号和低频带信号得到当前帧的兼容层编码参数;根据高频带信号得到当前帧的增强层编码参数;对兼容层编码参数和增强层编码参数进行码流复用,以得到编码码流。本申请实施例中在兼容层中可以编码音频信号的全部频域范围,而在增强层中只编码音频信号的高频频域范围。兼容层可以使用旧的音频编码设备来实现,而增强层和兼容层可以使用新的音频编码设备来实现,因此在本申请实施例中,实现新的音频编码设备与旧的音频编码设备的兼容,根据音频编码设备自身的设备类型,可以选择只在兼容层进行编码,或者同时在兼容层和增强层进行编码,本申请实施例不需要针对旧的音频编码设备新增转码模块,因此省去了音频编码设备的升级成本,且能够提高音频信号的编码效率。
请参阅图10所示,本申请实施例提供的一种音频解码设备1000,可以包括:获取模块1001、解复用模块1002、兼容层解码模块1003、增强层解码模块1004、适配模块1005和组合模块1006,其中,
获取模块,用于获取编码码流;
解复用模块,用于对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;
兼容层解码模块,用于根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;
增强层解码模块,用于根据所述增强层编码参数得到所述当前帧的增强层信号;
适配模块,用于根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;
组合模块,用于根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
在本申请的一些实施例中,增强层解码模块,用于根据所述当前帧的增强层编码参数获取信号类型信息;按照所述信号类型信息指示的预设信号类型对所述当前帧的增强层编码参数进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数;使用所述兼容层高频带调整参数对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的增强层编码参数或增强 层信号对应的包络信息,以及获取所述当前帧的第一高频带信号的包络信息;根据所述增强层编码参数或增强层信号对应的包络信息和所述第一高频带信号的包络信息获取所述兼容层高频带调整参数。
在本申请的一些实施例中,适配模块,用于根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号;对所述增强层高频带频谱信号与所述当前帧的第一高频带信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;确定所述当前帧的增强层信号中与所述兼容层频带扩展信号对应的信号为所述当前帧的增强层高频带频谱信号。
在本申请的一些实施例中,适配模块,用于使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述增强层高频带调整参数对所述当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;使用所述适配处理后的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;使用所述增强层高频带调整参数对所述替换后的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,适配模块,用于对所述当前帧的增强层信号和所述当前帧的第一高频带信号进行频谱成分对比选择,以从所述当前帧的增强层信号中选择出第一增强层子信号;使用所述第一增强层子信号对所述当前帧的第一高频带信号中与所述第一增强层子信号的频谱相同的信号进行替换,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,增强层解码模块,用于根据所述增强层编码参数和所述兼容层编码参数确定所述增强层编码参数中的待解码增强层高频信号;对所述增强层编码参数中的待解码增强层高频信号进行解码,以得到所述当前帧的增强层信号。
在本申请的一些实施例中,适配模块,用于获取所述当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;对所述兼容层频带扩展信号和所述当前帧的增强层信号进行组合处理,以得到所述当前帧的第二高频带信号。
在本申请的一些实施例中,所述兼容层信号的频谱范围为[0,FL],其中,所述兼容层解码信号的频谱范围为[0,FT],所述兼容层频带扩展信号的频谱范围为[FT,FL];所述增强层信号的频谱范围为[FX,FY];所述音频输出信号的频谱范围为[0,FY];
所述FL=FY,所述FX<=FT,所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL=FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信 号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX<=FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
所述FL<FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到。
在本申请的一些实施例中,音频解码设备1000,还可以包括:后处理模块,用于组合模块根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之后,对所述当前帧的音频输出信号进行后处理。
在本申请的一些实施例中,音频解码设备1000,还可以包括:后处理模块,用于组合模块根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之前,根据所述兼容层信号获取后处理参数;使用所述后处理参数对所述增强层信号进行后处理,以得到完成所述后处理的增强层信号。
通过前述实施例对本申请中解码方法的举例说明可知,获取编码码流;对编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和当前帧的增强层编码参数;根据兼容层编码参数得到当前帧的兼容层信号,兼容层信号包括:当前帧的第一高频带信号和当前帧的第一低频带信号;根据增强层编码参数得到当前帧的增强层信号;根据当前帧的增强层编码参数或增强层信号对当前帧的第一高频带信号进行适配处理,以得到当前帧的第二高频带信号;根据当前帧的增强层信号、当前帧的第二高频带信号和当前帧的第一低频带信号得到当前帧的音频输出信号。本申请实施例中在兼容层中可以解码音频信号的全部频域范围,而在增强层中只解码音频信号的高频频域范围。兼容层可以使用旧的音频解码设备来实现,而增强层和兼容层可以使用新的音频解码设备来实现,因此在本申请实施例中,实现新的音频解码设备与旧的音频解码设备的兼容,根据音频解码设备自身的设备类型,可以选择只在兼容层进行解码,或者同时在兼容层和增强层进行解码,本申请实施例不需要针对旧的音频解码设备新增转码模块,因此省去了音频解码设备的升级成本,且能够提高音频信号的解码效率。
如图11所示,本申请实施例还提供一种音频编码设备,所述音频编码设备1100包括:兼容层编码器1101、增强层编码器1102和码流复用器1103,其中,
所述兼容层编码器,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;
所述增强层编码器,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号得到所述当前帧的增强层编码参数;
所述码流复用器,用于对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
具体的,音频编码设备可以执行前述图2所示的音频编码方法,详见前述实施例中对 音频编码方法的举例说明,此处不再赘述。
如图12所示,本申请实施例还提供一种音频解码设备,所述音频解码设备1200包括:码流解复用器1201、兼容层解码器1202、增强层解码器1203、适配处理器1204和组合器1205,其中,
所述码流解复用器,用于获取编码码流;对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;
所述兼容层解码器,用于根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;
所述增强层解码器,用于根据所述增强层编码参数得到所述当前帧的增强层信号;
所述适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;
所述组合器,用于根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
具体的,音频解码设备可以执行前述图3所示的音频解码方法,详见前述实施例中对音频解码方法的举例说明,此处不再赘述。
需要说明的是,上述装置各模块/单元之间的信息交互、执行过程等内容,由于与本申请方法实施例基于同一构思,其带来的技术效果与本申请方法实施例相同,具体内容可参见本申请前述所示的方法实施例中的叙述,此处不再赘述。
本申请实施例还提供一种计算机存储介质,其中,该计算机存储介质存储有程序,该程序执行包括上述方法实施例中记载的部分或全部步骤。
接下来介绍本申请实施例提供的另一种音频编码设备,请参阅图13所示,音频编码设备1300包括:
接收器1301、发射器1302、处理器1303和存储器1304(其中音频编码设备1300中的处理器1303的数量可以一个或多个,图13中以一个处理器为例)。在本申请的一些实施例中,接收器1301、发射器1302、处理器1303和存储器1304可通过总线或其它方式连接,其中,图13中以通过总线连接为例。
存储器1304可以包括只读存储器和随机存取存储器,并向处理器1303提供指令和数据。存储器1304的一部分还可以包括非易失性随机存取存储器(non-volatile random access memory,NVRAM)。存储器1304存储有操作系统和操作指令、可执行模块或者数据结构,或者它们的子集,或者它们的扩展集,其中,操作指令可包括各种操作指令,用于实现各种操作。操作系统可包括各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。
处理器1303控制音频编码设备的操作,处理器1303还可以称为中央处理单元(central processing unit,CPU)。具体的应用中,音频编码设备的各个组件通过总线系统耦合在一起,其中总线系统除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都称为总线系统。
上述本申请实施例揭示的方法可以应用于处理器1303中,或者由处理器1303实现。处理器1303可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的 各步骤可以通过处理器1303中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器1303可以是通用处理器、数字信号处理器(digital signal processing,DSP)、专用集成电路(application specific integrated circuit,ASIC)、现场可编程门阵列(field-programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器1304,处理器1303读取存储器1304中的信息,结合其硬件完成上述方法的步骤。
接收器1301可用于接收输入的数字或字符信息,以及产生与音频编码设备的相关设置以及功能控制有关的信号输入,发射器1302可包括显示屏等显示设备,发射器1302可用于通过外接接口输出数字或字符信息。
本申请实施例中,处理器1303,用于执行前述图2所示的音频编码方法。
接下来介绍本申请实施例提供的另一种音频解码设备,请参阅图14所示,音频解码设备1400包括:
接收器1401、发射器1402、处理器1403和存储器1404(其中音频解码设备1400中的处理器1403的数量可以一个或多个,图14中以一个处理器为例)。在本申请的一些实施例中,接收器1401、发射器1402、处理器1403和存储器1404可通过总线或其它方式连接,其中,图14中以通过总线连接为例。
存储器1404可以包括只读存储器和随机存取存储器,并向处理器1403提供指令和数据。存储器1404的一部分还可以包括NVRAM。存储器1404存储有操作系统和操作指令、可执行模块或者数据结构,或者它们的子集,或者它们的扩展集,其中,操作指令可包括各种操作指令,用于实现各种操作。操作系统可包括各种系统程序,用于实现各种基础业务以及处理基于硬件的任务。
处理器1403控制音频解码设备的操作,处理器1403还可以称为CPU。具体的应用中,音频解码设备的各个组件通过总线系统耦合在一起,其中总线系统除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都称为总线系统。
上述本申请实施例揭示的方法可以应用于处理器1403中,或者由处理器1403实现。处理器1403可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步骤可以通过处理器1403中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器1403可以是通用处理器、DSP、ASIC、FPGA或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读 存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器1404,处理器1403读取存储器1404中的信息,结合其硬件完成上述方法的步骤。
本申请实施例中,处理器1403,用于执行前述图3所示的音频解码方法。
在另一种可能的设计中,当音频编码设备或音频解码设备为终端内的芯片时,芯片包括:处理单元和通信单元,所述处理单元例如可以是处理器,所述通信单元例如可以是输入/输出接口、管脚或电路等。该处理单元可执行存储单元存储的计算机执行指令,以使该终端内的芯片执行上述第一方面任意一项的方法。可选地,所述存储单元为所述芯片内的存储单元,如寄存器、缓存等,所述存储单元还可以是所述终端内的位于所述芯片外部的存储单元,如只读存储器(read-only memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(random access memory,RAM)等。
其中,上述任一处提到的处理器,可以是一个通用中央处理器,微处理器,ASIC,或一个或多个用于控制上述第一方面方法的程序执行的集成电路。
另外需说明的是,以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。另外,本申请提供的装置实施例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件的方式来实现,当然也可以通过专用硬件包括专用集成电路、专用CPU、专用存储器、专用元器件等来实现。一般情况下,凡由计算机程序完成的功能都可以很容易地用相应的硬件来实现,而且,用来实现同一功能的具体硬件结构也可以是多种多样的,例如模拟电路、数字电路或专用电路等。但是,对本申请而言更多情况下软件程序实现是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在可读取的存储介质中,如计算机的软盘、U盘、移动硬盘、ROM、RAM、磁碟或者光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。
所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算 机能够存储的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(Solid State Disk,SSD))等。

Claims (28)

  1. 一种音频编码方法,其特征在于,所述方法包括:
    获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;
    根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;
    根据所述高频带信号得到所述当前帧的增强层编码参数;
    对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述高频带信号得到所述当前帧的增强层编码参数,包括:
    获取所述当前帧的高频带信号的信号类型信息;
    当所述当前帧的高频带信号的信号类型信息指示预设信号类型时,对所述当前帧的高频带信号进行编码,以得到所述当前帧的增强层编码参数。
  3. 根据权利要求2所述的方法,其特征在于,所述预设信号类型包括如下至少一种:谐波信号类型,音调信号类型,类白噪声信号类型,瞬态信号类型,或摩擦音信号类型。
  4. 根据权利要求2或3所述的方法,其特征在于,所述当前帧的增强层编码参数还包括:所述当前帧的高频带信号的信号类型信息。
  5. 根据权利要求1所述的方法,其特征在于,所述根据所述高频带信号得到所述当前帧的增强层编码参数,包括:
    获取兼容层编码频带信息;
    根据所述兼容层编码频带信息确定所述当前帧的高频带信号中的待编码频带信号;
    对所述待编码频带信号进行编码,以得到所述增强层编码参数。
  6. 一种音频解码方法,其特征在于,所述方法包括:
    获取编码码流;
    对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;
    根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;
    根据所述增强层编码参数得到所述当前帧的增强层信号;
    根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;
    根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
  7. 根据权利要求6所述的方法,其特征在于,所述根据所述增强层编码参数得到所述当前帧的增强层信号,包括:
    根据所述当前帧的增强层编码参数获取信号类型信息;
    按照所述信号类型信息指示的预设信号类型对所述当前帧的增强层编码参数进行解码,以得到所述当前帧的增强层信号。
  8. 根据权利要求6或7所述的方法,其特征在于,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第 二高频带信号,包括:
    根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数;
    使用所述兼容层高频带调整参数对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
  9. 根据权利要求8所述的方法,其特征在于,所述根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取兼容层高频带调整参数,包括:
    获取所述当前帧的增强层编码参数或增强层信号对应的包络信息,以及获取所述当前帧的第一高频带信号的包络信息;
    根据所述增强层编码参数或增强层信号对应的包络信息和所述第一高频带信号的包络信息获取所述兼容层高频带调整参数。
  10. 根据权利要求6或7所述的方法,其特征在于,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:
    根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号;
    对所述增强层高频带频谱信号与所述当前帧的第一高频带信号进行组合处理,以得到所述当前帧的第二高频带信号。
  11. 根据权利要求10所述的方法,其特征在于,所述根据预设高频带频谱选择规则从所述当前帧的增强层信号中选择出所述当前帧的增强层高频带频谱信号,包括:
    获取所述当前帧的第一高频带信号中包括的兼容层解码信号和兼容层频带扩展信号;
    确定所述当前帧的增强层信号中与所述兼容层频带扩展信号对应的信号为所述当前帧的增强层高频带频谱信号。
  12. 根据权利要求6或7所述的方法,其特征在于,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:
    使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
  13. 根据权利要求12所述的方法,其特征在于,所述使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:
    根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;
    使用所述增强层高频带调整参数对所述当前帧的增强层信号进行适配处理,以得到适配处理后的增强层信号;
    使用所述适配处理后的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号。
  14. 根据权利要求12所述的方法,其特征在于,所述使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:
    根据所述当前帧的增强层编码参数或增强层信号和所述当前帧的第一高频带信号获取增强层高频带调整参数;
    使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到替换后的第一高频带信号;
    使用所述增强层高频带调整参数对所述替换后的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号。
  15. 根据权利要求12所述的方法,其特征在于,所述使用所述当前帧的增强层信号对所述当前帧的第一高频带信号进行替换,以得到所述当前帧的第二高频带信号,包括:
    对所述当前帧的增强层信号和所述当前帧的第一高频带信号进行频谱成分对比选择,以从所述当前帧的增强层信号中选择出第一增强层子信号;
    使用所述第一增强层子信号对所述当前帧的第一高频带信号中与所述第一增强层子信号的频谱相同的信号进行替换,以得到所述当前帧的第二高频带信号。
  16. 根据权利要求6所述的方法,其特征在于,所述根据所述增强层编码参数得到所述当前帧的增强层信号,包括:
    根据所述增强层编码参数和所述兼容层编码参数确定所述增强层编码参数中的待解码增强层高频信号;
    对所述增强层编码参数中的待解码增强层高频信号进行解码,以得到所述当前帧的增强层信号。
  17. 根据权利要求6或7所述的方法,其特征在于,所述根据所述当前帧的增强层编码参数或增强层信号对所述当前帧的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号,包括:
    获取所述当前帧的兼容层信号中的兼容层解码信号和兼容层频带扩展信号;
    对所述兼容层频带扩展信号和所述当前帧的增强层信号进行组合处理,以得到所述当前帧的第二高频带信号。
  18. 根据权利要求17所述的方法,其特征在于,所述兼容层信号的频谱范围为[0,FL],其中,所述兼容层解码信号的频谱范围为[0,FT],所述兼容层频带扩展信号的频谱范围为[FT,FL];所述增强层信号的频谱范围为[FX,FY];所述音频输出信号的频谱范围为[0,FY];
    所述FL=FY,所述FX<=FT,所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
    所述FL=FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
    所述FL<FY,所述FX<=FT,确定所述音频输出信号通过如下方式确定:所述音频输出信号中频谱范围为[0,FT]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FT,FL]的信号通过所述兼容层信号和所述增强层信号得到;或者,
    所述FL<FY,所述FX>FT,确定所述音频输出信号通过如下方式确定:所述音频输出信 号中频谱范围为[0,FX]的信号通过所述兼容层信号得到,所述音频输出信号中频谱范围为[FX,FL]的信号通过所述兼容层信号和所述增强层信号得到。
  19. 根据权利要求6至18中任一项所述的方法,其特征在于,所述根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之后,所述方法还包括:
    对所述当前帧的音频输出信号进行后处理。
  20. 根据权利要求6至18中任一项所述的方法,其特征在于,所述根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号之前,所述方法还包括:
    根据所述兼容层信号获取后处理参数;
    使用所述后处理参数对所述增强层信号进行后处理,以得到完成所述后处理的增强层信号。
  21. 一种音频编码设备,其特征在于,所述音频编码设备,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如权利要求1至5中任一项所述的方法。
  22. 根据权利要求21所述的音频编码设备,其特征在于,所述音频编码设备还包括:所述存储器。
  23. 一种音频解码设备,其特征在于,所述音频解码设备,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如权利要求6至20中任一项所述的方法。
  24. 根据权利要求23所述的音频解码设备,其特征在于,所述音频解码设备还包括:所述存储器。
  25. 一种音频编码设备,其特征在于,所述音频编码设备包括:兼容层编码器、增强层编码器和码流复用器,其中,
    所述兼容层编码器,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号和所述低频带信号得到所述当前帧的兼容层编码参数;
    所述增强层编码器,用于获取音频信号的当前帧,所述当前帧包括:高频带信号和低频带信号;根据所述高频带信号得到所述当前帧的增强层编码参数;
    所述码流复用器,用于对所述兼容层编码参数和所述增强层编码参数进行码流复用,以得到编码码流。
  26. 一种音频解码设备,其特征在于,所述音频解码设备包括:码流解复用器、兼容层解码器、增强层解码器、适配处理器和组合器,其中,
    所述码流解复用器,用于获取编码码流;对所述编码码流进行码流解复用,以得到音频信号的当前帧的兼容层编码参数和所述当前帧的增强层编码参数;
    所述兼容层解码器,用于根据所述兼容层编码参数得到所述当前帧的兼容层信号,所述兼容层信号包括:所述当前帧的第一高频带信号和所述当前帧的第一低频带信号;
    所述增强层解码器,用于根据所述增强层编码参数得到所述当前帧的增强层信号;
    所述适配处理器,用于根据所述当前帧的增强层编码参数或增强层信号对所述当前帧 的第一高频带信号进行适配处理,以得到所述当前帧的第二高频带信号;
    所述组合器,用于根据所述当前帧的增强层信号、所述当前帧的第二高频带信号和所述当前帧的第一低频带信号得到所述当前帧的音频输出信号。
  27. 一种计算机可读存储介质,包括指令,当其在计算机上运行时,使得计算机执行如权利要求1至5、或者6至20任意一项所述的方法。
  28. 一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行如权利要求1至5、或者6至20任意一项所述的方法。
PCT/CN2021/070831 2020-01-10 2021-01-08 一种音频编解码方法和音频编解码设备 WO2021139757A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
KR1020227025669A KR20220117332A (ko) 2020-01-10 2021-01-08 오디오 인코딩 방법 및 디바이스 그리고 오디오 디코딩 방법 및 디바이스
JP2022542238A JP7481457B2 (ja) 2020-01-10 2021-01-08 オーディオ符号化方法およびデバイスならびにオーディオ復号方法およびデバイス
EP21738625.9A EP4071756B1 (en) 2020-01-10 2021-01-08 Audio encoding method and device and audio decoding method and device
US17/857,725 US20220335962A1 (en) 2020-01-10 2022-07-05 Audio encoding method and device and audio decoding method and device

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010028452.6 2020-01-10
CN202010028452.6A CN113113032B (zh) 2020-01-10 2020-01-10 一种音频编解码方法和音频编解码设备

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/857,725 Continuation US20220335962A1 (en) 2020-01-10 2022-07-05 Audio encoding method and device and audio decoding method and device

Publications (1)

Publication Number Publication Date
WO2021139757A1 true WO2021139757A1 (zh) 2021-07-15

Family

ID=76708692

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/070831 WO2021139757A1 (zh) 2020-01-10 2021-01-08 一种音频编解码方法和音频编解码设备

Country Status (6)

Country Link
US (1) US20220335962A1 (zh)
EP (1) EP4071756B1 (zh)
JP (1) JP7481457B2 (zh)
KR (1) KR20220117332A (zh)
CN (1) CN113113032B (zh)
WO (1) WO2021139757A1 (zh)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114333862B (zh) * 2021-11-10 2024-05-03 腾讯科技(深圳)有限公司 音频编码方法、解码方法、装置、设备、存储介质及产品

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105280190A (zh) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 带宽扩展编码和解码方法以及装置
CN105869653A (zh) * 2016-05-31 2016-08-17 华为技术有限公司 话音信号处理方法和相关装置和系统
WO2019148112A1 (en) * 2018-01-26 2019-08-01 Dolby International Ab Backward-compatible integration of high frequency reconstruction techniques for audio signals
CN110178180A (zh) * 2017-03-23 2019-08-27 杜比国际公司 用于音频信号的高频重建的谐波转置器的后向兼容集成
WO2019210068A1 (en) * 2018-04-25 2019-10-31 Dolby Laboratories Licensing Corporation Integration of high frequency reconstruction techniques with reduced post-processing delay
WO2020009842A1 (en) * 2018-07-03 2020-01-09 Qualcomm Incorporated Embedding enhanced audio transports in backward compatible audio bitstreams

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0878790A1 (en) * 1997-05-15 1998-11-18 Hewlett-Packard Company Voice coding system and method
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
US7711123B2 (en) * 2001-04-13 2010-05-04 Dolby Laboratories Licensing Corporation Segmenting audio signals into auditory events
CN1266673C (zh) * 2002-03-12 2006-07-26 诺基亚有限公司 可伸缩音频编码的有效改进
JP3881943B2 (ja) * 2002-09-06 2007-02-14 松下電器産業株式会社 音響符号化装置及び音響符号化方法
JP4977472B2 (ja) * 2004-11-05 2012-07-18 パナソニック株式会社 スケーラブル復号化装置
KR100707186B1 (ko) * 2005-03-24 2007-04-13 삼성전자주식회사 오디오 부호화 및 복호화 장치와 그 방법 및 기록 매체
KR100818268B1 (ko) * 2005-04-14 2008-04-02 삼성전자주식회사 오디오 데이터 부호화 및 복호화 장치와 방법
US8285555B2 (en) * 2006-11-21 2012-10-09 Samsung Electronics Co., Ltd. Method, medium, and system scalably encoding/decoding audio/speech
CN101325059B (zh) * 2007-06-15 2011-12-21 华为技术有限公司 语音编解码收发方法及装置
US7885819B2 (en) * 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
US8532998B2 (en) * 2008-09-06 2013-09-10 Huawei Technologies Co., Ltd. Selective bandwidth extension for encoding/decoding audio/speech signal
CN102081927B (zh) * 2009-11-27 2012-07-18 中兴通讯股份有限公司 一种可分层音频编码、解码方法及系统
US8447617B2 (en) * 2009-12-21 2013-05-21 Mindspeed Technologies, Inc. Method and system for speech bandwidth extension
US8442837B2 (en) * 2009-12-31 2013-05-14 Motorola Mobility Llc Embedded speech and audio coding using a switchable model core
CN102737636B (zh) * 2011-04-13 2014-06-04 华为技术有限公司 一种音频编码方法及装置
EP3544006A1 (en) * 2011-11-11 2019-09-25 Dolby International AB Upsampling using oversampled sbr
CN103165135B (zh) * 2013-03-04 2015-03-25 深圳广晟信源技术有限公司 一种数字音频粗分层编码方法和装置
CN103413553B (zh) * 2013-08-20 2016-03-09 腾讯科技(深圳)有限公司 音频编码方法、音频解码方法、编码端、解码端和系统

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105280190A (zh) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 带宽扩展编码和解码方法以及装置
CN105869653A (zh) * 2016-05-31 2016-08-17 华为技术有限公司 话音信号处理方法和相关装置和系统
CN110178180A (zh) * 2017-03-23 2019-08-27 杜比国际公司 用于音频信号的高频重建的谐波转置器的后向兼容集成
WO2019148112A1 (en) * 2018-01-26 2019-08-01 Dolby International Ab Backward-compatible integration of high frequency reconstruction techniques for audio signals
WO2019210068A1 (en) * 2018-04-25 2019-10-31 Dolby Laboratories Licensing Corporation Integration of high frequency reconstruction techniques with reduced post-processing delay
WO2020009842A1 (en) * 2018-07-03 2020-01-09 Qualcomm Incorporated Embedding enhanced audio transports in backward compatible audio bitstreams

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4071756A4

Also Published As

Publication number Publication date
JP7481457B2 (ja) 2024-05-10
JP2023509548A (ja) 2023-03-08
CN113113032B (zh) 2024-08-09
EP4071756A1 (en) 2022-10-12
EP4071756B1 (en) 2024-09-25
KR20220117332A (ko) 2022-08-23
US20220335962A1 (en) 2022-10-20
CN113113032A (zh) 2021-07-13
EP4071756A4 (en) 2023-01-11

Similar Documents

Publication Publication Date Title
US10522168B2 (en) Audio signal synthesizer and audio signal encoder
TWI497485B (zh) 用以重塑經合成輸出音訊信號之時域包絡以更接近輸入音訊信號之時域包絡的方法
JP4918490B2 (ja) エネルギー整形装置及びエネルギー整形方法
US9530424B2 (en) Upsampling using oversampled SBR
JPWO2004010415A1 (ja) オーディオ復号装置と復号方法およびプログラム
JP2007528025A (ja) オーディオ配信システム、オーディオエンコーダ、オーディオデコーダ、及びそれらの動作方法
WO2021143694A1 (zh) 一种音频编解码方法和音频编解码设备
WO2021208792A1 (zh) 音频信号编码方法、解码方法、编码设备以及解码设备
JP2021507316A (ja) オーディオ信号の高周波再構成技術の後方互換性のある統合
WO2021143692A1 (zh) 一种音频编解码方法和音频编解码设备
WO2021244417A1 (zh) 一种音频编码方法和音频编码装置
US20230040515A1 (en) Audio signal coding method and apparatus
EP2360684B1 (en) Audio reproducing device and audio reproducing method
WO2021139757A1 (zh) 一种音频编解码方法和音频编解码设备
WO2021143691A1 (zh) 一种音频编解码方法和音频编解码设备
JP2004053940A (ja) オーディオ復号化装置およびオーディオ復号化方法
US20230154473A1 (en) Audio coding method and related apparatus, and computer-readable storage medium
US10762910B2 (en) Hierarchical fine quantization for audio coding
JP5943982B2 (ja) オーディオ再生装置及びオーディオ再生方法
CN117935822A (zh) 音频编码方法、装置、介质、设备和程序产品

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21738625

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022542238

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2021738625

Country of ref document: EP

Effective date: 20220708

ENP Entry into the national phase

Ref document number: 20227025669

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE