WO2021143691A1 - 一种音频编解码方法和音频编解码设备 - Google Patents
一种音频编解码方法和音频编解码设备 Download PDFInfo
- Publication number
- WO2021143691A1 WO2021143691A1 PCT/CN2021/071327 CN2021071327W WO2021143691A1 WO 2021143691 A1 WO2021143691 A1 WO 2021143691A1 CN 2021071327 W CN2021071327 W CN 2021071327W WO 2021143691 A1 WO2021143691 A1 WO 2021143691A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frequency region
- parameter
- component
- current frequency
- current
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
Definitions
- This application relates to the technical field of audio signal coding and decoding, and in particular to an audio coding and decoding method and audio coding and decoding equipment.
- the embodiments of the present application provide an audio coding and decoding method and an audio coding and decoding device, which can improve the quality of decoded audio signals.
- an audio encoding method comprising: obtaining a current frame of an audio signal, the current frame including a high frequency band signal; obtaining the high frequency band of the current frame according to the high frequency band signal Parameters, the high-frequency band parameters are used to indicate the position, quantity, and amplitude or energy of the tonal components included in the high-frequency band signal; code stream multiplexing is performed on the high-frequency band coding parameters to obtain a coded bit stream.
- the high-frequency band parameters include a position quantity parameter of a pitch component, and an amplitude parameter or an energy parameter of the pitch component.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region, one frequency region includes at least one subband, and Obtaining the high-frequency band parameters of the current frame according to the high-frequency band signal includes: determining the position quantity parameter of the tonal component of the current frequency region according to the high-frequency band signal of the current frequency region in the at least one frequency region And the amplitude parameter or energy parameter of the tonal component in the current frequency region.
- the method includes: determining whether a tonal component is included in the current frequency region; when a tonal component is included in the current frequency region, according to The high-band signal of the current frequency region in the at least one frequency region determines the position quantity parameter of the tone component of the current frequency region and the amplitude parameter or energy parameter of the tone component of the current frequency region.
- the high-band parameters of the current frame further include tone component indication information, and the tone component indication information is used to indicate the current frequency region Whether to include tonal components.
- the determination of the tone component of the current frequency region based on the high-band signal of the current frequency region in the at least one frequency region includes: performing a peak search in the current frequency region according to a high-band signal of the current frequency region in the at least one frequency region to obtain At least one of peak quantity information, peak position information, and peak amplitude information of the current area; determining the current frequency area according to at least one of peak quantity information, peak position information, and peak amplitude information The position quantity parameter of the tone component in the frequency region and the amplitude parameter or energy parameter of the tone component in the current frequency region.
- a peak search is performed in the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region to obtain
- the at least one of the peak quantity information, peak position information, and peak amplitude information of the current region includes: according to at least one of a power spectrum, an energy spectrum, or an amplitude spectrum of the current frequency region in the at least one frequency region.
- a peak search is performed in the current frequency region to obtain at least one of peak quantity information, peak position information, and peak amplitude information in the current region.
- the current frequency region is determined according to at least one of peak quantity information, peak position information, and peak amplitude information.
- the position quantity parameter of the tone component of the frequency region and the amplitude parameter or energy parameter of the tone component of the current frequency region include: according to at least one of peak quantity information, peak position information, and peak amplitude information of the current frequency region, Determine the position information, quantity information and amplitude information of the tonal components in the current frequency region; determine the position and quantity parameters of the tonal components in the current frequency region according to the position information, quantity information and amplitude information of the tonal components in the current frequency region And the amplitude parameter or energy parameter of the tonal component in the current frequency region.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region , The N bits have a one-to-one correspondence with the subbands included in the current frequency region; wherein, if the first subband included in the current frequency region has a peak, then the N bits and the first subband have a peak value.
- the value of the bit corresponding to a subband is the first value; or if there is no peak in the second subband included in the current frequency region, the bit corresponding to the second subband among the N bits
- the value of is a second value, and the first value is different from the second value.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region , The N bits have a one-to-one correspondence with the sub-bands included in the current frequency region; wherein, if the first sub-band included in the current frequency region has a tonal component, the N bits and the sub-bands The value of the bit corresponding to the first subband is the first value; or if the second subband included in the current frequency region does not have a tonal component, then among the N bits corresponding to the second subband The value of the bit position is a second value, and the first value is different from the second value.
- the high-band parameter further includes a noise floor parameter of the high-band signal.
- an audio decoding method including: obtaining an encoded bitstream; demultiplexing the encoded bitstream to obtain the high-frequency band parameters of the current frame of the audio signal.
- the parameter is used to indicate the position, quantity, and amplitude or energy of the tonal components included in the high-frequency signal of the current frame; obtain the reconstructed high-frequency signal of the current frame according to the high-frequency parameter; The reconstructed high-band signal obtains the audio output signal of the current frame.
- the high-frequency band parameters include a position quantity parameter of a pitch component of the high-frequency signal of the current frame and an amplitude parameter or an energy parameter of the pitch component.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region, and one frequency region includes at least one subband;
- the high-frequency band parameter includes the position quantity parameter of the pitch component of the high-frequency signal of the current frame, and the parameter includes the position quantity parameter of the respective pitch component of the at least one frequency region, and the amplitude of the pitch component of the high-frequency signal of the current frame.
- the parameters or energy parameters include the amplitude parameters or energy parameters of the respective tonal components of the at least one frequency region.
- the demultiplexing the code stream to obtain the high frequency band parameters of the current frame of the audio signal includes: obtaining The position quantity parameter of the tone component of the current frequency region of the at least one frequency region; and the amplitude parameter of the tone component of the current frequency region is analyzed from the coded code stream according to the position quantity parameter of the tone component of the current frequency region Or energy parameters.
- the pitch of the current frequency region is parsed from the code stream according to the position quantity parameter of the pitch component of the current frequency region
- the amplitude parameter or energy parameter of the component includes: determining the quantity parameter of the pitch component of the current frequency region according to the position quantity parameter of the pitch component of the current frequency region; according to the quantity parameter of the pitch component of the current frequency region, Analyze the amplitude parameter or the energy parameter of the tonal component of the current frequency region from the coded code stream.
- the demultiplexing the code stream to obtain the high frequency band parameters of the current frame of the audio signal includes: obtaining The position quantity parameter of the tonal component in the current frequency area of the at least one frequency area; and determine the position parameter of the tonal component in the current frequency area and the value of the tonal component in the current frequency area according to the position quantity parameter of the tonal component in the current frequency area Quantity parameter; parse the amplitude parameter or energy parameter of the tonal component of the current frequency region from the code stream according to the quantity parameter of the tonal component of the current frequency region.
- the method before acquiring the position quantity parameter of the tonal components of the current frequency region of the at least one frequency region, the method includes: acquiring the information of the current frequency region Tonal component indication information; the tonal component indication information is used to indicate whether a tonal component is included in the current frequency region; when a tonal component is included in the current frequency region, the tone of the current frequency region of the at least one frequency region is acquired The position quantity parameter of the component.
- the acquiring the position quantity parameter of the tonal components of the current frequency region of the at least one frequency region includes: according to the current frequency region included The number of subbands read N bits from the code stream, where the N bits are the position quantity parameter of the pitch component of the current frequency region, where N is the subband included in the current frequency region The number of the N bits corresponds to the subbands included in the current frequency region in a one-to-one correspondence.
- the obtaining the reconstructed high-band signal of the current frame according to the high-band parameter includes: according to the pitch of the current frequency region The position quantity parameter of the component determines the position of the tonal component in the current frequency region; the amplitude or energy corresponding to the position of the tonal component is determined according to the amplitude parameter or energy parameter of the tonal component in the current frequency region; according to the current frequency The position of the tone component in the area and the amplitude or energy corresponding to the position of the tone component obtain the reconstructed high-band signal.
- the position of the tonal component in the current frequency region is determined according to the position quantity parameter of the tonal component of the high-frequency signal in the current frequency region
- the method includes: determining the position parameter of the tonal component of the current frequency region according to the position quantity parameter of the tonal component of the high-frequency signal in the current frequency region; and determining the position parameter of the tonal component of the current frequency region The position of the tonal component in the frequency region.
- the obtaining the reconstructed high-band signal of the current frame according to the high-band parameter includes: according to the pitch of the current frequency region The position parameter of the component determines the position of the tonal component in the current frequency region; the amplitude or energy corresponding to the position of the tonal component is determined according to the amplitude parameter or the energy parameter of the tonal component in the current frequency region; according to the current frequency region The position of the mid-tone component and the amplitude or energy corresponding to the position of the tonal component obtain the reconstructed high-band signal.
- the position parameter of the tonal component in the current frequency region is used to indicate the sequence number of the subband including the tonal component in the current frequency region.
- the position of the tone component in the current frequency region is located at a designated position in the subband where the tone component is located in the current frequency region.
- the designated position of the sub-band is the center position of the sub-band.
- the reconstructed high frequency band is obtained according to the position of the tone component in the current frequency region and the amplitude corresponding to the position of the tone component
- the signal includes: the frequency domain signal that determines the position of the tonal component according to the following calculation formula:
- pSpectralData represents the reconstructed high-band frequency domain signal in the current frequency region
- tone_val represents the amplitude value corresponding to the position of the tone component in the current frequency region
- tone_pos represents the position of the tone component in the current frequency region
- an audio encoder including: a signal acquisition unit for acquiring a current frame of an audio signal, where the current frame includes a high-band signal; and a parameter acquisition unit for acquiring a signal based on the high-band signal Obtain the high-frequency band parameters of the current frame, where the high-frequency band parameters are used to indicate the position, quantity, and amplitude or energy of the tonal components included in the high-frequency band signal; The coding parameters are coded stream multiplexed to obtain the coded code stream.
- the high-band parameters include a position quantity parameter of a pitch component, and an amplitude parameter or an energy parameter of the pitch component.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region, and one frequency region includes at least one subband;
- the parameter acquisition unit is specifically configured to determine the position quantity parameter of the tonal component in the current frequency region and the amplitude of the tonal component in the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region Parameters or energy parameters.
- the audio encoder further includes: a determining unit, configured to determine whether tonal components are included in the current frequency region; and the parameter acquiring unit , Specifically used to determine the position quantity parameter of the tonal component in the current frequency region and the tonal component in the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region when the tonal component is included in the current frequency region.
- the amplitude parameter or energy parameter of the tonal component in the current frequency region is Specifically used to determine the position quantity parameter of the tonal component in the current frequency region and the tonal component in the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region when the tonal component is included in the current frequency region.
- the high-band parameters of the current frame further include tone component indication information, and the tone component indication information is used to indicate the current frequency region Whether to include tonal components.
- the parameter acquisition unit is specifically configured to: according to the high-frequency band signal in the current frequency region in the at least one frequency region, Perform a peak search in the current frequency region to obtain at least one of peak number information, peak position information, and peak amplitude information of the current region; according to the peak number information, peak position information, and peak amplitude information of the current frequency region At least one of determining the position quantity parameter of the tone component in the current frequency region and the amplitude parameter or energy parameter of the tone component in the current frequency region.
- the parameter acquisition unit is specifically configured to: according to the power spectrum, energy spectrum or amplitude of the current frequency region in the at least one frequency region At least one of the spectrums performs a peak search in the current frequency region to obtain at least one of peak number information, peak position information, and peak amplitude information in the current region.
- the parameter acquisition unit is specifically configured to: according to the peak number information, peak position information, and peak amplitude information of the current frequency region At least one of determining the position information, quantity information, and amplitude information of the tonal components of the current frequency region; and determining the tonal components of the current frequency region according to the position information, quantity information, and amplitude information of the tonal components of the current frequency region The position quantity parameter and the amplitude parameter or energy parameter of the tonal component of the current frequency region.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region , The N bits have a one-to-one correspondence with the subbands included in the current frequency region; wherein, if there is a peak in the first subband included in the current frequency region, the The value of the bit corresponding to a subband is the first value; or if there is no peak in the second subband included in the current frequency region, the bit corresponding to the second subband among the N bits The value of is a second value, and the first value is different from the second value.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region , The N bits have a one-to-one correspondence with the sub-bands included in the current frequency region; wherein, if the first sub-band included in the current frequency region has a tonal component, the N bits and the sub-bands The value of the bit corresponding to the first subband is the first value; or if the second subband included in the current frequency region does not have a tonal component, then among the N bits corresponding to the second subband The value of the bit position is a second value, and the first value is different from the second value.
- the high-band parameter further includes a noise floor parameter of the high-band signal.
- a fourth aspect provides an audio decoder, including: a receiving unit for obtaining a coded stream; a demultiplexing unit for demultiplexing the coded stream to obtain the current frame of the audio signal
- the high-band parameters are used to indicate the position, number, and amplitude or energy of the tonal components included in the high-band signal of the current frame; the reconstruction unit is used to obtain the high-band parameters according to the high-band parameters.
- the reconstructed high-band signal of the current frame; and the audio output signal of the current frame is obtained according to the reconstructed high-band signal of the current frame.
- the high-frequency band parameter includes a position quantity parameter of a pitch component of the high-frequency signal of the current frame and an amplitude parameter or an energy parameter of the pitch component.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region, and one frequency region includes at least one subband;
- the high-frequency band parameter includes the position quantity parameter of the pitch component of the high-frequency signal of the current frame, and the parameter includes the position quantity parameter of the respective pitch component of the at least one frequency region, and the amplitude of the pitch component of the high-frequency signal of the current frame.
- the parameters or energy parameters include the amplitude parameters or energy parameters of the respective tonal components of the at least one frequency region.
- the demultiplexing unit is specifically configured to: obtain the position quantity parameter of the tonal component of the current frequency region of the at least one frequency region; Analyze the amplitude parameter or the energy parameter of the tonal component in the current frequency region from the code stream according to the position quantity parameter of the tonal component in the current frequency region.
- the demultiplexing unit is specifically configured to: determine the current frequency according to the position quantity parameter of the tone component in the current frequency region The quantity parameter of the tonal component of the region; according to the quantity parameter of the tonal component of the current frequency region, the amplitude parameter or the energy parameter of the tonal component of the current frequency region is parsed from the coded code stream.
- the demultiplexing unit is specifically configured to: obtain the position quantity parameter of the tonal component of the current frequency region of the at least one frequency region; Determine the position parameter of the tonal component in the current frequency region and the quantitative parameter of the tonal component in the current frequency region according to the position quantity parameter of the tonal component in the current frequency region; and determine the position parameter of the tonal component in the current frequency region according to the quantity parameter of the tonal component in the current frequency region from the Analyze the amplitude parameter or energy parameter of the tonal component in the current frequency region in the coded stream.
- the demultiplexing unit is specifically configured to: obtain the tone component indication information of the current frequency region; the tone component indication information is used for To indicate whether tonal components are included in the current frequency region; when the current frequency region includes tonal components, obtain the position quantity parameter of the tonal components in the current frequency region of the at least one frequency region.
- the demultiplexing unit is specifically configured to: read from the code stream according to the number of subbands included in the current frequency region Take N bits, the N bits are the position quantity parameter of the pitch component of the current frequency region, where N is the number of subbands included in the current frequency region, and the N bits are the same as the The subbands included in the current frequency region have a one-to-one correspondence.
- the reconstruction unit is specifically configured to: determine the tone in the current frequency region according to the position quantity parameter of the tone component in the current frequency region The position of the component; determine the amplitude or energy corresponding to the position of the tonal component according to the amplitude parameter or energy parameter of the tonal component in the current frequency region; according to the position of the tonal component in the current frequency region and the position of the tonal component The corresponding amplitude or energy obtains the reconstructed high frequency band signal.
- the reconstruction unit is specifically configured to: determine the position quantity parameter of the tonal component of the high-frequency signal in the current frequency region The position parameter of the tonal component in the current frequency area; and the position of the tonal component in the current frequency area is determined according to the position parameter of the tonal component in the current frequency area.
- the reconstruction unit is specifically configured to: determine the tone in the current frequency region according to the position parameter of the tone component in the current frequency region Component position; determine the amplitude or energy corresponding to the position of the tonal component according to the amplitude parameter or energy parameter of the tonal component in the current frequency region; according to the position of the tonal component in the current frequency region and the position of the tonal component corresponding The amplitude or energy of the reconstructed high-band signal is obtained.
- the position parameter of the tonal component in the current frequency region is used to indicate the sequence number of the subband including the tonal component in the current frequency region.
- the position of the tonal component in the current frequency region is located at a designated position in the subband where the tonal component is located in the current frequency region.
- the designated position of the sub-band is the center position of the sub-band.
- the reconstructed high frequency band is obtained according to the position of the tone component in the current frequency region and the amplitude corresponding to the position of the tone component
- the signal includes: the frequency domain signal that determines the position of the tonal component according to the following calculation formula:
- pSpectralData represents the reconstructed high-band frequency domain signal in the current frequency region
- tone_val represents the amplitude value corresponding to the position of the tone component in the current frequency region
- tone_pos represents the position of the tone component in the current frequency region
- the embodiments of the present application provide a computer-readable storage medium that stores instructions in the computer-readable storage medium, which when run on a computer, causes the computer to execute the above-mentioned first or second aspect. The method described.
- the embodiments of the present application provide a computer program product containing instructions, which when run on a computer, cause the computer to execute the method described in the first aspect or the second aspect.
- an embodiment of the present application provides an audio encoder, including a processor and a memory; the memory is used to store instructions; the processor is used to execute the instructions in the memory, so that the audio encoding
- the device performs any one of the methods of the aforementioned first aspect.
- an embodiment of the present application provides an audio decoder, including a processor and a memory; the memory is used to store instructions; the processor is used to execute the instructions in the memory, so that the audio decoding
- the device executes any one of the methods of the aforementioned second aspect.
- an embodiment of the present application provides a communication device.
- the communication device may include entities such as audio codec equipment or a chip.
- the communication device includes a processor and optionally a memory; the memory is used for Storing instructions; the processor is configured to execute the instructions in the memory, so that the communication device executes the method according to any one of the foregoing first aspect or second aspect.
- this application provides a chip system that includes a processor for supporting audio codec devices to implement the functions involved in the above aspects, for example, sending or processing the data and/or involved in the above methods Or information.
- the chip system further includes a memory, and the memory is used to store necessary program instructions and data of the audio codec device.
- the chip system can be composed of chips, and can also include chips and other discrete devices.
- the audio encoder in the embodiment of the present invention encodes the position, quantity, and amplitude or energy of the tonal components in the high-band signal, so that the audio decoder recovers according to the position, quantity, and amplitude or energy of the tonal components
- the tonal component makes the position and energy of the restored tonal component more accurate, thereby improving the quality of the decoded signal.
- FIG. 1 is a schematic structural diagram of an audio codec system provided by an embodiment of the application
- FIG. 2 is a schematic flowchart of an audio coding method provided by an embodiment of the application
- FIG. 3 is a schematic flowchart of an audio decoding method provided by an embodiment of this application.
- FIG. 4 is a schematic diagram of a mobile terminal according to an embodiment of the application.
- Fig. 5 is a schematic diagram of a network element according to an embodiment of the application.
- FIG. 6 is a schematic diagram of the composition structure of an audio coding device provided by an embodiment of the application.
- FIG. 7 is a schematic diagram of the composition structure of an audio decoding device provided by an embodiment of the application.
- FIG. 8 is a schematic diagram of the composition structure of another audio coding device provided by an embodiment of the application.
- FIG. 9 is a schematic diagram of the composition structure of another audio decoding device provided by an embodiment of the application.
- the audio signal in the embodiment of the present application refers to the input signal in the audio encoding device.
- the audio signal may include multiple frames.
- the current frame may specifically refer to a certain frame in the audio signal.
- the current frame The audio signal coding and decoding are illustrated by examples.
- the previous frame or the next frame of the current frame in the audio signal can be coded and decoded according to the coding and decoding mode of the current frame audio signal.
- the audio signal in the embodiment of the present application may be a mono audio signal, or may also be a stereo signal.
- the stereo signal can be the original stereo signal, it can also be a stereo signal composed of two signals (left channel signal and right channel signal) included in the multi-channel signal, or it can be composed of the multi-channel signal.
- Fig. 1 is a schematic structural diagram of an audio coding and decoding system according to an exemplary embodiment of the application.
- the audio codec system includes an encoding component 110 and a decoding component 120.
- the encoding component 110 is used to encode the current frame (audio signal) in the frequency domain or the time domain.
- the encoding component 110 can be implemented by software; alternatively, it can also be implemented by hardware; or, it can also be implemented by a combination of software and hardware, which is not limited in the embodiments of the present application.
- the encoding component 110 encodes the current frame in the frequency domain or the time domain, in a possible implementation manner, the steps shown in FIG. 2 may be included.
- the encoding component 110 can generate an encoded bitstream after encoding is completed, and the encoding component 110 can send the encoded bitstream to the decoding component 120, so that the decoding component 120 can receive the encoded bitstream, and then decode the encoded bitstream.
- the component 120 obtains the audio output signal from the coded stream.
- the encoding method shown in FIG. 2 is only an example and not a limitation.
- the embodiment of the present application does not limit the execution order of the steps in FIG. 2 and the encoding method shown in FIG. 2 may also include more Or fewer steps, which are not limited in the embodiments of the present application.
- the encoding component 110 and the decoding component 120 may be connected in a wired or wireless manner, and the decoding component 120 may obtain the encoded bitstream generated by the encoding component 110 through the connection between the encoding component 110 and the encoding component 110; or, the encoding component 110 may The generated code stream is stored in the memory, and the decoding component 120 reads the code stream in the memory.
- the decoding component 120 can be implemented by software; alternatively, it can also be implemented by hardware; or, it can also be implemented by a combination of software and hardware, which is not limited in the embodiment of the present application.
- the decoding component 120 decodes the current frame (audio signal) in the frequency domain or the time domain, in a possible implementation manner, the steps shown in FIG. 3 may be included.
- the encoding component 110 and the decoding component 120 can be provided in the same device; or, they can also be provided in different devices.
- the device can be a terminal with audio signal processing functions such as mobile phones, tablet computers, laptop computers and desktop computers, Bluetooth speakers, voice recorders, wearable devices, etc., or it can be a core network or wireless network with audio signal processing capabilities This embodiment does not limit this.
- the encoding component 110 is installed in the mobile terminal 130
- the decoding component 120 is installed in the mobile terminal 140.
- the mobile terminal 130 and the mobile terminal 140 are independent of each other and have audio signal processing capabilities.
- the electronic device may be a mobile phone, a wearable device, a virtual reality (VR) device, or an augmented reality (AR) device, etc., and the mobile terminal 130 and the mobile terminal 140 are connected wirelessly or wiredly. Take network connection as an example.
- the mobile terminal 130 may include an acquisition component 131, an encoding component 110, and a channel encoding component 132, where the acquisition component 131 is connected to the encoding component 110, and the encoding component 110 is connected to the encoding component 132.
- the mobile terminal 140 may include an audio playing component 141, a decoding component 120, and a channel decoding component 142.
- the audio playing component 141 is connected to the decoding component 120
- the decoding component 120 is connected to the channel decoding component 142.
- the mobile terminal 130 After the mobile terminal 130 collects the audio signal through the collection component 131, it encodes the audio signal through the encoding component 110 to obtain an encoded code stream; then, the channel encoding component 132 encodes the encoded code stream to obtain a transmission signal.
- the mobile terminal 130 transmits the transmission signal to the mobile terminal 140 through a wireless or wired network.
- the mobile terminal 140 After receiving the transmission signal, the mobile terminal 140 decodes the transmission signal through the channel decoding component 142 to obtain a code stream; decodes the code stream through the decoding component 110 to obtain an audio signal; and plays the audio signal through the audio playback component. It can be understood that the mobile terminal 130 may also include components included in the mobile terminal 140, and the mobile terminal 140 may also include components included in the mobile terminal 130.
- the encoding component 110 and the decoding component 120 are provided in a network element 150 capable of processing audio signals in the same core network or wireless network as an example for description.
- the network element 150 includes a channel decoding component 151, a decoding component 120, an encoding component 110, and a channel encoding component 152.
- the channel decoding component 151 is connected to the decoding component 120
- the decoding component 120 is connected to the encoding component 110
- the encoding component 110 is connected to the channel encoding component 152.
- the channel decoding component 151 After the channel decoding component 151 receives the transmission signal sent by other devices, it decodes the transmission signal to obtain the first coded code stream; the decoding component 120 decodes the coded code stream to obtain the audio signal; the coding component 110 performs the decoding on the audio signal Encode to obtain a second coded code stream; use the channel coding component 152 to encode the second coded code stream to obtain a transmission signal.
- the other device may be a mobile terminal with audio signal processing capability; or, it may also be other network elements with audio signal processing capability, which is not limited in this embodiment.
- the encoding component 110 and the decoding component 120 in the network element can transcode the encoded code stream sent by the mobile terminal.
- the device installed with the encoding component 110 may be referred to as an audio encoding device.
- the audio encoding device may also have an audio decoding function, which is not limited in the implementation of this application.
- the device installed with the decoding component 120 may be referred to as an audio decoding device.
- the audio decoding device may also have an audio encoding function, which is not limited in the implementation of this application.
- Figure 2 describes the flow of an audio coding method provided by an embodiment of the present invention, including:
- the current frame can be any frame in the audio signal, and the current frame can include a high-band signal and a low-band signal.
- the division of the high-band signal and the low-band signal can be determined by the frequency band threshold, which is higher than the frequency band threshold.
- the frequency band threshold signal is a high frequency band signal, and the signal below the frequency band threshold value is a low frequency band signal.
- the frequency band threshold can be determined according to the transmission bandwidth, the data processing capability of the encoding component 110 and the decoding component 120, and it will not be done here. limited.
- the high-band signal and the low-band signal are relative. For example, a signal below a certain frequency is a low-band signal, but a signal above this frequency is a high-band signal (the signal corresponding to the frequency can be classified into the low-band Signals can also be assigned to high-band signals).
- the frequency varies according to the bandwidth of the current frame. For example, when the current frame is a 0-8khz wideband signal, the frequency may be 4khz; when the current frame is a 0-16khz ultra-wideband signal, the frequency may be 8khz.
- the high-frequency band parameter is used to indicate the position, quantity, and amplitude or energy of the tonal components included in the high-frequency band signal.
- the high frequency band parameters include a position quantity parameter of the tone component, and an amplitude parameter or an energy parameter of the tone component.
- the number of positions parameter indicates that the position of the tonal component and the number of tonal components are represented by the same parameter.
- the high-band parameters include the position parameter of the tonal component, the quantity parameter of the tonal component, and the amplitude parameter or energy parameter of the tonal component; in this case, the position and quantity of the tonal component are different. The parameter representation.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region (Tile), and one frequency region includes at least one subband.
- the high-band parameter of the current frame includes: determining the position quantity parameter of the tone component of the current frequency region and the tone component of the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region The amplitude parameter or energy parameter.
- the method includes: determining whether a tonal component is included in the current frequency region; when a tonal component is included in the current frequency region, according to the current frequency region in the at least one frequency region For a high-band signal, the position quantity parameter of the tonal component in the current frequency region and the amplitude parameter or energy parameter of the tonal component in the current frequency region are determined. In this way, only the parameters of the frequency region with tonal components are acquired, thereby improving the coding efficiency.
- the high-band parameters of the current frame further include pitch component indication information, and the pitch component indication information is used to indicate whether a pitch component is included in the current frequency region.
- the audio decoder to perform decoding according to the indication information, which improves decoding efficiency.
- the position quantity parameter of the tone component of the current frequency region and the tone component of the current frequency region are determined according to the high-band signal of the current frequency region in the at least one frequency region
- the amplitude parameter or energy parameter includes: performing a peak search in the current frequency region according to the high-band signal of the current frequency region in the at least one frequency region, to obtain peak quantity information and peak position information of the current region And at least one of peak amplitude information; according to at least one of peak number information, peak position information, and peak amplitude information in the current frequency region, determine the position quantity parameter of the tonal component in the current frequency region and the The amplitude parameter or energy parameter of the tonal component in the current frequency region.
- the high-band signal for peak search may be a frequency domain signal or a time domain signal.
- the peak search may be specifically performed according to at least one of a power spectrum, an energy spectrum, or an amplitude spectrum of the current frequency region.
- the position quantity parameter of the tonal component of the current frequency region and the position quantity parameter of the tonal component of the current frequency region and the peak amplitude information are determined according to at least one of the peak quantity information, peak position information, and peak amplitude information of the current frequency region.
- the amplitude parameter or energy parameter of the tonal component in the current frequency region includes: determining the position information of the tonal component in the current frequency region according to at least one of peak quantity information, peak position information, and peak amplitude information in the current frequency region , Quantity information and amplitude information; according to the position information, quantity information and amplitude information of the tonal component of the current frequency region, determine the position quantity parameter of the tonal component of the current frequency region and the amplitude parameter of the tone component of the current frequency region Or energy parameters.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region, and the N bits are related to the current frequency.
- the subbands included in the region have a one-to-one correspondence; wherein, if the first subband included in the current frequency region has a peak, the value of the bit corresponding to the first subband among the N bits is the first Value; or if there is no peak in the second subband included in the current frequency region, the value of the bit corresponding to the second subband in the N bits is the second value, and the first value Different from the second value.
- the position quantity parameter of the pitch component of the current frequency region includes N bits, where N is the number of subbands included in the current frequency region, and the N bits are related to the current frequency.
- the subbands included in the region have a one-to-one correspondence; wherein, if the first subband included in the current frequency region has a tonal component, the value of the bit corresponding to the first subband in the N bits is the first Or if the second subband included in the current frequency region does not have tonal components, the value of the bit corresponding to the second subband in the N bits is the second value, and the first One value is different from the second value.
- the high-band parameters may further include a noise floor parameter of the high-band signal.
- the audio encoding method may include the following procedures:
- the high frequency band parameters include position parameters, quantity parameters, and amplitude parameters of tonal components.
- the high-band parameter which may specifically be:
- peak search according to the power spectrum of the high-frequency signal to obtain peak number information, peak position information, and peak amplitude information.
- the embodiment of the present invention does not limit the specific way of peak search. For example, if the value of the power spectrum corresponding to the current frequency point is significantly different from the value of the power spectrum corresponding to the left and right adjacent frequency points, the frequency point is the peak value.
- filtering is performed according to at least one of the peak position, the peak amplitude, and the number of peaks to determine the position parameter, the quantity parameter, and the amplitude parameter of the tonal component.
- filtering based on the peak amplitude may be: the peak amplitude is greater than a preset threshold value as a preset condition.
- the number of peaks that meet the preset condition can be used as the number parameter of the tonal components.
- the corresponding peak position is used as the position parameter of the tonal component, or the position parameter of the tonal component is determined according to the corresponding peak position.
- the subband sequence number corresponding to the peak position is obtained according to the corresponding peak position, and the subband sequence number corresponding to the peak position is used as the position parameter of the pitch component.
- the corresponding peak amplitude is used as the amplitude parameter of the tonal component or the amplitude parameter of the tonal component is determined according to the corresponding peak amplitude.
- the peak amplitude can be characterized by the energy of the frequency domain signal or the power of the frequency domain signal.
- the amplitude parameter of the tonal component can be replaced with the energy parameter of the tonal component as the high-frequency band parameter.
- the high frequency band is divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the determination of the high-band parameters based on the high-band signals can also be performed in various frequency regions.
- K and N are both integers greater than or equal to 1.
- the high frequency band parameters include the position quantity parameter and the amplitude parameter of the tonal component.
- the high frequency band can be divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the determination of the high-frequency band parameters can be performed in units of frequency regions. Take a frequency region as an example.
- the method for determining the high-band parameters according to the high-band signal may specifically be:
- Peak search is performed in units of frequency regions.
- a peak search is performed on the power spectrum of a high-band signal in a frequency region to obtain peak number information, peak position information, and peak amplitude information in the frequency region.
- Screening is performed according to at least one of the peak position, the peak amplitude, and the number of peaks, and the position quantity parameter and the amplitude parameter of the tone component are determined.
- Screening is performed according to at least one of the peak position, the peak amplitude, and the number of peaks, and the position parameter, quantity parameter, and amplitude parameter of the tonal component are determined.
- the position parameter of the tonal component may be the sequence number of the subband with the peak in the frequency region.
- the parameter of the number of tonal components is the number of subbands with peaks in the frequency region.
- the amplitude parameter of the tonal component may be equal to the peak amplitude of the subband with the peak in the frequency region or calculated according to the peak amplitude of the subband with the peak in the frequency region.
- the peak amplitude can be characterized by the energy of the frequency domain signal or the power of the frequency domain signal.
- the amplitude parameter of the tonal component can be replaced with the energy parameter of the tonal component as the high-frequency band parameter.
- the position quantity parameter of the tonal component is determined.
- the position quantity parameter of the tone component can be represented by an N-bit sequence, where N is the number of subbands in a frequency region.
- N is the number of subbands in a frequency region.
- the bit sequence from low to high indicates that the sequence numbers of the subbands are from small to large.
- the bit sequence from low to high indicates the sequence numbers of the subbands from large to small.
- the sequence number of the subband corresponding to each bit of the bit sequence can also be specified in advance.
- sequence number of the subband with the peak in the frequency region it is determined whether there is a peak in the subband corresponding to each bit in the N-bit sequence, and the N-bit sequence is obtained, that is, the position quantity parameter of the tonal component. If the sequence number of the subband corresponding to the bit is equal to the sequence number of the subband with the peak in the frequency region, the value of the bit is 1, otherwise the value of the bit is 0.
- the number of subbands in a frequency region is 5, the position quantity parameter of the tone component is represented by a 5-bit sequence, and the binary representation of the 5-bit sequence value is 10011. Assuming that the 5-bit bit sequence from low to high indicates that the subband's sequence number is from small to large, the value of this bit sequence indicates that there are peaks in the 0th, 1st, and 4th subbands in the frequency region, that is, the sequence number of the subbands with peaks is 0, 1, 4.
- Case 3 The high-band parameters may also include noise floor parameters. Case 3 can be implemented in combination with Case 1 or Case 2.
- the method further includes:
- Case 4 The high-band parameters may also include signal type information. Case 3 can be implemented in combination with Cases 1-3.
- the determination of the high-frequency parameter further includes: determining the signal type information according to the quantity parameter of the tonal component or the position quantity parameter of the tonal component. specifically:
- the signal type information is determined. For example, if the value of the quantity parameter of tonal components is greater than 0, the signal type information indicates the tonal signal type.
- the signal type information is determined. It may be that the quantity parameter of the tonal component is obtained according to the position quantity parameter of the tonal component; the signal type information is determined according to the quantity parameter of the tonal component. It is worth noting that if the quantitative parameter of the tonal component has been obtained by determining the position quantity parameter of the tonal component, there is no need to obtain the quantity parameter of the tonal component according to the position quantity parameter of the tonal component, and the signal is determined directly according to the quantity parameter of the tonal component. Type information is fine.
- the signal type information can be indicated by the presence or absence of tonal components.
- the flag of the presence or absence of a tone component may also be referred to as tone component indication information.
- the flag value of the presence or absence of a tonal component is 1, which indicates that there is a tonal component.
- the signal type information can be represented by a flag indicating whether there are tonal components in the frequency region. For example, the flag value of whether there is a tonal component in the frequency region is 1, which indicates that there is a tonal component in the frequency region.
- Special processing for case 4 If the signal type information indicates the tone signal type, the signal type information and high-frequency band parameters other than the signal type information need to be written into the code stream. Otherwise, write the signal type information into the code stream. If it is coded according to the frequency region, the frequency region is processed in sequence: If the signal type information corresponding to the frequency region indicates the tone signal type, the signal type information and the high-frequency band parameters other than the signal type information need to be written into the code Stream; otherwise, write the signal type information into the code stream.
- the audio encoder in the embodiment of the present invention encodes the position, quantity, and amplitude or energy of the tonal components in the high-band signal, so that the audio decoder recovers according to the position, quantity, and amplitude or energy of the tonal components
- the tonal component makes the position and energy of the restored tonal component more accurate, thereby improving the quality of the decoded signal.
- Figure 3 describes the flow of an audio decoding method provided by an embodiment of the present invention, including:
- the high frequency band parameters include a position quantity parameter of the tone component, and an amplitude parameter or an energy parameter of the tone component.
- the number of positions parameter indicates that the position of the tonal component and the number of tonal components are represented by the same parameter.
- the high-band parameters include the position parameter of the tonal component, the quantity parameter of the tonal component, and the amplitude parameter or energy parameter of the tonal component; in this case, the position and quantity of the tonal component are different. The parameter representation.
- the high frequency band corresponding to the high frequency band signal includes at least one frequency region, and one frequency region includes at least one subband; accordingly, the high frequency band parameter includes the high frequency of the current frame.
- the position quantity parameter of the tonal component of the high-frequency signal includes the position quantity parameter of the respective tonal component of the at least one frequency region, and the amplitude parameter or energy parameter of the tonal component of the high-frequency signal of the current frame includes each of the at least one frequency region.
- the amplitude parameter or energy parameter of the tonal component includes
- the demultiplexing the code stream to obtain the high-frequency band parameters of the current frame of the audio signal includes: obtaining the pitch component of the current frequency region of the at least one frequency region Position quantity parameter; parse the amplitude parameter or energy parameter of the pitch component of the current frequency region from the code stream according to the position quantity parameter of the pitch component of the current frequency region.
- the parsing the amplitude parameter or energy parameter of the pitch component of the current frequency region from the code stream according to the position quantity parameter of the pitch component of the current frequency region includes: The position quantity parameter of the tonal component in the frequency region determines the quantity parameter of the tonal component in the current frequency region; according to the quantity parameter of the tonal component in the current frequency region, the current frequency region is parsed from the code stream. The amplitude parameter or energy parameter of the tonal component.
- the demultiplexing the code stream to obtain the high-frequency band parameters of the current frame of the audio signal includes: obtaining the pitch component of the current frequency region of the at least one frequency region Position quantity parameter; according to the position quantity parameter of the tonal component in the current frequency region, determine the position parameter of the tonal component in the current frequency region and the quantity parameter of the tonal component in the current frequency region; according to the number of tonal components in the current frequency region
- the parameter analyzes the amplitude parameter or the energy parameter of the pitch component of the current frequency region from the coded code stream.
- the method before acquiring the position quantity parameter of the pitch component of the current frequency region of the at least one frequency region, the method includes: acquiring the pitch component indication information of the current frequency region; the pitch component indication information is used to indicate Whether the current frequency region includes a tonal component; when the current frequency region includes a tonal component, acquiring a position quantity parameter of the tonal component in the current frequency region of the at least one frequency region. Therefore, it is possible to decode the parameters of the tonal components only in the frequency region including the tonal components, thereby improving the decoding efficiency.
- the obtaining the reconstructed high-band signal of the current frame according to the high-band parameter includes: determining the tonal component in the current frequency region according to the position quantity parameter of the tonal component in the current frequency region Determine the amplitude or energy corresponding to the position of the tonal component according to the amplitude parameter or energy parameter of the tonal component in the current frequency region; according to the position of the tonal component in the current frequency region and the position of the tonal component corresponding to The amplitude or energy of the reconstructed high-band signal is obtained.
- the determining the position of the tonal component in the current frequency region according to the position quantity parameter of the tonal component of the high-frequency signal in the current frequency region may include: The position quantity parameter determines the position parameter of the tonal component in the current frequency region; and determines the position of the tonal component in the current frequency region according to the position parameter of the tonal component in the current frequency region.
- the obtaining the reconstructed high-band signal of the current frame according to the high-band parameter may specifically include: determining the current frequency region according to the position parameter of the tonal component of the current frequency region Tone component position; determine the amplitude or energy corresponding to the position of the tonal component according to the amplitude parameter or energy parameter of the tonal component in the current frequency region; according to the position of the tonal component in the current frequency region and the position of the tonal component The corresponding amplitude or energy obtains the reconstructed high frequency band signal.
- the obtaining of the reconstructed high-band signal according to the position of the tone component in the current frequency region and the amplitude corresponding to the position of the tone component may be performed in the following manner:
- pSpectralData represents the reconstructed high-band frequency domain signal in the current frequency region
- tone_val represents the amplitude value corresponding to the position of the tone component in the current frequency region
- tone_pos represents the position of the tone component in the current frequency region
- the position quantity parameter of the tonal component of the current frequency region includes N bits. Accordingly, the obtaining the position quantity parameter of the tonal component of the current frequency region of the at least one frequency region includes: according to The number of subbands included in the current frequency region reads N bits from the code stream, where the N bits are the position quantity parameters of the tonal components in the current frequency region, where N is the The number of subbands included in the current frequency region, and the N bits have a one-to-one correspondence with the subbands included in the current frequency region.
- the position parameter of the tonal component in the current frequency region is used to indicate the sequence number of the sub-band including the tonal component in the current frequency region.
- the position of the tonal component in the current frequency region is located at a designated position in the subband where the tonal component is located in the current frequency region.
- the designated position of the subband may be the center position of the subband, or the start position of the subband, or the end position of the subband.
- Another embodiment of the present invention provides an audio decoding method, including the following processes:
- the high frequency band can be divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the determination of the high-frequency band parameters can be performed in units of frequency regions.
- the following takes the method of obtaining high-frequency band parameters according to the code stream in a frequency region as an example.
- the methods for obtaining high-band parameters in different frequency regions according to the coded bitstream can be the same or different.
- the high frequency band parameters can be obtained through the following process:
- the code stream is analyzed to determine the position parameter of the tonal component.
- the code stream is analyzed to determine the amplitude parameter of the tonal component.
- the high frequency band parameters can be obtained through the following process:
- the position quantity parameter of the tonal component represents the position information of the tonal component and the quantity information of the tonal component.
- the decoding side parses the code stream, and first obtains the position quantity parameter of the tonal component.
- the position quantity parameter of the tone component can be represented by an N-bit sequence, where N is the number of subbands in a frequency region.
- the frequency domain resolution tone_res[p] may be preset, or it may be obtained by analyzing the obtained code stream. Assuming that the bandwidth of the p-th frequency region is tile_width[p], the number of subbands in the frequency region can be
- num_subband tile_width[p]/tone_res[p]
- the number of subbands in the frequency region is 5, and 5 bits are read from the code stream, and the binary representation of the number of positions of the tonal components is 10011.
- the number of subbands num_subband in the frequency region can also be preset, and num_subband bits can be read from the code stream directly according to the number of subbands in the frequency region num_subband, which is the position quantity parameter of the tone component.
- the quantity parameter of the tone component is obtained according to the position quantity parameter of the tone component.
- it may be: determining the number of subbands of the tonal component in the frequency region according to the position quantity parameter of the tonal component, that is, the quantity parameter tone_cnt[p] of the tonal component.
- the number of subbands of tonal components in the frequency region is equal to the number of bits with a value of 1 in the binary representation of the number of positions of the tonal components parameter.
- the code stream is analyzed according to the quantity parameter of the tonal component, and the amplitude parameter of the tonal component is determined.
- the amplitude parameters of the tonal components are sequentially analyzed from the code stream according to the preset number of bits, and the number of amplitude parameters of the tonal components is equal to the quantity parameter of the tonal components.
- the amplitude parameter tone_val_q[p][i], i 0,...,tone_cnt[p]-1 of the tone component.
- the high-band parameters may also include the noise floor parameters of the tonal components.
- Obtaining the high-frequency band parameters according to the coded code stream also includes: parsing the code stream to determine the noise floor parameters. Specifically, it may be: analyzing the noise floor parameter noise_floor[p] from the code stream according to the preset number of bits.
- the high frequency band parameters also include signal type information.
- Obtaining the high frequency band parameters according to the coded code stream also includes: parsing the code stream to determine the signal type information.
- the high frequency band parameters are obtained, which can be specifically:
- the signal type information can be a flag indicating whether there are tonal components in the frequency region, and can also be referred to as tonal component indication information.
- the signal type information it is determined whether it is necessary to decode other high-frequency band parameters except the signal type information.
- the flag value of whether there is a tone component in the frequency region is 1, that is, the signal type information indicates the tone signal type, then the code stream analysis is continued.
- the method of parsing the code stream to determine other high-band parameters except the signal type information can be any of Case 1, Case 2, and Case 3 on the decoding side.
- the high frequency band can be divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the reconstruction of the high-band signal can be performed in units of frequency regions.
- the method for obtaining the reconstructed high-band signal according to the high-band parameters in different frequency regions may be the same or different.
- the reconstructed high-frequency signal in each frequency region the reconstructed high-frequency signal is obtained.
- the high frequency band signal can be a frequency domain signal or a time domain signal
- the position parameter of the tonal component represents the subband sequence number corresponding to the position of the tonal component.
- the quantity parameter of tonal components characterizes the quantity of tonal components. According to the quantity parameter of the tonal component, the position parameter and the amplitude parameter of the tonal component, the high frequency band signal of the current frame is reconstructed.
- tone_pos tile[p]+(sfb+0.5)*tone_res[p]
- tone_val pow(2.0,0.25*tone_val_q[p][tone_idx]–4.0)
- tile[p] is the starting frequency point of the p-th frequency region
- sfb is the position parameter of the tonal component (that is, the subband number corresponding to the position of the tonal component)
- tone_res[p] is the frequency domain resolution of the subband
- Tone_pos represents the position of the tone component corresponding to the tone_idxth tone component in the p-th frequency region
- tone_val_q[p][tone_idx] represents the amplitude parameter of the tone component corresponding to the tone_idx-th tone component in the p-th frequency region
- tone_val represents the amplitude value corresponding to the tone_idx-th tone component in the p-th frequency region.
- pSpectralData[tone_pos] represents the frequency domain signal corresponding to the position tone_pos of the tone component.
- the value range of tone_idx belongs to [0, tone_cnt[p]-1], and tone_cnt[p] is the quantity parameter of the tone component.
- the frequency domain signal on this frequency point can be directly set to 0.
- the present invention does not limit the reconstruction method of other frequency points without tonal components.
- the position quantity parameter of the tone component can be represented by an N-bit sequence, where N is the number of subbands in a frequency region. Specifically, the position quantity parameter of the pitch component may be shifted to determine the subband sequence number of the pitch component in the frequency region and the number of subbands with the pitch component.
- the subband number of the tone component in the frequency region is the position parameter of the tone component.
- the number of subbands of tonal components in the frequency region is the parameter of the number of tonal components.
- bit sequence from low to high indicates that the sequence numbers of the subbands are from small to large.
- the number of subbands in the frequency region is 5, the lowest bit of the 5-bit sequence corresponds to the subband sequence number 0, and the highest bit of the 5-bit sequence corresponds to the subband sequence number 4.
- the binary representation of the position quantity parameter of the tone component is 10011, the subband numbers of the tone component in the frequency region are 0, 1, and 4, respectively.
- bit sequence from low to high indicates the sequence numbers of the subbands from large to small.
- the number of subbands in the frequency region is 5, the lowest bit of the 5-bit sequence corresponds to the subband sequence number 4, and the highest bit of the 5-bit sequence corresponds to the subband sequence number 0.
- the binary representation of the position quantity parameter of the tone component is 10011, the subband numbers of the tone component in the frequency region are 0, 3, and 4, respectively.
- sequence number of the subband corresponding to each bit of the bit sequence may also be predetermined, which is not limited in the present invention.
- the quantity parameter of the tonal component can be obtained.
- the number of subband numbers of tonal components in the frequency region is the quantity parameter of the tonal components.
- the high-frequency band signal is reconstructed.
- it may be: calculating the position of the pitch component according to the position parameter of the pitch component.
- tone_pos tile[p]+(sfb+0.5)*tone_res[p]
- tile[p] is the starting frequency point of the p-th frequency region
- sfb is the subband number of the tone component in the frequency region
- tone_res[p] is the frequency-domain resolution of the p-th frequency region.
- the subband number of the tone component in the frequency region is the position parameter of the tone component. 0.5 means that the position of the tonal component in the sub-band where the tonal component exists is at the center of the sub-band.
- the reconstructed tonal components can also be located in other positions of the subband.
- it may be: calculating the amplitude of the tonal component according to the amplitude parameter of the tonal component.
- tone_val pow(2.0,0.25*tone_val_q[p][tone_idx]–4.0)
- tone_val_q[p][tone_idx] represents the amplitude parameter corresponding to the tone_idx position parameter in the p-th frequency region
- tone_val represents the amplitude value of the frequency point corresponding to the tone_idx position parameter in the p-th frequency region.
- tone_idx belongs to [0, tone_cnt[p]-1], and tone_cnt[p] is the quantity parameter of the tone component.
- the frequency domain signal corresponding to the position tone_pos of the tone component satisfies:
- tone_pos represents the frequency domain signal corresponding to the position tone_pos of the tone component
- tone_val represents the amplitude value of the frequency point corresponding to the tone_idx position parameter in the p-th frequency region
- tone_pos indicates the position of the tone component corresponding to the tone_idx position parameter in the p-th frequency region.
- the frequency domain signal of the frequency point can be directly set to 0.
- the present invention does not limit the reconstruction method of other frequency points without tonal components.
- the audio signal of the current frame is obtained.
- the third embodiment of the present invention provides an audio decoding method, including the following processes:
- the high frequency band can be divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the determination of the high-frequency band parameters can be performed in units of frequency regions. The following takes the method of obtaining high-frequency band parameters according to the code stream in a frequency region as an example.
- the position quantity parameter of the tonal component represents the position information of the tonal component and the quantity information of the tonal component.
- the decoding side parses the code stream, and first obtains the position quantity parameter of the tonal component.
- the position quantity parameter of the tone component can be represented by an N-bit sequence, where N is the number of subbands in a frequency region.
- the frequency domain resolution tone_res[p] may be preset, or it may be obtained by analyzing the obtained code stream. Assuming that the bandwidth of the p-th frequency region is tile_width[p], the number of subbands in the frequency region can be
- num_subband tile_width[p]/tone_res[p]
- the number of subbands in the frequency region is 5, and 5 bits are read from the code stream, and the binary representation of the position quantity parameter of the tonal component obtained is 10011.
- the number of subbands num_subband in the frequency region can also be preset, and num_subband bits can be read from the code stream directly according to the number of subbands in the frequency region num_subband, which is the position quantity parameter of the tone component.
- the position quantity parameter of the tone component can be represented by an N-bit sequence, where N is the number of subbands in a frequency region. Specifically, the position quantity parameter of the pitch component may be shifted to determine the subband sequence number of the pitch component in the frequency region and the number of subbands with the pitch component.
- the subband number of the tone component in the frequency region is the position parameter of the tone component.
- the number of subbands of tonal components in the frequency region is the parameter of the number of tonal components.
- bit sequence from low to high indicates that the sequence numbers of the subbands are from small to large.
- the number of subbands in the frequency region is 5, the lowest bit of the 5-bit sequence corresponds to the subband sequence number 0, and the highest bit of the 5-bit sequence corresponds to the subband sequence number 4.
- the binary representation of the position quantity parameter of the tone component is 10011, the subband numbers of the tone component in the frequency region are 0, 1, and 4, respectively.
- bit sequence from low to high indicates the sequence numbers of the subbands from large to small.
- the number of subbands in the frequency region is 5, the lowest bit of the 5-bit sequence corresponds to the subband sequence number 4, and the highest bit of the 5-bit sequence corresponds to the subband sequence number 0.
- the binary representation of the position quantity parameter of the tone component is 10011, the subband numbers of the tone component in the frequency region are 0, 3, and 4, respectively.
- sequence number of the subband corresponding to each bit of the bit sequence may also be predetermined, which is not limited in the present invention.
- the quantity parameter of the tonal component can be obtained.
- the number of subband numbers of tonal components in the frequency region is the quantity parameter of the tonal components.
- it may be: determining the number of subbands of the tonal component in the frequency region according to the position quantity parameter of the tonal component, that is, the quantity parameter tone_cnt[p] of the tonal component.
- the number of subbands of tonal components in the frequency region is equal to the number of bits with a value of 1 in the binary representation of the number of positions of the tonal components parameter.
- the amplitude parameters of the tonal components are sequentially analyzed from the code stream according to the preset number of bits, and the number of amplitude parameters of the tonal components is equal to the quantity parameter of the tonal components.
- the amplitude parameter tone_val_q[p][i], i 0,...,tone_cnt[p]-1 of the tone component.
- the high frequency band can be divided into K frequency regions (tile), and each frequency region is divided into N subbands.
- the reconstruction of the high-band signal can be performed in units of frequency regions. The following are examples of methods for obtaining a reconstructed high-band signal based on high-band parameters in a frequency region. According to the reconstructed high-frequency signal in each frequency region, the reconstructed high-frequency signal is obtained.
- the high frequency band signal can be a frequency domain signal or a time domain signal.
- the high-frequency signal of the current frame may be reconstructed according to the position parameter, the quantity parameter, and the amplitude parameter of the pitch component.
- the number of tonal components parameter table proves the number of tonal components.
- the reconstruction method of the tonal component at a position can be specifically:
- it may be: calculating the position of the pitch component according to the position parameter of the pitch component.
- tone_pos tile[p]+(sfb+0.5)*tone_res[p]
- tile[p] is the starting frequency point of the p-th frequency region
- sfb is the subband number of the tone component in the frequency region
- tone_res[p] is the frequency-domain resolution of the p-th frequency region.
- the subband number of the tone component in the frequency region is the position parameter of the tone component. 0.5 means that the position of the tonal component in the sub-band where the tonal component exists is at the center of the sub-band.
- the reconstructed tonal components can also be located in other positions of the subband.
- it may be: calculating the amplitude of the tonal component according to the amplitude parameter of the tonal component.
- tone_val pow(2.0,0.25*tone_val_q[p][tone_idx]–4.0)
- tone_val_q[p][tone_idx] represents the amplitude parameter corresponding to the tone_idx position parameter in the p-th frequency region
- tone_val represents the amplitude value of the frequency point corresponding to the tone_idx position parameter in the p-th frequency region.
- tone_idx belongs to [0, tone_cnt[p]-1], and tone_cnt[p] is the number of tone components.
- the high-frequency band signal is reconstructed.
- the frequency domain signal corresponding to the position tone_pos of the tone component satisfies:
- tone_pos represents the frequency domain signal corresponding to the position tone_pos of the tone component
- tone_val represents the amplitude value of the frequency point corresponding to the tone_idx position parameter in the p-th frequency region
- tone_pos indicates the position of the tone component corresponding to the tone_idx position parameter in the p-th frequency region.
- the frequency domain signal of the frequency point can be directly set to 0.
- the present invention does not limit the reconstruction method of other frequency points without tonal components.
- the audio signal of the current frame is obtained.
- the audio encoder in the embodiment of the present invention encodes the position, quantity, and amplitude or energy of the tonal components in the high-band signal, so that the audio decoder recovers according to the position, quantity, and amplitude or energy of the tonal components
- the tonal component makes the position and energy of the restored tonal component more accurate, thereby improving the quality of the decoded signal.
- Figure 6 depicts the structure of an audio encoder provided by an embodiment of the present invention, including:
- the signal acquisition unit 601 is configured to acquire a current frame of an audio signal, where the current frame includes a high frequency band signal;
- the parameter obtaining unit 602 is configured to obtain the high-band parameters of the current frame according to the high-band signal, where the high-band parameters are used to indicate the position, quantity, and amplitude of the tonal components included in the high-band signal Or energy
- the encoding unit 603 is configured to perform code stream multiplexing on the high frequency band coding parameters to obtain an encoded code stream.
- the audio encoder may further include: a determining unit, configured to determine whether a pitch component is included in the current frequency region; and the parameter acquiring unit, specifically configured to include a pitch component in the current frequency region
- the position quantity parameter of the tone component of the current frequency region and the amplitude parameter or energy parameter of the tone component of the current frequency region are determined according to the high-band signal of the current frequency region in the at least one frequency region.
- the specific implementation of the audio encoder can refer to the aforementioned audio encoding method, which will not be repeated here.
- the audio encoder in the embodiment of the present invention encodes the position, quantity, and amplitude or energy of the tonal components in the high-band signal, so that the audio decoder recovers according to the position, quantity, and amplitude or energy of the tonal components
- the tonal component makes the position and energy of the restored tonal component more accurate, thereby improving the quality of the decoded signal.
- Figure 7 illustrates the structure of an audio decoder provided by an embodiment of the present invention, including:
- the receiving unit 701 is configured to obtain an encoding code stream
- the demultiplexing unit 702 is configured to demultiplex the code stream to obtain the high frequency band parameters of the current frame of the audio signal, and the high frequency band parameters are used to indicate the high frequency of the current frame.
- the reconstruction unit 703 is configured to obtain the reconstructed high frequency band signal of the current frame according to the high frequency band parameter; and obtain the audio output signal of the current frame according to the reconstructed high frequency band signal of the current frame.
- the specific implementation of the audio decoder can refer to the aforementioned audio coding method, which will not be repeated here.
- the audio encoder in the embodiment of the present invention encodes the position, quantity, and amplitude or energy of the tonal components in the high-band signal, so that the audio decoder recovers according to the position, quantity, and amplitude or energy of the tonal components
- the tonal component makes the position and energy of the restored tonal component more accurate, thereby improving the quality of the decoded signal.
- An embodiment of the present application also provides a computer storage medium, wherein the computer storage medium stores a program, and the program executes some or all of the steps recorded in the above method embodiments.
- the audio coding device 800 includes:
- the receiver 801, the transmitter 802, the processor 803, and the memory 804 (the number of the processors 803 in the audio encoding device 800 may be one or more, and one processor is taken as an example in FIG. 8).
- the receiver 801, the transmitter 802, the processor 803, and the memory 804 may be connected by a bus or in other ways, wherein the connection by a bus is taken as an example in FIG. 8.
- the memory 804 may include a read-only memory and a random access memory, and provides instructions and data to the processor 803. A part of the memory 804 may also include a non-volatile random access memory (NVRAM).
- NVRAM non-volatile random access memory
- the memory 804 stores an operating system and operating instructions, executable modules or data structures, or a subset of them, or an extended set of them.
- the operating instructions may include various operating instructions for implementing various operations.
- the operating system may include various system programs for implementing various basic services and processing hardware-based tasks.
- the processor 803 controls the operation of the audio encoding device, and the processor 803 may also be referred to as a central processing unit (CPU).
- the various components of the audio encoding device are coupled together through a bus system.
- the bus system may also include a power bus, a control bus, and a status signal bus.
- various buses are referred to as bus systems in the figure.
- the method disclosed in the foregoing embodiment of the present application may be applied to the processor 803 or implemented by the processor 803.
- the processor 803 may be an integrated circuit chip with signal processing capability. In the implementation process, the steps of the foregoing method can be completed by an integrated logic circuit of hardware in the processor 803 or instructions in the form of software.
- the aforementioned processor 803 may be a general-purpose processor, a digital signal processing (digital signal processing, DSP), an application specific integrated circuit (ASIC), a field-programmable gate array (FPGA), or Other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
- DSP digital signal processing
- ASIC application specific integrated circuit
- FPGA field-programmable gate array
- Other programmable logic devices discrete gates or transistor logic devices, discrete hardware components.
- the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
- the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
- the software module can be located in a mature storage medium in the field, such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
- the storage medium is located in the memory 804, and the processor 803 reads the information in the memory 804, and completes the steps of the foregoing method in combination with its hardware.
- the receiver 801 can be used to receive input digital or character information, and generate signal input related to the related settings and function control of the audio coding device.
- the transmitter 802 can include display devices such as a display screen, and the transmitter 802 can be used to output through an external interface Number or character information.
- the processor 803 is configured to execute the audio encoding method shown in FIG. 2 above.
- the audio decoding device 900 includes:
- the receiver 901, the transmitter 902, the processor 903, and the memory 904 (the number of the processors 903 in the audio decoding device 900 may be one or more, and one processor is taken as an example in FIG. 9).
- the receiver 901, the transmitter 902, the processor 903, and the memory 904 may be connected by a bus or in other ways, wherein the connection by a bus is taken as an example in FIG. 9.
- the memory 904 may include a read-only memory and a random access memory, and provides instructions and data to the processor 903. A part of the memory 904 may also include NVRAM.
- the memory 904 stores an operating system and operating instructions, executable modules or data structures, or a subset of them, or an extended set of them.
- the operating instructions may include various operating instructions for implementing various operations.
- the operating system may include various system programs for implementing various basic services and processing hardware-based tasks.
- the processor 903 controls the operation of the audio decoding device, and the processor 903 may also be referred to as a CPU.
- the various components of the audio decoding device are coupled together through a bus system, where the bus system may include a power bus, a control bus, and a status signal bus in addition to a data bus.
- bus system may include a power bus, a control bus, and a status signal bus in addition to a data bus.
- various buses are referred to as bus systems in the figure.
- the method disclosed in the foregoing embodiment of the present application may be applied to the processor 903 or implemented by the processor 903.
- the processor 903 may be an integrated circuit chip with signal processing capability. In the implementation process, the steps of the foregoing method may be completed by an integrated logic circuit of hardware in the processor 903 or instructions in the form of software.
- the aforementioned processor 903 may be a general-purpose processor, DSP, ASIC, FPGA or other programmable logic device, discrete gate or transistor logic device, or discrete hardware component.
- the methods, steps, and logical block diagrams disclosed in the embodiments of the present application can be implemented or executed.
- the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
- the steps of the method disclosed in the embodiments of the present application may be directly embodied as being executed and completed by a hardware decoding processor, or executed and completed by a combination of hardware and software modules in the decoding processor.
- the software module can be located in a mature storage medium in the field, such as random access memory, flash memory, read-only memory, programmable read-only memory, or electrically erasable programmable memory, registers.
- the storage medium is located in the memory 904, and the processor 903 reads the information in the memory 904, and completes the steps of the foregoing method in combination with its hardware.
- the processor 903 is configured to execute the audio decoding method shown in FIG. 3.
- the chip when the audio encoding device or the audio decoding device is a chip in the terminal, the chip includes: a processing unit and a communication unit.
- the processing unit may be, for example, a processor, and the communication unit may be, for example, Input/output interface, pin or circuit, etc.
- the processing unit can execute the computer-executable instructions stored in the storage unit, so that the chip in the terminal executes the method of any one of the above-mentioned first aspects.
- the storage unit is a storage unit in the chip, such as a register, a cache, etc., and the storage unit may also be a storage unit in the terminal located outside the chip, such as a read-only memory (read-only memory). -only memory, ROM) or other types of static storage devices that can store static information and instructions, random access memory (RAM), etc.
- processor mentioned in any of the foregoing may be a general-purpose central processing unit, a microprocessor, an ASIC, or one or more integrated circuits used to control the execution of the program of the method in the first aspect.
- the device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physically separate.
- the physical unit can be located in one place or distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
- the connection relationship between the modules indicates that they have a communication connection between them, which may be specifically implemented as one or more communication buses or signal lines.
- this application can be implemented by means of software plus necessary general hardware.
- it can also be implemented by dedicated hardware including dedicated integrated circuits, dedicated CPUs, dedicated memory, Dedicated components and so on to achieve.
- all functions completed by computer programs can be easily implemented with corresponding hardware.
- the specific hardware structures used to achieve the same function can also be diverse, such as analog circuits, digital circuits or special-purpose circuits. Circuit etc.
- software program implementation is a better implementation in more cases.
- the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a readable storage medium, such as a computer floppy disk. , U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk, etc., including several instructions to make a computer device (which can be a personal computer, server, or network device, etc.) execute the methods described in each embodiment of this application .
- a computer device which can be a personal computer, server, or network device, etc.
- the computer program product includes one or more computer instructions.
- the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable devices.
- the computer instructions may be stored in a computer-readable storage medium or transmitted from one computer-readable storage medium to another computer-readable storage medium.
- the computer instructions may be transmitted from a website, computer, server, or data center. Transmission to another website, computer, server or data center via wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.).
- wired such as coaxial cable, optical fiber, digital subscriber line (DSL)
- wireless such as infrared, wireless, microwave, etc.
- the computer-readable storage medium may be any available medium that can be stored by a computer or a data storage device such as a server or a data center integrated with one or more available media.
- the usable medium may be a magnetic medium (for example, a floppy disk, a hard disk, and a magnetic tape), an optical medium (for example, a DVD), or a semiconductor medium (for example, a solid state disk (SSD)).
Abstract
Description
Claims (53)
- 一种音频编码方法,其特征在于,所述方法包括:获取音频信号的当前帧,所述当前帧包括高频带信号;根据所述高频带信号获得所述当前帧的高频带参数,所述高频带参数用于表示所述高频带信号包括的音调成分的位置、数量以及幅度或能量;对所述高频带编码参数进行码流复用,以得到编码码流。
- 根据权利要求1所述的方法,其特征在于,所述高频带参数包括音调成分的位置数量参数、以及所述音调成分的幅度参数或能量参数。
- 根据权利要求2所述的方法,其特征在于,所述高频带信号对应的高频带包括至少一个频率区域,一个所述频率区域包括至少一个子带,所述根据所述高频带信号获得所述当前帧的高频带参数包括:根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求3所述的方法,其特征在于,所述根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数前,所述方法包括:确定所述当前频率区域内是否包括音调成分;在所述当前频率区域内包括音调成分时,根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求4所述的方法,其特征在于,所述当前帧的高频带参数还包括音调成分指示信息,所述音调成分指示信息用于指示所述当前频率区域内是否包括音调成分。
- 根据权利要求3至5任一所述的方法,其特征在于,所述根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数包括:根据所述至少一个频率区域中的当前频率区域的高频带信号在所述当前频率区域内进行峰值搜索,以获得所述当前区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种;根据所述当前频率区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求6所述的方法,其特征在于,根据所述至少一个频率区域中的当前频率区域的高频带信号在所述当前频率区域内进行峰值搜索,以获得所述当前区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种包括:根据所述至少一个频率区域中的当前频率区域的功率谱、能量谱或幅度谱中的至少一种在所述当前频率区域内进行峰值搜索,以获得所述当前区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种。
- 根据权利要求6所述的方法,其特征在于,所述根据所述当前频率区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数包括:根据所述当前频率区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种,确定所述当前频率区域的音调成分的位置信息,数量信息以及幅度信息;根据所述当前频率区域的音调成分的位置信息,数量信息以及幅度信息确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求3至8任一所述所述的方法,其特征在于,所述当前频率区域的音调成分的位置数量参数包括N个比特位,N为所述当前频率区域包括的子带数量,所述N个比特位与所述当前频率区域包括的子带一一对应;其中,若所述当前频率区域包括的第一子带存在音调成分,则所述N个比特位中与所述第一子带对应的比特位的值为第一值;或若所述当前频率区域包括的第二子带不存在音调成分,则所述N个比特位中与所述第二子带对应的比特位的值为第二值,所述第一值与所述第二值不同。
- 根据权利要求1至9任一所述的方法,其特征在于,所述高频带参数还包括所述高频带信号的噪声基底参数。
- 一种音频解码方法,其特征在于,包括:获取编码码流;对所述编码码流进行码流解复用,以得到音频信号的当前帧的高频带参数,所述高频带参数用于表示所述当前帧的高频带信号包括的音调成分的位置、数量以及幅度或能量;根据所述高频带参数获得所述当前帧的重建高频带信号;根据所述当前帧的重建高频带信号获得所述当前帧的音频输出信号。
- 根据权利要求11所述的方法,其特征在于,所述高频带参数包括所述当前帧的高频信号的音调成分的位置数量参数和所述音调成分的幅度参数或能量参数。
- 根据权利要求12所述的方法,其特征在于,所述高频带信号对应的高频带包括至少一个频率区域,一个所述频率区域包括至少一个子带;所述高频带参数包括所述当前帧的高频信号的音调成分的位置数量参数包括所述至少一个频率区域各自的音调成分的位置数量参数,所述当前帧的高频信号的音调成分的幅度参数或能量参数包括所述至少一个频率区域各自的音调成分的幅度参数或能量参数。
- 根据权利要求13所述的方法,其特征在于,所述对所述编码码流进行码流解复用,以得到音频信号的当前帧的高频带参数包括:获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数;根据所述当前频率区域的音调成分的位置数量参数从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求14所述的方法,其特征在于,所述根据所述当前频率区域的音调成分的位置数量参数从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数,包括:根据所述当前频率区域的音调成分的位置数量参数,确定所述当前频率区域的音调成分的数量参数;根据所述当前频率区域的音调成分的数量参数,从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求13所述的方法,其特征在于,所述对所述编码码流进行码流解复用,以得到音频信号的当前帧的高频带参数包括:获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数;根据所述当前频率区域的音调成分的位置数量参数,确定当前频率区域的音调成分的位置参数和当前频率区域的音调成分的数量参数;根据所述当前频率区域的音调成分的数量参数从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求14至16任一所述的方法,其特征在于,所述获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数前包括:获取所述当前频率区域的音调成分指示信息;所述音调成分指示信息用于指示所述当前频率区域内是否包括音调成分;当所述当前频率区域内包括音调成分时,获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数。
- 根据权利要求14至17任一所述的方法,其特征在于,所述获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数包括:根据所述当前频率区域包括的子带数量从所述编码码流中读取N个比特位,所述N个比特位为所述当前频率区域的音调成分的位置数量参数,其中,N为所述当前频率区域包括的子带数量,所述N个比特位与所述当前频率区域包括的子带一一对应。
- 根据权利要求14,15,17和18任一所述的方法,其特征在于,所述根据所述高频带参数获得所述当前帧的重建高频带信号包括:根据所述当前频率区域的音调成分的位置数量参数确定所述当前频率区域中音调成分的位置;根据所述当前频率区域的音调成分的幅度参数或能量参数确定所述音调成分的位置对应的幅度或能量;根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度或能量获得所述重建高频带信号。
- 根据权利要求19所述的方法,其特征在于,所述根据所述当前频率区域的高频信号的音调成分的位置数量参数确定所述当前频率区域中音调成分的位置包括:根据所述当前频率区域的高频信号的音调成分的位置数量参数,确定所述当前频率区域的音调成分的位置参数;根据所述当前频率区域的音调成分的位置参数,确定所述当前频率区域中音调成分位置。
- 根据权利要求16至18任一所述的方法,其特征在于,所述根据所述高频带参数 获得所述当前帧的重建高频带信号包括:根据所述当前频率区域的音调成分的位置参数,确定所述当前频率区域中音调成分位置;根据所述当前频率区域的音调成分的幅度参数或能量参数确定所述音调成分的位置对应的幅度或能量;根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度或能量获得所述重建高频带信号。
- 根据权利要求16至21任一所述的方法,其特征在于,所述当前频率区域的音调成分的位置参数用于指示所述当前频率区域中包括音调成分的子带的序号。
- 根据权利要求20或21任一所述的方法,其特征在于,所述当前频率区域中音调成分位置位于所述当前频率区域中音调成分所在子带的指定位置。
- 根据权利要求23所述的方法,其特征在于,所述子带的指定位置为子带的中心位置。
- 根据权利要求19至21任一所述的方法,其特征在于,所述根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度获得所述重建高频带信号包括:根据如下计算式确定音调成分的位置的频域信号:pSpectralData[tone_pos]=tone_val其中,pSpectralData表示所述当前频率区域的重建高频带频域信号,tone_val表示所述当前频率区域内音调成分的位置对应的幅度值,tone_pos表示所述当前频率区域内音调成分的位置。
- 一种音频编码器,其特征在于,包括:信号获取单元,用于获取音频信号的当前帧,所述当前帧包括高频带信号;参数获取单元,用于根据所述高频带信号获得所述当前帧的高频带参数,所述高频带参数用于表示所述高频带信号包括的音调成分的位置、数量以及幅度或能量;编码单元,用于对所述高频带编码参数进行码流复用,以得到编码码流。
- 根据权利要求26所述的音频编码器,其特征在于,所述高频带参数包括音调成分的位置数量参数、以及所述音调成分的幅度参数或能量参数。
- 根据权利要求27所述的音频编码器,其特征在于,所述高频带信号对应的高频带包括至少一个频率区域,一个所述频率区域包括至少一个子带;所述参数获取单元,具体用于:根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求28所述的音频编码器,其特征在于,所述音频编码器还包括:确定单元,用于确定所述当前频率区域内是否包括音调成分;所述参数获取单元,具体用于在所述当前频率区域内包括音调成分时,根据所述至少一个频率区域中的当前频率区域的高频带信号,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求29所述的音频编码器,其特征在于,所述当前帧的高频带参数还包括音调成分指示信息,所述音调成分指示信息用于指示所述当前频率区域内是否包括音调成分。
- 根据权利要求28至30任一所述的音频编码器,其特征在于,所述参数获取单元,具体用于:根据所述至少一个频率区域中的当前频率区域的高频带信号在所述当前频率区域内进行峰值搜索,以获得所述当前区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种;根据所述当前频率区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种,确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求31所述的音频编码器,其特征在于,所述参数获取单元,具体用于:根据所述至少一个频率区域中的当前频率区域的功率谱、能量谱或幅度谱中的至少一种在所述当前频率区域内进行峰值搜索,以获得所述当前区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种。
- 根据权利要求31所述的音频编码器,其特征在于,所述参数获取单元,具体用于:根据所述当前频率区域的峰值数量信息、峰值位置信息以及峰值幅度信息中的至少一种,确定所述当前频率区域的音调成分的位置信息,数量信息以及幅度信息;根据所述当前频率区域的音调成分的位置信息,数量信息以及幅度信息确定所述当前频率区域的音调成分的位置数量参数和所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求28至33任一所述所述的音频编码器,其特征在于,所述当前频率区域的音调成分的位置数量参数包括N个比特位,N为所述当前频率区域包括的子带数量,所述N个比特位与所述当前频率区域包括的子带一一对应;其中,若所述当前频率区域包括的第一子带存在音调成分,则所述N个比特位中与所述第一子带对应的比特位的值为第一值;或若所述当前频率区域包括的第二子带不存在音调成分,则所述N个比特位中与所述第二子带对应的比特位的值为第二值,所述第一值与所述第二值不同。
- 根据权利要求26至34任一所述的音频编码器,其特征在于,所述高频带参数还包括所述高频带信号的噪声基底参数。
- 一种音频解码器,其特征在于,包括:接收单元,用于获取编码码流;解复用单元,用于对所述编码码流进行码流解复用,以得到音频信号的当前帧的高频带参数,所述高频带参数用于表示所述当前帧的高频带信号包括的音调成分的位置、数量以及幅度或能量;重建单元,用于根据所述高频带参数获得所述当前帧的重建高频带信号;根据所述当前帧的重建高频带信号获得所述当前帧的音频输出信号。
- 根据权利要求36所述的音频解码器,其特征在于,所述高频带参数包括所述当前 帧的高频信号的音调成分的位置数量参数和所述音调成分的幅度参数或能量参数。
- 根据权利要求37所述的音频解码器,其特征在于,将所述高频带信号对应的高频带包括至少一个频率区域,一个所述频率区域包括至少一个子带;所述高频带参数包括所述当前帧的高频信号的音调成分的位置数量参数包括所述至少一个频率区域各自的音调成分的位置数量参数,所述当前帧的高频信号的音调成分的幅度参数或能量参数包括所述至少一个频率区域各自的音调成分的幅度参数或能量参数。
- 根据权利要求38所述的音频解码器,其特征在于,所述解复用单元,具体用于:获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数;根据所述当前频率区域的音调成分的位置数量参数从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求39所述的音频解码器,其特征在于,所述解复用单元,具体用于:根据所述当前频率区域的音调成分的位置数量参数,确定所述当前频率区域的音调成分的数量参数;根据所述当前频率区域的音调成分的数量参数,从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求38所述的音频解码器,其特征在于,所述解复用单元,具体用于:获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数;根据所述当前频率区域的音调成分的位置数量参数,确定当前频率区域的音调成分的位置参数和当前频率区域的音调成分的数量参数;根据所述当前频率区域的音调成分的数量参数从所述编码码流中解析所述当前频率区域的音调成分的幅度参数或能量参数。
- 根据权利要求39至41任一所述的音频解码器,其特征在于,解复用单元,具体用于:获取所述当前频率区域的音调成分指示信息;所述音调成分指示信息用于指示所述当前频率区域内是否包括音调成分;当所述当前频率区域内包括音调成分时,获取所述至少一个频率区域的当前频率区域的音调成分的位置数量参数。
- 根据权利要求39至42任一所述的音频解码器,其特征在于,所述解复用单元,具体用于:根据所述当前频率区域包括的子带数量从所述编码码流中读取N个比特位,所述N个比特位为所述当前频率区域的音调成分的位置数量参数,其中,N为所述当前频率区域包括的子带数量,所述N个比特位与所述当前频率区域包括的子带一一对应。
- 根据权利要求39,40,42和43任一所述的音频解码器,其特征在于,所述重建单元,具体用于:根据所述当前频率区域的音调成分的位置数量参数确定所述当前频率区域中音调成分的位置;根据所述当前频率区域的音调成分的幅度参数或能量参数确定所述音调成分的位置对应的幅度或能量;根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度或能量获 得所述重建高频带信号。
- 根据权利要求44所述的音频解码器,其特征在于,所述重建单元,具体用于:根据所述当前频率区域的高频信号的音调成分的位置数量参数,确定所述当前频率区域的音调成分的位置参数;根据所述当前频率区域的音调成分的位置参数,确定所述当前频率区域中音调成分位置。
- 根据权利要求41至43任一所述的音频解码器,其特征在于,所述重建单元,具体用于:根据所述当前频率区域的音调成分的位置参数,确定所述当前频率区域中音调成分位置;根据所述当前频率区域的音调成分的幅度参数或能量参数确定所述音调成分的位置对应的幅度或能量;根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度或能量获得所述重建高频带信号。
- 根据权利要求41至46任一所述的音频解码器,其特征在于,所述当前频率区域的音调成分的位置参数用于指示所述当前频率区域中包括音调成分的子带的序号。
- 根据权利要求45或46任一所述的音频解码器,其特征在于,所述当前频率区域中音调成分位置位于所述当前频率区域中音调成分所在子带的指定位置。
- 根据权利要求48所述的音频解码器,其特征在于,所述子带的指定位置为子带的中心位置。
- 根据权利要求44至49任一所述的音频解码器,其特征在于,所述根据所述当前频率区域中音调成分的位置和所述音调成分的位置对应的幅度获得所述重建高频带信号包括:根据如下计算式确定音调成分的位置的频域信号:pSpectralData[tone_pos]=tone_val其中,pSpectralData表示所述当前频率区域的重建高频带频域信号,tone_val表示所述当前频率区域内音调成分的位置对应的幅度值,tone_pos表示所述当前频率区域内音调成分的位置。
- 一种计算机可读存储介质,包括指令,当其在计算机上运行时,使得计算机执行如权利要求1至25任意一项所述的方法。
- 一种音频编码设备,其特征在于,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如权利要求1至10任一所述的方法。
- 一种音频解码设备,其特征在于,包括至少一个处理器,所述至少一个处理器用于与存储器耦合,读取并执行所述存储器中的指令,以实现如权利要求11至15中任一项所述的方法。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP21740645.3A EP4080503A4 (en) | 2020-01-13 | 2021-01-12 | AUDIO ENCODING AND DECODING METHODS AND AUDIO ENCODING AND DECODING DEVICES |
KR1020227026986A KR20220117340A (ko) | 2020-01-13 | 2021-01-12 | 오디오 인코딩 및 디코딩 방법과 오디오 인코딩 및 디코딩 디바이스 |
JP2022542159A JP2023509201A (ja) | 2020-01-13 | 2021-01-12 | オーディオ符号化及び復号方法、並びにオーディオ符号化及び復号デバイス |
US17/862,712 US11887610B2 (en) | 2020-01-13 | 2022-07-12 | Audio encoding and decoding method and audio encoding and decoding device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010033973.0A CN113192517B (zh) | 2020-01-13 | 一种音频编解码方法和音频编解码设备 | |
CN202010033973.0 | 2020-01-13 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/862,712 Continuation US11887610B2 (en) | 2020-01-13 | 2022-07-12 | Audio encoding and decoding method and audio encoding and decoding device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021143691A1 true WO2021143691A1 (zh) | 2021-07-22 |
Family
ID=76863583
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/071327 WO2021143691A1 (zh) | 2020-01-13 | 2021-01-12 | 一种音频编解码方法和音频编解码设备 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11887610B2 (zh) |
EP (1) | EP4080503A4 (zh) |
JP (1) | JP2023509201A (zh) |
KR (1) | KR20220117340A (zh) |
WO (1) | WO2021143691A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113808596A (zh) * | 2020-05-30 | 2021-12-17 | 华为技术有限公司 | 一种音频编码方法和音频编码装置 |
CN113808597A (zh) * | 2020-05-30 | 2021-12-17 | 华为技术有限公司 | 一种音频编码方法和音频编码装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008096567A (ja) * | 2006-10-10 | 2008-04-24 | Matsushita Electric Ind Co Ltd | オーディオ符号化装置およびオーディオ符号化方法ならびにプログラム |
US20080312912A1 (en) * | 2007-06-12 | 2008-12-18 | Samsung Electronics Co., Ltd | Audio signal encoding/decoding method and apparatus |
CN102750954A (zh) * | 2007-04-30 | 2012-10-24 | 三星电子株式会社 | 对高频带编码和解码的方法和设备 |
CN104584124A (zh) * | 2013-01-22 | 2015-04-29 | 松下电器产业株式会社 | 带宽扩展参数生成装置、编码装置、解码装置、带宽扩展参数生成方法、编码方法、以及解码方法 |
CN105408957A (zh) * | 2013-06-11 | 2016-03-16 | 松下电器(美国)知识产权公司 | 进行语音信号的频带扩展的装置及方法 |
US10224048B2 (en) * | 2016-12-27 | 2019-03-05 | Fujitsu Limited | Audio coding device and audio coding method |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08162963A (ja) * | 1994-11-30 | 1996-06-21 | Sony Corp | データ符号化装置および復号装置 |
CN1163870C (zh) | 1996-08-02 | 2004-08-25 | 松下电器产业株式会社 | 声音编码装置和方法,声音译码装置,以及声音译码方法 |
JP2003233395A (ja) | 2002-02-07 | 2003-08-22 | Matsushita Electric Ind Co Ltd | オーディオ信号の符号化方法及び装置、並びに符号化及び復号化システム |
JP4736812B2 (ja) * | 2006-01-13 | 2011-07-27 | ソニー株式会社 | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
WO2008007698A1 (fr) | 2006-07-12 | 2008-01-17 | Panasonic Corporation | Procédé de compensation des pertes de blocs, appareil de codage audio et appareil de décodage audio |
US20080027012A1 (en) | 2006-07-24 | 2008-01-31 | Heejin Kim | Bridged carbamate macrolides |
CN102194458B (zh) | 2010-03-02 | 2013-02-27 | 中兴通讯股份有限公司 | 频带复制方法、装置及音频解码方法、系统 |
CN103098130B (zh) | 2010-10-06 | 2014-11-26 | 松下电器产业株式会社 | 编码装置、解码装置、编码方法以及解码方法 |
EP3220390B1 (en) * | 2012-03-29 | 2018-09-26 | Telefonaktiebolaget LM Ericsson (publ) | Transform encoding/decoding of harmonic audio signals |
EP2830063A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus, method and computer program for decoding an encoded audio signal |
FR3017484A1 (fr) * | 2014-02-07 | 2015-08-14 | Orange | Extension amelioree de bande de frequence dans un decodeur de signaux audiofrequences |
BR112016020988B1 (pt) * | 2014-03-14 | 2022-08-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Método e codificador para codificação de um sinal de áudio, e, dispositivo de comunicação |
PL3174050T3 (pl) * | 2014-07-25 | 2019-04-30 | Fraunhofer Ges Forschung | Urządzenie do kodowania sygnałów audio, urządzenie do dekodowania sygnałów audio i ich sposoby |
ES2823250T3 (es) | 2014-07-25 | 2021-05-06 | Fraunhofer Ges Forschung | Aparato de codificación de señal de audio, dispositivo de decodificación de señal de audio y métodos del mismo |
EP2980792A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for generating an enhanced signal using independent noise-filling |
CN113593586A (zh) * | 2020-04-15 | 2021-11-02 | 华为技术有限公司 | 音频信号编码方法、解码方法、编码设备以及解码设备 |
-
2021
- 2021-01-12 JP JP2022542159A patent/JP2023509201A/ja active Pending
- 2021-01-12 KR KR1020227026986A patent/KR20220117340A/ko active Search and Examination
- 2021-01-12 WO PCT/CN2021/071327 patent/WO2021143691A1/zh unknown
- 2021-01-12 EP EP21740645.3A patent/EP4080503A4/en active Pending
-
2022
- 2022-07-12 US US17/862,712 patent/US11887610B2/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008096567A (ja) * | 2006-10-10 | 2008-04-24 | Matsushita Electric Ind Co Ltd | オーディオ符号化装置およびオーディオ符号化方法ならびにプログラム |
CN102750954A (zh) * | 2007-04-30 | 2012-10-24 | 三星电子株式会社 | 对高频带编码和解码的方法和设备 |
US20080312912A1 (en) * | 2007-06-12 | 2008-12-18 | Samsung Electronics Co., Ltd | Audio signal encoding/decoding method and apparatus |
CN104584124A (zh) * | 2013-01-22 | 2015-04-29 | 松下电器产业株式会社 | 带宽扩展参数生成装置、编码装置、解码装置、带宽扩展参数生成方法、编码方法、以及解码方法 |
CN105408957A (zh) * | 2013-06-11 | 2016-03-16 | 松下电器(美国)知识产权公司 | 进行语音信号的频带扩展的装置及方法 |
US10224048B2 (en) * | 2016-12-27 | 2019-03-05 | Fujitsu Limited | Audio coding device and audio coding method |
Non-Patent Citations (1)
Title |
---|
See also references of EP4080503A4 |
Also Published As
Publication number | Publication date |
---|---|
KR20220117340A (ko) | 2022-08-23 |
EP4080503A4 (en) | 2023-05-03 |
JP2023509201A (ja) | 2023-03-07 |
CN113192517A (zh) | 2021-07-30 |
US20220343926A1 (en) | 2022-10-27 |
US11887610B2 (en) | 2024-01-30 |
EP4080503A1 (en) | 2022-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7937271B2 (en) | Audio decoding using variable-length codebook application ranges | |
JP5162588B2 (ja) | 音声符号化システム | |
US10089997B2 (en) | Method for predicting high frequency band signal, encoding device, and decoding device | |
WO2021143694A1 (zh) | 一种音频编解码方法和音频编解码设备 | |
WO2021143692A1 (zh) | 一种音频编解码方法和音频编解码设备 | |
JP2012238034A (ja) | マルチチャンネルオーディオ信号復号化方法 | |
JP2001094433A (ja) | サブバンド符号化・復号方法 | |
WO2021143691A1 (zh) | 一种音频编解码方法和音频编解码设备 | |
US9799339B2 (en) | Stereo audio signal encoder | |
AU2015235133B2 (en) | Audio decoding device, audio encoding device, audio decoding method, audio encoding method, audio decoding program, and audio encoding program | |
CN100489964C (zh) | 音频解码 | |
EP2610867A1 (en) | Audio reproducing device and audio reproducing method | |
WO2021213128A1 (zh) | 音频信号编码方法和装置 | |
US20220335961A1 (en) | Audio signal encoding method and apparatus, and audio signal decoding method and apparatus | |
CN113192517B (zh) | 一种音频编解码方法和音频编解码设备 | |
WO2022012677A1 (zh) | 音频编解码方法和相关装置及计算机可读存储介质 | |
WO2021139757A1 (zh) | 一种音频编解码方法和音频编解码设备 | |
WO2021136343A1 (zh) | 音频信号的编解码方法和编解码装置 | |
WO2024021732A1 (zh) | 音频编解码方法、装置、存储介质及计算机程序产品 | |
WO2022141658A1 (zh) | 在lc3音频码流中添加额外信息的方法、系统及介质 | |
CN115881139A (zh) | 编解码方法、装置、设备、存储介质及计算机程序 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21740645 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2022542159 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2021740645 Country of ref document: EP Effective date: 20220721 |
|
ENP | Entry into the national phase |
Ref document number: 20227026986 Country of ref document: KR Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |