CN111768793B - LC3 audio encoder coding optimization method, system and storage medium - Google Patents

LC3 audio encoder coding optimization method, system and storage medium Download PDF

Info

Publication number
CN111768793B
CN111768793B CN202010665658.XA CN202010665658A CN111768793B CN 111768793 B CN111768793 B CN 111768793B CN 202010665658 A CN202010665658 A CN 202010665658A CN 111768793 B CN111768793 B CN 111768793B
Authority
CN
China
Prior art keywords
audio
coding
encoding
audio encoder
encoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010665658.XA
Other languages
Chinese (zh)
Other versions
CN111768793A (en
Inventor
李强
王尧
叶东翔
朱勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Barrot Wireless Co Ltd
Original Assignee
Barrot Wireless Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Barrot Wireless Co Ltd filed Critical Barrot Wireless Co Ltd
Priority to CN202010665658.XA priority Critical patent/CN111768793B/en
Publication of CN111768793A publication Critical patent/CN111768793A/en
Application granted granted Critical
Publication of CN111768793B publication Critical patent/CN111768793B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The application discloses a coding optimization method, a system, a storage medium and equipment of an LC3 audio coder, and belongs to the technical field of audio coding. The LC3 audio encoder coding optimization method comprises the following steps: filtering the first coded audio in the bass enhancement channel to obtain second coded audio; reducing and adjusting the coding rate of the LC3 audio coder; in the LC3 audio encoder, the second encoded audio is modified according to the adjusted encoding rate, where the modified encoding includes: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters. The application of the application can effectively reduce the coding rate of the LC3 audio coder, reduce the operand in the coding process and further reduce the power consumption of the LC3 audio coder.

Description

LC3 audio encoder coding optimization method, system and storage medium
Technical Field
The application relates to the technical field of audio coding, in particular to a coding optimization method, a coding optimization system, a storage medium and a device of an LC3 audio coder.
Background
In the prior art, the mainstream bluetooth audio encoder includes: the SBC audio encoder is most widely used according to the mandatory requirements of the A2DP protocol, and all Bluetooth audio devices are supported, but the tone quality is general; the AAC-LC audio encoder has good tone quality and wide application, a plurality of mainstream mobile phones support, but compared with the SBC audio encoder, the memory occupation is large, the operation complexity is high, a plurality of Bluetooth devices are based on embedded platforms, the battery capacity is limited, the operation capability of a processor is poor, and the memory is limited; the audio encoder of the apt X series has better tone quality, but the code rate is very high, apt X needs 384kbps of code rate, and the code rate of apt X-HD is 576kbps, and is the exclusive technology of the high pass, relatively closed; the LDAC audio encoder has better tone quality, but the code rates are also very high, namely 330kbps,660kbps and 990kbps, and the stable support of such high code rate is difficult due to the special complex wireless environment in which the Bluetooth device is positioned, and the technology which is unique to Sony is also very closed.
For the above reasons, the Bluetooth international union Bluetooth Sig has been introduced by a number of manufacturers in combination with LC3 audio encoders, which have the advantages of low delay, high sound quality and coding gain, and no patent fee in the Bluetooth field, and have been paid attention by the manufacturers. Since the LC3 audio encoder is originally proposed to meet the audio application in the bluetooth low energy field, the power consumption requirements are very strict.
The existing audio coding channels include a common audio channel and a bass enhancement channel, and in the prior art, the common audio channel and the audio in the bass enhancement channel are coded by a standard coding flow of full-band coding. However, the encoded audio in the bass enhancement channel is narrower in frequency band, and if the standard code rate is used as full-band encoding, precious bandwidth resources are wasted, and the operation amount of the audio encoder is increased, resulting in unnecessary power consumption. In addition, the bandwidth detection module of the LC3 audio encoder can only detect the bandwidth of 4kHz at the lowest, so that the detected bandwidth of the encoded audio in the bass enhancement channel is not consistent with the actual bandwidth, the normal operation of other modules in the LC3 audio encoding is affected, and finally, the frequency spectrum leakage is caused.
Disclosure of Invention
Aiming at the technical problems in the prior art, the application provides a coding optimization method, a system, a storage medium and equipment of an LC3 audio coder.
In one aspect of the present application, there is provided an LC3 audio encoder coding optimization method, including: filtering the first coded audio in the bass enhancement channel to obtain second coded audio; reducing and adjusting the coding rate of the LC3 audio coder; in the LC3 audio encoder, the second encoded audio is modified according to the adjusted encoding rate, where the modified encoding includes: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters.
In another aspect of the present application, there is provided an LC3 audio encoder coding optimization system, comprising: the filtering module is used for filtering the first coded audio in the bass enhancement channel to obtain second coded audio; the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and an improved encoding module for improved encoding of the second encoded audio according to the adjusted encoding rate, the improved encoding comprising: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters.
In another aspect of the present application, a computer readable storage medium is provided, in which computer executable instructions are stored, where the computer executable instructions when executed by a processor implement the LC3 audio encoder coding optimization method in the first aspect.
In another aspect of the present application, there is provided an apparatus comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the LC3 audio encoder coding optimization method of scheme one
The beneficial effects of the application are as follows: when the method is applied, the code rate of audio coding can be effectively reduced in the coding process, unnecessary coding flow is reduced, the operand in the audio coding process is reduced, the coding speed is increased, and the power consumption of the LC3 audio coder is reduced.
Drawings
FIG. 1 is a flow chart of one embodiment of the LC3 audio encoder code optimization method of the present application;
FIG. 2 is a flow diagram of an embodiment of the improved encoding flow of the LC3 audio encoder according to the present application;
FIG. 3 is a flow chart of an embodiment of the LC3 audio encoder code optimization method of the present application;
fig. 4 is a schematic diagram of one embodiment of the LC3 audio encoder coding optimization system of the present application.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present application more apparent, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application, and it is apparent that the described embodiments are some embodiments of the present application, but not all embodiments of the present application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
The terms "first," "second," "third," "fourth," and the like in the description, in the claims, in the abstract and in the drawings, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the application described herein may be implemented, for example, in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
Aiming at the tone quality problem, the operation complexity and the technical unique problem of a plurality of audio encoders, the Bluetooth International Union Sig is combined with a plurality of manufacturers to provide an LC3 audio encoder which has the advantages of lower delay, higher tone quality, higher coding gain, no patent fee in the Bluetooth field and the like, and is paid attention to by the vast manufacturers. Since the LC3 audio encoder is originally proposed to meet the audio application in the bluetooth low energy field, the power consumption requirements are very strict.
The existing audio coding channels comprise a common audio channel and a bass enhancement channel, and in the prior art, all the audio in the common audio channel and the bass enhancement channel are coded by a standard coding flow of full-band coding. However, the encoded audio in the bass enhancement channel is narrower in frequency band, and if the standard code rate is used as full-band encoding, precious bandwidth resources are wasted, and the operation amount of the audio encoder is increased, resulting in unnecessary power consumption. In addition, the bandwidth detection module of the LC3 audio encoder can only detect the bandwidth of 4kHz at the lowest, so that the detected bandwidth of the encoded audio in the bass enhancement channel is not consistent with the actual bandwidth, the normal operation of other modules in the LC3 audio encoding is affected, and finally, the frequency spectrum leakage is caused.
In order to solve the problems, the application provides a method, a system, a storage medium and equipment for optimizing the coding of an LC3 audio coder.
Fig. 1 is a flow chart of an embodiment of the LC3 audio encoder coding optimization method of the present application, in which the audio coding optimization method of the present application includes: step S01, filtering the first coded audio in the bass enhancement channel to obtain second coded audio; step S102, reducing and adjusting the coding rate of an LC3 audio coder; and step S103, in the LC3 audio encoder, carrying out improved encoding on the second encoded audio according to the adjusted encoding code rate, wherein the improved encoding comprises: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters.
The LC3 audio encoder coding optimization method of the present application is described below with reference to fig. 1 and the specific embodiment.
In the specific embodiment shown in fig. 1, the LC3 audio encoder coding optimization method of the present application includes step S101: and filtering the first coded audio in the bass enhancement channel to obtain second coded audio. And filtering the impurity signals in the first coded audio through filtering processing, so that when the coded audio is coded, the coding accuracy is improved, and the power consumption of the LC3 audio coder is reduced.
In a specific embodiment of the present application, the audio signal greater than the first cut-off frequency in the first encoded audio is filtered by low-pass filtering. In this particular embodiment, the second encoded audio is obtained by low-pass filtering the first encoded audio in the bass enhancement channel with a low-pass filter by setting the first cut-off frequency. In the step, the deviation audio signal or impurity signal in the first coded audio is filtered through low-pass filtering processing, so that the coding accuracy is improved.
In one example of the present application, the representation of the channels is typically 6 channels, 5.1 channels, etc. Wherein the ".1" channel of the 5.1 channels represents a bass enhancement channel. According to different audio standard specifications, the frequency band range of the bass enhancement channel is different, wherein according to the Dolby AC-3 standard, the frequency band range of the bass enhancement channel is 20Hz-120Hz; according to the DTS standard, the frequency band range of the bass enhancement sound channel is 20Hz-80Hz. The first encoded audio is low pass filtered by selecting an appropriate cut-off frequency according to different bass enhancement channel audio standards.
In this example, when the Dolby AC-3 standard is selected, the cutoff frequency may be selected to be 120Hz; when the DTS standard is selected, the cut-off frequency may be chosen to be 80Hz. The specific value of the cut-off frequency can be determined according to the relevant parameters of the actual encoded audio or other encoding requirements, and the application is not particularly limited with respect to the specific selection of the cut-off frequency.
And filtering the deviation audio signals or impurity signals in the first encoded audio through filtering the first encoded audio, so that the encoding accuracy is improved, the number of audio signals in the first encoded audio is reduced, and the power consumption of the LC3 audio encoder is further reduced.
In the specific embodiment shown in fig. 1, the LC3 audio encoder coding optimization method of the present application includes step S102: and reducing and adjusting the coding rate of the LC3 audio coder. In the bass enhancement channel, the frequency band of the audio coding signal is narrower, if coding is performed according to the full-band standard coding flow of the LC3 audio coder, coding resources are wasted, so that the power consumption of the LC3 audio coder is increased, and the coding effect of the coded audio is adversely affected, such as overflow. Therefore, when the encoded audio is encoded in the bass enhancement channel, the encoding code rate needs to be adjusted according to the characteristics of the frequency band and the like of the encoded audio, so that the encoding resource of the LC3 audio encoder is saved, and the power consumption of the LC3 audio encoder is reduced.
In a specific embodiment of the present application, the adjusted coding rate of the LC3 audio encoder is set to be no greater than half of the original coding rate of the LC3 audio encoder, where the original coding rate is the coding rate at which the LC3 audio encoder encodes the encoded audio in the non-bass enhancement channel. When coding the coded audio in the bass enhancement channel, the coding rate of the LC3 audio coder is adjusted to be half or less of the original coding rate of the LC3 audio coder, and the power consumption of the LC3 audio coder is reduced through the reduction of the coding rate.
In this particular embodiment, there is a relatively large difference in the second encoded audio in the bass-enhanced channel from the encoded audio in the other non-bass-enhanced channels. Wherein the second encoded audio in the bass enhancement channel is narrower in frequency band. For example, under the DTS standard, the frequency band of the encoded audio in the bass enhancement channel ranges from 20Hz to 80Hz, which is much smaller than the frequency bands of the encoded audio in the other audio channels. The coding rate can be adjusted to be lower when the LC3 audio encoder encodes the encoded audio in the bass enhancement channel. Preferably, the encoding rate of the LC3 audio encoder may be adjusted to be half or less of the original encoding rate, for example, when the original rate is 125kbps, the rate may be adjusted to be 62.5kpbs or less. The specific value for the code rate adjustment may be determined based on the coding requirements of the audio encoder or the relevant characteristics of the encoded audio. .
In the specific embodiment shown in fig. 1, the LC3 audio encoder coding optimization method of the present application includes step S103: in the LC3 audio encoder, the second encoded audio is modified according to the adjusted encoding rate, where the modified encoding includes: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters.
In this embodiment, when encoding the encoded audio in the bass enhancement channel, after step S102, the encoding rate of the LC3 audio encoder is adjusted to be reduced. Preferably, the adjusted coding rate is set to be half or less than half of the LC3 original coding rate. The code rate adjustment is based on the following steps: in the encoding process of the LC3 audio encoder, for the minimum bandwidth configuration such as 4kHz, the recommended code rate is 24 kbps-28 kbps when encoding. The bandwidth of the bass enhancement channel is about 80-120 hz, and in order to ensure the tone quality of the bass enhancement channel and save the code rate, it is recommended that the code rate of the bass enhancement channel is 24 kbps-64 kbps, and when the audio in other non-bass enhancement channels is encoded, the encoding code rate is about 125kbps. Therefore, when the encoded audio in the bass enhancement channel is encoded, the adjusted encoding rate is set to be half or less than half of the original encoding rate of LC3, and in addition, the configuration of the encoding rate can also reduce quantization noise to the maximum extent.
The second encoded audio is then encoded in an LC3 audio encoder. The encoded audio in the bass-enhanced channel differs significantly from the encoded audio in other non-bass-enhanced channels, e.g. the encoded audio in the bass-enhanced channel is narrower in frequency band. And setting a first audio coding parameter in the LC3 audio coder according to the reduced coding rate of the adjusted LC3 audio coder and/or the characteristic of the coded audio in the bass enhancement channel, and then adjusting the coding state of a coding module in the LC3 audio coder according to the set first audio coding parameter so as to code a second coded audio in the bass enhancement channel.
In a specific embodiment of the present application, the first audio coding parameters include: an attack detection parameter, wherein the attack detection parameter is an attack flag of each frame of encoded audio, and the attack detection parameter is set to 0 according to the sampling rate of the second encoded audio and the adjusted encoding code rate, so that the execution process of the attack detection module is skipped in the improved encoding process; and a bandwidth parameter indicating a bandwidth characteristic of the encoded audio of each frame, the bandwidth parameter being set to 0 according to the bandwidth characteristic of the second encoded audio such that an execution process of the bandwidth detection module is skipped in the modified encoding process.
In this particular embodiment, the LC3 audio encoder includes an attack detection module, where there are two conditions under which the attack detection is performed. Wherein the first condition is when the sampling rate of the encoded audioWhen the corresponding LC3 audio encoder encodes one frame, the total number of bytes which can be used is more than 80; the second condition is when the sampling rate of the encoded audio is +.>When the corresponding LC3 audio encoder encodes one frame, the total number of bytes that can be used is greater than or equal to 100. When the LC3 audio encoder encodes the encoded audio of the bass enhancement channel, step S102 is performed first, and the encoding rate of the LC3 audio encoder is adjusted in a reduced manner, so that the attock detection process of the attock detection module is not required when the encoded audio of the bass enhancement channel is encoded. For example, when the encoded audio is 10ms frame length, if the sampling rate is 48KHz, the total number of bytes that can be used when encoding audio for an atack detection needs to be in line with one frame of an LC3 audio encoder is greater than or equal to 100. Wherein, the total number of bytes which can be used in one frame coding is 100, and the corresponding code rate condition is 80kbps. Because the rate of the LC3 audio encoder has fallen below 80kpbs after the rate reduction adjustment by step S102, the atack check is no longer metThe test module tests the test conditions, thus the test parameters, i.e. the test flag of each frame +.>And the value is set to 0, wherein k represents a frame number, so that in the improved coding process of the LC3 audio coder, an attach detection process of an attach detection module is skipped, the operand of the LC3 audio coder is reduced, and the power consumption of the LC3 audio coder is further reduced.
In this particular embodiment, the LC3 audio encoder includes a bandwidth detection module. The bandwidth detection module is used for detecting the bandwidth of the coded audio, and the minimum value of the detected bandwidth is 4KHz. When the LC3 audio encoder encodes the encoded audio in the bass-enhancement channel, the encoded audio in the bass-enhancement channel has a very narrow frequency band. For example, under the Dolby AC-3 standard, the frequency band of the encoded audio in the bass-enhancement channel is 20Hz-120Hz; under the DTS standard, the frequency band range of the encoded audio in the bass-enhancement channel is 20Hz-80Hz. Therefore, the frequency band range of the encoded audio in the bass enhancement channel is far smaller than the minimum value 4KHz of the bandwidth detected by the bandwidth detection module, and therefore, in the improved encoding process of the LC3 audio encoder, the bandwidth parameter is set to 0, so that the bandwidth detection process of the bandwidth detection module is skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is further reduced.
When the LC3 audio encoder carries out improved encoding on the encoded audio in the bass enhancement channel, corresponding first audio encoding parameters are set according to the frequency band characteristics of the encoded audio in the bass enhancement channel and/or the adjusted encoding code rate reduced by the LC3 audio encoder, so that the LC3 audio encoding process is adjusted, unnecessary modules or processes are skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is further reduced.
In a specific embodiment of the application, the improved encoding of the LC3 audio encoder further comprises: the second audio coding parameter is set according to the characteristics of the bass boost channel. Wherein the second audio encoding parameters include: an output flag parameter, which is an output flag parameter of a long-term post-filter module in the LC3 audio encoder, the output flag parameter being set to 0 such that an execution process of the long-term post-filter module is skipped in the improved encoding process; the reflection coefficient is a reflection coefficient of a time domain noise shaping module in the LC3 audio encoder, and the reflection coefficient is set to 0, so that the execution process of the time domain noise shaping module is skipped in the improved encoding process; and other parameters including a setting parameter of a noise estimation module in the LC3 audio encoder, wherein the setting parameter of the noise estimation module is set to 7 such that an execution process of the noise estimation module is skipped in the modified encoding process.
In this specific embodiment, the bass enhancement channel performs multi-channel optimization enhancement processing on the audio when the whole audio channel is in action, so that a user experiences an "immersive" sound effect, therefore, when the encoded audio in the bass enhancement channel is encoded, the output flag parameter of the long-term post-filter module in the LC3 audio encoder can be set to 0, and the corresponding workflow of the resampling module and the long-term post-filter module is no longer performed on the encoded audio in the bass enhancement channel; the reflection coefficient of the time domain noise shaping module in the LC3 audio encoder is set to be 0, so that the encoded audio in the bass enhancement channel does not carry out the corresponding workflow of the time domain noise shaping module, and meanwhile, the occurrence of spectrum leakage during encoding of the encoded audio in the bass enhancement channel is avoided, and the final encoding result is not influenced well. In addition, the second audio coding parameters also include other parameters. Wherein the other parameters include a noise block parameter of a noise estimation block in the LC3 audio encoder, noise filling when the LC3 audio encoder encodes encoded audio of the bass enhancement channel can be avoided by setting the noise block parameter of the noise estimation block to 7.
When the LC3 audio encoder carries out improved coding on the coded audio in the bass enhancement channel, the second audio coding parameters are set, the LC3 audio coding process is adjusted, unnecessary modules or processes are skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is further reduced.
In a specific embodiment of the present application, the audio coding optimization method of the present application further includes, before step S101, step S102, and step S103: and judging whether a bass enhancement channel exists in the audio channels, and when the bass enhancement channel does not exist, performing LC3 audio encoder standard encoding on the encoded audio, and when the bass enhancement channel exists, encoding the encoded audio by using an LC3 audio encoder encoding optimization method, wherein the LC3 audio encoder standard encoding is conventional encoding on the encoded audio by an LC3 audio encoder. Wherein the bass enhancement channels include, but are not limited to, 5.1 channels or 2.1 channels, etc., the related methods of the present application can be performed as long as they belong to the bass enhancement channels.
Fig. 2 is a flow chart of an embodiment of the LC3 audio encoder enhancement encoding of the present application. As shown in fig. 2, in the LC3 audio encoder, the modified encoding flow is composed of a low-delay modified discrete cosine transform step, a transform domain noise shaping step, a quantization step, an arithmetic encoding and residual encoding step, and a code stream encapsulation step. Compared with the standard coding flow of the LC3 audio coder in fig. 2, the improved coding flow of the LC3 audio coder omits the workflow of the attach detection, bandwidth detection, long-term post-filter module, resampling module, noise estimation module and time-domain noise shaping module. By skipping unnecessary modules or processes, the operand of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is further reduced.
When the LC3 audio encoder carries out improved encoding on the encoded audio in the bass enhancement channel, unnecessary encoding flow is omitted, so that the power consumption of the LC3 audio encoder in the LC3 audio encoding process is reduced, and the service time of the LC3 audio encoder is prolonged. The method has the advantages that the encoded audio in the bass enhancement sound channel is encoded by using the proper code rate through updating the encoding code rate, so that the waste of the code rate in the encoding process is avoided, in addition, the encoded audio is encoded by the improved encoding process, the unnecessary encoding process in the LC3 audio encoder is reduced, the operand is reduced, and the power consumption of the audio encoder is reduced.
Fig. 3 shows a specific example of the LC3 audio encoder coding optimization method of the present application. As shown in fig. 3, when the LC3 audio encoder coding optimization method of the present application is executed, it is first determined whether a bass enhancement channel exists, and when no bass enhancement channel exists, the coding is performed using the coding flow of the standard LC3 audio encoder, and the specific coding flow is the standard coding flow of the LC3 audio encoder shown in the left drawing in fig. 2. When the bass enhancement channel is present, the encoded audio signal is first low pass filtered. Second, the coding rate is adjusted because of the specificity of the bass enhancement channel, and when the audio in the bass enhancement channel is encoded, the coding rate can be adjusted to be half or less than half of the original coding rate of the LC3 audio encoder. And through the reduction and adjustment of the code rate, the coding power consumption of the LC3 audio coder is reduced, and the service time of the LC3 audio coder is prolonged. And finally, performing the coding flow of the improved LC3 audio coder on the coded audio in the bass enhancement channel by using the adjusted coding rate. The coding flow of the specific improved LC3 audio coder is shown in the right-hand drawing of fig. 2.
By using the LC3 audio encoder coding optimization method, the coded audio in the bass enhancement sound channel is coded by adopting a proper coding rate, so that unnecessary code rate waste is avoided, and the power consumption of the LC3 audio encoder is reduced; the coding process of the improved LC3 audio coder is used for coding the coded audio in the bass enhancement channel, so that unnecessary coding processes are reduced, the coding load of the LC3 audio coder is reduced, the power consumption of the LC3 audio coder is reduced, and the service time of the LC3 audio coder is prolonged.
Fig. 4 shows an embodiment of the LC3 audio encoder coding optimization system of the present application. As shown in fig. 4, the LC3 audio encoder coding optimization system includes:
the filtering module is used for filtering the first coded audio in the bass enhancement channel to obtain second coded audio; the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and an improved encoding module for improved encoding of the second encoded audio according to the adjusted encoding rate, the improved encoding comprising: setting a first audio coding parameter according to the adjusted coding code rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in the LC3 audio encoder according to the first audio encoding parameters.
In a specific embodiment of the present application, the LC3 audio encoder coding optimization system of the present application further includes a channel detection module that determines whether a bass enhancement channel is present in the audio channels, performs LC3 audio encoder standard coding on the encoded audio when the bass enhancement channel is not present, and uses the LC3 audio encoder coding optimization method to encode the encoded audio when the bass enhancement channel is present, where the LC3 audio encoder standard coding is a conventional coding of the encoded audio by the LC3 audio encoder.
In a specific embodiment of the present application, in the filtering module, an audio signal in the first encoded audio that is greater than the first cut-off frequency is filtered out. And the filtering module is used for filtering impurity signals in the first coded audio, so that when the first coded audio is coded, the coding accuracy is improved, and the power consumption of the LC3 audio coder is reduced.
In a specific embodiment of the present application, in the code rate adjustment module, the number of encoded bytes of the LC3 audio encoder is adjusted according to the frequency band characteristic of the second encoded audio, and the encoding code rate of the LC3 audio encoder is updated according to the adjusted number of encoded bytes.
The encoded audio in the bass enhancement channel is encoded by updating the encoding rate and using a proper code rate, so that the waste of the code rate in the encoding process of the LC3 audio encoder is avoided; in addition, the improved coding flow of the LC3 audio coder is used for coding the coded audio, so that unnecessary coding processes in the audio coder are reduced, the operation amount of the audio coder is reduced, and the power consumption is reduced.
In a specific embodiment of the present application, this embodiment also provides a computer-readable storage medium having stored therein computer-executable instructions that, when executed by at least one processor of a user equipment, perform the various possible methods described above.
Among them, computer-readable media include computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. In addition, the ASIC may reside in a user device. The processor and the storage medium may reside as discrete components in a communication device.
Those skilled in the art will appreciate that: all or part of the steps for implementing the method embodiments described above may be performed by hardware associated with program instructions. The foregoing program may be stored in a computer readable storage medium. The program, when executed, performs steps including the method embodiments described above; and the aforementioned storage medium includes: various media that can store program code, such as ROM, RAM, magnetic or optical disks.
In a specific embodiment of the present application, this embodiment also provides an apparatus comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the various possible methods described above.
The foregoing description is only illustrative of the present application and is not intended to limit the scope of the application, and all equivalent structural changes made by the present application and the accompanying drawings, or direct or indirect application in other related technical fields, are included in the scope of the present application.

Claims (10)

1. A method for optimizing the encoding of an LC3 audio encoder, comprising:
filtering the first coded audio in the bass enhancement channel to obtain second coded audio;
reducing and adjusting the coding rate of the LC3 audio coder;
in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, the modified encoding including:
setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio;
encoding the second encoded audio in the LC3 audio encoder according to the first audio encoding parameters, wherein the first audio encoding parameters include:
and the attack detection parameter is an attack mark of each frame of encoded audio of an attack detection module in the LC3 audio encoder, and is determined according to the sampling rate of the second encoded audio and the adjusted encoding code rate, so that the execution process of the attack detection module is skipped in the improved encoding process.
2. The LC3 audio encoder coding optimization method of claim 1, wherein the first audio encoding parameters comprise:
and the bandwidth parameter is the bandwidth parameter of the audio coded by each frame of the bandwidth detection module in the LC3 audio coder, and is determined according to the frequency band characteristic of the second coded audio.
3. The LC3 audio encoder coding optimization method of claim 2, wherein the attack detection parameter is set to 0; setting the bandwidth parameter to 0 so that execution of the bandwidth detection module is skipped in the modified encoding process.
4. The LC3 audio encoder coding optimization method of claim 1, wherein the improved coding further comprises: setting a second audio coding parameter according to the characteristic of the bass enhancement channel, wherein the second audio coding parameter comprises:
an output flag parameter that is an output flag parameter of the long-term post-filter module in the LC3 audio encoder, the output flag parameter being set to 0 such that an execution process of the long-term post-filter module is skipped in the modified encoding process;
a reflection coefficient, which is a reflection coefficient of a time domain noise shaping module in the LC3 audio encoder, the reflection coefficient being set to 0 such that an execution process of the time domain noise shaping module is skipped in the modified encoding process; and
and other parameters including setting parameters of a noise estimation module in the LC3 audio encoder, wherein the setting parameters of the noise estimation module are set to 7, so that the execution process of the noise estimation module is skipped in the modified encoding process.
5. The LC3 audio encoder coding optimization method of claim 1, wherein the reducing the coding rate of the LC3 audio encoder comprises: and setting the adjusted coding rate of the LC3 audio encoder to be not more than half of the original coding rate of the LC3 audio encoder, wherein the original coding rate is the coding rate when the LC3 audio encoder codes the coding audio in the non-bass enhancement channel.
6. The LC3 audio encoder coding optimization method of claim 1, further comprising:
and judging whether the bass enhancement channel exists in the audio channel, when the bass enhancement channel does not exist, carrying out LC3 audio encoder standard encoding on the encoded audio, and when the bass enhancement channel exists, using the LC3 audio encoder encoding optimization method to encode the encoded audio, wherein the LC3 audio encoder standard encoding is the conventional encoding carried out on the encoded audio by the LC3 audio encoder.
7. The LC3 audio encoder coding optimization method of claim 1, wherein the filtering process is filtering out audio signals in the first encoded audio that are greater than a first cut-off frequency.
8. An LC3 audio encoder coding optimization system, comprising:
the filtering module is used for filtering the first coded audio in the bass enhancement channel to obtain second coded audio;
the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and
an improved encoding module for improved encoding of the second encoded audio according to the adjusted encoding rate, the improved encoding comprising:
setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio;
encoding the second encoded audio in the LC3 audio encoder according to the first audio encoding parameters, wherein the first audio encoding parameters include:
and the attack detection parameter is an attack mark of each frame of encoded audio of an attack detection module in the LC3 audio encoder, and is determined according to the sampling rate of the second encoded audio and the adjusted encoding code rate, so that the execution process of the attack detection module is skipped in the improved encoding process.
9. A computer readable storage medium having stored therein computer executable instructions which when executed by a processor implement the method of any of claims 1-7.
10. A computer device, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
CN202010665658.XA 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium Active CN111768793B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010665658.XA CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010665658.XA CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Publications (2)

Publication Number Publication Date
CN111768793A CN111768793A (en) 2020-10-13
CN111768793B true CN111768793B (en) 2023-09-01

Family

ID=72726605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010665658.XA Active CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Country Status (1)

Country Link
CN (1) CN111768793B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112270928A (en) * 2020-10-28 2021-01-26 北京百瑞互联技术有限公司 Method, device and storage medium for reducing code rate of audio encoder
CN112309408A (en) * 2020-11-10 2021-02-02 北京百瑞互联技术有限公司 Method, device and storage medium for expanding LC3 audio encoding and decoding bandwidth
CN112489666A (en) * 2020-11-26 2021-03-12 北京百瑞互联技术有限公司 Bluetooth LE audio propagation data processing method and device and storage medium
CN112435674A (en) * 2020-12-09 2021-03-02 北京百瑞互联技术有限公司 Method, apparatus, medium for optimizing LC3 arithmetic coding search table of spectrum data
CN114582361B (en) * 2022-04-29 2022-07-08 北京百瑞互联技术有限公司 High-resolution audio coding and decoding method and system based on generation countermeasure network

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737720A (en) * 1993-10-26 1998-04-07 Sony Corporation Low bit rate multichannel audio coding methods and apparatus using non-linear adaptive bit allocation
CN101136200A (en) * 2006-08-30 2008-03-05 财团法人工业技术研究院 Audio signal transform coding method and system thereof
CN101188878A (en) * 2007-12-05 2008-05-28 武汉大学 A space parameter quantification and entropy coding method for 3D audio signals and its system architecture
CN103165135A (en) * 2013-03-04 2013-06-19 深圳广晟信源技术有限公司 Digital audio coarse layering coding method and digital audio coarse layering coding device
CN105280190A (en) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 Bandwidth extension encoding and decoding method and device
CN106373583A (en) * 2016-09-28 2017-02-01 北京大学 Ideal ratio mask (IRM) multi-audio object coding and decoding method
CN106653035A (en) * 2016-12-26 2017-05-10 广州广晟数码技术有限公司 Bit rate allocation method and bit rate allocation device in digital audio coding
CN104170007B (en) * 2012-06-19 2017-09-26 深圳广晟信源技术有限公司 To monophonic or the stereo method encoded
CN108847248A (en) * 2018-06-15 2018-11-20 珠海市杰理科技股份有限公司 Bluetooth equipment audio-frequency processing method, system, readable storage medium storing program for executing and bluetooth equipment
CN109448741A (en) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 A kind of 3D audio coding, coding/decoding method and device
CN110832581A (en) * 2017-03-31 2020-02-21 弗劳恩霍夫应用研究促进协会 Apparatus for post-processing audio signals using transient position detection

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
US8027479B2 (en) * 2006-06-02 2011-09-27 Coding Technologies Ab Binaural multi-channel decoder in the context of non-energy conserving upmix rules
KR101414736B1 (en) * 2010-03-09 2014-08-06 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. Apparatus and method for processing an input audio signal using cascaded filterbanks

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737720A (en) * 1993-10-26 1998-04-07 Sony Corporation Low bit rate multichannel audio coding methods and apparatus using non-linear adaptive bit allocation
CN101136200A (en) * 2006-08-30 2008-03-05 财团法人工业技术研究院 Audio signal transform coding method and system thereof
CN101188878A (en) * 2007-12-05 2008-05-28 武汉大学 A space parameter quantification and entropy coding method for 3D audio signals and its system architecture
CN104170007B (en) * 2012-06-19 2017-09-26 深圳广晟信源技术有限公司 To monophonic or the stereo method encoded
CN103165135A (en) * 2013-03-04 2013-06-19 深圳广晟信源技术有限公司 Digital audio coarse layering coding method and digital audio coarse layering coding device
CN105280190A (en) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 Bandwidth extension encoding and decoding method and device
CN106373583A (en) * 2016-09-28 2017-02-01 北京大学 Ideal ratio mask (IRM) multi-audio object coding and decoding method
CN106653035A (en) * 2016-12-26 2017-05-10 广州广晟数码技术有限公司 Bit rate allocation method and bit rate allocation device in digital audio coding
CN110832581A (en) * 2017-03-31 2020-02-21 弗劳恩霍夫应用研究促进协会 Apparatus for post-processing audio signals using transient position detection
CN108847248A (en) * 2018-06-15 2018-11-20 珠海市杰理科技股份有限公司 Bluetooth equipment audio-frequency processing method, system, readable storage medium storing program for executing and bluetooth equipment
CN109448741A (en) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 A kind of 3D audio coding, coding/decoding method and device

Also Published As

Publication number Publication date
CN111768793A (en) 2020-10-13

Similar Documents

Publication Publication Date Title
CN111768793B (en) LC3 audio encoder coding optimization method, system and storage medium
US11533575B2 (en) Loudness adjustment for downmixed audio content
CN111681663B (en) Method, system, storage medium and device for reducing audio coding computation amount
CN110706715B (en) Method and apparatus for encoding and decoding signal
HU219255B (en) Method and apparatus for suppressing noise, speech coder
EP2664161A1 (en) Loudness maximization with constrained loudspeaker excursion
CN101149926B (en) Audio signal interpolation method and device
JP6616470B2 (en) Encoding method, decoding method, encoding device, and decoding device
JP2015522847A (en) Effective pre-echo attenuation in digital audio signals
JP2012524304A (en) Method and apparatus for adjusting channel delay parameters of multi-channel signals
CN111681664A (en) Method, system, storage medium and equipment for reducing audio coding rate
CN102543090B (en) Code rate automatic control system applicable to variable bit rate voice and audio coding
US11527253B2 (en) Stereo encoding method and stereo encoder
JP2015505991A (en) Method and apparatus for audio encoding
KR101907808B1 (en) Method for estimating noise in an audio signal, noise estimator, audio encoder, audio decoder and system for transmitting audio signals
JP2001102930A (en) Method and device for correcting quantization error, and method and device for decoding audio information
EP3133600B1 (en) Codec method, device and system
US20200175999A1 (en) Time-domain stereo encoding and decoding method and related product
US20140214412A1 (en) Apparatus and method for processing voice signal
JP4409733B2 (en) Encoding apparatus, encoding method, and recording medium therefor
CN101211561A (en) Music signal quality enhancement method and device
CN101377926A (en) Audio encoding method capable of quickening quantification circulation program
WO2008072524A1 (en) Audio signal encoding method and decoding method
US20130346072A1 (en) Noise feedback coding for delta modulation and other codecs
KR20240042184A (en) Stereo encoding method and stereo encoder

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: A1009, floor 9, block a, No. 9, Shangdi Third Street, Haidian District, Beijing 100085

Patentee after: Beijing Bairui Internet Technology Co.,Ltd.

Address before: 7-1-1, building C, 7 / F, building 2-1, No.2, Shangdi Information Road, Haidian District, Beijing 100085

Patentee before: BARROT WIRELESS Co.,Ltd.

CP03 Change of name, title or address