CN111768793A - LC3 audio encoder coding optimization method, system and storage medium - Google Patents

LC3 audio encoder coding optimization method, system and storage medium Download PDF

Info

Publication number
CN111768793A
CN111768793A CN202010665658.XA CN202010665658A CN111768793A CN 111768793 A CN111768793 A CN 111768793A CN 202010665658 A CN202010665658 A CN 202010665658A CN 111768793 A CN111768793 A CN 111768793A
Authority
CN
China
Prior art keywords
audio
encoding
audio encoder
coding
encoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010665658.XA
Other languages
Chinese (zh)
Other versions
CN111768793B (en
Inventor
李强
王尧
叶东翔
朱勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Barrot Wireless Co Ltd
Original Assignee
Barrot Wireless Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Barrot Wireless Co Ltd filed Critical Barrot Wireless Co Ltd
Priority to CN202010665658.XA priority Critical patent/CN111768793B/en
Publication of CN111768793A publication Critical patent/CN111768793A/en
Application granted granted Critical
Publication of CN111768793B publication Critical patent/CN111768793B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The application discloses a LC3 audio encoder encoding optimization method, a system, a storage medium and equipment, belonging to the technical field of audio encoding. The LC3 audio encoder coding optimization method comprises the following steps: filtering a first coded audio in a bass enhanced sound channel to obtain a second coded audio; performing reduction adjustment on the encoding code rate of the LC3 audio encoder; in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, the modified encoding including: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters. The application of the application can effectively reduce the coding code rate of the LC3 audio encoder, reduce the operation amount in the coding process and further reduce the power consumption of the LC3 audio encoder.

Description

LC3 audio encoder coding optimization method, system and storage medium
Technical Field
The present application relates to the field of audio coding technologies, and in particular, to a coding optimization method, system, storage medium, and device for an LC3 audio coder.
Background
In the prior art, the mainstream bluetooth audio encoder includes: the SBC audio encoder is most widely used according to the mandatory requirements of an A2DP protocol, and is supported by all Bluetooth audio equipment, but the tone quality is general; the AAC-LC audio encoder has good tone quality and wide application range, is supported by a plurality of mainstream mobile phones, but has larger memory occupation and high operation complexity compared with the SBC audio encoder, and a plurality of Bluetooth devices are based on an embedded platform, so that the battery capacity is limited, the operation capability of a processor is poor and the memory is limited; the aptX series audio coder has good sound quality but high code rate, wherein the aptX needs the code rate of 384kbps, and the code rate of the aptX-HD is 576kbps, is a unique technology of high pass, and is relatively closed; LDAC audio frequency encoder, its tone quality is better, but the code rate is also very high, 330kbps, 660kbps and 990kbps respectively, because the wireless environment that bluetooth equipment is located is very complicated, and stable support such high code rate has certain difficulty, and is the unique technique of sony, and is also very closed.
For the above reasons, the Bluetooth international association Bluetooth Sig has introduced the LC3 audio encoder by combining with many manufacturers, which has the advantages of low delay, high sound quality, high coding gain, no special fee in the Bluetooth field, and the like, and is paid attention by the manufacturers. Since the LC3 audio encoder was originally proposed to satisfy the audio application in the bluetooth low energy domain, the power consumption requirement is very strict.
The existing audio coding channels include a common audio channel and a bass enhancement channel, and in the prior art, the audio in the common audio channel and the bass enhancement channel is coded by a standard coding process of full-band coding. However, the frequency band of the encoded audio in the bass enhanced channel is narrow, and if the standard code rate is used as the full-band encoding, the precious bandwidth resource is wasted, and the operation amount of the audio encoder is increased, which causes unnecessary power consumption. In addition, the bandwidth detection module of the LC3 audio encoder can only detect the bandwidth of 4kHz at the lowest, which causes the detected bandwidth of the encoded audio in the bass enhanced channel to be inconsistent with the actual bandwidth, affects the normal operation of other modules in the LC3 audio encoding, and finally causes the spectrum leakage.
Disclosure of Invention
In view of the above technical problems in the prior art, the present application provides an encoding optimization method, system, storage medium and device for an LC3 audio encoder.
In one aspect of the present application, there is provided an encoding optimization method for an LC3 audio encoder, including: filtering a first coded audio in a bass enhanced sound channel to obtain a second coded audio; performing reduction adjustment on the encoding code rate of the LC3 audio encoder; in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, the modified encoding including: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters.
In another aspect of the present application, there is provided an LC3 audio encoder encoding optimization system, including: the filtering module is used for filtering the first coded audio in the bass enhancement sound channel to obtain a second coded audio; the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and an improved encoding module for performing improved encoding of the second encoded audio according to the adjusted encoding rate, the improved encoding comprising: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters.
In another technical solution of the present application, a computer-readable storage medium is provided, where computer-executable instructions are stored in the computer-readable storage medium, and when the computer-executable instructions are executed by a processor, the LC3 audio encoder encoding optimization method in the first technical solution may be implemented.
In another aspect of the present application, there is provided an apparatus, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the LC3 audio encoder encoding optimization method of scheme one
The beneficial effect of this application is: when the method is applied, the code rate of the audio coding can be effectively reduced, unnecessary coding processes are reduced, the operation amount in the audio coding process is reduced, the coding speed is accelerated, and the power consumption of the LC3 audio coder is reduced.
Drawings
FIG. 1 is a schematic flow chart diagram of an embodiment of an encoding optimization method of an LC3 audio encoder of the present application;
FIG. 2 is a schematic flow chart of a specific example of an improved encoding flow of an LC3 audio encoder in the present application;
FIG. 3 is a schematic flow chart of an embodiment of an encoding optimization method of an LC3 audio encoder of the present application;
FIG. 4 is a schematic diagram of an embodiment of an encoding optimization system of an LC3 audio encoder of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The terms "first," "second," "third," "fourth," and the like in the description, the claims, the abstract, and the drawings of the application, if any, are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are, for example, capable of operation in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
The LC3 audio encoder is provided by combining a plurality of manufacturers aiming at the problems of tone quality, operation complexity and unique technology of a plurality of audio encoders, namely the Bluetooth International Association Bluetooth Sig, has the advantages of low delay, high tone quality, high coding gain, no special fee in the Bluetooth field and the like, and is concerned by the majority of manufacturers. Since the LC3 audio encoder was originally proposed to satisfy the audio application in the bluetooth low energy domain, the power consumption requirement is very strict.
The existing audio coding channels include a normal audio channel and a bass enhancement channel, and in the prior art, the audio in the normal audio channel and the bass enhancement channel is coded by a standard coding process of full-band coding. However, the frequency band of the encoded audio in the bass enhanced channel is narrow, and if the standard code rate is used as the full-band encoding, the precious bandwidth resource is wasted, and the operation amount of the audio encoder is increased, which causes unnecessary power consumption. In addition, the bandwidth detection module of the LC3 audio encoder can only detect the bandwidth of 4kHz at the lowest, which causes the detected bandwidth of the encoded audio in the bass enhanced channel to be inconsistent with the actual bandwidth, affects the normal operation of other modules in the LC3 audio encoding, and finally causes the spectrum leakage.
In view of the above problems, the present application provides an LC3 audio encoder encoding optimization method, system, storage medium, and apparatus.
Fig. 1 is a schematic flowchart of an embodiment of an encoding optimization method of an LC3 audio encoder according to the present application, in which the encoding optimization method of the present application includes: step S01, filtering the first coding audio in the bass enhanced sound channel to obtain a second coding audio; step S102, the encoding code rate of the LC3 audio encoder is reduced and adjusted; and step S103, in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, wherein the modified encoding comprises: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters.
The encoding optimization method of the LC3 audio encoder of the present application is described below with reference to fig. 1 and a specific embodiment.
In the embodiment shown in fig. 1, the LC3 audio encoder encoding optimization method of the present application includes step S101: and filtering the first coded audio in the bass enhanced sound channel to obtain a second coded audio. Through filtering processing, impurity signals in the first coded audio are filtered, so that when the coded audio is coded, the coding accuracy is improved, and the power consumption of the LC3 audio coder is reduced.
In a specific embodiment of the present application, the audio signal of the first encoded audio greater than the first cut-off frequency is filtered by low-pass filtering. In this specific embodiment, by setting the first cut-off frequency, the first encoded audio in the bass-enhanced channel is subjected to low-pass filtering processing using a low-pass filter, resulting in the second encoded audio. In the step, the deviation audio signal or the impurity signal in the first coded audio is filtered through low-pass filtering processing, so that the coding accuracy is improved.
In one example of the present application, the expression channels is typically 6 channels, 5.1 channels, etc. Wherein the ". 1" channel of the 5.1 channels represents a bass enhancement channel. According to different audio standard specifications, the frequency band range of a bass enhanced sound channel is different, wherein according to the Dolby AC-3 standard, the frequency band range of the bass enhanced sound channel is 20Hz-120 Hz; according to the DTS standard, the frequency band range of the bass enhanced sound channel is 20Hz-80 Hz. The first encoded audio is low-pass filtered by selecting an appropriate cut-off frequency according to different bass-enhancement channel audio standards.
In this example, when the Dolby AC-3 standard is used, a cutoff frequency of 120Hz may be selected; when DTS criteria are chosen, a cutoff frequency of 80Hz may be chosen. The specific value of the cut-off frequency may be determined according to the relevant parameters of the actually encoded audio or other encoding requirements, and the specific selection of the cut-off frequency is not specifically limited in this application.
Through carrying out filtering processing to first code audio frequency, carry out the filtering with deviation audio signal or impurity signal in the first code audio frequency, improve the accuracy of coding, reduce the audio signal quantity in the first code audio frequency simultaneously, and then reduce LC3 audio encoder's power consumption.
In the embodiment shown in fig. 1, the LC3 audio encoder encoding optimization method of the present application includes step S102: and performing reduction adjustment on the encoding code rate of the LC3 audio encoder. In the bass enhancement channel, the frequency band of the audio coding signal is narrow, and if coding is performed according to the full-band standard coding flow of the LC3 audio coder, coding resources are wasted, power consumption of the LC3 audio coder is increased, and adverse effects such as overflow are generated on the coding effect of the coded audio. Therefore, when encoding the encoded audio in the bass enhanced channel, the encoding rate needs to be adjusted according to the characteristics of the frequency band and the like of the encoded audio, so that the encoding resources of the LC3 audio encoder are saved, and the power consumption of the LC3 audio encoder is reduced.
In a specific embodiment of the present application, the adjusted encoding rate of the LC3 audio encoder is set to be not greater than half of the original encoding rate of the LC3 audio encoder, where the original encoding rate is the encoding rate of the LC3 audio encoder when encoding the encoded audio in the non-bass enhanced channel. When the encoded audio in the bass enhancement channel is encoded, the encoding rate of the LC3 audio encoder is adjusted to be half or less than the original encoding rate of the LC3 audio encoder, and the power consumption of the LC3 audio encoder is reduced by reducing the encoding rate.
In this particular embodiment, the second encoded audio in the bass-enhanced channel is significantly different from the encoded audio in the other non-bass-enhanced channels. Wherein the second encoded audio in the bass enhancement channel is narrower in frequency band. For example, under the DTS standard, the frequency band of the encoded audio in the bass enhancement channel ranges from 20Hz-80Hz, which is much smaller than the frequency bands of the encoded audio in the other audio channels. Therefore, when the LC3 audio encoder encodes the encoded audio in the bass enhanced channel, the encoding rate can be adjusted to be lower. Preferably, the encoding rate of the LC3 audio encoder may be adjusted to be half or less than half of the original encoding rate, for example, when the original encoding rate is 125kbps, the encoding rate may be adjusted to be 62.5kpbs or less. The specific value of the code rate adjustment can be determined according to the encoding requirements of the audio encoder or the relevant characteristics of the encoded audio. .
In the embodiment shown in fig. 1, the LC3 audio encoder encoding optimization method of the present application includes step S103: in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, the modified encoding including: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters.
In this embodiment, when encoding the encoded audio in the bass-enhanced channel, after step S102, the encoding rate of the LC3 audio encoder is adjusted to be lower. Preferably, the adjusted coding rate is set to be half or less than half of the original coding rate of the LC 3. The code rate is adjusted according to the following steps: in the encoding process of the LC3 audio encoder, the minimum bandwidth configuration is 4kHz, and the recommended code rate is 24 kbps-28 kbps when encoding is carried out. The bandwidth of the bass enhanced channel is about 80-120 Hz, in order to ensure the sound quality of the bass enhanced channel and save the code rate, the code rate of the bass enhanced channel is recommended to be 24-64 kbps, and when the audio in other non-bass enhanced channels is coded, the coding code rate is about 125 kbps. Therefore, when the coded audio in the bass enhanced channel is coded, the adjusted coding rate is set to be half or less than half of the original coding rate of the LC3, and the configuration of the coding rate can reduce the quantization noise to the maximum extent.
The second encoded audio is then modified encoded in an LC3 audio encoder. The encoded audio in its bass-enhanced channel is significantly different from the encoded audio in other non-bass-enhanced channels, e.g., the encoded audio in the bass-enhanced channel is narrower in frequency band. Setting a first audio coding parameter in the LC3 audio encoder according to the reduction of the adjusted coding rate of the LC3 audio encoder and/or the characteristic of the coded audio in the bass enhancement channel, then adjusting the coding state of a coding module in the LC3 audio encoder according to the set first audio coding parameter, and further coding a second coded audio in the bass enhancement channel.
In a specific embodiment of the present application, the first audio coding parameter comprises: the attack detection parameter is an attack mark of each frame of coded audio, and is set to be 0 according to the sampling rate of the second coded audio and the adjusted coding rate, so that the execution process of an attack detection module is skipped in the improved coding process; a bandwidth parameter representing a bandwidth characteristic of each frame of the encoded audio, the bandwidth parameter being set to 0 according to a frequency band characteristic of the second encoded audio such that an implementation of the bandwidth detection module is skipped in the modified encoding process.
In this particular embodiment, the LC3 audio encoder includes an attack detection module, where there are two conditions for attack detection in the attack detection module. Wherein the first condition is when the sampling rate of the audio is encoded
Figure 759056DEST_PATH_IMAGE001
When the number of the bytes is larger than 80, the LC3 audio encoder can use the total number of bytes when encoding a frame; the second condition is when the sampling rate of the encoded audio is
Figure 535513DEST_PATH_IMAGE002
The total number of bytes that can be used by the corresponding LC3 audio encoder to encode a frame is greater than or equal to 100. On the other hand, when the LC3 audio encoder encodes the encoded audio of the bass-enhanced channel, step S102 is performed firstThe encoding rate of the LC3 audio encoder is adjusted to be reduced, so that the attack detection process of the attack detection module is not needed when the encoded audio in the bass enhanced channel is encoded. For example, when the encoded audio is 10ms frames long, if the sampling rate is 48KHz, the total number of bytes that can be used when the encoded audio is subjected to attack detection requires a frame encoding consistent with the LC3 audio encoder is greater than or equal to 100. The code rate condition is 80kbps, wherein the total number of bytes used for encoding a frame is 100. Since the bitrate of the LC3 audio encoder has already decreased to below 80kpbs after the bitrate reduction adjustment through step S102, the attack detection condition in the attack detection module is no longer met, and therefore, the attack detection parameter, i.e., the attack flag of each frame, is set
Figure 802546DEST_PATH_IMAGE003
And setting the value to be 0, wherein k represents the frame number, so that in the improved encoding process of the LC3 audio encoder, the attack detection process of an attack detection module is skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is reduced.
In this particular embodiment, the LC3 audio encoder includes a bandwidth detection module. The bandwidth detection module is used for detecting the bandwidth of the coded audio, and the minimum value of the detected bandwidth is 4 KHz. When the LC3 audio encoder encodes the encoded audio in the bass enhancement channel, the encoded audio in the bass enhancement channel has a narrow frequency band range. For example, under the DolbyAC-3 standard, the frequency band range of the encoded audio in the bass enhancement channel is 20Hz-120 Hz; under the DTS standard, the frequency band of the encoded audio in the bass enhancement channel ranges from 20Hz to 80 Hz. Therefore, the frequency band range of the coded audio in the bass enhanced channel is far less than the minimum value 4KHz of the bandwidth which can be detected by the bandwidth detection module, so that in the improved coding process of the LC3 audio coder, the bandwidth parameter is set to be 0, so that the bandwidth detection process of the bandwidth detection module is skipped, the operation amount of the LC3 audio coder is reduced, and the power consumption of the LC3 audio coder is reduced.
When the LC3 audio encoder performs improved encoding on the encoded audio in the bass enhancement channel, corresponding first audio encoding parameters are set according to the frequency band characteristics of the encoded audio in the bass enhancement channel and/or the LC3 audio encoder reduces the adjusted encoding rate, so that the LC3 audio encoding process is adjusted, unnecessary modules or processes are skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is reduced.
In a specific embodiment of the present application, the improved encoding of the LC3 audio encoder further comprises: the second audio coding parameters are set according to characteristics of the bass enhancement channel. Wherein the second audio coding parameters include: an output flag parameter, which is an output flag parameter of a long-term postfilter module in an LC3 audio encoder, setting the output flag parameter to 0 so that an execution process of the long-term postfilter module is skipped in the modified encoding process; a reflection coefficient, which is a reflection coefficient of a temporal noise shaping module in the LC3 audio encoder, is set to 0 so that an execution process of the temporal noise shaping module is skipped in the modified encoding process; and other parameters including setting parameters of a noise estimation module in the LC3 audio encoder, wherein the setting parameters of the noise estimation module are set to 7, so that the execution process of the noise estimation module is skipped in the modified encoding process.
In this specific embodiment, the bass enhancement channel performs multi-channel optimization enhancement processing on the audio under the action of the entire audio channel, so that a user experiences an "immersive" sound effect, therefore, when encoding the encoded audio in the bass enhancement channel, the output flag parameter of the long-term post-filter module in the LC3 audio encoder may be set to 0, and the encoded audio in the bass enhancement channel is not subjected to the corresponding workflow of the resampling module and the long-term post-filter module; the reflection coefficient of the time domain noise shaping module in the LC3 audio encoder is set to be 0, so that the corresponding work flow of the time domain noise shaping module is not carried out on the coded audio in the bass enhanced sound channel any more, and the frequency spectrum leakage generated when the coded audio in the bass enhanced sound channel is coded is avoided, and the adverse effect on the final coding result is generated. In addition, the second audio coding parameters also comprise other parameters. Wherein the other parameters include noise module parameters of a noise estimation module in the LC3 audio encoder, noise filling when the LC3 audio encoder encodes the encoded audio of the bass-enhanced channel may be avoided by setting the noise module parameters of the noise estimation module to 7.
When the LC3 audio encoder carries out improved encoding on the encoded audio in the bass enhanced channel, the second audio encoding parameters are set, the LC3 audio encoding process is adjusted, unnecessary modules or processes are skipped, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is reduced.
In a specific embodiment of the present application, the audio coding optimization method further includes, before step S101, step S102, and step S103: and judging whether a bass enhanced channel exists in the audio channels, when the bass enhanced channel does not exist, performing LC3 audio encoder standard encoding on the encoded audio, and when the bass enhanced channel exists, encoding the encoded audio by using an LC3 audio encoder encoding optimization method, wherein the LC3 audio encoder standard encoding is the conventional encoding of the encoded audio by an LC3 audio encoder. Wherein the bass enhanced channel includes, but is not limited to, 5.1 channel or 2.1 channel, etc., and the related method of the present application can be performed as long as it belongs to the bass enhanced channel.
Fig. 2 is a schematic flow diagram of a specific example of improved encoding of an LC3 audio encoder in the present application. As shown in fig. 2, in the LC3 audio encoder, the modified encoding process consists of a low-delay modified discrete cosine transform step, a transform domain noise shaping step, a quantization step, an arithmetic coding and residual coding step, and a code stream encapsulation step. Compared with the standard encoding flow of the LC3 audio encoder in fig. 2, the LC3 audio encoder modified encoding flow omits the work flows of attach detection, bandwidth detection, long-term post-filter module, resampling module, noise estimation module, and time-domain noise shaping module. By skipping unnecessary modules or processes, the operation amount of the LC3 audio encoder is reduced, and the power consumption of the LC3 audio encoder is reduced.
When the LC3 audio encoder performs improved encoding on the encoded audio in the bass enhanced channel, unnecessary encoding processes are omitted, so that the power consumption of the LC3 audio encoder in the LC3 audio encoding process is reduced, and the service life of the LC3 audio encoder is prolonged. The coding rate of the coded audio in the bass enhancement sound channel is updated, so that the audio is coded by using a proper code rate, the waste of the code rate in the coding process is avoided, in addition, the coded audio is coded by an improved coding process, the unnecessary coding process in an LC3 audio coder is reduced, the operation amount is reduced, and the power consumption of the audio coder is reduced.
Fig. 3 shows a specific example of the encoding optimization method of the LC3 audio encoder of the present application. As shown in fig. 3, the LC3 audio encoder encoding optimization method of the present application, when executed, first determines whether a bass enhancement channel exists, and when the bass enhancement channel does not exist, performs encoding using the encoding flow of the standard LC3 audio encoder, where a specific encoding flow is the standard encoding flow of the LC3 audio encoder shown in the left drawing in fig. 2. When a bass-enhanced channel is present, the audio signal to be encoded is first subjected to a low-pass filtering process. Secondly, the coding rate is adjusted, because of the particularity of the bass enhanced channel, when the audio in the bass enhanced channel is coded, the coding rate can be adjusted to be half or less than half of the original coding rate of the LC3 audio coder. By the adjustment of the reduction of the code rate, the encoding power consumption of the LC3 audio encoder is reduced, and the service time of the LC3 audio encoder is prolonged. And finally, carrying out the encoding process of the improved LC3 audio encoder on the encoded audio in the bass enhanced channel by using the adjusted encoding rate. The encoding flow of the specific modified LC3 audio encoder is shown in the right-hand drawing of fig. 2.
By using the LC3 audio encoder encoding optimization method, the encoded audio in the bass enhanced channel is encoded by adopting a proper encoding code rate, so that unnecessary code rate waste is avoided, and the power consumption of the LC3 audio encoder is reduced; the encoding process of the improved LC3 audio encoder is used for encoding the encoding audio in the bass enhanced channel, so that unnecessary encoding processes are reduced, the encoding load of the LC3 audio encoder is reduced, the power consumption of the LC3 audio encoder is reduced, and the service time of the LC3 audio encoder is prolonged.
Fig. 4 shows a specific embodiment of the LC3 audio encoder encoding optimization system of the present application. As shown in fig. 4, the LC3 audio encoder encoding optimization system includes:
the filtering module is used for filtering the first coded audio in the bass enhancement sound channel to obtain a second coded audio; the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and an improved encoding module for performing improved encoding of the second encoded audio according to the adjusted encoding rate, the improved encoding comprising: setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio; the second encoded audio is encoded in an LC3 audio encoder according to the first audio encoding parameters.
In a specific embodiment of the present application, the LC3 audio encoder encoding optimization system of the present application further includes a channel detection module that determines whether a bass enhancement channel is present in the audio channel, performs LC3 audio encoder standard encoding on the encoded audio when the bass enhancement channel is not present, and encodes the encoded audio using an LC3 audio encoder encoding optimization method when the bass enhancement channel is present, the LC3 audio encoder standard encoding being the conventional encoding of the encoded audio by an LC3 audio encoder.
In a specific embodiment of the present application, in the filtering module, the audio signals of the first encoded audio that are greater than the first cut-off frequency are filtered. Through the filtering module, impurity signals in the first coded audio are filtered, so that when the first coded audio is coded, the coding accuracy is improved, and the power consumption of the LC3 audio coder is reduced.
In a specific embodiment of the present application, in the rate adjustment module, the number of encoding bytes of the LC3 audio encoder is adjusted according to the frequency band characteristic of the second encoded audio, and the encoding rate of the LC3 audio encoder is updated according to the adjusted number of encoding bytes.
By updating the coding rate, the coding audio in the bass enhanced sound channel is coded by using a proper code rate, so that the waste of the code rate in the coding process of an LC3 audio coder is avoided; in addition, the encoded audio is encoded through the improved encoding process of the LC3 audio encoder, so that the unnecessary encoding process in the audio encoder is reduced, the operation amount of the audio encoder is reduced, and the power consumption is reduced.
In a specific embodiment of the present application, the embodiment further provides a computer-readable storage medium, in which computer-executable instructions are stored, and when the at least one processor of the user equipment executes the computer-executable instructions, the user equipment executes the above-mentioned various possible methods.
Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer. An exemplary storage medium is coupled to the processor such the processor can read information from, and write information to, the storage medium. Of course, the storage medium may also be integral to the processor. The processor and the storage medium may reside in an ASIC. Additionally, the ASIC may reside in user equipment. Of course, the processor and the storage medium may reside as discrete components in a communication device.
Those skilled in the art will understand that: all or a portion of the steps of implementing the above-described method embodiments may be performed by hardware associated with program instructions. The program may be stored in a computer-readable storage medium. When executed, the program performs steps comprising the method embodiments described above; and the aforementioned storage medium includes: various media that can store program codes, such as ROM, RAM, magnetic or optical disks.
In a particular embodiment of the present application, this embodiment also provides an apparatus comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the various possible methods described above.
The above description is only an example of the present application and is not intended to limit the scope of the present application, and all equivalent structural changes made by using the contents of the specification and the drawings, which are directly or indirectly applied to other related technical fields, are included in the scope of the present application.

Claims (10)

1. An LC3 audio encoder encoding optimization method, comprising:
filtering a first coded audio in a bass enhanced sound channel to obtain a second coded audio;
performing reduction adjustment on the encoding code rate of the LC3 audio encoder;
in the LC3 audio encoder, performing modified encoding on the second encoded audio according to the adjusted encoding rate, where the modified encoding includes:
setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio;
encoding the second encoded audio in the LC3 audio encoder according to the first audio encoding parameters.
2. The LC3 audio encoder encoding optimization method of claim 1, wherein the first audio encoding parameters include:
an attack detection parameter, which is an attack flag of each frame of encoded audio of an attack detection module in the LC3 audio encoder and is determined jointly according to the sampling rate of the second encoded audio and the adjusted encoding rate; and
a bandwidth parameter of each frame of encoded audio of a bandwidth detection module in the LC3 audio encoder, determined from a frequency band characteristic of the second encoded audio.
3. The LC3 audio encoder encoding optimization method of claim 1, wherein the attack detection parameter is set to 0 such that the execution of the attack detection module is skipped during the modified encoding process; setting the bandwidth parameter to 0 such that an execution process of the bandwidth detection module is skipped in the modified encoding process.
4. The LC3 audio encoder encoding optimization method of claim 1, wherein the modified encoding further comprises: setting a second audio coding parameter according to a characteristic of the bass enhancement channel, the second audio coding parameter comprising:
an output flag parameter, which is an output flag parameter of a long term post-filter module in the LC3 audio encoder, and sets the output flag parameter to 0, so that the execution process of the long term post-filter module is skipped in the improved encoding process;
a reflection coefficient, which is a reflection coefficient of a temporal noise shaping module in the LC3 audio encoder, the reflection coefficient being set to 0 such that an execution process of the temporal noise shaping module is skipped in the modified encoding process; and
other parameters including a setting parameter of a noise estimation module in the LC3 audio encoder, wherein the setting parameter of the noise estimation module is set to 7 such that the execution of the noise estimation module is skipped in the modified encoding process.
5. The LC3 audio encoder encoding optimization method of claim 1, wherein the downward adjustment of the encoding rate of the LC3 audio encoder comprises: setting the adjusted encoding rate of the LC3 audio encoder to be not greater than half of an original encoding rate of the LC3 audio encoder, the original encoding rate being an encoding rate when the LC3 audio encoder encodes encoded audio in a non-bass enhanced channel.
6. The LC3 audio encoder encoding optimization method of claim 1, further comprising:
determining whether the bass enhancement channel is present in an audio channel, when the bass enhancement channel is not present, encoding the encoded audio with the LC3 audio encoder standard, when the bass enhancement channel is present, encoding the encoded audio with the LC3 audio encoder encoding optimization method, wherein the LC3 audio encoder standard encodes the encoded audio with a conventional encoding performed by the LC3 audio encoder.
7. The LC3 audio encoder encoding optimization method of claim 1, wherein the filtering process is to filter out audio signals in the first encoded audio that are greater than a first cut-off frequency.
8. An LC3 audio encoder encoding optimization system, comprising:
the filtering module is used for filtering the first coded audio in the bass enhancement sound channel to obtain a second coded audio;
the code rate adjusting module is used for adjusting the coding code rate of the LC3 audio coder; and
a modified coding module for modified coding the second coded audio according to the adjusted coding rate, wherein the modified coding comprises:
setting a first audio coding parameter according to the adjusted coding rate and/or the frequency band characteristic of the second coded audio;
encoding the second encoded audio in the LC3 audio encoder according to the first audio encoding parameters.
9. A computer-readable storage medium having computer-executable instructions stored thereon which, when executed by a processor, implement the method of any one of claims 1-7.
10. An apparatus, comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
CN202010665658.XA 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium Active CN111768793B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010665658.XA CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010665658.XA CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Publications (2)

Publication Number Publication Date
CN111768793A true CN111768793A (en) 2020-10-13
CN111768793B CN111768793B (en) 2023-09-01

Family

ID=72726605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010665658.XA Active CN111768793B (en) 2020-07-11 2020-07-11 LC3 audio encoder coding optimization method, system and storage medium

Country Status (1)

Country Link
CN (1) CN111768793B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112270928A (en) * 2020-10-28 2021-01-26 北京百瑞互联技术有限公司 Method, device and storage medium for reducing code rate of audio encoder
CN112309408A (en) * 2020-11-10 2021-02-02 北京百瑞互联技术有限公司 Method, device and storage medium for expanding LC3 audio encoding and decoding bandwidth
CN112435674A (en) * 2020-12-09 2021-03-02 北京百瑞互联技术有限公司 Method, apparatus, medium for optimizing LC3 arithmetic coding search table of spectrum data
CN112489666A (en) * 2020-11-26 2021-03-12 北京百瑞互联技术有限公司 Bluetooth LE audio propagation data processing method and device and storage medium
CN112599138A (en) * 2020-12-08 2021-04-02 北京百瑞互联技术有限公司 Multi-PCM signal coding method, device and medium of LC3 audio coder
CN114582361A (en) * 2022-04-29 2022-06-03 北京百瑞互联技术有限公司 High-resolution audio coding and decoding method and system based on generation countermeasure network

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737720A (en) * 1993-10-26 1998-04-07 Sony Corporation Low bit rate multichannel audio coding methods and apparatus using non-linear adaptive bit allocation
US20050216262A1 (en) * 2004-03-25 2005-09-29 Digital Theater Systems, Inc. Lossless multi-channel audio codec
US20070280485A1 (en) * 2006-06-02 2007-12-06 Lars Villemoes Binaural multi-channel decoder in the context of non-energy conserving upmix rules
CN101188878A (en) * 2007-12-05 2008-05-28 武汉大学 A space parameter quantification and entropy coding method for 3D audio signals and its system architecture
US20130090933A1 (en) * 2010-03-09 2013-04-11 Lars Villemoes Apparatus and method for processing an input audio signal using cascaded filterbanks
CN103165135A (en) * 2013-03-04 2013-06-19 深圳广晟信源技术有限公司 Digital audio coarse layering coding method and digital audio coarse layering coding device
CN105280190A (en) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 Bandwidth extension encoding and decoding method and device
CN106373583A (en) * 2016-09-28 2017-02-01 北京大学 Ideal ratio mask (IRM) multi-audio object coding and decoding method
CN106653035A (en) * 2016-12-26 2017-05-10 广州广晟数码技术有限公司 Bit rate allocation method and bit rate allocation device in digital audio coding
CN104170007B (en) * 2012-06-19 2017-09-26 深圳广晟信源技术有限公司 To monophonic or the stereo method encoded
CN108847248A (en) * 2018-06-15 2018-11-20 珠海市杰理科技股份有限公司 Bluetooth equipment audio-frequency processing method, system, readable storage medium storing program for executing and bluetooth equipment
CN109448741A (en) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 A kind of 3D audio coding, coding/decoding method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101136200B (en) * 2006-08-30 2011-04-20 财团法人工业技术研究院 Audio signal transform coding method and system thereof
EP3382700A1 (en) * 2017-03-31 2018-10-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for post-processing an audio signal using a transient location detection

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5737720A (en) * 1993-10-26 1998-04-07 Sony Corporation Low bit rate multichannel audio coding methods and apparatus using non-linear adaptive bit allocation
US20050216262A1 (en) * 2004-03-25 2005-09-29 Digital Theater Systems, Inc. Lossless multi-channel audio codec
US20070280485A1 (en) * 2006-06-02 2007-12-06 Lars Villemoes Binaural multi-channel decoder in the context of non-energy conserving upmix rules
CN101188878A (en) * 2007-12-05 2008-05-28 武汉大学 A space parameter quantification and entropy coding method for 3D audio signals and its system architecture
US20130090933A1 (en) * 2010-03-09 2013-04-11 Lars Villemoes Apparatus and method for processing an input audio signal using cascaded filterbanks
CN104170007B (en) * 2012-06-19 2017-09-26 深圳广晟信源技术有限公司 To monophonic or the stereo method encoded
CN103165135A (en) * 2013-03-04 2013-06-19 深圳广晟信源技术有限公司 Digital audio coarse layering coding method and digital audio coarse layering coding device
CN105280190A (en) * 2015-09-16 2016-01-27 深圳广晟信源技术有限公司 Bandwidth extension encoding and decoding method and device
CN106373583A (en) * 2016-09-28 2017-02-01 北京大学 Ideal ratio mask (IRM) multi-audio object coding and decoding method
CN106653035A (en) * 2016-12-26 2017-05-10 广州广晟数码技术有限公司 Bit rate allocation method and bit rate allocation device in digital audio coding
CN108847248A (en) * 2018-06-15 2018-11-20 珠海市杰理科技股份有限公司 Bluetooth equipment audio-frequency processing method, system, readable storage medium storing program for executing and bluetooth equipment
CN109448741A (en) * 2018-11-22 2019-03-08 广州广晟数码技术有限公司 A kind of 3D audio coding, coding/decoding method and device

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112270928A (en) * 2020-10-28 2021-01-26 北京百瑞互联技术有限公司 Method, device and storage medium for reducing code rate of audio encoder
CN112309408A (en) * 2020-11-10 2021-02-02 北京百瑞互联技术有限公司 Method, device and storage medium for expanding LC3 audio encoding and decoding bandwidth
CN112489666A (en) * 2020-11-26 2021-03-12 北京百瑞互联技术有限公司 Bluetooth LE audio propagation data processing method and device and storage medium
CN112489666B (en) * 2020-11-26 2024-05-28 北京百瑞互联技术股份有限公司 Bluetooth LE audio propagation data processing method, device and storage medium
CN112599138A (en) * 2020-12-08 2021-04-02 北京百瑞互联技术有限公司 Multi-PCM signal coding method, device and medium of LC3 audio coder
CN112599138B (en) * 2020-12-08 2024-05-24 北京百瑞互联技术股份有限公司 Multi-PCM signal coding method, device and medium of LC3 audio coder
CN112435674A (en) * 2020-12-09 2021-03-02 北京百瑞互联技术有限公司 Method, apparatus, medium for optimizing LC3 arithmetic coding search table of spectrum data
CN114582361A (en) * 2022-04-29 2022-06-03 北京百瑞互联技术有限公司 High-resolution audio coding and decoding method and system based on generation countermeasure network
CN114582361B (en) * 2022-04-29 2022-07-08 北京百瑞互联技术有限公司 High-resolution audio coding and decoding method and system based on generation countermeasure network

Also Published As

Publication number Publication date
CN111768793B (en) 2023-09-01

Similar Documents

Publication Publication Date Title
CN111768793B (en) LC3 audio encoder coding optimization method, system and storage medium
KR102060208B1 (en) Adaptive voice intelligibility processor
KR101942913B1 (en) Metadata driven dynamic range control
EP3175454B1 (en) Apparatus and method for processing an audio signal using a harmonic post-filter
JP4275855B2 (en) Decoding method and system with adaptive postfilter
CN111681663B (en) Method, system, storage medium and device for reducing audio coding computation amount
JP5312680B2 (en) Method and apparatus for adjusting channel delay parameters of multi-channel signals
JP6271531B2 (en) Effective pre-echo attenuation in digital audio signals
JP6616470B2 (en) Encoding method, decoding method, encoding device, and decoding device
CN111681664A (en) Method, system, storage medium and equipment for reducing audio coding rate
CN111429925A (en) Method and system for reducing audio coding rate
CN104956437A (en) Systems and methods of performing gain control
JP2015505991A (en) Method and apparatus for audio encoding
CN112669860B (en) Method and device for increasing effective bandwidth of LC3 audio coding and decoding
KR102251639B1 (en) Stereo encoding method and stereo encoder
CN112002338A (en) Method and system for optimizing audio coding quantization times
WO2008072524A1 (en) Audio signal encoding method and decoding method
JPH0637725A (en) Radio communication equipment
JP3519764B2 (en) Speech coding communication system and its device
CN111862996A (en) Method, system and storage medium for balancing load of audio codec
EP2814029B1 (en) Audio signal bandwidth to codec bandwidth analysis and response
CN114999503A (en) Full-bandwidth spectral coefficient generation method and system based on generation countermeasure network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: A1009, floor 9, block a, No. 9, Shangdi Third Street, Haidian District, Beijing 100085

Patentee after: Beijing Bairui Internet Technology Co.,Ltd.

Address before: 7-1-1, building C, 7 / F, building 2-1, No.2, Shangdi Information Road, Haidian District, Beijing 100085

Patentee before: BARROT WIRELESS Co.,Ltd.