CN114724567A - Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data - Google Patents
Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data Download PDFInfo
- Publication number
- CN114724567A CN114724567A CN202011500408.7A CN202011500408A CN114724567A CN 114724567 A CN114724567 A CN 114724567A CN 202011500408 A CN202011500408 A CN 202011500408A CN 114724567 A CN114724567 A CN 114724567A
- Authority
- CN
- China
- Prior art keywords
- data
- frame
- audio
- voice
- original
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000006835 compression Effects 0.000 title claims abstract description 155
- 238000007906 compression Methods 0.000 title claims abstract description 155
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000005236 sound signal Effects 0.000 claims description 7
- 230000006837 decompression Effects 0.000 description 17
- 238000010586 diagram Methods 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The application discloses a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which comprises the following steps: calculating a predetermined maximum energy value that each frame of the plurality of frames can be compressed losslessly according to the set bandwidth; reading original voice and information data of each frame; calculating the energy value of the original voice and information data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame, and if not, performing lossless compression on the original audio-visual data of the frame.
Description
Technical Field
The present application relates to a compression method of audio-visual data stream, and more particularly, to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data.
Background
The digital audio data is usually encoded in a frame (frame) and then transmitted to a receiving end via a wired or wireless network for decoding and playing. The encoding method is classified into a lossy compression (lossy compression) and a lossless compression (lossless compression). Lossless compression can obtain audio and video data that is completely consistent with that before compression after decompression, but lossless compression is generally poor in compression efficiency compared with lossy compression, which removes sound nuances that are generally not discernible by human ears. General users are interested in convenient and stable use, and music enthusiasts who pursue high tone quality want the degree of distortion of the tone signal to be as small as possible, and the tone signal is preferably reproduced as the original tone.
To solve the above problems, MPEG-4SLS proposes a method of mixing lossy and lossless compression to satisfy different requirements of two users, and adds a difference part with the original Audio and video data on a lossy-compressed base, and only part or all of the lossy-compressed data can be transmitted according to the requirements during playing, such as the base-added difference part of Advanced Audio Coding (AAC) lossy compression, to complete lossless compression. However, MPEG-4SLS has a large increase in bandwidth 128Kbps compared to AAC, and is no longer a fixed bandwidth, but rather increases complexity and instability of network transmission and processing.
Disclosure of Invention
In view of the above, the present application provides a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which includes the following steps: calculating a predetermined maximum energy value that each frame of the plurality of frames can be compressed losslessly according to the set bandwidth; reading original voice and information data of each frame; calculating the energy value of the original voice and information data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value or not, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, performing lossless compression on the original audio-visual data of the frame to generate audio-visual lossless compressed data.
According to an example embodiment, the method for dynamically switching lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: calculating a square value of the amplitude of each waveform in a plurality of waveforms of the audio signal of the original audio data of each frame; summing a plurality of squared values of the plurality of waveforms, respectively, to generate a total energy value; and dividing the total energy value by the number of the plurality of waveforms to obtain an average energy value, taking a logarithm value of the average energy value to calculate the energy of the original voice information of each frame, which is expressed by the following equation: e ═ log ((x)1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal data for each frame, x1To xnRespectively representThe amplitudes of the plurality of waveforms, respectively, n represents the number of the plurality of waveforms.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, executing the next step; judging whether the previous frame of the current frame is lossless compression or not, if so, performing lossless compression on the original voice and information data of the current frame to generate voice and information lossless compression data, not executing subsequent steps, and if not, sequentially executing the subsequent steps; applying the window function (window function) of Modified Discrete Cosine Transform (MDCT) to the original voice data of the previous frame, and calculating function voice data according to Time-Domain Aliasing Cancellation (TDAC); and lossless compressing the function voice and information data and the original voice and information data of the current frame to generate voice and information lossless compression data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: calculating a preset maximum length allowed after the original audio and video data of each frame is compressed according to the set bandwidth; and judging whether the data length of the audio-visual lossless compression data of each frame is greater than the preset maximum length, if not, retaining the audio-visual lossless compression data, and if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compression data to replace the audio-visual lossless compression data with the data length greater than the preset maximum length.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: carrying out lossy compression on the original audio-visual data of the previous frame; determining to perform lossy compression on the original audio and video data of the current frame; performing improved discrete cosine transform after applying a window function to the original voice and message data of the previous frame and the original voice and message data of the current frame, and calculating function voice and message data; and lossy compressing the function voice and information data to generate voice and information lossy compressed data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: carrying out lossy compression on the original audio-visual data of the previous frame; determining to perform lossless compression on the original voice and information data of the current frame; applying an improved discrete cosine transform window function to the original voice and information data of the previous frame, and calculating function voice and information data according to a time domain aliasing cancellation method; and lossless compressing the function voice and information data together with the complete original voice and information data of the current frame to generate voice and information lossless compression data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: lossless compression is carried out on the original voice and information data of the previous frame to generate voice and information lossless compression data of the previous frame; and determining to perform lossless compression on the original voice and information data of the current frame so as to generate voice and information lossless compression data of the current frame.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: lossless compression is carried out on the original voice and information data of the previous frame to generate voice and information lossless compression data; determining the original audio-visual data of the previous frame and the original audio-visual data of the current frame, applying a window function and then performing improved discrete cosine transform to calculate a function audio-visual data; and performing lossy compression on the function voice and information data.
Drawings
Fig. 1 is a flowchart of the compression steps of the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 2 is a flowchart of the decompression steps of the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 3 is a schematic diagram illustrating overlapping of original audio and video data of multiple frames in a dynamic switching method for lossy or lossless compression of fixed-bandwidth audio and video data according to an embodiment of the present invention.
Fig. 4 is a schematic diagram of decompression of a previous frame and a current frame of lossy compression in the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 5 is a schematic diagram of decompression of a previous frame of lossy compression and a current frame of lossless compression in the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 6 is a schematic diagram of decompression of a previous frame and a current frame of lossless compression according to the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data in the embodiment of the present invention.
Fig. 7 is a schematic diagram of the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data, the decompression of the previous frame of lossless compression and the current frame of lossy compression according to the embodiment of the present invention.
Detailed Description
Please refer to fig. 1, which is a flowchart illustrating a compression procedure of a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention. As shown in fig. 1, the method for dynamically switching lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention may include steps S101 to S115, and the execution sequence, content, and number of the steps may be adjusted according to actual requirements. The steps S101 to S115 are performed for each of a plurality of frames to individually determine whether lossy compression or lossless compression is applied to each frame under a predetermined bandwidth/fixed bandwidth. Different frames may be compressed differently, such as but not limited to lossy compression for the previous frame and lossless compression for the current frame. For convenience of explanation, hereinafter, the currently processed frame is simply referred to as a current frame.
In step S101, the energy value of the original audio data of the current frame is calculated. For example, a square value of the amplitude of each of a plurality of waveforms of the audio signal of the original audio data of each frame is calculated. Then, the square values of the waveforms are summed to generate a total energy value. The total energy value is divided by the number of the plurality of waveforms to obtain an average energy value. Then, taking the logarithm value of the average energy value to calculate the energy of the original voice data of each frame. Expressed by the following equation: e=log((x1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal data for each frame, x1To xnEach represents a plurality of amplitudes of the plurality of waveforms, and n represents the number of the plurality of waveforms/the number of the plurality of acquired amplitudes.
In step S103, a predetermined maximum energy value capable of lossless compression is calculated according to a set bandwidth, and whether the energy value of the original audio-visual data of the current frame is greater than the predetermined maximum energy value is determined. If the energy value of the original audio/video data of the current frame is greater than the predetermined maximum energy value, E > Emax (where E represents the energy value of the original audio/video data of the current frame, and Emax represents the predetermined maximum energy value), the lossy compression method is determined to be selected, and step S105 is executed. Conversely, if the energy value of the original audio data of the current frame is not greater than (i.e. equal to or less than) the predetermined maximum energy value, then the lossless compression method is selected, and step S107 is executed.
In step S105, the original audio signal data of the current frame having an energy value greater than the predetermined maximum energy value is lossy-compressed to generate audio signal lossy-compressed data. Therefore, the situation that when the energy value of the original voice data is too large, lossless compression is still tried to cause waste of operation time can be avoided. If necessary, step S115 may be performed next.
In step S107, it is determined whether the original audio-visual data of the frame previous to the current frame is compressed in a lossless compression manner.
If the original audio-visual data of the previous frame is compressed in a lossy manner, the original audio-visual data of the previous frame and the original audio-visual data of the current frame need to be obtained in order to successfully restore the complete original audio-visual data of the previous frame during decompression. Therefore, if the original audio-visual data of the previous frame is compressed in a lossy manner, step S109 is executed and then step S111 is executed when performing lossless compression of the original audio-visual data of the current frame.
In step S109, the original voice data of the previous frame (for example, the previous frame B) is applied to a window function (window function) of Modified Discrete Cosine Transform (MDCT) to calculate a function voice data according to Time-Domain Aliasing Cancellation (TDAC).
If the original audio-visual data of the previous frame is compressed by a lossy compression method, the lossless compression range of the current frame needs to be extended to the overlapping (overlap) part of the current frame and the previous frame. Therefore, after step S109 is executed, step S111 is executed to perform lossless compression on the function voice data together with the complete original voice data of the current frame to generate a voice lossless compression data.
On the contrary, if it is determined that the original audio data of the previous frame and the current frame are both losslessly compressed, when losslessly compressing the original audio data of the current frame, the compression range of the current frame does not need to be extended to the previous frame (i.e. step S109 does not need to be executed), but step S111 is directly executed to losslessly compress the original audio data of the current frame.
In step S113, a predetermined maximum length allowed after lossless compression of each frame is calculated according to the set bandwidth, and it is determined whether the data length of the audio/video lossless compression data of the current frame is greater than the predetermined maximum length.
If the data length of the audio-visual lossless compressed data of the current frame is not greater than the predetermined maximum length, i.e., L ≦ Lmax (where L represents the data length of the audio-visual lossless compressed data of the current frame and Lmax represents the predetermined maximum length), step S115 may then be performed. On the contrary, if the data length of the audio-visual lossless compression data of the current frame is greater than the predetermined maximum length, step S105 is executed to perform lossy compression on the original audio-visual data of the current frame, and then step S115 is executed.
In step S115, the compressed audio data, i.e., the audio lossy compressed data or the audio lossless compressed data, is transmitted.
Please refer to fig. 2, which is a flowchart illustrating a decompression step of a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention.
As shown in fig. 2, the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data of the present embodiment may include steps S201 to S213.
In step S201, a header of compressed audio/video data of a current frame is read.
In step S203, it is determined whether lossless compression is applied to the current frame (i.e. whether the current frame is the audio/video lossless compression data) according to the read content. If yes, step S205 is executed next. If not, step S207 is executed next.
In step S205, lossless decompression is performed on the original audio/video data of the current frame.
In step S207, lossy decompression and inverse modified discrete cosine transform are performed on the original audio-visual data of the current frame.
In step S209, it is determined whether or not lossless compression is applied to the compressed audio/video data of the frame immediately preceding the current frame. If the last frame is losslessly compressed, the overlapped portion between the current frame and the previous frame is eliminated without using Time Domain Aliasing Cancellation (TDAC) to restore the previous frame, so step S213 is performed next. Conversely, if the previous frame is lossy compressed, the steps S211 and S213 are performed sequentially.
In step S211, the first half of the decompressed data of the current frame is overlapped with the second half of the modified inverse discrete cosine transform result of the previous frame according to the time domain aliasing cancellation method (TDAC), so as to restore the original audio and video data of the previous frame.
In step S213, the original audio data of the previous frame after decompression is output.
After the above steps of the embodiment of the present invention are repeatedly performed, under a fixed bandwidth, lossy compression or lossless compression may be individually selected to be used according to the characteristics of each frame, so that the compression modes used for consecutive frames may be the same or different. The main purpose of the present invention is that the compression modes used for the consecutive frames may be different, for example, the previous frame uses lossless compression, but the next frame (i.e. the current frame) of the previous frame may be dynamically switched to lossy compression, and the next frame of the current frame may be dynamically switched to lossless compression, which is only illustrated herein, but the present invention is not limited thereto. As described in more detail below.
Please refer to fig. 4, which is a schematic diagram illustrating a decompression of a previous frame and a current frame of lossy compression according to a dynamic switching method of lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applied to the previous frame B, the previous frame a of the previous frame, and the current frame C, which all adopt lossy compression, as shown in fig. 4.
First, as shown in fig. 4, the second half of the improved inverse discrete cosine transform result of the previous frame B and the first half of the improved inverse discrete cosine transform result of the current frame C are overlapped according to the time-domain aliasing cancellation method (TDAC), so as to restore the original audio-visual data of the previous frame.
The previous frame B as shown in fig. 3 applies the general MDCT window function and operates as follows according to the time-domain aliasing cancellation method: IMDCT (MDCT (WA, W)RB) W in the latter half of the formula)R(WRB+(WRB)R)=WR 2B+WWRBRAnd IMDCT (WB, W)RC) The first half of (B) is W (WB-W)RBR)=W2B–WWRBRAdding the two halves (W)R 2B+WWRBR)+(W2B–WWRBR)=(WR 2+W2) B ═ B, where (W)R 2+W2) Where B represents the original audio data of the previous frame (i.e. the previous frame mentioned above), R represents reverse (inverse), BRRepresenting the inverse of the B waveform.
As described above, in the case of restoring the audio data of each frame by lossy compression, TDAC operation is required to be performed on the audio data of the previous and next frames in succession, and each frame by lossy compression can be restored.
Please refer to fig. 5, which is a schematic diagram illustrating a decompression of a previous frame of lossy compression and a current frame of lossless compression in a dynamic switching method of lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applicable to the previous frame B using lossy compression and the current frame C using lossless compression as shown in fig. 5.
First, as shown in fig. 5, the last frame B is subjected to lossy decompression and an inverse modified discrete cosine transform, where the second half of the transform result is WR 2B+WWRBR。
Lossless compressing the current frame C to obtain the original audio data of the current frame C and applying the window function of modified discrete cosine transform to the previous frame B in step S109 to obtain W2B–WWRBR。
According to the time-domain aliasing cancellation method, W is divided intoR 2B+WWRBRAnd W2B–WWRBRAnd after addition operation, restoring complete original voice and information data of the previous frame B.
Please refer to fig. 6, which is a schematic diagram illustrating a decompression of a previous frame and a current frame of lossless compression according to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applied to the previous frame B and the current frame C that are both losslessly compressed as shown in fig. 6.
First, as shown in fig. 6, if the audio-visual data of the previous frame B is lossless compressed and the audio-visual data of the current frame C is lossless compressed, the lossless compressed audio-visual data of the previous frame is decompressed to restore the original audio-visual data of the previous frame B.
As described above, the original audio/video data of each frame compressed without loss can be directly restored after decompression without performing MDCT operation with the original audio/video data of the previous and next frames in succession.
Please refer to fig. 7, which is a diagram illustrating a decompression of a last frame of lossless compression and a next frame of lossy compression according to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applicable to the previous frame B using lossless compression and the current frame C using lossy compression as shown in fig. 7.
If the audio-visual data of the previous frame B is compressed losslessly and the audio-visual data of the current frame C is compressed losslessly, the audio-visual data of the previous frame B is decompressed losslessly, and the original audio-visual data of the previous frame B can be restored.
As described above, the original audio data of each frame, which is losslessly compressed, can be directly decompressed to restore the complete original audio data without performing TDAC operation with the original audio data of the previous and next frames of the consecutive frame. However, in the restoration of the original audio data of the frame using the lossy compression, TDAC operation is required to be performed with the original audio data of the previous frame and the original audio data of the next frame, so that each frame using the lossy compression can be restored.
In summary, the present invention provides a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which has the following main features:
dynamically switching a lossy lossless compression mode under a fixed bandwidth;
supporting a Modified Discrete Cosine Transform (MDCT);
frames which cannot be subjected to lossless compression in a fixed bandwidth are filtered according to the energy, lossless compression is not required to be tried on each frame, and time consumption of compression operation is reduced;
the invention directly uses the lossless compression to the original audio-visual data of a frame, which is not the most lossless compression mode in the market, and calculates the difference between the original audio-visual data and the lossy compression restoration.
For a better understanding of the features and technical content of the present invention, reference should be made to the following detailed description and accompanying drawings, which are provided for purposes of illustration and description only and are not intended to limit the invention.
The disclosure above is only a preferred embodiment of the present invention and is not intended to limit the claims, so that all the modifications and equivalents of the disclosure and drawings are included in the claims.
Claims (8)
1. A method for dynamic switching of lossy or lossless compression of fixed bandwidth audio-visual data, the method comprising the steps of:
calculating a predetermined maximum energy value that each frame of a plurality of frames can be compressed losslessly according to a set bandwidth;
reading the original voice and information data of each frame;
calculating the energy value of the original voice data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than the preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, performing lossless compression on the original audio-visual data of the frame to generate audio-visual lossless compressed data.
2. The method of claim 1, further comprising the steps of:
calculating a square value of an amplitude of each of a plurality of waveforms of a voice signal of the original voice data of each of the frames;
summing the plurality of squared values of the plurality of waveforms, respectively, to produce a total energy value; and dividing the total energy value by the number of the plurality of waveforms to obtain an average energy value, taking a logarithmic value of the average energy value to calculate the energy of the original audio-visual data of each frame, which is expressed by the following equation: e ═ log ((x)1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal material for each of the frames, x1To xnRespectively represent the amplitudes of the waveforms, n represents the waveformsThe number of the cells.
3. The method of claim 1, further comprising the steps of:
judging whether the energy value of the original audio-visual data of each frame is larger than the preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, executing the next step;
judging whether the last frame of the current frame is lossless compression or not, if so, performing lossless compression on the original voice and information data of the current frame to generate voice and information lossless compression data, not executing subsequent steps, and if not, sequentially executing the subsequent steps;
applying a window function (window function) of Modified Discrete Cosine Transform (MDCT) to the original voice data of the previous frame, and calculating function voice data according to Time-Domain Aliasing Cancellation (TDAC); and lossless compressing the function voice and information data and the original voice and information data of the current frame to generate the voice and information lossless compression data.
4. The method of claim 1, further comprising the steps of:
calculating the allowable preset maximum length of the original audio-visual data of each frame after compression according to the set bandwidth;
and judging whether the data length of the audio-visual lossless compression data of each frame is greater than the preset maximum length, if not, retaining the audio-visual lossless compression data, and if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compression data to replace the audio-visual lossless compression data with the data length greater than the preset maximum length.
5. The method of claim 1, further comprising the steps of:
performing lossy compression on the original audio and information data of the previous frame;
determining to perform lossy compression on the original audio and video data of the current frame;
carrying out improved discrete cosine transform after applying a window function to the original voice and information data of the previous frame and the original voice and information data of the current frame, and calculating function voice and information data; and carrying out lossy compression on the function voice and information data to generate the voice and information lossy compression data.
6. The method of claim 1, further comprising the steps of:
performing lossy compression on the original audio and information data of the previous frame;
determining to perform lossless compression on the original voice and information data of the current frame;
applying an improved discrete cosine transform window function to the original voice and information data of the previous frame, and calculating function voice and information data according to a time domain aliasing cancellation method; and lossless compressing the function voice and information data and the complete original voice and information data of the current frame together to generate the voice and information lossless compression data.
7. The method of claim 1, further comprising the steps of:
lossless compressing the original audio-visual data of the previous frame to generate the audio-visual lossless compressed data of the previous frame; and determining to perform lossless compression on the original audio-visual data of the current frame to generate the audio-visual lossless compression data of the current frame.
8. The method of claim 1, further comprising the steps of:
lossless compressing the original voice and information data of the previous frame to generate voice and information lossless compressed data;
determining to perform lossy compression on the original audio and video data of the current frame;
carrying out improved discrete cosine transform after applying a window function to the original voice and information data of the previous frame and the original voice and information data of the current frame, and calculating function voice and information data; and performing lossy compression on the function voice and information data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011500408.7A CN114724567A (en) | 2020-12-18 | 2020-12-18 | Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011500408.7A CN114724567A (en) | 2020-12-18 | 2020-12-18 | Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114724567A true CN114724567A (en) | 2022-07-08 |
Family
ID=82229873
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011500408.7A Pending CN114724567A (en) | 2020-12-18 | 2020-12-18 | Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114724567A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5819215A (en) * | 1995-10-13 | 1998-10-06 | Dobson; Kurt | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data |
US20020016161A1 (en) * | 2000-02-10 | 2002-02-07 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for compression of speech encoded parameters |
US20040044521A1 (en) * | 2002-09-04 | 2004-03-04 | Microsoft Corporation | Unified lossy and lossless audio compression |
CN101490748A (en) * | 2006-07-24 | 2009-07-22 | 汤姆森许可贸易公司 | Method and apparatus for lossless encoding of a source signal, using a lossy encoded data stream and a lossless extension data stream |
CN101784976A (en) * | 2007-09-07 | 2010-07-21 | 高通股份有限公司 | Power efficient batch-frame audio decoding apparatus, system and method |
US20120290306A1 (en) * | 2011-05-12 | 2012-11-15 | Cambridge Silicon Radio Ltd. | Hybrid coded audio data streaming apparatus and method |
CN109147805A (en) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | Audio sound quality enhancing based on deep learning |
CN110265043A (en) * | 2019-06-03 | 2019-09-20 | 同响科技股份有限公司 | Adaptively damage or lossless message compression and decompression calculation method |
TW202223881A (en) * | 2020-12-11 | 2022-06-16 | 同響科技股份有限公司 | Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth |
-
2020
- 2020-12-18 CN CN202011500408.7A patent/CN114724567A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5819215A (en) * | 1995-10-13 | 1998-10-06 | Dobson; Kurt | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data |
US20020016161A1 (en) * | 2000-02-10 | 2002-02-07 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for compression of speech encoded parameters |
US20040044521A1 (en) * | 2002-09-04 | 2004-03-04 | Microsoft Corporation | Unified lossy and lossless audio compression |
CN101490748A (en) * | 2006-07-24 | 2009-07-22 | 汤姆森许可贸易公司 | Method and apparatus for lossless encoding of a source signal, using a lossy encoded data stream and a lossless extension data stream |
CN101784976A (en) * | 2007-09-07 | 2010-07-21 | 高通股份有限公司 | Power efficient batch-frame audio decoding apparatus, system and method |
US20120290306A1 (en) * | 2011-05-12 | 2012-11-15 | Cambridge Silicon Radio Ltd. | Hybrid coded audio data streaming apparatus and method |
CN109147805A (en) * | 2018-06-05 | 2019-01-04 | 安克创新科技股份有限公司 | Audio sound quality enhancing based on deep learning |
CN110265043A (en) * | 2019-06-03 | 2019-09-20 | 同响科技股份有限公司 | Adaptively damage or lossless message compression and decompression calculation method |
TW202223881A (en) * | 2020-12-11 | 2022-06-16 | 同響科技股份有限公司 | Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20200395031A1 (en) | Companding system and method to reduce quantization noise using advanced spectral extension | |
EP0737350B1 (en) | System and method for performing voice compression | |
US7986797B2 (en) | Signal processing system, signal processing apparatus and method, recording medium, and program | |
US10861475B2 (en) | Signal-dependent companding system and method to reduce quantization noise | |
JP5065687B2 (en) | Audio data processing device and terminal device | |
JPH05304479A (en) | High efficient encoder of audio signal | |
US11640825B2 (en) | Time-domain stereo encoding and decoding method and related product | |
JP4021124B2 (en) | Digital acoustic signal encoding apparatus, method and recording medium | |
JP2015507764A (en) | Method, apparatus and system for processing audio data | |
CN114550732B (en) | Coding and decoding method and related device for high-frequency audio signal | |
JP4657570B2 (en) | Music information encoding apparatus and method, music information decoding apparatus and method, program, and recording medium | |
US20020173969A1 (en) | Method for decompressing a compressed audio signal | |
JP3353868B2 (en) | Audio signal conversion encoding method and decoding method | |
TWI826754B (en) | Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth | |
KR102637514B1 (en) | Time-domain stereo coding and decoding method and related product | |
US10650834B2 (en) | Audio processing method and non-transitory computer readable medium | |
CN114724567A (en) | Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data | |
US6678653B1 (en) | Apparatus and method for coding audio data at high speed using precision information | |
JP4645869B2 (en) | DIGITAL SIGNAL PROCESSING METHOD, LEARNING METHOD, DEVICE THEREOF, AND PROGRAM STORAGE MEDIUM | |
JPH03233500A (en) | Voice synthesis system and device used for same | |
JP4538705B2 (en) | Digital signal processing method, learning method and apparatus, and program storage medium | |
JP5491193B2 (en) | Speech coding method and apparatus | |
CN115188394B (en) | Sound mixing method, device, electronic equipment and storage medium | |
JP3510493B2 (en) | Audio signal encoding / decoding method and recording medium recording the program | |
JP2001148632A (en) | Encoding device, encoding method and recording medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |