CN114724567A - Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data - Google Patents

Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data Download PDF

Info

Publication number
CN114724567A
CN114724567A CN202011500408.7A CN202011500408A CN114724567A CN 114724567 A CN114724567 A CN 114724567A CN 202011500408 A CN202011500408 A CN 202011500408A CN 114724567 A CN114724567 A CN 114724567A
Authority
CN
China
Prior art keywords
data
frame
audio
voice
original
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011500408.7A
Other languages
Chinese (zh)
Inventor
李敬祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
T'ung Hsiang Technologies Co ltd
Original Assignee
T'ung Hsiang Technologies Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by T'ung Hsiang Technologies Co ltd filed Critical T'ung Hsiang Technologies Co ltd
Priority to CN202011500408.7A priority Critical patent/CN114724567A/en
Publication of CN114724567A publication Critical patent/CN114724567A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The application discloses a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which comprises the following steps: calculating a predetermined maximum energy value that each frame of the plurality of frames can be compressed losslessly according to the set bandwidth; reading original voice and information data of each frame; calculating the energy value of the original voice and information data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame, and if not, performing lossless compression on the original audio-visual data of the frame.

Description

Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data
Technical Field
The present application relates to a compression method of audio-visual data stream, and more particularly, to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data.
Background
The digital audio data is usually encoded in a frame (frame) and then transmitted to a receiving end via a wired or wireless network for decoding and playing. The encoding method is classified into a lossy compression (lossy compression) and a lossless compression (lossless compression). Lossless compression can obtain audio and video data that is completely consistent with that before compression after decompression, but lossless compression is generally poor in compression efficiency compared with lossy compression, which removes sound nuances that are generally not discernible by human ears. General users are interested in convenient and stable use, and music enthusiasts who pursue high tone quality want the degree of distortion of the tone signal to be as small as possible, and the tone signal is preferably reproduced as the original tone.
To solve the above problems, MPEG-4SLS proposes a method of mixing lossy and lossless compression to satisfy different requirements of two users, and adds a difference part with the original Audio and video data on a lossy-compressed base, and only part or all of the lossy-compressed data can be transmitted according to the requirements during playing, such as the base-added difference part of Advanced Audio Coding (AAC) lossy compression, to complete lossless compression. However, MPEG-4SLS has a large increase in bandwidth 128Kbps compared to AAC, and is no longer a fixed bandwidth, but rather increases complexity and instability of network transmission and processing.
Disclosure of Invention
In view of the above, the present application provides a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which includes the following steps: calculating a predetermined maximum energy value that each frame of the plurality of frames can be compressed losslessly according to the set bandwidth; reading original voice and information data of each frame; calculating the energy value of the original voice and information data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value or not, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, performing lossless compression on the original audio-visual data of the frame to generate audio-visual lossless compressed data.
According to an example embodiment, the method for dynamically switching lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: calculating a square value of the amplitude of each waveform in a plurality of waveforms of the audio signal of the original audio data of each frame; summing a plurality of squared values of the plurality of waveforms, respectively, to generate a total energy value; and dividing the total energy value by the number of the plurality of waveforms to obtain an average energy value, taking a logarithm value of the average energy value to calculate the energy of the original voice information of each frame, which is expressed by the following equation: e ═ log ((x)1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal data for each frame, x1To xnRespectively representThe amplitudes of the plurality of waveforms, respectively, n represents the number of the plurality of waveforms.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: judging whether the energy value of the original audio-visual data of each frame is larger than a preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, executing the next step; judging whether the previous frame of the current frame is lossless compression or not, if so, performing lossless compression on the original voice and information data of the current frame to generate voice and information lossless compression data, not executing subsequent steps, and if not, sequentially executing the subsequent steps; applying the window function (window function) of Modified Discrete Cosine Transform (MDCT) to the original voice data of the previous frame, and calculating function voice data according to Time-Domain Aliasing Cancellation (TDAC); and lossless compressing the function voice and information data and the original voice and information data of the current frame to generate voice and information lossless compression data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: calculating a preset maximum length allowed after the original audio and video data of each frame is compressed according to the set bandwidth; and judging whether the data length of the audio-visual lossless compression data of each frame is greater than the preset maximum length, if not, retaining the audio-visual lossless compression data, and if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compression data to replace the audio-visual lossless compression data with the data length greater than the preset maximum length.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: carrying out lossy compression on the original audio-visual data of the previous frame; determining to perform lossy compression on the original audio and video data of the current frame; performing improved discrete cosine transform after applying a window function to the original voice and message data of the previous frame and the original voice and message data of the current frame, and calculating function voice and message data; and lossy compressing the function voice and information data to generate voice and information lossy compressed data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: carrying out lossy compression on the original audio-visual data of the previous frame; determining to perform lossless compression on the original voice and information data of the current frame; applying an improved discrete cosine transform window function to the original voice and information data of the previous frame, and calculating function voice and information data according to a time domain aliasing cancellation method; and lossless compressing the function voice and information data together with the complete original voice and information data of the current frame to generate voice and information lossless compression data.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: lossless compression is carried out on the original voice and information data of the previous frame to generate voice and information lossless compression data of the previous frame; and determining to perform lossless compression on the original voice and information data of the current frame so as to generate voice and information lossless compression data of the current frame.
According to an example embodiment, the method for dynamically switching between lossy or lossless compression of fixed bandwidth audio-visual data further comprises the steps of: lossless compression is carried out on the original voice and information data of the previous frame to generate voice and information lossless compression data; determining the original audio-visual data of the previous frame and the original audio-visual data of the current frame, applying a window function and then performing improved discrete cosine transform to calculate a function audio-visual data; and performing lossy compression on the function voice and information data.
Drawings
Fig. 1 is a flowchart of the compression steps of the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 2 is a flowchart of the decompression steps of the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 3 is a schematic diagram illustrating overlapping of original audio and video data of multiple frames in a dynamic switching method for lossy or lossless compression of fixed-bandwidth audio and video data according to an embodiment of the present invention.
Fig. 4 is a schematic diagram of decompression of a previous frame and a current frame of lossy compression in the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 5 is a schematic diagram of decompression of a previous frame of lossy compression and a current frame of lossless compression in the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention.
Fig. 6 is a schematic diagram of decompression of a previous frame and a current frame of lossless compression according to the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data in the embodiment of the present invention.
Fig. 7 is a schematic diagram of the dynamic switching method of lossy or lossless compression of fixed-bandwidth audio-visual data, the decompression of the previous frame of lossless compression and the current frame of lossy compression according to the embodiment of the present invention.
Detailed Description
Please refer to fig. 1, which is a flowchart illustrating a compression procedure of a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention. As shown in fig. 1, the method for dynamically switching lossy or lossless compression of fixed-bandwidth audio-visual data according to the embodiment of the present invention may include steps S101 to S115, and the execution sequence, content, and number of the steps may be adjusted according to actual requirements. The steps S101 to S115 are performed for each of a plurality of frames to individually determine whether lossy compression or lossless compression is applied to each frame under a predetermined bandwidth/fixed bandwidth. Different frames may be compressed differently, such as but not limited to lossy compression for the previous frame and lossless compression for the current frame. For convenience of explanation, hereinafter, the currently processed frame is simply referred to as a current frame.
In step S101, the energy value of the original audio data of the current frame is calculated. For example, a square value of the amplitude of each of a plurality of waveforms of the audio signal of the original audio data of each frame is calculated. Then, the square values of the waveforms are summed to generate a total energy value. The total energy value is divided by the number of the plurality of waveforms to obtain an average energy value. Then, taking the logarithm value of the average energy value to calculate the energy of the original voice data of each frame. Expressed by the following equation: e=log((x1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal data for each frame, x1To xnEach represents a plurality of amplitudes of the plurality of waveforms, and n represents the number of the plurality of waveforms/the number of the plurality of acquired amplitudes.
In step S103, a predetermined maximum energy value capable of lossless compression is calculated according to a set bandwidth, and whether the energy value of the original audio-visual data of the current frame is greater than the predetermined maximum energy value is determined. If the energy value of the original audio/video data of the current frame is greater than the predetermined maximum energy value, E > Emax (where E represents the energy value of the original audio/video data of the current frame, and Emax represents the predetermined maximum energy value), the lossy compression method is determined to be selected, and step S105 is executed. Conversely, if the energy value of the original audio data of the current frame is not greater than (i.e. equal to or less than) the predetermined maximum energy value, then the lossless compression method is selected, and step S107 is executed.
In step S105, the original audio signal data of the current frame having an energy value greater than the predetermined maximum energy value is lossy-compressed to generate audio signal lossy-compressed data. Therefore, the situation that when the energy value of the original voice data is too large, lossless compression is still tried to cause waste of operation time can be avoided. If necessary, step S115 may be performed next.
In step S107, it is determined whether the original audio-visual data of the frame previous to the current frame is compressed in a lossless compression manner.
If the original audio-visual data of the previous frame is compressed in a lossy manner, the original audio-visual data of the previous frame and the original audio-visual data of the current frame need to be obtained in order to successfully restore the complete original audio-visual data of the previous frame during decompression. Therefore, if the original audio-visual data of the previous frame is compressed in a lossy manner, step S109 is executed and then step S111 is executed when performing lossless compression of the original audio-visual data of the current frame.
In step S109, the original voice data of the previous frame (for example, the previous frame B) is applied to a window function (window function) of Modified Discrete Cosine Transform (MDCT) to calculate a function voice data according to Time-Domain Aliasing Cancellation (TDAC).
If the original audio-visual data of the previous frame is compressed by a lossy compression method, the lossless compression range of the current frame needs to be extended to the overlapping (overlap) part of the current frame and the previous frame. Therefore, after step S109 is executed, step S111 is executed to perform lossless compression on the function voice data together with the complete original voice data of the current frame to generate a voice lossless compression data.
On the contrary, if it is determined that the original audio data of the previous frame and the current frame are both losslessly compressed, when losslessly compressing the original audio data of the current frame, the compression range of the current frame does not need to be extended to the previous frame (i.e. step S109 does not need to be executed), but step S111 is directly executed to losslessly compress the original audio data of the current frame.
In step S113, a predetermined maximum length allowed after lossless compression of each frame is calculated according to the set bandwidth, and it is determined whether the data length of the audio/video lossless compression data of the current frame is greater than the predetermined maximum length.
If the data length of the audio-visual lossless compressed data of the current frame is not greater than the predetermined maximum length, i.e., L ≦ Lmax (where L represents the data length of the audio-visual lossless compressed data of the current frame and Lmax represents the predetermined maximum length), step S115 may then be performed. On the contrary, if the data length of the audio-visual lossless compression data of the current frame is greater than the predetermined maximum length, step S105 is executed to perform lossy compression on the original audio-visual data of the current frame, and then step S115 is executed.
In step S115, the compressed audio data, i.e., the audio lossy compressed data or the audio lossless compressed data, is transmitted.
Please refer to fig. 2, which is a flowchart illustrating a decompression step of a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention.
As shown in fig. 2, the dynamic switching method for lossy or lossless compression of fixed-bandwidth audio-visual data of the present embodiment may include steps S201 to S213.
In step S201, a header of compressed audio/video data of a current frame is read.
In step S203, it is determined whether lossless compression is applied to the current frame (i.e. whether the current frame is the audio/video lossless compression data) according to the read content. If yes, step S205 is executed next. If not, step S207 is executed next.
In step S205, lossless decompression is performed on the original audio/video data of the current frame.
In step S207, lossy decompression and inverse modified discrete cosine transform are performed on the original audio-visual data of the current frame.
In step S209, it is determined whether or not lossless compression is applied to the compressed audio/video data of the frame immediately preceding the current frame. If the last frame is losslessly compressed, the overlapped portion between the current frame and the previous frame is eliminated without using Time Domain Aliasing Cancellation (TDAC) to restore the previous frame, so step S213 is performed next. Conversely, if the previous frame is lossy compressed, the steps S211 and S213 are performed sequentially.
In step S211, the first half of the decompressed data of the current frame is overlapped with the second half of the modified inverse discrete cosine transform result of the previous frame according to the time domain aliasing cancellation method (TDAC), so as to restore the original audio and video data of the previous frame.
In step S213, the original audio data of the previous frame after decompression is output.
After the above steps of the embodiment of the present invention are repeatedly performed, under a fixed bandwidth, lossy compression or lossless compression may be individually selected to be used according to the characteristics of each frame, so that the compression modes used for consecutive frames may be the same or different. The main purpose of the present invention is that the compression modes used for the consecutive frames may be different, for example, the previous frame uses lossless compression, but the next frame (i.e. the current frame) of the previous frame may be dynamically switched to lossy compression, and the next frame of the current frame may be dynamically switched to lossless compression, which is only illustrated herein, but the present invention is not limited thereto. As described in more detail below.
Please refer to fig. 4, which is a schematic diagram illustrating a decompression of a previous frame and a current frame of lossy compression according to a dynamic switching method of lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applied to the previous frame B, the previous frame a of the previous frame, and the current frame C, which all adopt lossy compression, as shown in fig. 4.
First, as shown in fig. 4, the second half of the improved inverse discrete cosine transform result of the previous frame B and the first half of the improved inverse discrete cosine transform result of the current frame C are overlapped according to the time-domain aliasing cancellation method (TDAC), so as to restore the original audio-visual data of the previous frame.
The previous frame B as shown in fig. 3 applies the general MDCT window function and operates as follows according to the time-domain aliasing cancellation method: IMDCT (MDCT (WA, W)RB) W in the latter half of the formula)R(WRB+(WRB)R)=WR 2B+WWRBRAnd IMDCT (WB, W)RC) The first half of (B) is W (WB-W)RBR)=W2B–WWRBRAdding the two halves (W)R 2B+WWRBR)+(W2B–WWRBR)=(WR 2+W2) B ═ B, where (W)R 2+W2) Where B represents the original audio data of the previous frame (i.e. the previous frame mentioned above), R represents reverse (inverse), BRRepresenting the inverse of the B waveform.
As described above, in the case of restoring the audio data of each frame by lossy compression, TDAC operation is required to be performed on the audio data of the previous and next frames in succession, and each frame by lossy compression can be restored.
Please refer to fig. 5, which is a schematic diagram illustrating a decompression of a previous frame of lossy compression and a current frame of lossless compression in a dynamic switching method of lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applicable to the previous frame B using lossy compression and the current frame C using lossless compression as shown in fig. 5.
First, as shown in fig. 5, the last frame B is subjected to lossy decompression and an inverse modified discrete cosine transform, where the second half of the transform result is WR 2B+WWRBR
Lossless compressing the current frame C to obtain the original audio data of the current frame C and applying the window function of modified discrete cosine transform to the previous frame B in step S109 to obtain W2B–WWRBR
According to the time-domain aliasing cancellation method, W is divided intoR 2B+WWRBRAnd W2B–WWRBRAnd after addition operation, restoring complete original voice and information data of the previous frame B.
Please refer to fig. 6, which is a schematic diagram illustrating a decompression of a previous frame and a current frame of lossless compression according to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applied to the previous frame B and the current frame C that are both losslessly compressed as shown in fig. 6.
First, as shown in fig. 6, if the audio-visual data of the previous frame B is lossless compressed and the audio-visual data of the current frame C is lossless compressed, the lossless compressed audio-visual data of the previous frame is decompressed to restore the original audio-visual data of the previous frame B.
As described above, the original audio/video data of each frame compressed without loss can be directly restored after decompression without performing MDCT operation with the original audio/video data of the previous and next frames in succession.
Please refer to fig. 7, which is a diagram illustrating a decompression of a last frame of lossless compression and a next frame of lossy compression according to a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to an embodiment of the present invention.
Steps S101 to S115 and S201 to S213 included in the dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data according to the embodiment of the present invention are applicable to the previous frame B using lossless compression and the current frame C using lossy compression as shown in fig. 7.
If the audio-visual data of the previous frame B is compressed losslessly and the audio-visual data of the current frame C is compressed losslessly, the audio-visual data of the previous frame B is decompressed losslessly, and the original audio-visual data of the previous frame B can be restored.
As described above, the original audio data of each frame, which is losslessly compressed, can be directly decompressed to restore the complete original audio data without performing TDAC operation with the original audio data of the previous and next frames of the consecutive frame. However, in the restoration of the original audio data of the frame using the lossy compression, TDAC operation is required to be performed with the original audio data of the previous frame and the original audio data of the next frame, so that each frame using the lossy compression can be restored.
In summary, the present invention provides a dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data, which has the following main features:
dynamically switching a lossy lossless compression mode under a fixed bandwidth;
supporting a Modified Discrete Cosine Transform (MDCT);
frames which cannot be subjected to lossless compression in a fixed bandwidth are filtered according to the energy, lossless compression is not required to be tried on each frame, and time consumption of compression operation is reduced;
the invention directly uses the lossless compression to the original audio-visual data of a frame, which is not the most lossless compression mode in the market, and calculates the difference between the original audio-visual data and the lossy compression restoration.
For a better understanding of the features and technical content of the present invention, reference should be made to the following detailed description and accompanying drawings, which are provided for purposes of illustration and description only and are not intended to limit the invention.
The disclosure above is only a preferred embodiment of the present invention and is not intended to limit the claims, so that all the modifications and equivalents of the disclosure and drawings are included in the claims.

Claims (8)

1. A method for dynamic switching of lossy or lossless compression of fixed bandwidth audio-visual data, the method comprising the steps of:
calculating a predetermined maximum energy value that each frame of a plurality of frames can be compressed losslessly according to a set bandwidth;
reading the original voice and information data of each frame;
calculating the energy value of the original voice data of each frame; and judging whether the energy value of the original audio-visual data of each frame is larger than the preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, performing lossless compression on the original audio-visual data of the frame to generate audio-visual lossless compressed data.
2. The method of claim 1, further comprising the steps of:
calculating a square value of an amplitude of each of a plurality of waveforms of a voice signal of the original voice data of each of the frames;
summing the plurality of squared values of the plurality of waveforms, respectively, to produce a total energy value; and dividing the total energy value by the number of the plurality of waveforms to obtain an average energy value, taking a logarithmic value of the average energy value to calculate the energy of the original audio-visual data of each frame, which is expressed by the following equation: e ═ log ((x)1 2+x2 2+…+xn 2) N) where E represents the energy of the original audio signal material for each of the frames, x1To xnRespectively represent the amplitudes of the waveforms, n represents the waveformsThe number of the cells.
3. The method of claim 1, further comprising the steps of:
judging whether the energy value of the original audio-visual data of each frame is larger than the preset maximum energy value, if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compressed data, and if not, executing the next step;
judging whether the last frame of the current frame is lossless compression or not, if so, performing lossless compression on the original voice and information data of the current frame to generate voice and information lossless compression data, not executing subsequent steps, and if not, sequentially executing the subsequent steps;
applying a window function (window function) of Modified Discrete Cosine Transform (MDCT) to the original voice data of the previous frame, and calculating function voice data according to Time-Domain Aliasing Cancellation (TDAC); and lossless compressing the function voice and information data and the original voice and information data of the current frame to generate the voice and information lossless compression data.
4. The method of claim 1, further comprising the steps of:
calculating the allowable preset maximum length of the original audio-visual data of each frame after compression according to the set bandwidth;
and judging whether the data length of the audio-visual lossless compression data of each frame is greater than the preset maximum length, if not, retaining the audio-visual lossless compression data, and if so, performing lossy compression on the original audio-visual data of the frame to generate audio-visual lossy compression data to replace the audio-visual lossless compression data with the data length greater than the preset maximum length.
5. The method of claim 1, further comprising the steps of:
performing lossy compression on the original audio and information data of the previous frame;
determining to perform lossy compression on the original audio and video data of the current frame;
carrying out improved discrete cosine transform after applying a window function to the original voice and information data of the previous frame and the original voice and information data of the current frame, and calculating function voice and information data; and carrying out lossy compression on the function voice and information data to generate the voice and information lossy compression data.
6. The method of claim 1, further comprising the steps of:
performing lossy compression on the original audio and information data of the previous frame;
determining to perform lossless compression on the original voice and information data of the current frame;
applying an improved discrete cosine transform window function to the original voice and information data of the previous frame, and calculating function voice and information data according to a time domain aliasing cancellation method; and lossless compressing the function voice and information data and the complete original voice and information data of the current frame together to generate the voice and information lossless compression data.
7. The method of claim 1, further comprising the steps of:
lossless compressing the original audio-visual data of the previous frame to generate the audio-visual lossless compressed data of the previous frame; and determining to perform lossless compression on the original audio-visual data of the current frame to generate the audio-visual lossless compression data of the current frame.
8. The method of claim 1, further comprising the steps of:
lossless compressing the original voice and information data of the previous frame to generate voice and information lossless compressed data;
determining to perform lossy compression on the original audio and video data of the current frame;
carrying out improved discrete cosine transform after applying a window function to the original voice and information data of the previous frame and the original voice and information data of the current frame, and calculating function voice and information data; and performing lossy compression on the function voice and information data.
CN202011500408.7A 2020-12-18 2020-12-18 Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data Pending CN114724567A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011500408.7A CN114724567A (en) 2020-12-18 2020-12-18 Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011500408.7A CN114724567A (en) 2020-12-18 2020-12-18 Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data

Publications (1)

Publication Number Publication Date
CN114724567A true CN114724567A (en) 2022-07-08

Family

ID=82229873

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011500408.7A Pending CN114724567A (en) 2020-12-18 2020-12-18 Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data

Country Status (1)

Country Link
CN (1) CN114724567A (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819215A (en) * 1995-10-13 1998-10-06 Dobson; Kurt Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data
US20020016161A1 (en) * 2000-02-10 2002-02-07 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for compression of speech encoded parameters
US20040044521A1 (en) * 2002-09-04 2004-03-04 Microsoft Corporation Unified lossy and lossless audio compression
CN101490748A (en) * 2006-07-24 2009-07-22 汤姆森许可贸易公司 Method and apparatus for lossless encoding of a source signal, using a lossy encoded data stream and a lossless extension data stream
CN101784976A (en) * 2007-09-07 2010-07-21 高通股份有限公司 Power efficient batch-frame audio decoding apparatus, system and method
US20120290306A1 (en) * 2011-05-12 2012-11-15 Cambridge Silicon Radio Ltd. Hybrid coded audio data streaming apparatus and method
CN109147805A (en) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 Audio sound quality enhancing based on deep learning
CN110265043A (en) * 2019-06-03 2019-09-20 同响科技股份有限公司 Adaptively damage or lossless message compression and decompression calculation method
TW202223881A (en) * 2020-12-11 2022-06-16 同響科技股份有限公司 Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819215A (en) * 1995-10-13 1998-10-06 Dobson; Kurt Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of digital audio or other sensory data
US20020016161A1 (en) * 2000-02-10 2002-02-07 Telefonaktiebolaget Lm Ericsson (Publ) Method and apparatus for compression of speech encoded parameters
US20040044521A1 (en) * 2002-09-04 2004-03-04 Microsoft Corporation Unified lossy and lossless audio compression
CN101490748A (en) * 2006-07-24 2009-07-22 汤姆森许可贸易公司 Method and apparatus for lossless encoding of a source signal, using a lossy encoded data stream and a lossless extension data stream
CN101784976A (en) * 2007-09-07 2010-07-21 高通股份有限公司 Power efficient batch-frame audio decoding apparatus, system and method
US20120290306A1 (en) * 2011-05-12 2012-11-15 Cambridge Silicon Radio Ltd. Hybrid coded audio data streaming apparatus and method
CN109147805A (en) * 2018-06-05 2019-01-04 安克创新科技股份有限公司 Audio sound quality enhancing based on deep learning
CN110265043A (en) * 2019-06-03 2019-09-20 同响科技股份有限公司 Adaptively damage or lossless message compression and decompression calculation method
TW202223881A (en) * 2020-12-11 2022-06-16 同響科技股份有限公司 Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth

Similar Documents

Publication Publication Date Title
US20200395031A1 (en) Companding system and method to reduce quantization noise using advanced spectral extension
EP0737350B1 (en) System and method for performing voice compression
US7986797B2 (en) Signal processing system, signal processing apparatus and method, recording medium, and program
US10861475B2 (en) Signal-dependent companding system and method to reduce quantization noise
JP5065687B2 (en) Audio data processing device and terminal device
JPH05304479A (en) High efficient encoder of audio signal
US11640825B2 (en) Time-domain stereo encoding and decoding method and related product
JP4021124B2 (en) Digital acoustic signal encoding apparatus, method and recording medium
JP2015507764A (en) Method, apparatus and system for processing audio data
CN114550732B (en) Coding and decoding method and related device for high-frequency audio signal
JP4657570B2 (en) Music information encoding apparatus and method, music information decoding apparatus and method, program, and recording medium
US20020173969A1 (en) Method for decompressing a compressed audio signal
JP3353868B2 (en) Audio signal conversion encoding method and decoding method
TWI826754B (en) Method of dynamically switching lossy compression and lossless compression that will be performed on audio data in constant bandwidth
KR102637514B1 (en) Time-domain stereo coding and decoding method and related product
US10650834B2 (en) Audio processing method and non-transitory computer readable medium
CN114724567A (en) Dynamic switching method for lossy or lossless compression of fixed bandwidth audio-visual data
US6678653B1 (en) Apparatus and method for coding audio data at high speed using precision information
JP4645869B2 (en) DIGITAL SIGNAL PROCESSING METHOD, LEARNING METHOD, DEVICE THEREOF, AND PROGRAM STORAGE MEDIUM
JPH03233500A (en) Voice synthesis system and device used for same
JP4538705B2 (en) Digital signal processing method, learning method and apparatus, and program storage medium
JP5491193B2 (en) Speech coding method and apparatus
CN115188394B (en) Sound mixing method, device, electronic equipment and storage medium
JP3510493B2 (en) Audio signal encoding / decoding method and recording medium recording the program
JP2001148632A (en) Encoding device, encoding method and recording medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination