EP0661826A2 - Codage perceptuel en sous-bandes dans laquelle le rapport signal/masquage est calculés à partir des signaux dans les sous-bandes - Google Patents
Codage perceptuel en sous-bandes dans laquelle le rapport signal/masquage est calculés à partir des signaux dans les sous-bandes Download PDFInfo
- Publication number
- EP0661826A2 EP0661826A2 EP94308601A EP94308601A EP0661826A2 EP 0661826 A2 EP0661826 A2 EP 0661826A2 EP 94308601 A EP94308601 A EP 94308601A EP 94308601 A EP94308601 A EP 94308601A EP 0661826 A2 EP0661826 A2 EP 0661826A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- subband
- signal
- subbands
- processing system
- data processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/665—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
Definitions
- the present invention relates in general to an improved method and apparatus for efficiently transmitting data from a source apparatus to a receiving apparatus.
- the present invention relates to a method and apparatus for compressing data for transmission.
- the present invention relates to a method and apparatus for compressing digital audio data.
- a system data bus may be utilized to transmit data to and from a central processing unit, direct access storage devices, communications input/output processors (IOPs), and other peripheral devices.
- IOPs communications input/output processors
- the bandwidth of a data bus is the rate, expressed in bytes per second, at which data can be conveyed from a source to a target, such as a workstation or other receiving device connected on the bus. Such bandwidth is limited by the electrical characteristics of the transceivers connected to the system data bus, and the electrical characteristics of the system data bus itself.
- a communication link may be utilized to transmit data from a source processor to a workstation within a distributed data processing system.
- a communication link also has a finite bandwidth which limits the capacity or volume of information that may be transmitted via the communications link.
- data transmission capacity is a resource that may be divided among several devices connected to such communication channels. As more devices are connected to such communications channels, and as the volume of data communicated between devices on such channels increases, the need to conserve channel capacity and optimize channel usage becomes increasingly important.
- Multimedia data is a collection of "time-related" or “time-based” data files which may be utilized to represent video, sound, and animation.
- Such multimedia data files are typically quite large. For example, at 300 pixels per inch and 24 bits per pixel, an 8 1/2-by-11-inch colour picture requires more than 25 megabytes of data storage.
- One method of increasing the capacity of the system data bus or the communications link is to transmit data more efficiently by transmitting data in a compressed format.
- Data compression is the process of eliminating gaps, empty fields, redundancies, and unnecessary data in order to shorten the length of a data file.
- MPEG Moving Pictures Experts Group
- ISO International Standards Organization
- IEC International Electrotechnical Commission
- the MPEG standards for audio may be found in ISO-IEC/JTC1 SC29/WG11, Coding of Moving Pictures And Associated Audio For Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio , DIS, 11172, April 1992.
- MPEG sets forth standards for data compression and may be applied to various signals such as audio and video.
- any data object such as a page of text, an image, a segment of speech or music, or a video sequence
- the compression of any data object may be thought of as a series of steps, including: (1) a decomposition of that object into a collection of "tokens"; (2) the representation of those tokens by binary strings which have a minimal length in some sense; and (3) the concatenation of the strings in a well defined order.
- subband coding is employed to compress audio data.
- the tokens for audio data are subbands.
- a "subband" is a frequency band in a frequency domain.
- the present invention provides, in a first aspect, a method in a data processing system for efficiently compressing a digital audio signal, wherein said digital audio signal includes a plurality of samples, said method comprising: separating each of said plurality of samples into a plurality of subbands; predicting a signal to mask ratio for each of said plurality of subbands utilizing a model of relationships between energy values within each of said plurality of subbands and signal to mask ratios values based on a predetermined psychoacoustic model; allocating a number of bits in response to said predicted signal to mask ratio and a preselected bit-rate; and quantizing each of said plurality of subbands based on said number of bits allocated, wherein said digital audio signal may be efficiently compressed.
- the present invention provides a data processing system for compressing a digital audio signal, wherein said digital audio signal includes a plurality of samples, said data processing system comprising: separation means for separating each of said plurality of samples into a plurality of subbands; prediction means for predicting a signal to mask ratio for each of said plurality of subbands utilizing a model of relationships between energy values within each of said plurality of subbands and signal to mask ratios values based on a predetermined psychoacoustic model; allocation means for allocating a number of bits in response to said predicted signal to mask ratio and a preselected bit-rate; and quantization means for quantizing each of said plurality of subbands based on said number of bits allocated, wherein said digital audio signal may be efficiently compressed.
- a device is used in determining bit allocation, which in turn provides the required input to enable adaptive quantization of a digital audio signal that has been divided into subbands.
- the method and system of a preferred embodiment of the present invention permit the efficient compressing of a digital audio signal, wherein the digital audio signal includes a plurality of samples. Each of the samples are separated into a subbands. A signal to mask ratio (SMR) for each the subbands is predicted utilizing a model of relationships between energy values within each of the subbands and SMR values based on a predetermined psychoacoustic model. A number of bits are allocated in response to the predicted SMR. Then each of the subbands are quantized based on the number of bits allocated, wherein the digital audio signal may be efficiently compressed.
- SMR signal to mask ratio
- the method proposed by MPEG for compression of digital audio is based on subband coding (SBC).
- SBC subband coding
- a SBC scheme initially splits the incoming signal into multiple signals that correspond to various bandwidths that comprise the entire spectrum of the signal. Then the signals are quantized according to either a pre-specified or a dynamic bit-allocation scheme.
- the compression algorithms that attempt to preserve the original quality as much as possible usually employ a dynamic bit allocation scheme.
- the bit-allocation is based upon a perceptual model of the human ear.
- the perceptual model commonly known as a psychoacoustic model, utilizes the spectral information content of the incoming signal and outputs a vector of values that correspond to the signal to mask ratios (SMR) in each subband.
- SMR signal to mask ratios
- MPEG recommends two different such models, Psychoacoustic Model 1 (PM1) and Psychoacoustic Model 2 (PM2). More information on MPEG and PM1 and PM2 may be found in ISO-IEC/JTC1 SC29/WG11, Coding Of Moving Pictures And Associated Audio for Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio, DIS, 11172, April 1992.
- PCM pulse code modulation
- SMR signal to mask ratio
- Bit allocation is performed to allocate bits available for storage or transmission of PCM samples in a subband. The number of bits allocated depends on the SMR value computed in block 202 .
- SMR values are used in conjunction with signal to Noise Ratios (SNR) resulting from quantization of the signal to allocate the number of bits needed for quantization in each subband.
- SNR signal to Noise Ratios
- a high SMR results in more bits being allocated, while a low SMR causes less bits to be allocated for encoding.
- United States Patent No. 4,899,384 teaches table controlled bit allocation in a variable rate subband speech coder
- United States Patent No. 5,185,800 discloses a bit allocation device for transformed digital audio signals with adaptive quantization based on psychoauditive criterion..
- PCM samples also are processed utilizing subband analysis, as illustrated in block 208 .
- Subband analysis involves producing subbands for encoding.
- the subbands may be selected by the user or specified by an encoding standard, such as MPEG.
- the subbands may be produced from the PCM samples by filtering the PCM samples with cosine modulated filters to produce the desired subbands. Each filter is employed to separate a subband from the PCM samples.
- a number of different filters may be utilized to select the desired subbands from the PCM samples, depending on the subbands desired or specified. Examples of various filter designs may be found in H. S.
- a scale factor is then determined and coded for each of the subbands separated from each of the PCM samples factor, as illustrated in block 210 .
- the absolute maximum of the 12 samples is taken as the scale factor. To prevent an infinite number of choices for the scale factor, only 64 values are used in Layer I and II. Hence the scale factor value that is higher and closest to this absolute maximum value is chosen and indicated to the decoder by an index. The decoder is assumed to know the value indexed.
- the scale factor requires bits for coding and is taken into account when bit allocation is performed in block 206 .
- Each subband value is divided by the scale factor value corresponding to the subband.
- the scaled subband samples are quantized by quantizers whose step sizes are determined by the SMA and SNR values.
- the bits resulting from the quantization process are packed to conform to the MPEG audio bit stream definitions in the case of MPEG or any other standard that is used.
- United States Patent No. 5,185,800 discloses a bit allocation device for transformed digital audio signals with adaptive quantization based on psychoauditive criterion. More information on quantizing and encoding also may be found in Ziemer et al, Signals and Systems: Continuous and Discrete, Macmillian Publishing Co. (2d ed. 1989).
- Layers I and II split the signal into 32 uniformly spaced subbands using a cosine modulated filter bank as specified in ISO-IEC/JTC1 SC29/WG11, Coding Of Moving Pictures And Associated Audio for Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio, DIS, 11172, April 1992.
- Layer III also uses 32 subbands in the initial stage but further splitting is performed within the subbands to obtain subband samples of finer frequency divisions.
- the 384 samples are grouped together in a frame and a new bit-allocation table is computed for each of these frames.
- the psychoacoustic models use a 512-point discrete Fourier transform (DFT) to compute the spectrum. For the permitted samples rates of 32, 44.1 and 48 kHz, this translates into the requirement of performing bit-allocation computation for each 12, 8.7 and 8 milliseconds. For Layer II, 1152 (3x384) samples are grouped together in a frame and a 1024-point DFT is used for spectral analysis.
- DFT discrete Fourier transform
- the computational requirement for computing the PM2 while Layer II is employed can be derived as 26,314 multiplies, 37,341 adds, 1024 compares, 1135 logarithms, 1201 table index operations, 859 divides, 768 square roots and 512 inverse tangents per 6 ms or approximately 170 times a second for a two-channel (stereo) audio. See ISO-IEC/JTC1 SC29/WG11, Comments On Audio CD And Analysis Of Audio Complexity, May 1991 for more information.
- a preferred embodiment of the present invention provides a process for bit allocation that can be computationally 70 times more efficient than the PM2 for Layer II, and about 60 times more efficient than PM1 for Layer I.
- the present invention is well suited for use with standard digital processor architectures.
- the present invention in the preferred embodiment, predicts SMR values based on the energy in a subband rather than by spectral analysis as depicted in Figure 1 .
- the subbands obtained from subband analysis are utilized to predict the SMR value utilized in bit allocation.
- the subband energy is employed in accordance with a preferred embodiment of the present invention.
- the prediction of the SMR value is accomplished by utilizing a matrix of prediction coefficients indexed by subbands.
- the prediction coefficients are found by utilizing actual psychoacoustic models, such as PM1 and PM2. Details of the methodology used in accordance with a preferred embodiment of the present invention are presented in the following sections. Utilizing this approach, dynamic bit allocation schemes for any subband codes may be developed in accordance with a preferred embodiment of the present invention.
- PM1 and PM2 can be found in ISO-IEC/JTC1 SC29/WG11, Coding Of Moving Pictures And Associated Audio for Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio, DIS, 11172, April 1992.
- PM1 first the DFT is performed to obtain the power density spectrum of the signal. From the power spectrum, tonal and non-tonal components of the signal are computed since it is well known that these components have different masking characteristics. These masking characteristics can cross the boundaries (or cut-off frequencies) of the subbands. The global masking thresholds at various frequency points are then computed. Minima of these values within each subband represent the SMR.
- PM2 requires more complex operations using both magnitude and phase of the DFT and is detailed in ISO-IEC/JTC1 SC29/WG11, Coding Of Moving Pictures And Associated Audio for Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio, DIS, 11172, April 1992
- the subband samples represent the temporal information within their respective bandwidths. Assuming that each subband provides perfect bandpass characteristics, the summation of the square of each subband value within a subband reflects the energy in that frequency band by the application of Parseval's Theorem as described in A.V. Oppenheim and R.W. Schafer, Digital Processing of Signals, Englewood Cliffs, NJ: Prentice Hall, 1979.
- the analysis filter bank that provides subband decomposition has been designed using a prototype filter that provides more than 96 dB attenuation in the stop band. See K. Brandenberg and G. Stoll, "The ISO/MPEG-Audio codec: A generic standard for coding of high quality digital audio," Proc. of the 92nd Convention of the Audio Engineering Society, Vienna, March 1992 for more information.
- the problem of finding a linear model translates simply into estimating a matrix of dimension 32 by 33 to map the energy values into an array of SMR values.
- the initial step is to obtain data for modelling. Once the data is obtained, finding the best model that fits the data is the next step in the process. First, the mechanism for collecting the data will be examined. Next, the appropriate input and output data sets will be selected. Then, the linear hypothesis will be tested to support the arguments for a linear model. Finally, actual estimation of the matrix will be conducted in accordance with a preferred embodiment of the present invention.
- the data collection procedure requires that a good psychoacoustic model be used to obtain sample SMR values.
- Software has been used to obtain SMR values via the two psychoacoustic models described in ISO-IEC/JTC1 SC29/WG11, Coding of Moving Pictures And Associated Audio for Digital Storage Media At Up to About 1.5 Mbits/s - Part 3: Audio, DIS, 11172, April 1992.
- PM1 and PM2 both have been used in experiments.
- To obtain a set of data for the estimation problem a variety of music and speech signals are needed. A multitude of audio samples from classical and popular music, and some speech signals varying between 20 and 30 seconds of duration was captured in monoaural mode at 44.1 kHz sampling rate with 16 bit resolution per sample using the IBM Audio Capture and Playback Adapter (ACPA).
- ACPA IBM Audio Capture and Playback Adapter
- a similar approach may be taken to capture data at 32 and 48 kHz samples techniques as well. If the samples are available digitally, there will be no need for using an audio capturing hardware. A table of time domain energy values in each subband and the corresponding SMR values from an established psychoacoustic model for several frames of audio may be produced using the simulation programs. If the data from all of the different musical samples were to be collected, one would end up with a prohibitively large data set. To circumvent this problem, a sampling technique was employed. A pseudo-random number generator with uniform distribution characteristics was utilized for sampling purposes. Let the random number that lies between 0 and 215 - 1 be denoted by w i .
- Data corresponding to Layer I and Layer II were obtained using PM1 and also PM2.
- the absolute values of the subband samples were considered instead of the square of the samples in accordance with a preferred embodiment of the present invention. This was done to minimize the computational or cycle requirements in programmable DSPs.
- the absolute values of the subband samples are referred to herein as "pseudo-energy" values. While a modelling for Layer 1, in each frame for each subband 12 absolute values of the samples were summed together to obtain the energy value in that subband. While using Layer II, 36 absolute values were summed to obtain the pseudo-energy values.
- a "frame" contains a number of adjacent audio samples. The aim is to obtain an estimate of ⁇ k,i such that the errors are small for the given data.
- the number of parameters to be estimated are 32 X 33.
- the additional 32 parameters come from the requirement to estimate a bias vector that correspond to x 33(.).
- a SMR values are determined for a random audio sample utilizing a psychoacoustic model, such as PM1 or PM2, as depicted in block 300 . Thereafter energy values for the subbands in the sample are determined, as illustrated in block 302 . Then a prediction coefficient is determined for each subband and data point, as depicted in block 304 . The prediction coefficients are ⁇ k,i as shown in equation (1). Then, a determination of whether more samples are present is made, as illustrated in block 306 . If no more samples are present, the process terminates. Otherwise, the process returns to block 300 to process another audio sample in accordance with a preferred embodiment of the present invention.
- a psychoacoustic model such as PM1 or PM2
- MLE maximum likelihood estimate
- Table 1 Variance Ratio Table for Hypothesis Testing Subband F o F ⁇ ,32,N-33 1 7.66 1.9 7 22.34 1.9 11 55.68 1.9 19 134.06 1.9 24 217.36 1.9 31 159.79 1.9
- the results in Table 1 are typical for all the data that was gathered; Layer II, Layer I with various combinations of PM1 and PM2. Thus, it is very clear that the null hypothesis should be rejected and estimation should proceed.
- equation (2) provides the Best Linear Unbiased Estimate of b k under the normality assumptions.
- outliers the data points at which the errors are considerably larger, and the lack of knowledge about the distribution of the errors
- estimators may have to be utilized.
- a typical result of using least squares estimate of b k by plotting the errors for sample frames is illustrated in Figure 3 .
- Figure 3 it is indeed clear that elimination of certain points can very well contribute to better estimation of b k .
- a technique known as robust estimation has been deemed as an appropriate alternative to least squares technique in the presence of outliers.
- the appropriate selection of k from the above estimations is based on two tests, a subjective and an objective one in accordance with a preferred embodiment of the present invention.
- the music quality is subjectively evaluated against both the original and compressed/decompressed music pieces that were obtained by using either PM1 or PM2.
- the bit-allocation deviations from a corresponding MPEG implementation using either PM1 or PM2 are employed.
- the deviations are computed for sampled frames and the average deviation per frame is taken as an indication of the amount of digression from an implementation that uses the recommended psychoacoustic models.
- PCM Pulse code modulation
- Table 2 provides a comparison between PM2 and a preferred embodiment of the present invention.
- Subband No. No. Bits PM2 No. Bits Proposed Deviation Average bits/frame 1 23161 23035 0.242090 2 16361 16970 0.136139 3 13483 13782 0.123077 4 13853 13659 0.211030 5 12425 12550 0.149492 6 13240 13000 0.197097 7 12078 11998 0.169811 8 11769 11503 0.195356 9 11132 10725 0.161393 10 10755 10521 0.109434 11 10558 10451 0.079245 12 9999 10193 0.057765 13 9053 9198 0.109724 14 7168 7159 0.102467 15 5866 6076 0.123948 16 4110 4193 0.137881 17 1740 1559 0.182003 18 54 43 0.028157
- FIG. 5 a high level flowchart of a process for compressing PCM samples is depicted in accordance with a preferred embodiment of the present invention.
- Subband analysis is performed on the PCM samples to produce the desired subbands for each samples.
- Each subband may be produced by filtering the sample utilizing known filtering systems in accordance with a preferred embodiment of the present invention.
- SMR is predicted for a subband utilizing a model of relationships between energy values within each of the subbands and utilizing SMR values based on a predetermined psychoacoustic model, as illustrated in block 402 .
- the predicted SMR is employed to determine a bit allocation for the sample, as depicted in block 404 .
- a desired bit-rate also is considered in bit allocation, as illustrated in block 406 .
- Scale factor coding is performed for each of the subbands in a PCM sample, as depicted in block 408 .
- Quantization and bit packing is performed, as illustrated in block 410 , utilizing the bit-allocation and scale factor from blocks 404 and 408 .
- the need for spectral analysis of PCM samples being compressed is eliminated.
- FIG. 6 a flowchart of a process for predicting SMR values in block 402 of Figure 5 is illustrated in accordance with a preferred embodiment of the present invention.
- L is 12 for Layer I and 36 for Layer II under MPEG standards.
- the process of the present invention may be "tuned" for a specific type of music. For example, if a user sending an audio signal for classic music desires to encode only the classical violin, samples from a classic violin source may be collected in estimating ⁇ i,j 's. The estimated ⁇ i,j 's will be better suited for classical violin. Furthermore, a user may obtain several sets of ⁇ i,j 's corresponding to different types of music and one set may be selected by the user appropriately.
- squared-energy values for subband samples S i,l may be employed instead of absolute values in equation (6):
- the constant 'C' can be selected using empirical observations. By attempting to equate X's to normalized sound pressure levels, C can be set to about 82.53 dB.
- the determination of prediction coefficients, ⁇ i,j , in equation (1) also will replace pseudo-energy values with squared-energy values in accordance with a preferred embodiment of the present invention.
- data processing system 10 includes a system 12 , a video display terminal 14 , a keyboard 16 , and a mouse 18 .
- Data processing system 10 may be implemented utilizing any suitable computer, such as an IBM PS/2 or IBM RISC SYSTEM/6000 computer, both products of International Business Machines Corporation, located in Armonk, New York. "PS/2" and "RISC SYSTEM/6000" are trademarks of International Business Machines Corporation.
- PS/2 and "RISC SYSTEM/6000” are trademarks of International Business Machines Corporation.
- the depicted embodiment is a personal computer, a preferred embodiment of the present invention may be implemented in other types of data processing systems, such as, for example, intelligent workstations, mini computers, local area networks, or special purpose multimedia devices using standard digital signal processors.
- System bus 11 provides a connection between various components within data processing system 10 .
- Central processing unit (CPU) 22 provides the decision making capability in data processing system 10 .
- CPU 12 may include one or more processors, such as an 80486 processor or a Pentium processor available from Intel Corporation in Santa Clara, California. "Pentium” is a trademark of Intel Corporation. Other processors that may be used include Power PC available from IBM/Motorola or Alpha AXP processors from Digital Equipment.
- Memory 24 provides a storage for data processing system 10 and may include both read only memory (ROM) and random access memory (RAM).
- Direct access storage device (DASD) 26 provides additional storage for data processing system 10 .
- DASD 26 typically provides long term storage for data processing system 10 .
- DASD 26 may include, for example, a hard disk drive or a floppy disk drive.
- an audio capture and playback adapter (ACPA) 25 may be employed to obtain audio samples.
- ACPA audio capture and playback adapter
- an IBM Audio Capture and Playback Adapter available from International Business Machines Corporation, may be utilized.
- Popular Sound Blaster and other sound cords may also be utilized. if audio data can be directly read from the CD or DAT, these sources also may be utilized.
- Communications unit 28 provides the interface between the data processing system 10 and some other data processing system such as another personal computer or a network.
- the digital audio signal processed by the present invention may originate from stored data in DASD 26 , or may be received at communications unit 28 , or from some other source of data that is connected to the data processing system, such as ACPA 25 .
- a preferred embodiment of the present invention may be implemented in an IBM RISC SYSTEM/6000 computer, which is a product of International Business Machines Corporation, located in Armonk, New York. "RISC SYSTEM/6000" is a trademark of International Business Machines Corporation.
- the processes of the present invention may be implemented within the data processing system depicted in Figures 7 and 8 or in hardware.
- the present invention allows a simpler implementation than the process depicted in Figure 1 .
- the present invention also may be utilized with psychoacoustic models other than those specified by MPEG.
- the operation can be performed in one instruction cycle. Furthermore, in a fixed-point DSP, the truncation can be made to the result in the accumulator at the end of all addition, thus preventing round-off errors after each accumulation.
- process of the present invention is faster because the number of instruction cycles required are much less than a process utilizing PM1 or PM2 in a standard DSP environment in accordance with a preferred embodiment of the present invention.
- the performance gains provided by the present invention provides a more efficient encoding process for data.
- the a preferred embodiment of the present invention may be implemented with a single DSP.
- subband coding employing a psychovisual weighting may be implemented in accordance with a preferred embodiment of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US175900 | 1988-03-31 | ||
US08/175,900 US5764698A (en) | 1993-12-30 | 1993-12-30 | Method and apparatus for efficient compression of high quality digital audio |
Publications (1)
Publication Number | Publication Date |
---|---|
EP0661826A2 true EP0661826A2 (fr) | 1995-07-05 |
Family
ID=22642132
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP94308601A Withdrawn EP0661826A2 (fr) | 1993-12-30 | 1994-11-22 | Codage perceptuel en sous-bandes dans laquelle le rapport signal/masquage est calculés à partir des signaux dans les sous-bandes |
Country Status (3)
Country | Link |
---|---|
US (1) | US5764698A (fr) |
EP (1) | EP0661826A2 (fr) |
JP (1) | JP2904472B2 (fr) |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1998037698A1 (fr) * | 1996-12-17 | 1998-08-27 | Adaptive Media Technologies | Procede et dispositif constituant un systeme de remise de supports conformables |
WO2004015689A1 (fr) * | 2002-08-08 | 2004-02-19 | Qualcomm Incorporated | Quantification adaptative a une largeur de bande |
WO2004112003A1 (fr) * | 2003-06-13 | 2004-12-23 | Vixs Systems Inc. | Systeme et methode pour traiter des trames audio |
US7054964B2 (en) | 2001-07-30 | 2006-05-30 | Vixs Systems, Inc. | Method and system for bit-based data access |
US7106715B1 (en) | 2001-11-16 | 2006-09-12 | Vixs Systems, Inc. | System for providing data to multiple devices and method thereof |
US7120253B2 (en) | 2002-05-02 | 2006-10-10 | Vixs Systems, Inc. | Method and system for protecting video data |
US7130350B1 (en) | 2003-02-28 | 2006-10-31 | Vixs Systems, Inc. | Method and system for encoding and decoding data in a video stream |
US7133452B1 (en) | 2003-02-24 | 2006-11-07 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7139330B1 (en) | 2001-10-31 | 2006-11-21 | Vixs Systems, Inc. | System for signal mixing and method thereof |
US7165180B1 (en) | 2001-11-27 | 2007-01-16 | Vixs Systems, Inc. | Monolithic semiconductor device for preventing external access to an encryption key |
US7277101B2 (en) | 2003-09-29 | 2007-10-02 | Vixs Systems Inc | Method and system for scaling images |
US7310679B1 (en) | 2002-04-29 | 2007-12-18 | Vixs Systems Inc. | Method and system for transmitting video content while preventing other transmissions in a contention-based network |
US7327784B2 (en) | 2003-02-24 | 2008-02-05 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7356079B2 (en) | 2001-11-21 | 2008-04-08 | Vixs Systems Inc. | Method and system for rate control during video transcoding |
US7400869B2 (en) | 2005-03-22 | 2008-07-15 | Vixs Systems Inc. | System and method for adaptive DC offset compensation in wireless transmissions |
US7403564B2 (en) | 2001-11-21 | 2008-07-22 | Vixs Systems, Inc. | System and method for multiple channel video transcoding |
US7406598B2 (en) | 2004-02-17 | 2008-07-29 | Vixs Systems Inc. | Method and system for secure content distribution |
US7408989B2 (en) | 2003-01-16 | 2008-08-05 | Vix5 Systems Inc | Method of video encoding using windows and system thereof |
US7421048B2 (en) | 2005-01-20 | 2008-09-02 | Vixs Systems, Inc. | System and method for multimedia delivery in a wireless environment |
US7596127B1 (en) | 2001-10-31 | 2009-09-29 | Vixs Systems, Inc. | System for allocating data in a communications system and method thereof |
US7602847B1 (en) | 2001-03-27 | 2009-10-13 | Vixs Systems, Inc. | Device and method for compression of a video stream |
US7606305B1 (en) | 2003-02-24 | 2009-10-20 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7609766B2 (en) | 2005-02-08 | 2009-10-27 | Vixs Systems, Inc. | System of intra-picture complexity preprocessing |
US7668396B2 (en) | 2003-09-29 | 2010-02-23 | Vixs Systems, Inc. | Method and system for noise reduction in an image |
US7675972B1 (en) | 2001-07-30 | 2010-03-09 | Vixs Systems, Inc. | System and method for multiple channel video transcoding |
US7707485B2 (en) | 2005-09-28 | 2010-04-27 | Vixs Systems, Inc. | System and method for dynamic transrating based on content |
US8107524B2 (en) | 2001-03-30 | 2012-01-31 | Vixs Systems, Inc. | Adaptive bandwidth footprint matching for multiple compressed video streams in a fixed bandwidth network |
US8131995B2 (en) | 2006-01-24 | 2012-03-06 | Vixs Systems, Inc. | Processing feature revocation and reinvocation |
US8949920B2 (en) | 2005-03-17 | 2015-02-03 | Vixs Systems Inc. | System and method for storage device emulation in a multimedia processing system |
US8958459B2 (en) | 2005-01-31 | 2015-02-17 | St-Ericsson Sa | Method and apparatus for despread data in wireless communication system |
Families Citing this family (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB9606680D0 (en) * | 1996-03-29 | 1996-06-05 | Philips Electronics Nv | Compressed audio signal processing |
US5941936A (en) * | 1996-10-31 | 1999-08-24 | Taylor Group Of Companies, Inc. | One-bit run-length encoding and playback system |
US6278735B1 (en) * | 1998-03-19 | 2001-08-21 | International Business Machines Corporation | Real-time single pass variable bit rate control strategy and encoder |
US6161088A (en) * | 1998-06-26 | 2000-12-12 | Texas Instruments Incorporated | Method and system for encoding a digital audio signal |
US6421464B1 (en) * | 1998-12-16 | 2002-07-16 | Fastvdo Llc | Fast lapped image transforms using lifting steps |
US6240379B1 (en) * | 1998-12-24 | 2001-05-29 | Sony Corporation | System and method for preventing artifacts in an audio data encoder device |
EP1076297A1 (fr) * | 1999-08-09 | 2001-02-14 | Deutsche Thomson-Brandt Gmbh | Méthode de transformation de Fourier rapide pour signaux audio |
US6567781B1 (en) | 1999-12-30 | 2003-05-20 | Quikcat.Com, Inc. | Method and apparatus for compressing audio data using a dynamical system having a multi-state dynamical rule set and associated transform basis function |
US6745162B1 (en) * | 2000-06-22 | 2004-06-01 | Sony Corporation | System and method for bit allocation in an audio encoder |
JP2002014700A (ja) * | 2000-06-30 | 2002-01-18 | Canon Inc | 音声信号処理方法、装置および記憶媒体 |
US6826546B1 (en) * | 2000-08-17 | 2004-11-30 | Ideaflood, Inc. | Method and system for licensing a copy of a copyright protected work |
JP2002196792A (ja) * | 2000-12-25 | 2002-07-12 | Matsushita Electric Ind Co Ltd | 音声符号化方式、音声符号化方法およびそれを用いる音声符号化装置、記録媒体、ならびに音楽配信システム |
US6882976B1 (en) | 2001-02-28 | 2005-04-19 | Advanced Micro Devices, Inc. | Efficient finite length POW10 calculation for MPEG audio encoding |
DE10150519B4 (de) * | 2001-10-12 | 2014-01-09 | Hewlett-Packard Development Co., L.P. | Verfahren und Anordnung zur Sprachverarbeitung |
US7085675B2 (en) * | 2002-02-06 | 2006-08-01 | The University Of Chicago | Subband domain signal validation |
US7313520B2 (en) * | 2002-03-20 | 2007-12-25 | The Directv Group, Inc. | Adaptive variable bit rate audio compression encoding |
US20080075377A1 (en) * | 2003-07-29 | 2008-03-27 | Topiwala Pankaj N | Fast lapped image transforms using lifting steps |
KR100554680B1 (ko) * | 2003-08-20 | 2006-02-24 | 한국전자통신연구원 | 크기 변화에 강인한 양자화 기반 오디오 워터마킹 장치 및방법 |
KR100571824B1 (ko) * | 2003-11-26 | 2006-04-17 | 삼성전자주식회사 | 부가정보 삽입된 mpeg-4 오디오 bsac부호화/복호화 방법 및 장치 |
US7720145B2 (en) * | 2004-05-13 | 2010-05-18 | Ittiam Systems (P) Ltd. | Model based bit rate control for a macroblock encoder |
KR100695125B1 (ko) * | 2004-05-28 | 2007-03-14 | 삼성전자주식회사 | 디지털 신호 부호화/복호화 방법 및 장치 |
KR100634506B1 (ko) * | 2004-06-25 | 2006-10-16 | 삼성전자주식회사 | 저비트율 부호화/복호화 방법 및 장치 |
BRPI0518133A (pt) * | 2004-10-13 | 2008-10-28 | Matsushita Electric Ind Co Ltd | codificador escalável, decodificador escalável, e método de codificação escalável |
DE102004059979B4 (de) | 2004-12-13 | 2007-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zur Berechnung einer Signalenergie eines Informationssignals |
US7231974B2 (en) * | 2005-04-08 | 2007-06-19 | Chevron U.S.A. | Self-leaving in-situ device and method for passively removing oil from water wells |
JP4822507B2 (ja) * | 2005-10-27 | 2011-11-24 | 株式会社メガチップス | 画像処理装置および画像処理装置に接続される装置 |
US7720300B1 (en) * | 2006-12-05 | 2010-05-18 | Calister Technologies | System and method for effectively performing an adaptive quantization procedure |
US8446976B2 (en) * | 2007-09-21 | 2013-05-21 | Qualcomm Incorporated | Signal generator with adjustable phase |
US8385474B2 (en) * | 2007-09-21 | 2013-02-26 | Qualcomm Incorporated | Signal generator with adjustable frequency |
US7965805B2 (en) | 2007-09-21 | 2011-06-21 | Qualcomm Incorporated | Signal generator with signal tracking |
GB2454208A (en) * | 2007-10-31 | 2009-05-06 | Cambridge Silicon Radio Ltd | Compression using a perceptual model and a signal-to-mask ratio (SMR) parameter tuned based on target bitrate and previously encoded data |
US8831936B2 (en) * | 2008-05-29 | 2014-09-09 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement |
US8538749B2 (en) * | 2008-07-18 | 2013-09-17 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for enhanced intelligibility |
US9202456B2 (en) * | 2009-04-23 | 2015-12-01 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for automatic control of active noise cancellation |
JP5539992B2 (ja) * | 2009-08-20 | 2014-07-02 | トムソン ライセンシング | レート制御装置、レート制御方法及びレート制御プログラム |
US8788277B2 (en) * | 2009-09-11 | 2014-07-22 | The Trustees Of Columbia University In The City Of New York | Apparatus and methods for processing a signal using a fixed-point operation |
US9053697B2 (en) | 2010-06-01 | 2015-06-09 | Qualcomm Incorporated | Systems, methods, devices, apparatus, and computer program products for audio equalization |
US11545159B1 (en) | 2021-06-10 | 2023-01-03 | Nice Ltd. | Computerized monitoring of digital audio signals |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4631746A (en) * | 1983-02-14 | 1986-12-23 | Wang Laboratories, Inc. | Compression and expansion of digitized voice signals |
US4688246A (en) * | 1985-12-20 | 1987-08-18 | Zenith Electronics Corporation | CATV scrambling system with compressed digital audio in synchronizing signal intervals |
US4899384A (en) * | 1986-08-25 | 1990-02-06 | Ibm Corporation | Table controlled dynamic bit allocation in a variable rate sub-band speech coder |
US4969192A (en) * | 1987-04-06 | 1990-11-06 | Voicecraft, Inc. | Vector adaptive predictive coder for speech and audio |
EP0400222A1 (fr) * | 1989-06-02 | 1990-12-05 | ETAT FRANCAIS représenté par le Ministère des Postes, des Télécommunications et de l'Espace | Système de transmission numérique utilisant le codage par sous-bandes d'un signal numérique |
US5185800A (en) * | 1989-10-13 | 1993-02-09 | Centre National D'etudes Des Telecommunications | Bit allocation device for transformed digital audio broadcasting signals with adaptive quantization based on psychoauditive criterion |
CA2032765C (fr) * | 1989-12-21 | 1995-12-12 | Hidetaka Yoshikawa | Appareil de communication a vitesse de codage variable |
JP2906646B2 (ja) * | 1990-11-09 | 1999-06-21 | 松下電器産業株式会社 | 音声帯域分割符号化装置 |
US5150401A (en) * | 1990-12-04 | 1992-09-22 | Chips International, Inc. | Retrofittable encryption/decryption apparatus using modified frequency modulation |
EP0506394A2 (fr) * | 1991-03-29 | 1992-09-30 | Sony Corporation | Dispositif pour le codage de signaux digitaux |
BR9204799A (pt) * | 1991-03-29 | 1993-07-13 | Sony Corp | Processo de codificacao para um sinal digital |
US5231484A (en) * | 1991-11-08 | 1993-07-27 | International Business Machines Corporation | Motion video compression system with adaptive bit allocation and quantization |
US5315670A (en) * | 1991-11-12 | 1994-05-24 | General Electric Company | Digital data compression system including zerotree coefficient coding |
JP2976701B2 (ja) * | 1992-06-24 | 1999-11-10 | 日本電気株式会社 | 量子化ビット数割当方法 |
JP3508146B2 (ja) * | 1992-09-11 | 2004-03-22 | ソニー株式会社 | ディジタル信号符号化復号化装置、ディジタル信号符号化装置及びディジタル信号復号化装置 |
JPH0750589A (ja) * | 1993-08-04 | 1995-02-21 | Sanyo Electric Co Ltd | サブバンド符号化装置 |
-
1993
- 1993-12-30 US US08/175,900 patent/US5764698A/en not_active Expired - Lifetime
-
1994
- 1994-11-22 EP EP94308601A patent/EP0661826A2/fr not_active Withdrawn
- 1994-11-28 JP JP6292810A patent/JP2904472B2/ja not_active Expired - Lifetime
Cited By (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5953506A (en) * | 1996-12-17 | 1999-09-14 | Adaptive Media Technologies | Method and apparatus that provides a scalable media delivery system |
WO1998037698A1 (fr) * | 1996-12-17 | 1998-08-27 | Adaptive Media Technologies | Procede et dispositif constituant un systeme de remise de supports conformables |
US7602847B1 (en) | 2001-03-27 | 2009-10-13 | Vixs Systems, Inc. | Device and method for compression of a video stream |
US9826259B2 (en) | 2001-03-30 | 2017-11-21 | Vixs Systems Inc. | Managed degradation of a video stream |
US8107524B2 (en) | 2001-03-30 | 2012-01-31 | Vixs Systems, Inc. | Adaptive bandwidth footprint matching for multiple compressed video streams in a fixed bandwidth network |
US7054964B2 (en) | 2001-07-30 | 2006-05-30 | Vixs Systems, Inc. | Method and system for bit-based data access |
US7675972B1 (en) | 2001-07-30 | 2010-03-09 | Vixs Systems, Inc. | System and method for multiple channel video transcoding |
US7139330B1 (en) | 2001-10-31 | 2006-11-21 | Vixs Systems, Inc. | System for signal mixing and method thereof |
US7596127B1 (en) | 2001-10-31 | 2009-09-29 | Vixs Systems, Inc. | System for allocating data in a communications system and method thereof |
US7106715B1 (en) | 2001-11-16 | 2006-09-12 | Vixs Systems, Inc. | System for providing data to multiple devices and method thereof |
US10129552B2 (en) | 2001-11-21 | 2018-11-13 | Vixs Systems Inc. | Method and system for rate control during video transcoding |
US7356079B2 (en) | 2001-11-21 | 2008-04-08 | Vixs Systems Inc. | Method and system for rate control during video transcoding |
US7403564B2 (en) | 2001-11-21 | 2008-07-22 | Vixs Systems, Inc. | System and method for multiple channel video transcoding |
US7165180B1 (en) | 2001-11-27 | 2007-01-16 | Vixs Systems, Inc. | Monolithic semiconductor device for preventing external access to an encryption key |
US7310679B1 (en) | 2002-04-29 | 2007-12-18 | Vixs Systems Inc. | Method and system for transmitting video content while preventing other transmissions in a contention-based network |
US7120253B2 (en) | 2002-05-02 | 2006-10-10 | Vixs Systems, Inc. | Method and system for protecting video data |
US8090577B2 (en) | 2002-08-08 | 2012-01-03 | Qualcomm Incorported | Bandwidth-adaptive quantization |
WO2004015689A1 (fr) * | 2002-08-08 | 2004-02-19 | Qualcomm Incorporated | Quantification adaptative a une largeur de bande |
US7408989B2 (en) | 2003-01-16 | 2008-08-05 | Vix5 Systems Inc | Method of video encoding using windows and system thereof |
US7133452B1 (en) | 2003-02-24 | 2006-11-07 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7606305B1 (en) | 2003-02-24 | 2009-10-20 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7327784B2 (en) | 2003-02-24 | 2008-02-05 | Vixs Systems, Inc. | Method and system for transcoding video data |
US7130350B1 (en) | 2003-02-28 | 2006-10-31 | Vixs Systems, Inc. | Method and system for encoding and decoding data in a video stream |
WO2004112003A1 (fr) * | 2003-06-13 | 2004-12-23 | Vixs Systems Inc. | Systeme et methode pour traiter des trames audio |
US7739105B2 (en) | 2003-06-13 | 2010-06-15 | Vixs Systems, Inc. | System and method for processing audio frames |
US7668396B2 (en) | 2003-09-29 | 2010-02-23 | Vixs Systems, Inc. | Method and system for noise reduction in an image |
US7277101B2 (en) | 2003-09-29 | 2007-10-02 | Vixs Systems Inc | Method and system for scaling images |
US7406598B2 (en) | 2004-02-17 | 2008-07-29 | Vixs Systems Inc. | Method and system for secure content distribution |
US7421048B2 (en) | 2005-01-20 | 2008-09-02 | Vixs Systems, Inc. | System and method for multimedia delivery in a wireless environment |
US8958459B2 (en) | 2005-01-31 | 2015-02-17 | St-Ericsson Sa | Method and apparatus for despread data in wireless communication system |
US7609766B2 (en) | 2005-02-08 | 2009-10-27 | Vixs Systems, Inc. | System of intra-picture complexity preprocessing |
US8949920B2 (en) | 2005-03-17 | 2015-02-03 | Vixs Systems Inc. | System and method for storage device emulation in a multimedia processing system |
US7400869B2 (en) | 2005-03-22 | 2008-07-15 | Vixs Systems Inc. | System and method for adaptive DC offset compensation in wireless transmissions |
US7707485B2 (en) | 2005-09-28 | 2010-04-27 | Vixs Systems, Inc. | System and method for dynamic transrating based on content |
US8131995B2 (en) | 2006-01-24 | 2012-03-06 | Vixs Systems, Inc. | Processing feature revocation and reinvocation |
Also Published As
Publication number | Publication date |
---|---|
US5764698A (en) | 1998-06-09 |
JP2904472B2 (ja) | 1999-06-14 |
JPH07210195A (ja) | 1995-08-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US5764698A (en) | Method and apparatus for efficient compression of high quality digital audio | |
US5535300A (en) | Perceptual coding of audio signals using entropy coding and/or multiple power spectra | |
CA2027136C (fr) | Codage perceptif des signaux audio | |
US8615391B2 (en) | Method and apparatus to extract important spectral component from audio signal and low bit-rate audio signal coding and/or decoding method and apparatus using the same | |
Hans et al. | Lossless compression of digital audio | |
FI84538C (fi) | Foerfarande foer transmission av digitaliska audiosignaler. | |
US5845243A (en) | Method and apparatus for wavelet based data compression having adaptive bit rate control for compression of audio information | |
US7333929B1 (en) | Modular scalable compressed audio data stream | |
KR100397690B1 (ko) | 데이터부호화장치및그방법 | |
US5758315A (en) | Encoding/decoding method and apparatus using bit allocation as a function of scale factor | |
EP0661827A2 (fr) | Filtrage en sous-bandes utilisant une transformation discrète inverse de cosinus | |
Musmann | The ISO audio coding standard | |
US8149927B2 (en) | Method of and apparatus for encoding/decoding digital signal using linear quantization by sections | |
JPH09134200A (ja) | ディジタル・オーディオ符号化方法及びその装置 | |
EP0376553B1 (fr) | Codage de signaux audio tenant compte de la perception | |
US6161088A (en) | Method and system for encoding a digital audio signal | |
JP3465341B2 (ja) | オーディオ信号符号化方法 | |
JP3146121B2 (ja) | 符号化復号化装置 | |
Teh et al. | Subband coding of high-fidelity quality audio signals at 128 kbps | |
KR0144297B1 (ko) | 적응적 디지탈 오디오 부호화 장치 | |
KR100204471B1 (ko) | 디지탈 오디오 부호화기의 비트 할당 장치 | |
CA2467466A1 (fr) | Systeme et methode de compression et de reconstitution des fichiers sonores | |
Teh et al. | A neural network based perceptual audio coder | |
Bhaskar | Low rate coding of audio by a predictive transform coder for efficient satellite transmission | |
Kandadai | Perceptual Audio Coding That Scales to Low Bitrates |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB |
|
17P | Request for examination filed |
Effective date: 19951024 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
18W | Application withdrawn |
Withdrawal date: 19971107 |