WO1993019536A1 - Verfahren zur datenreduktioin bei der speicherung und/oder übertragung digitaler audiosignale für studioanwendungen mit perceptiver codierung und code variabler länge - Google Patents
Verfahren zur datenreduktioin bei der speicherung und/oder übertragung digitaler audiosignale für studioanwendungen mit perceptiver codierung und code variabler länge Download PDFInfo
- Publication number
- WO1993019536A1 WO1993019536A1 PCT/DE1993/000177 DE9300177W WO9319536A1 WO 1993019536 A1 WO1993019536 A1 WO 1993019536A1 DE 9300177 W DE9300177 W DE 9300177W WO 9319536 A1 WO9319536 A1 WO 9319536A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- audio signals
- coding
- khz
- digital audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H20/00—Arrangements for broadcast or for distribution combined with broadcast
- H04H20/86—Arrangements characterised by the broadcast information itself
- H04H20/95—Arrangements characterised by the broadcast information itself characterised by a specific format, e.g. an encoded audio stream
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/42—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
Definitions
- the invention relates to a method for data reduction in the storage and / or transmission of digital audio signals according to the preamble of claim 1.
- Such data reduction methods are used when interference-free, high-quality sound is to be achieved when storing and / or transmitting audio signals, but the transmission or memory bandwidth is not as high as, for example, a compact disc (CD).
- CD compact disc
- Examples are the planned digital terrestrial broadcasting, the sound channel of digital video recorders or tape recorders with a stationary sound head and many more.
- a method according to the preamble has become known, for example, from DE-OS 3912605.
- the quantized spectral values are encoded with an optimal encoder, in which the code word used for encoding is shorter the more frequently the spectra coefficient occurs.
- the code is taken from a table, the length of which corresponds to the number of code Words corresponds to (Huffman code).
- the object of the invention is to develop a method for reducing the data according to the preamble such that it is suitable for professional applications in studios.
- the audio signal is mapped from the time range into the frequency range with a filter bank with perfect reconstruction. This means that without quantization and coding, the signal transmitted back into the time domain completely matches the input signal.
- a block section under consideration is coded with such a small number of bits that the noise caused by the quantization is still below the monitoring threshold, that is to say is covered by the music signal.
- the initially concealed quantization noise can become an audible disturbing noise.
- the invention at most half of the psychoacoustic concealment effect, compared to the aforementioned ASPEC method, is used in the quantization and coding. As a result, a distance between the concealed and concealing signal is achieved, which enables postprocessing of the music signal.
- the coding is carried out with a variable word length, a cascaded, multidimensional Huffman code being used as the code.
- the Huffman coding is thus carried out differently than with known methods (eg ASPEC).
- known methods predominantly have low values due to the high data reduction which have to be encoded.
- Code values are assigned to these values using a Huffman code table.
- the coding of very large quantized values, which are not included in the table, takes place with the help of an identifier, which initiates a further coding step.
- the coding is carried out using different Huffman code tables.
- the code word is assigned in accordance with this area by means of the suitable Huffman code table.
- the portion exceeding the threshold is coded scalarly. With small values, the coding is done in a known manner.
- the curve of the monitoring thresholds (English “spreading function") is used to calculate the masking effects.
- a curve with at least twice the slope is selected according to claim 2.
- the tendrils of the curve of the listening thresholds are preferably chosen to be so steep that only a slight concealment is assumed. With this coding, the quantization noise does not reach an audible range due to post-processing such as filtering for matching the signal.
- the amplitude of the noise resulting from the quantization is at least 14 dB below the masking audio signal. This ensures that there is no audible deterioration of the music signal even when the method is used several times.
- a level is assumed for the basic hearing threshold which is at least 10 dB below the amplitude resolution ("least significant bit").
- a modified discrete cosine transformation is used as the ferbank, which is used according to claim 6 with a block length of 256 or 512 samples. At a sampling frequency of 48 kHz, this results in block lengths of 5.33 and 10.67 msec.
- the MDCT is a transformation with perfect reconstruction, that is to say that the output signal of the synthesis filter bank is exactly the same as the input signal of the analysis filter bank if no quantization is carried out.
- the option to use a block length of 256 samples enables the time resolution to be increased while the coding efficiency is reduced.
- the analysis window contains 512 or 1024 samples and is therefore 10.67 or 21.33 msec long at a sampling rate of 48 kHz.
- the method also supports a sampling frequency of 96 kHz. At this sampling frequency, frequencies far above human perception can be transmitted with the method. This is sometimes called a requirement for studio applications.
- the encoded signal is then post-processed, for example hidden or shown.
- the coded signal can be entered into a computer and processed directly. With this method, great savings can be achieved in digital process processing and memory allocation. Since many music channels are usually mixed in a recording studio, up to 48 channels can be processed in parallel in a further development of the invention.
- An important advantage of the invention is that a digital music signal processed by the method is suitable for professional applications in recording studios.
- the signal can be mixed and faded, it can be played at very different volumes, it can be encoded and decoded several times in succession without a deterioration of the sound signal being perceptible.
- the signal can be provided with reverberation or other effects. All of the measures mentioned can be applied to individual channels which are combined to form an overall signal.
- the method according to the invention achieves a significantly greater reduction.
- a digital audio signal is mapped by a modified discrete cosine transformation from the time domain to the frequency domain in order to split the input signal into undersampled spectral components lay.
- the sampling frequency is 48 kHz
- the block length is 516 samples or 10.67 msec.
- the window width is 1024 samples.
- the output signals from this filter bank are used to calculate estimates of the respective signal-dependent monitoring thresholds.
- regularities known from psychoacoustics are used.
- a modified acoustic model is used to determine the minimum listening threshold.
- Model 2 of the encoder of the ISO / MPEG standard 11172-3 International Standardization Organization / Moving Pictures Experts Group
- the slope of the edges of the curve of the listening threshold is increased so much that only a minimal masking is taken into account.
- the use of a curve of the listening threshold with an incline of the flanks that has at least twice the slope of the curve of model 2 of the ISO / MPEG standard has proven to be particularly advantageous.
- the distance between the masking signal and the masked noise is 20 dB instead of 6-9 dB for noise-like signals.
- the signal is quantized using a linear quantizer because this enables an optimal ratio of masking signal and masked noise.
- the quantized spectral values are encoded using Huffman encoding. Since much more large values are to be encoded than, for example, in the known ASPEC method, the code tables are organized differently. Large values are encoded by determining the range in which the value falls. The corresponding area is then selected from various Huffman code tables. If the quantized values exceed a certain threshold, scalar values are coded instead of value pairs by the table.
- the coded values and the page information are combined to form an output bit stream. Since the psychoacoustic model is only used in the encoder and not in the decoder, the model can be changed without changing the definition of the bit stream.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
- Circuits Of Receivers In General (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Stereo-Broadcasting Methods (AREA)
Priority Applications (8)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP5516152A JPH07508375A (ja) | 1992-03-23 | 1993-02-25 | スタジオ用デジタルオーディオ信号の記憶及び又は通信時のデータ整理方法 |
| AU36254/93A AU670068B2 (en) | 1992-03-23 | 1993-02-25 | Data compression process during storage and/or transmission of digital audio signals for studio applications with perceptive coding and variable length code |
| CA2131806A CA2131806A1 (en) | 1992-03-23 | 1993-02-25 | Data compression process during storage and/or transmission of digital audio signals for studio applications with perceptive coding and variable length code |
| DE59310119T DE59310119D1 (de) | 1992-03-23 | 1993-02-25 | Verfahren zur datenreduktion bei der speicherung und/oder übertragung digitaler audiosignale für studio-anwendungen |
| EP93905148A EP0632940B1 (de) | 1992-03-23 | 1993-02-25 | Verfahren zur datenreduktion bei der speicherung und/oder übertragung digitaler audiosignale für studio-anwendungen |
| AT93905148T ATE197105T1 (de) | 1992-03-23 | 1993-02-25 | Verfahren zur datenreduktion bei der speicherung und/oder übertragung digitaler audiosignale für studio-anwendungen |
| NO943520A NO943520D0 (no) | 1992-03-23 | 1994-09-22 | "Datareduksjon" ved lagring/overföring av digitale lydsignaler for studiobruk |
| KR1019940703290A KR950701161A (ko) | 1992-03-23 | 1994-09-22 | 가변장코드와 감지가능코딩으로 스튜디오용 디지탈 오디오 신호의 전송 및/또는 저장에 있어서의 데이타 감축 방법 |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE4209382A DE4209382C1 (enExample) | 1992-03-23 | 1992-03-23 | |
| DEP4209382.1 | 1992-03-23 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO1993019536A1 true WO1993019536A1 (de) | 1993-09-30 |
Family
ID=6454791
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/DE1993/000177 Ceased WO1993019536A1 (de) | 1992-03-23 | 1993-02-25 | Verfahren zur datenreduktioin bei der speicherung und/oder übertragung digitaler audiosignale für studioanwendungen mit perceptiver codierung und code variabler länge |
Country Status (9)
| Country | Link |
|---|---|
| EP (1) | EP0632940B1 (enExample) |
| JP (1) | JPH07508375A (enExample) |
| KR (1) | KR950701161A (enExample) |
| AT (1) | ATE197105T1 (enExample) |
| AU (1) | AU670068B2 (enExample) |
| CA (1) | CA2131806A1 (enExample) |
| DE (2) | DE4209382C1 (enExample) |
| NO (1) | NO943520D0 (enExample) |
| WO (1) | WO1993019536A1 (enExample) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1998042090A1 (de) * | 1997-03-14 | 1998-09-24 | Siemens Aktiengesellschaft | Verfahren zur informationsübermittlung an einen mobilen empfänger, wie einen pager, unter verwendung von rundfunksendern |
| DE10024959B4 (de) | 2000-05-22 | 2014-08-21 | Endress + Hauser Gmbh + Co. Kg | Vorrichtung zum unidirektionalen oder bidirektionalen Austausch von Daten |
| DE10119980C1 (de) * | 2001-04-24 | 2002-11-07 | Bosch Gmbh Robert | Verfahren zur Codierung von Audiodaten |
| DE10240135B4 (de) | 2002-08-30 | 2006-10-26 | Infineon Technologies Ag | Verfahren und Vorrichtung zur digitalen Filterung interpolierter Werte |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4546342A (en) * | 1983-12-14 | 1985-10-08 | Digital Recording Research Limited Partnership | Data compression method and apparatus |
| US4656500A (en) * | 1983-04-27 | 1987-04-07 | Fuji Photo Film Co., Ltd. | Adaptive type compression method for compressing a color image by imparting predetermined variable-length codes to combinations of quantized values based upon quantized prediction error components |
| DE3930760A1 (de) * | 1989-09-14 | 1991-03-28 | Bosch Gmbh Robert | Transformationscodierungssystem |
| US5028995A (en) * | 1987-10-28 | 1991-07-02 | Hitachi, Ltd. | Picture signal processor, picture signal coder and picture signal interpolator |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE3943880B4 (de) * | 1989-04-17 | 2008-07-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Digitales Codierverfahren |
-
1992
- 1992-03-23 DE DE4209382A patent/DE4209382C1/de not_active Expired - Lifetime
-
1993
- 1993-02-25 CA CA2131806A patent/CA2131806A1/en not_active Abandoned
- 1993-02-25 JP JP5516152A patent/JPH07508375A/ja active Pending
- 1993-02-25 AT AT93905148T patent/ATE197105T1/de active
- 1993-02-25 WO PCT/DE1993/000177 patent/WO1993019536A1/de not_active Ceased
- 1993-02-25 EP EP93905148A patent/EP0632940B1/de not_active Expired - Lifetime
- 1993-02-25 DE DE59310119T patent/DE59310119D1/de not_active Expired - Lifetime
- 1993-02-25 AU AU36254/93A patent/AU670068B2/en not_active Expired
-
1994
- 1994-09-22 KR KR1019940703290A patent/KR950701161A/ko not_active Ceased
- 1994-09-22 NO NO943520A patent/NO943520D0/no unknown
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4656500A (en) * | 1983-04-27 | 1987-04-07 | Fuji Photo Film Co., Ltd. | Adaptive type compression method for compressing a color image by imparting predetermined variable-length codes to combinations of quantized values based upon quantized prediction error components |
| US4546342A (en) * | 1983-12-14 | 1985-10-08 | Digital Recording Research Limited Partnership | Data compression method and apparatus |
| US5028995A (en) * | 1987-10-28 | 1991-07-02 | Hitachi, Ltd. | Picture signal processor, picture signal coder and picture signal interpolator |
| DE3930760A1 (de) * | 1989-09-14 | 1991-03-28 | Bosch Gmbh Robert | Transformationscodierungssystem |
Also Published As
| Publication number | Publication date |
|---|---|
| CA2131806A1 (en) | 1993-09-30 |
| AU670068B2 (en) | 1996-07-04 |
| JPH07508375A (ja) | 1995-09-14 |
| NO943520L (no) | 1994-09-22 |
| EP0632940B1 (de) | 2000-10-18 |
| ATE197105T1 (de) | 2000-11-15 |
| KR950701161A (ko) | 1995-02-20 |
| NO943520D0 (no) | 1994-09-22 |
| DE4209382C1 (enExample) | 1993-03-18 |
| DE59310119D1 (de) | 2000-11-23 |
| EP0632940A1 (de) | 1995-01-11 |
| AU3625493A (en) | 1993-10-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE69233094T2 (de) | Verfahren und Anordnung zur Datenkompression bei welchem Quantisierungsbits einem Block in einem gegenwärtigen Rahmen in Abhängigkeit eines Blocks in einem vergangenen Rahmen zugeteilt werden | |
| DE60214027T2 (de) | Kodiervorrichtung und dekodiervorrichtung | |
| DE3639753C2 (enExample) | ||
| DE60015030T2 (de) | Auf Block Umschaltung basierender Teilband-Audiokodierer | |
| EP0601437B1 (de) | Verfahren zur kompatiblen Übertragung und/oder Speicherung und Decodierung eines Zusatzsignals | |
| DE69732619T2 (de) | Verfahren zum Codieren von digitalen Audiosignalen | |
| DE4320990B4 (de) | Verfahren zur Redundanzreduktion | |
| EP0954909B1 (de) | Verfahren zum codieren eines audiosignals | |
| EP0931386B1 (de) | Verfahren zum signalisieren einer rauschsubstitution beim codieren eines audiosignals | |
| DE69804478T2 (de) | Verfahren und vorrichtung zur codierung und decodierung mehrere tonkanäle mit geringer bitrate | |
| DE69933119T2 (de) | Verfahren und vorrichtung zur maskierung des quantisierungsrauschens von audiosignalen | |
| DE19921122C1 (de) | Verfahren und Vorrichtung zum Verschleiern eines Fehlers in einem codierten Audiosignal und Verfahren und Vorrichtung zum Decodieren eines codierten Audiosignals | |
| DE69515907T2 (de) | Verfahren und gerät zum anwenden von wellenformprädiktion auf teilbänder in einem perzeptiven kodiersystem | |
| WO2005083678A1 (de) | Vorrichtung und verfahren zum verarbeiten eines multikanalsignals | |
| DE10328777A1 (de) | Vorrichtung und Verfahren zum Codieren eines Audiosignals und Vorrichtung und Verfahren zum Decodieren eines codierten Audiosignals | |
| DE60311334T2 (de) | Verfahren und Vorrichtung zur Kodierung und Dekodierung eines digitalen Informationssignals | |
| WO1990009063A1 (de) | Verfahren zur übertragung eines signals | |
| DE60206269T2 (de) | Editieren von audiosignalen | |
| EP0464534B1 (de) | Transformationskodierer mit adaptiver Fensterfunktion | |
| EP0494918A1 (de) | Verfahren zur übertragung eines signals. | |
| EP0632940B1 (de) | Verfahren zur datenreduktion bei der speicherung und/oder übertragung digitaler audiosignale für studio-anwendungen | |
| EP0378609B1 (de) | Verfahren zur übertragung eines audiosignals | |
| DE19742201C1 (de) | Verfahren und Vorrichtung zum Codieren von Audiosignalen | |
| DE4222150C2 (de) | Verfahren zur Übertragung und/oder Speicherung digitaler Audiosignale nach dem ISO-MPEG-Audio-Standard mit erweiterten Abtastfrequenzen und Bitraten | |
| DE69534799T2 (de) | Übertragungssystem mit anwendung verschiedener kodierprinzipen |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP KR NO RU UA US |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE |
|
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 1993905148 Country of ref document: EP |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2131806 Country of ref document: CA |
|
| ENP | Entry into the national phase |
Ref document number: 1994 307664 Country of ref document: US Date of ref document: 19941221 Kind code of ref document: A |
|
| WWP | Wipo information: published in national office |
Ref document number: 1993905148 Country of ref document: EP |
|
| WWG | Wipo information: grant in national office |
Ref document number: 1993905148 Country of ref document: EP |