CA2482427C - Dispositif et procede pour coder un signal audio a temps discret et dispositif et procede pour decoder des donnees audio codees - Google Patents
Dispositif et procede pour coder un signal audio a temps discret et dispositif et procede pour decoder des donnees audio codees Download PDFInfo
- Publication number
- CA2482427C CA2482427C CA002482427A CA2482427A CA2482427C CA 2482427 C CA2482427 C CA 2482427C CA 002482427 A CA002482427 A CA 002482427A CA 2482427 A CA2482427 A CA 2482427A CA 2482427 C CA2482427 C CA 2482427C
- Authority
- CA
- Canada
- Prior art keywords
- block
- integer
- difference
- spectral values
- quantization
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 65
- 238000000034 method Methods 0.000 title claims abstract description 30
- 230000003595 spectral effect Effects 0.000 claims abstract description 224
- 238000013139 quantization Methods 0.000 claims abstract description 111
- 238000012545 processing Methods 0.000 claims description 36
- 239000011159 matrix material Substances 0.000 claims description 19
- 230000002123 temporal effect Effects 0.000 claims description 18
- 238000006243 chemical reaction Methods 0.000 claims description 8
- 238000004590 computer program Methods 0.000 claims description 4
- 230000001419 dependent effect Effects 0.000 claims description 4
- 238000013507 mapping Methods 0.000 claims description 4
- 230000008569 process Effects 0.000 abstract description 5
- 230000005540 biological transmission Effects 0.000 description 12
- 238000001228 spectrum Methods 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 9
- 238000010586 diagram Methods 0.000 description 8
- 230000008901 benefit Effects 0.000 description 5
- 230000002349 favourable effect Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 4
- 244000089409 Erythrina poeppigiana Species 0.000 description 3
- 235000009776 Rathbunia alamosensis Nutrition 0.000 description 3
- 238000013144 data compression Methods 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 241001442234 Cosa Species 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000002592 echocardiography Methods 0.000 description 2
- 230000000873 masking effect Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000007493 shaping process Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
Selon la présente invention, un signal audio à temps discret est traité (52) afin de fournir (52) un bloc de quantification avec des valeurs spectrales quantifiées. Une représentation spectrale en nombres entiers est produite à partir du signal audio à temps discret, par utilisation d'un algorithme de transformation (56) en nombres entiers. Le bloc de quantification qui a été produit à l'aide d'un modèle psychoacoustique (54) est inversement quantifié et arrondi (58) afin d'établir une différence entre les valeurs spectrales en nombres entiers et les valeurs spectrales arrondies inversement quantifiées. Le bloc de quantification seul fournit, après le décodage, un signal audio à codage/décodage psychoacoustique avec pertes, alors que le bloc de quantification avec le bloc de combinaison fournit, lors du décodage, un signal audio codé ou à nouveau décodé, sans perte ou quasiment sans perte. La production du signal différentiel dans le domaine fréquentiel permet d'obtenir une structure de codeur/décodeur simplifiée.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10217297.8 | 2002-04-18 | ||
DE10217297A DE10217297A1 (de) | 2002-04-18 | 2002-04-18 | Vorrichtung und Verfahren zum Codieren eines zeitdiskreten Audiosignals und Vorrichtung und Verfahren zum Decodieren von codierten Audiodaten |
PCT/EP2002/013623 WO2003088212A1 (fr) | 2002-04-18 | 2002-12-02 | Dispositif et procede pour coder un signal audio a temps discret et dispositif et procede pour decoder des donnees audio codees |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2482427A1 CA2482427A1 (fr) | 2003-10-23 |
CA2482427C true CA2482427C (fr) | 2010-01-19 |
Family
ID=28798541
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002482427A Expired - Lifetime CA2482427C (fr) | 2002-04-18 | 2002-12-02 | Dispositif et procede pour coder un signal audio a temps discret et dispositif et procede pour decoder des donnees audio codees |
Country Status (9)
Country | Link |
---|---|
EP (1) | EP1495464B1 (fr) |
JP (1) | JP4081447B2 (fr) |
KR (1) | KR100892152B1 (fr) |
CN (1) | CN1258172C (fr) |
AT (1) | ATE305655T1 (fr) |
CA (1) | CA2482427C (fr) |
DE (2) | DE10217297A1 (fr) |
HK (1) | HK1077391A1 (fr) |
WO (1) | WO2003088212A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11817111B2 (en) | 2018-04-11 | 2023-11-14 | Dolby Laboratories Licensing Corporation | Perceptually-based loss functions for audio encoding and decoding based on machine learning |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070276894A1 (en) * | 2003-09-29 | 2007-11-29 | Agency For Science, Technology And Research | Process And Device For Determining A Transforming Element For A Given Transformation Function, Method And Device For Transforming A Digital Signal From The Time Domain Into The Frequency Domain And Vice Versa And Computer Readable Medium |
KR101141247B1 (ko) * | 2003-10-10 | 2012-05-04 | 에이전시 포 사이언스, 테크놀로지 앤드 리서치 | 디지털 신호를 확장성 비트스트림으로 인코딩하는 방법;확장성 비트스트림을 디코딩하는 방법 |
DE102004007200B3 (de) * | 2004-02-13 | 2005-08-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audiocodierung |
DE102004007184B3 (de) * | 2004-02-13 | 2005-09-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Verfahren und Vorrichtung zum Quantisieren eines Informationssignals |
DE102004059979B4 (de) | 2004-12-13 | 2007-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zur Berechnung einer Signalenergie eines Informationssignals |
US8494667B2 (en) | 2005-06-30 | 2013-07-23 | Lg Electronics Inc. | Apparatus for encoding and decoding audio signal and method thereof |
ATE455348T1 (de) | 2005-08-30 | 2010-01-15 | Lg Electronics Inc | Vorrichtung und verfahren zur dekodierung eines audiosignals |
KR100878833B1 (ko) | 2005-10-05 | 2009-01-14 | 엘지전자 주식회사 | 신호 처리 방법 및 이의 장치, 그리고 인코딩 및 디코딩방법 및 이의 장치 |
US7653533B2 (en) | 2005-10-24 | 2010-01-26 | Lg Electronics Inc. | Removing time delays in signal paths |
EP1852849A1 (fr) | 2006-05-05 | 2007-11-07 | Deutsche Thomson-Brandt Gmbh | Procédé et appareil d'encodage sans perte d'un signal source utilisant un courant de données encodées avec perte et un courant d'extension de données encodées sans perte |
EP1883067A1 (fr) * | 2006-07-24 | 2008-01-30 | Deutsche Thomson-Brandt Gmbh | Méthode et appareil pour l'encodage sans perte d'un signal source, utilisant un flux de données encodées avec pertes et un flux de données d'extension sans perte. |
EP1903559A1 (fr) | 2006-09-20 | 2008-03-26 | Deutsche Thomson-Brandt Gmbh | Procédé et dispositif de transcodage de signaux audio |
DE102006051673A1 (de) * | 2006-11-02 | 2008-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Nachbearbeiten von Spektralwerten und Encodierer und Decodierer für Audiosignale |
DE102007003187A1 (de) * | 2007-01-22 | 2008-10-02 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Erzeugen eines zu sendenden Signals oder eines decodierten Signals |
KR101149448B1 (ko) * | 2007-02-12 | 2012-05-25 | 삼성전자주식회사 | 오디오 부호화 및 복호화 장치와 그 방법 |
EP2015293A1 (fr) * | 2007-06-14 | 2009-01-14 | Deutsche Thomson OHG | Procédé et appareil pour coder et décoder un signal audio par résolution temporelle à commutation adaptative dans le domaine spectral |
MX2010001763A (es) * | 2007-08-27 | 2010-03-10 | Ericsson Telefon Ab L M | Analisis/sintesis espectral de baja complejidad utilizando la resolucion temporal seleccionable. |
EP2063417A1 (fr) * | 2007-11-23 | 2009-05-27 | Deutsche Thomson OHG | Formage de l'erreur d'arrondi pour le codage et décodage basés sur des transformées entières |
EP2144230A1 (fr) | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Schéma de codage/décodage audio à taux bas de bits disposant des commutateurs en cascade |
CN102177426B (zh) * | 2008-10-08 | 2014-11-05 | 弗兰霍菲尔运输应用研究公司 | 多分辨率切换音频编码/解码方案 |
CN102918590B (zh) * | 2010-03-31 | 2014-12-10 | 韩国电子通信研究院 | 编码方法和装置、以及解码方法和装置 |
US20120029926A1 (en) * | 2010-07-30 | 2012-02-02 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals |
US9208792B2 (en) | 2010-08-17 | 2015-12-08 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for noise injection |
JP5799707B2 (ja) * | 2011-09-26 | 2015-10-28 | ソニー株式会社 | オーディオ符号化装置およびオーディオ符号化方法、オーディオ復号装置およびオーディオ復号方法、並びにプログラム |
EP2830058A1 (fr) * | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codage audio en domaine de fréquence supportant la commutation de longueur de transformée |
CN105632503B (zh) * | 2014-10-28 | 2019-09-03 | 南宁富桂精密工业有限公司 | 信息隐藏方法及系统 |
US10354667B2 (en) * | 2017-03-22 | 2019-07-16 | Immersion Networks, Inc. | System and method for processing audio data |
EP3471271A1 (fr) * | 2017-10-16 | 2019-04-17 | Acoustical Beauty | Convolutions améliorées de signaux numériques utilisant une optimisation des exigences de bits d'un signal numérique cible |
EP3483879A1 (fr) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Fonction de fenêtrage d'analyse/de synthèse pour une transformation chevauchante modulée |
WO2019091576A1 (fr) * | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeurs audio, décodeurs audio, procédés et programmes informatiques adaptant un codage et un décodage de bits les moins significatifs |
CN107911122A (zh) * | 2017-11-13 | 2018-04-13 | 南京大学 | 基于分解压缩的分布式光纤振动传感数据无损压缩方法 |
US11281312B2 (en) | 2018-01-08 | 2022-03-22 | Immersion Networks, Inc. | Methods and apparatuses for producing smooth representations of input motion in time and space |
DE102019204527B4 (de) * | 2019-03-29 | 2020-11-19 | Technische Universität München | Kodierungs-/dekodierungsvorrichtungen und verfahren zur kodierung/dekodierung von vibrotaktilen signalen |
KR102250835B1 (ko) * | 2019-08-05 | 2021-05-11 | 국방과학연구소 | 수동 소나의 협대역 신호를 탐지하기 위한 lofar 또는 demon 그램의 압축 장치 |
CN118571234A (zh) * | 2023-02-28 | 2024-08-30 | 华为技术有限公司 | 音频编解码方法及相关装置 |
-
2002
- 2002-04-18 DE DE10217297A patent/DE10217297A1/de not_active Withdrawn
- 2002-12-02 JP JP2003585070A patent/JP4081447B2/ja not_active Expired - Lifetime
- 2002-12-02 DE DE50204426T patent/DE50204426D1/de not_active Expired - Lifetime
- 2002-12-02 AT AT02792858T patent/ATE305655T1/de active
- 2002-12-02 CN CNB028289749A patent/CN1258172C/zh not_active Expired - Lifetime
- 2002-12-02 KR KR1020047016744A patent/KR100892152B1/ko active IP Right Grant
- 2002-12-02 EP EP02792858A patent/EP1495464B1/fr not_active Expired - Lifetime
- 2002-12-02 CA CA002482427A patent/CA2482427C/fr not_active Expired - Lifetime
- 2002-12-02 WO PCT/EP2002/013623 patent/WO2003088212A1/fr active IP Right Grant
-
2005
- 2005-10-20 HK HK05109316A patent/HK1077391A1/xx not_active IP Right Cessation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11817111B2 (en) | 2018-04-11 | 2023-11-14 | Dolby Laboratories Licensing Corporation | Perceptually-based loss functions for audio encoding and decoding based on machine learning |
Also Published As
Publication number | Publication date |
---|---|
CA2482427A1 (fr) | 2003-10-23 |
JP2005527851A (ja) | 2005-09-15 |
DE50204426D1 (de) | 2005-11-03 |
HK1077391A1 (en) | 2006-02-10 |
JP4081447B2 (ja) | 2008-04-23 |
CN1625768A (zh) | 2005-06-08 |
KR100892152B1 (ko) | 2009-04-10 |
WO2003088212A1 (fr) | 2003-10-23 |
AU2002358578A1 (en) | 2003-10-27 |
DE10217297A1 (de) | 2003-11-06 |
CN1258172C (zh) | 2006-05-31 |
KR20050007312A (ko) | 2005-01-17 |
EP1495464A1 (fr) | 2005-01-12 |
EP1495464B1 (fr) | 2005-09-28 |
ATE305655T1 (de) | 2005-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2482427C (fr) | Dispositif et procede pour coder un signal audio a temps discret et dispositif et procede pour decoder des donnees audio codees | |
US7275036B2 (en) | Apparatus and method for coding a time-discrete audio signal to obtain coded audio data and for decoding coded audio data | |
USRE49492E1 (en) | Audio or video encoder, audio or video decoder and related methods for processing multi-channel audio or video signals using a variable prediction direction | |
US7343287B2 (en) | Method and apparatus for scalable encoding and method and apparatus for scalable decoding | |
US8195730B2 (en) | Apparatus and method for conversion into a transformed representation or for inverse conversion of the transformed representation | |
US7917564B2 (en) | Device and method for processing a signal having a sequence of discrete values | |
US20080319739A1 (en) | Low complexity decoder for complex transform coding of multi-channel sound | |
US7512539B2 (en) | Method and device for processing time-discrete audio sampled values | |
JP2013528822A (ja) | オーディオエンコーダ、オーディオデコーダ、及び複素数予測を使用したマルチチャンネルオーディオ信号処理方法 | |
KR20070098930A (ko) | 근접-투명 또는 투명 멀티-채널 인코더/디코더 구성 | |
EP1974470A1 (fr) | Codage de canal de transformee complexe avec codage de frequence a bande etendue | |
CN103329197A (zh) | 用于反相声道的改进的立体声参数编码/解码 | |
EP2279562A2 (fr) | Factorisation de transformées chevauchantes en deux transformées par blocs | |
Geiger et al. | IntMDCT-A link between perceptual and lossless audio coding | |
TW201928947A (zh) | 用於統一語音及音訊之解碼及編碼去關聯濾波器之改良之方法、裝置及系統 | |
WO2008114080A1 (fr) | Décodage audio | |
Herre | Audio Coding Based on Integer Transforms | |
Fraunhofer | INTMDCT-A LINK BETWEEN PERCEPTUAL AND LOSSLESS AUDIO CODING |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKEX | Expiry |
Effective date: 20221202 |
|
MKEX | Expiry |
Effective date: 20221202 |