EP2599079A2 - Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio - Google Patents
Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audioInfo
- Publication number
- EP2599079A2 EP2599079A2 EP11745635.0A EP11745635A EP2599079A2 EP 2599079 A2 EP2599079 A2 EP 2599079A2 EP 11745635 A EP11745635 A EP 11745635A EP 2599079 A2 EP2599079 A2 EP 2599079A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- subbands
- frame
- encoded
- target frame
- location
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 107
- 230000005236 sound signal Effects 0.000 title claims description 88
- 239000013598 vector Substances 0.000 claims description 33
- 238000012545 processing Methods 0.000 claims description 32
- 238000013139 quantization Methods 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 5
- 238000010586 diagram Methods 0.000 description 40
- 238000004891 communication Methods 0.000 description 25
- 238000001228 spectrum Methods 0.000 description 21
- 238000003491 array Methods 0.000 description 12
- 230000001965 increasing effect Effects 0.000 description 9
- 230000005540 biological transmission Effects 0.000 description 8
- 230000003287 optical effect Effects 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 7
- 230000003247 decreasing effect Effects 0.000 description 7
- 102100028138 F-box/WD repeat-containing protein 7 Human genes 0.000 description 6
- 101001060231 Homo sapiens F-box/WD repeat-containing protein 7 Proteins 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000013507 mapping Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 3
- 239000000835 fiber Substances 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 230000000873 masking effect Effects 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 239000004065 semiconductor Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 101000822695 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C1 Proteins 0.000 description 1
- 101000655262 Clostridium perfringens (strain 13 / Type A) Small, acid-soluble spore protein C2 Proteins 0.000 description 1
- 101000655256 Paraclostridium bifermentans Small, acid-soluble spore protein alpha Proteins 0.000 description 1
- 101000655264 Paraclostridium bifermentans Small, acid-soluble spore protein beta Proteins 0.000 description 1
- 238000012952 Resampling Methods 0.000 description 1
- 238000005056 compaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/093—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models
Definitions
- FIG. 8E shows a block diagram of an implementation A130 of apparatus A120.
- the locations of regions of significant energy in the frequency domain at a given time may be relatively persistent over time. It may be desirable to perform efficient transform-domain coding of an audio signal by exploiting such a correlation over time.
- a scheme as described herein for coding a set of transform coefficients that represent an audio-frequency range of a signal exploits time-persistence of energy distribution across the signal spectrum by encoding the locations of regions of significant energy in the frequency domain relative to locations of such regions in an earlier frame of the signal as decoded.
- VQ coding scheme e.g., GSVQ
- GSVQ VQ coding scheme
- method MB 110 is arranged to encode regions of significant energy in a frequency range of an UB-MDCT spectrum.
- FIG. 14B shows a block diagram of an implementation of the path of FIG. 14A in which transform module MM1 is implemented using an MDCT transform module.
- Modified DCT module MM 10 performs an MDCT operation on each audio frame to produce a set of MDCT domain coefficients.
- FIG. 16 shows front, rear, and side views of a handset H100 (e.g., a smartphone) having two voice microphones MVlO-1 and MV10-3 arranged on the front face, a voice microphone MV10-2 arranged on the rear face, an error microphone ME 10 located in a top corner of the front face, and a noise reference microphone MR 10 located on the back face.
- a loudspeaker LS10 is arranged in the top center of the front face near error microphone ME10, and two other loudspeakers LS20L, LS20R are also provided (e.g., for speakerphone applications).
- a maximum distance between the microphones of such a handset is typically about ten or twelve centimeters.
- Important design requirements for implementation of a configuration as disclosed herein may include minimizing processing delay and/or computational complexity (typically measured in millions of instructions per second or MIPS), especially for computation-intensive applications, such as playback of compressed audio or audiovisual information (e.g., a file or stream encoded according to a compression format, such as one of the examples identified herein) or applications for wideband communications (e.g., voice communications at sampling rates higher than eight kilohertz, such as 12, 16, 44.1, 48, or 192 kHz).
- MIPS processing delay and/or computational complexity
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Un système destiné à coder un ensemble de coefficients de transformation qui représentent une plage de fréquences audio d'un signal utilise des informations en provenance d'une trame de référence qui décrit une trame précédente du signal de façon à déterminer dans le domaine fréquentiel des emplacements de régions qui présentent une énergie importante dans une trame cible du signal.
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36966210P | 2010-07-30 | 2010-07-30 | |
US36970510P | 2010-07-31 | 2010-07-31 | |
US36975110P | 2010-08-01 | 2010-08-01 | |
US37456510P | 2010-08-17 | 2010-08-17 | |
US201061384237P | 2010-09-17 | 2010-09-17 | |
US201161470438P | 2011-03-31 | 2011-03-31 | |
US13/193,542 US20120029926A1 (en) | 2010-07-30 | 2011-07-28 | Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals |
PCT/US2011/045865 WO2012016128A2 (fr) | 2010-07-30 | 2011-07-29 | Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2599079A2 true EP2599079A2 (fr) | 2013-06-05 |
Family
ID=48173935
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP11745635.0A Withdrawn EP2599079A2 (fr) | 2010-07-30 | 2011-07-29 | Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP2599079A2 (fr) |
-
2011
- 2011-07-29 EP EP11745635.0A patent/EP2599079A2/fr not_active Withdrawn
Non-Patent Citations (1)
Title |
---|
See references of WO2012016128A2 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2599080B1 (fr) | Systèmes, procédés, appareil et supports lisibles par ordinateur permettant de coder des signaux harmoniques | |
KR101445512B1 (ko) | 잡음 주입을 위한 시스템, 방법, 장치, 및 컴퓨터 판독가능 매체 | |
CN104995678B (zh) | 用于控制平均编码率的系统和方法 | |
EP2599079A2 (fr) | Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio | |
ES2653799T3 (es) | Sistemas, procedimientos, aparatos y medios legibles por ordenador para la decodificación de señales armónicas |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20130114 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20150108 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20150519 |