EP2599079A2 - Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio - Google Patents

Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio

Info

Publication number
EP2599079A2
EP2599079A2 EP11745635.0A EP11745635A EP2599079A2 EP 2599079 A2 EP2599079 A2 EP 2599079A2 EP 11745635 A EP11745635 A EP 11745635A EP 2599079 A2 EP2599079 A2 EP 2599079A2
Authority
EP
European Patent Office
Prior art keywords
subbands
frame
encoded
target frame
location
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP11745635.0A
Other languages
German (de)
English (en)
Inventor
Venkatesh Krishnan
Vivek Rajendran
Ethan R. Duni
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US13/193,542 external-priority patent/US20120029926A1/en
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of EP2599079A2 publication Critical patent/EP2599079A2/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • G10L19/038Vector quantisation, e.g. TwinVQ audio
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/093Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models

Definitions

  • FIG. 8E shows a block diagram of an implementation A130 of apparatus A120.
  • the locations of regions of significant energy in the frequency domain at a given time may be relatively persistent over time. It may be desirable to perform efficient transform-domain coding of an audio signal by exploiting such a correlation over time.
  • a scheme as described herein for coding a set of transform coefficients that represent an audio-frequency range of a signal exploits time-persistence of energy distribution across the signal spectrum by encoding the locations of regions of significant energy in the frequency domain relative to locations of such regions in an earlier frame of the signal as decoded.
  • VQ coding scheme e.g., GSVQ
  • GSVQ VQ coding scheme
  • method MB 110 is arranged to encode regions of significant energy in a frequency range of an UB-MDCT spectrum.
  • FIG. 14B shows a block diagram of an implementation of the path of FIG. 14A in which transform module MM1 is implemented using an MDCT transform module.
  • Modified DCT module MM 10 performs an MDCT operation on each audio frame to produce a set of MDCT domain coefficients.
  • FIG. 16 shows front, rear, and side views of a handset H100 (e.g., a smartphone) having two voice microphones MVlO-1 and MV10-3 arranged on the front face, a voice microphone MV10-2 arranged on the rear face, an error microphone ME 10 located in a top corner of the front face, and a noise reference microphone MR 10 located on the back face.
  • a loudspeaker LS10 is arranged in the top center of the front face near error microphone ME10, and two other loudspeakers LS20L, LS20R are also provided (e.g., for speakerphone applications).
  • a maximum distance between the microphones of such a handset is typically about ten or twelve centimeters.
  • Important design requirements for implementation of a configuration as disclosed herein may include minimizing processing delay and/or computational complexity (typically measured in millions of instructions per second or MIPS), especially for computation-intensive applications, such as playback of compressed audio or audiovisual information (e.g., a file or stream encoded according to a compression format, such as one of the examples identified herein) or applications for wideband communications (e.g., voice communications at sampling rates higher than eight kilohertz, such as 12, 16, 44.1, 48, or 192 kHz).
  • MIPS processing delay and/or computational complexity

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Un système destiné à coder un ensemble de coefficients de transformation qui représentent une plage de fréquences audio d'un signal utilise des informations en provenance d'une trame de référence qui décrit une trame précédente du signal de façon à déterminer dans le domaine fréquentiel des emplacements de régions qui présentent une énergie importante dans une trame cible du signal.
EP11745635.0A 2010-07-30 2011-07-29 Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio Withdrawn EP2599079A2 (fr)

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US36966210P 2010-07-30 2010-07-30
US36970510P 2010-07-31 2010-07-31
US36975110P 2010-08-01 2010-08-01
US37456510P 2010-08-17 2010-08-17
US201061384237P 2010-09-17 2010-09-17
US201161470438P 2011-03-31 2011-03-31
US13/193,542 US20120029926A1 (en) 2010-07-30 2011-07-28 Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals
PCT/US2011/045865 WO2012016128A2 (fr) 2010-07-30 2011-07-29 Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio

Publications (1)

Publication Number Publication Date
EP2599079A2 true EP2599079A2 (fr) 2013-06-05

Family

ID=48173935

Family Applications (1)

Application Number Title Priority Date Filing Date
EP11745635.0A Withdrawn EP2599079A2 (fr) 2010-07-30 2011-07-29 Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio

Country Status (1)

Country Link
EP (1) EP2599079A2 (fr)

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2012016128A2 *

Similar Documents

Publication Publication Date Title
EP2599080B1 (fr) Systèmes, procédés, appareil et supports lisibles par ordinateur permettant de coder des signaux harmoniques
KR101445512B1 (ko) 잡음 주입을 위한 시스템, 방법, 장치, 및 컴퓨터 판독가능 매체
CN104995678B (zh) 用于控制平均编码率的系统和方法
EP2599079A2 (fr) Systèmes, procédés, appareil et supports pouvant être lus par un ordinateur destinés à un codage à mode dépendant de signaux audio
ES2653799T3 (es) Sistemas, procedimientos, aparatos y medios legibles por ordenador para la decodificación de señales armónicas

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20130114

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DAX Request for extension of the european patent (deleted)
17Q First examination report despatched

Effective date: 20150108

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20150519