CA2865651C - Phase coherence control for harmonic signals in perceptual audio codecs - Google Patents

Phase coherence control for harmonic signals in perceptual audio codecs Download PDF

Info

Publication number
CA2865651C
CA2865651C CA2865651A CA2865651A CA2865651C CA 2865651 C CA2865651 C CA 2865651C CA 2865651 A CA2865651 A CA 2865651A CA 2865651 A CA2865651 A CA 2865651A CA 2865651 C CA2865651 C CA 2865651C
Authority
CA
Canada
Prior art keywords
audio signal
control information
phase
vpc
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CA2865651A
Other languages
English (en)
French (fr)
Other versions
CA2865651A1 (en
Inventor
Sascha Disch
Jurgen Herre
Bernd Edler
Frederik Nagel
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Publication of CA2865651A1 publication Critical patent/CA2865651A1/en
Application granted granted Critical
Publication of CA2865651C publication Critical patent/CA2865651C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
CA2865651A 2012-02-27 2013-02-26 Phase coherence control for harmonic signals in perceptual audio codecs Active CA2865651C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201261603773P 2012-02-27 2012-02-27
US61/603,773 2012-02-27
EP12178265.0 2012-07-27
EP12178265.0A EP2631906A1 (en) 2012-02-27 2012-07-27 Phase coherence control for harmonic signals in perceptual audio codecs
PCT/EP2013/053831 WO2013127801A1 (en) 2012-02-27 2013-02-26 Phase coherence control for harmonic signals in perceptual audio codecs

Publications (2)

Publication Number Publication Date
CA2865651A1 CA2865651A1 (en) 2013-09-06
CA2865651C true CA2865651C (en) 2017-05-02

Family

ID=47076051

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2865651A Active CA2865651C (en) 2012-02-27 2013-02-26 Phase coherence control for harmonic signals in perceptual audio codecs

Country Status (14)

Country Link
US (1) US10818304B2 (https=)
EP (2) EP2631906A1 (https=)
JP (1) JP5873936B2 (https=)
KR (1) KR101680953B1 (https=)
CN (1) CN104170009B (https=)
AU (1) AU2013225076B2 (https=)
BR (1) BR112014021054B1 (https=)
CA (1) CA2865651C (https=)
ES (1) ES2673319T3 (https=)
IN (1) IN2014KN01766A (https=)
MX (1) MX338526B (https=)
RU (1) RU2612584C2 (https=)
TR (1) TR201808452T4 (https=)
WO (1) WO2013127801A1 (https=)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BRPI0718738B1 (pt) 2006-12-12 2023-05-16 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Codificador, decodificador e métodos para codificação e decodificação de segmentos de dados representando uma corrente de dados de domínio de tempo
KR20160087827A (ko) * 2013-11-22 2016-07-22 퀄컴 인코포레이티드 고대역 코딩에서의 선택적 위상 보상
EP2963648A1 (en) 2014-07-01 2016-01-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio processor and method for processing an audio signal using vertical phase correction
CA2976864C (en) 2015-02-26 2020-07-14 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for processing an audio signal to obtain a processed audio signal using a target time-domain envelope
TWI693594B (zh) * 2015-03-13 2020-05-11 瑞典商杜比國際公司 解碼具有增強頻譜帶複製元資料在至少一填充元素中的音訊位元流
WO2016046421A1 (en) * 2015-11-19 2016-03-31 Telefonaktiebolaget L M Ericsson (Publ) Method and apparatus for voiced speech detection
CN106653004B (zh) * 2016-12-26 2019-07-26 苏州大学 感知语谱规整耳蜗滤波系数的说话人识别特征提取方法
US11771779B2 (en) 2018-01-26 2023-10-03 Hadasit Medical Research Services & Development Limited Non-metallic magnetic resonance contrast agent
IL313348B2 (en) 2018-04-25 2025-08-01 Dolby Int Ab Integration of high frequency reconstruction techniques with reduced post-processing delay
KR20250130700A (ko) 2018-04-25 2025-09-02 돌비 인터네셔널 에이비 고주파 오디오 재구성 기술의 통합
CN110728970B (zh) * 2019-09-29 2022-02-25 东莞市中光通信科技有限公司 一种数字辅助隔音处理的方法及装置
US12347447B2 (en) 2019-12-05 2025-07-01 Dolby Laboratories Licensing Corporation Psychoacoustic model for audio processing
CN113990334B (zh) * 2021-10-28 2024-11-01 深圳市智创一切科技有限公司 用于语音编码的蓝牙音频的传送方法、系统和电子设备
EP4276824A1 (en) 2022-05-13 2023-11-15 Alta Voce Method for modifying an audio signal without phasiness
CN116486835B (zh) * 2023-05-31 2025-09-02 平安科技(深圳)有限公司 合成语音检测方法和系统、计算机设备、存储介质

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5054072A (en) * 1987-04-02 1991-10-01 Massachusetts Institute Of Technology Coding of acoustic waveforms
RU2009585C1 (ru) * 1991-06-19 1994-03-15 Евгений Николаевич Пестов Способ ударного возбуждения фазовой когерентности одновременно по крайней мере в двух квантовых системах
FR2692091B1 (fr) * 1992-06-03 1995-04-14 France Telecom Procédé et dispositif de dissimulation d'erreurs de transmission de signaux audio-numériques codés par transformée fréquentielle.
US6766300B1 (en) * 1996-11-07 2004-07-20 Creative Technology Ltd. Method and apparatus for transient detection and non-distortion time scaling
JPH11251918A (ja) * 1998-03-03 1999-09-17 Takayoshi Hirata 音声信号波形符号化伝送方式
US6397175B1 (en) * 1999-07-19 2002-05-28 Qualcomm Incorporated Method and apparatus for subsampling phase spectrum information
US6549884B1 (en) * 1999-09-21 2003-04-15 Creative Technology Ltd. Phase-vocoder pitch-shifting
KR100348790B1 (ko) * 1999-12-21 2002-08-17 엘지전자주식회사 큐에이엠 수신기
US7006636B2 (en) * 2002-05-24 2006-02-28 Agere Systems Inc. Coherence-based audio coding and synthesis
US20030187663A1 (en) * 2002-03-28 2003-10-02 Truman Michael Mead Broadband frequency translation for high frequency regeneration
JP4313993B2 (ja) * 2002-07-19 2009-08-12 パナソニック株式会社 オーディオ復号化装置およびオーディオ復号化方法
CN1231889C (zh) * 2002-11-19 2005-12-14 华为技术有限公司 多通道声码器的语音处理方法
SE527669C2 (sv) * 2003-12-19 2006-05-09 Ericsson Telefon Ab L M Förbättrad felmaskering i frekvensdomänen
SE0303498D0 (sv) * 2003-12-19 2003-12-19 Ericsson Telefon Ab L M Spectral loss conccalment in transform codecs
JP4513556B2 (ja) * 2003-12-25 2010-07-28 カシオ計算機株式会社 音声分析合成装置、及びプログラム
WO2006003550A1 (en) * 2004-06-28 2006-01-12 Koninklijke Philips Electronics N.V. Wireless audio
JP4734961B2 (ja) * 2005-02-28 2011-07-27 カシオ計算機株式会社 音響効果付与装置、及びプログラム
US7856355B2 (en) * 2005-07-05 2010-12-21 Alcatel-Lucent Usa Inc. Speech quality assessment method and system
US7546237B2 (en) * 2005-12-23 2009-06-09 Qnx Software Systems (Wavemakers), Inc. Bandwidth extension of narrowband speech
US9697844B2 (en) * 2006-05-17 2017-07-04 Creative Technology Ltd Distributed spatial audio decoder
EP1918911A1 (en) * 2006-11-02 2008-05-07 RWTH Aachen University Time scale modification of an audio signal
KR101453732B1 (ko) * 2007-04-16 2014-10-24 삼성전자주식회사 스테레오 신호 및 멀티 채널 신호 부호화 및 복호화 방법및 장치
KR101230479B1 (ko) * 2008-03-10 2013-02-06 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 트랜지언트 이벤트를 갖는 오디오 신호를 조작하기 위한 장치 및 방법
EP2237266A1 (en) * 2009-04-03 2010-10-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for determining a plurality of local center of gravity frequencies of a spectrum of an audio signal
WO2011039668A1 (en) * 2009-09-29 2011-04-07 Koninklijke Philips Electronics N.V. Apparatus for mixing a digital audio
EP2704143B1 (en) * 2009-10-21 2015-01-07 Panasonic Intellectual Property Corporation of America Apparatus, method and computer program for audio signal processing
EP2545551B1 (en) * 2010-03-09 2017-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Improved magnitude response and temporal alignment in phase vocoder based bandwidth extension for audio signals
JP6037156B2 (ja) * 2011-08-24 2016-11-30 ソニー株式会社 符号化装置および方法、並びにプログラム
FR3008533A1 (fr) * 2013-07-12 2015-01-16 Orange Facteur d'echelle optimise pour l'extension de bande de frequence dans un decodeur de signaux audiofrequences

Also Published As

Publication number Publication date
KR20140130225A (ko) 2014-11-07
IN2014KN01766A (https=) 2015-10-23
WO2013127801A1 (en) 2013-09-06
JP2015508911A (ja) 2015-03-23
CN104170009B (zh) 2017-02-22
EP2631906A1 (en) 2013-08-28
JP5873936B2 (ja) 2016-03-01
MX2014010098A (es) 2014-09-16
KR101680953B1 (ko) 2016-12-12
RU2612584C2 (ru) 2017-03-09
EP2820647B1 (en) 2018-03-21
ES2673319T3 (es) 2018-06-21
EP2820647A1 (en) 2015-01-07
BR112014021054B1 (pt) 2022-04-26
US10818304B2 (en) 2020-10-27
CA2865651A1 (en) 2013-09-06
TR201808452T4 (tr) 2018-07-23
MX338526B (es) 2016-04-20
BR112014021054A2 (pt) 2021-05-25
CN104170009A (zh) 2014-11-26
AU2013225076B2 (en) 2016-04-21
AU2013225076A1 (en) 2014-09-04
RU2014138820A (ru) 2016-04-20
US20140372131A1 (en) 2014-12-18

Similar Documents

Publication Publication Date Title
CA2865651C (en) Phase coherence control for harmonic signals in perceptual audio codecs
CA2987808C (en) Apparatus and method for encoding or decoding an audio multi-channel signal using spectral-domain resampling
CN108369810B (zh) 用于对多声道音频信号进行编码的自适应声道缩减处理
JP5426680B2 (ja) 信号処理方法及び装置
CA2589623C (en) Temporal envelope shaping for spatial audio coding using frequency domain wiener filtering
JP6535730B2 (ja) 独立したノイズ充填を用いた強化された信号を生成するための装置および方法
EP2169666B1 (en) A method and an apparatus for processing a signal
EP4123645B1 (en) Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision
WO2010097748A1 (en) Parametric stereo encoding and decoding
CA2887228A1 (en) Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding
KR101837686B1 (ko) 공간적 오디오 객체 코딩에 오디오 정보를 적응시키기 위한 장치 및 방법
Lindblom et al. Flexible sum-difference stereo coding based on time-aligned signal components

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20140826

MPN Maintenance fee for patent paid

Free format text: FEE DESCRIPTION TEXT: MF (PATENT, 12TH ANNIV.) - STANDARD

Year of fee payment: 12

U00 Fee paid

Free format text: ST27 STATUS EVENT CODE: A-4-4-U10-U00-U101 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE REQUEST RECEIVED

Effective date: 20250218

U11 Full renewal or maintenance fee paid

Free format text: ST27 STATUS EVENT CODE: A-4-4-U10-U11-U102 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE FEE PAYMENT DETERMINED COMPLIANT

Effective date: 20250218

Free format text: ST27 STATUS EVENT CODE: A-4-4-U10-U11-U102 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE FEE PAYMENT PAID IN FULL

Effective date: 20250218

MPN Maintenance fee for patent paid

Free format text: FEE DESCRIPTION TEXT: MF (PATENT, 13TH ANNIV.) - STANDARD

Year of fee payment: 13

U00 Fee paid

Free format text: ST27 STATUS EVENT CODE: A-4-4-U10-U00-U101 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE REQUEST RECEIVED

Effective date: 20260212

U11 Full renewal or maintenance fee paid

Free format text: ST27 STATUS EVENT CODE: A-4-4-U10-U11-U102 (AS PROVIDED BY THE NATIONAL OFFICE); EVENT TEXT: MAINTENANCE FEE PAYMENT PAID IN FULL

Effective date: 20260212