TWI575510B - 用於增強對話之解碼方法、電腦程式產品及解碼器 - Google Patents

用於增強對話之解碼方法、電腦程式產品及解碼器 Download PDF

Info

Publication number
TWI575510B
TWI575510B TW104132168A TW104132168A TWI575510B TW I575510 B TWI575510 B TW I575510B TW 104132168 A TW104132168 A TW 104132168A TW 104132168 A TW104132168 A TW 104132168A TW I575510 B TWI575510 B TW I575510B
Authority
TW
Taiwan
Prior art keywords
parameters
subset
dialog
enhanced
channels
Prior art date
Application number
TW104132168A
Other languages
English (en)
Chinese (zh)
Other versions
TW201627983A (zh
Inventor
傑倫 科本斯
皮爾 伊斯坦德
Original Assignee
杜比國際公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杜比國際公司 filed Critical 杜比國際公司
Publication of TW201627983A publication Critical patent/TW201627983A/zh
Application granted granted Critical
Publication of TWI575510B publication Critical patent/TWI575510B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Telephonic Communication Services (AREA)
TW104132168A 2014-10-02 2015-09-30 用於增強對話之解碼方法、電腦程式產品及解碼器 TWI575510B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201462059015P 2014-10-02 2014-10-02
US201562128331P 2015-03-04 2015-03-04

Publications (2)

Publication Number Publication Date
TW201627983A TW201627983A (zh) 2016-08-01
TWI575510B true TWI575510B (zh) 2017-03-21

Family

ID=54199263

Family Applications (1)

Application Number Title Priority Date Filing Date
TW104132168A TWI575510B (zh) 2014-10-02 2015-09-30 用於增強對話之解碼方法、電腦程式產品及解碼器

Country Status (19)

Country Link
US (1) US10170131B2 (https=)
EP (1) EP3201918B1 (https=)
JP (1) JP6728146B2 (https=)
KR (1) KR102426965B1 (https=)
CN (1) CN106796804B (https=)
AU (1) AU2015326856B2 (https=)
BR (1) BR112017006325B1 (https=)
CA (1) CA2962806C (https=)
DK (1) DK3201918T3 (https=)
ES (1) ES2709327T3 (https=)
IL (1) IL251263B (https=)
MX (1) MX364166B (https=)
MY (1) MY179448A (https=)
PL (1) PL3201918T3 (https=)
RU (1) RU2701055C2 (https=)
SG (1) SG11201702301SA (https=)
TW (1) TWI575510B (https=)
UA (1) UA120372C2 (https=)
WO (1) WO2016050854A1 (https=)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK3201918T3 (en) * 2014-10-02 2019-02-25 Dolby Int Ab DECODING PROCEDURE AND DECODS FOR DIALOGUE IMPROVEMENT
CN106303897A (zh) 2015-06-01 2017-01-04 杜比实验室特许公司 处理基于对象的音频信号
EP3409029B1 (en) 2016-01-29 2024-10-30 Dolby Laboratories Licensing Corporation Binaural dialogue enhancement
TWI658458B (zh) * 2018-05-17 2019-05-01 張智星 歌聲分離效能提升之方法、非暫態電腦可讀取媒體及電腦程式產品
WO2020216459A1 (en) * 2019-04-23 2020-10-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method or computer program for generating an output downmix representation
JP7810612B2 (ja) * 2022-06-16 2026-02-03 シャープ株式会社 放送システム、受信機、受信方法、及びプログラム

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040252850A1 (en) * 2003-04-24 2004-12-16 Lorenzo Turicchia System and method for spectral enhancement employing compression and expansion
US20060271354A1 (en) * 2005-05-31 2006-11-30 Microsoft Corporation Audio codec post-filter
US20110119061A1 (en) * 2009-11-17 2011-05-19 Dolby Laboratories Licensing Corporation Method and system for dialog enhancement
TW201325269A (zh) * 2011-07-01 2013-06-16 Dolby Lab Licensing Corp 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
US8577676B2 (en) * 2008-04-18 2013-11-05 Dolby Laboratories Licensing Corporation Method and apparatus for maintaining speech audibility in multi-channel audio with minimal impact on surround experience

Family Cites Families (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6463410B1 (en) * 1998-10-13 2002-10-08 Victor Company Of Japan, Ltd. Audio signal processing apparatus
US7158933B2 (en) 2001-05-11 2007-01-02 Siemens Corporate Research, Inc. Multi-channel speech enhancement system and method based on psychoacoustic masking effects
CA2992051C (en) 2004-03-01 2019-01-22 Dolby Laboratories Licensing Corporation Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters
US7447317B2 (en) * 2003-10-02 2008-11-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V Compatible multi-channel coding/decoding by weighting the downmix channel
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
SE0402652D0 (sv) * 2004-11-02 2004-11-02 Coding Tech Ab Methods for improved performance of prediction based multi- channel reconstruction
DE602006015294D1 (de) 2005-03-30 2010-08-19 Dolby Int Ab Mehrkanal-audiocodierung
RU2376655C2 (ru) * 2005-04-19 2009-12-20 Коудинг Текнолоджиз Аб Зависящее от энергии квантование для эффективного кодирования пространственных параметров звука
EP1946294A2 (en) 2005-06-30 2008-07-23 LG Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
JP5227794B2 (ja) 2005-06-30 2013-07-03 エルジー エレクトロニクス インコーポレイティド オーディオ信号をエンコーディング及びデコーディングするための装置とその方法
EP1906706B1 (en) * 2005-07-15 2009-11-25 Panasonic Corporation Audio decoder
KR101001835B1 (ko) * 2006-03-28 2010-12-15 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 멀티 채널 오디오 재구성에서 신호 셰이핑을 위한 개선 방법
WO2007111568A2 (en) 2006-03-28 2007-10-04 Telefonaktiebolaget L M Ericsson (Publ) Method and arrangement for a decoder for multi-channel surround sound
ATE527833T1 (de) * 2006-05-04 2011-10-15 Lg Electronics Inc Verbesserung von stereo-audiosignalen mittels neuabmischung
TWI308739B (en) 2006-06-23 2009-04-11 Mstar Semiconductor Inc Audio processing circuit and method
WO2008006108A2 (en) 2006-07-07 2008-01-10 Srs Labs, Inc. Systems and methods for multi-dialog surround audio
KR101061132B1 (ko) 2006-09-14 2011-08-31 엘지전자 주식회사 다이알로그 증폭 기술
US7463170B2 (en) 2006-11-30 2008-12-09 Broadcom Corporation Method and system for processing multi-rate audio from a plurality of audio processing sources
US8050434B1 (en) 2006-12-21 2011-11-01 Srs Labs, Inc. Multi-channel audio enhancement system
WO2008100503A2 (en) * 2007-02-12 2008-08-21 Dolby Laboratories Licensing Corporation Improved ratio of speech to non-speech audio such as for elderly or hearing-impaired listeners
KR101336237B1 (ko) * 2007-03-02 2013-12-03 삼성전자주식회사 멀티 채널 스피커 시스템의 멀티 채널 신호 재생 방법 및장치
CA2684975C (en) 2007-04-26 2016-08-02 Dolby Sweden Ab Apparatus and method for synthesizing an output signal
JP5883561B2 (ja) * 2007-10-17 2016-03-15 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ アップミックスを使用した音声符号器
US8831936B2 (en) * 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US8315396B2 (en) * 2008-07-17 2012-11-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio output signals using object based metadata
US8639502B1 (en) 2009-02-16 2014-01-28 Arrowhead Center, Inc. Speaker model-based speech enhancement system
BR122019023947B1 (pt) 2009-03-17 2021-04-06 Dolby International Ab Sistema codificador, sistema decodificador, método para codificar um sinal estéreo para um sinal de fluxo de bits e método para decodificar um sinal de fluxo de bits para um sinal estéreo
CN102414743A (zh) 2009-04-21 2012-04-11 皇家飞利浦电子股份有限公司 音频信号合成
US8204742B2 (en) 2009-09-14 2012-06-19 Srs Labs, Inc. System for processing an audio signal to enhance speech intelligibility
JP5400225B2 (ja) * 2009-10-05 2014-01-29 ハーマン インターナショナル インダストリーズ インコーポレイテッド オーディオ信号の空間的抽出のためのシステム
KR101411759B1 (ko) * 2009-10-20 2014-06-25 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 오디오 신호 인코더, 오디오 신호 디코더, 앨리어싱-소거를 이용하여 오디오 신호를 인코딩 또는 디코딩하는 방법
TWI459828B (zh) * 2010-03-08 2014-11-01 Dolby Lab Licensing Corp 在多頻道音訊中決定語音相關頻道的音量降低比例的方法及系統
JP5856295B2 (ja) 2011-07-01 2016-02-09 ドルビー ラボラトリーズ ライセンシング コーポレイション 適応的オーディオシステムのための同期及びスイッチオーバ方法及びシステム
US8615394B1 (en) 2012-01-27 2013-12-24 Audience, Inc. Restoration of noise-reduced speech
EP2690621A1 (en) * 2012-07-26 2014-01-29 Thomson Licensing Method and Apparatus for downmixing MPEG SAOC-like encoded audio signals at receiver side in a manner different from the manner of downmixing at encoder side
US9055362B2 (en) 2012-12-19 2015-06-09 Duo Zhang Methods, apparatus and systems for individualizing audio, music and speech adaptively, intelligently and interactively
BR112015029132B1 (pt) 2013-05-24 2022-05-03 Dolby International Ab Método para codificar um mosaico de tempo/frequência de uma cena de áudio, codificador que codifica um mosaico de tempo/frequência de uma cena de áudio, método para decodificar um mosaico de tempo-frequência de uma cena de áudio, decodificador que decodifica um mosaico de tempo-frequência de uma cena de áudio e meio legível em computador.
EP2830047A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for low delay object metadata coding
DK3201918T3 (en) * 2014-10-02 2019-02-25 Dolby Int Ab DECODING PROCEDURE AND DECODS FOR DIALOGUE IMPROVEMENT

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040252850A1 (en) * 2003-04-24 2004-12-16 Lorenzo Turicchia System and method for spectral enhancement employing compression and expansion
US20060271354A1 (en) * 2005-05-31 2006-11-30 Microsoft Corporation Audio codec post-filter
US8577676B2 (en) * 2008-04-18 2013-11-05 Dolby Laboratories Licensing Corporation Method and apparatus for maintaining speech audibility in multi-channel audio with minimal impact on surround experience
US20110119061A1 (en) * 2009-11-17 2011-05-19 Dolby Laboratories Licensing Corporation Method and system for dialog enhancement
TW201325269A (zh) * 2011-07-01 2013-06-16 Dolby Lab Licensing Corp 用於適應性音頻信號的產生、譯碼與呈現之系統與方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Digital Audio Compression (AC-4) Standard, Technical Specification, 20140401 European Telecommunications Standards Institute (ETSI), 650, route des Lucioles ; F-06921 Sophia-Antipolis ; France, Vol:BROADCAS, V1.1.1. *

Also Published As

Publication number Publication date
AU2015326856A1 (en) 2017-04-06
BR112017006325A2 (pt) 2018-01-16
KR20170063667A (ko) 2017-06-08
JP6728146B2 (ja) 2020-07-22
CN106796804B (zh) 2020-09-18
IL251263B (en) 2019-07-31
IL251263A0 (en) 2017-05-29
ES2709327T3 (es) 2019-04-16
DK3201918T3 (en) 2019-02-25
PL3201918T3 (pl) 2019-04-30
RU2017110842A3 (https=) 2019-05-15
MX364166B (es) 2019-04-15
JP2017534904A (ja) 2017-11-24
RU2701055C2 (ru) 2019-09-24
RU2017110842A (ru) 2018-10-01
SG11201702301SA (en) 2017-04-27
US20170309288A1 (en) 2017-10-26
CA2962806C (en) 2023-03-14
UA120372C2 (uk) 2019-11-25
KR102426965B1 (ko) 2022-08-01
US10170131B2 (en) 2019-01-01
BR112017006325B1 (pt) 2023-12-26
CA2962806A1 (en) 2016-04-07
MX2017004194A (es) 2017-05-19
TW201627983A (zh) 2016-08-01
AU2015326856B2 (en) 2021-04-08
EP3201918A1 (en) 2017-08-09
CN106796804A (zh) 2017-05-31
EP3201918B1 (en) 2018-12-12
MY179448A (en) 2020-11-06
WO2016050854A1 (en) 2016-04-07

Similar Documents

Publication Publication Date Title
TWI575510B (zh) 用於增強對話之解碼方法、電腦程式產品及解碼器
US8116459B2 (en) Enhanced method for signal shaping in multi-channel audio reconstruction
JP5818913B2 (ja) 音声信号フレームにおけるイベントのスロット位置の符号化および復号化
US8249883B2 (en) Channel extension coding for multi-channel source
JP6640849B2 (ja) マルチチャネル・オーディオ信号のパラメトリック・エンコードおよびデコード
JP6732739B2 (ja) オーディオ・エンコーダおよびデコーダ
JP2023530409A (ja) マルチチャンネル入力信号内の空間バックグラウンドノイズを符号化および/または復号するための方法およびデバイス
EP3005352B1 (en) Audio object encoding and decoding
HK1235540B (en) Decoding method and decoder for dialog enhancement
HK1235540A1 (en) Decoding method and decoder for dialog enhancement