JP6728146B2 - ダイアログ向上のためのデコード方法およびデコーダ - Google Patents

ダイアログ向上のためのデコード方法およびデコーダ Download PDF

Info

Publication number
JP6728146B2
JP6728146B2 JP2017517237A JP2017517237A JP6728146B2 JP 6728146 B2 JP6728146 B2 JP 6728146B2 JP 2017517237 A JP2017517237 A JP 2017517237A JP 2017517237 A JP2017517237 A JP 2017517237A JP 6728146 B2 JP6728146 B2 JP 6728146B2
Authority
JP
Japan
Prior art keywords
dialog
parameter
parameters
subset
channels
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2017517237A
Other languages
English (en)
Japanese (ja)
Other versions
JP2017534904A (ja
JP2017534904A5 (https=
Inventor
コッペンス,イェルーン
エクストランド,ペール
Original Assignee
ドルビー・インターナショナル・アーベー
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ドルビー・インターナショナル・アーベー filed Critical ドルビー・インターナショナル・アーベー
Publication of JP2017534904A publication Critical patent/JP2017534904A/ja
Publication of JP2017534904A5 publication Critical patent/JP2017534904A5/ja
Application granted granted Critical
Publication of JP6728146B2 publication Critical patent/JP6728146B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Telephonic Communication Services (AREA)
JP2017517237A 2014-10-02 2015-09-30 ダイアログ向上のためのデコード方法およびデコーダ Active JP6728146B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US201462059015P 2014-10-02 2014-10-02
US62/059,015 2014-10-02
US201562128331P 2015-03-04 2015-03-04
US62/128,331 2015-03-04
PCT/EP2015/072578 WO2016050854A1 (en) 2014-10-02 2015-09-30 Decoding method and decoder for dialog enhancement

Publications (3)

Publication Number Publication Date
JP2017534904A JP2017534904A (ja) 2017-11-24
JP2017534904A5 JP2017534904A5 (https=) 2018-11-08
JP6728146B2 true JP6728146B2 (ja) 2020-07-22

Family

ID=54199263

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2017517237A Active JP6728146B2 (ja) 2014-10-02 2015-09-30 ダイアログ向上のためのデコード方法およびデコーダ

Country Status (19)

Country Link
US (1) US10170131B2 (https=)
EP (1) EP3201918B1 (https=)
JP (1) JP6728146B2 (https=)
KR (1) KR102426965B1 (https=)
CN (1) CN106796804B (https=)
AU (1) AU2015326856B2 (https=)
BR (1) BR112017006325B1 (https=)
CA (1) CA2962806C (https=)
DK (1) DK3201918T3 (https=)
ES (1) ES2709327T3 (https=)
IL (1) IL251263B (https=)
MX (1) MX364166B (https=)
MY (1) MY179448A (https=)
PL (1) PL3201918T3 (https=)
RU (1) RU2701055C2 (https=)
SG (1) SG11201702301SA (https=)
TW (1) TWI575510B (https=)
UA (1) UA120372C2 (https=)
WO (1) WO2016050854A1 (https=)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DK3201918T3 (en) * 2014-10-02 2019-02-25 Dolby Int Ab DECODING PROCEDURE AND DECODS FOR DIALOGUE IMPROVEMENT
CN106303897A (zh) 2015-06-01 2017-01-04 杜比实验室特许公司 处理基于对象的音频信号
EP3409029B1 (en) 2016-01-29 2024-10-30 Dolby Laboratories Licensing Corporation Binaural dialogue enhancement
TWI658458B (zh) * 2018-05-17 2019-05-01 張智星 歌聲分離效能提升之方法、非暫態電腦可讀取媒體及電腦程式產品
WO2020216459A1 (en) * 2019-04-23 2020-10-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method or computer program for generating an output downmix representation
JP7810612B2 (ja) * 2022-06-16 2026-02-03 シャープ株式会社 放送システム、受信機、受信方法、及びプログラム

Family Cites Families (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6463410B1 (en) * 1998-10-13 2002-10-08 Victor Company Of Japan, Ltd. Audio signal processing apparatus
US7158933B2 (en) 2001-05-11 2007-01-02 Siemens Corporate Research, Inc. Multi-channel speech enhancement system and method based on psychoacoustic masking effects
CA2992051C (en) 2004-03-01 2019-01-22 Dolby Laboratories Licensing Corporation Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters
WO2004097799A1 (en) 2003-04-24 2004-11-11 Massachusetts Institute Of Technology System and method for spectral enhancement employing compression and expansion
US7447317B2 (en) * 2003-10-02 2008-11-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V Compatible multi-channel coding/decoding by weighting the downmix channel
KR20050049103A (ko) * 2003-11-21 2005-05-25 삼성전자주식회사 포만트 대역을 이용한 다이얼로그 인핸싱 방법 및 장치
SE0402652D0 (sv) * 2004-11-02 2004-11-02 Coding Tech Ab Methods for improved performance of prediction based multi- channel reconstruction
DE602006015294D1 (de) 2005-03-30 2010-08-19 Dolby Int Ab Mehrkanal-audiocodierung
RU2376655C2 (ru) * 2005-04-19 2009-12-20 Коудинг Текнолоджиз Аб Зависящее от энергии квантование для эффективного кодирования пространственных параметров звука
US7707034B2 (en) * 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
EP1946294A2 (en) 2005-06-30 2008-07-23 LG Electronics Inc. Apparatus for encoding and decoding audio signal and method thereof
JP5227794B2 (ja) 2005-06-30 2013-07-03 エルジー エレクトロニクス インコーポレイティド オーディオ信号をエンコーディング及びデコーディングするための装置とその方法
EP1906706B1 (en) * 2005-07-15 2009-11-25 Panasonic Corporation Audio decoder
KR101001835B1 (ko) * 2006-03-28 2010-12-15 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 멀티 채널 오디오 재구성에서 신호 셰이핑을 위한 개선 방법
WO2007111568A2 (en) 2006-03-28 2007-10-04 Telefonaktiebolaget L M Ericsson (Publ) Method and arrangement for a decoder for multi-channel surround sound
ATE527833T1 (de) * 2006-05-04 2011-10-15 Lg Electronics Inc Verbesserung von stereo-audiosignalen mittels neuabmischung
TWI308739B (en) 2006-06-23 2009-04-11 Mstar Semiconductor Inc Audio processing circuit and method
WO2008006108A2 (en) 2006-07-07 2008-01-10 Srs Labs, Inc. Systems and methods for multi-dialog surround audio
KR101061132B1 (ko) 2006-09-14 2011-08-31 엘지전자 주식회사 다이알로그 증폭 기술
US7463170B2 (en) 2006-11-30 2008-12-09 Broadcom Corporation Method and system for processing multi-rate audio from a plurality of audio processing sources
US8050434B1 (en) 2006-12-21 2011-11-01 Srs Labs, Inc. Multi-channel audio enhancement system
WO2008100503A2 (en) * 2007-02-12 2008-08-21 Dolby Laboratories Licensing Corporation Improved ratio of speech to non-speech audio such as for elderly or hearing-impaired listeners
KR101336237B1 (ko) * 2007-03-02 2013-12-03 삼성전자주식회사 멀티 채널 스피커 시스템의 멀티 채널 신호 재생 방법 및장치
CA2684975C (en) 2007-04-26 2016-08-02 Dolby Sweden Ab Apparatus and method for synthesizing an output signal
JP5883561B2 (ja) * 2007-10-17 2016-03-15 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ アップミックスを使用した音声符号器
MY159890A (en) * 2008-04-18 2017-02-15 Dolby Laboratories Licensing Corp Method and apparatus for maintaining speech audibiliy in multi-channel audio with minimal impact on surround experience
US8831936B2 (en) * 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US8315396B2 (en) * 2008-07-17 2012-11-20 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating audio output signals using object based metadata
US8639502B1 (en) 2009-02-16 2014-01-28 Arrowhead Center, Inc. Speaker model-based speech enhancement system
BR122019023947B1 (pt) 2009-03-17 2021-04-06 Dolby International Ab Sistema codificador, sistema decodificador, método para codificar um sinal estéreo para um sinal de fluxo de bits e método para decodificar um sinal de fluxo de bits para um sinal estéreo
CN102414743A (zh) 2009-04-21 2012-04-11 皇家飞利浦电子股份有限公司 音频信号合成
US8204742B2 (en) 2009-09-14 2012-06-19 Srs Labs, Inc. System for processing an audio signal to enhance speech intelligibility
JP5400225B2 (ja) * 2009-10-05 2014-01-29 ハーマン インターナショナル インダストリーズ インコーポレイテッド オーディオ信号の空間的抽出のためのシステム
KR101411759B1 (ko) * 2009-10-20 2014-06-25 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. 오디오 신호 인코더, 오디오 신호 디코더, 앨리어싱-소거를 이용하여 오디오 신호를 인코딩 또는 디코딩하는 방법
US9324337B2 (en) 2009-11-17 2016-04-26 Dolby Laboratories Licensing Corporation Method and system for dialog enhancement
TWI459828B (zh) * 2010-03-08 2014-11-01 Dolby Lab Licensing Corp 在多頻道音訊中決定語音相關頻道的音量降低比例的方法及系統
JP5856295B2 (ja) 2011-07-01 2016-02-09 ドルビー ラボラトリーズ ライセンシング コーポレイション 適応的オーディオシステムのための同期及びスイッチオーバ方法及びシステム
KR102185941B1 (ko) * 2011-07-01 2020-12-03 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법
US8615394B1 (en) 2012-01-27 2013-12-24 Audience, Inc. Restoration of noise-reduced speech
EP2690621A1 (en) * 2012-07-26 2014-01-29 Thomson Licensing Method and Apparatus for downmixing MPEG SAOC-like encoded audio signals at receiver side in a manner different from the manner of downmixing at encoder side
US9055362B2 (en) 2012-12-19 2015-06-09 Duo Zhang Methods, apparatus and systems for individualizing audio, music and speech adaptively, intelligently and interactively
BR112015029132B1 (pt) 2013-05-24 2022-05-03 Dolby International Ab Método para codificar um mosaico de tempo/frequência de uma cena de áudio, codificador que codifica um mosaico de tempo/frequência de uma cena de áudio, método para decodificar um mosaico de tempo-frequência de uma cena de áudio, decodificador que decodifica um mosaico de tempo-frequência de uma cena de áudio e meio legível em computador.
EP2830047A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for low delay object metadata coding
DK3201918T3 (en) * 2014-10-02 2019-02-25 Dolby Int Ab DECODING PROCEDURE AND DECODS FOR DIALOGUE IMPROVEMENT

Also Published As

Publication number Publication date
AU2015326856A1 (en) 2017-04-06
BR112017006325A2 (pt) 2018-01-16
KR20170063667A (ko) 2017-06-08
CN106796804B (zh) 2020-09-18
IL251263B (en) 2019-07-31
IL251263A0 (en) 2017-05-29
ES2709327T3 (es) 2019-04-16
DK3201918T3 (en) 2019-02-25
PL3201918T3 (pl) 2019-04-30
RU2017110842A3 (https=) 2019-05-15
MX364166B (es) 2019-04-15
JP2017534904A (ja) 2017-11-24
RU2701055C2 (ru) 2019-09-24
RU2017110842A (ru) 2018-10-01
SG11201702301SA (en) 2017-04-27
US20170309288A1 (en) 2017-10-26
CA2962806C (en) 2023-03-14
UA120372C2 (uk) 2019-11-25
KR102426965B1 (ko) 2022-08-01
US10170131B2 (en) 2019-01-01
BR112017006325B1 (pt) 2023-12-26
CA2962806A1 (en) 2016-04-07
MX2017004194A (es) 2017-05-19
TWI575510B (zh) 2017-03-21
TW201627983A (zh) 2016-08-01
AU2015326856B2 (en) 2021-04-08
EP3201918A1 (en) 2017-08-09
CN106796804A (zh) 2017-05-31
EP3201918B1 (en) 2018-12-12
MY179448A (en) 2020-11-06
WO2016050854A1 (en) 2016-04-07

Similar Documents

Publication Publication Date Title
JP6728146B2 (ja) ダイアログ向上のためのデコード方法およびデコーダ
JP5158814B2 (ja) 減数されたチャネルへの復号化
RU2625444C2 (ru) Система обработки аудио
RU2639952C2 (ru) Гибридное усиление речи с кодированием формы сигнала и параметрическим кодированием
JP6231093B2 (ja) オーディオ信号の符号化及び復号
KR101761569B1 (ko) 오디오 현장의 코딩
CN101151658B (zh) 多声道音频编码和解码方法、编码器和解码器
JP6640849B2 (ja) マルチチャネル・オーディオ信号のパラメトリック・エンコードおよびデコード
JP6732739B2 (ja) オーディオ・エンコーダおよびデコーダ
KR101756838B1 (ko) 다채널 오디오 신호를 다운 믹스하는 방법 및 장치
JP6248186B2 (ja) オーディオ・エンコードおよびデコード方法、対応するコンピュータ可読媒体ならびに対応するオーディオ・エンコーダおよびデコーダ
TW202101427A (zh) 降混器及降混方法
HK1235540A1 (en) Decoding method and decoder for dialog enhancement
HK1235540B (en) Decoding method and decoder for dialog enhancement

Legal Events

Date Code Title Description
A529 Written submission of copy of amendment under article 34 pct

Free format text: JAPANESE INTERMEDIATE CODE: A529

Effective date: 20170424

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20180925

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20180925

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20191015

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20191023

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20200116

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20200602

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20200701

R150 Certificate of patent or registration of utility model

Ref document number: 6728146

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250