JP6214765B2 - 音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法 - Google Patents

音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法 Download PDF

Info

Publication number
JP6214765B2
JP6214765B2 JP2016523221A JP2016523221A JP6214765B2 JP 6214765 B2 JP6214765 B2 JP 6214765B2 JP 2016523221 A JP2016523221 A JP 2016523221A JP 2016523221 A JP2016523221 A JP 2016523221A JP 6214765 B2 JP6214765 B2 JP 6214765B2
Authority
JP
Japan
Prior art keywords
frame
encoded
decoder
special frame
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2016523221A
Other languages
English (en)
Japanese (ja)
Other versions
JP2016539357A (ja
Inventor
ダーニエル フィッシャー、
ダーニエル フィッシャー、
ベルント チェルハン、
ベルント チェルハン、
マックス ノイエンドルフ、
マックス ノイエンドルフ、
ニコラウス レッテルバッハ、
ニコラウス レッテルバッハ、
インゴ ホーフマン、
インゴ ホーフマン、
ハーラルト フックス、
ハーラルト フックス、
シュテファン デーラ、
シュテファン デーラ、
ニコラウス フェルバー、
ニコラウス フェルバー、
Original Assignee
フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー.
フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=49378190&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=JP6214765(B2) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー., フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. filed Critical フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー.
Publication of JP2016539357A publication Critical patent/JP2016539357A/ja
Application granted granted Critical
Publication of JP6214765B2 publication Critical patent/JP6214765B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mathematical Physics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
JP2016523221A 2013-10-18 2014-10-14 音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法 Active JP6214765B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP13189328.1A EP2863386A1 (en) 2013-10-18 2013-10-18 Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
EP13189328.1 2013-10-18
PCT/EP2014/072063 WO2015055683A1 (en) 2013-10-18 2014-10-14 Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder

Publications (2)

Publication Number Publication Date
JP2016539357A JP2016539357A (ja) 2016-12-15
JP6214765B2 true JP6214765B2 (ja) 2017-10-18

Family

ID=49378190

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2016523221A Active JP6214765B2 (ja) 2013-10-18 2014-10-14 音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法

Country Status (19)

Country Link
US (11) US9928845B2 (https=)
EP (2) EP2863386A1 (https=)
JP (1) JP6214765B2 (https=)
KR (1) KR101809390B1 (https=)
CN (2) CN105745704B (https=)
AR (1) AR098075A1 (https=)
AU (1) AU2014336243B2 (https=)
BR (4) BR122021004490B1 (https=)
CA (1) CA2925653C (https=)
ES (1) ES2644370T3 (https=)
MX (1) MX355274B (https=)
MY (1) MY177213A (https=)
PL (1) PL3044782T3 (https=)
PT (1) PT3044782T (https=)
RU (1) RU2651190C2 (https=)
SG (1) SG11201602971SA (https=)
TW (1) TWI579832B (https=)
WO (1) WO2015055683A1 (https=)
ZA (1) ZA201603154B (https=)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2863386A1 (en) * 2013-10-18 2015-04-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
AU2018208522B2 (en) * 2017-01-10 2020-07-02 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier
CN115691519A (zh) 2018-02-22 2023-02-03 杜比国际公司 用于处理嵌入在mpeg-h 3d音频流中的辅媒体流的方法及设备
US10586546B2 (en) 2018-04-26 2020-03-10 Qualcomm Incorporated Inversely enumerated pyramid vector quantizers for efficient rate adaptation in audio coding
US10734006B2 (en) 2018-06-01 2020-08-04 Qualcomm Incorporated Audio coding based on audio pattern recognition
US10580424B2 (en) * 2018-06-01 2020-03-03 Qualcomm Incorporated Perceptual audio coding as sequential decision-making problems
BR112021003104A2 (pt) * 2018-08-21 2021-05-11 Dolby International Ab métodos, aparelho e sistemas para geração, transporte e processamento de quadros de reprodução imediata (ipfs)
JP7576582B2 (ja) 2019-07-02 2024-10-31 ドルビー・インターナショナル・アーベー 離散指向性情報の表現、符号化、および復号化のための方法、装置、およびシステム
EP4002358A4 (en) * 2019-07-19 2023-03-22 Intellectual Discovery Co., Ltd. ADAPTIVE AUDIO PROCESSING METHOD, APPARATUS, COMPUTER PROGRAM AND RECORDING MEDIUM THEREFORE IN A WIRELESS COMMUNICATIONS SYSTEM
US12205607B2 (en) * 2019-08-15 2025-01-21 Dolby Laboratories Licensing Corporation Methods and devices for generation and processing of modified bitstreams
CN113518322B (zh) * 2020-04-10 2024-09-17 华为技术有限公司 无线通信方法和通信装置
EP4154249B1 (en) 2020-05-20 2024-01-24 Dolby International AB Methods and apparatus for unified speech and audio decoding improvements
GB2614482A (en) * 2020-09-25 2023-07-05 Apple Inc Seamless scalable decoding of channels, objects, and hoa audio content
WO2022135507A1 (en) * 2020-12-23 2022-06-30 Beijing Bytedance Network Technology Co., Ltd. Video decoder initialization information
CN114093375B (zh) 2021-03-02 2025-09-12 北京沃东天骏信息技术有限公司 解码方法、装置和计算机可读存储介质
US12568238B2 (en) * 2021-04-12 2026-03-03 Lg Electronics Inc. Method for image coding based on signaling of information related to decoder initialization
CN118103906A (zh) 2021-08-19 2024-05-28 弗劳恩霍夫应用研究促进协会 音频编码器、用于提供音频信息的编码表示的方法、计算机程序、以及使用立即播出帧的编码音频表示
US12184425B2 (en) * 2021-09-21 2024-12-31 Qualcomm Incorporated Lossy compressed feedback for multiple incremental redundancy scheme (MIRS)
DE102021006419A1 (de) 2021-12-30 2023-07-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung eingetragener Verein Streaming-Techniken
CN115240692A (zh) * 2022-06-30 2022-10-25 哲库科技(上海)有限公司 接收音频数据的方法、装置以及音频播放设备
CN121794991A (zh) * 2023-08-29 2026-04-03 三星电子株式会社 用于沉浸式画面的显示装置及其控制方法
US20250086956A1 (en) * 2023-09-13 2025-03-13 Qualcomm Incorporated Multi-view convolutional neural networks for video processing
GB2636866A (en) * 2023-12-28 2025-07-02 Nokia Technologies Oy An apparatus and method for immersive audio rendering

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100304092B1 (ko) 1998-03-11 2001-09-26 마츠시타 덴끼 산교 가부시키가이샤 오디오 신호 부호화 장치, 오디오 신호 복호화 장치 및 오디오 신호 부호화/복호화 장치
US7315815B1 (en) * 1999-09-22 2008-01-01 Microsoft Corporation LPC-harmonic vocoder with superframe structure
US6604070B1 (en) * 1999-09-22 2003-08-05 Conexant Systems, Inc. System of encoding and decoding speech signals
US7460629B2 (en) * 2001-06-29 2008-12-02 Agere Systems Inc. Method and apparatus for frame-based buffer control in a communication system
JP2003273939A (ja) * 2002-03-13 2003-09-26 Nec Corp 多重伝送システムおよび変換装置と警報転送方法
US7536305B2 (en) * 2002-09-04 2009-05-19 Microsoft Corporation Mixed lossless audio compression
US7392195B2 (en) 2004-03-25 2008-06-24 Dts, Inc. Lossless multi-channel audio codec
WO2005099243A1 (ja) * 2004-04-09 2005-10-20 Nec Corporation 音声通信方法及び装置
US7596486B2 (en) * 2004-05-19 2009-09-29 Nokia Corporation Encoding an audio signal using different audio coder modes
DE102004043521A1 (de) 2004-09-08 2006-03-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals oder eines Parameterdatensatzes
US7610195B2 (en) * 2006-06-01 2009-10-27 Nokia Corporation Decoding of predictively coded data using buffer adaptation
RU2452042C1 (ru) * 2008-03-04 2012-05-27 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способ и устройство для обработки аудиосигнала
EP2131590A1 (en) * 2008-06-02 2009-12-09 Deutsche Thomson OHG Method and apparatus for generating or cutting or changing a frame based bit stream format file including at least one header section, and a corresponding data structure
US8380523B2 (en) * 2008-07-07 2013-02-19 Lg Electronics Inc. Method and an apparatus for processing an audio signal
PL3002750T3 (pl) * 2008-07-11 2018-06-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Koder i dekoder audio do kodowania i dekodowania próbek audio
EP2224433B1 (en) * 2008-09-25 2020-05-27 Lg Electronics Inc. An apparatus for processing an audio signal and method thereof
US9237387B2 (en) * 2009-10-06 2016-01-12 Microsoft Technology Licensing, Llc Low latency cacheable media streaming
US8428936B2 (en) 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
CN102934161B (zh) * 2010-06-14 2015-08-26 松下电器产业株式会社 音频混合编码装置以及音频混合解码装置
US8948249B2 (en) * 2011-08-19 2015-02-03 Google Technology Holdings LLC Encoder-aided segmentation for adaptive streaming
US20140109153A1 (en) * 2012-10-11 2014-04-17 Affirmed Networks, Inc. Expansion of a Stream Set and Transcoding of HTTP Adaptive Streaming Videos in a Mobile Network
EP2863386A1 (en) * 2013-10-18 2015-04-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder

Also Published As

Publication number Publication date
MX355274B (es) 2018-04-13
EP2863386A1 (en) 2015-04-22
JP2016539357A (ja) 2016-12-15
US20240203434A1 (en) 2024-06-20
US20220215850A1 (en) 2022-07-07
CN110444218A (zh) 2019-11-12
CN105745704A (zh) 2016-07-06
US11423919B2 (en) 2022-08-23
US9928845B2 (en) 2018-03-27
US20230335146A1 (en) 2023-10-19
EP3044782B1 (en) 2017-09-06
US12170093B2 (en) 2024-12-17
US10229694B2 (en) 2019-03-12
PT3044782T (pt) 2017-12-04
CA2925653C (en) 2018-07-24
US20160232910A1 (en) 2016-08-11
AR098075A1 (es) 2016-04-27
CN110444218B (zh) 2023-10-24
ZA201603154B (en) 2017-11-29
TW201523587A (zh) 2015-06-16
BR122021004494B1 (pt) 2023-02-23
US12165664B2 (en) 2024-12-10
RU2016118985A (ru) 2017-11-23
TWI579832B (zh) 2017-04-21
US20240212697A1 (en) 2024-06-27
ES2644370T3 (es) 2017-11-28
US20180197556A1 (en) 2018-07-12
US20250061906A1 (en) 2025-02-20
US12094479B2 (en) 2024-09-17
WO2015055683A1 (en) 2015-04-23
US20200234726A1 (en) 2020-07-23
US12080309B2 (en) 2024-09-03
BR122021004490B1 (pt) 2023-02-23
BR122021004485B1 (pt) 2023-02-28
CN105745704B (zh) 2019-08-23
AU2014336243A1 (en) 2016-05-26
KR20160060686A (ko) 2016-05-30
US20240203432A1 (en) 2024-06-20
US20240203433A1 (en) 2024-06-20
CA2925653A1 (en) 2015-04-23
MX2016004845A (es) 2016-07-26
US10614824B2 (en) 2020-04-07
BR112016008415B1 (pt) 2022-11-29
US11670314B2 (en) 2023-06-06
MY177213A (en) 2020-09-09
PL3044782T3 (pl) 2018-02-28
AU2014336243B2 (en) 2017-02-02
BR112016008415A2 (https=) 2017-08-22
KR101809390B1 (ko) 2018-01-18
RU2651190C2 (ru) 2018-04-18
EP3044782A1 (en) 2016-07-20
US20190156844A1 (en) 2019-05-23
SG11201602971SA (en) 2016-05-30
US12094478B2 (en) 2024-09-17

Similar Documents

Publication Publication Date Title
JP6214765B2 (ja) 音声デコーダ、符号化音声出力データを生成するための装置、及びデコーダの初期化を可能にする方法
JP2013535023A (ja) 基本層および少なくとも一つの向上層を含む層構造の階層的ビットストリームを探索し、再生する方法および装置
HK40020053A (en) Method and apparatus for encoding and decoding audio data
HK1226549A1 (en) Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
HK1226549B (en) Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
HK40020053B (zh) 用於编码和解码音频数据的装置以及方法

Legal Events

Date Code Title Description
A529 Written submission of copy of amendment under article 34 pct

Free format text: JAPANESE INTERMEDIATE CODE: A529

Effective date: 20160414

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20160414

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20170516

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20170808

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20170822

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20170919

R150 Certificate of patent or registration of utility model

Ref document number: 6214765

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250