JP2023526627A - 音声音響統合復号の向上のための方法及び機器 - Google Patents

音声音響統合復号の向上のための方法及び機器 Download PDF

Info

Publication number
JP2023526627A
JP2023526627A JP2022570444A JP2022570444A JP2023526627A JP 2023526627 A JP2023526627 A JP 2023526627A JP 2022570444 A JP2022570444 A JP 2022570444A JP 2022570444 A JP2022570444 A JP 2022570444A JP 2023526627 A JP2023526627 A JP 2023526627A
Authority
JP
Japan
Prior art keywords
usac
decoder
bitstream
configuration
current
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2022570444A
Other languages
English (en)
Japanese (ja)
Inventor
フランツ ベーア,ミヒャエル
ルービン,エイタン
フィッシャー,ダニエル
フェルシュ,クリストフ
ヴェルナー,マルクス
Original Assignee
ドルビー・インターナショナル・アーベー
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ドルビー・インターナショナル・アーベー filed Critical ドルビー・インターナショナル・アーベー
Publication of JP2023526627A publication Critical patent/JP2023526627A/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
JP2022570444A 2020-05-20 2021-05-18 音声音響統合復号の向上のための方法及び機器 Pending JP2023526627A (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US202063027594P 2020-05-20 2020-05-20
EP20175652 2020-05-20
US63/027,594 2020-05-20
EP20175652.5 2020-05-20
PCT/EP2021/063092 WO2021233886A2 (en) 2020-05-20 2021-05-18 Methods and apparatus for unified speech and audio decoding improvements

Publications (1)

Publication Number Publication Date
JP2023526627A true JP2023526627A (ja) 2023-06-22

Family

ID=75904960

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022570444A Pending JP2023526627A (ja) 2020-05-20 2021-05-18 音声音響統合復号の向上のための方法及び機器

Country Status (8)

Country Link
US (1) US20230186928A1 (de)
EP (1) EP4154249B1 (de)
JP (1) JP2023526627A (de)
KR (1) KR20230011416A (de)
CN (1) CN115668365A (de)
BR (1) BR112022023245A2 (de)
ES (1) ES2972833T3 (de)
WO (1) WO2021233886A2 (de)

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
PL3352168T3 (pl) * 2009-06-23 2021-03-08 Voiceage Corporation Postępowe tłumienie aliasingu w dziedzinie czasu z zastosowaniem w ważonej albo pierwotnej dziedzinie sygnału
US9093066B2 (en) * 2010-01-13 2015-07-28 Voiceage Corporation Forward time-domain aliasing cancellation using linear-predictive filtering to cancel time reversed and zero input responses of adjacent frames
CN117037804A (zh) * 2017-01-10 2023-11-10 弗劳恩霍夫应用研究促进协会 音频解码器和编码器、提供解码的音频信号的方法、提供编码的音频信号的方法、使用流标识符的音频流、音频流提供器和计算机程序
WO2019121982A1 (en) * 2017-12-19 2019-06-27 Dolby International Ab Methods and apparatus for unified speech and audio decoding qmf based harmonic transposer improvements

Also Published As

Publication number Publication date
EP4154249B1 (de) 2024-01-24
WO2021233886A2 (en) 2021-11-25
KR20230011416A (ko) 2023-01-20
EP4154249C0 (de) 2024-01-24
US20230186928A1 (en) 2023-06-15
WO2021233886A3 (en) 2021-12-30
CN115668365A (zh) 2023-01-31
BR112022023245A2 (pt) 2022-12-20
ES2972833T3 (es) 2024-06-17
EP4154249A2 (de) 2023-03-29

Similar Documents

Publication Publication Date Title
JP5208901B2 (ja) 音声信号および音楽信号を符号化する方法
JP5722040B2 (ja) スケーラブルなスピーチおよびオーディオコーデックにおける、量子化mdctスペクトルに対するコードブックインデックスのエンコーディング/デコーディングのための技術
JP5171842B2 (ja) 時間領域データストリームを表している符号化および復号化のための符号器、復号器およびその方法
RU2459282C2 (ru) Масштабируемое кодирование речи и аудио с использованием комбинаторного кодирования mdct-спектра
JP5606433B2 (ja) オーディオエンコーダ及びオーディオデコーダ
RU2584463C2 (ru) Кодирование звука с малой задержкой, содержащее чередующиеся предсказательное кодирование и кодирование с преобразованием
US20080010062A1 (en) Adaptive encoding and decoding methods and apparatuses
KR20100007738A (ko) 음성/오디오 통합 신호의 부호화/복호화 장치
JP2010539625A (ja) スピーチおよびオーディオコーディングアプリケーションのためのmdct/imdctフィルタバンクの効率的な設計
JP6687599B2 (ja) Fd/lpd遷移コンテキストにおけるフレーム喪失管理
WO2013061584A1 (ja) 音信号ハイブリッドデコーダ、音信号ハイブリッドエンコーダ、音信号復号方法、及び音信号符号化方法
RU2675216C1 (ru) Переход от кодирования/декодирования с преобразованием к кодированию/декодированию с предсказанием
JP2019194711A (ja) スムーズな遷移を取得するために、ゼロ入力応答を用いるオーディオ・デコーダ、方法及びコンピュータ・プログラム
JP2005515486A (ja) Celpによる音声符号間のトランスコーディング・スキーム
JP2023526627A (ja) 音声音響統合復号の向上のための方法及び機器
RU2574849C2 (ru) Устройство и способ для кодирования и декодирования аудиосигнала с использованием выровненной части опережающего просмотра

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230213

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20240513