KR102741508B1 - 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램 - Google Patents

인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램 Download PDF

Info

Publication number
KR102741508B1
KR102741508B1 KR1020237005472A KR20237005472A KR102741508B1 KR 102741508 B1 KR102741508 B1 KR 102741508B1 KR 1020237005472 A KR1020237005472 A KR 1020237005472A KR 20237005472 A KR20237005472 A KR 20237005472A KR 102741508 B1 KR102741508 B1 KR 102741508B1
Authority
KR
South Korea
Prior art keywords
priority information
audio signal
unit
decoding
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020237005472A
Other languages
English (en)
Korean (ko)
Other versions
KR20230027329A (ko
Inventor
도루 치넨
마사유키 니시구치
룬유 시
미츠유키 하타나카
유키 야마모토
Original Assignee
소니그룹주식회사
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 소니그룹주식회사 filed Critical 소니그룹주식회사
Priority to KR1020247040609A priority Critical patent/KR20250002792A/ko
Publication of KR20230027329A publication Critical patent/KR20230027329A/ko
Application granted granted Critical
Publication of KR102741508B1 publication Critical patent/KR102741508B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
KR1020237005472A 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램 Active KR102741508B1 (ko)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020247040609A KR20250002792A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
JPJP-P-2014-060486 2014-03-24
JP2014060486 2014-03-24
JPJP-P-2014-136633 2014-07-02
JP2014136633A JP6439296B2 (ja) 2014-03-24 2014-07-02 復号装置および方法、並びにプログラム
KR1020217028231A KR20210111897A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
PCT/JP2015/001432 WO2015146057A1 (en) 2014-03-24 2015-03-16 Encoding device and encoding method, decoding device and decoding method, and program

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
KR1020217028231A Division KR20210111897A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램

Related Child Applications (1)

Application Number Title Priority Date Filing Date
KR1020247040609A Division KR20250002792A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램

Publications (2)

Publication Number Publication Date
KR20230027329A KR20230027329A (ko) 2023-02-27
KR102741508B1 true KR102741508B1 (ko) 2024-12-12

Family

ID=53039543

Family Applications (4)

Application Number Title Priority Date Filing Date
KR1020237005472A Active KR102741508B1 (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR1020217028231A Ceased KR20210111897A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR1020167021269A Active KR102300062B1 (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR1020247040609A Pending KR20250002792A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램

Family Applications After (3)

Application Number Title Priority Date Filing Date
KR1020217028231A Ceased KR20210111897A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR1020167021269A Active KR102300062B1 (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR1020247040609A Pending KR20250002792A (ko) 2014-03-24 2015-03-16 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램

Country Status (8)

Country Link
US (4) US20180033440A1 (enExample)
EP (3) EP3123470B1 (enExample)
JP (1) JP6439296B2 (enExample)
KR (4) KR102741508B1 (enExample)
CN (2) CN111489758B (enExample)
BR (1) BR112016021407B1 (enExample)
RU (2) RU2019112504A (enExample)
WO (1) WO2015146057A1 (enExample)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015056383A1 (ja) * 2013-10-17 2015-04-23 パナソニック株式会社 オーディオエンコード装置及びオーディオデコード装置
JP6777071B2 (ja) * 2015-04-08 2020-10-28 ソニー株式会社 送信装置、送信方法、受信装置および受信方法
US10477269B2 (en) * 2015-04-08 2019-11-12 Sony Corporation Transmission apparatus, transmission method, reception apparatus, and reception method
US10424307B2 (en) * 2017-01-03 2019-09-24 Nokia Technologies Oy Adapting a distributed audio recording for end user free viewpoint monitoring
US10891962B2 (en) * 2017-03-06 2021-01-12 Dolby International Ab Integrated reconstruction and rendering of audio signals
US11574644B2 (en) 2017-04-26 2023-02-07 Sony Corporation Signal processing device and method, and program
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
US10657974B2 (en) * 2017-12-21 2020-05-19 Qualcomm Incorporated Priority information for higher order ambisonic audio data
US11270711B2 (en) 2017-12-21 2022-03-08 Qualcomm Incorproated Higher order ambisonic audio data
GB2578715A (en) * 2018-07-20 2020-05-27 Nokia Technologies Oy Controlling audio focus for spatial audio processing
KR102677399B1 (ko) 2018-10-16 2024-06-24 소니그룹주식회사 신호 처리 장치 및 방법, 그리고 프로그램
CN111081226B (zh) * 2018-10-18 2024-02-13 北京搜狗科技发展有限公司 语音识别解码优化方法及装置
KR20210092728A (ko) * 2018-11-20 2021-07-26 소니그룹주식회사 정보 처리 장치 및 방법, 그리고 프로그램
US20230105632A1 (en) * 2020-04-01 2023-04-06 Sony Group Corporation Signal processing apparatus and method, and program
EP4210048A4 (en) * 2020-09-03 2024-02-21 Sony Group Corporation SIGNAL PROCESSING APPARATUS AND METHOD, LEARNING APPARATUS AND METHOD AND PROGRAM
GB2614482A (en) * 2020-09-25 2023-07-05 Apple Inc Seamless scalable decoding of channels, objects, and hoa audio content
CN112634914B (zh) * 2020-12-15 2024-03-29 中国科学技术大学 基于短时谱一致性的神经网络声码器训练方法
US11710491B2 (en) * 2021-04-20 2023-07-25 Tencent America LLC Method and apparatus for space of interest of audio scene
CN114974273B (zh) * 2021-08-10 2023-08-15 中移互联网有限公司 一种会议音频混音方法和装置
CN114550732B (zh) * 2022-04-15 2022-07-08 腾讯科技(深圳)有限公司 一种高频音频信号的编解码方法和相关装置

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011020065A1 (en) * 2009-08-14 2011-02-17 Srs Labs, Inc. Object-oriented audio streaming system
WO2012125855A1 (en) * 2011-03-16 2012-09-20 Dts, Inc. Encoding and reproduction of three dimensional audio soundtracks

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6330644B1 (en) * 1994-10-27 2001-12-11 Canon Kabushiki Kaisha Signal processor with a plurality of kinds of processors and a shared memory accessed through a versatile control means
JP3519722B2 (ja) * 1997-03-17 2004-04-19 松下電器産業株式会社 データ処理方法及びデータ処理装置
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
US6230130B1 (en) * 1998-05-18 2001-05-08 U.S. Philips Corporation Scalable mixing for speech streaming
JP2005292702A (ja) * 2004-04-05 2005-10-20 Kddi Corp オーディオフレームに対するフェードイン/フェードアウト処理装置及びプログラム
US8787594B1 (en) * 2005-01-28 2014-07-22 Texas Instruments Incorporated Multi-stream audio level controller
RU2383941C2 (ru) * 2005-06-30 2010-03-10 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способ и устройство для кодирования и декодирования аудиосигналов
US7974422B1 (en) * 2005-08-25 2011-07-05 Tp Lab, Inc. System and method of adjusting the sound of multiple audio objects directed toward an audio output device
JP4396683B2 (ja) * 2006-10-02 2010-01-13 カシオ計算機株式会社 音声符号化装置、音声符号化方法、及び、プログラム
RU2431940C2 (ru) * 2006-10-16 2011-10-20 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Аппаратура и метод многоканального параметрического преобразования
US8085786B2 (en) * 2007-03-16 2011-12-27 Qualcomm Incorporated H-ARQ throughput optimization by prioritized decoding
FR2929466A1 (fr) * 2008-03-28 2009-10-02 France Telecom Dissimulation d'erreur de transmission dans un signal numerique dans une structure de decodage hierarchique
CA2781310C (en) * 2009-11-20 2015-12-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus for providing an upmix signal representation on the basis of the downmix signal representation, apparatus for providing a bitstream representing a multi-channel audio signal, methods, computer programs and bitstream representing a multi-channel audio signal using a linear combination parameter
US9531761B2 (en) * 2010-07-01 2016-12-27 Broadcom Corporation Method and system for prioritizing and scheduling services in an IP multimedia network
JP2012108451A (ja) * 2010-10-18 2012-06-07 Sony Corp 音声処理装置および方法、並びにプログラム
WO2013181272A2 (en) * 2012-05-31 2013-12-05 Dts Llc Object-based audio system using vector base amplitude panning
US9025458B2 (en) * 2012-10-23 2015-05-05 Verizon Patent And Licensing Inc. Reducing congestion of media delivery over a content delivery network
CN104885151B (zh) * 2012-12-21 2017-12-22 杜比实验室特许公司 用于基于感知准则呈现基于对象的音频内容的对象群集
BR112015016593B1 (pt) * 2013-01-15 2021-10-05 Koninklijke Philips N.V. Aparelho para processar um sinal de áudio; aparelho para gerar um fluxo de bits; método de processamento de áudio; método para gerar um fluxo de bits; e fluxo de bits
WO2015056383A1 (ja) * 2013-10-17 2015-04-23 パナソニック株式会社 オーディオエンコード装置及びオーディオデコード装置
KR102160254B1 (ko) * 2014-01-10 2020-09-25 삼성전자주식회사 액티브다운 믹스 방식을 이용한 입체 음향 재생 방법 및 장치

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011020065A1 (en) * 2009-08-14 2011-02-17 Srs Labs, Inc. Object-oriented audio streaming system
WO2012125855A1 (en) * 2011-03-16 2012-09-20 Dts, Inc. Encoding and reproduction of three dimensional audio soundtracks

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
ISO/IEC CD 23008-3. Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio. ISO/IEC JTC 1/SC 29/WG 11. 2014.04.04.
ISO/IEC DIS 23008-3. Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio. ISO/IEC JTC 1/SC 29/WG 11. 2014.07.25.
ISO/IEC DIS 23008-3. Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio. ISO/IEC JTC 1/SC 29/WG 11. 2014.08.05.
ISO/IEC FDIS 23003-3:2011(E), Information technology - MPEG audio technologies - Part 3: Unified speech and audio coding. ISO/IEC JTC 1/SC 29/WG 11. 2011.09.20.*
ISO/IEC WD1 23008-3. Information technology - High efficiency coding and media delivery in heterogeneous environments - Part 3: 3D audio. ISO/IEC JTC 1/SC 29/WG 11. 2014.01.24. (107 meeting w14263)

Also Published As

Publication number Publication date
WO2015146057A1 (en) 2015-10-01
CN106133828A (zh) 2016-11-16
US20200135216A1 (en) 2020-04-30
CN111489758A (zh) 2020-08-04
US20210398546A1 (en) 2021-12-23
EP3745397A1 (en) 2020-12-02
EP3745397B1 (en) 2023-06-07
CN106133828B (zh) 2020-04-10
RU2016137197A3 (enExample) 2018-10-22
BR112016021407A2 (pt) 2022-07-19
BR112016021407B1 (pt) 2022-09-27
RU2016137197A (ru) 2018-03-21
EP3123470B1 (en) 2020-08-12
JP6439296B2 (ja) 2018-12-19
EP3123470A1 (en) 2017-02-01
EP4243016A2 (en) 2023-09-13
US20240055007A1 (en) 2024-02-15
RU2689438C2 (ru) 2019-05-28
EP4243016A3 (en) 2023-11-08
KR20160136278A (ko) 2016-11-29
KR20230027329A (ko) 2023-02-27
KR102300062B1 (ko) 2021-09-09
RU2019112504A (ru) 2019-05-06
JP2015194666A (ja) 2015-11-05
CN111489758B (zh) 2023-12-01
US20180033440A1 (en) 2018-02-01
KR20250002792A (ko) 2025-01-07
KR20210111897A (ko) 2021-09-13

Similar Documents

Publication Publication Date Title
KR102741508B1 (ko) 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
KR101921403B1 (ko) 고차 앰비소닉 신호 압축
KR102294767B1 (ko) 고채널 카운트 멀티채널 오디오에 대한 멀티플렛 기반 매트릭스 믹싱
US8817991B2 (en) Advanced encoding of multi-channel digital audio signals
TW201729180A (zh) 使用一寬帶對準參數與複數窄帶對準參數編碼或解碼多通道信號之裝置及方法
CN114008704B (zh) 编码已缩放空间分量
CN114008705B (zh) 基于操作条件执行心理声学音频编解码
EP3987513B1 (en) Quantizing spatial components based on bit allocations determined for psychoacoustic audio coding
EP3987514B1 (en) Correlating scene-based audio data for psychoacoustic audio coding
JP2025061919A (ja) 情報処理装置および方法、並びにプログラム
KR20210071972A (ko) 신호 처리 장치 및 방법, 그리고 프로그램

Legal Events

Date Code Title Description
PA0104 Divisional application for international application

St.27 status event code: A-0-1-A10-A16-div-PA0104

St.27 status event code: A-0-1-A10-A18-div-PA0104

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E90F Notification of reason for final refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

PA0104 Divisional application for international application

St.27 status event code: A-0-1-A10-A16-div-PA0104

PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

Fee payment year number: 1

St.27 status event code: A-2-2-U10-U12-oth-PR1002

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601