RU2019112504A - Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования, и программа - Google Patents

Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования, и программа Download PDF

Info

Publication number
RU2019112504A
RU2019112504A RU2019112504A RU2019112504A RU2019112504A RU 2019112504 A RU2019112504 A RU 2019112504A RU 2019112504 A RU2019112504 A RU 2019112504A RU 2019112504 A RU2019112504 A RU 2019112504A RU 2019112504 A RU2019112504 A RU 2019112504A
Authority
RU
Russia
Prior art keywords
priority
degree
decoding
encoded audio
objects
Prior art date
Application number
RU2019112504A
Other languages
English (en)
Russian (ru)
Inventor
Тору ТИНЕН
Масаюки НИСИГУТИ
Руню СИ
Мицуюки ХАТАНАКА
Юки ЯМАМОТО
Original Assignee
Сони Корпорейшн
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Сони Корпорейшн filed Critical Сони Корпорейшн
Publication of RU2019112504A publication Critical patent/RU2019112504A/ru

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
RU2019112504A 2014-03-24 2015-03-16 Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования, и программа RU2019112504A (ru)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2014-060486 2014-03-24
JP2014060486 2014-03-24
JP2014-136633 2014-07-02
JP2014136633A JP6439296B2 (ja) 2014-03-24 2014-07-02 復号装置および方法、並びにプログラム

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
RU2016137197A Division RU2689438C2 (ru) 2014-03-24 2015-03-16 Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования и программа

Publications (1)

Publication Number Publication Date
RU2019112504A true RU2019112504A (ru) 2019-05-06

Family

ID=53039543

Family Applications (2)

Application Number Title Priority Date Filing Date
RU2019112504A RU2019112504A (ru) 2014-03-24 2015-03-16 Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования, и программа
RU2016137197A RU2689438C2 (ru) 2014-03-24 2015-03-16 Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования и программа

Family Applications After (1)

Application Number Title Priority Date Filing Date
RU2016137197A RU2689438C2 (ru) 2014-03-24 2015-03-16 Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования и программа

Country Status (8)

Country Link
US (4) US20180033440A1 (cg-RX-API-DMAC7.html)
EP (3) EP4243016A3 (cg-RX-API-DMAC7.html)
JP (1) JP6439296B2 (cg-RX-API-DMAC7.html)
KR (4) KR20210111897A (cg-RX-API-DMAC7.html)
CN (2) CN111489758B (cg-RX-API-DMAC7.html)
BR (1) BR112016021407B1 (cg-RX-API-DMAC7.html)
RU (2) RU2019112504A (cg-RX-API-DMAC7.html)
WO (1) WO2015146057A1 (cg-RX-API-DMAC7.html)

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3059732B1 (en) * 2013-10-17 2018-10-10 Socionext Inc. Audio decoding device
JP6904250B2 (ja) * 2015-04-08 2021-07-14 ソニーグループ株式会社 送信装置、送信方法、受信装置および受信方法
WO2016163329A1 (ja) * 2015-04-08 2016-10-13 ソニー株式会社 送信装置、送信方法、受信装置および受信方法
US10424307B2 (en) * 2017-01-03 2019-09-24 Nokia Technologies Oy Adapting a distributed audio recording for end user free viewpoint monitoring
EP4054213A1 (en) * 2017-03-06 2022-09-07 Dolby International AB Rendering in dependence on the number of loudspeaker channels
EP3618067B1 (en) * 2017-04-26 2024-04-10 Sony Group Corporation Signal processing device, method, and program
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
US10657974B2 (en) * 2017-12-21 2020-05-19 Qualcomm Incorporated Priority information for higher order ambisonic audio data
US11270711B2 (en) 2017-12-21 2022-03-08 Qualcomm Incorproated Higher order ambisonic audio data
GB2578715A (en) * 2018-07-20 2020-05-27 Nokia Technologies Oy Controlling audio focus for spatial audio processing
JP7447798B2 (ja) * 2018-10-16 2024-03-12 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
CN111081226B (zh) * 2018-10-18 2024-02-13 北京搜狗科技发展有限公司 语音识别解码优化方法及装置
CN113016032B (zh) * 2018-11-20 2024-08-20 索尼集团公司 信息处理装置和方法以及程序
WO2021200260A1 (ja) * 2020-04-01 2021-10-07 ソニーグループ株式会社 信号処理装置および方法、並びにプログラム
MX2023002255A (es) * 2020-09-03 2023-05-16 Sony Group Corp Dispositivo y método de procesamiento de señales, dispositivo y método de aprendizaje y programa.
DE112021005027T5 (de) * 2020-09-25 2023-08-10 Apple Inc. Nahtloses skalierbares decodieren von kanälen, objekten und hoa-audioinhalt
CN112634914B (zh) * 2020-12-15 2024-03-29 中国科学技术大学 基于短时谱一致性的神经网络声码器训练方法
US11710491B2 (en) * 2021-04-20 2023-07-25 Tencent America LLC Method and apparatus for space of interest of audio scene
CN114974273B (zh) * 2021-08-10 2023-08-15 中移互联网有限公司 一种会议音频混音方法和装置
CN114550732B (zh) * 2022-04-15 2022-07-08 腾讯科技(深圳)有限公司 一种高频音频信号的编解码方法和相关装置

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6330644B1 (en) * 1994-10-27 2001-12-11 Canon Kabushiki Kaisha Signal processor with a plurality of kinds of processors and a shared memory accessed through a versatile control means
JP3519722B2 (ja) * 1997-03-17 2004-04-19 松下電器産業株式会社 データ処理方法及びデータ処理装置
SE512719C2 (sv) * 1997-06-10 2000-05-02 Lars Gustaf Liljeryd En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion
US6230130B1 (en) * 1998-05-18 2001-05-08 U.S. Philips Corporation Scalable mixing for speech streaming
JP2005292702A (ja) * 2004-04-05 2005-10-20 Kddi Corp オーディオフレームに対するフェードイン/フェードアウト処理装置及びプログラム
US8787594B1 (en) * 2005-01-28 2014-07-22 Texas Instruments Incorporated Multi-stream audio level controller
RU2383941C2 (ru) * 2005-06-30 2010-03-10 ЭлДжи ЭЛЕКТРОНИКС ИНК. Способ и устройство для кодирования и декодирования аудиосигналов
US7974422B1 (en) * 2005-08-25 2011-07-05 Tp Lab, Inc. System and method of adjusting the sound of multiple audio objects directed toward an audio output device
JP4396683B2 (ja) * 2006-10-02 2010-01-13 カシオ計算機株式会社 音声符号化装置、音声符号化方法、及び、プログラム
MY144273A (en) * 2006-10-16 2011-08-29 Fraunhofer Ges Forschung Apparatus and method for multi-chennel parameter transformation
US8085786B2 (en) * 2007-03-16 2011-12-27 Qualcomm Incorporated H-ARQ throughput optimization by prioritized decoding
FR2929466A1 (fr) * 2008-03-28 2009-10-02 France Telecom Dissimulation d'erreur de transmission dans un signal numerique dans une structure de decodage hierarchique
US8396577B2 (en) * 2009-08-14 2013-03-12 Dts Llc System for creating audio objects for streaming
CN102714038B (zh) * 2009-11-20 2014-11-05 弗兰霍菲尔运输应用研究公司 用以基于下混信号表示型态而提供上混信号表示型态的装置、用以提供表示多声道音频信号的位流的装置、方法
US9531761B2 (en) * 2010-07-01 2016-12-27 Broadcom Corporation Method and system for prioritizing and scheduling services in an IP multimedia network
JP2012108451A (ja) * 2010-10-18 2012-06-07 Sony Corp 音声処理装置および方法、並びにプログラム
KR102374897B1 (ko) * 2011-03-16 2022-03-17 디티에스, 인코포레이티드 3차원 오디오 사운드트랙의 인코딩 및 재현
WO2013181272A2 (en) * 2012-05-31 2013-12-05 Dts Llc Object-based audio system using vector base amplitude panning
US9025458B2 (en) * 2012-10-23 2015-05-05 Verizon Patent And Licensing Inc. Reducing congestion of media delivery over a content delivery network
US9805725B2 (en) * 2012-12-21 2017-10-31 Dolby Laboratories Licensing Corporation Object clustering for rendering object-based audio content based on perceptual criteria
US9860663B2 (en) * 2013-01-15 2018-01-02 Koninklijke Philips N.V. Binaural audio processing
EP3059732B1 (en) * 2013-10-17 2018-10-10 Socionext Inc. Audio decoding device
KR102160254B1 (ko) * 2014-01-10 2020-09-25 삼성전자주식회사 액티브다운 믹스 방식을 이용한 입체 음향 재생 방법 및 장치

Also Published As

Publication number Publication date
KR102300062B1 (ko) 2021-09-09
EP3123470A1 (en) 2017-02-01
CN111489758A (zh) 2020-08-04
JP2015194666A (ja) 2015-11-05
KR20160136278A (ko) 2016-11-29
WO2015146057A1 (en) 2015-10-01
BR112016021407B1 (pt) 2022-09-27
EP4243016A3 (en) 2023-11-08
CN111489758B (zh) 2023-12-01
US20200135216A1 (en) 2020-04-30
EP3123470B1 (en) 2020-08-12
KR20210111897A (ko) 2021-09-13
CN106133828B (zh) 2020-04-10
EP3745397A1 (en) 2020-12-02
US20240055007A1 (en) 2024-02-15
RU2016137197A (ru) 2018-03-21
BR112016021407A2 (pt) 2022-07-19
KR20250002792A (ko) 2025-01-07
US20210398546A1 (en) 2021-12-23
CN106133828A (zh) 2016-11-16
RU2689438C2 (ru) 2019-05-28
KR102741508B1 (ko) 2024-12-12
EP4243016A2 (en) 2023-09-13
KR20230027329A (ko) 2023-02-27
JP6439296B2 (ja) 2018-12-19
RU2016137197A3 (cg-RX-API-DMAC7.html) 2018-10-22
EP3745397B1 (en) 2023-06-07
US20180033440A1 (en) 2018-02-01

Similar Documents

Publication Publication Date Title
RU2019112504A (ru) Устройство кодирования и способ кодирования, устройство декодирования и способ декодирования, и программа
DK3931748T3 (da) Delbillede-baserede skiveadresser i videokodning
IL280228A (en) Video encoder, video decoder, and corresponding encoding and decoding methods
TN2017000286A1 (en) Enhanced multiple transforms for prediction residual
PL4090025T3 (pl) Sposób dekodowania wideo, sposób kodowania wideo i sposób przesyłania
HUE040132T2 (hu) Eljárás és eszköz többrétegû videoadatok dekódolására a dekóder képességeinek egy vagy több réteget tartalmazó partícióhoz társított profil, rangsor és szint alapján történõ meghatározásával
JP2015188244A5 (cg-RX-API-DMAC7.html)
EP3580649A4 (en) CONTENT STORAGE OPTIMIZATION BY CREATION OF TALONS
WO2016033480A3 (en) Intermediate compression for higher order ambisonic audio data
BR112015004956A2 (pt) aparelhos de codificação e de decodificação de imagem, e, métodos de codificação e de decodificação de imagem.
WO2018128679A3 (en) Efficient list decoding of ldpc codes
EP4170947C0 (en) AVOIDANCE OF MULTIPLE SIGNALING RETRANSMISSIONS CARRIED BY A 5G NAS TRANSPORT
PT3125243T (pt) Dispositivo de descodificação de áudio, dispositivo de codificação de áudio, método de descodificação de áudio, método de codificação de áudio, programa de descodificação de áudio e programa de codificação de áudio
PL3859734T3 (pl) Urządzenie dekodujące sygnał dźwiękowy, sposób dekodowania sygnału dźwiękowego, program i nośnik rejestrujący
DK3958572T3 (da) Fremgangsmåde til kodning af multi-visnings-video, fremgangsmåde til afkodning af multi-visnings-video og lagringsmedium dertil
EP3345397A4 (en) VIDEO CODING WITH DELAYED RECONSTRUCTION
KR102384691B9 (ko) 디코딩 또는 인코딩을 위한 방법, 장치 및 매체
PL3139380T3 (pl) Koder, dekoder, sposób kodowania, sposób dekodowania, program kodujący, program dekodujący i nośnik rejestrujący
EP3295670A4 (en) Data-charge phase data compression architecture
DK3642839T3 (da) Audiosignalkodning og -afkodning
TR201910102T4 (tr) Kodlayici, kod çözücü, kodlama metodu, kod çözme metodu ve program
TH1501000014B (th) อุปกรณ์การเข้ารหัสเชิงพยากรณ์วิดีโอ, วิธีการของการเข้ารหัสเชิงพยากรณ์วิดีโอ, โปรแกรมการเข้ารหัสเชิงพยากรณ์วิดีโอ, อุปกรณ์การถอดรหัสเชิงพยากรณ์วิดีโอ, วิธีการของการถอดรหัสเชิงพยากรณ์วิดีโอ และโปรแกรมการถอดรหัสเชิงพยากรณ์วิดีโอ
TH1501005845A (th) ตัวถอดรหัส ตัวเข้ารหัส และวิธีการสำหรับการประมาณค่าความดังที่ได้รับการแจ้งให้ ทราบที่ใช้สัญญาณเสียง ทางเลี่ยงในระบบการลงรหัสสัญญาณเสียงที่มีพื้นฐานของวัตถุ
TH1501004429A (th) ตัวเข้ารหัสและตัวถอดรหัสเสียงที่มีสารสนเทศของโปรแกรม หรือเมตะดาตาของโครงสร้าง
TH1501004212B (th) การเติมสัญญาณรบกวนในการลงรหัสเสียงการแปลงที่สัมผัสรู้ได้