PL3114681T3 - Redukcja przepływności po kodowaniu dla wielu obiektów audio - Google Patents

Redukcja przepływności po kodowaniu dla wielu obiektów audio

Info

Publication number
PL3114681T3
PL3114681T3 PL15758957T PL15758957T PL3114681T3 PL 3114681 T3 PL3114681 T3 PL 3114681T3 PL 15758957 T PL15758957 T PL 15758957T PL 15758957 T PL15758957 T PL 15758957T PL 3114681 T3 PL3114681 T3 PL 3114681T3
Authority
PL
Poland
Prior art keywords
post
multiple object
object audio
encoding bitrate
bitrate reduction
Prior art date
Application number
PL15758957T
Other languages
English (en)
Inventor
Zoran Fejzo
Original Assignee
Dts, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dts, Inc. filed Critical Dts, Inc.
Publication of PL3114681T3 publication Critical patent/PL3114681T3/pl

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
PL15758957T 2014-03-06 2015-02-26 Redukcja przepływności po kodowaniu dla wielu obiektów audio PL3114681T3 (pl)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14/199,706 US9564136B2 (en) 2014-03-06 2014-03-06 Post-encoding bitrate reduction of multiple object audio
EP15758957.3A EP3114681B1 (en) 2014-03-06 2015-02-26 Post-encoding bitrate reduction of multiple object audio
PCT/US2015/017732 WO2015134272A1 (en) 2014-03-06 2015-02-26 Post-encoding bitrate reduction of multiple object audio

Publications (1)

Publication Number Publication Date
PL3114681T3 true PL3114681T3 (pl) 2018-12-31

Family

ID=54017971

Family Applications (1)

Application Number Title Priority Date Filing Date
PL15758957T PL3114681T3 (pl) 2014-03-06 2015-02-26 Redukcja przepływności po kodowaniu dla wielu obiektów audio

Country Status (7)

Country Link
US (2) US9564136B2 (pl)
EP (2) EP3114681B1 (pl)
JP (1) JP6620108B2 (pl)
KR (1) KR102451342B1 (pl)
CN (1) CN106233380B (pl)
PL (1) PL3114681T3 (pl)
WO (1) WO2015134272A1 (pl)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI530941B (zh) 2013-04-03 2016-04-21 杜比實驗室特許公司 用於基於物件音頻之互動成像的方法與系統
EP3167620A1 (en) * 2014-07-07 2017-05-17 Thomson Licensing Enhancing video content according to metadata
CN111556426B (zh) 2015-02-06 2022-03-25 杜比实验室特许公司 用于自适应音频的混合型基于优先度的渲染系统和方法
JP2017168967A (ja) * 2016-03-15 2017-09-21 富士ゼロックス株式会社 情報処理装置
US10362082B2 (en) * 2016-04-12 2019-07-23 Baidu Usa Llc Method for streaming-based distributed media data processing
WO2018113953A1 (en) * 2016-12-21 2018-06-28 Telefonaktiebolaget Lm Ericsson (Publ) Region of interest classification
WO2018142947A1 (ja) * 2017-01-31 2018-08-09 ソニー株式会社 情報処理装置および方法
EP4358085A2 (en) * 2017-04-26 2024-04-24 Sony Group Corporation Signal processing device, method, and program
US10771789B2 (en) * 2017-05-19 2020-09-08 Google Llc Complexity adaptive rate control
EP3734594A4 (en) * 2017-12-28 2020-11-11 Sony Corporation INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD AND PROGRAM
CN110535810A (zh) * 2018-05-25 2019-12-03 视联动力信息技术股份有限公司 一种视频数据的处理方法和终端
GB2578715A (en) * 2018-07-20 2020-05-27 Nokia Technologies Oy Controlling audio focus for spatial audio processing
JP2022506338A (ja) * 2018-11-02 2022-01-17 ドルビー・インターナショナル・アーベー オーディオ・エンコーダおよびオーディオ・デコーダ
WO2020253941A1 (en) * 2019-06-17 2020-12-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder with a signal-dependent number and precision control, audio decoder, and related methods and computer programs
US11361776B2 (en) * 2019-06-24 2022-06-14 Qualcomm Incorporated Coding scaled spatial components
US11538489B2 (en) 2019-06-24 2022-12-27 Qualcomm Incorporated Correlating scene-based audio data for psychoacoustic audio coding
CA3145045A1 (en) * 2019-07-08 2021-01-14 Voiceage Corporation Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
CN110718211B (zh) * 2019-09-26 2021-12-21 东南大学 一种基于混合压缩卷积神经网络的关键词识别系统
CN113593585A (zh) * 2020-04-30 2021-11-02 华为技术有限公司 音频信号的比特分配方法和装置
CN111583898B (zh) * 2020-05-26 2021-06-29 苏州双福智能科技有限公司 一种空间环境多方位选择性降噪系统及方法
US11355139B2 (en) * 2020-09-22 2022-06-07 International Business Machines Corporation Real-time vs non-real time audio streaming
CN114884974B (zh) * 2022-04-08 2024-02-23 海南车智易通信息技术有限公司 一种数据复用方法、系统及计算设备
WO2024080597A1 (ko) * 2022-10-12 2024-04-18 삼성전자주식회사 오디오 비트스트림을 적응적으로 처리하는 전자 장치, 방법, 및 비일시적 컴퓨터 판독가능 저장 매체

Family Cites Families (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4095052A (en) * 1977-08-02 1978-06-13 Bell Telephone Laboratories, Incorporated Digital speech interpolation trunk priority rotator
GB8330885D0 (en) * 1983-11-18 1983-12-29 British Telecomm Data transmission
EP0805564A3 (en) * 1991-08-02 1999-10-13 Sony Corporation Digital encoder with dynamic quantization bit allocation
WO1995017745A1 (en) 1993-12-16 1995-06-29 Voice Compression Technologies Inc. System and method for performing voice compression
US5519779A (en) * 1994-08-05 1996-05-21 Motorola, Inc. Method and apparatus for inserting signaling in a communication system
US5742734A (en) 1994-08-10 1998-04-21 Qualcomm Incorporated Encoding rate selection in a variable rate vocoder
US6041295A (en) * 1995-04-10 2000-03-21 Corporate Computer Systems Comparing CODEC input/output to adjust psycho-acoustic parameters
US5835495A (en) * 1995-10-11 1998-11-10 Microsoft Corporation System and method for scaleable streamed audio transmission over a network
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
KR100257613B1 (ko) * 1996-10-15 2000-06-01 모리시타 요이찌 영상 및 음성 부호화 방법, 부호화 장치 및 부호화 프로그램 기록 매체
JP3835034B2 (ja) * 1998-02-26 2006-10-18 株式会社日立製作所 受信装置、情報出力装置及び情報出力方法
US6349286B2 (en) 1998-09-03 2002-02-19 Siemens Information And Communications Network, Inc. System and method for automatic synchronization for multimedia presentations
US6775325B1 (en) * 1998-10-07 2004-08-10 Sarnoff Corporation Method and apparatus for converting the bitrate of an encoded bitstream without full re-encoding
US7003449B1 (en) * 1999-10-30 2006-02-21 Stmicroelectronics Asia Pacific Pte Ltd. Method of encoding an audio signal using a quality value for bit allocation
US6697776B1 (en) 2000-07-31 2004-02-24 Mindspeed Technologies, Inc. Dynamic signal detector system and method
US20020131496A1 (en) * 2001-01-18 2002-09-19 Vinod Vasudevan System and method for adjusting bit rate and cost of delivery of digital data
DE10102159C2 (de) * 2001-01-18 2002-12-12 Fraunhofer Ges Forschung Verfahren und Vorrichtung zum Erzeugen bzw. Decodieren eines skalierbaren Datenstroms unter Berücksichtigung einer Bitsparkasse, Codierer und skalierbarer Codierer
US6694293B2 (en) 2001-02-13 2004-02-17 Mindspeed Technologies, Inc. Speech coding system with a music classifier
US7333929B1 (en) * 2001-09-13 2008-02-19 Chmounk Dmitri V Modular scalable compressed audio data stream
US7313520B2 (en) 2002-03-20 2007-12-25 The Directv Group, Inc. Adaptive variable bit rate audio compression encoding
US8244895B2 (en) * 2002-07-15 2012-08-14 Hewlett-Packard Development Company, L.P. Method and apparatus for applying receiving attributes using constraints
US7398204B2 (en) * 2002-08-27 2008-07-08 Her Majesty In Right Of Canada As Represented By The Minister Of Industry Bit rate reduction in audio encoders by exploiting inharmonicity effects and auditory temporal masking
US7804897B1 (en) * 2002-12-16 2010-09-28 Apple Inc. Method for implementing an improved quantizer in a multimedia compression and encoding system
KR100528325B1 (ko) * 2002-12-18 2005-11-15 삼성전자주식회사 비트율 조절이 가능한 스테레오 오디오 부호화 및복호화방법 및 그 장치
US7075460B2 (en) * 2004-02-13 2006-07-11 Hewlett-Packard Development Company, L.P. Methods for scaling encoded data without requiring knowledge of the encoding scheme
DE602005022641D1 (de) 2004-03-01 2010-09-09 Dolby Lab Licensing Corp Mehrkanal-Audiodekodierung
US7272567B2 (en) 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
WO2006010951A1 (en) 2004-07-30 2006-02-02 U-Myx Limited Multi-channel audio data distribution format, method and system
US7930184B2 (en) 2004-08-04 2011-04-19 Dts, Inc. Multi-channel audio coding/decoding of random access points and transients
US8370514B2 (en) * 2005-04-28 2013-02-05 DISH Digital L.L.C. System and method of minimizing network bandwidth retrieved from an external network
US7548853B2 (en) * 2005-06-17 2009-06-16 Shmunk Dmitry V Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding
CN101411080B (zh) * 2006-03-27 2013-05-01 维德约股份有限公司 用于使用控制消息管理可缩放视频和音频编码系统中的可缩放性信息的系统和方法
JP2007264154A (ja) * 2006-03-28 2007-10-11 Sony Corp オーディオ信号符号化方法、オーディオ信号符号化方法のプログラム、オーディオ信号符号化方法のプログラムを記録した記録媒体及びオーディオ信号符号化装置
EP1855271A1 (en) * 2006-05-12 2007-11-14 Deutsche Thomson-Brandt Gmbh Method and apparatus for re-encoding signals
US8279889B2 (en) * 2007-01-04 2012-10-02 Qualcomm Incorporated Systems and methods for dimming a first packet associated with a first bit rate to a second packet associated with a second bit rate
US20090099851A1 (en) 2007-10-11 2009-04-16 Broadcom Corporation Adaptive bit pool allocation in sub-band coding
US20090210436A1 (en) 2007-10-30 2009-08-20 General Instrument Corporation Encoding a hierarchical multi-layer data package
US8239210B2 (en) 2007-12-19 2012-08-07 Dts, Inc. Lossless multi-channel audio codec
EP2144231A1 (en) 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme with common preprocessing
KR101209213B1 (ko) * 2008-08-19 2012-12-06 광주과학기술원 오디오 신호의 계층적 파라메트릭 스테레오 부호화 장치 및복호화 장치
US8396577B2 (en) * 2009-08-14 2013-03-12 Dts Llc System for creating audio objects for streaming
CN102081927B (zh) * 2009-11-27 2012-07-18 中兴通讯股份有限公司 一种可分层音频编码、解码方法及系统
TWI476761B (zh) * 2011-04-08 2015-03-11 Dolby Lab Licensing Corp 用以產生可由實施不同解碼協定之解碼器所解碼的統一位元流之音頻編碼方法及系統
DE102011106033A1 (de) * 2011-06-30 2013-01-03 Zte Corporation Verfahren und System zur Audiocodierung und -decodierung und Verfahren zur Schätzung des Rauschpegels

Also Published As

Publication number Publication date
US9984692B2 (en) 2018-05-29
JP6620108B2 (ja) 2019-12-11
KR20160129876A (ko) 2016-11-09
EP3416165A1 (en) 2018-12-19
EP3114681A4 (en) 2017-08-02
KR102451342B1 (ko) 2022-10-05
WO2015134272A1 (en) 2015-09-11
US20160099000A1 (en) 2016-04-07
EP3114681A1 (en) 2017-01-11
US20150255076A1 (en) 2015-09-10
CN106233380B (zh) 2019-11-08
EP3416165B1 (en) 2020-10-21
EP3114681B1 (en) 2018-07-25
JP2017507365A (ja) 2017-03-16
US9564136B2 (en) 2017-02-07
CN106233380A (zh) 2016-12-14

Similar Documents

Publication Publication Date Title
PL3114681T3 (pl) Redukcja przepływności po kodowaniu dla wielu obiektów audio
PL3417725T3 (pl) Fajka wodna
HUE044919T2 (hu) Konferencia hang menedzselés
EP3155817A4 (en) Enhanced streaming media playback
EP3111306A4 (en) Continuous playback queue
HK1243415A1 (zh) 溴結構域抑制劑
EP3378235A4 (en) MEDIA STREAMING
EP3281199C0 (en) AUDIO BANDWIDTH SELECTION
EP3211916A4 (en) Audio playback device
GB2548208B (en) Audio watermarking for people monitoring
HK1244121A1 (zh) 鏈路感知流送自適應
HK1226169A1 (zh) 分析音頻數據
HUE050695T2 (hu) Több audiójel kódolása
SG11201701516TA (en) Audio splicing concept
GB201513555D0 (en) Audio system
EP3262850A4 (en) Audio devices
HK1216364A1 (zh) 音頻混頻器
PL3790007T3 (pl) Kodowanie audio
GB201408606D0 (en) Audio mode selector
GB201620838D0 (en) Audio playback
GB2540673B (en) Audio enhancement
EP3117113B8 (fr) Insert filete
ZA201701965B (en) Audio parameter quantization
GB201518892D0 (en) Multimedia playing system
GB201421513D0 (en) Real-time audio manipulation