CN104428835B - 音频信号的编码和解码 - Google Patents
音频信号的编码和解码 Download PDFInfo
- Publication number
- CN104428835B CN104428835B CN201380036886.9A CN201380036886A CN104428835B CN 104428835 B CN104428835 B CN 104428835B CN 201380036886 A CN201380036886 A CN 201380036886A CN 104428835 B CN104428835 B CN 104428835B
- Authority
- CN
- China
- Prior art keywords
- frequency
- contracting
- mixed
- time
- pieced together
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 139
- 238000002156 mixing Methods 0.000 claims abstract description 106
- 238000000034 method Methods 0.000 claims description 62
- 238000009877 rendering Methods 0.000 claims description 44
- 239000011159 matrix material Substances 0.000 claims description 28
- 230000004044 response Effects 0.000 claims description 16
- 238000009826 distribution Methods 0.000 claims description 10
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 claims description 5
- 239000000203 mixture Substances 0.000 description 16
- 238000005516 engineering process Methods 0.000 description 9
- 238000005457 optimization Methods 0.000 description 8
- 230000000153 supplemental effect Effects 0.000 description 8
- 230000002349 favourable effect Effects 0.000 description 7
- 230000008859 change Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 6
- 238000013507 mapping Methods 0.000 description 6
- 230000008447 perception Effects 0.000 description 6
- 238000009792 diffusion process Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 241000208340 Araliaceae Species 0.000 description 2
- 241000406668 Loxodonta cyclotis Species 0.000 description 2
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 2
- 235000003140 Panax quinquefolius Nutrition 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 235000008434 ginseng Nutrition 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 241001342895 Chorus Species 0.000 description 1
- 241001269238 Data Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- HAORKNGNJCEJBX-UHFFFAOYSA-N cyprodinil Chemical compound N=1C(C)=CC(C2CC2)=NC=1NC1=CC=CC=C1 HAORKNGNJCEJBX-UHFFFAOYSA-N 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000013707 sensory perception of sound Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
- G10L19/265—Pre-filtering, e.g. high frequency emphasis prior to encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201261669197P | 2012-07-09 | 2012-07-09 | |
| US61/669197 | 2012-07-09 | ||
| PCT/IB2013/055628 WO2014009878A2 (en) | 2012-07-09 | 2013-07-09 | Encoding and decoding of audio signals |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN104428835A CN104428835A (zh) | 2015-03-18 |
| CN104428835B true CN104428835B (zh) | 2017-10-31 |
Family
ID=49170767
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201380036886.9A Active CN104428835B (zh) | 2012-07-09 | 2013-07-09 | 音频信号的编码和解码 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US9478228B2 (enExample) |
| EP (2) | EP3748632A1 (enExample) |
| JP (1) | JP6231093B2 (enExample) |
| CN (1) | CN104428835B (enExample) |
| BR (1) | BR112015000247B1 (enExample) |
| MX (1) | MX342150B (enExample) |
| RU (1) | RU2643644C2 (enExample) |
| WO (1) | WO2014009878A2 (enExample) |
| ZA (1) | ZA201500888B (enExample) |
Families Citing this family (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9674587B2 (en) * | 2012-06-26 | 2017-06-06 | Sonos, Inc. | Systems and methods for networked music playback including remote add to queue |
| US9489954B2 (en) * | 2012-08-07 | 2016-11-08 | Dolby Laboratories Licensing Corporation | Encoding and rendering of object based audio indicative of game audio content |
| BR112015029031B1 (pt) | 2013-05-24 | 2021-02-23 | Dolby International Ab | Método e codificador para codificar um vetor de parâmetros em umsistema de codificação de áudio, método e decodificador para decodificar umvetor de símbolos codificados por entropia em um sistema de decodificação deáudio, e meio de armazenamento legível por computador |
| US9774974B2 (en) | 2014-09-24 | 2017-09-26 | Electronics And Telecommunications Research Institute | Audio metadata providing apparatus and method, and multichannel audio data playback apparatus and method to support dynamic format conversion |
| TWI587286B (zh) | 2014-10-31 | 2017-06-11 | 杜比國際公司 | 音頻訊號之解碼和編碼的方法及系統、電腦程式產品、與電腦可讀取媒體 |
| AU2016269886B2 (en) * | 2015-06-02 | 2020-11-12 | Sony Corporation | Transmission device, transmission method, media processing device, media processing method, and reception device |
| US10693936B2 (en) * | 2015-08-25 | 2020-06-23 | Qualcomm Incorporated | Transporting coded audio data |
| US9961467B2 (en) * | 2015-10-08 | 2018-05-01 | Qualcomm Incorporated | Conversion from channel-based audio to HOA |
| US9854375B2 (en) * | 2015-12-01 | 2017-12-26 | Qualcomm Incorporated | Selection of coded next generation audio data for transport |
| KR102261905B1 (ko) * | 2016-03-15 | 2021-06-08 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 음장 기술을 생성하기 위한 장치, 방법, 또는 컴퓨터 프로그램 |
| CN113242508B (zh) | 2017-03-06 | 2022-12-06 | 杜比国际公司 | 基于音频数据流渲染音频输出的方法、解码器系统和介质 |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| BR112021009306A2 (pt) | 2018-11-20 | 2021-08-10 | Sony Group Corporation | dispositivo e método de processamento de informações, e, programa. |
| GB2587614A (en) * | 2019-09-26 | 2021-04-07 | Nokia Technologies Oy | Audio encoding and audio decoding |
| US11930349B2 (en) * | 2020-11-24 | 2024-03-12 | Naver Corporation | Computer system for producing audio content for realizing customized being-there and method thereof |
| KR102500694B1 (ko) * | 2020-11-24 | 2023-02-16 | 네이버 주식회사 | 사용자 맞춤형 현장감 실현을 위한 오디오 콘텐츠를 제작하는 컴퓨터 시스템 및 그의 방법 |
| JP7536733B2 (ja) * | 2020-11-24 | 2024-08-20 | ネイバー コーポレーション | オーディオと関連してユーザカスタム型臨場感を実現するためのコンピュータシステムおよびその方法 |
| EP4320876A4 (en) * | 2021-04-08 | 2024-11-06 | Nokia Technologies Oy | SEPARATION OF SPATIAL AUDIO OBJECTS |
| CN115497485B (zh) * | 2021-06-18 | 2024-10-18 | 华为技术有限公司 | 三维音频信号编码方法、装置、编码器和系统 |
| JP7745100B2 (ja) * | 2021-11-02 | 2025-09-26 | 北京小米移動軟件有限公司 | 信号の符号化および復号化方法、装置、ユーザイクイップメント、ネットワーク側デバイス並びに記憶媒体 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101490745A (zh) * | 2006-11-24 | 2009-07-22 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7644003B2 (en) * | 2001-05-04 | 2010-01-05 | Agere Systems Inc. | Cue-based audio coding/decoding |
| ES2307160T3 (es) * | 2004-04-05 | 2008-11-16 | Koninklijke Philips Electronics N.V. | Codificador multicanal. |
| US7831434B2 (en) | 2006-01-20 | 2010-11-09 | Microsoft Corporation | Complex-transform channel coding with extended-band frequency coding |
| US9014377B2 (en) * | 2006-05-17 | 2015-04-21 | Creative Technology Ltd | Multichannel surround format conversion and generalized upmix |
| US8345899B2 (en) * | 2006-05-17 | 2013-01-01 | Creative Technology Ltd | Phase-amplitude matrixed surround decoder |
| UA94117C2 (ru) * | 2006-10-16 | 2011-04-11 | Долби Свиден Ав | Усовершенстованное кодирование и отображение параметров многоканального кодирования микшированных объектов |
| KR101102401B1 (ko) * | 2006-11-24 | 2012-01-05 | 엘지전자 주식회사 | 오브젝트 기반 오디오 신호의 부호화 및 복호화 방법과 그 장치 |
| JP2008252834A (ja) * | 2007-03-30 | 2008-10-16 | Toshiba Corp | 音声再生装置 |
| US8612237B2 (en) * | 2007-04-04 | 2013-12-17 | Apple Inc. | Method and apparatus for determining audio spatial quality |
| CA2702986C (en) * | 2007-10-17 | 2016-08-16 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio coding using downmix |
| CA2710560C (en) * | 2008-01-01 | 2015-10-27 | Lg Electronics Inc. | A method and an apparatus for processing an audio signal |
| KR101596504B1 (ko) * | 2008-04-23 | 2016-02-23 | 한국전자통신연구원 | 객체기반 오디오 컨텐츠의 생성/재생 방법 및 객체기반 오디오 서비스를 위한 파일 포맷 구조를 가진 데이터를 기록한 컴퓨터 판독 가능 기록 매체 |
| JPWO2010005050A1 (ja) * | 2008-07-11 | 2012-01-05 | 日本電気株式会社 | 信号分析装置、信号制御装置及びその方法と、プログラム |
| US8504184B2 (en) * | 2009-02-04 | 2013-08-06 | Panasonic Corporation | Combination device, telecommunication system, and combining method |
| KR101387902B1 (ko) * | 2009-06-10 | 2014-04-22 | 한국전자통신연구원 | 다객체 오디오 신호를 부호화하는 방법 및 부호화 장치, 복호화 방법 및 복호화 장치, 그리고 트랜스코딩 방법 및 트랜스코더 |
| SG177277A1 (en) * | 2009-06-24 | 2012-02-28 | Fraunhofer Ges Forschung | Audio signal decoder, method for decoding an audio signal and computer program using cascaded audio object processing stages |
| KR101615262B1 (ko) * | 2009-08-12 | 2016-04-26 | 삼성전자주식회사 | 시멘틱 정보를 이용한 멀티 채널 오디오 인코딩 및 디코딩 방법 및 장치 |
| BR112012007138B1 (pt) * | 2009-09-29 | 2021-11-30 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Decodificador de sinal de áudio, codificador de sinal de áudio, método para prover uma representação de mescla ascendente de sinal, método para prover uma representação de mescla descendente de sinal e fluxo de bits usando um valor de parâmetro comum de correlação intra- objetos |
| KR101666465B1 (ko) * | 2010-07-22 | 2016-10-17 | 삼성전자주식회사 | 다채널 오디오 신호 부호화/복호화 장치 및 방법 |
| EP2686654A4 (en) * | 2011-03-16 | 2015-03-11 | Dts Inc | CODING AND PLAYING THREE-DIMENSIONAL AUDIOSPURES |
| KR20130093798A (ko) * | 2012-01-02 | 2013-08-23 | 한국전자통신연구원 | 다채널 신호 부호화 및 복호화 장치 및 방법 |
-
2013
- 2013-07-09 CN CN201380036886.9A patent/CN104428835B/zh active Active
- 2013-07-09 JP JP2015521121A patent/JP6231093B2/ja active Active
- 2013-07-09 MX MX2015000113A patent/MX342150B/es active IP Right Grant
- 2013-07-09 EP EP20182398.6A patent/EP3748632A1/en not_active Withdrawn
- 2013-07-09 RU RU2015104074A patent/RU2643644C2/ru active
- 2013-07-09 WO PCT/IB2013/055628 patent/WO2014009878A2/en not_active Ceased
- 2013-07-09 BR BR112015000247-1A patent/BR112015000247B1/pt active IP Right Grant
- 2013-07-09 US US14/413,234 patent/US9478228B2/en active Active
- 2013-07-09 EP EP13762579.4A patent/EP2870603B1/en active Active
-
2015
- 2015-02-06 ZA ZA2015/00888A patent/ZA201500888B/en unknown
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101490745A (zh) * | 2006-11-24 | 2009-07-22 | Lg电子株式会社 | 用于编码和解码基于对象的音频信号的方法和装置 |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014009878A3 (en) | 2014-03-13 |
| BR112015000247A2 (pt) | 2017-06-27 |
| BR112015000247B1 (pt) | 2021-08-03 |
| MX2015000113A (es) | 2015-08-10 |
| RU2015104074A (ru) | 2016-08-27 |
| JP6231093B2 (ja) | 2017-11-15 |
| EP2870603B1 (en) | 2020-09-30 |
| US9478228B2 (en) | 2016-10-25 |
| EP2870603A2 (en) | 2015-05-13 |
| EP3748632A1 (en) | 2020-12-09 |
| US20150142453A1 (en) | 2015-05-21 |
| CN104428835A (zh) | 2015-03-18 |
| WO2014009878A2 (en) | 2014-01-16 |
| MX342150B (es) | 2016-09-15 |
| RU2643644C2 (ru) | 2018-02-02 |
| ZA201500888B (en) | 2017-01-25 |
| JP2015527609A (ja) | 2015-09-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN104428835B (zh) | 音频信号的编码和解码 | |
| CN102800320B (zh) | 多对象音频信号的附加信息比特流产生方法和装置 | |
| TWI508578B (zh) | 音訊編碼及解碼 | |
| CN105981411B (zh) | 用于高声道计数的多声道音频的基于多元组的矩阵混合 | |
| KR102374897B1 (ko) | 3차원 오디오 사운드트랙의 인코딩 및 재현 | |
| CN103890841B (zh) | 音频对象编码和解码 | |
| US9299353B2 (en) | Method and apparatus for three-dimensional acoustic field encoding and optimal reconstruction | |
| TWI359620B (en) | Apparatus and method for multi-channel parameter t | |
| TWI427621B (zh) | 編碼聲音通道及解碼經傳輸之聲音通道之方法、裝置及機器可讀取媒體 | |
| CN105247611B (zh) | 对音频场景的编码 | |
| TWI379287B (en) | Method, audio coder and apparatus for encoding c input audio | |
| CN106664500B (zh) | 用于渲染声音信号的方法和设备以及计算机可读记录介质 | |
| ES2433316T3 (es) | Generación de señales de audio de multiples canales | |
| CN113170274B (zh) | 环境音频表示和相关联的渲染 | |
| JP2015509212A (ja) | 空間オーディオ・レンダリング及び符号化 | |
| CN107533843A (zh) | 用于捕获、编码、分布和解码沉浸式音频的系统和方法 | |
| CN106063297A (zh) | 用于再现三维音频的方法和设备 | |
| CN107077861A (zh) | 音频编码器和解码器 | |
| WO2008084436A1 (en) | An object-oriented audio decoder | |
| CN114944164A (zh) | 一种基于多模态的沉浸声生成方法及装置 | |
| KR20070081735A (ko) | 오디오 신호의 인코딩/디코딩 방법 및 장치 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| EXSB | Decision made by sipo to initiate substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |