TWI679633B - 對於高階保真立體音響資料框表示之壓縮判定用於描述非差分增益值表示的最低整數位元數之方法與設備 - Google Patents
對於高階保真立體音響資料框表示之壓縮判定用於描述非差分增益值表示的最低整數位元數之方法與設備 Download PDFInfo
- Publication number
- TWI679633B TWI679633B TW104120627A TW104120627A TWI679633B TW I679633 B TWI679633 B TW I679633B TW 104120627 A TW104120627 A TW 104120627A TW 104120627 A TW104120627 A TW 104120627A TW I679633 B TWI679633 B TW I679633B
- Authority
- TW
- Taiwan
- Prior art keywords
- hoa
- signal
- data frame
- matrix
- representation
- Prior art date
Links
- 238000007906 compression Methods 0.000 title claims abstract description 21
- 230000006835 compression Effects 0.000 title claims abstract description 20
- 238000000034 method Methods 0.000 title claims description 19
- 239000011159 matrix material Substances 0.000 claims description 61
- 239000013598 vector Substances 0.000 claims description 60
- 238000012545 processing Methods 0.000 claims description 31
- 230000005236 sound signal Effects 0.000 claims description 31
- 238000012937 correction Methods 0.000 claims description 20
- 238000002156 mixing Methods 0.000 claims description 13
- 238000010606 normalization Methods 0.000 claims description 12
- 230000009466 transformation Effects 0.000 claims description 12
- 230000008859 change Effects 0.000 claims description 10
- 230000002159 abnormal effect Effects 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 claims description 6
- 238000009826 distribution Methods 0.000 claims description 5
- 230000008447 perception Effects 0.000 claims description 4
- 230000005856 abnormality Effects 0.000 claims 1
- 238000009827 uniform distribution Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 12
- 238000000354 decomposition reaction Methods 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000006837 decompression Effects 0.000 description 5
- 238000005070 sampling Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 241001306293 Ophrys insectifera Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002547 anomalous effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 230000015654 memory Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 230000005428 wave function Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Mathematical Physics (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Mathematical Analysis (AREA)
- Theoretical Computer Science (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- General Physics & Mathematics (AREA)
- Algebra (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP14306024.2 | 2014-06-27 | ||
| EP14306024 | 2014-06-27 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201603001A TW201603001A (zh) | 2016-01-16 |
| TWI679633B true TWI679633B (zh) | 2019-12-11 |
Family
ID=51178840
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW110117878A TWI809394B (zh) | 2014-06-27 | 2015-06-26 | 用於將聲音或聲場的高階保真立體音響(hoa)表示予以解碼的方法及裝置 |
| TW108142368A TWI728563B (zh) | 2014-06-27 | 2015-06-26 | 用於將聲音或聲場的高階保真立體音響(hoa)表示予以解碼的方法及裝置 |
| TW104120627A TWI679633B (zh) | 2014-06-27 | 2015-06-26 | 對於高階保真立體音響資料框表示之壓縮判定用於描述非差分增益值表示的最低整數位元數之方法與設備 |
Family Applications Before (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW110117878A TWI809394B (zh) | 2014-06-27 | 2015-06-26 | 用於將聲音或聲場的高階保真立體音響(hoa)表示予以解碼的方法及裝置 |
| TW108142368A TWI728563B (zh) | 2014-06-27 | 2015-06-26 | 用於將聲音或聲場的高階保真立體音響(hoa)表示予以解碼的方法及裝置 |
Country Status (8)
| Country | Link |
|---|---|
| US (4) | US9792924B2 (enExample) |
| EP (3) | EP3162086B1 (enExample) |
| JP (5) | JP6641304B2 (enExample) |
| KR (5) | KR102381202B1 (enExample) |
| CN (7) | CN110415712B (enExample) |
| ES (1) | ES2974440T3 (enExample) |
| TW (3) | TWI809394B (enExample) |
| WO (1) | WO2015197514A1 (enExample) |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2960903A1 (en) | 2014-06-27 | 2015-12-30 | Thomson Licensing | Method and apparatus for determining for the compression of an HOA data frame representation a lowest integer number of bits required for representing non-differential gain values |
| CN119864039A (zh) * | 2014-06-27 | 2025-04-22 | 杜比国际公司 | 针对hoa数据帧表示的压缩确定表示非差分增益值所需的最小整数比特数的方法 |
| JP6656182B2 (ja) * | 2014-06-27 | 2020-03-04 | ドルビー・インターナショナル・アーベー | Hoaデータ・フレーム表現のデータ・フレームの個々のもののチャネル信号に関連付けられた非差分的な利得値を含む符号化されたhoaデータ・フレーム表現 |
| DE102016104665A1 (de) * | 2016-03-14 | 2017-09-14 | Ask Industries Gmbh | Verfahren und Vorrichtung zur Aufbereitung eines verlustbehaftet komprimierten Audiosignals |
| US10332530B2 (en) * | 2017-01-27 | 2019-06-25 | Google Llc | Coding of a soundfield representation |
| US10015618B1 (en) * | 2017-08-01 | 2018-07-03 | Google Llc | Incoherent idempotent ambisonics rendering |
| US10264386B1 (en) * | 2018-02-09 | 2019-04-16 | Google Llc | Directional emphasis in ambisonics |
| GB2572761A (en) * | 2018-04-09 | 2019-10-16 | Nokia Technologies Oy | Quantization of spatial audio parameters |
| KR102824806B1 (ko) * | 2018-12-07 | 2025-06-25 | 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 | 방향 컴포넌트 보상을 사용하는 DirAC 기반 공간 오디오 코딩과 관련된 인코딩, 디코딩, 장면 처리 및 기타 절차를 위한 장치, 방법 및 컴퓨터 프로그램 |
| BR112023001616A2 (pt) * | 2020-07-30 | 2023-02-23 | Fraunhofer Ges Forschung | Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada |
| CN116325525B (zh) * | 2020-10-22 | 2025-09-23 | 上海诺基亚贝尔股份有限公司 | 用于通信的装置执行的方法、相关装置 |
| CN113314129B (zh) * | 2021-04-30 | 2022-08-05 | 北京大学 | 一种适应环境的声场重放空间解码方法 |
| CN113345448B (zh) * | 2021-05-12 | 2022-08-05 | 北京大学 | 一种基于独立成分分析的hoa信号压缩方法 |
| CN115376528A (zh) * | 2021-05-17 | 2022-11-22 | 华为技术有限公司 | 三维音频信号编码方法、装置和编码器 |
| CN115376529B (zh) * | 2021-05-17 | 2024-10-11 | 华为技术有限公司 | 三维音频信号编码方法、装置和编码器 |
| CN115376530A (zh) * | 2021-05-17 | 2022-11-22 | 华为技术有限公司 | 三维音频信号编码方法、装置和编码器 |
| CN115497485B (zh) * | 2021-06-18 | 2024-10-18 | 华为技术有限公司 | 三维音频信号编码方法、装置、编码器和系统 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6664662B2 (en) * | 2000-02-28 | 2003-12-16 | Scania Cv Aktiebolag (Publ) | Method and device for control of an auxiliary unit in a motor vehicle |
| US20120155653A1 (en) * | 2010-12-21 | 2012-06-21 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
| US20130216070A1 (en) * | 2010-11-05 | 2013-08-22 | Florian Keiler | Data structure for higher order ambisonics audio data |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1138254C (zh) * | 2001-03-19 | 2004-02-11 | 北京阜国数字技术有限公司 | 一种基于小波变换的音频信号压缩编/解码方法 |
| AU2005219956B2 (en) * | 2004-03-01 | 2009-05-28 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
| CN1677492A (zh) * | 2004-04-01 | 2005-10-05 | 北京宫羽数字技术有限责任公司 | 一种增强音频编解码装置及方法 |
| EP1851866B1 (en) * | 2005-02-23 | 2011-08-17 | Telefonaktiebolaget LM Ericsson (publ) | Adaptive bit allocation for multi-channel audio encoding |
| US20080232601A1 (en) * | 2007-03-21 | 2008-09-25 | Ville Pulkki | Method and apparatus for enhancement of audio reconstruction |
| US8788264B2 (en) * | 2007-06-27 | 2014-07-22 | Nec Corporation | Audio encoding method, audio decoding method, audio encoding device, audio decoding device, program, and audio encoding/decoding system |
| US8509454B2 (en) * | 2007-11-01 | 2013-08-13 | Nokia Corporation | Focusing on a portion of an audio scene for an audio signal |
| EP2077551B1 (en) * | 2008-01-04 | 2011-03-02 | Dolby Sweden AB | Audio encoder and decoder |
| EP2301262B1 (en) * | 2008-06-17 | 2017-09-27 | Earlens Corporation | Optical electro-mechanical hearing devices with combined power and signal architectures |
| JP4512172B2 (ja) * | 2008-09-17 | 2010-07-28 | パナソニック株式会社 | 記録媒体、再生装置、及び集積回路 |
| KR102622947B1 (ko) * | 2010-03-26 | 2024-01-10 | 돌비 인터네셔널 에이비 | 오디오 재생을 위한 오디오 사운드필드 표현을 디코딩하는 방법 및 장치 |
| CA3125378C (en) * | 2010-04-09 | 2023-02-07 | Dolby International Ab | Audio upmixer operable in prediction or non-prediction mode |
| EP2541547A1 (en) * | 2011-06-30 | 2013-01-02 | Thomson Licensing | Method and apparatus for changing the relative positions of sound objects contained within a higher-order ambisonics representation |
| EP2637427A1 (en) * | 2012-03-06 | 2013-09-11 | Thomson Licensing | Method and apparatus for playback of a higher-order ambisonics audio signal |
| EP2645748A1 (en) * | 2012-03-28 | 2013-10-02 | Thomson Licensing | Method and apparatus for decoding stereo loudspeaker signals from a higher-order Ambisonics audio signal |
| EP2665208A1 (en) | 2012-05-14 | 2013-11-20 | Thomson Licensing | Method and apparatus for compressing and decompressing a Higher Order Ambisonics signal representation |
| EP2688066A1 (en) * | 2012-07-16 | 2014-01-22 | Thomson Licensing | Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction |
| KR102479737B1 (ko) * | 2012-07-16 | 2022-12-21 | 돌비 인터네셔널 에이비 | 오디오 재생을 위한 오디오 음장 표현을 렌더링하는 방법 및 장치 |
| EP2743922A1 (en) * | 2012-12-12 | 2014-06-18 | Thomson Licensing | Method and apparatus for compressing and decompressing a higher order ambisonics representation for a sound field |
| EP2800401A1 (en) | 2013-04-29 | 2014-11-05 | Thomson Licensing | Method and Apparatus for compressing and decompressing a Higher Order Ambisonics representation |
| EP2824661A1 (en) | 2013-07-11 | 2015-01-14 | Thomson Licensing | Method and Apparatus for generating from a coefficient domain representation of HOA signals a mixed spatial/coefficient domain representation of said HOA signals |
-
2015
- 2015-06-22 EP EP15729523.9A patent/EP3162086B1/en active Active
- 2015-06-22 US US15/319,707 patent/US9792924B2/en active Active
- 2015-06-22 CN CN201910861296.9A patent/CN110415712B/zh active Active
- 2015-06-22 WO PCT/EP2015/063914 patent/WO2015197514A1/en not_active Ceased
- 2015-06-22 CN CN201910861280.8A patent/CN110459229B/zh active Active
- 2015-06-22 EP EP24158677.5A patent/EP4354432A3/en active Pending
- 2015-06-22 CN CN201910861274.2A patent/CN110556120B/zh active Active
- 2015-06-22 KR KR1020167036547A patent/KR102381202B1/ko active Active
- 2015-06-22 KR KR1020247010754A patent/KR102816984B1/ko active Active
- 2015-06-22 JP JP2016575019A patent/JP6641304B2/ja active Active
- 2015-06-22 KR KR1020227035215A patent/KR102654275B1/ko active Active
- 2015-06-22 CN CN201580035125.0A patent/CN106471822B/zh active Active
- 2015-06-22 CN CN201910922110.6A patent/CN110662158B/zh active Active
- 2015-06-22 KR KR1020227010252A patent/KR102454747B1/ko active Active
- 2015-06-22 ES ES21159478T patent/ES2974440T3/es active Active
- 2015-06-22 EP EP21159478.3A patent/EP3860154B1/en active Active
- 2015-06-22 CN CN202311558626.XA patent/CN117612540A/zh active Pending
- 2015-06-22 CN CN202311556422.2A patent/CN117636885A/zh active Pending
- 2015-06-22 KR KR1020257018085A patent/KR20250085845A/ko active Pending
- 2015-06-26 TW TW110117878A patent/TWI809394B/zh active
- 2015-06-26 TW TW108142368A patent/TWI728563B/zh active
- 2015-06-26 TW TW104120627A patent/TWI679633B/zh active
-
2017
- 2017-09-12 US US15/702,418 patent/US10037764B2/en active Active
-
2018
- 2018-06-26 US US16/019,288 patent/US10262670B2/en active Active
-
2019
- 2019-04-08 US US16/377,661 patent/US10580426B2/en active Active
- 2019-12-27 JP JP2019237716A patent/JP6874115B2/ja active Active
-
2021
- 2021-04-21 JP JP2021071874A patent/JP7267340B2/ja active Active
-
2023
- 2023-04-19 JP JP2023068243A patent/JP7512470B2/ja active Active
-
2024
- 2024-06-26 JP JP2024102467A patent/JP7751696B2/ja active Active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6664662B2 (en) * | 2000-02-28 | 2003-12-16 | Scania Cv Aktiebolag (Publ) | Method and device for control of an auxiliary unit in a motor vehicle |
| US20130216070A1 (en) * | 2010-11-05 | 2013-08-22 | Florian Keiler | Data structure for higher order ambisonics audio data |
| US20120155653A1 (en) * | 2010-12-21 | 2012-06-21 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI679633B (zh) | 對於高階保真立體音響資料框表示之壓縮判定用於描述非差分增益值表示的最低整數位元數之方法與設備 | |
| TWI686793B (zh) | 用於確定用於hoa資料框表示之壓縮的最低整數位元數的方法及設備,以及用於解碼聲音或聲場的壓縮的高階保真立體音響(hoa)聲音表示的方法及設備 | |
| TWI689916B (zh) | 用以判定用於描述將振幅變化對應為2之指數之非差分增益值之表示之最低整數位元數以用於hoa資料框表示壓縮之方法及裝置以及用於執行其的電腦程式產品、編碼之hoa資料框表示以及用於儲存其的儲存媒體,以及解碼聲音或聲場之壓縮高階保真立體音響(hoa)聲音表示之方法及裝置 | |
| TWI681385B (zh) | 對於高階保真立體音響資料框表示之壓縮判定用於描述非差分增益值表示的最低整數位元數之方法與裝置 | |
| TWI899581B (zh) | 用於將聲音或聲場的高階保真立體音響(hoa)表示予以解碼的方法及裝置 | |
| TWI903247B (zh) | 用以判定用於描述將振幅變化對應為2之指數之非差分增益值之表示之最低整數位元數以用於hoa資料框表示壓縮之方法及裝置以及用於執行其的電腦程式產品、編碼之hoa資料框表示以及用於儲存其的儲存媒體,以及解碼聲音或聲場之壓縮高階保真立體音響(hoa)聲音表示之方法及裝置 | |
| HK40039253B (zh) | 声音或声场的压缩hoa声音表示的解码方法和装置 | |
| HK1233104B (zh) | 针对hoa数据帧表示的压缩确定表示非差分增益值所需的最小整数比特数的设备 | |
| HK1233043B (zh) | 针对hoa数据帧表示的压缩确定表示非差分增益值所需的最小整数比特数的方法 | |
| HK1233044B (zh) | 针对hoa数据帧表示的压缩确定表示非差分增益值所需的最小整数比特数的方法和设备 | |
| HK1238407B (zh) | 包括与hoa数据帧表示的特定数据帧的通道信号关联的非差分增益值的编码hoa数据帧表示 |