TWI669943B - 分裂增益形狀向量編碼 - Google Patents
分裂增益形狀向量編碼 Download PDFInfo
- Publication number
- TWI669943B TWI669943B TW103139294A TW103139294A TWI669943B TW I669943 B TWI669943 B TW I669943B TW 103139294 A TW103139294 A TW 103139294A TW 103139294 A TW103139294 A TW 103139294A TW I669943 B TWI669943 B TW I669943B
- Authority
- TW
- Taiwan
- Prior art keywords
- vector
- segments
- segment
- avg
- final
- Prior art date
Links
- 239000013598 vector Substances 0.000 title claims abstract description 224
- 238000000034 method Methods 0.000 claims abstract description 88
- 238000004590 computer program Methods 0.000 claims description 13
- 238000012545 processing Methods 0.000 description 33
- 230000006870 function Effects 0.000 description 14
- 230000008859 change Effects 0.000 description 13
- 238000013139 quantization Methods 0.000 description 12
- 238000005516 engineering process Methods 0.000 description 8
- 238000004891 communication Methods 0.000 description 7
- 230000009471 action Effects 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000005259 measurement Methods 0.000 description 4
- 230000014509 gene expression Effects 0.000 description 3
- 230000007423 decrease Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/3082—Vector coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
- G10L19/038—Vector quantisation, e.g. TwinVQ audio
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/665—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Error Detection And Correction (AREA)
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361903024P | 2013-11-12 | 2013-11-12 | |
| US61/903,024 | 2013-11-12 | ||
| ??PCT/SE2014/051339 | 2014-11-11 | ||
| PCT/SE2014/051339 WO2015072914A1 (en) | 2013-11-12 | 2014-11-11 | Split gain shape vector coding |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201532423A TW201532423A (zh) | 2015-08-16 |
| TWI669943B true TWI669943B (zh) | 2019-08-21 |
Family
ID=52001045
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW103139294A TWI669943B (zh) | 2013-11-12 | 2014-11-12 | 分裂增益形狀向量編碼 |
| TW109142236A TWI776298B (zh) | 2013-11-12 | 2014-11-12 | 分裂增益形狀向量編碼 |
| TW108125153A TWI708501B (zh) | 2013-11-12 | 2014-11-12 | 分裂增益形狀向量編碼 |
Family Applications After (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW109142236A TWI776298B (zh) | 2013-11-12 | 2014-11-12 | 分裂增益形狀向量編碼 |
| TW108125153A TWI708501B (zh) | 2013-11-12 | 2014-11-12 | 分裂增益形狀向量編碼 |
Country Status (12)
| Country | Link |
|---|---|
| US (3) | US9385750B2 (enExample) |
| EP (4) | EP3913808B1 (enExample) |
| CN (3) | CN110649925B (enExample) |
| AR (2) | AR099351A1 (enExample) |
| BR (1) | BR112016009785B1 (enExample) |
| DK (2) | DK3624347T3 (enExample) |
| ES (3) | ES2773958T3 (enExample) |
| MX (3) | MX352106B (enExample) |
| PL (1) | PL3069449T3 (enExample) |
| PT (2) | PT3624347T (enExample) |
| TW (3) | TWI669943B (enExample) |
| WO (1) | WO2015072914A1 (enExample) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| IN2014DN07726A (enExample) * | 2012-03-29 | 2015-05-15 | Ericsson Telefon Ab L M | |
| MX352106B (es) * | 2013-11-12 | 2017-11-09 | Ericsson Telefon Ab L M | Codificación de vector de ganancia y forma dividida. |
| MY203900A (en) | 2014-07-28 | 2024-07-23 | Ericsson Telefon Ab L M | Pyramid vector quantizer shape search |
| US10559315B2 (en) | 2018-03-28 | 2020-02-11 | Qualcomm Incorporated | Extended-range coarse-fine quantization for audio coding |
| US10762910B2 (en) | 2018-06-01 | 2020-09-01 | Qualcomm Incorporated | Hierarchical fine quantization for audio coding |
| CN111061907B (zh) * | 2019-12-10 | 2023-06-20 | 腾讯科技(深圳)有限公司 | 媒体数据处理方法、装置及存储介质 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7310598B1 (en) * | 2002-04-12 | 2007-12-18 | University Of Central Florida Research Foundation, Inc. | Energy based split vector quantizer employing signal representation in multiple transform domains |
Family Cites Families (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7343291B2 (en) * | 2003-07-18 | 2008-03-11 | Microsoft Corporation | Multi-pass variable bitrate media encoding |
| CN1327408C (zh) * | 2004-12-31 | 2007-07-18 | 苏州大学 | 一种低比特率语音编码器 |
| KR101768207B1 (ko) * | 2010-01-19 | 2017-08-16 | 삼성전자주식회사 | 축소된 예측 움직임 벡터의 후보들에 기초해 움직임 벡터를 부호화, 복호화하는 방법 및 장치 |
| US20120029926A1 (en) * | 2010-07-30 | 2012-02-02 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for dependent-mode coding of audio signals |
| WO2012122299A1 (en) * | 2011-03-07 | 2012-09-13 | Xiph. Org. | Bit allocation and partitioning in gain-shape vector quantization for audio coding |
| EP2697795B1 (en) * | 2011-04-15 | 2015-06-17 | Telefonaktiebolaget L M Ericsson (PUBL) | Adaptive gain-shape rate sharing |
| KR20130047643A (ko) * | 2011-10-28 | 2013-05-08 | 한국전자통신연구원 | 통신 시스템에서 신호 코덱 장치 및 방법 |
| US9860604B2 (en) * | 2011-11-23 | 2018-01-02 | Oath Inc. | Systems and methods for internet video delivery |
| JP2013131918A (ja) * | 2011-12-21 | 2013-07-04 | Jvc Kenwood Corp | 動画像復号装置、動画像復号方法及び動画像復号プログラム |
| MX352106B (es) * | 2013-11-12 | 2017-11-09 | Ericsson Telefon Ab L M | Codificación de vector de ganancia y forma dividida. |
-
2014
- 2014-11-11 MX MX2016005806A patent/MX352106B/es active IP Right Grant
- 2014-11-11 DK DK19186188.9T patent/DK3624347T3/da active
- 2014-11-11 ES ES14805698T patent/ES2773958T3/es active Active
- 2014-11-11 WO PCT/SE2014/051339 patent/WO2015072914A1/en not_active Ceased
- 2014-11-11 EP EP21185475.7A patent/EP3913808B1/en active Active
- 2014-11-11 EP EP14805698.9A patent/EP3069449B1/en active Active
- 2014-11-11 CN CN201911003152.6A patent/CN110649925B/zh active Active
- 2014-11-11 US US14/440,713 patent/US9385750B2/en active Active
- 2014-11-11 ES ES19186188T patent/ES2891050T3/es active Active
- 2014-11-11 CN CN201480061092.2A patent/CN105706369B/zh active Active
- 2014-11-11 PL PL14805698T patent/PL3069449T3/pl unknown
- 2014-11-11 EP EP19186188.9A patent/EP3624347B1/en active Active
- 2014-11-11 PT PT191861889T patent/PT3624347T/pt unknown
- 2014-11-11 BR BR112016009785-8A patent/BR112016009785B1/pt active IP Right Grant
- 2014-11-11 PT PT148056989T patent/PT3069449T/pt unknown
- 2014-11-11 EP EP25168912.1A patent/EP4593009A3/en active Pending
- 2014-11-11 MX MX2019006311A patent/MX394518B/es unknown
- 2014-11-11 MX MX2017013371A patent/MX365684B/es unknown
- 2014-11-11 CN CN201911003154.5A patent/CN110708075B/zh active Active
- 2014-11-11 DK DK14805698.9T patent/DK3069449T3/da active
- 2014-11-11 ES ES21185475T patent/ES3019932T3/es active Active
- 2014-11-12 TW TW103139294A patent/TWI669943B/zh active
- 2014-11-12 TW TW109142236A patent/TWI776298B/zh active
- 2014-11-12 AR ARP140104266A patent/AR099351A1/es active IP Right Grant
- 2014-11-12 TW TW108125153A patent/TWI708501B/zh active
-
2016
- 2016-06-22 US US15/189,627 patent/US9602128B2/en active Active
-
2017
- 2017-02-07 US US15/426,483 patent/US9853659B2/en active Active
-
2018
- 2018-02-01 AR ARP180100239A patent/AR111014A2/es active IP Right Grant
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7310598B1 (en) * | 2002-04-12 | 2007-12-18 | University Of Central Florida Research Foundation, Inc. | Energy based split vector quantizer employing signal representation in multiple transform domains |
Non-Patent Citations (1)
| Title |
|---|
| < A. M. Kondoz>,< Digital Speech: Coding for Low Bit Rate Communication Systems>,< John Wiley & Sons>,<2nd edition>,<2006/01/03>,<PAGE 1、2、50~52> * |
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI669943B (zh) | 分裂增益形狀向量編碼 | |
| RU2765886C1 (ru) | Кодирование и декодирование положений спектральных пиков | |
| US9741349B2 (en) | Audio coding method and apparatus | |
| WO2020236976A1 (en) | Linear neural reconstruction for deep neural network compression | |
| WO2018044897A1 (en) | Quantizer with index coding and bit scheduling | |
| CN107666472B (zh) | 混合的数字-模拟编解码的方法和设备 | |
| TW202337209A (zh) | 編解碼方法、裝置、設備、儲存介質及電腦程式產品 |