TWI543152B - 用以產生音訊輸出信號或多個頻譜型樣之裝置及方法、及相關電腦程 式 - Google Patents
用以產生音訊輸出信號或多個頻譜型樣之裝置及方法、及相關電腦程 式 Download PDFInfo
- Publication number
- TWI543152B TWI543152B TW102136550A TW102136550A TWI543152B TW I543152 B TWI543152 B TW I543152B TW 102136550 A TW102136550 A TW 102136550A TW 102136550 A TW102136550 A TW 102136550A TW I543152 B TWI543152 B TW I543152B
- Authority
- TW
- Taiwan
- Prior art keywords
- spectral
- coefficients
- pattern
- frequency
- spectrum
- Prior art date
Links
- 230000003595 spectral effect Effects 0.000 title claims description 656
- 238000000034 method Methods 0.000 title claims description 54
- 238000004590 computer program Methods 0.000 title claims description 13
- 238000001228 spectrum Methods 0.000 claims description 335
- 230000005236 sound signal Effects 0.000 claims description 181
- 238000012545 processing Methods 0.000 claims description 46
- 230000006978 adaptation Effects 0.000 claims description 29
- 238000006243 chemical reaction Methods 0.000 claims description 17
- 238000012805 post-processing Methods 0.000 claims description 17
- 230000001131 transforming effect Effects 0.000 claims description 7
- 230000003044 adaptive effect Effects 0.000 claims description 4
- 230000008569 process Effects 0.000 claims description 4
- 239000003607 modifier Substances 0.000 description 37
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 230000005484 gravity Effects 0.000 description 11
- 230000008439 repair process Effects 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 6
- 238000005070 sampling Methods 0.000 description 6
- 239000002131 composite material Substances 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 238000013139 quantization Methods 0.000 description 5
- 230000014759 maintenance of location Effects 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000000007 visual effect Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000013213 extrapolation Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000007493 shaping process Methods 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 229910001369 Brass Inorganic materials 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000010951 brass Substances 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000000059 patterning Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000000682 scanning probe acoustic microscopy Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201261712013P | 2012-10-10 | 2012-10-10 | |
| EP12199266.3A EP2720222A1 (en) | 2012-10-10 | 2012-12-21 | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns |
| PCT/EP2013/069592 WO2014056705A1 (en) | 2012-10-10 | 2013-09-20 | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| TW201419268A TW201419268A (zh) | 2014-05-16 |
| TWI543152B true TWI543152B (zh) | 2016-07-21 |
Family
ID=47715790
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| TW102136550A TWI543152B (zh) | 2012-10-10 | 2013-10-09 | 用以產生音訊輸出信號或多個頻譜型樣之裝置及方法、及相關電腦程 式 |
Country Status (17)
| Country | Link |
|---|---|
| US (1) | US9570085B2 (enExample) |
| EP (3) | EP2720222A1 (enExample) |
| JP (3) | JP6563338B2 (enExample) |
| KR (1) | KR101777485B1 (enExample) |
| CN (1) | CN104903956B (enExample) |
| AR (1) | AR092958A1 (enExample) |
| AU (3) | AU2013329734B2 (enExample) |
| BR (1) | BR112015008114B1 (enExample) |
| CA (2) | CA2944927C (enExample) |
| ES (1) | ES2896016T3 (enExample) |
| MX (1) | MX344955B (enExample) |
| MY (1) | MY193732A (enExample) |
| RU (1) | RU2633136C2 (enExample) |
| SG (2) | SG10201702285QA (enExample) |
| TW (1) | TWI543152B (enExample) |
| WO (1) | WO2014056705A1 (enExample) |
| ZA (1) | ZA201503152B (enExample) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2963645A1 (en) | 2014-07-01 | 2016-01-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Calculator and method for determining phase correction data for an audio signal |
| EP2980791A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Processor, method and computer program for processing an audio signal using truncated analysis or synthesis window overlap portions |
| WO2016091893A1 (en) * | 2014-12-09 | 2016-06-16 | Dolby International Ab | Mdct-domain error concealment |
| EP3107096A1 (en) | 2015-06-16 | 2016-12-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Downscaled decoding |
| US10146500B2 (en) | 2016-08-31 | 2018-12-04 | Dts, Inc. | Transform-based audio codec and method with subband energy smoothing |
| US10362423B2 (en) | 2016-10-13 | 2019-07-23 | Qualcomm Incorporated | Parametric audio decoding |
| CN108074588B (zh) * | 2016-11-15 | 2020-12-01 | 北京唱吧科技股份有限公司 | 一种音高计算方法及装置 |
| US10638227B2 (en) | 2016-12-02 | 2020-04-28 | Dirac Research Ab | Processing of an audio input signal |
| EP4235662B1 (en) | 2017-01-10 | 2025-10-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier |
| CN106847294B (zh) * | 2017-01-17 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 基于人工智能的音频处理方法和装置 |
| US10210874B2 (en) * | 2017-02-03 | 2019-02-19 | Qualcomm Incorporated | Multi channel coding |
| KR102837794B1 (ko) * | 2019-07-02 | 2025-07-24 | 한국전자통신연구원 | 오디오의 고대역 부호화 방법 및 고대역 복호화 방법, 그리고 상기 방법을 수하는 부호화기 및 복호화기 |
| CN110867194B (zh) * | 2019-11-05 | 2022-05-17 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频的评分方法、装置、设备及存储介质 |
| JP7491395B2 (ja) * | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
Family Cites Families (30)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| PL173718B1 (pl) * | 1993-06-30 | 1998-04-30 | Sony Corp | Sposób i urządzenie do kodowania sygnałów cyfrowych |
| ATE276607T1 (de) * | 1994-04-01 | 2004-10-15 | Sony Corp | Verfahren und vorrichtung zum kodieren und dekodieren von nachrichten |
| TW384434B (en) * | 1997-03-31 | 2000-03-11 | Sony Corp | Encoding method, device therefor, decoding method, device therefor and recording medium |
| DE60017825T2 (de) * | 1999-03-23 | 2006-01-12 | Nippon Telegraph And Telephone Corp. | Verfahren und Vorrichtung zur Kodierung und Dekodierung von Audiosignalen und Aufzeichnungsträger mit Programmen dafür |
| WO2001052241A1 (fr) * | 2000-01-11 | 2001-07-19 | Matsushita Electric Industrial Co., Ltd. | Dispositif de codage vocal multimode et dispositif de decodage |
| EP1335496B1 (en) * | 2000-12-14 | 2009-06-10 | Sony Corporation | Coding and decoding |
| JP2002311996A (ja) * | 2001-02-09 | 2002-10-25 | Sony Corp | コンテンツ供給システム |
| JP4534382B2 (ja) * | 2001-02-09 | 2010-09-01 | ソニー株式会社 | 符号列生成装置及び方法、信号再生装置及び方法、並びにコンテンツ供給システム |
| JP2003029797A (ja) * | 2001-05-11 | 2003-01-31 | Matsushita Electric Ind Co Ltd | 符号化装置、復号化装置および放送システム |
| JP4012506B2 (ja) | 2001-08-24 | 2007-11-21 | 株式会社ケンウッド | 信号の周波数成分を適応的に補間するための装置および方法 |
| ES2294300T3 (es) * | 2002-07-12 | 2008-04-01 | Koninklijke Philips Electronics N.V. | Codificacion de audio. |
| CN1714584B (zh) * | 2002-12-20 | 2010-05-05 | 诺基亚有限公司 | 采用元信息来组织用户提供信息的方法及装置 |
| US7318035B2 (en) * | 2003-05-08 | 2008-01-08 | Dolby Laboratories Licensing Corporation | Audio coding systems and methods using spectral component coupling and spectral component regeneration |
| JP2007509363A (ja) | 2003-10-13 | 2007-04-12 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | オーディオ符号化方法及び装置 |
| US7693709B2 (en) * | 2005-07-15 | 2010-04-06 | Microsoft Corporation | Reordering coefficients for waveform coding or decoding |
| TWI330355B (en) * | 2005-12-05 | 2010-09-11 | Qualcomm Inc | Systems, methods, and apparatus for detection of tonal components |
| KR101346358B1 (ko) * | 2006-09-18 | 2013-12-31 | 삼성전자주식회사 | 대역폭 확장 기법을 이용한 오디오 신호의 부호화/복호화방법 및 장치 |
| US8041578B2 (en) * | 2006-10-18 | 2011-10-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Encoding an information signal |
| JP2008268384A (ja) * | 2007-04-17 | 2008-11-06 | Nec Lcd Technologies Ltd | 液晶表示装置 |
| US8527265B2 (en) * | 2007-10-22 | 2013-09-03 | Qualcomm Incorporated | Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs |
| US20100324708A1 (en) * | 2007-11-27 | 2010-12-23 | Nokia Corporation | encoder |
| EP2107556A1 (en) * | 2008-04-04 | 2009-10-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio transform coding using pitch correction |
| KR101576318B1 (ko) * | 2008-08-08 | 2015-12-09 | 파나소닉 인텔렉츄얼 프로퍼티 코포레이션 오브 아메리카 | 스펙트럼 평활화 장치, 부호화 장치, 복호 장치, 통신 단말 장치, 기지국 장치 및 스펙트럼 평활화 방법 |
| EP2407965B1 (en) | 2009-03-31 | 2012-12-12 | Huawei Technologies Co., Ltd. | Method and device for audio signal denoising |
| EP2237266A1 (en) * | 2009-04-03 | 2010-10-06 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for determining a plurality of local center of gravity frequencies of a spectrum of an audio signal |
| ES2400661T3 (es) * | 2009-06-29 | 2013-04-11 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificación y decodificación de extensión de ancho de banda |
| MY163358A (en) * | 2009-10-08 | 2017-09-15 | Fraunhofer-Gesellschaft Zur Förderung Der Angenwandten Forschung E V | Multi-mode audio signal decoder,multi-mode audio signal encoder,methods and computer program using a linear-prediction-coding based noise shaping |
| EP2676268B1 (en) * | 2011-02-14 | 2014-12-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
| CN103582913B (zh) * | 2011-04-28 | 2016-05-11 | 杜比国际公司 | 有效内容分类及响度估计 |
| FR2996047B1 (fr) * | 2012-09-27 | 2014-09-05 | Renault Sa | Dispositif inductif limitant les oscillations acoustiques |
-
2012
- 2012-12-21 EP EP12199266.3A patent/EP2720222A1/en not_active Withdrawn
-
2013
- 2013-09-20 WO PCT/EP2013/069592 patent/WO2014056705A1/en not_active Ceased
- 2013-09-20 RU RU2015117432A patent/RU2633136C2/ru active
- 2013-09-20 JP JP2015536045A patent/JP6563338B2/ja active Active
- 2013-09-20 KR KR1020157011967A patent/KR101777485B1/ko active Active
- 2013-09-20 BR BR112015008114-2A patent/BR112015008114B1/pt active IP Right Grant
- 2013-09-20 AU AU2013329734A patent/AU2013329734B2/en active Active
- 2013-09-20 ES ES13766036T patent/ES2896016T3/es active Active
- 2013-09-20 SG SG10201702285QA patent/SG10201702285QA/en unknown
- 2013-09-20 CA CA2944927A patent/CA2944927C/en active Active
- 2013-09-20 MY MYPI2015000889A patent/MY193732A/en unknown
- 2013-09-20 EP EP13766036.1A patent/EP2907132B1/en active Active
- 2013-09-20 SG SG11201502744YA patent/SG11201502744YA/en unknown
- 2013-09-20 MX MX2015004506A patent/MX344955B/es active IP Right Grant
- 2013-09-20 CN CN201380064128.8A patent/CN104903956B/zh active Active
- 2013-09-20 CA CA2887188A patent/CA2887188C/en active Active
- 2013-09-20 EP EP16193357.7A patent/EP3133598A1/en not_active Withdrawn
- 2013-10-09 TW TW102136550A patent/TWI543152B/zh active
- 2013-10-09 AR ARP130103664A patent/AR092958A1/es active IP Right Grant
-
2015
- 2015-04-08 US US14/682,015 patent/US9570085B2/en active Active
- 2015-05-08 ZA ZA2015/03152A patent/ZA201503152B/en unknown
-
2016
- 2016-12-21 AU AU2016277636A patent/AU2016277636A1/en not_active Abandoned
-
2017
- 2017-11-13 JP JP2017217969A patent/JP6789915B2/ja active Active
-
2018
- 2018-10-19 AU AU2018250490A patent/AU2018250490B2/en active Active
-
2019
- 2019-08-14 JP JP2019148934A patent/JP7005564B2/ja active Active
Also Published As
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| TWI543152B (zh) | 用以產生音訊輸出信號或多個頻譜型樣之裝置及方法、及相關電腦程 式 | |
| TWI503815B (zh) | 用以利用正弦代換進行音訊編碼及解碼之裝置和方法 | |
| HK1234887A1 (en) | Apparatus and method for generating a plurality of spectral patterns | |
| HK1213688B (en) | Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns | |
| HK1192640B (en) | Apparatus and method for audio encoding and decoding employing sinusoidal substitution |