JP6174266B2 - ブラインド帯域幅拡張のシステムおよび方法 - Google Patents
ブラインド帯域幅拡張のシステムおよび方法 Download PDFInfo
- Publication number
- JP6174266B2 JP6174266B2 JP2016539147A JP2016539147A JP6174266B2 JP 6174266 B2 JP6174266 B2 JP 6174266B2 JP 2016539147 A JP2016539147 A JP 2016539147A JP 2016539147 A JP2016539147 A JP 2016539147A JP 6174266 B2 JP6174266 B2 JP 6174266B2
- Authority
- JP
- Japan
- Prior art keywords
- parameters
- band
- low
- highband
- energy value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 159
- 238000013139 quantization Methods 0.000 claims description 190
- 230000005236 sound signal Effects 0.000 claims description 123
- 230000007704 transition Effects 0.000 claims description 68
- 239000011159 matrix material Substances 0.000 claims description 41
- 230000004044 response Effects 0.000 claims description 32
- 238000004891 communication Methods 0.000 claims description 4
- 230000009471 action Effects 0.000 claims description 3
- 238000010295 mobile communication Methods 0.000 claims 3
- 230000001131 transforming effect Effects 0.000 claims 2
- 230000008878 coupling Effects 0.000 claims 1
- 238000010168 coupling process Methods 0.000 claims 1
- 238000005859 coupling reaction Methods 0.000 claims 1
- 239000013598 vector Substances 0.000 description 244
- 238000001514 detection method Methods 0.000 description 19
- 238000010586 diagram Methods 0.000 description 14
- 230000006870 function Effects 0.000 description 13
- 230000015572 biosynthetic process Effects 0.000 description 10
- 238000003786 synthesis reaction Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012805 post-processing Methods 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000005311 autocorrelation function Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/0017—Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361916264P | 2013-12-15 | 2013-12-15 | |
| US61/916,264 | 2013-12-15 | ||
| US201461939148P | 2014-02-12 | 2014-02-12 | |
| US61/939,148 | 2014-02-12 | ||
| US14/334,921 US9524720B2 (en) | 2013-12-15 | 2014-07-18 | Systems and methods of blind bandwidth extension |
| US14/334,921 | 2014-07-18 | ||
| PCT/US2014/069045 WO2015088957A1 (en) | 2013-12-15 | 2014-12-08 | Systems and methods of blind bandwidth extension |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2016540255A JP2016540255A (ja) | 2016-12-22 |
| JP2016540255A5 JP2016540255A5 (enExample) | 2017-04-06 |
| JP6174266B2 true JP6174266B2 (ja) | 2017-08-02 |
Family
ID=53369245
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2016539147A Expired - Fee Related JP6174266B2 (ja) | 2013-12-15 | 2014-12-08 | ブラインド帯域幅拡張のシステムおよび方法 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US9524720B2 (enExample) |
| EP (1) | EP3080808A1 (enExample) |
| JP (1) | JP6174266B2 (enExample) |
| KR (1) | KR20160097232A (enExample) |
| CN (1) | CN105814631A (enExample) |
| WO (2) | WO2015088957A1 (enExample) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN104301064B (zh) | 2013-07-16 | 2018-05-04 | 华为技术有限公司 | 处理丢失帧的方法和解码器 |
| US9524720B2 (en) | 2013-12-15 | 2016-12-20 | Qualcomm Incorporated | Systems and methods of blind bandwidth extension |
| US9729215B2 (en) * | 2014-06-23 | 2017-08-08 | Samsung Electronics Co., Ltd. | OFDM signal compression |
| CN105225666B (zh) | 2014-06-25 | 2016-12-28 | 华为技术有限公司 | 处理丢失帧的方法和装置 |
| CN105554332A (zh) * | 2016-01-22 | 2016-05-04 | 深圳市中兴物联科技股份有限公司 | 一种基于voip的语音连接方法和装置 |
| US20190051286A1 (en) * | 2017-08-14 | 2019-02-14 | Microsoft Technology Licensing, Llc | Normalization of high band signals in network telephony communications |
| JP6996185B2 (ja) * | 2017-09-15 | 2022-01-17 | 富士通株式会社 | 発話区間検出装置、発話区間検出方法及び発話区間検出用コンピュータプログラム |
| TWI869186B (zh) | 2018-01-26 | 2025-01-01 | 瑞典商都比國際公司 | 用於執行一音訊信號之高頻重建之方法、音訊處理單元及非暫時性電腦可讀媒體 |
| CN110322891B (zh) * | 2019-07-03 | 2021-12-10 | 南方科技大学 | 一种语音信号的处理方法、装置、终端及存储介质 |
| CN114822569B (zh) * | 2021-01-21 | 2025-07-25 | 腾讯科技(深圳)有限公司 | 音频信号处理方法、装置、设备及计算机可读存储介质 |
| CN113113030B (zh) * | 2021-03-22 | 2022-03-22 | 浙江大学 | 一种基于降噪自编码器的高维受损数据无线传输方法 |
Family Cites Families (47)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4521646A (en) * | 1980-06-26 | 1985-06-04 | Callaghan Edward P | Methods and apparatus for bandwidth reduction |
| WO1986003873A1 (en) * | 1984-12-20 | 1986-07-03 | Gte Laboratories Incorporated | Method and apparatus for encoding speech |
| JP3194481B2 (ja) * | 1991-10-22 | 2001-07-30 | 日本電信電話株式会社 | 音声符号化法 |
| JP2779886B2 (ja) | 1992-10-05 | 1998-07-23 | 日本電信電話株式会社 | 広帯域音声信号復元方法 |
| US5455888A (en) * | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
| US5657423A (en) | 1993-02-22 | 1997-08-12 | Texas Instruments Incorporated | Hardware filter circuit and address circuitry for MPEG encoded data |
| US5715372A (en) * | 1995-01-10 | 1998-02-03 | Lucent Technologies Inc. | Method and apparatus for characterizing an input signal |
| FI102445B1 (fi) | 1996-02-08 | 1998-11-30 | Nokia Telecommunications Oy | Transmissiolaitteisto keskusten väliselle yhteydelle |
| FI106082B (fi) | 1996-12-05 | 2000-11-15 | Nokia Networks Oy | Menetelmä puhekanavan takaisinkytkemisen havaitsemiseksi sekä puheenkäsittelylaite |
| US6014623A (en) * | 1997-06-12 | 2000-01-11 | United Microelectronics Corp. | Method of encoding synthetic speech |
| US6044268A (en) * | 1997-07-16 | 2000-03-28 | Telefonaktiebolaget Lm Ericsson Ab | System and method for providing intercom and multiple voice channels in a private telephone system |
| DE19804581C2 (de) | 1998-02-05 | 2000-08-17 | Siemens Ag | Verfahren und Funk-Kommunikationssystem zur Übertragung von Sprachinformation |
| US6445686B1 (en) | 1998-09-03 | 2002-09-03 | Lucent Technologies Inc. | Method and apparatus for improving the quality of speech signals transmitted over wireless communication facilities |
| US6539355B1 (en) * | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
| US6226616B1 (en) * | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
| CN1335980A (zh) | 1999-11-10 | 2002-02-13 | 皇家菲利浦电子有限公司 | 借助于映射矩阵的宽频带语音合成 |
| US7088704B1 (en) * | 1999-12-10 | 2006-08-08 | Lucent Technologies Inc. | Transporting voice telephony and data via a single ATM transport link |
| US6704711B2 (en) | 2000-01-28 | 2004-03-09 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method for modifying speech signals |
| JP2001282246A (ja) * | 2000-03-31 | 2001-10-12 | Kawai Musical Instr Mfg Co Ltd | 波形データ時間伸張圧縮装置 |
| US7330814B2 (en) * | 2000-05-22 | 2008-02-12 | Texas Instruments Incorporated | Wideband speech coding with modulated noise highband excitation system and method |
| FI109393B (fi) * | 2000-07-14 | 2002-07-15 | Nokia Corp | Menetelmä mediavirran enkoodaamiseksi skaalautuvasti, skaalautuva enkooderi ja päätelaite |
| US6842733B1 (en) | 2000-09-15 | 2005-01-11 | Mindspeed Technologies, Inc. | Signal processing system for filtering spectral content of a signal for speech coding |
| US7289461B2 (en) | 2001-03-15 | 2007-10-30 | Qualcomm Incorporated | Communications using wideband terminals |
| EP1400139B1 (en) | 2001-06-26 | 2006-06-07 | Nokia Corporation | Method for transcoding audio signals, network element, wireless communications network and communications system |
| US6988066B2 (en) * | 2001-10-04 | 2006-01-17 | At&T Corp. | Method of bandwidth extension for narrow-band speech |
| PT1423847E (pt) * | 2001-11-29 | 2005-05-31 | Coding Tech Ab | Reconstrucao de componentes de frequencia elevada |
| US20040138876A1 (en) * | 2003-01-10 | 2004-07-15 | Nokia Corporation | Method and apparatus for artificial bandwidth expansion in speech processing |
| FR2852172A1 (fr) * | 2003-03-04 | 2004-09-10 | France Telecom | Procede et dispositif de reconstruction spectrale d'un signal audio |
| KR100636145B1 (ko) * | 2004-06-04 | 2006-10-18 | 삼성전자주식회사 | 확장된 고해상도 오디오 신호 부호화 및 복호화 장치 |
| CN101006495A (zh) | 2004-08-31 | 2007-07-25 | 松下电器产业株式会社 | 语音编码装置、语音解码装置、通信装置以及语音编码方法 |
| JP4871501B2 (ja) | 2004-11-04 | 2012-02-08 | パナソニック株式会社 | ベクトル変換装置及びベクトル変換方法 |
| JP4903053B2 (ja) * | 2004-12-10 | 2012-03-21 | パナソニック株式会社 | 広帯域符号化装置、広帯域lsp予測装置、帯域スケーラブル符号化装置及び広帯域符号化方法 |
| UA93677C2 (ru) * | 2005-04-01 | 2011-03-10 | Квелкомм Инкорпорейтед | Способы и устройства кодирования и декодирования части речевого сигнала диапазона высоких частот |
| US7953604B2 (en) | 2006-01-20 | 2011-05-31 | Microsoft Corporation | Shape and scale parameters for extended-band frequency coding |
| US8295507B2 (en) * | 2006-11-09 | 2012-10-23 | Sony Corporation | Frequency band extending apparatus, frequency band extending method, player apparatus, playing method, program and recording medium |
| AU2007332508B2 (en) | 2006-12-13 | 2012-08-16 | Iii Holdings 12, Llc | Encoding device, decoding device, and method thereof |
| US8229106B2 (en) | 2007-01-22 | 2012-07-24 | D.S.P. Group, Ltd. | Apparatus and methods for enhancement of speech |
| US8392198B1 (en) | 2007-04-03 | 2013-03-05 | Arizona Board Of Regents For And On Behalf Of Arizona State University | Split-band speech compression based on loudness estimation |
| WO2010028292A1 (en) | 2008-09-06 | 2010-03-11 | Huawei Technologies Co., Ltd. | Adaptive frequency prediction |
| ES2374486T3 (es) | 2009-03-26 | 2012-02-17 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Dispositivo y método para manipular una señal de audio. |
| US8856011B2 (en) | 2009-11-19 | 2014-10-07 | Telefonaktiebolaget L M Ericsson (Publ) | Excitation signal bandwidth extension |
| CN101964189B (zh) * | 2010-04-28 | 2012-08-08 | 华为技术有限公司 | 语音频信号切换方法及装置 |
| RU2552184C2 (ru) | 2010-05-25 | 2015-06-10 | Нокиа Корпорейшн | Устройство для расширения полосы частот |
| KR101826331B1 (ko) * | 2010-09-15 | 2018-03-22 | 삼성전자주식회사 | 고주파수 대역폭 확장을 위한 부호화/복호화 장치 및 방법 |
| JP5707842B2 (ja) * | 2010-10-15 | 2015-04-30 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
| CN105469805B (zh) * | 2012-03-01 | 2018-01-12 | 华为技术有限公司 | 一种语音频信号处理方法和装置 |
| US9524720B2 (en) | 2013-12-15 | 2016-12-20 | Qualcomm Incorporated | Systems and methods of blind bandwidth extension |
-
2014
- 2014-07-18 US US14/334,921 patent/US9524720B2/en not_active Expired - Fee Related
- 2014-07-18 US US14/334,988 patent/US20150170655A1/en not_active Abandoned
- 2014-12-08 JP JP2016539147A patent/JP6174266B2/ja not_active Expired - Fee Related
- 2014-12-08 EP EP14827897.1A patent/EP3080808A1/en not_active Withdrawn
- 2014-12-08 CN CN201480065995.8A patent/CN105814631A/zh active Pending
- 2014-12-08 KR KR1020167016860A patent/KR20160097232A/ko not_active Abandoned
- 2014-12-08 WO PCT/US2014/069045 patent/WO2015088957A1/en not_active Ceased
- 2014-12-09 WO PCT/US2014/069336 patent/WO2015089066A1/en not_active Ceased
Also Published As
| Publication number | Publication date |
|---|---|
| US20150170655A1 (en) | 2015-06-18 |
| JP2016540255A (ja) | 2016-12-22 |
| KR20160097232A (ko) | 2016-08-17 |
| EP3080808A1 (en) | 2016-10-19 |
| CN105814631A (zh) | 2016-07-27 |
| US9524720B2 (en) | 2016-12-20 |
| WO2015088957A1 (en) | 2015-06-18 |
| WO2015089066A1 (en) | 2015-06-18 |
| US20150170654A1 (en) | 2015-06-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP6174266B2 (ja) | ブラインド帯域幅拡張のシステムおよび方法 | |
| TW497335B (en) | Method and apparatus for variable rate coding of speech | |
| TWI672691B (zh) | 解碼方法 | |
| JP5373217B2 (ja) | 可変レートスピーチ符号化 | |
| US9697843B2 (en) | High band excitation signal generation | |
| JP6470857B2 (ja) | 音声処理のための無声/有声判定 | |
| JP6290434B2 (ja) | オーディオ信号の高調波帯域幅拡張 | |
| ES2943588T3 (es) | Decodificador para generar una señal de audio mejorada en frecuencia, procedimiento de decodificación, codificador para generar una señal codificada y procedimiento de codificación que utiliza información lateral de selección compacta | |
| JP6526096B2 (ja) | 平均符号化レートを制御するためのシステムおよび方法 | |
| US9293143B2 (en) | Bandwidth extension mode selection | |
| CN102934163A (zh) | 用于宽带语音编码的系统、方法、设备和计算机程序产品 | |
| TW200912897A (en) | Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding | |
| CN101180676A (zh) | 用于谱包络表示的向量量化的方法和设备 | |
| TW201434033A (zh) | 用於判定音調脈衝週期信號界限之系統及方法 | |
| KR20110086919A (ko) | 에스엠브이 및 에이엠알 음성 부호화 기법을 위한 상호부호화 방법 및 장치 | |
| WO2025240194A1 (en) | Signal synthesis using temporal upsampling of input features | |
| WO2025240227A1 (en) | Generative audio codec for signal synthesis based on joint coding of spectral envelope features and pitch information | |
| WO2025240231A1 (en) | Generative audio codec for signal synthesis based on groupwise joint coding of spectral envelope features and pitch information |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20160825 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20170228 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20170228 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20170228 |
|
| A975 | Report on accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A971005 Effective date: 20170427 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20170606 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20170705 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 6174266 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
| R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
| LAPS | Cancellation because of no payment of annual fees |