BR112016025850B1 - Métodos para codificar um sinal de áudio e para discriminação de sinal de áudio, codificador para codificação de um sinal de áudio, discriminador de sinal de áudio, dispositivo de comunicação, e, meio de armazenamento legível por computador - Google Patents
Métodos para codificar um sinal de áudio e para discriminação de sinal de áudio, codificador para codificação de um sinal de áudio, discriminador de sinal de áudio, dispositivo de comunicação, e, meio de armazenamento legível por computador Download PDFInfo
- Publication number
- BR112016025850B1 BR112016025850B1 BR112016025850-9A BR112016025850A BR112016025850B1 BR 112016025850 B1 BR112016025850 B1 BR 112016025850B1 BR 112016025850 A BR112016025850 A BR 112016025850A BR 112016025850 B1 BR112016025850 B1 BR 112016025850B1
- Authority
- BR
- Brazil
- Prior art keywords
- audio signal
- peak
- coefficients
- spectral
- encoding
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 55
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000004891 communication Methods 0.000 title claims abstract description 18
- 230000003595 spectral effect Effects 0.000 claims abstract description 44
- 238000012545 processing Methods 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 17
- 230000006870 function Effects 0.000 description 7
- 238000004422 calculation algorithm Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 238000010183 spectrum analysis Methods 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201461990354P | 2014-05-08 | 2014-05-08 | |
| US61/990354 | 2014-05-08 | ||
| PCT/SE2015/050503 WO2015171061A1 (en) | 2014-05-08 | 2015-05-07 | Audio signal discriminator and coder |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| BR112016025850A2 BR112016025850A2 (https=) | 2017-08-15 |
| BR112016025850B1 true BR112016025850B1 (pt) | 2022-08-16 |
Family
ID=53200274
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| BR112016025850-9A BR112016025850B1 (pt) | 2014-05-08 | 2015-05-07 | Métodos para codificar um sinal de áudio e para discriminação de sinal de áudio, codificador para codificação de um sinal de áudio, discriminador de sinal de áudio, dispositivo de comunicação, e, meio de armazenamento legível por computador |
Country Status (11)
| Country | Link |
|---|---|
| US (3) | US9620138B2 (https=) |
| EP (3) | EP3594948B1 (https=) |
| CN (3) | CN110619891B (https=) |
| BR (1) | BR112016025850B1 (https=) |
| DK (2) | DK3379535T3 (https=) |
| ES (3) | ES2690577T3 (https=) |
| HU (1) | HUE046477T2 (https=) |
| MX (2) | MX356883B (https=) |
| MY (1) | MY182165A (https=) |
| PL (2) | PL3140831T3 (https=) |
| WO (1) | WO2015171061A1 (https=) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| MY176776A (en) | 2013-10-18 | 2020-08-21 | Ericsson Telefon Ab L M | Coding and decoding of spectral peak positions |
| CN110619891B (zh) * | 2014-05-08 | 2023-01-17 | 瑞典爱立信有限公司 | 音频信号区分器和编码器 |
| CN112992164B (zh) * | 2014-07-28 | 2024-12-06 | 日本电信电话株式会社 | 编码方法、装置、程序产品以及记录介质 |
| CN110211580B (zh) * | 2019-05-15 | 2021-07-16 | 海尔优家智能科技(北京)有限公司 | 多智能设备应答方法、装置、系统及存储介质 |
| CA3184152A1 (en) * | 2020-06-30 | 2022-01-06 | Rivarol VERGIN | Cumulative average spectral entropy analysis for tone and speech classification |
| CN113890492B (zh) * | 2021-10-09 | 2025-07-18 | 深圳市创成微电子有限公司 | 音频功率放大器的供电电压控制方法、控制器和音频设备 |
| US20250201255A1 (en) * | 2023-12-13 | 2025-06-19 | Qualcomm Incorporated | Content-based switchable audio codec |
Family Cites Families (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1146130C (zh) * | 1998-05-27 | 2004-04-14 | 微软公司 | 输入信号处理系统的编码器和屏蔽频信号量化噪声方法 |
| US6226608B1 (en) * | 1999-01-28 | 2001-05-01 | Dolby Laboratories Licensing Corporation | Data framing for adaptive-block-length coding system |
| US6959274B1 (en) * | 1999-09-22 | 2005-10-25 | Mindspeed Technologies, Inc. | Fixed rate speech compression system and method |
| US6785645B2 (en) * | 2001-11-29 | 2004-08-31 | Microsoft Corporation | Real-time speech and music classifier |
| KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
| US20070282601A1 (en) * | 2006-06-02 | 2007-12-06 | Texas Instruments Inc. | Packet loss concealment for a conjugate structure algebraic code excited linear prediction decoder |
| CN101145345B (zh) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | 音频分类方法 |
| US8990073B2 (en) * | 2007-06-22 | 2015-03-24 | Voiceage Corporation | Method and device for sound activity detection and sound signal classification |
| CN101399039B (zh) * | 2007-09-30 | 2011-05-11 | 华为技术有限公司 | 一种确定非噪声音频信号类别的方法及装置 |
| KR101599875B1 (ko) * | 2008-04-17 | 2016-03-14 | 삼성전자주식회사 | 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 부호화 방법 및 장치, 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 복호화 방법 및 장치 |
| PL2346029T3 (pl) * | 2008-07-11 | 2013-11-29 | Fraunhofer Ges Forschung | Koder sygnału audio, sposób kodowania sygnału audio i odpowiadający mu program komputerowy |
| EP2210944A1 (en) | 2009-01-22 | 2010-07-28 | ATG:biosynthetics GmbH | Methods for generation of RNA and (poly)peptide libraries and their use |
| CN102044246B (zh) * | 2009-10-15 | 2012-05-23 | 华为技术有限公司 | 一种音频信号检测方法和装置 |
| KR101754970B1 (ko) * | 2010-01-12 | 2017-07-06 | 삼성전자주식회사 | 무선 통신 시스템의 채널 상태 측정 기준신호 처리 장치 및 방법 |
| US9652999B2 (en) * | 2010-04-29 | 2017-05-16 | Educational Testing Service | Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition |
| US8977542B2 (en) * | 2010-07-16 | 2015-03-10 | Telefonaktiebolaget L M Ericsson (Publ) | Audio encoder and decoder and methods for encoding and decoding an audio signal |
| RU2010152225A (ru) * | 2010-12-20 | 2012-06-27 | ЭлЭсАй Корпорейшн (US) | Обнаружение музыки с использованием анализа спектральных пиков |
| CN102982804B (zh) * | 2011-09-02 | 2017-05-03 | 杜比实验室特许公司 | 音频分类方法和系统 |
| CN102522082B (zh) * | 2011-12-27 | 2013-07-10 | 重庆大学 | 一种公共场所异常声音的识别与定位方法 |
| US9111531B2 (en) * | 2012-01-13 | 2015-08-18 | Qualcomm Incorporated | Multiple coding mode signal classification |
| US20130282373A1 (en) * | 2012-04-23 | 2013-10-24 | Qualcomm Incorporated | Systems and methods for audio signal processing |
| AU2013283568B2 (en) * | 2012-06-28 | 2016-05-12 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Linear prediction based audio coding using improved probability distribution estimation |
| US9401153B2 (en) * | 2012-10-15 | 2016-07-26 | Digimarc Corporation | Multi-mode audio recognition and auxiliary data encoding and decoding |
| CN110619891B (zh) * | 2014-05-08 | 2023-01-17 | 瑞典爱立信有限公司 | 音频信号区分器和编码器 |
| WO2015168925A1 (en) | 2014-05-09 | 2015-11-12 | Qualcomm Incorporated | Restricted aperiodic csi measurement reporting in enhanced interference management and traffic adaptation |
| TWI602172B (zh) * | 2014-08-27 | 2017-10-11 | 弗勞恩霍夫爾協會 | 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法 |
-
2015
- 2015-05-07 CN CN201910918149.0A patent/CN110619891B/zh active Active
- 2015-05-07 CN CN201910919030.5A patent/CN110619892B/zh active Active
- 2015-05-07 HU HUE18172361A patent/HUE046477T2/hu unknown
- 2015-05-07 EP EP19195287.8A patent/EP3594948B1/en active Active
- 2015-05-07 ES ES15724098.7T patent/ES2690577T3/es active Active
- 2015-05-07 BR BR112016025850-9A patent/BR112016025850B1/pt active IP Right Grant
- 2015-05-07 DK DK18172361.0T patent/DK3379535T3/da active
- 2015-05-07 EP EP18172361.0A patent/EP3379535B1/en active Active
- 2015-05-07 PL PL15724098T patent/PL3140831T3/pl unknown
- 2015-05-07 ES ES18172361T patent/ES2763280T3/es active Active
- 2015-05-07 CN CN201580023968.9A patent/CN106463141B/zh active Active
- 2015-05-07 ES ES19195287T patent/ES2874757T3/es active Active
- 2015-05-07 US US14/649,689 patent/US9620138B2/en active Active
- 2015-05-07 MX MX2016014534A patent/MX356883B/es active IP Right Grant
- 2015-05-07 EP EP15724098.7A patent/EP3140831B1/en active Active
- 2015-05-07 MY MYPI2016703844A patent/MY182165A/en unknown
- 2015-05-07 PL PL19195287T patent/PL3594948T3/pl unknown
- 2015-05-07 WO PCT/SE2015/050503 patent/WO2015171061A1/en not_active Ceased
- 2015-05-07 DK DK15724098.7T patent/DK3140831T3/en active
-
2016
- 2016-11-04 MX MX2018007257A patent/MX2018007257A/es unknown
-
2017
- 2017-03-07 US US15/451,551 patent/US10242687B2/en active Active
-
2019
- 2019-02-14 US US16/275,701 patent/US10984812B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| MX2018007257A (es) | 2022-08-25 |
| US20160086615A1 (en) | 2016-03-24 |
| HUE046477T2 (hu) | 2020-03-30 |
| EP3140831A1 (en) | 2017-03-15 |
| EP3140831B1 (en) | 2018-07-11 |
| CN106463141B (zh) | 2019-11-01 |
| MY182165A (en) | 2021-01-18 |
| EP3594948A1 (en) | 2020-01-15 |
| PL3140831T3 (pl) | 2018-12-31 |
| MX2016014534A (es) | 2017-02-20 |
| MX356883B (es) | 2018-06-19 |
| DK3140831T3 (en) | 2018-10-15 |
| CN106463141A (zh) | 2017-02-22 |
| CN110619892B (zh) | 2023-04-11 |
| ES2763280T3 (es) | 2020-05-27 |
| WO2015171061A1 (en) | 2015-11-12 |
| US9620138B2 (en) | 2017-04-11 |
| US20190198032A1 (en) | 2019-06-27 |
| US20170178660A1 (en) | 2017-06-22 |
| PL3594948T3 (pl) | 2021-08-30 |
| DK3379535T3 (da) | 2019-12-16 |
| CN110619892A (zh) | 2019-12-27 |
| ES2874757T3 (es) | 2021-11-05 |
| US10984812B2 (en) | 2021-04-20 |
| CN110619891B (zh) | 2023-01-17 |
| ES2690577T3 (es) | 2018-11-21 |
| EP3594948B1 (en) | 2021-03-03 |
| EP3379535B1 (en) | 2019-09-18 |
| BR112016025850A2 (https=) | 2017-08-15 |
| EP3379535A1 (en) | 2018-09-26 |
| CN110619891A (zh) | 2019-12-27 |
| US10242687B2 (en) | 2019-03-26 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| BR112016025850B1 (pt) | Métodos para codificar um sinal de áudio e para discriminação de sinal de áudio, codificador para codificação de um sinal de áudio, discriminador de sinal de áudio, dispositivo de comunicação, e, meio de armazenamento legível por computador | |
| CN106415717B (zh) | 音频信号分类和编码 | |
| BR112014017708B1 (pt) | Método e aparelho para detectar atividade de voz na presença de ruído de fundo, e, memória legível por computador | |
| CN104469804B (zh) | 物理下行控制信道的盲检方法 | |
| WO2019001252A1 (zh) | 时延估计方法及装置 | |
| CN106452627B (zh) | 一种用于宽带频谱感知的噪声功率估计方法和装置 | |
| CN109219055A (zh) | 一种主用户占空比估计方法 | |
| Liu et al. | Adaptive compressive spectrum sensing using a deterministic estimation model for wideband cognitive radios | |
| CN103915099A (zh) | 语音基音周期检测方法和装置 | |
| WO2013000240A1 (zh) | 一种多节点联合的频谱感知方法和系统 | |
| Treeumnuk et al. | Energy detector with adaptive sensing window for improved spectrum utilization in dynamic cognitive radio systems | |
| CN117612286B (zh) | 一种楼堂馆所门禁管理系统及其控制方法 | |
| CN113765607B (zh) | 一种频谱检测方法及装置 | |
| Song et al. | Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP | |
| Wen et al. | Deformation analysis of dam with the improved wavelet threshold | |
| Liu | Traffic-Aware Spectrum Sharing Protocols | |
| JP2026501166A (ja) | マルチモードオーディオデコーダにおける改善された遷移 | |
| Zhang et al. | Dynamic-Dual-Threshold Cooperative Spectrum Sensing Algorithm Based on DS Evidence Theory | |
| CN117376972A (zh) | 一种移动网络流量压抑的检测方法、装置、设备及介质 | |
| CN114257339A (zh) | Pdcch盲检方法、装置、电子设备和存储介质 | |
| Kopytov et al. | Persistent Short Time Series Data Acquisition Algorithm for Wireless Smart Sensor Networks | |
| Cai | Wi-fi-based indoor positioning by distributed machine-learning data analytics on smart gateways | |
| WO2018127156A1 (zh) | 编码方法和编码器 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| B06U | Preliminary requirement: requests with searches performed by other patent offices: procedure suspended [chapter 6.21 patent gazette] | ||
| B350 | Update of information on the portal [chapter 15.35 patent gazette] | ||
| B09A | Decision: intention to grant [chapter 9.1 patent gazette] | ||
| B16A | Patent or certificate of addition of invention granted [chapter 16.1 patent gazette] |
Free format text: PRAZO DE VALIDADE: 20 (VINTE) ANOS CONTADOS A PARTIR DE 07/05/2015, OBSERVADAS AS CONDICOES LEGAIS |