MX2018007257A - Codificador y discriminador de señal de audio. - Google Patents
Codificador y discriminador de señal de audio.Info
- Publication number
- MX2018007257A MX2018007257A MX2018007257A MX2018007257A MX2018007257A MX 2018007257 A MX2018007257 A MX 2018007257A MX 2018007257 A MX2018007257 A MX 2018007257A MX 2018007257 A MX2018007257 A MX 2018007257A MX 2018007257 A MX2018007257 A MX 2018007257A
- Authority
- MX
- Mexico
- Prior art keywords
- audio signal
- coding
- coder
- signal discriminator
- segment
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 4
- 230000003595 spectral effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
La invención se refiere a un códec y un discriminador y métodos en el mismo para la discriminación de la señal de audio y la codificación. Las modalidades de un método realizado por un codificador que comprende, por un segmento de la señal de audio: identificar un conjunto de picos espectrales; determinar una distancia S media entre picos en el conjunto; y determinar una proporción, PNR, entre un pico de la envolvente y una envolvente de umbral mínimo de ruido. El método comprende además la selección de un modo de codificación, fuera de una pluralidad de modos de codificación, basándose al menos en la distancia S media y la proporción PNR; y aplicar el modo de codificación seleccionado para la codificación del segmento de la señal de audio.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201461990354P | 2014-05-08 | 2014-05-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2018007257A true MX2018007257A (es) | 2022-08-25 |
Family
ID=53200274
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2016014534A MX356883B (es) | 2014-05-08 | 2015-05-07 | Codificador y discriminador de señal de audio. |
MX2018007257A MX2018007257A (es) | 2014-05-08 | 2016-11-04 | Codificador y discriminador de señal de audio. |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2016014534A MX356883B (es) | 2014-05-08 | 2015-05-07 | Codificador y discriminador de señal de audio. |
Country Status (11)
Country | Link |
---|---|
US (3) | US9620138B2 (es) |
EP (3) | EP3140831B1 (es) |
CN (3) | CN106463141B (es) |
BR (1) | BR112016025850B1 (es) |
DK (2) | DK3379535T3 (es) |
ES (3) | ES2874757T3 (es) |
HU (1) | HUE046477T2 (es) |
MX (2) | MX356883B (es) |
MY (1) | MY182165A (es) |
PL (2) | PL3140831T3 (es) |
WO (1) | WO2015171061A1 (es) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101940464B1 (ko) | 2013-10-18 | 2019-01-18 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 스펙트럼의 피크 위치의 코딩 및 디코딩 |
EP3140831B1 (en) * | 2014-05-08 | 2018-07-11 | Telefonaktiebolaget LM Ericsson (publ) | Audio signal discriminator and coder |
US10304472B2 (en) * | 2014-07-28 | 2019-05-28 | Nippon Telegraph And Telephone Corporation | Method, device and recording medium for coding based on a selected coding processing |
CN110211580B (zh) * | 2019-05-15 | 2021-07-16 | 海尔优家智能科技(北京)有限公司 | 多智能设备应答方法、装置、系统及存储介质 |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100361405C (zh) * | 1998-05-27 | 2008-01-09 | 微软公司 | 利用可升级的音频编码器和解码器处理输入信号的方法 |
US6226608B1 (en) * | 1999-01-28 | 2001-05-01 | Dolby Laboratories Licensing Corporation | Data framing for adaptive-block-length coding system |
US6959274B1 (en) * | 1999-09-22 | 2005-10-25 | Mindspeed Technologies, Inc. | Fixed rate speech compression system and method |
US6785645B2 (en) * | 2001-11-29 | 2004-08-31 | Microsoft Corporation | Real-time speech and music classifier |
KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
US20070282601A1 (en) * | 2006-06-02 | 2007-12-06 | Texas Instruments Inc. | Packet loss concealment for a conjugate structure algebraic code excited linear prediction decoder |
CN101145345B (zh) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | 音频分类方法 |
EP2162880B1 (en) * | 2007-06-22 | 2014-12-24 | VoiceAge Corporation | Method and device for estimating the tonality of a sound signal |
CN101399039B (zh) * | 2007-09-30 | 2011-05-11 | 华为技术有限公司 | 一种确定非噪声音频信号类别的方法及装置 |
KR101599875B1 (ko) * | 2008-04-17 | 2016-03-14 | 삼성전자주식회사 | 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 부호화 방법 및 장치, 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 복호화 방법 및 장치 |
PL2346030T3 (pl) | 2008-07-11 | 2015-03-31 | Fraunhofer Ges Forschung | Koder audio, sposób kodowania sygnału audio oraz program komputerowy |
EP2210944A1 (en) | 2009-01-22 | 2010-07-28 | ATG:biosynthetics GmbH | Methods for generation of RNA and (poly)peptide libraries and their use |
CN102044246B (zh) * | 2009-10-15 | 2012-05-23 | 华为技术有限公司 | 一种音频信号检测方法和装置 |
KR101754970B1 (ko) * | 2010-01-12 | 2017-07-06 | 삼성전자주식회사 | 무선 통신 시스템의 채널 상태 측정 기준신호 처리 장치 및 방법 |
US9652999B2 (en) * | 2010-04-29 | 2017-05-16 | Educational Testing Service | Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition |
CN102985966B (zh) * | 2010-07-16 | 2016-07-06 | 瑞典爱立信有限公司 | 音频编码器和解码器及用于音频信号的编码和解码的方法 |
RU2010152225A (ru) * | 2010-12-20 | 2012-06-27 | ЭлЭсАй Корпорейшн (US) | Обнаружение музыки с использованием анализа спектральных пиков |
CN102982804B (zh) * | 2011-09-02 | 2017-05-03 | 杜比实验室特许公司 | 音频分类方法和系统 |
CN102522082B (zh) * | 2011-12-27 | 2013-07-10 | 重庆大学 | 一种公共场所异常声音的识别与定位方法 |
US9111531B2 (en) * | 2012-01-13 | 2015-08-18 | Qualcomm Incorporated | Multiple coding mode signal classification |
US9305567B2 (en) * | 2012-04-23 | 2016-04-05 | Qualcomm Incorporated | Systems and methods for audio signal processing |
RU2651187C2 (ru) | 2012-06-28 | 2018-04-18 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Основанное на линейном предсказании кодирование аудио с использованием улучшенной оценки распределения вероятностей |
US9401153B2 (en) * | 2012-10-15 | 2016-07-26 | Digimarc Corporation | Multi-mode audio recognition and auxiliary data encoding and decoding |
EP3140831B1 (en) * | 2014-05-08 | 2018-07-11 | Telefonaktiebolaget LM Ericsson (publ) | Audio signal discriminator and coder |
WO2015168925A1 (en) | 2014-05-09 | 2015-11-12 | Qualcomm Incorporated | Restricted aperiodic csi measurement reporting in enhanced interference management and traffic adaptation |
TWI602172B (zh) * | 2014-08-27 | 2017-10-11 | 弗勞恩霍夫爾協會 | 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法 |
-
2015
- 2015-05-07 EP EP15724098.7A patent/EP3140831B1/en active Active
- 2015-05-07 ES ES19195287T patent/ES2874757T3/es active Active
- 2015-05-07 MY MYPI2016703844A patent/MY182165A/en unknown
- 2015-05-07 CN CN201580023968.9A patent/CN106463141B/zh active Active
- 2015-05-07 HU HUE18172361A patent/HUE046477T2/hu unknown
- 2015-05-07 MX MX2016014534A patent/MX356883B/es active IP Right Grant
- 2015-05-07 CN CN201910918149.0A patent/CN110619891B/zh active Active
- 2015-05-07 EP EP19195287.8A patent/EP3594948B1/en active Active
- 2015-05-07 BR BR112016025850-9A patent/BR112016025850B1/pt active IP Right Grant
- 2015-05-07 PL PL15724098T patent/PL3140831T3/pl unknown
- 2015-05-07 ES ES18172361T patent/ES2763280T3/es active Active
- 2015-05-07 DK DK18172361.0T patent/DK3379535T3/da active
- 2015-05-07 US US14/649,689 patent/US9620138B2/en active Active
- 2015-05-07 ES ES15724098.7T patent/ES2690577T3/es active Active
- 2015-05-07 PL PL19195287T patent/PL3594948T3/pl unknown
- 2015-05-07 DK DK15724098.7T patent/DK3140831T3/en active
- 2015-05-07 WO PCT/SE2015/050503 patent/WO2015171061A1/en active Application Filing
- 2015-05-07 EP EP18172361.0A patent/EP3379535B1/en active Active
- 2015-05-07 CN CN201910919030.5A patent/CN110619892B/zh active Active
-
2016
- 2016-11-04 MX MX2018007257A patent/MX2018007257A/es unknown
-
2017
- 2017-03-07 US US15/451,551 patent/US10242687B2/en active Active
-
2019
- 2019-02-14 US US16/275,701 patent/US10984812B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
MX2016014534A (es) | 2017-02-20 |
US10984812B2 (en) | 2021-04-20 |
EP3594948A1 (en) | 2020-01-15 |
BR112016025850A2 (es) | 2017-08-15 |
PL3594948T3 (pl) | 2021-08-30 |
US9620138B2 (en) | 2017-04-11 |
CN110619892B (zh) | 2023-04-11 |
PL3140831T3 (pl) | 2018-12-31 |
EP3379535A1 (en) | 2018-09-26 |
WO2015171061A1 (en) | 2015-11-12 |
ES2763280T3 (es) | 2020-05-27 |
BR112016025850B1 (pt) | 2022-08-16 |
US20170178660A1 (en) | 2017-06-22 |
US20190198032A1 (en) | 2019-06-27 |
HUE046477T2 (hu) | 2020-03-30 |
EP3140831A1 (en) | 2017-03-15 |
DK3140831T3 (en) | 2018-10-15 |
ES2690577T3 (es) | 2018-11-21 |
EP3140831B1 (en) | 2018-07-11 |
CN110619891B (zh) | 2023-01-17 |
MX356883B (es) | 2018-06-19 |
US20160086615A1 (en) | 2016-03-24 |
CN106463141A (zh) | 2017-02-22 |
ES2874757T3 (es) | 2021-11-05 |
CN110619892A (zh) | 2019-12-27 |
CN110619891A (zh) | 2019-12-27 |
EP3379535B1 (en) | 2019-09-18 |
US10242687B2 (en) | 2019-03-26 |
DK3379535T3 (da) | 2019-12-16 |
CN106463141B (zh) | 2019-11-01 |
MY182165A (en) | 2021-01-18 |
EP3594948B1 (en) | 2021-03-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2018009140A (es) | Decodificador de audio multicanal, codificador de audio multicanal, métodos y programa de computadora usando un ajuste en base a señales residuales de una contribución de una señal decorrelacionada. | |
MX2018007257A (es) | Codificador y discriminador de señal de audio. | |
MX2018003242A (es) | Metodo y sistema para codificar una señal de sonido estereo utilizando los parametros de codificacion de un canal primario para codificar un canal secundario. | |
MX2023002585A (es) | Metodo de intra-prediccion, y codificador y decodificador que lo utilizan. | |
MX340453B (es) | Biomarcadores para cancer de pulmon. | |
MX2015009600A (es) | Llenado con ruido en la codificacion de audio por transformada perceptual. | |
MX2016011218A (es) | Definiciones de nivel para codecs de video de multiples capas. | |
MX364419B (es) | Aparato y método para codificar o decodificar una señal multicanal. | |
GB2538392A (en) | Ranging using current profiling | |
MY179139A (en) | Noise filling in multichannel audio coding | |
EP3131094A4 (en) | Noise signal processing and generation method, encoder/decoder and encoding/decoding system | |
MX2019011956A (es) | Clasificacion y codificacion de señal de audio. | |
MY176776A (en) | Coding and decoding of spectral peak positions | |
MX2019012777A (es) | Metodo y aparato de codificacion de audio. | |
MX347410B (es) | Aparato y metodo para seleccionar uno de un primer algoritmo de codificacion y un segundo algoritmo de codificacion. | |
MX2019006311A (es) | Codificacion de vector de ganancia y forma dividida. | |
EP4235661A3 (en) | Comfort noise generation method and device | |
EP3547311A4 (en) | STEREOPHONES CODING PROCESS AND STEREOPHONE CODIER | |
IN2013MU01493A (es) | ||
HK1223726A1 (zh) | 用於通過採用分佈量化與編碼來分裂音頻信號包絡以進行音頻信號包絡編碼、處理和解碼的設備及方法 | |
MY179202A (en) | Method for producing specific ?,b-unsaturated aldehydes | |
TH1501007373A (th) | เครื่องและวิธีการสำหรับการเข้ารหัส การประมวลผล และการถอดรหัสเอนเวโลปของ สัญญาณเสียงโดยการแยกเอนเวโลปของสัญญาณเสียงนั้นซึ่งใช้งานการควอนไทซ์ การแจกแจงและการลงรหัส | |
MY175324A (en) | Ranging using current profiling | |
TH1501004211B (th) | เครื่องและวิธีการสำหรับการดำเนินการเติมสัญญาณรบกวนบนสเปกตรัมของสัญญาณเสียงตัวเข้ารหัสเสียงและตัวถอดรหัสเสียง ที่รองรับการเติมสัญญาณรบกวน |