MX356883B - Codificador y discriminador de señal de audio. - Google Patents
Codificador y discriminador de señal de audio.Info
- Publication number
- MX356883B MX356883B MX2016014534A MX2016014534A MX356883B MX 356883 B MX356883 B MX 356883B MX 2016014534 A MX2016014534 A MX 2016014534A MX 2016014534 A MX2016014534 A MX 2016014534A MX 356883 B MX356883 B MX 356883B
- Authority
- MX
- Mexico
- Prior art keywords
- audio signal
- coding
- coder
- signal discriminator
- segment
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 4
- 230000003595 spectral effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
La invención se refiere a un códec y un discriminador y métodos en el mismo para la discriminación de la señal de audio y la codificación. Las modalidades de un método realizado por un codificador que comprende, por un segmento de la señal de audio: identificar un conjunto de picos espectrales; determinar una distancia S media entre picos en el conjunto; y determinar una proporción, PNR, entre un pico de la envolvente y una envolvente de umbral mínimo de ruido. El método comprende además la selección de un modo de codificación, fuera de una pluralidad de modos de codificación, basándose al menos en la distancia S media y la proporción PNR; y aplicar el modo de codificación seleccionado para la codificación del segmento de la señal de audio.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201461990354P | 2014-05-08 | 2014-05-08 | |
PCT/SE2015/050503 WO2015171061A1 (en) | 2014-05-08 | 2015-05-07 | Audio signal discriminator and coder |
Publications (2)
Publication Number | Publication Date |
---|---|
MX2016014534A MX2016014534A (es) | 2017-02-20 |
MX356883B true MX356883B (es) | 2018-06-19 |
Family
ID=53200274
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2016014534A MX356883B (es) | 2014-05-08 | 2015-05-07 | Codificador y discriminador de señal de audio. |
MX2018007257A MX2018007257A (es) | 2014-05-08 | 2016-11-04 | Codificador y discriminador de señal de audio. |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2018007257A MX2018007257A (es) | 2014-05-08 | 2016-11-04 | Codificador y discriminador de señal de audio. |
Country Status (11)
Country | Link |
---|---|
US (3) | US9620138B2 (es) |
EP (3) | EP3594948B1 (es) |
CN (3) | CN110619891B (es) |
BR (1) | BR112016025850B1 (es) |
DK (2) | DK3140831T3 (es) |
ES (3) | ES2763280T3 (es) |
HU (1) | HUE046477T2 (es) |
MX (2) | MX356883B (es) |
MY (1) | MY182165A (es) |
PL (2) | PL3594948T3 (es) |
WO (1) | WO2015171061A1 (es) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ES2638201T3 (es) | 2013-10-18 | 2017-10-19 | Telefonaktiebolaget Lm Ericsson (Publ) | Codificación de las posiciones de los picos espectrales |
DK3140831T3 (en) * | 2014-05-08 | 2018-10-15 | Ericsson Telefon Ab L M | Audio signal discriminator and codes |
WO2016017238A1 (ja) * | 2014-07-28 | 2016-02-04 | 日本電信電話株式会社 | 符号化方法、装置、プログラム及び記録媒体 |
CN110211580B (zh) * | 2019-05-15 | 2021-07-16 | 海尔优家智能科技(北京)有限公司 | 多智能设备应答方法、装置、系统及存储介质 |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1080462B1 (en) * | 1998-05-27 | 2005-02-02 | Microsoft Corporation | System and method for entropy encoding quantized transform coefficients of a signal |
US6226608B1 (en) * | 1999-01-28 | 2001-05-01 | Dolby Laboratories Licensing Corporation | Data framing for adaptive-block-length coding system |
US6959274B1 (en) * | 1999-09-22 | 2005-10-25 | Mindspeed Technologies, Inc. | Fixed rate speech compression system and method |
US6785645B2 (en) * | 2001-11-29 | 2004-08-31 | Microsoft Corporation | Real-time speech and music classifier |
KR100762596B1 (ko) * | 2006-04-05 | 2007-10-01 | 삼성전자주식회사 | 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법 |
US20070282601A1 (en) * | 2006-06-02 | 2007-12-06 | Texas Instruments Inc. | Packet loss concealment for a conjugate structure algebraic code excited linear prediction decoder |
CN101145345B (zh) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | 音频分类方法 |
ES2533358T3 (es) * | 2007-06-22 | 2015-04-09 | Voiceage Corporation | Procedimiento y dispositivo para estimar la tonalidad de una señal de sonido |
CN101399039B (zh) * | 2007-09-30 | 2011-05-11 | 华为技术有限公司 | 一种确定非噪声音频信号类别的方法及装置 |
KR101599875B1 (ko) * | 2008-04-17 | 2016-03-14 | 삼성전자주식회사 | 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 부호화 방법 및 장치, 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 복호화 방법 및 장치 |
CA2871268C (en) | 2008-07-11 | 2015-11-03 | Nikolaus Rettelbach | Audio encoder, audio decoder, methods for encoding and decoding an audio signal, audio stream and computer program |
EP2210944A1 (en) | 2009-01-22 | 2010-07-28 | ATG:biosynthetics GmbH | Methods for generation of RNA and (poly)peptide libraries and their use |
CN102044246B (zh) * | 2009-10-15 | 2012-05-23 | 华为技术有限公司 | 一种音频信号检测方法和装置 |
KR101754970B1 (ko) * | 2010-01-12 | 2017-07-06 | 삼성전자주식회사 | 무선 통신 시스템의 채널 상태 측정 기준신호 처리 장치 및 방법 |
US9652999B2 (en) * | 2010-04-29 | 2017-05-16 | Educational Testing Service | Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition |
EP2593937B1 (en) * | 2010-07-16 | 2015-11-11 | Telefonaktiebolaget LM Ericsson (publ) | Audio encoder and decoder and methods for encoding and decoding an audio signal |
RU2010152225A (ru) * | 2010-12-20 | 2012-06-27 | ЭлЭсАй Корпорейшн (US) | Обнаружение музыки с использованием анализа спектральных пиков |
CN102982804B (zh) * | 2011-09-02 | 2017-05-03 | 杜比实验室特许公司 | 音频分类方法和系统 |
CN102522082B (zh) * | 2011-12-27 | 2013-07-10 | 重庆大学 | 一种公共场所异常声音的识别与定位方法 |
US9111531B2 (en) * | 2012-01-13 | 2015-08-18 | Qualcomm Incorporated | Multiple coding mode signal classification |
US20130282372A1 (en) * | 2012-04-23 | 2013-10-24 | Qualcomm Incorporated | Systems and methods for audio signal processing |
MY168806A (en) | 2012-06-28 | 2018-12-04 | Fraunhofer Ges Forschung | Linear prediction based audio coding using improved probability distribution estimation |
US9401153B2 (en) * | 2012-10-15 | 2016-07-26 | Digimarc Corporation | Multi-mode audio recognition and auxiliary data encoding and decoding |
DK3140831T3 (en) * | 2014-05-08 | 2018-10-15 | Ericsson Telefon Ab L M | Audio signal discriminator and codes |
WO2015168925A1 (en) | 2014-05-09 | 2015-11-12 | Qualcomm Incorporated | Restricted aperiodic csi measurement reporting in enhanced interference management and traffic adaptation |
TWI602172B (zh) * | 2014-08-27 | 2017-10-11 | 弗勞恩霍夫爾協會 | 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法 |
-
2015
- 2015-05-07 DK DK15724098.7T patent/DK3140831T3/en active
- 2015-05-07 DK DK18172361.0T patent/DK3379535T3/da active
- 2015-05-07 ES ES18172361T patent/ES2763280T3/es active Active
- 2015-05-07 BR BR112016025850-9A patent/BR112016025850B1/pt active IP Right Grant
- 2015-05-07 CN CN201910918149.0A patent/CN110619891B/zh active Active
- 2015-05-07 EP EP19195287.8A patent/EP3594948B1/en active Active
- 2015-05-07 PL PL19195287T patent/PL3594948T3/pl unknown
- 2015-05-07 PL PL15724098T patent/PL3140831T3/pl unknown
- 2015-05-07 MY MYPI2016703844A patent/MY182165A/en unknown
- 2015-05-07 EP EP18172361.0A patent/EP3379535B1/en active Active
- 2015-05-07 EP EP15724098.7A patent/EP3140831B1/en active Active
- 2015-05-07 CN CN201910919030.5A patent/CN110619892B/zh active Active
- 2015-05-07 ES ES15724098.7T patent/ES2690577T3/es active Active
- 2015-05-07 ES ES19195287T patent/ES2874757T3/es active Active
- 2015-05-07 CN CN201580023968.9A patent/CN106463141B/zh active Active
- 2015-05-07 MX MX2016014534A patent/MX356883B/es active IP Right Grant
- 2015-05-07 HU HUE18172361A patent/HUE046477T2/hu unknown
- 2015-05-07 US US14/649,689 patent/US9620138B2/en active Active
- 2015-05-07 WO PCT/SE2015/050503 patent/WO2015171061A1/en active Application Filing
-
2016
- 2016-11-04 MX MX2018007257A patent/MX2018007257A/es unknown
-
2017
- 2017-03-07 US US15/451,551 patent/US10242687B2/en active Active
-
2019
- 2019-02-14 US US16/275,701 patent/US10984812B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
ES2690577T3 (es) | 2018-11-21 |
MX2018007257A (es) | 2022-08-25 |
US20170178660A1 (en) | 2017-06-22 |
ES2874757T3 (es) | 2021-11-05 |
EP3594948A1 (en) | 2020-01-15 |
US10242687B2 (en) | 2019-03-26 |
CN110619891A (zh) | 2019-12-27 |
US20190198032A1 (en) | 2019-06-27 |
BR112016025850B1 (pt) | 2022-08-16 |
EP3379535B1 (en) | 2019-09-18 |
US9620138B2 (en) | 2017-04-11 |
PL3140831T3 (pl) | 2018-12-31 |
EP3140831B1 (en) | 2018-07-11 |
CN110619892A (zh) | 2019-12-27 |
MX2016014534A (es) | 2017-02-20 |
US20160086615A1 (en) | 2016-03-24 |
EP3140831A1 (en) | 2017-03-15 |
CN106463141B (zh) | 2019-11-01 |
HUE046477T2 (hu) | 2020-03-30 |
CN110619891B (zh) | 2023-01-17 |
DK3140831T3 (en) | 2018-10-15 |
ES2763280T3 (es) | 2020-05-27 |
PL3594948T3 (pl) | 2021-08-30 |
BR112016025850A2 (es) | 2017-08-15 |
MY182165A (en) | 2021-01-18 |
WO2015171061A1 (en) | 2015-11-12 |
CN106463141A (zh) | 2017-02-22 |
EP3379535A1 (en) | 2018-09-26 |
EP3594948B1 (en) | 2021-03-03 |
DK3379535T3 (da) | 2019-12-16 |
US10984812B2 (en) | 2021-04-20 |
CN110619892B (zh) | 2023-04-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2018007257A (es) | Codificador y discriminador de señal de audio. | |
MX2018003242A (es) | Metodo y sistema para codificar una señal de sonido estereo utilizando los parametros de codificacion de un canal primario para codificar un canal secundario. | |
MX362424B (es) | Codificador y decodificador de audio usando un procesador de dominio de frecuencia con un relleno de intervalo de banda completa y un procesador de dominio de tiempo. | |
MY198121A (en) | Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal | |
MX337507B (es) | Metodo de intra - prediccion, y codificador y decodificador que lo utilizan. | |
MX340453B (es) | Biomarcadores para cancer de pulmon. | |
MX2015009600A (es) | Llenado con ruido en la codificacion de audio por transformada perceptual. | |
MX2016011218A (es) | Definiciones de nivel para codecs de video de multiples capas. | |
MX2017011495A (es) | Aparato y método para codificar o decodificar una señal multicanal. | |
GB2538392A (en) | Ranging using current profiling | |
MY179139A (en) | Noise filling in multichannel audio coding | |
MX2019011956A (es) | Clasificacion y codificacion de señal de audio. | |
MX2019012777A (es) | Metodo y aparato de codificacion de audio. | |
AR110378A1 (es) | Métodos para determinar el estado del cáncer colorrectal en una persona | |
MX347410B (es) | Aparato y metodo para seleccionar uno de un primer algoritmo de codificacion y un segundo algoritmo de codificacion. | |
MX2019006311A (es) | Codificacion de vector de ganancia y forma dividida. | |
MY176776A (en) | Coding and decoding of spectral peak positions | |
MY178529A (en) | Method for estimating noise in an audio signal, noise estimator, audio encoder, audio decoder, and system for transmitting audio signals | |
EP3547311A4 (en) | STEREOPHONES CODING PROCESS AND STEREOPHONE CODIER | |
HK1223726A1 (zh) | 用於通過採用分佈量化與編碼來分裂音頻信號包絡以進行音頻信號包絡編碼、處理和解碼的設備及方法 | |
MY179202A (en) | Method for producing specific ?,b-unsaturated aldehydes | |
TH1501007373A (th) | เครื่องและวิธีการสำหรับการเข้ารหัส การประมวลผล และการถอดรหัสเอนเวโลปของ สัญญาณเสียงโดยการแยกเอนเวโลปของสัญญาณเสียงนั้นซึ่งใช้งานการควอนไทซ์ การแจกแจงและการลงรหัส | |
TH1501007374B (th) | เครื่องเเละวิธีการสำหรับการเข้ารหัส การประมวลผล และการถอดรหัสเอนเวโลปของ สัญญาณเสียงโดยการจำลองเเบบการแสดงถึงผลบวกสะสมซึ่งใช้งานการควอนไทซ์ การแจกแจง และการลงรหัส | |
MY175324A (en) | Ranging using current profiling | |
MX2016014335A (es) | Calsificacion y codificacion de señal de audio. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant or registration |