AR030386A1 - Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio - Google Patents
Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audioInfo
- Publication number
- AR030386A1 AR030386A1 ARP990105966A ARP990105966A AR030386A1 AR 030386 A1 AR030386 A1 AR 030386A1 AR P990105966 A ARP990105966 A AR P990105966A AR P990105966 A ARP990105966 A AR P990105966A AR 030386 A1 AR030386 A1 AR 030386A1
- Authority
- AR
- Argentina
- Prior art keywords
- audio signal
- determination
- information
- speech
- coding
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 11
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
Un método para conservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio comprende efectuar una primera determinacion en cuanto a si la senal de audio comprende informacion de habla o de ruido, efectuar una segunda determinacion en cuanto a si la senal de audio incluye informacion que no es de habla y si es perceptivamente relevante para el oyente, y anular selectivamente dicha primera determinacion, en respuesta a dicha segunda determinacion. Un aparato que comprende un clasificador para recibir la senal de audio es utilizado para efectuar una primera determinacion en cuanto a si se considera que la senal de audio comprende informacion de habla o ruido, comprende además un detector para recibir la senal de audio y efectuar la segunda determinacion en cuanto a si la senal de audio incluye informacion que no es de habla y sí perceptivamente relevante para el oyente, y logica acoplada al clasificador y al detector, tiene una salida para indicar si la senal de audio incluye informacion perceptivamente relevante, siendo la logica operable para proporcionar selectivamente en la salida la informacion indicativa de la primera determinacion y que también responde a la segunda determinacion para anular selectivamente en la salida la informacion indicativa de la primera determinacion.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10955698P | 1998-11-23 | 1998-11-23 | |
US09/434,787 US6424938B1 (en) | 1998-11-23 | 1999-11-05 | Complex signal activity detection for improved speech/noise classification of an audio signal |
Publications (1)
Publication Number | Publication Date |
---|---|
AR030386A1 true AR030386A1 (es) | 2003-08-20 |
Family
ID=26807081
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ARP990105966A AR030386A1 (es) | 1998-11-23 | 1999-11-23 | Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio |
Country Status (15)
Country | Link |
---|---|
US (1) | US6424938B1 (es) |
EP (1) | EP1224659B1 (es) |
JP (1) | JP4025018B2 (es) |
KR (1) | KR100667008B1 (es) |
CN (2) | CN1828722B (es) |
AR (1) | AR030386A1 (es) |
AU (1) | AU763409B2 (es) |
BR (1) | BR9915576B1 (es) |
CA (1) | CA2348913C (es) |
DE (1) | DE69925168T2 (es) |
HK (1) | HK1097080A1 (es) |
MY (1) | MY124630A (es) |
RU (1) | RU2251750C2 (es) |
WO (1) | WO2000031720A2 (es) |
ZA (1) | ZA200103150B (es) |
Families Citing this family (46)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7072832B1 (en) | 1998-08-24 | 2006-07-04 | Mindspeed Technologies, Inc. | System for speech encoding having an adaptive encoding arrangement |
US6424938B1 (en) * | 1998-11-23 | 2002-07-23 | Telefonaktiebolaget L M Ericsson | Complex signal activity detection for improved speech/noise classification of an audio signal |
US6633841B1 (en) * | 1999-07-29 | 2003-10-14 | Mindspeed Technologies, Inc. | Voice activity detection speech coding to accommodate music signals |
US6694012B1 (en) * | 1999-08-30 | 2004-02-17 | Lucent Technologies Inc. | System and method to provide control of music on hold to the hold party |
US20030205124A1 (en) * | 2002-05-01 | 2003-11-06 | Foote Jonathan T. | Method and system for retrieving and sequencing music by rhythmic similarity |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
EP1569200A1 (en) * | 2004-02-26 | 2005-08-31 | Sony International (Europe) GmbH | Identification of the presence of speech in digital audio data |
US7983906B2 (en) * | 2005-03-24 | 2011-07-19 | Mindspeed Technologies, Inc. | Adaptive voice mode extension for a voice activity detector |
US8874437B2 (en) * | 2005-03-28 | 2014-10-28 | Tellabs Operations, Inc. | Method and apparatus for modifying an encoded signal for voice quality enhancement |
WO2006136179A1 (en) * | 2005-06-20 | 2006-12-28 | Telecom Italia S.P.A. | Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system |
KR100785471B1 (ko) * | 2006-01-06 | 2007-12-13 | 와이더댄 주식회사 | 통신망을 통해 가입자 단말기로 전송되는 오디오 신호의출력 품질 개선을 위한 오디오 신호의 처리 방법 및 상기방법을 채용한 오디오 신호 처리 장치 |
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US9966085B2 (en) * | 2006-12-30 | 2018-05-08 | Google Technology Holdings LLC | Method and noise suppression circuit incorporating a plurality of noise suppression techniques |
ES2533358T3 (es) | 2007-06-22 | 2015-04-09 | Voiceage Corporation | Procedimiento y dispositivo para estimar la tonalidad de una señal de sonido |
CN101889432B (zh) * | 2007-12-07 | 2013-12-11 | 艾格瑞系统有限公司 | 处于保持时的音乐的终端用户控制 |
US20090154718A1 (en) * | 2007-12-14 | 2009-06-18 | Page Steven R | Method and apparatus for suppressor backfill |
DE102008009719A1 (de) * | 2008-02-19 | 2009-08-20 | Siemens Enterprise Communications Gmbh & Co. Kg | Verfahren und Mittel zur Enkodierung von Hintergrundrauschinformationen |
AU2009220321B2 (en) * | 2008-03-03 | 2011-09-22 | Intellectual Discovery Co., Ltd. | Method and apparatus for processing audio signal |
EP2259254B1 (en) * | 2008-03-04 | 2014-04-30 | LG Electronics Inc. | Method and apparatus for processing an audio signal |
MY154452A (en) | 2008-07-11 | 2015-06-15 | Fraunhofer Ges Forschung | An apparatus and a method for decoding an encoded audio signal |
CN102150201B (zh) | 2008-07-11 | 2013-04-17 | 弗劳恩霍夫应用研究促进协会 | 提供时间扭曲激活信号以及使用该时间扭曲激活信号对音频信号编码 |
KR101251045B1 (ko) * | 2009-07-28 | 2013-04-04 | 한국전자통신연구원 | 오디오 판별 장치 및 그 방법 |
JP5754899B2 (ja) * | 2009-10-07 | 2015-07-29 | ソニー株式会社 | 復号装置および方法、並びにプログラム |
CN102044243B (zh) * | 2009-10-15 | 2012-08-29 | 华为技术有限公司 | 语音激活检测方法与装置、编码器 |
JP5793500B2 (ja) * | 2009-10-19 | 2015-10-14 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 音声区間検出器及び方法 |
JP5712220B2 (ja) | 2009-10-19 | 2015-05-07 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 音声活動検出のための方法および背景推定器 |
US20110178800A1 (en) * | 2010-01-19 | 2011-07-21 | Lloyd Watts | Distortion Measurement for Noise Suppression System |
JP5609737B2 (ja) * | 2010-04-13 | 2014-10-22 | ソニー株式会社 | 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム |
CN102237085B (zh) * | 2010-04-26 | 2013-08-14 | 华为技术有限公司 | 音频信号的分类方法及装置 |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
EP3726530B1 (en) | 2010-12-24 | 2024-05-22 | Huawei Technologies Co., Ltd. | Method and apparatus for adaptively detecting a voice activity in an input audio signal |
EP2477188A1 (en) | 2011-01-18 | 2012-07-18 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Encoding and decoding of slot positions of events in an audio signal frame |
WO2012127278A1 (en) * | 2011-03-18 | 2012-09-27 | Nokia Corporation | Apparatus for audio signal processing |
CN103187065B (zh) * | 2011-12-30 | 2015-12-16 | 华为技术有限公司 | 音频数据的处理方法、装置和系统 |
US9208798B2 (en) | 2012-04-09 | 2015-12-08 | Board Of Regents, The University Of Texas System | Dynamic control of voice codec data rate |
JP6127143B2 (ja) * | 2012-08-31 | 2017-05-10 | テレフオンアクチーボラゲット エルエム エリクソン(パブル) | 音声アクティビティ検出のための方法及び装置 |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
CN111145767B (zh) | 2012-12-21 | 2023-07-25 | 弗劳恩霍夫应用研究促进协会 | 解码器及用于产生和处理编码频比特流的系统 |
BR112015014212B1 (pt) | 2012-12-21 | 2021-10-19 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Geração de um ruído de conforto com alta resolução espectro-temporal em transmissão descontínua de sinais de audio |
KR101788484B1 (ko) | 2013-06-21 | 2017-10-19 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Tcx ltp를 이용하여 붕괴되거나 붕괴되지 않은 수신된 프레임들의 재구성을 갖는 오디오 디코딩 |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
BR112016014104B1 (pt) | 2013-12-19 | 2020-12-29 | Telefonaktiebolaget Lm Ericsson (Publ) | método de estimativa de ruído de fundo, estimador de ruído de fundo, detector de atividade de som, codec, dispositivo sem fio, nó de rede, meio de armazenamento legível por computador |
CN106797512B (zh) | 2014-08-28 | 2019-10-25 | 美商楼氏电子有限公司 | 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质 |
KR102299330B1 (ko) * | 2014-11-26 | 2021-09-08 | 삼성전자주식회사 | 음성 인식 방법 및 그 전자 장치 |
US10978096B2 (en) * | 2017-04-25 | 2021-04-13 | Qualcomm Incorporated | Optimized uplink operation for voice over long-term evolution (VoLte) and voice over new radio (VoNR) listen or silent periods |
CN113345446B (zh) * | 2021-06-01 | 2024-02-27 | 广州虎牙科技有限公司 | 音频处理方法、装置、电子设备和计算机可读存储介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58143394A (ja) * | 1982-02-19 | 1983-08-25 | 株式会社日立製作所 | 音声区間の検出・分類方式 |
US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
BR9206143A (pt) * | 1991-06-11 | 1995-01-03 | Qualcomm Inc | Processos de compressão de final vocal e para codificação de taxa variável de quadros de entrada, aparelho para comprimir im sinal acústico em dados de taxa variável, codificador de prognóstico exitado por córdigo de taxa variável (CELP) e descodificador para descodificar quadros codificados |
US5659622A (en) * | 1995-11-13 | 1997-08-19 | Motorola, Inc. | Method and apparatus for suppressing noise in a communication system |
US5930749A (en) * | 1996-02-02 | 1999-07-27 | International Business Machines Corporation | Monitoring, identification, and selection of audio signal poles with characteristic behaviors, for separation and synthesis of signal contributions |
US6570991B1 (en) * | 1996-12-18 | 2003-05-27 | Interval Research Corporation | Multi-feature speech/music discrimination system |
US6097772A (en) * | 1997-11-24 | 2000-08-01 | Ericsson Inc. | System and method for detecting speech transmissions in the presence of control signaling |
US6173257B1 (en) * | 1998-08-24 | 2001-01-09 | Conexant Systems, Inc | Completed fixed codebook for speech encoder |
US6240386B1 (en) * | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
US6104992A (en) * | 1998-08-24 | 2000-08-15 | Conexant Systems, Inc. | Adaptive gain reduction to produce fixed codebook target signal |
US6188980B1 (en) * | 1998-08-24 | 2001-02-13 | Conexant Systems, Inc. | Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients |
US6260010B1 (en) * | 1998-08-24 | 2001-07-10 | Conexant Systems, Inc. | Speech encoder using gain normalization that combines open and closed loop gains |
US6424938B1 (en) * | 1998-11-23 | 2002-07-23 | Telefonaktiebolaget L M Ericsson | Complex signal activity detection for improved speech/noise classification of an audio signal |
-
1999
- 1999-11-05 US US09/434,787 patent/US6424938B1/en not_active Expired - Lifetime
- 1999-11-12 DE DE69925168T patent/DE69925168T2/de not_active Expired - Lifetime
- 1999-11-12 CN CN2006100733243A patent/CN1828722B/zh not_active Expired - Lifetime
- 1999-11-12 CN CNB998136255A patent/CN1257486C/zh not_active Expired - Lifetime
- 1999-11-12 KR KR1020017006424A patent/KR100667008B1/ko active IP Right Grant
- 1999-11-12 EP EP99958602A patent/EP1224659B1/en not_active Expired - Lifetime
- 1999-11-12 BR BRPI9915576-1A patent/BR9915576B1/pt active IP Right Grant
- 1999-11-12 WO PCT/SE1999/002073 patent/WO2000031720A2/en active IP Right Grant
- 1999-11-12 JP JP2000584462A patent/JP4025018B2/ja not_active Expired - Lifetime
- 1999-11-12 CA CA002348913A patent/CA2348913C/en not_active Expired - Lifetime
- 1999-11-12 RU RU2001117231/09A patent/RU2251750C2/ru active
- 1999-11-12 AU AU15938/00A patent/AU763409B2/en not_active Expired
- 1999-11-20 MY MYPI99005074A patent/MY124630A/en unknown
- 1999-11-23 AR ARP990105966A patent/AR030386A1/es active IP Right Grant
-
2001
- 2001-04-18 ZA ZA2001/03150A patent/ZA200103150B/en unknown
-
2007
- 2007-02-12 HK HK07101656.6A patent/HK1097080A1/xx not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
EP1224659B1 (en) | 2005-05-04 |
WO2000031720A2 (en) | 2000-06-02 |
BR9915576B1 (pt) | 2013-04-16 |
EP1224659A2 (en) | 2002-07-24 |
DE69925168D1 (de) | 2005-06-09 |
US6424938B1 (en) | 2002-07-23 |
HK1097080A1 (en) | 2007-06-15 |
CA2348913C (en) | 2009-09-15 |
KR20010078401A (ko) | 2001-08-20 |
CN1828722A (zh) | 2006-09-06 |
CN1419687A (zh) | 2003-05-21 |
CA2348913A1 (en) | 2000-06-02 |
KR100667008B1 (ko) | 2007-01-10 |
RU2251750C2 (ru) | 2005-05-10 |
MY124630A (en) | 2006-06-30 |
DE69925168T2 (de) | 2006-02-16 |
ZA200103150B (en) | 2002-06-26 |
JP4025018B2 (ja) | 2007-12-19 |
JP2002540441A (ja) | 2002-11-26 |
WO2000031720A3 (en) | 2002-03-21 |
AU1593800A (en) | 2000-06-13 |
CN1257486C (zh) | 2006-05-24 |
CN1828722B (zh) | 2010-05-26 |
AU763409B2 (en) | 2003-07-24 |
BR9915576A (pt) | 2001-08-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AR030386A1 (es) | Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio | |
WO2000017859A8 (en) | Noise suppression for low bitrate speech coder | |
EP0932141A3 (en) | Method for signal controlled switching between different audio coding schemes | |
AU7035298A (en) | Method for signalling a noise substitution during audio signal coding | |
MY133623A (en) | Controlling loudness of speech in signals that contain speech and other types of audio material | |
AU2001282454A1 (en) | Voice enhancement system | |
CA2177422A1 (en) | Voice/Unvoiced Classification of Speech for Use in Speech Decoding During Frame Erasures | |
AU2001284588A1 (en) | Multi-channel signal encoding and decoding | |
EP0746116A3 (en) | MPEG audio decoder | |
AU7520798A (en) | Method for coding an audio signal | |
WO2002029780A3 (en) | Speech detection with source separation | |
EP1647972A3 (de) | Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen | |
CA2306098A1 (en) | Multimode speech coding apparatus and decoding apparatus | |
AR024353A1 (es) | Audifono y equipo auxiliar interactivo con relacion de voz a audio remanente | |
DE60129771D1 (de) | Laguerre funktion für audiokodierung | |
ATE225553T1 (de) | Vorrichtung zur signal-rauschverhältnismessung in einem sprachsignal | |
WO2003048711A3 (fr) | System de detection de parole dans un signal audio en environnement bruite | |
EP1073039A3 (en) | Speech decoder with gain processing | |
IT1302026B1 (it) | Materiali immunologici e metodi per la rivelazione di diidropirimidinadeidrogenasi. | |
MX9602144A (es) | Atenuacion de ganancia del codigo cifrado durante borrado de cuadros. | |
EP1739654A3 (de) | Verfahren zum Betrieb einer Mehrfachmikrofonanordnung in einem Kraftfahrzeug sowie Mehrfachmikrofonanordnung selbst | |
EP0813183A3 (en) | Speech reproducing system | |
EP1204092A3 (en) | Speech decoder capable of decoding background noise signal with high quality | |
AU6446100A (en) | Method for selecting modulation detector in receiver, and receiver | |
DE68908845T2 (de) | Araliphatische Aldehyde. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG | Grant, registration |