AR030386A1 - Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio - Google Patents

Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio

Info

Publication number
AR030386A1
AR030386A1 ARP990105966A ARP990105966A AR030386A1 AR 030386 A1 AR030386 A1 AR 030386A1 AR P990105966 A ARP990105966 A AR P990105966A AR P990105966 A ARP990105966 A AR P990105966A AR 030386 A1 AR030386 A1 AR 030386A1
Authority
AR
Argentina
Prior art keywords
audio signal
determination
information
speech
coding
Prior art date
Application number
ARP990105966A
Other languages
English (en)
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=26807081&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=AR030386(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Publication of AR030386A1 publication Critical patent/AR030386A1/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Un método para conservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio comprende efectuar una primera determinacion en cuanto a si la senal de audio comprende informacion de habla o de ruido, efectuar una segunda determinacion en cuanto a si la senal de audio incluye informacion que no es de habla y si es perceptivamente relevante para el oyente, y anular selectivamente dicha primera determinacion, en respuesta a dicha segunda determinacion. Un aparato que comprende un clasificador para recibir la senal de audio es utilizado para efectuar una primera determinacion en cuanto a si se considera que la senal de audio comprende informacion de habla o ruido, comprende además un detector para recibir la senal de audio y efectuar la segunda determinacion en cuanto a si la senal de audio incluye informacion que no es de habla y sí perceptivamente relevante para el oyente, y logica acoplada al clasificador y al detector, tiene una salida para indicar si la senal de audio incluye informacion perceptivamente relevante, siendo la logica operable para proporcionar selectivamente en la salida la informacion indicativa de la primera determinacion y que también responde a la segunda determinacion para anular selectivamente en la salida la informacion indicativa de la primera determinacion.
ARP990105966A 1998-11-23 1999-11-23 Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio AR030386A1 (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10955698P 1998-11-23 1998-11-23
US09/434,787 US6424938B1 (en) 1998-11-23 1999-11-05 Complex signal activity detection for improved speech/noise classification of an audio signal

Publications (1)

Publication Number Publication Date
AR030386A1 true AR030386A1 (es) 2003-08-20

Family

ID=26807081

Family Applications (1)

Application Number Title Priority Date Filing Date
ARP990105966A AR030386A1 (es) 1998-11-23 1999-11-23 Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio

Country Status (15)

Country Link
US (1) US6424938B1 (es)
EP (1) EP1224659B1 (es)
JP (1) JP4025018B2 (es)
KR (1) KR100667008B1 (es)
CN (2) CN1828722B (es)
AR (1) AR030386A1 (es)
AU (1) AU763409B2 (es)
BR (1) BR9915576B1 (es)
CA (1) CA2348913C (es)
DE (1) DE69925168T2 (es)
HK (1) HK1097080A1 (es)
MY (1) MY124630A (es)
RU (1) RU2251750C2 (es)
WO (1) WO2000031720A2 (es)
ZA (1) ZA200103150B (es)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7072832B1 (en) 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
US6424938B1 (en) * 1998-11-23 2002-07-23 Telefonaktiebolaget L M Ericsson Complex signal activity detection for improved speech/noise classification of an audio signal
US6633841B1 (en) * 1999-07-29 2003-10-14 Mindspeed Technologies, Inc. Voice activity detection speech coding to accommodate music signals
US6694012B1 (en) * 1999-08-30 2004-02-17 Lucent Technologies Inc. System and method to provide control of music on hold to the hold party
US20030205124A1 (en) * 2002-05-01 2003-11-06 Foote Jonathan T. Method and system for retrieving and sequencing music by rhythmic similarity
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
EP1569200A1 (en) * 2004-02-26 2005-08-31 Sony International (Europe) GmbH Identification of the presence of speech in digital audio data
US7983906B2 (en) * 2005-03-24 2011-07-19 Mindspeed Technologies, Inc. Adaptive voice mode extension for a voice activity detector
US8874437B2 (en) * 2005-03-28 2014-10-28 Tellabs Operations, Inc. Method and apparatus for modifying an encoded signal for voice quality enhancement
WO2006136179A1 (en) * 2005-06-20 2006-12-28 Telecom Italia S.P.A. Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system
KR100785471B1 (ko) * 2006-01-06 2007-12-13 와이더댄 주식회사 통신망을 통해 가입자 단말기로 전송되는 오디오 신호의출력 품질 개선을 위한 오디오 신호의 처리 방법 및 상기방법을 채용한 오디오 신호 처리 장치
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
US9966085B2 (en) * 2006-12-30 2018-05-08 Google Technology Holdings LLC Method and noise suppression circuit incorporating a plurality of noise suppression techniques
ES2533358T3 (es) 2007-06-22 2015-04-09 Voiceage Corporation Procedimiento y dispositivo para estimar la tonalidad de una señal de sonido
CN101889432B (zh) * 2007-12-07 2013-12-11 艾格瑞系统有限公司 处于保持时的音乐的终端用户控制
US20090154718A1 (en) * 2007-12-14 2009-06-18 Page Steven R Method and apparatus for suppressor backfill
DE102008009719A1 (de) * 2008-02-19 2009-08-20 Siemens Enterprise Communications Gmbh & Co. Kg Verfahren und Mittel zur Enkodierung von Hintergrundrauschinformationen
AU2009220321B2 (en) * 2008-03-03 2011-09-22 Intellectual Discovery Co., Ltd. Method and apparatus for processing audio signal
EP2259254B1 (en) * 2008-03-04 2014-04-30 LG Electronics Inc. Method and apparatus for processing an audio signal
MY154452A (en) 2008-07-11 2015-06-15 Fraunhofer Ges Forschung An apparatus and a method for decoding an encoded audio signal
CN102150201B (zh) 2008-07-11 2013-04-17 弗劳恩霍夫应用研究促进协会 提供时间扭曲激活信号以及使用该时间扭曲激活信号对音频信号编码
KR101251045B1 (ko) * 2009-07-28 2013-04-04 한국전자통신연구원 오디오 판별 장치 및 그 방법
JP5754899B2 (ja) * 2009-10-07 2015-07-29 ソニー株式会社 復号装置および方法、並びにプログラム
CN102044243B (zh) * 2009-10-15 2012-08-29 华为技术有限公司 语音激活检测方法与装置、编码器
JP5793500B2 (ja) * 2009-10-19 2015-10-14 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声区間検出器及び方法
JP5712220B2 (ja) 2009-10-19 2015-05-07 テレフオンアクチーボラゲット エル エム エリクソン(パブル) 音声活動検出のための方法および背景推定器
US20110178800A1 (en) * 2010-01-19 2011-07-21 Lloyd Watts Distortion Measurement for Noise Suppression System
JP5609737B2 (ja) * 2010-04-13 2014-10-22 ソニー株式会社 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム
CN102237085B (zh) * 2010-04-26 2013-08-14 华为技术有限公司 音频信号的分类方法及装置
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
EP3726530B1 (en) 2010-12-24 2024-05-22 Huawei Technologies Co., Ltd. Method and apparatus for adaptively detecting a voice activity in an input audio signal
EP2477188A1 (en) 2011-01-18 2012-07-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoding and decoding of slot positions of events in an audio signal frame
WO2012127278A1 (en) * 2011-03-18 2012-09-27 Nokia Corporation Apparatus for audio signal processing
CN103187065B (zh) * 2011-12-30 2015-12-16 华为技术有限公司 音频数据的处理方法、装置和系统
US9208798B2 (en) 2012-04-09 2015-12-08 Board Of Regents, The University Of Texas System Dynamic control of voice codec data rate
JP6127143B2 (ja) * 2012-08-31 2017-05-10 テレフオンアクチーボラゲット エルエム エリクソン(パブル) 音声アクティビティ検出のための方法及び装置
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
CN111145767B (zh) 2012-12-21 2023-07-25 弗劳恩霍夫应用研究促进协会 解码器及用于产生和处理编码频比特流的系统
BR112015014212B1 (pt) 2012-12-21 2021-10-19 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Geração de um ruído de conforto com alta resolução espectro-temporal em transmissão descontínua de sinais de audio
KR101788484B1 (ko) 2013-06-21 2017-10-19 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. Tcx ltp를 이용하여 붕괴되거나 붕괴되지 않은 수신된 프레임들의 재구성을 갖는 오디오 디코딩
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
BR112016014104B1 (pt) 2013-12-19 2020-12-29 Telefonaktiebolaget Lm Ericsson (Publ) método de estimativa de ruído de fundo, estimador de ruído de fundo, detector de atividade de som, codec, dispositivo sem fio, nó de rede, meio de armazenamento legível por computador
CN106797512B (zh) 2014-08-28 2019-10-25 美商楼氏电子有限公司 多源噪声抑制的方法、系统和非瞬时计算机可读存储介质
KR102299330B1 (ko) * 2014-11-26 2021-09-08 삼성전자주식회사 음성 인식 방법 및 그 전자 장치
US10978096B2 (en) * 2017-04-25 2021-04-13 Qualcomm Incorporated Optimized uplink operation for voice over long-term evolution (VoLte) and voice over new radio (VoNR) listen or silent periods
CN113345446B (zh) * 2021-06-01 2024-02-27 广州虎牙科技有限公司 音频处理方法、装置、电子设备和计算机可读存储介质

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58143394A (ja) * 1982-02-19 1983-08-25 株式会社日立製作所 音声区間の検出・分類方式
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection
BR9206143A (pt) * 1991-06-11 1995-01-03 Qualcomm Inc Processos de compressão de final vocal e para codificação de taxa variável de quadros de entrada, aparelho para comprimir im sinal acústico em dados de taxa variável, codificador de prognóstico exitado por córdigo de taxa variável (CELP) e descodificador para descodificar quadros codificados
US5659622A (en) * 1995-11-13 1997-08-19 Motorola, Inc. Method and apparatus for suppressing noise in a communication system
US5930749A (en) * 1996-02-02 1999-07-27 International Business Machines Corporation Monitoring, identification, and selection of audio signal poles with characteristic behaviors, for separation and synthesis of signal contributions
US6570991B1 (en) * 1996-12-18 2003-05-27 Interval Research Corporation Multi-feature speech/music discrimination system
US6097772A (en) * 1997-11-24 2000-08-01 Ericsson Inc. System and method for detecting speech transmissions in the presence of control signaling
US6173257B1 (en) * 1998-08-24 2001-01-09 Conexant Systems, Inc Completed fixed codebook for speech encoder
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
US6104992A (en) * 1998-08-24 2000-08-15 Conexant Systems, Inc. Adaptive gain reduction to produce fixed codebook target signal
US6188980B1 (en) * 1998-08-24 2001-02-13 Conexant Systems, Inc. Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients
US6260010B1 (en) * 1998-08-24 2001-07-10 Conexant Systems, Inc. Speech encoder using gain normalization that combines open and closed loop gains
US6424938B1 (en) * 1998-11-23 2002-07-23 Telefonaktiebolaget L M Ericsson Complex signal activity detection for improved speech/noise classification of an audio signal

Also Published As

Publication number Publication date
EP1224659B1 (en) 2005-05-04
WO2000031720A2 (en) 2000-06-02
BR9915576B1 (pt) 2013-04-16
EP1224659A2 (en) 2002-07-24
DE69925168D1 (de) 2005-06-09
US6424938B1 (en) 2002-07-23
HK1097080A1 (en) 2007-06-15
CA2348913C (en) 2009-09-15
KR20010078401A (ko) 2001-08-20
CN1828722A (zh) 2006-09-06
CN1419687A (zh) 2003-05-21
CA2348913A1 (en) 2000-06-02
KR100667008B1 (ko) 2007-01-10
RU2251750C2 (ru) 2005-05-10
MY124630A (en) 2006-06-30
DE69925168T2 (de) 2006-02-16
ZA200103150B (en) 2002-06-26
JP4025018B2 (ja) 2007-12-19
JP2002540441A (ja) 2002-11-26
WO2000031720A3 (en) 2002-03-21
AU1593800A (en) 2000-06-13
CN1257486C (zh) 2006-05-24
CN1828722B (zh) 2010-05-26
AU763409B2 (en) 2003-07-24
BR9915576A (pt) 2001-08-14

Similar Documents

Publication Publication Date Title
AR030386A1 (es) Metodo y aparato para preservar informacion perceptivamente relevante que no es de habla en una senal de audio durante la codificacion de la senal de audio
WO2000017859A8 (en) Noise suppression for low bitrate speech coder
EP0932141A3 (en) Method for signal controlled switching between different audio coding schemes
AU7035298A (en) Method for signalling a noise substitution during audio signal coding
MY133623A (en) Controlling loudness of speech in signals that contain speech and other types of audio material
AU2001282454A1 (en) Voice enhancement system
CA2177422A1 (en) Voice/Unvoiced Classification of Speech for Use in Speech Decoding During Frame Erasures
AU2001284588A1 (en) Multi-channel signal encoding and decoding
EP0746116A3 (en) MPEG audio decoder
AU7520798A (en) Method for coding an audio signal
WO2002029780A3 (en) Speech detection with source separation
EP1647972A3 (de) Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen
CA2306098A1 (en) Multimode speech coding apparatus and decoding apparatus
AR024353A1 (es) Audifono y equipo auxiliar interactivo con relacion de voz a audio remanente
DE60129771D1 (de) Laguerre funktion für audiokodierung
ATE225553T1 (de) Vorrichtung zur signal-rauschverhältnismessung in einem sprachsignal
WO2003048711A3 (fr) System de detection de parole dans un signal audio en environnement bruite
EP1073039A3 (en) Speech decoder with gain processing
IT1302026B1 (it) Materiali immunologici e metodi per la rivelazione di diidropirimidinadeidrogenasi.
MX9602144A (es) Atenuacion de ganancia del codigo cifrado durante borrado de cuadros.
EP1739654A3 (de) Verfahren zum Betrieb einer Mehrfachmikrofonanordnung in einem Kraftfahrzeug sowie Mehrfachmikrofonanordnung selbst
EP0813183A3 (en) Speech reproducing system
EP1204092A3 (en) Speech decoder capable of decoding background noise signal with high quality
AU6446100A (en) Method for selecting modulation detector in receiver, and receiver
DE68908845T2 (de) Araliphatische Aldehyde.

Legal Events

Date Code Title Description
FG Grant, registration