WO2003048711A3 - System de detection de parole dans un signal audio en environnement bruite - Google Patents

System de detection de parole dans un signal audio en environnement bruite Download PDF

Info

Publication number
WO2003048711A3
WO2003048711A3 PCT/FR2002/003910 FR0203910W WO03048711A3 WO 2003048711 A3 WO2003048711 A3 WO 2003048711A3 FR 0203910 W FR0203910 W FR 0203910W WO 03048711 A3 WO03048711 A3 WO 03048711A3
Authority
WO
WIPO (PCT)
Prior art keywords
audio signal
speech detection
detection system
noisy surrounding
information
Prior art date
Application number
PCT/FR2002/003910
Other languages
English (en)
Other versions
WO2003048711A2 (fr
Inventor
Arnaud Martin
Laurent Mauuary
Original Assignee
France Telecom
Arnaud Martin
Laurent Mauuary
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom, Arnaud Martin, Laurent Mauuary filed Critical France Telecom
Priority to EP02788059A priority Critical patent/EP1451548A2/fr
Priority to US10/497,874 priority patent/US7359856B2/en
Priority to AU2002352339A priority patent/AU2002352339A1/en
Publication of WO2003048711A2 publication Critical patent/WO2003048711A2/fr
Publication of WO2003048711A3 publication Critical patent/WO2003048711A3/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)

Abstract

Un procédé de détection de parole dans un signal audio comporte une étape d'obtention d'une information d'énergie du signal audio, cette information d'énergie étant utilisée pour détecter de la parole dans le signal audio. Selon l'invention ce procédé comporte en outre une étape d'obtention d'une information de voisement du signal audio, cette information de voisement étant utilisée conjointement à l'information d'énergie pour la détection de parole dans le signal audio.
PCT/FR2002/003910 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite WO2003048711A2 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP02788059A EP1451548A2 (fr) 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite
US10/497,874 US7359856B2 (en) 2001-12-05 2002-11-15 Speech detection system in an audio signal in noisy surrounding
AU2002352339A AU2002352339A1 (en) 2001-12-05 2002-11-15 Speech detection system in an audio signal in noisy surrounding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0115685A FR2833103B1 (fr) 2001-12-05 2001-12-05 Systeme de detection de parole dans le bruit
FR01/15685 2001-12-05

Publications (2)

Publication Number Publication Date
WO2003048711A2 WO2003048711A2 (fr) 2003-06-12
WO2003048711A3 true WO2003048711A3 (fr) 2004-02-12

Family

ID=8870113

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2002/003910 WO2003048711A2 (fr) 2001-12-05 2002-11-15 System de detection de parole dans un signal audio en environnement bruite

Country Status (5)

Country Link
US (1) US7359856B2 (fr)
EP (1) EP1451548A2 (fr)
AU (1) AU2002352339A1 (fr)
FR (1) FR2833103B1 (fr)
WO (1) WO2003048711A2 (fr)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2856506B1 (fr) * 2003-06-23 2005-12-02 France Telecom Procede et dispositif de detection de parole dans un signal audio
FR2864319A1 (fr) * 2005-01-19 2005-06-24 France Telecom Procede et dispositif de detection de parole dans un signal audio
CN1815550A (zh) * 2005-02-01 2006-08-09 松下电器产业株式会社 可识别环境中的语音与非语音的方法及系统
US8175877B2 (en) * 2005-02-02 2012-05-08 At&T Intellectual Property Ii, L.P. Method and apparatus for predicting word accuracy in automatic speech recognition systems
GB2450886B (en) * 2007-07-10 2009-12-16 Motorola Inc Voice activity detector and a method of operation
KR100930039B1 (ko) * 2007-12-18 2009-12-07 한국전자통신연구원 음성 인식기의 성능 평가 장치 및 그 방법
US8380497B2 (en) * 2008-10-15 2013-02-19 Qualcomm Incorporated Methods and apparatus for noise estimation
US8938389B2 (en) * 2008-12-17 2015-01-20 Nec Corporation Voice activity detector, voice activity detection program, and parameter adjusting method
EP2816560A1 (fr) * 2009-10-19 2014-12-24 Telefonaktiebolaget L M Ericsson (PUBL) Estimateur de fond et procédé de détection d'activité vocale
EP2561508A1 (fr) * 2010-04-22 2013-02-27 Qualcomm Incorporated Détection d'activité vocale
CN102237081B (zh) * 2010-04-30 2013-04-24 国际商业机器公司 语音韵律评估方法与系统
US8898058B2 (en) 2010-10-25 2014-11-25 Qualcomm Incorporated Systems, methods, and apparatus for voice activity detection
JP5747562B2 (ja) * 2010-10-28 2015-07-15 ヤマハ株式会社 音響処理装置
US20150281853A1 (en) * 2011-07-11 2015-10-01 SoundFest, Inc. Systems and methods for enhancing targeted audibility
KR20140147587A (ko) * 2013-06-20 2014-12-30 한국전자통신연구원 Wfst를 이용한 음성 끝점 검출 장치 및 방법
WO2015098079A1 (fr) * 2013-12-26 2015-07-02 パナソニックIpマネジメント株式会社 Dispositif de traitement de reconnaissance vocale, procédé de traitement de reconnaissance vocale, et dispositif d'affichage
CN112927725A (zh) * 2014-07-29 2021-06-08 瑞典爱立信有限公司 用于估计背景噪声的方法和背景噪声估计器
CN111739515B (zh) * 2019-09-18 2023-08-04 北京京东尚科信息技术有限公司 语音识别方法、设备、电子设备和服务器、相关系统
KR20210089347A (ko) * 2020-01-08 2021-07-16 엘지전자 주식회사 음성 인식 장치 및 음성데이터를 학습하는 방법
CN111599377B (zh) * 2020-04-03 2023-03-31 厦门快商通科技股份有限公司 基于音频识别的设备状态检测方法、系统及移动终端
CN111554314A (zh) * 2020-05-15 2020-08-18 腾讯科技(深圳)有限公司 噪声检测方法、装置、终端及存储介质
CN115602152B (zh) * 2022-12-14 2023-02-28 成都启英泰伦科技有限公司 一种基于多阶段注意力网络的语音增强方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4696039A (en) * 1983-10-13 1987-09-22 Texas Instruments Incorporated Speech analysis/synthesis system with silence suppression
US5579431A (en) * 1992-10-05 1996-11-26 Panasonic Technologies, Inc. Speech detection in presence of noise by determining variance over time of frequency band limited energy
US5598466A (en) * 1995-08-28 1997-01-28 Intel Corporation Voice activity detector for half-duplex audio communication system
JPH0990974A (ja) * 1995-09-25 1997-04-04 Nippon Telegr & Teleph Corp <Ntt> 信号処理方法
US5819217A (en) * 1995-12-21 1998-10-06 Nynex Science & Technology, Inc. Method and system for differentiating between speech and noise
US5890109A (en) * 1996-03-28 1999-03-30 Intel Corporation Re-initializing adaptive parameters for encoding audio signals
US6023674A (en) * 1998-01-23 2000-02-08 Telefonaktiebolaget L M Ericsson Non-parametric voice activity detection
US6122531A (en) * 1998-07-31 2000-09-19 Motorola, Inc. Method for selectively including leading fricative sounds in a portable communication device operated in a speakerphone mode
US6327564B1 (en) * 1999-03-05 2001-12-04 Matsushita Electric Corporation Of America Speech detection using stochastic confidence measures on the frequency spectrum
US6775649B1 (en) * 1999-09-01 2004-08-10 Texas Instruments Incorporated Concealment of frame erasures for speech transmission and storage system and method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5276765A (en) * 1988-03-11 1994-01-04 British Telecommunications Public Limited Company Voice activity detection

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
MARTIN A ET AL: "Robust speech/non-speech detection using LDA applied to MFCC", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.01CH37221), 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS, SALT LAKE CITY, UT, USA, 7-11 MAY 2001, 2001, Piscataway, NJ, USA, IEEE, USA, pages 237 - 240 vol.1, XP002245514, ISBN: 0-7803-7041-4 *
MARTIN P: "COMPARISON OF PITCH DETECTION BY CEPSTRUM AND SPECTRAL COMB ANALYSIS", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH & SIGNAL PROCESSING. ICASSP. PARIS, MAY 3 - 5, 1982, NEW YORK, IEEE, US, vol. 1 CONF. 7, 3 May 1982 (1982-05-03), pages 180 - 183, XP002906644 *
MORENO-BILBAO A ET AL: "PITCH DETECTOR IN SPEECH SIGNALS CORRUPTED BY NOISE", SIGNAL PROCESSING THEORIES AND APPLICATIONS. BARCELONA, SEPT. 18 - 21, 1990, PROCEEDINGS OF THE EUROPEAN SIGNAL PROCESSING CONFERENCE, AMSTERDAM, ELSEVIER, NL, vol. 2 CONF. 5, 18 September 1990 (1990-09-18), pages 1163 - 1166, XP000365761 *
RAMANA RAO G V ET AL: "Word boundary detection using pitch variations", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 813 - 816 vol.2, XP002245515, ISBN: 0-7803-3555-4 *
See also references of EP1451548A2 *

Also Published As

Publication number Publication date
AU2002352339A8 (en) 2003-06-17
US20050143978A1 (en) 2005-06-30
EP1451548A2 (fr) 2004-09-01
WO2003048711A2 (fr) 2003-06-12
AU2002352339A1 (en) 2003-06-17
US7359856B2 (en) 2008-04-15
FR2833103B1 (fr) 2004-07-09
FR2833103A1 (fr) 2003-06-06

Similar Documents

Publication Publication Date Title
WO2003048711A3 (fr) System de detection de parole dans un signal audio en environnement bruite
WO2002103695A3 (fr) Dispositif et procede pour l&#39;insertion d&#39;un filigrane dans un signal audio
WO2001020965A3 (fr) Procede de determination d&#39;une situation d&#39;environnement acoustique momentanee, utilisation de ce procede, et prothese auditive
EP0913952A3 (fr) Procédé pour intégrer un code en un signal audio et pour détecter le signal intégré
WO2003015464A8 (fr) Traitement de signaux audio directionnel par banc de filtres surechantillonnes
AU2003225928A1 (en) Method for robust voice recognition by analyzing redundant features of source signal
WO2003038804A3 (fr) Detection d&#39;intervention non voulue
AU2001284588A1 (en) Multi-channel signal encoding and decoding
DK1453194T3 (da) Fremgangsmåde til automatisk forstærkningsindstilling i et höreapparat samt et höreapparat
WO2002052542A3 (fr) Procede et dispositif d&#39;analyse d&#39;un signal sonore issu d&#39;une source sonore
DE60033132D1 (de) Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern
AU7750700A (en) Method and apparatus for the provision of information signals based upon speech recognition
AU2002322102A1 (en) Systems and methods for sensing an acoustic signal using microelectromechanical systems technology
DE502005003436D1 (de) Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen
ATE381237T1 (de) Verfahren zum betrieb eines hörgerätes sowie hörgerät
AU2002232795A1 (en) Perceptual audio signal compression system and method
WO2002007481A3 (fr) Convertisseur stereo multicanaux de derivation d&#39;un signal centrale stereo d&#39;ambiophonie et/ou audio
AU2003266191A1 (en) Method and device for monitoring brake signals in a vehicle
WO1998001956A3 (fr) Systeme servant a supprimer le bruit d&#39;un micro
AU2002237945A1 (en) Speech transcription, therapy, and analysis system and method
AU2003269418A1 (en) Method for operating a speech recognition system
WO2004095419A3 (fr) Systeme et procede de synthese de la parole a partir du texte d&#39;un dispositif portable
AU2003215220A1 (en) System and method for efficiently detecting the identification of a received signal
AU2002226922A1 (en) Method and apparatus for speech recognition incorporating location information
EP1335349A3 (fr) Méthodes et dispositifs d&#39;extraction de la fréquence fondamentale pour codage de la parole utilisant des techniques d&#39;interpolation

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REEP Request for entry into the european phase

Ref document number: 2002788059

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2002788059

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002788059

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10497874

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Ref document number: JP