WO2003048711A3 - System de detection de parole dans un signal audio en environnement bruite - Google Patents
System de detection de parole dans un signal audio en environnement bruite Download PDFInfo
- Publication number
- WO2003048711A3 WO2003048711A3 PCT/FR2002/003910 FR0203910W WO03048711A3 WO 2003048711 A3 WO2003048711 A3 WO 2003048711A3 FR 0203910 W FR0203910 W FR 0203910W WO 03048711 A3 WO03048711 A3 WO 03048711A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio signal
- speech detection
- detection system
- noisy surrounding
- information
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 6
- 238000001514 detection method Methods 0.000 title abstract 3
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
Abstract
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP02788059A EP1451548A2 (fr) | 2001-12-05 | 2002-11-15 | System de detection de parole dans un signal audio en environnement bruite |
US10/497,874 US7359856B2 (en) | 2001-12-05 | 2002-11-15 | Speech detection system in an audio signal in noisy surrounding |
AU2002352339A AU2002352339A1 (en) | 2001-12-05 | 2002-11-15 | Speech detection system in an audio signal in noisy surrounding |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0115685A FR2833103B1 (fr) | 2001-12-05 | 2001-12-05 | Systeme de detection de parole dans le bruit |
FR01/15685 | 2001-12-05 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003048711A2 WO2003048711A2 (fr) | 2003-06-12 |
WO2003048711A3 true WO2003048711A3 (fr) | 2004-02-12 |
Family
ID=8870113
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2002/003910 WO2003048711A2 (fr) | 2001-12-05 | 2002-11-15 | System de detection de parole dans un signal audio en environnement bruite |
Country Status (5)
Country | Link |
---|---|
US (1) | US7359856B2 (fr) |
EP (1) | EP1451548A2 (fr) |
AU (1) | AU2002352339A1 (fr) |
FR (1) | FR2833103B1 (fr) |
WO (1) | WO2003048711A2 (fr) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2856506B1 (fr) * | 2003-06-23 | 2005-12-02 | France Telecom | Procede et dispositif de detection de parole dans un signal audio |
FR2864319A1 (fr) * | 2005-01-19 | 2005-06-24 | France Telecom | Procede et dispositif de detection de parole dans un signal audio |
CN1815550A (zh) * | 2005-02-01 | 2006-08-09 | 松下电器产业株式会社 | 可识别环境中的语音与非语音的方法及系统 |
US8175877B2 (en) * | 2005-02-02 | 2012-05-08 | At&T Intellectual Property Ii, L.P. | Method and apparatus for predicting word accuracy in automatic speech recognition systems |
GB2450886B (en) * | 2007-07-10 | 2009-12-16 | Motorola Inc | Voice activity detector and a method of operation |
KR100930039B1 (ko) * | 2007-12-18 | 2009-12-07 | 한국전자통신연구원 | 음성 인식기의 성능 평가 장치 및 그 방법 |
US8380497B2 (en) * | 2008-10-15 | 2013-02-19 | Qualcomm Incorporated | Methods and apparatus for noise estimation |
US8938389B2 (en) * | 2008-12-17 | 2015-01-20 | Nec Corporation | Voice activity detector, voice activity detection program, and parameter adjusting method |
EP2816560A1 (fr) * | 2009-10-19 | 2014-12-24 | Telefonaktiebolaget L M Ericsson (PUBL) | Estimateur de fond et procédé de détection d'activité vocale |
EP2561508A1 (fr) * | 2010-04-22 | 2013-02-27 | Qualcomm Incorporated | Détection d'activité vocale |
CN102237081B (zh) * | 2010-04-30 | 2013-04-24 | 国际商业机器公司 | 语音韵律评估方法与系统 |
US8898058B2 (en) | 2010-10-25 | 2014-11-25 | Qualcomm Incorporated | Systems, methods, and apparatus for voice activity detection |
JP5747562B2 (ja) * | 2010-10-28 | 2015-07-15 | ヤマハ株式会社 | 音響処理装置 |
US20150281853A1 (en) * | 2011-07-11 | 2015-10-01 | SoundFest, Inc. | Systems and methods for enhancing targeted audibility |
KR20140147587A (ko) * | 2013-06-20 | 2014-12-30 | 한국전자통신연구원 | Wfst를 이용한 음성 끝점 검출 장치 및 방법 |
WO2015098079A1 (fr) * | 2013-12-26 | 2015-07-02 | パナソニックIpマネジメント株式会社 | Dispositif de traitement de reconnaissance vocale, procédé de traitement de reconnaissance vocale, et dispositif d'affichage |
CN112927725A (zh) * | 2014-07-29 | 2021-06-08 | 瑞典爱立信有限公司 | 用于估计背景噪声的方法和背景噪声估计器 |
CN111739515B (zh) * | 2019-09-18 | 2023-08-04 | 北京京东尚科信息技术有限公司 | 语音识别方法、设备、电子设备和服务器、相关系统 |
KR20210089347A (ko) * | 2020-01-08 | 2021-07-16 | 엘지전자 주식회사 | 음성 인식 장치 및 음성데이터를 학습하는 방법 |
CN111599377B (zh) * | 2020-04-03 | 2023-03-31 | 厦门快商通科技股份有限公司 | 基于音频识别的设备状态检测方法、系统及移动终端 |
CN111554314A (zh) * | 2020-05-15 | 2020-08-18 | 腾讯科技(深圳)有限公司 | 噪声检测方法、装置、终端及存储介质 |
CN115602152B (zh) * | 2022-12-14 | 2023-02-28 | 成都启英泰伦科技有限公司 | 一种基于多阶段注意力网络的语音增强方法 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4696039A (en) * | 1983-10-13 | 1987-09-22 | Texas Instruments Incorporated | Speech analysis/synthesis system with silence suppression |
US5579431A (en) * | 1992-10-05 | 1996-11-26 | Panasonic Technologies, Inc. | Speech detection in presence of noise by determining variance over time of frequency band limited energy |
US5598466A (en) * | 1995-08-28 | 1997-01-28 | Intel Corporation | Voice activity detector for half-duplex audio communication system |
JPH0990974A (ja) * | 1995-09-25 | 1997-04-04 | Nippon Telegr & Teleph Corp <Ntt> | 信号処理方法 |
US5819217A (en) * | 1995-12-21 | 1998-10-06 | Nynex Science & Technology, Inc. | Method and system for differentiating between speech and noise |
US5890109A (en) * | 1996-03-28 | 1999-03-30 | Intel Corporation | Re-initializing adaptive parameters for encoding audio signals |
US6023674A (en) * | 1998-01-23 | 2000-02-08 | Telefonaktiebolaget L M Ericsson | Non-parametric voice activity detection |
US6122531A (en) * | 1998-07-31 | 2000-09-19 | Motorola, Inc. | Method for selectively including leading fricative sounds in a portable communication device operated in a speakerphone mode |
US6327564B1 (en) * | 1999-03-05 | 2001-12-04 | Matsushita Electric Corporation Of America | Speech detection using stochastic confidence measures on the frequency spectrum |
US6775649B1 (en) * | 1999-09-01 | 2004-08-10 | Texas Instruments Incorporated | Concealment of frame erasures for speech transmission and storage system and method |
-
2001
- 2001-12-05 FR FR0115685A patent/FR2833103B1/fr not_active Expired - Fee Related
-
2002
- 2002-11-15 EP EP02788059A patent/EP1451548A2/fr not_active Withdrawn
- 2002-11-15 WO PCT/FR2002/003910 patent/WO2003048711A2/fr not_active Application Discontinuation
- 2002-11-15 AU AU2002352339A patent/AU2002352339A1/en not_active Abandoned
- 2002-11-15 US US10/497,874 patent/US7359856B2/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5276765A (en) * | 1988-03-11 | 1994-01-04 | British Telecommunications Public Limited Company | Voice activity detection |
Non-Patent Citations (5)
Title |
---|
MARTIN A ET AL: "Robust speech/non-speech detection using LDA applied to MFCC", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.01CH37221), 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS, SALT LAKE CITY, UT, USA, 7-11 MAY 2001, 2001, Piscataway, NJ, USA, IEEE, USA, pages 237 - 240 vol.1, XP002245514, ISBN: 0-7803-7041-4 * |
MARTIN P: "COMPARISON OF PITCH DETECTION BY CEPSTRUM AND SPECTRAL COMB ANALYSIS", INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH & SIGNAL PROCESSING. ICASSP. PARIS, MAY 3 - 5, 1982, NEW YORK, IEEE, US, vol. 1 CONF. 7, 3 May 1982 (1982-05-03), pages 180 - 183, XP002906644 * |
MORENO-BILBAO A ET AL: "PITCH DETECTOR IN SPEECH SIGNALS CORRUPTED BY NOISE", SIGNAL PROCESSING THEORIES AND APPLICATIONS. BARCELONA, SEPT. 18 - 21, 1990, PROCEEDINGS OF THE EUROPEAN SIGNAL PROCESSING CONFERENCE, AMSTERDAM, ELSEVIER, NL, vol. 2 CONF. 5, 18 September 1990 (1990-09-18), pages 1163 - 1166, XP000365761 * |
RAMANA RAO G V ET AL: "Word boundary detection using pitch variations", PROCEEDINGS ICSLP 96. FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (CAT. NO.96TH8206), PROCEEDING OF FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING. ICSLP '96, PHILADELPHIA, PA, USA, 3-6 OCT. 1996, 1996, New York, NY, USA, IEEE, USA, pages 813 - 816 vol.2, XP002245515, ISBN: 0-7803-3555-4 * |
See also references of EP1451548A2 * |
Also Published As
Publication number | Publication date |
---|---|
AU2002352339A8 (en) | 2003-06-17 |
US20050143978A1 (en) | 2005-06-30 |
EP1451548A2 (fr) | 2004-09-01 |
WO2003048711A2 (fr) | 2003-06-12 |
AU2002352339A1 (en) | 2003-06-17 |
US7359856B2 (en) | 2008-04-15 |
FR2833103B1 (fr) | 2004-07-09 |
FR2833103A1 (fr) | 2003-06-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2003048711A3 (fr) | System de detection de parole dans un signal audio en environnement bruite | |
WO2002103695A3 (fr) | Dispositif et procede pour l'insertion d'un filigrane dans un signal audio | |
WO2001020965A3 (fr) | Procede de determination d'une situation d'environnement acoustique momentanee, utilisation de ce procede, et prothese auditive | |
EP0913952A3 (fr) | Procédé pour intégrer un code en un signal audio et pour détecter le signal intégré | |
WO2003015464A8 (fr) | Traitement de signaux audio directionnel par banc de filtres surechantillonnes | |
AU2003225928A1 (en) | Method for robust voice recognition by analyzing redundant features of source signal | |
WO2003038804A3 (fr) | Detection d'intervention non voulue | |
AU2001284588A1 (en) | Multi-channel signal encoding and decoding | |
DK1453194T3 (da) | Fremgangsmåde til automatisk forstærkningsindstilling i et höreapparat samt et höreapparat | |
WO2002052542A3 (fr) | Procede et dispositif d'analyse d'un signal sonore issu d'une source sonore | |
DE60033132D1 (de) | Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern | |
AU7750700A (en) | Method and apparatus for the provision of information signals based upon speech recognition | |
AU2002322102A1 (en) | Systems and methods for sensing an acoustic signal using microelectromechanical systems technology | |
DE502005003436D1 (de) | Verbesserung der Verständlichkeit von Sprache enthaltenden Audiosignalen | |
ATE381237T1 (de) | Verfahren zum betrieb eines hörgerätes sowie hörgerät | |
AU2002232795A1 (en) | Perceptual audio signal compression system and method | |
WO2002007481A3 (fr) | Convertisseur stereo multicanaux de derivation d'un signal centrale stereo d'ambiophonie et/ou audio | |
AU2003266191A1 (en) | Method and device for monitoring brake signals in a vehicle | |
WO1998001956A3 (fr) | Systeme servant a supprimer le bruit d'un micro | |
AU2002237945A1 (en) | Speech transcription, therapy, and analysis system and method | |
AU2003269418A1 (en) | Method for operating a speech recognition system | |
WO2004095419A3 (fr) | Systeme et procede de synthese de la parole a partir du texte d'un dispositif portable | |
AU2003215220A1 (en) | System and method for efficiently detecting the identification of a received signal | |
AU2002226922A1 (en) | Method and apparatus for speech recognition incorporating location information | |
EP1335349A3 (fr) | Méthodes et dispositifs d'extraction de la fréquence fondamentale pour codage de la parole utilisant des techniques d'interpolation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REEP | Request for entry into the european phase |
Ref document number: 2002788059 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002788059 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2002788059 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10497874 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: JP |