DE60033636T2 - Pausendetektion für die Spracherkennung - Google Patents

Pausendetektion für die Spracherkennung Download PDF

Info

Publication number
DE60033636T2
DE60033636T2 DE60033636T DE60033636T DE60033636T2 DE 60033636 T2 DE60033636 T2 DE 60033636T2 DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T2 DE60033636 T2 DE 60033636T2
Authority
DE
Germany
Prior art keywords
subbands
power
thr
max
pause
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60033636T
Other languages
German (de)
English (en)
Other versions
DE60033636D1 (de
Inventor
Kari Laurila
Juha Häkkinen
Ramalingam Hariharan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Inc
Original Assignee
Nokia Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Inc filed Critical Nokia Inc
Application granted granted Critical
Publication of DE60033636D1 publication Critical patent/DE60033636D1/de
Publication of DE60033636T2 publication Critical patent/DE60033636T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Circuits Of Receivers In General (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Alarm Systems (AREA)
  • Telephone Function (AREA)
DE60033636T 1999-01-18 2000-01-17 Pausendetektion für die Spracherkennung Expired - Lifetime DE60033636T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FI990078A FI118359B (fi) 1999-01-18 1999-01-18 Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
FI990078 1999-01-18
PCT/FI2000/000028 WO2000042600A2 (en) 1999-01-18 2000-01-17 Method in speech recognition and a speech recognition device

Publications (2)

Publication Number Publication Date
DE60033636D1 DE60033636D1 (de) 2007-04-12
DE60033636T2 true DE60033636T2 (de) 2007-06-21

Family

ID=8553379

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60033636T Expired - Lifetime DE60033636T2 (de) 1999-01-18 2000-01-17 Pausendetektion für die Spracherkennung

Country Status (8)

Country Link
US (1) US7146318B2 (enExample)
EP (1) EP1153387B1 (enExample)
JP (1) JP2002535708A (enExample)
AT (1) ATE355588T1 (enExample)
AU (1) AU2295800A (enExample)
DE (1) DE60033636T2 (enExample)
FI (1) FI118359B (enExample)
WO (1) WO2000042600A2 (enExample)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
JP2002041073A (ja) * 2000-07-31 2002-02-08 Alpine Electronics Inc 音声認識装置
US20030004720A1 (en) * 2001-01-30 2003-01-02 Harinath Garudadri System and method for computing and transmitting parameters in a distributed voice recognition system
US6771706B2 (en) 2001-03-23 2004-08-03 Qualcomm Incorporated Method and apparatus for utilizing channel state information in a wireless communication system
US7941313B2 (en) * 2001-05-17 2011-05-10 Qualcomm Incorporated System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
CN101320559B (zh) * 2007-06-07 2011-05-18 华为技术有限公司 一种声音激活检测装置及方法
US8082148B2 (en) * 2008-04-24 2011-12-20 Nuance Communications, Inc. Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
US9135809B2 (en) * 2008-06-20 2015-09-15 At&T Intellectual Property I, Lp Voice enabled remote control for a set-top box
DE112009005215T8 (de) * 2009-08-04 2013-01-03 Nokia Corp. Verfahren und Vorrichtung zur Audiosignalklassifizierung
HUE053127T2 (hu) 2010-12-24 2021-06-28 Huawei Tech Co Ltd Eljárás és berendezés hang aktivitás adaptív detektálására egy bemeneti audiójelben
EP3438979B1 (en) * 2013-12-19 2020-06-24 Telefonaktiebolaget LM Ericsson (publ) Estimation of background noise in audio signals
US10332564B1 (en) * 2015-06-25 2019-06-25 Amazon Technologies, Inc. Generating tags during video upload
US10090005B2 (en) * 2016-03-10 2018-10-02 Aspinity, Inc. Analog voice activity detection
US10825471B2 (en) * 2017-04-05 2020-11-03 Avago Technologies International Sales Pte. Limited Voice energy detection
RU2761940C1 (ru) 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
CN111327395B (zh) * 2019-11-21 2023-04-11 沈连腾 一种宽带信号的盲检测方法、装置、设备及存储介质

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4015088A (en) * 1975-10-31 1977-03-29 Bell Telephone Laboratories, Incorporated Real-time speech analyzer
EP0167364A1 (en) * 1984-07-06 1986-01-08 AT&T Corp. Speech-silence detection with subband coding
GB8613327D0 (en) * 1986-06-02 1986-07-09 British Telecomm Speech processor
US4811404A (en) * 1987-10-01 1989-03-07 Motorola, Inc. Noise suppression system
FI100840B (fi) * 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin
US5794199A (en) 1996-01-29 1998-08-11 Texas Instruments Incorporated Method and system for improved discontinuous speech transmission
US6108610A (en) * 1998-10-13 2000-08-22 Noise Cancellation Technologies, Inc. Method and system for updating noise estimates during pauses in an information signal
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin

Also Published As

Publication number Publication date
EP1153387A2 (en) 2001-11-14
FI118359B (fi) 2007-10-15
JP2002535708A (ja) 2002-10-22
EP1153387B1 (en) 2007-02-28
FI990078A0 (fi) 1999-01-18
ATE355588T1 (de) 2006-03-15
AU2295800A (en) 2000-08-01
US7146318B2 (en) 2006-12-05
WO2000042600A3 (en) 2000-09-28
WO2000042600A2 (en) 2000-07-20
DE60033636D1 (de) 2007-04-12
FI990078L (fi) 2000-07-19
US20040236571A1 (en) 2004-11-25

Similar Documents

Publication Publication Date Title
DE60033636T2 (de) Pausendetektion für die Spracherkennung
DE69926851T2 (de) Verfahren und Vorrichtung zur Sprachaktivitätsdetektion
DE60024236T2 (de) Sprach endpunktbestimmung in einem rauschsignal
DE3856280T2 (de) Rauschunterdrückungssystem
DE69426969T2 (de) Spracherkennung mit bewerteter Entscheidung
DE10041512B4 (de) Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
DE3752288T2 (de) Sprachprozessor
DE69830017T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69727895T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69614789T2 (de) Vom Anwender auswählbare mehrfache Schwellenwertkriterien für Spracherkennung
DE60204504T2 (de) Schlüsselworterkennung in einem verrauschten Signal
DE60007637T2 (de) Vermeidung von Online-Sprecherüberanpassung bei der Spracherkennung
DE112017007005B4 (de) Akustiksignal-verarbeitungsvorrichtung, akustiksignalverarbeitungsverfahren und freisprech-kommunikationsvorrichtung
DE10006930B4 (de) System und Verfahren zur Spracherkennung
DE3009677A1 (de) Verfahren zur erkennung von sprache und sprachpausen
WO1995007597A1 (de) Mobilfunkgerät mit freisprecheinrichtung
DE69918635T2 (de) Vorrichtung und Verfahren zur Sprachverarbeitung
DE69635141T2 (de) Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung
DE69411817T2 (de) Verfahren und vorrichtung zur kodierung/dekodierung von hintergrundgeräuschen
EP0508547B1 (de) Schaltungsanordnung zur Spracherkennung
DE69130687T2 (de) Sprachsignalverarbeitungsvorrichtung zum Herausschneiden von einem Sprachsignal aus einem verrauschten Sprachsignal
DE10043064B4 (de) Verfahren und Vorrichtung zur Elimination von Lautsprecherinterferenzen aus Mikrofonsignalen
EP2080197B1 (de) Vorrichtung zur geräuschunterdrückung bei einem audiosignal
EP1456837B1 (de) Verfahren und vorrichtung zur spracherkennung
DE69922769T2 (de) Vorrichtung und Verfahren zur Sprachverarbeitung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition