DE60033636T2 - Pausendetektion für die Spracherkennung - Google Patents

Pausendetektion für die Spracherkennung Download PDF

Info

Publication number
DE60033636T2
DE60033636T2 DE60033636T DE60033636T DE60033636T2 DE 60033636 T2 DE60033636 T2 DE 60033636T2 DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T2 DE60033636 T2 DE 60033636T2
Authority
DE
Germany
Prior art keywords
subbands
power
thr
max
pause
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60033636T
Other languages
German (de)
English (en)
Other versions
DE60033636D1 (de
Inventor
Kari Laurila
Juha Häkkinen
Ramalingam Hariharan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Oyj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj filed Critical Nokia Oyj
Publication of DE60033636D1 publication Critical patent/DE60033636D1/de
Application granted granted Critical
Publication of DE60033636T2 publication Critical patent/DE60033636T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Circuits Of Receivers In General (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Telephone Function (AREA)
  • Alarm Systems (AREA)
  • Facsimile Transmission Control (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
DE60033636T 1999-01-18 2000-01-17 Pausendetektion für die Spracherkennung Expired - Lifetime DE60033636T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FI990078 1999-01-18
FI990078A FI118359B (fi) 1999-01-18 1999-01-18 Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
PCT/FI2000/000028 WO2000042600A2 (en) 1999-01-18 2000-01-17 Method in speech recognition and a speech recognition device

Publications (2)

Publication Number Publication Date
DE60033636D1 DE60033636D1 (de) 2007-04-12
DE60033636T2 true DE60033636T2 (de) 2007-06-21

Family

ID=8553379

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60033636T Expired - Lifetime DE60033636T2 (de) 1999-01-18 2000-01-17 Pausendetektion für die Spracherkennung

Country Status (8)

Country Link
US (1) US7146318B2 (fi)
EP (1) EP1153387B1 (fi)
JP (1) JP2002535708A (fi)
AT (1) ATE355588T1 (fi)
AU (1) AU2295800A (fi)
DE (1) DE60033636T2 (fi)
FI (1) FI118359B (fi)
WO (1) WO2000042600A2 (fi)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin
JP2002041073A (ja) * 2000-07-31 2002-02-08 Alpine Electronics Inc 音声認識装置
US20030004720A1 (en) * 2001-01-30 2003-01-02 Harinath Garudadri System and method for computing and transmitting parameters in a distributed voice recognition system
US6771706B2 (en) 2001-03-23 2004-08-03 Qualcomm Incorporated Method and apparatus for utilizing channel state information in a wireless communication system
US7941313B2 (en) * 2001-05-17 2011-05-10 Qualcomm Incorporated System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
CN101320559B (zh) 2007-06-07 2011-05-18 华为技术有限公司 一种声音激活检测装置及方法
US8082148B2 (en) * 2008-04-24 2011-12-20 Nuance Communications, Inc. Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
US9135809B2 (en) * 2008-06-20 2015-09-15 At&T Intellectual Property I, Lp Voice enabled remote control for a set-top box
DE112009005215T8 (de) * 2009-08-04 2013-01-03 Nokia Corp. Verfahren und Vorrichtung zur Audiosignalklassifizierung
EP3493205B1 (en) 2010-12-24 2020-12-23 Huawei Technologies Co., Ltd. Method and apparatus for adaptively detecting a voice activity in an input audio signal
CN110265058B (zh) 2013-12-19 2023-01-17 瑞典爱立信有限公司 估计音频信号中的背景噪声
US10332564B1 (en) * 2015-06-25 2019-06-25 Amazon Technologies, Inc. Generating tags during video upload
US10090005B2 (en) * 2016-03-10 2018-10-02 Aspinity, Inc. Analog voice activity detection
US10825471B2 (en) * 2017-04-05 2020-11-03 Avago Technologies International Sales Pte. Limited Voice energy detection
RU2761940C1 (ru) 2018-12-18 2021-12-14 Общество С Ограниченной Ответственностью "Яндекс" Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу
CN111327395B (zh) * 2019-11-21 2023-04-11 沈连腾 一种宽带信号的盲检测方法、装置、设备及存储介质

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4015088A (en) * 1975-10-31 1977-03-29 Bell Telephone Laboratories, Incorporated Real-time speech analyzer
EP0167364A1 (en) * 1984-07-06 1986-01-08 AT&T Corp. Speech-silence detection with subband coding
GB8613327D0 (en) * 1986-06-02 1986-07-09 British Telecomm Speech processor
US4811404A (en) * 1987-10-01 1989-03-07 Motorola, Inc. Noise suppression system
FI100840B (fi) * 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin
US5794199A (en) * 1996-01-29 1998-08-11 Texas Instruments Incorporated Method and system for improved discontinuous speech transmission
US6108610A (en) * 1998-10-13 2000-08-22 Noise Cancellation Technologies, Inc. Method and system for updating noise estimates during pauses in an information signal
FI118359B (fi) * 1999-01-18 2007-10-15 Nokia Corp Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin

Also Published As

Publication number Publication date
FI118359B (fi) 2007-10-15
EP1153387A2 (en) 2001-11-14
WO2000042600A3 (en) 2000-09-28
ATE355588T1 (de) 2006-03-15
US20040236571A1 (en) 2004-11-25
AU2295800A (en) 2000-08-01
FI990078A0 (fi) 1999-01-18
FI990078A (fi) 2000-07-19
DE60033636D1 (de) 2007-04-12
US7146318B2 (en) 2006-12-05
WO2000042600A2 (en) 2000-07-20
EP1153387B1 (en) 2007-02-28
JP2002535708A (ja) 2002-10-22

Similar Documents

Publication Publication Date Title
DE60033636T2 (de) Pausendetektion für die Spracherkennung
DE60131639T2 (de) Vorrichtungen und Verfahren zur Bestimmung von Leistungswerten für die Geräuschunterdrückung für ein Sprachkommunikationssystem
DE60024236T2 (de) Sprach endpunktbestimmung in einem rauschsignal
DE3856280T2 (de) Rauschunterdrückungssystem
DE10041512B4 (de) Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen
DE69432943T2 (de) Verfahren und Vorrichtung zur Sprachdetektion
DE69727895T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE3688614T2 (de) Worterkennung in einem spracherkennungssystem unter verwendung datenermässigter wortmuster.
DE69830017T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69614789T2 (de) Vom Anwender auswählbare mehrfache Schwellenwertkriterien für Spracherkennung
DE60204504T2 (de) Schlüsselworterkennung in einem verrauschten Signal
DE60007637T2 (de) Vermeidung von Online-Sprecherüberanpassung bei der Spracherkennung
DE10006930B4 (de) System und Verfahren zur Spracherkennung
DE60034772T2 (de) Zurückweisungsverfahren in der spracherkennung
DE112017007005B4 (de) Akustiksignal-verarbeitungsvorrichtung, akustiksignalverarbeitungsverfahren und freisprech-kommunikationsvorrichtung
DE3009677A1 (de) Verfahren zur erkennung von sprache und sprachpausen
EP0668007A1 (de) Mobilfunkgerät mit freisprecheinrichtung
DE3688749T2 (de) Verfahren und vorrichtung zur sprachsynthese ohne informationen über die stimme oder hinsichtlich stimmhöhe.
DE69918635T2 (de) Vorrichtung und Verfahren zur Sprachverarbeitung
DE69635141T2 (de) Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung
DE19521258A1 (de) Spracherkennungssystem
DE69411817T2 (de) Verfahren und vorrichtung zur kodierung/dekodierung von hintergrundgeräuschen
EP0508547B1 (de) Schaltungsanordnung zur Spracherkennung
DE69130687T2 (de) Sprachsignalverarbeitungsvorrichtung zum Herausschneiden von einem Sprachsignal aus einem verrauschten Sprachsignal
EP1456837B1 (de) Verfahren und vorrichtung zur spracherkennung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition