DE60033636T2 - Pausendetektion für die Spracherkennung - Google Patents
Pausendetektion für die Spracherkennung Download PDFInfo
- Publication number
- DE60033636T2 DE60033636T2 DE60033636T DE60033636T DE60033636T2 DE 60033636 T2 DE60033636 T2 DE 60033636T2 DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T DE60033636 T DE 60033636T DE 60033636 T2 DE60033636 T2 DE 60033636T2
- Authority
- DE
- Germany
- Prior art keywords
- subbands
- power
- thr
- max
- pause
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 42
- 238000000034 method Methods 0.000 claims abstract description 36
- 238000001228 spectrum Methods 0.000 claims abstract 4
- 238000004364 calculation method Methods 0.000 claims description 27
- 230000000694 effects Effects 0.000 claims description 14
- 238000004891 communication Methods 0.000 claims description 11
- 230000004048 modification Effects 0.000 claims description 8
- 238000012986 modification Methods 0.000 claims description 8
- 238000001914 filtration Methods 0.000 claims description 6
- 230000003247 decreasing effect Effects 0.000 claims description 3
- 238000011895 specific detection Methods 0.000 claims 5
- 230000006870 function Effects 0.000 description 17
- 239000000872 buffer Substances 0.000 description 8
- 238000005070 sampling Methods 0.000 description 7
- 230000005236 sound signal Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 230000035987 intoxication Effects 0.000 description 1
- 231100000566 intoxication Toxicity 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Mobile Radio Communication Systems (AREA)
- Circuits Of Receivers In General (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Telephone Function (AREA)
- Alarm Systems (AREA)
- Facsimile Transmission Control (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FI990078 | 1999-01-18 | ||
FI990078A FI118359B (fi) | 1999-01-18 | 1999-01-18 | Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin |
PCT/FI2000/000028 WO2000042600A2 (en) | 1999-01-18 | 2000-01-17 | Method in speech recognition and a speech recognition device |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60033636D1 DE60033636D1 (de) | 2007-04-12 |
DE60033636T2 true DE60033636T2 (de) | 2007-06-21 |
Family
ID=8553379
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60033636T Expired - Lifetime DE60033636T2 (de) | 1999-01-18 | 2000-01-17 | Pausendetektion für die Spracherkennung |
Country Status (8)
Country | Link |
---|---|
US (1) | US7146318B2 (fi) |
EP (1) | EP1153387B1 (fi) |
JP (1) | JP2002535708A (fi) |
AT (1) | ATE355588T1 (fi) |
AU (1) | AU2295800A (fi) |
DE (1) | DE60033636T2 (fi) |
FI (1) | FI118359B (fi) |
WO (1) | WO2000042600A2 (fi) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FI118359B (fi) * | 1999-01-18 | 2007-10-15 | Nokia Corp | Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin |
JP2002041073A (ja) * | 2000-07-31 | 2002-02-08 | Alpine Electronics Inc | 音声認識装置 |
US20030004720A1 (en) * | 2001-01-30 | 2003-01-02 | Harinath Garudadri | System and method for computing and transmitting parameters in a distributed voice recognition system |
US6771706B2 (en) | 2001-03-23 | 2004-08-03 | Qualcomm Incorporated | Method and apparatus for utilizing channel state information in a wireless communication system |
US7941313B2 (en) * | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
CN101320559B (zh) | 2007-06-07 | 2011-05-18 | 华为技术有限公司 | 一种声音激活检测装置及方法 |
US8082148B2 (en) * | 2008-04-24 | 2011-12-20 | Nuance Communications, Inc. | Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise |
US9135809B2 (en) * | 2008-06-20 | 2015-09-15 | At&T Intellectual Property I, Lp | Voice enabled remote control for a set-top box |
DE112009005215T8 (de) * | 2009-08-04 | 2013-01-03 | Nokia Corp. | Verfahren und Vorrichtung zur Audiosignalklassifizierung |
EP3493205B1 (en) | 2010-12-24 | 2020-12-23 | Huawei Technologies Co., Ltd. | Method and apparatus for adaptively detecting a voice activity in an input audio signal |
CN110265058B (zh) | 2013-12-19 | 2023-01-17 | 瑞典爱立信有限公司 | 估计音频信号中的背景噪声 |
US10332564B1 (en) * | 2015-06-25 | 2019-06-25 | Amazon Technologies, Inc. | Generating tags during video upload |
US10090005B2 (en) * | 2016-03-10 | 2018-10-02 | Aspinity, Inc. | Analog voice activity detection |
US10825471B2 (en) * | 2017-04-05 | 2020-11-03 | Avago Technologies International Sales Pte. Limited | Voice energy detection |
RU2761940C1 (ru) | 2018-12-18 | 2021-12-14 | Общество С Ограниченной Ответственностью "Яндекс" | Способы и электронные устройства для идентификации пользовательского высказывания по цифровому аудиосигналу |
CN111327395B (zh) * | 2019-11-21 | 2023-04-11 | 沈连腾 | 一种宽带信号的盲检测方法、装置、设备及存储介质 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4015088A (en) * | 1975-10-31 | 1977-03-29 | Bell Telephone Laboratories, Incorporated | Real-time speech analyzer |
EP0167364A1 (en) * | 1984-07-06 | 1986-01-08 | AT&T Corp. | Speech-silence detection with subband coding |
GB8613327D0 (en) * | 1986-06-02 | 1986-07-09 | British Telecomm | Speech processor |
US4811404A (en) * | 1987-10-01 | 1989-03-07 | Motorola, Inc. | Noise suppression system |
FI100840B (fi) * | 1995-12-12 | 1998-02-27 | Nokia Mobile Phones Ltd | Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin |
US5794199A (en) * | 1996-01-29 | 1998-08-11 | Texas Instruments Incorporated | Method and system for improved discontinuous speech transmission |
US6108610A (en) * | 1998-10-13 | 2000-08-22 | Noise Cancellation Technologies, Inc. | Method and system for updating noise estimates during pauses in an information signal |
FI118359B (fi) * | 1999-01-18 | 2007-10-15 | Nokia Corp | Menetelmä puheentunnistuksessa ja puheentunnistuslaite ja langaton viestin |
-
1999
- 1999-01-18 FI FI990078A patent/FI118359B/fi not_active IP Right Cessation
-
2000
- 2000-01-17 WO PCT/FI2000/000028 patent/WO2000042600A2/en active IP Right Grant
- 2000-01-17 EP EP00901626A patent/EP1153387B1/en not_active Expired - Lifetime
- 2000-01-17 AT AT00901626T patent/ATE355588T1/de not_active IP Right Cessation
- 2000-01-17 AU AU22958/00A patent/AU2295800A/en not_active Abandoned
- 2000-01-17 DE DE60033636T patent/DE60033636T2/de not_active Expired - Lifetime
- 2000-01-17 JP JP2000594107A patent/JP2002535708A/ja active Pending
-
2004
- 2004-05-06 US US10/840,003 patent/US7146318B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
FI118359B (fi) | 2007-10-15 |
EP1153387A2 (en) | 2001-11-14 |
WO2000042600A3 (en) | 2000-09-28 |
ATE355588T1 (de) | 2006-03-15 |
US20040236571A1 (en) | 2004-11-25 |
AU2295800A (en) | 2000-08-01 |
FI990078A0 (fi) | 1999-01-18 |
FI990078A (fi) | 2000-07-19 |
DE60033636D1 (de) | 2007-04-12 |
US7146318B2 (en) | 2006-12-05 |
WO2000042600A2 (en) | 2000-07-20 |
EP1153387B1 (en) | 2007-02-28 |
JP2002535708A (ja) | 2002-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60033636T2 (de) | Pausendetektion für die Spracherkennung | |
DE60131639T2 (de) | Vorrichtungen und Verfahren zur Bestimmung von Leistungswerten für die Geräuschunterdrückung für ein Sprachkommunikationssystem | |
DE60024236T2 (de) | Sprach endpunktbestimmung in einem rauschsignal | |
DE3856280T2 (de) | Rauschunterdrückungssystem | |
DE10041512B4 (de) | Verfahren und Vorrichtung zur künstlichen Erweiterung der Bandbreite von Sprachsignalen | |
DE69432943T2 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
DE69727895T2 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE3688614T2 (de) | Worterkennung in einem spracherkennungssystem unter verwendung datenermässigter wortmuster. | |
DE69830017T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69614789T2 (de) | Vom Anwender auswählbare mehrfache Schwellenwertkriterien für Spracherkennung | |
DE60204504T2 (de) | Schlüsselworterkennung in einem verrauschten Signal | |
DE60007637T2 (de) | Vermeidung von Online-Sprecherüberanpassung bei der Spracherkennung | |
DE10006930B4 (de) | System und Verfahren zur Spracherkennung | |
DE60034772T2 (de) | Zurückweisungsverfahren in der spracherkennung | |
DE112017007005B4 (de) | Akustiksignal-verarbeitungsvorrichtung, akustiksignalverarbeitungsverfahren und freisprech-kommunikationsvorrichtung | |
DE3009677A1 (de) | Verfahren zur erkennung von sprache und sprachpausen | |
EP0668007A1 (de) | Mobilfunkgerät mit freisprecheinrichtung | |
DE3688749T2 (de) | Verfahren und vorrichtung zur sprachsynthese ohne informationen über die stimme oder hinsichtlich stimmhöhe. | |
DE69918635T2 (de) | Vorrichtung und Verfahren zur Sprachverarbeitung | |
DE69635141T2 (de) | Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung | |
DE19521258A1 (de) | Spracherkennungssystem | |
DE69411817T2 (de) | Verfahren und vorrichtung zur kodierung/dekodierung von hintergrundgeräuschen | |
EP0508547B1 (de) | Schaltungsanordnung zur Spracherkennung | |
DE69130687T2 (de) | Sprachsignalverarbeitungsvorrichtung zum Herausschneiden von einem Sprachsignal aus einem verrauschten Sprachsignal | |
EP1456837B1 (de) | Verfahren und vorrichtung zur spracherkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |