DE59809897D1 - Sprachaktivitätserkennung - Google Patents
SprachaktivitätserkennungInfo
- Publication number
- DE59809897D1 DE59809897D1 DE59809897T DE59809897T DE59809897D1 DE 59809897 D1 DE59809897 D1 DE 59809897D1 DE 59809897 T DE59809897 T DE 59809897T DE 59809897 T DE59809897 T DE 59809897T DE 59809897 D1 DE59809897 D1 DE 59809897D1
- Authority
- DE
- Germany
- Prior art keywords
- speech
- activity identification
- voice activity
- activity detection
- controlling
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000001514 detection method Methods 0.000 title 1
- 230000011218 segmentation Effects 0.000 abstract 1
- 230000009466 transformation Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Time-Division Multiplex Systems (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Geophysics And Detection Of Objects (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19716862A DE19716862A1 (de) | 1997-04-22 | 1997-04-22 | Sprachaktivitätserkennung |
Publications (1)
Publication Number | Publication Date |
---|---|
DE59809897D1 true DE59809897D1 (de) | 2003-11-20 |
Family
ID=7827317
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE19716862A Ceased DE19716862A1 (de) | 1997-04-22 | 1997-04-22 | Sprachaktivitätserkennung |
DE59809897T Expired - Lifetime DE59809897D1 (de) | 1997-04-22 | 1998-02-19 | Sprachaktivitätserkennung |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE19716862A Ceased DE19716862A1 (de) | 1997-04-22 | 1997-04-22 | Sprachaktivitätserkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US6374211B2 (de) |
EP (1) | EP0874352B1 (de) |
AT (1) | ATE252265T1 (de) |
DE (2) | DE19716862A1 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE10026904A1 (de) | 2000-04-28 | 2002-01-03 | Deutsche Telekom Ag | Verfahren zur Berechnung des die Lautstärke mitbestimmenden Verstärkungsfaktors für ein codiert übertragenes Sprachsignal |
EP1279164A1 (de) | 2000-04-28 | 2003-01-29 | Deutsche Telekom AG | Verfahren zur berechnung einer sprachaktivitätsentscheidung (voice activity detector) |
US7505594B2 (en) * | 2000-12-19 | 2009-03-17 | Qualcomm Incorporated | Discontinuous transmission (DTX) controller system and method |
US6725191B2 (en) * | 2001-07-19 | 2004-04-20 | Vocaltec Communications Limited | Method and apparatus for transmitting voice over internet |
US8315865B2 (en) * | 2004-05-04 | 2012-11-20 | Hewlett-Packard Development Company, L.P. | Method and apparatus for adaptive conversation detection employing minimal computation |
US7574353B2 (en) * | 2004-11-18 | 2009-08-11 | Lsi Logic Corporation | Transmit/receive data paths for voice-over-internet (VoIP) communication systems |
US7983922B2 (en) * | 2005-04-15 | 2011-07-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing |
KR100655953B1 (ko) * | 2006-02-06 | 2006-12-11 | 한양대학교 산학협력단 | 웨이블릿 패킷 변환을 이용한 음성 처리 시스템 및 그 방법 |
US7680657B2 (en) * | 2006-08-15 | 2010-03-16 | Microsoft Corporation | Auto segmentation based partitioning and clustering approach to robust endpointing |
KR100789084B1 (ko) | 2006-11-21 | 2007-12-26 | 한양대학교 산학협력단 | 웨이블릿 패킷 영역에서 비선형 구조의 과중 이득에 의한음질 개선 방법 |
US9361883B2 (en) * | 2012-05-01 | 2016-06-07 | Microsoft Technology Licensing, Llc | Dictation with incremental recognition of speech |
CN104019885A (zh) | 2013-02-28 | 2014-09-03 | 杜比实验室特许公司 | 声场分析系统 |
US9979829B2 (en) | 2013-03-15 | 2018-05-22 | Dolby Laboratories Licensing Corporation | Normalization of soundfield orientations based on auditory scene analysis |
US10917611B2 (en) | 2015-06-09 | 2021-02-09 | Avaya Inc. | Video adaptation in conferencing using power or view indications |
WO2020252782A1 (zh) * | 2019-06-21 | 2020-12-24 | 深圳市汇顶科技股份有限公司 | 语音检测方法、语音检测装置、语音处理芯片以及电子设备 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5152007A (en) * | 1991-04-23 | 1992-09-29 | Motorola, Inc. | Method and apparatus for detecting speech |
GB2272554A (en) * | 1992-11-13 | 1994-05-18 | Creative Tech Ltd | Recognizing speech by using wavelet transform and transient response therefrom |
US5388182A (en) * | 1993-02-16 | 1995-02-07 | Prometheus, Inc. | Nonlinear method and apparatus for coding and decoding acoustic signals with data compression and noise suppression using cochlear filters, wavelet analysis, and irregular sampling reconstruction |
US5459814A (en) * | 1993-03-26 | 1995-10-17 | Hughes Aircraft Company | Voice activity detector for speech signals in variable background noise |
JP3090842B2 (ja) * | 1994-04-28 | 2000-09-25 | 沖電気工業株式会社 | ビタビ復号法に適応した送信装置 |
FR2727236B1 (fr) * | 1994-11-22 | 1996-12-27 | Alcatel Mobile Comm France | Detection d'activite vocale |
US5822726A (en) * | 1995-01-31 | 1998-10-13 | Motorola, Inc. | Speech presence detector based on sparse time-random signal samples |
ATE206841T1 (de) * | 1995-06-30 | 2001-10-15 | Deutsche Telekom Ag | Verfahren und anordnung zur klassifizierung von sprachsignalen |
DE19538852A1 (de) * | 1995-06-30 | 1997-01-02 | Deutsche Telekom Ag | Verfahren und Anordnung zur Klassifizierung von Sprachsignalen |
CA2188369C (en) * | 1995-10-19 | 2005-01-11 | Joachim Stegmann | Method and an arrangement for classifying speech signals |
-
1997
- 1997-04-22 DE DE19716862A patent/DE19716862A1/de not_active Ceased
-
1998
- 1998-02-19 EP EP98102842A patent/EP0874352B1/de not_active Expired - Lifetime
- 1998-02-19 AT AT98102842T patent/ATE252265T1/de active
- 1998-02-19 DE DE59809897T patent/DE59809897D1/de not_active Expired - Lifetime
- 1998-04-22 US US09/064,248 patent/US6374211B2/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
US6374211B2 (en) | 2002-04-16 |
ATE252265T1 (de) | 2003-11-15 |
EP0874352B1 (de) | 2003-10-15 |
EP0874352A2 (de) | 1998-10-28 |
EP0874352A3 (de) | 1999-06-02 |
US20010014854A1 (en) | 2001-08-16 |
DE19716862A1 (de) | 1998-10-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE59809897D1 (de) | Sprachaktivitätserkennung | |
US5228087A (en) | Speech recognition apparatus and methods | |
CA2228948A1 (en) | Pattern recognition | |
DE60117144D1 (de) | Sprachübertragungssystem und verfahren zur behandlung verlorener datenrahmen | |
ATE302991T1 (de) | Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen | |
EP1083541A3 (de) | Verfahren und Vorrichtung zur Sprachdetektion | |
ATE202232T1 (de) | Verfahren zur sprachkodierung | |
CA2343661A1 (en) | Method and apparatus for improving the intelligibility of digitally compressed speech | |
WO2000031719A3 (en) | Speech coding with comfort noise variability feature for increased fidelity | |
DE60033132D1 (de) | Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern | |
FI102884B1 (fi) | Menetelmä ja laitteisto hissin toimintojen analysoimiseksi | |
CA2117932A1 (en) | Soft Decision Speech Recognition | |
SG43428A1 (en) | Speech encoding method and apparatus | |
DE60220485D1 (de) | Verfahren und Vorrichtung zur Verschleierung von Rahmenausfall von prädiktionskodierter Sprache unter Verwendung von Extrapolation der Wellenform | |
ATE360249T1 (de) | Verfahren und vorrichtung zur bestimmung von sprachkodierparametern | |
GB2307582A (en) | System for recognizing spoken sounds from continuous speech and method of using same | |
AU2001277647A1 (en) | Method for noise robust classification in speech coding | |
EP1093112A3 (de) | Verfahren zur Erzeugung von Sprachmerkmalsignalen und Vorrichtung zu seiner Durchführung | |
EP0651521A3 (de) | Verfahren zur Unterscheidung zwischen Geräusch und Empfangssignalen | |
ES2139112T3 (es) | Reconocimiento del habla basado en hmms. | |
ATE282235T1 (de) | Robuste merkmale für die erkennung von verrauschten sprachsignalen | |
CA2315324A1 (en) | Speech signal decoding method and apparatus | |
GB2188763A (en) | Noise compensation in speech recognition | |
DE69400229D1 (de) | Verhinderung von Artefackten bei Sprachkodierern auf CELP-Basis | |
FR2815457B1 (fr) | Procede de codage de la prosodie pour un codeur de parole a tres bas debit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |