ATE316678T1 - Spracherkennung - Google Patents

Spracherkennung

Info

Publication number
ATE316678T1
ATE316678T1 AT00972938T AT00972938T ATE316678T1 AT E316678 T1 ATE316678 T1 AT E316678T1 AT 00972938 T AT00972938 T AT 00972938T AT 00972938 T AT00972938 T AT 00972938T AT E316678 T1 ATE316678 T1 AT E316678T1
Authority
AT
Austria
Prior art keywords
speech
additional
feature
spectral values
generating
Prior art date
Application number
AT00972938T
Other languages
English (en)
Inventor
Ramalingam Hariharan
Juha Haekkinen
Imre Kiss
Jilei Tian
Olli Viikki
Original Assignee
Nokia Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corp filed Critical Nokia Corp
Application granted granted Critical
Publication of ATE316678T1 publication Critical patent/ATE316678T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Telephonic Communication Services (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
  • Meter Arrangements (AREA)
  • Navigation (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measuring Fluid Pressure (AREA)
  • Golf Clubs (AREA)
  • Inorganic Insulating Materials (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
AT00972938T 1999-10-29 2000-10-27 Spracherkennung ATE316678T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FI992350A FI19992350A (fi) 1999-10-29 1999-10-29 Parannettu puheentunnistus

Publications (1)

Publication Number Publication Date
ATE316678T1 true ATE316678T1 (de) 2006-02-15

Family

ID=8555534

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00972938T ATE316678T1 (de) 1999-10-29 2000-10-27 Spracherkennung

Country Status (7)

Country Link
US (1) US6721698B1 (de)
EP (1) EP1250699B1 (de)
AT (1) ATE316678T1 (de)
AU (1) AU1149601A (de)
DE (1) DE60025748T2 (de)
FI (1) FI19992350A (de)
WO (1) WO2001031633A2 (de)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030004720A1 (en) * 2001-01-30 2003-01-02 Harinath Garudadri System and method for computing and transmitting parameters in a distributed voice recognition system
EP1253581B1 (de) * 2001-04-27 2004-06-30 CSEM Centre Suisse d'Electronique et de Microtechnique S.A. - Recherche et Développement Verfahren und Vorrichtung zur Sprachverbesserung in verrauschter Umgebung
US7941313B2 (en) * 2001-05-17 2011-05-10 Qualcomm Incorporated System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system
US7203643B2 (en) * 2001-06-14 2007-04-10 Qualcomm Incorporated Method and apparatus for transmitting speech activity in distributed voice recognition systems
CA2359544A1 (en) * 2001-10-22 2003-04-22 Dspfactory Ltd. Low-resource real-time speech recognition system using an oversampled filterbank
JP2003143256A (ja) * 2001-10-30 2003-05-16 Nec Corp 端末装置と通信制御方法
US7197456B2 (en) * 2002-04-30 2007-03-27 Nokia Corporation On-line parametric histogram normalization for noise robust speech recognition
US7027979B2 (en) * 2003-01-14 2006-04-11 Motorola, Inc. Method and apparatus for speech reconstruction within a distributed speech recognition system
US7672838B1 (en) * 2003-12-01 2010-03-02 The Trustees Of Columbia University In The City Of New York Systems and methods for speech recognition using frequency domain linear prediction polynomials to form temporal and spectral envelopes from frequency domain representations of signals
US7813931B2 (en) * 2005-04-20 2010-10-12 QNX Software Systems, Co. System for improving speech quality and intelligibility with bandwidth compression/expansion
US8249861B2 (en) * 2005-04-20 2012-08-21 Qnx Software Systems Limited High frequency compression integration
US8086451B2 (en) 2005-04-20 2011-12-27 Qnx Software Systems Co. System for improving speech intelligibility through high frequency compression
US7480641B2 (en) * 2006-04-07 2009-01-20 Nokia Corporation Method, apparatus, mobile terminal and computer program product for providing efficient evaluation of feature transformation
US8355913B2 (en) * 2006-11-03 2013-01-15 Nokia Corporation Speech recognition with adjustable timeout period
US7983916B2 (en) * 2007-07-03 2011-07-19 General Motors Llc Sampling rate independent speech recognition
US8306817B2 (en) * 2008-01-08 2012-11-06 Microsoft Corporation Speech recognition with non-linear noise reduction on Mel-frequency cepstra
KR20110006004A (ko) * 2009-07-13 2011-01-20 삼성전자주식회사 결합인식단위 최적화 장치 및 그 방법
US9406313B2 (en) * 2014-03-21 2016-08-02 Intel Corporation Adaptive microphone sampling rate techniques
US10089989B2 (en) 2015-12-07 2018-10-02 Semiconductor Components Industries, Llc Method and apparatus for a low power voice trigger device
EP3504708B1 (de) 2016-09-09 2020-07-15 Huawei Technologies Co., Ltd. Vorrichtung und verfahren zur klassifizierung einer akustischen umgebung
CN108369813B (zh) * 2017-07-31 2022-10-25 深圳和而泰智能家居科技有限公司 特定声音识别方法、设备和存储介质
US10431242B1 (en) * 2017-11-02 2019-10-01 Gopro, Inc. Systems and methods for identifying speech based on spectral features
CN110288981B (zh) * 2019-07-03 2020-11-06 百度在线网络技术(北京)有限公司 用于处理音频数据的方法和装置
CN113592003B (zh) * 2021-08-04 2023-12-26 智道网联科技(北京)有限公司 一种图片传输方法、装置、设备及存储介质

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0535293A (ja) 1991-08-01 1993-02-12 Fujitsu Ltd 音声認識装置における認識候補数設定方式
WO1994022132A1 (en) 1993-03-25 1994-09-29 British Telecommunications Public Limited Company A method and apparatus for speaker recognition
ZA948426B (en) 1993-12-22 1995-06-30 Qualcomm Inc Distributed voice recognition system
US6370504B1 (en) * 1997-05-29 2002-04-09 University Of Washington Speech recognition on MPEG/Audio encoded files
US6292776B1 (en) * 1999-03-12 2001-09-18 Lucent Technologies Inc. Hierarchial subband linear predictive cepstral features for HMM-based speech recognition

Also Published As

Publication number Publication date
EP1250699A2 (de) 2002-10-23
FI19992350A (fi) 2001-04-30
WO2001031633A8 (en) 2004-04-22
AU1149601A (en) 2001-05-08
WO2001031633A2 (en) 2001-05-03
DE60025748D1 (de) 2006-04-13
WO2001031633A3 (en) 2002-08-15
US6721698B1 (en) 2004-04-13
DE60025748T2 (de) 2006-08-03
EP1250699B1 (de) 2006-01-25

Similar Documents

Publication Publication Date Title
ATE316678T1 (de) Spracherkennung
DE69811921T2 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
BR0114086A (pt) Sistema de seleção de som eletrônico
WO2003039100A3 (en) Asynchronous access to synchronous voice services
WO2004100126A3 (en) Method for statistical language modeling in speech recognition
ATE297588T1 (de) Anpassung des phonetischen kontextes zur verbesserung der spracherkennung
SE500277C2 (sv) Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
AU2003235782A1 (en) System and method for speech recognition by multi-pass recognition generating refined context specific grammars
DE60125542D1 (de) System und verfahren zur spracherkennung mit einer vielzahl von spracherkennungsvorrichtungen
DE69813180T2 (de) Kontextabhängige phonemnetzwerke zur kodierung von sprachinformation
ES2146107T3 (es) Procedimiento para reducir las perturbaciones de una señal de voz.
DE59102755D1 (de) Tinnitus-maskiergerät.
ATE343198T1 (de) Erzeugung von einem einheitlichen aufgabeabhängigen sprachmodell mittels informationsauffindungverfahren
HUP0101288A2 (hu) Zajcsökkentési eljárás a távközlésben akusztikus hasznos jelek, különösen beszéd továbbításához
BRPI0409327A (pt) dispositivo para gerar um sinal de áudio de saìda com base em um sinal de aúdio de entrada, método para prover um sinal de áudio de saìda com base em um sinal de áudio de entrada e aparelho para fornecer um sinal de áudio de saìda
CN108172210B (zh) 一种基于歌声节奏的演唱和声生成方法
KR950020396A (ko) 바이오-신호를 이용한 음성인식방법
US7561709B2 (en) Modulation depth enhancement for tone perception
SE9600959D0 (sv) Metod och anordning vid tal-till-talöversättning
TW345611B (en) Earthquake discriminating inference apparatus and gas meter applying thereof
AU2003269366A1 (en) Method and apparatus for generating audio components
RU2007111063A (ru) Способ защиты конфиденциальной акустической информации и устройство для его осуществления
EP1406468A3 (de) Hörhilfegerät oder Hörgerätesystem mit einem Taktgenerator
ATE374990T1 (de) Verfahren zum synthetisieren von sprache
WO2000026901A3 (en) Performing spoken recorded actions

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties