WO2003096324A1 - Dispositif de reconnaissance vocale - Google Patents

Dispositif de reconnaissance vocale Download PDF

Info

Publication number
WO2003096324A1
WO2003096324A1 PCT/JP2003/005695 JP0305695W WO03096324A1 WO 2003096324 A1 WO2003096324 A1 WO 2003096324A1 JP 0305695 W JP0305695 W JP 0305695W WO 03096324 A1 WO03096324 A1 WO 03096324A1
Authority
WO
WIPO (PCT)
Prior art keywords
speech
model
speech model
ram
recognition device
Prior art date
Application number
PCT/JP2003/005695
Other languages
English (en)
French (fr)
Inventor
Toshiyuki Miyazaki
Original Assignee
Asahi Kasei Kabushiki Kaisha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Asahi Kasei Kabushiki Kaisha filed Critical Asahi Kasei Kabushiki Kaisha
Priority to AU2003235868A priority Critical patent/AU2003235868A1/en
Priority to JP2004508528A priority patent/JP4316494B2/ja
Priority to US10/513,753 priority patent/US7487091B2/en
Priority to KR1020047018136A priority patent/KR100650473B1/ko
Priority to EP03723248A priority patent/EP1505573B1/en
Priority to DE60323362T priority patent/DE60323362D1/de
Publication of WO2003096324A1 publication Critical patent/WO2003096324A1/ja

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/285Memory allocation or algorithm optimisation to reduce hardware requirements
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
PCT/JP2003/005695 2002-05-10 2003-05-07 Dispositif de reconnaissance vocale WO2003096324A1 (fr)

Priority Applications (6)

Application Number Priority Date Filing Date Title
AU2003235868A AU2003235868A1 (en) 2002-05-10 2003-05-07 Speech recognition device
JP2004508528A JP4316494B2 (ja) 2002-05-10 2003-05-07 音声認識装置
US10/513,753 US7487091B2 (en) 2002-05-10 2003-05-07 Speech recognition device for recognizing a word sequence using a switching speech model network
KR1020047018136A KR100650473B1 (ko) 2002-05-10 2003-05-07 음성 인식 장치
EP03723248A EP1505573B1 (en) 2002-05-10 2003-05-07 Speech recognition device
DE60323362T DE60323362D1 (de) 2002-05-10 2003-05-07 Spracherkennungseinrichtung

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2002135770 2002-05-10
JP2002-135770 2002-05-10

Publications (1)

Publication Number Publication Date
WO2003096324A1 true WO2003096324A1 (fr) 2003-11-20

Family

ID=29416761

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2003/005695 WO2003096324A1 (fr) 2002-05-10 2003-05-07 Dispositif de reconnaissance vocale

Country Status (8)

Country Link
US (1) US7487091B2 (ja)
EP (1) EP1505573B1 (ja)
JP (1) JP4316494B2 (ja)
KR (1) KR100650473B1 (ja)
CN (1) CN1320520C (ja)
AU (1) AU2003235868A1 (ja)
DE (1) DE60323362D1 (ja)
WO (1) WO2003096324A1 (ja)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100814143B1 (ko) * 2003-10-03 2008-03-14 아사히 가세이 가부시키가이샤 데이터 처리 장치 및 데이터 처리 장치 제어 프로그램

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3667332B2 (ja) * 2002-11-21 2005-07-06 松下電器産業株式会社 標準モデル作成装置及び標準モデル作成方法
US7865357B2 (en) * 2006-03-14 2011-01-04 Microsoft Corporation Shareable filler model for grammar authoring
PT2102619T (pt) * 2006-10-24 2017-05-25 Voiceage Corp Método e dispositivo para codificação de tramas de transição em sinais de voz
US8180641B2 (en) * 2008-09-29 2012-05-15 Microsoft Corporation Sequential speech recognition with two unequal ASR systems
JP5692493B2 (ja) * 2009-02-05 2015-04-01 セイコーエプソン株式会社 隠れマルコフモデル作成プログラム、情報記憶媒体、隠れマルコフモデル作成システム、音声認識システム及び音声認識方法
KR20100136890A (ko) * 2009-06-19 2010-12-29 삼성전자주식회사 컨텍스트 기반의 산술 부호화 장치 및 방법과 산술 복호화 장치 및 방법
EP2357647B1 (de) * 2010-01-11 2013-01-02 Svox AG Verfahren zur Spracherkennung
US9001976B2 (en) * 2012-05-03 2015-04-07 Nexidia, Inc. Speaker adaptation
US9390708B1 (en) * 2013-05-28 2016-07-12 Amazon Technologies, Inc. Low latency and memory efficient keywork spotting
US9251806B2 (en) * 2013-09-05 2016-02-02 Intel Corporation Mobile phone with variable energy consuming speech recognition module
US9183830B2 (en) * 2013-11-01 2015-11-10 Google Inc. Method and system for non-parametric voice conversion
US9177549B2 (en) * 2013-11-01 2015-11-03 Google Inc. Method and system for cross-lingual voice conversion
US9542927B2 (en) 2014-11-13 2017-01-10 Google Inc. Method and system for building text-to-speech voice from diverse recordings
EP3280779A1 (en) * 2015-04-09 2018-02-14 Saudi Arabian Oil Company Encapsulated nanocompositions for increasing hydrocarbon recovery
US9792907B2 (en) 2015-11-24 2017-10-17 Intel IP Corporation Low resource key phrase detection for wake on voice
US9972313B2 (en) * 2016-03-01 2018-05-15 Intel Corporation Intermediate scoring and rejection loopback for improved key phrase detection
US10043521B2 (en) 2016-07-01 2018-08-07 Intel IP Corporation User defined key phrase detection by user dependent sequence modeling
US10083689B2 (en) * 2016-12-23 2018-09-25 Intel Corporation Linear scoring for low power wake on voice
CN110556103B (zh) * 2018-05-31 2023-05-30 阿里巴巴集团控股有限公司 音频信号处理方法、装置、系统、设备和存储介质
US10714122B2 (en) 2018-06-06 2020-07-14 Intel Corporation Speech classification of audio for wake on voice
CN110875033A (zh) * 2018-09-04 2020-03-10 蔚来汽车有限公司 用于确定语音结束点的方法、装置和计算机存储介质
US10650807B2 (en) 2018-09-18 2020-05-12 Intel Corporation Method and system of neural network keyphrase detection
CN110364162B (zh) * 2018-11-15 2022-03-15 腾讯科技(深圳)有限公司 一种人工智能的重置方法及装置、存储介质
KR20200063521A (ko) 2018-11-28 2020-06-05 삼성전자주식회사 전자 장치 및 이의 제어 방법
US11127394B2 (en) 2019-03-29 2021-09-21 Intel Corporation Method and system of high accuracy keyphrase detection for low resource devices
KR20210001082A (ko) * 2019-06-26 2021-01-06 삼성전자주식회사 사용자 발화를 처리하는 전자 장치와 그 동작 방법
US11694685B2 (en) * 2020-12-10 2023-07-04 Google Llc Hotphrase triggering based on a sequence of detections
CN112786055A (zh) * 2020-12-25 2021-05-11 北京百度网讯科技有限公司 资源挂载方法、装置、设备、存储介质及计算机程序产品

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH117292A (ja) * 1997-06-16 1999-01-12 Nec Corp 音声認識装置
JPH1115492A (ja) * 1997-06-24 1999-01-22 Mitsubishi Electric Corp 音声認識装置
JP2002297182A (ja) * 2001-03-29 2002-10-11 Sanyo Electric Co Ltd 音声認識装置および音声認識方法

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6417292A (en) * 1987-07-09 1989-01-20 Nec Corp Static memory circuit
US5920837A (en) * 1992-11-13 1999-07-06 Dragon Systems, Inc. Word recognition system which stores two models for some words and allows selective deletion of one such model
US6230128B1 (en) * 1993-03-31 2001-05-08 British Telecommunications Public Limited Company Path link passing speech recognition with vocabulary node being capable of simultaneously processing plural path links
JP2775140B2 (ja) * 1994-03-18 1998-07-16 株式会社エイ・ティ・アール人間情報通信研究所 パターン認識方法、音声認識方法および音声認識装置
US5842165A (en) * 1996-02-29 1998-11-24 Nynex Science & Technology, Inc. Methods and apparatus for generating and using garbage models for speaker dependent speech recognition purposes
US6076054A (en) * 1996-02-29 2000-06-13 Nynex Science & Technology, Inc. Methods and apparatus for generating and using out of vocabulary word models for speaker dependent speech recognition
CA2216224A1 (en) 1997-09-19 1999-03-19 Peter R. Stubley Block algorithm for pattern recognition
US6073095A (en) * 1997-10-15 2000-06-06 International Business Machines Corporation Fast vocabulary independent method and apparatus for spotting words in speech
US6061653A (en) * 1998-07-14 2000-05-09 Alcatel Usa Sourcing, L.P. Speech recognition system using shared speech models for multiple recognition processes
JP2000089782A (ja) 1998-09-17 2000-03-31 Kenwood Corp 音声認識装置と方法、ナビゲーションシステム、及び記録媒体
FI116991B (fi) * 1999-01-18 2006-04-28 Nokia Corp Menetelmä puheen tunnistamisessa, puheentunnistuslaite ja puheella ohjattava langaton viestin
US6526380B1 (en) 1999-03-26 2003-02-25 Koninklijke Philips Electronics N.V. Speech recognition system having parallel large vocabulary recognition engines
US6195639B1 (en) * 1999-05-14 2001-02-27 Telefonaktiebolaget Lm Ericsson (Publ) Matching algorithm for isolated speech recognition
JP4642953B2 (ja) 1999-09-09 2011-03-02 クラリオン株式会社 音声検索装置、および、音声認識ナビゲーション装置
GB2364814A (en) * 2000-07-12 2002-02-06 Canon Kk Speech recognition
JP4116233B2 (ja) 2000-09-05 2008-07-09 パイオニア株式会社 音声認識装置ならびにその方法
JP4283984B2 (ja) * 2000-10-12 2009-06-24 パイオニア株式会社 音声認識装置ならびに方法
US6950796B2 (en) * 2001-11-05 2005-09-27 Motorola, Inc. Speech recognition by dynamical noise model adaptation
JP2003308091A (ja) * 2002-04-17 2003-10-31 Pioneer Electronic Corp 音声認識装置、音声認識方法および音声認識プログラム

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH117292A (ja) * 1997-06-16 1999-01-12 Nec Corp 音声認識装置
JPH1115492A (ja) * 1997-06-24 1999-01-22 Mitsubishi Electric Corp 音声認識装置
JP2002297182A (ja) * 2001-03-29 2002-10-11 Sanyo Electric Co Ltd 音声認識装置および音声認識方法

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
INOUE, TAKEDA, YAMAMOTO: "Garbage HMM o mochiita jiyu hatsuwabunchu no fuyogo shori hoho", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEER A, vol. J77-A, no. 2, 25 February 1994 (1994-02-25), pages 215 - 222, XP002971098 *
KONUMA, TAKEDA: "Garbage model to kobunteki kosoku o mochiita word spotting no kento", THE ACOUSTICAL SOCIETY OF JAPAN (ASJ) HEISEI 4 NENDO SHUKI KENKYU HYPPYO KOEN RONBUNSHU (2-1-17), October 1992 (1992-10-01), pages 111 - 112, XP002971097 *
See also references of EP1505573A4 *
TAKEDA, KONUMA: "Jiyu hatsuwabun rikai no tameno garbage HMM no riyo no kento", THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS GIJUTSU KENKYU HOKOKU (ONSEI)(SP92-127), vol. 92, no. 410, 19 January 1993 (1993-01-19), pages 33 - 40, XP002971096 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100814143B1 (ko) * 2003-10-03 2008-03-14 아사히 가세이 가부시키가이샤 데이터 처리 장치 및 데이터 처리 장치 제어 프로그램

Also Published As

Publication number Publication date
AU2003235868A1 (en) 2003-11-11
CN1320520C (zh) 2007-06-06
KR20040102224A (ko) 2004-12-03
KR100650473B1 (ko) 2006-11-29
EP1505573A1 (en) 2005-02-09
JP4316494B2 (ja) 2009-08-19
CN1653518A (zh) 2005-08-10
EP1505573A4 (en) 2005-07-13
US20050203737A1 (en) 2005-09-15
US7487091B2 (en) 2009-02-03
DE60323362D1 (de) 2008-10-16
JPWO2003096324A1 (ja) 2005-09-15
EP1505573B1 (en) 2008-09-03

Similar Documents

Publication Publication Date Title
WO2003096324A1 (fr) Dispositif de reconnaissance vocale
KR102335717B1 (ko) 음성 제어 시스템 및 그 웨이크업 방법, 웨이크업 장치 및 가전제품, 코프로세서
CA2413100A1 (en) Improved method for upgrading firmware in an electronic device
EP3748631B1 (en) Low power integrated circuit to analyze a digitized audio stream
HK1054813A1 (en) Language independent voice-based user interface
AU2003271083A1 (en) Language model creation/accumulation device, speech recognition device, language model creation method, and speech recognition method
EP1197950A3 (en) Hierarchized dictionaries for speech recognition
WO2007051106A3 (en) Semantic processor for recognition of cause-effect relations in natural language documents
EP1355295A3 (en) Speech recognition apparatus, speech recognition method, and computer-readable recording medium in which speech recognition program is recorded
EP1134727A3 (en) Sound models for unknown words in speech recognition
EP0955628A3 (en) A method of and a device for speech recognition employing neural network and Markov model recognition techniques
EP1103951A3 (en) Adaptive wavelet extraction for speech recognition
WO2004036337A3 (en) Information extraction using an object based semantic network
EP1152326A3 (en) A technique for providing continuous speech recognition as an alternative input device to limited processing power devices
EP1538535A3 (en) Determination of meaning for text input in natural language understanding systems
EP1455484A3 (en) Integrating design, deployment, and management phases for systems
WO2003079196A3 (en) System and method of secure garbage collection on a mobile device
AUPR824501A0 (en) Methods and systems (npw003)
CA2373568A1 (en) Method of searching similar document, system for performing the same and program for processing the same
EP1193959A3 (en) Hierarchized dictionaries for speech recognition
WO2001084357A3 (en) Cluster and pruning-based language model compression
GB0522504D0 (en) A method and apparatus for distributed analyses of images
EP1054387A3 (en) Method and apparatus for activating voice controlled devices
WO2002029720A1 (fr) Dispositif et procede permettant de verifier une empreinte digitale
WO2001091105A3 (en) Wireless voice recognition data retrieval system and method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2004508528

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2003723248

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10513753

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 20038105667

Country of ref document: CN

Ref document number: 1020047018136

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 1020047018136

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003723248

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 2003723248

Country of ref document: EP