DE60204374D1 - Spracherkennungsvorrichtung - Google Patents

Spracherkennungsvorrichtung

Info

Publication number
DE60204374D1
DE60204374D1 DE60204374T DE60204374T DE60204374D1 DE 60204374 D1 DE60204374 D1 DE 60204374D1 DE 60204374 T DE60204374 T DE 60204374T DE 60204374 T DE60204374 T DE 60204374T DE 60204374 D1 DE60204374 D1 DE 60204374D1
Authority
DE
Germany
Prior art keywords
voice recognition
recognition device
voice
recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60204374T
Other languages
English (en)
Other versions
DE60204374T2 (de
Inventor
Koichi Shinoda
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Publication of DE60204374D1 publication Critical patent/DE60204374D1/de
Application granted granted Critical
Publication of DE60204374T2 publication Critical patent/DE60204374T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Probability & Statistics with Applications (AREA)
  • Image Analysis (AREA)
  • Complex Calculations (AREA)
  • Machine Translation (AREA)
DE60204374T 2001-03-13 2002-03-11 Spracherkennungsvorrichtung Expired - Lifetime DE60204374T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2001070108A JP4336865B2 (ja) 2001-03-13 2001-03-13 音声認識装置
JP2001070108 2001-03-13

Publications (2)

Publication Number Publication Date
DE60204374D1 true DE60204374D1 (de) 2005-07-07
DE60204374T2 DE60204374T2 (de) 2006-03-16

Family

ID=18928034

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60204374T Expired - Lifetime DE60204374T2 (de) 2001-03-13 2002-03-11 Spracherkennungsvorrichtung

Country Status (4)

Country Link
US (1) US7437288B2 (de)
EP (1) EP1241661B1 (de)
JP (1) JP4336865B2 (de)
DE (1) DE60204374T2 (de)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7966187B1 (en) * 2001-02-15 2011-06-21 West Corporation Script compliance and quality assurance using speech recognition
JP4069715B2 (ja) * 2002-09-19 2008-04-02 セイコーエプソン株式会社 音響モデル作成方法および音声認識装置
JP4194433B2 (ja) * 2003-07-07 2008-12-10 キヤノン株式会社 尤度算出装置および方法
JP2005156593A (ja) * 2003-11-20 2005-06-16 Seiko Epson Corp 音響モデル作成方法、音響モデル作成装置、音響モデル作成プログラムおよび音声認識装置
JP4442211B2 (ja) * 2003-12-12 2010-03-31 セイコーエプソン株式会社 音響モデル作成方法
JP4510517B2 (ja) * 2004-05-26 2010-07-28 日本電信電話株式会社 音響モデル雑音適応化方法およびこの方法を実施する装置
US20060058999A1 (en) * 2004-09-10 2006-03-16 Simon Barker Voice model adaptation
KR100664960B1 (ko) 2005-10-06 2007-01-04 삼성전자주식회사 음성 인식 장치 및 방법
US20070088552A1 (en) * 2005-10-17 2007-04-19 Nokia Corporation Method and a device for speech recognition
CN100502463C (zh) * 2005-12-14 2009-06-17 浙江工业大学 一种交通流信息视频检测中的特征采集方法
JP2007233308A (ja) * 2006-03-03 2007-09-13 Mitsubishi Electric Corp 音声認識装置
US7680664B2 (en) * 2006-08-16 2010-03-16 Microsoft Corporation Parsimonious modeling by non-uniform kernel allocation
US9141860B2 (en) 2008-11-17 2015-09-22 Liveclips Llc Method and system for segmenting and transmitting on-demand live-action video in real-time
US8725510B2 (en) * 2009-07-09 2014-05-13 Sony Corporation HMM learning device and method, program, and recording medium
US20130283143A1 (en) * 2012-04-24 2013-10-24 Eric David Petajan System for Annotating Media Content for Automatic Content Understanding
US9367745B2 (en) 2012-04-24 2016-06-14 Liveclips Llc System for annotating media content for automatic content understanding
JP5997114B2 (ja) * 2013-08-14 2016-09-28 日本電信電話株式会社 雑音抑圧装置、雑音抑圧方法、およびプログラム
US10333857B1 (en) 2014-10-30 2019-06-25 Pearson Education, Inc. Systems and methods for data packet metadata stabilization
US9667321B2 (en) 2014-10-31 2017-05-30 Pearson Education, Inc. Predictive recommendation engine
US10110486B1 (en) 2014-10-30 2018-10-23 Pearson Education, Inc. Automatic determination of initial content difficulty
WO2016070124A1 (en) 2014-10-30 2016-05-06 Pearson Education, Inc. Content database generation
US10116563B1 (en) 2014-10-30 2018-10-30 Pearson Education, Inc. System and method for automatically updating data packet metadata
US10218630B2 (en) 2014-10-30 2019-02-26 Pearson Education, Inc. System and method for increasing data transmission rates through a content distribution network
US10318499B2 (en) 2014-10-30 2019-06-11 Pearson Education, Inc. Content database generation
US10735402B1 (en) 2014-10-30 2020-08-04 Pearson Education, Inc. Systems and method for automated data packet selection and delivery
US10614368B2 (en) 2015-08-28 2020-04-07 Pearson Education, Inc. System and method for content provisioning with dual recommendation engines
US10642848B2 (en) 2016-04-08 2020-05-05 Pearson Education, Inc. Personalized automatic content aggregation generation
US10043133B2 (en) 2016-04-08 2018-08-07 Pearson Education, Inc. Systems and methods of event-based content provisioning
US11188841B2 (en) 2016-04-08 2021-11-30 Pearson Education, Inc. Personalized content distribution
US10789316B2 (en) 2016-04-08 2020-09-29 Pearson Education, Inc. Personalized automatic content aggregation generation

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4903305A (en) * 1986-05-12 1990-02-20 Dragon Systems, Inc. Method for representing word models for use in speech recognition
US5243686A (en) * 1988-12-09 1993-09-07 Oki Electric Industry Co., Ltd. Multi-stage linear predictive analysis method for feature extraction from acoustic signals
US5263120A (en) * 1991-04-29 1993-11-16 Bickel Michael A Adaptive fast fuzzy clustering system
US5325445A (en) * 1992-05-29 1994-06-28 Eastman Kodak Company Feature classification using supervised statistical pattern recognition
JP2531073B2 (ja) 1993-01-14 1996-09-04 日本電気株式会社 音声認識システム
JP2751856B2 (ja) * 1995-02-03 1998-05-18 日本電気株式会社 木構造を用いたパターン適応化方式
JP3092491B2 (ja) * 1995-08-30 2000-09-25 日本電気株式会社 記述長最小基準を用いたパターン適応化方式
JP2852210B2 (ja) 1995-09-19 1999-01-27 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者モデル作成装置及び音声認識装置
US5787394A (en) * 1995-12-13 1998-07-28 International Business Machines Corporation State-dependent speaker clustering for speaker adaptation
JP2982689B2 (ja) * 1996-04-19 1999-11-29 日本電気株式会社 情報量基準を用いた標準パターン作成方式
US5806030A (en) * 1996-05-06 1998-09-08 Matsushita Electric Ind Co Ltd Low complexity, high accuracy clustering method for speech recognizer
US6064958A (en) * 1996-09-20 2000-05-16 Nippon Telegraph And Telephone Corporation Pattern recognition scheme using probabilistic models based on mixtures distribution of discrete distribution
JPH10149192A (ja) 1996-09-20 1998-06-02 Nippon Telegr & Teleph Corp <Ntt> パターン認識方法、装置およびその記憶媒体
US5708759A (en) * 1996-11-19 1998-01-13 Kemeny; Emanuel S. Speech recognition using phoneme waveform parameters
JP3088357B2 (ja) 1997-09-08 2000-09-18 株式会社エイ・ティ・アール音声翻訳通信研究所 不特定話者音響モデル生成装置及び音声認識装置
JP3009640B2 (ja) 1997-09-10 2000-02-14 株式会社エイ・ティ・アール音声翻訳通信研究所 音響モデル生成装置及び音声認識装置
US5937385A (en) * 1997-10-20 1999-08-10 International Business Machines Corporation Method and apparatus for creating speech recognition grammars constrained by counter examples
JPH11143486A (ja) 1997-11-10 1999-05-28 Fuji Xerox Co Ltd 話者適応装置および方法
US6141641A (en) * 1998-04-15 2000-10-31 Microsoft Corporation Dynamically configurable acoustic model for speech recognition system
US6246982B1 (en) * 1999-01-26 2001-06-12 International Business Machines Corporation Method for measuring distance between collections of distributions

Also Published As

Publication number Publication date
JP4336865B2 (ja) 2009-09-30
US20020184020A1 (en) 2002-12-05
EP1241661B1 (de) 2005-06-01
JP2002268675A (ja) 2002-09-20
EP1241661A1 (de) 2002-09-18
DE60204374T2 (de) 2006-03-16
US7437288B2 (en) 2008-10-14

Similar Documents

Publication Publication Date Title
DE60204374D1 (de) Spracherkennungsvorrichtung
DE60323362D1 (de) Spracherkennungseinrichtung
DE60217444D1 (de) Sprachgesteuertes elektronisches Gerät
DE60123747D1 (de) Spracherkennungsbasiertes Untertitelungssystem
DE60201867D1 (de) Gerät zur Gesichtserkennung
DE60226620D1 (de) Anschlagpuffervorrichtung
DE60231151D1 (de) Silizium-mikrofon
DE602004021716D1 (de) Spracherkennungssystem
DE50214067D1 (de) Hinweisvorrichtung
FI19992351A (fi) Puheentunnistus
DE60233561D1 (de) Sprachantwortsystem
DE50114574D1 (de) Sprachgesteuerte Vorrichtung
DE60200809D1 (de) Verbindungsvorrichtung
DE60229137D1 (de) Schnittstellenvorrichtung
DE60114968D1 (de) Geräuschrobuste Spracherkennung
DE60206248D1 (de) Cogenerations-vorrichtung
DE60234819D1 (de) Sprachausgabevorrichtung
ITMI20020071A0 (it) Dispositivo di avviamento
DE60222717D1 (de) Zündpille-Anschlusseinrichtung
FI20010668A (fi) Paikannuslaite
DE60225215D1 (de) Verbesserte Spracherkennung
DE60142729D1 (de) Spracherkennungssystem
DE50213599D1 (de) Sauggerät
DE60222413D1 (de) Spracherkennung
DE60210915D1 (de) Text-zu-Sprache Umsetzung

Legal Events

Date Code Title Description
8364 No opposition during term of opposition