AU2000276404A1 - Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system - Google Patents

Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system

Info

Publication number
AU2000276404A1
AU2000276404A1 AU2000276404A AU7640400A AU2000276404A1 AU 2000276404 A1 AU2000276404 A1 AU 2000276404A1 AU 2000276404 A AU2000276404 A AU 2000276404A AU 7640400 A AU7640400 A AU 7640400A AU 2000276404 A1 AU2000276404 A1 AU 2000276404A1
Authority
AU
Australia
Prior art keywords
lvcsr
building
speech recognition
compact model
continuous speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2000276404A
Inventor
Jielin Pan
Baosheng Yuan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Publication of AU2000276404A1 publication Critical patent/AU2000276404A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
AU2000276404A 2000-09-30 2000-09-30 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system Abandoned AU2000276404A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2000/000306 WO2002029617A1 (en) 2000-09-30 2000-09-30 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system

Publications (1)

Publication Number Publication Date
AU2000276404A1 true AU2000276404A1 (en) 2002-04-15

Family

ID=4574719

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2000276404A Abandoned AU2000276404A1 (en) 2000-09-30 2000-09-30 Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system

Country Status (3)

Country Link
US (1) US7454341B1 (en)
AU (1) AU2000276404A1 (en)
WO (1) WO2002029617A1 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7930181B1 (en) 2002-09-18 2011-04-19 At&T Intellectual Property Ii, L.P. Low latency real-time speech transcription
US20070033044A1 (en) * 2005-08-03 2007-02-08 Texas Instruments, Incorporated System and method for creating generalized tied-mixture hidden Markov models for automatic speech recognition
US7970613B2 (en) * 2005-11-12 2011-06-28 Sony Computer Entertainment Inc. Method and system for Gaussian probability data bit reduction and computation
US8019593B2 (en) * 2006-06-30 2011-09-13 Robert Bosch Corporation Method and apparatus for generating features through logical and functional operations
US9678775B1 (en) * 2008-04-09 2017-06-13 Nvidia Corporation Allocating memory for local variables of a multi-threaded program for execution in a single-threaded environment
US8776030B2 (en) * 2008-04-09 2014-07-08 Nvidia Corporation Partitioning CUDA code for execution by a general purpose processor
US9785613B2 (en) * 2011-12-19 2017-10-10 Cypress Semiconductor Corporation Acoustic processing unit interface for determining senone scores using a greater clock frequency than that corresponding to received audio
US9153235B2 (en) 2012-04-09 2015-10-06 Sony Computer Entertainment Inc. Text dependent speaker recognition with long-term feature based on functional data analysis
WO2014123846A1 (en) 2013-02-05 2014-08-14 Fluidnet Corporation Medical device management using associations
US10210156B2 (en) * 2014-01-10 2019-02-19 International Business Machines Corporation Seed selection in corpora compaction for natural language processing
CN105913074B (en) * 2016-04-05 2019-01-11 西安电子科技大学 Based on amplitude and the united SAR image moving-target clustering method of radial velocity
JP6640896B2 (en) * 2018-02-15 2020-02-05 株式会社東芝 Data processing device, data processing method and program

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05188994A (en) * 1992-01-07 1993-07-30 Sony Corp Noise suppression device
JP2522154B2 (en) * 1993-06-03 1996-08-07 日本電気株式会社 Voice recognition system
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
JP3533696B2 (en) * 1994-03-22 2004-05-31 三菱電機株式会社 Speech recognition boundary estimation method and speech recognition device
CN1112269A (en) * 1994-05-20 1995-11-22 北京超凡电子科技有限公司 HMM speech recognition technique based on Chinese pronunciation characteristics
US5598505A (en) * 1994-09-30 1997-01-28 Apple Computer, Inc. Cepstral correction vector quantizer for speech recognition
US5864810A (en) * 1995-01-20 1999-01-26 Sri International Method and apparatus for speech recognition adapted to an individual speaker
US6070140A (en) * 1995-06-05 2000-05-30 Tran; Bao Q. Speech recognizer
US5806034A (en) * 1995-08-02 1998-09-08 Itt Corporation Speaker independent speech recognition method utilizing multiple training iterations
CN1061451C (en) * 1996-09-26 2001-01-31 财团法人工业技术研究院 Concealed Markov-mould Chines word sound idenfitying method and apparatus thereof
GB2355834A (en) * 1999-10-29 2001-05-02 Nokia Mobile Phones Ltd Speech recognition
US6526379B1 (en) * 1999-11-29 2003-02-25 Matsushita Electric Industrial Co., Ltd. Discriminative clustering methods for automatic speech recognition

Also Published As

Publication number Publication date
WO2002029617A1 (en) 2002-04-11
US7454341B1 (en) 2008-11-18

Similar Documents

Publication Publication Date Title
AU4065700A (en) Method and apparatus for creating and editing grammars for speech recognition
AU2002243594A1 (en) Method and apparatus for speech reconstruction in a distributed speech recognition system
AU2003295628A1 (en) Method and apparatus for selective speech recognition
HK1074276A1 (en) System and method for transmitting speech activityin a distributed voice recognition system
AU2169700A (en) A method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system
AU2003271083A1 (en) Language model creation/accumulation device, speech recognition device, language model creation method, and speech recognition method
EP1470548A4 (en) System and method for speech recognition by multi-pass recognition using context specific grammars
AU2003295976A1 (en) Method and apparatus for selective distributed speech recognition
AU2003293119A1 (en) Method and apparatus for selective distributed speech recognition
AU2002353356A1 (en) Method of operating a speech recognition system
AU2666399A (en) Speech recognition apparatus and method for learning
AU2002222388A1 (en) A method for activating context sensitive speech recognition in a terminal
AU2001245272A1 (en) System and method for referencing object instances and invoking methods on thoseobject instances from within speech recognition grammar
AU2002367354A1 (en) Method and apparatus for multi-level distributed speech recognition
AU2002247043A1 (en) System and method for computing and transmitting parameters in a distributed voice recognition system
AU2001269521A1 (en) Speech recognition device and speech recognition method
AU2797199A (en) Apparatus and method for providing speech input to a speech recognition system
EP0764319A4 (en) Method, apparatus, and radio for optimizing hidden markov model speech recognition
HK1090735A1 (en) System and method for speech recognition utilizing a merged dictionary
AU2002364174A1 (en) System and method for speech recognition and transcription
AU2000276402A1 (en) Method, apparatus, and system for bottom-up tone integration to chinese continuous speech recognition system
AU2002226922A1 (en) Method and apparatus for speech recognition incorporating location information
AU2000276404A1 (en) Method, apparatus, and system for building a compact model for large vocabulary continuous speech recognition (lvcsr) system
AU2003291397A1 (en) Method and apparatus for coding gain information in a speech coding system
AU2000276400A1 (en) Search method based on single triphone tree for large vocabulary continuous speech recognizer