WO2002103675A8 - Client-server based distributed speech recognition system architecture - Google Patents

Client-server based distributed speech recognition system architecture

Info

Publication number
WO2002103675A8
WO2002103675A8 PCT/CN2001/001030 CN0101030W WO02103675A8 WO 2002103675 A8 WO2002103675 A8 WO 2002103675A8 CN 0101030 W CN0101030 W CN 0101030W WO 02103675 A8 WO02103675 A8 WO 02103675A8
Authority
WO
WIPO (PCT)
Prior art keywords
client
dsr
server
speech recognition
recognition
Prior art date
Application number
PCT/CN2001/001030
Other languages
French (fr)
Other versions
WO2002103675A1 (en
Inventor
Qingwei Zhao
Xiangdong Zhang
Yonghong Yan
Baosheng Yuan
Original Assignee
Intel Corp
Intel China Ltd
Qingwei Zhao
Xiangdong Zhang
Yonghong Yan
Baosheng Yuan
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp, Intel China Ltd, Qingwei Zhao, Xiangdong Zhang, Yonghong Yan, Baosheng Yuan filed Critical Intel Corp
Priority to CN01823555.7A priority Critical patent/CN1223984C/en
Priority to PCT/CN2001/001030 priority patent/WO2002103675A1/en
Publication of WO2002103675A1 publication Critical patent/WO2002103675A1/en
Publication of WO2002103675A8 publication Critical patent/WO2002103675A8/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
  • Computer And Data Communications (AREA)

Abstract

In general, the new client-server based Distributed Speech Recognition (DSR) system provides an effective method of recognizing speech made by a human at a client device and transmitted to a remote server over a network. The system distributes the speech recognition process between the client and the server so that a speaker-dependent language model may be utilized yielding higher accuracy as compared to the tradition DSR systems. Accordingly, the client device is configured to generate a phonetic word graph by performing acoustic recognition using an acoustic model that is trained by the same end-user whose speech is to be recognized. The resulting phonetic word graph is transmitted to the server which will handle the language processing and generate a recognized word sequence. When compared to a design that uses the traditional DSR, the new DSR method and system produces a word error rate that is at least 2-3 times lower, resulting in a higher accuracy recognition system.
PCT/CN2001/001030 2001-06-19 2001-06-19 Client-server based distributed speech recognition system architecture WO2002103675A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN01823555.7A CN1223984C (en) 2001-06-19 2001-06-19 Client-server based distributed speech recognition system
PCT/CN2001/001030 WO2002103675A1 (en) 2001-06-19 2001-06-19 Client-server based distributed speech recognition system architecture

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2001/001030 WO2002103675A1 (en) 2001-06-19 2001-06-19 Client-server based distributed speech recognition system architecture

Publications (2)

Publication Number Publication Date
WO2002103675A1 WO2002103675A1 (en) 2002-12-27
WO2002103675A8 true WO2002103675A8 (en) 2005-09-22

Family

ID=4574816

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2001/001030 WO2002103675A1 (en) 2001-06-19 2001-06-19 Client-server based distributed speech recognition system architecture

Country Status (2)

Country Link
CN (1) CN1223984C (en)
WO (1) WO2002103675A1 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7533023B2 (en) * 2003-02-12 2009-05-12 Panasonic Corporation Intermediary speech processor in network environments transforming customized speech parameters
KR100622019B1 (en) 2004-12-08 2006-09-11 한국전자통신연구원 Voice interface system and method
GB0513820D0 (en) 2005-07-06 2005-08-10 Ibm Distributed voice recognition system and method
EP2851896A1 (en) 2013-09-19 2015-03-25 Maluuba Inc. Speech recognition using phoneme matching
CN103578467B (en) * 2013-10-18 2017-01-18 威盛电子股份有限公司 Acoustic model building method, speech recognition method and electronic device thereof
US9601108B2 (en) 2014-01-17 2017-03-21 Microsoft Technology Licensing, Llc Incorporating an exogenous large-vocabulary model into rule-based speech recognition
CN103956168A (en) * 2014-03-29 2014-07-30 深圳创维数字技术股份有限公司 Voice recognition method and device, and terminal
US10749989B2 (en) 2014-04-01 2020-08-18 Microsoft Technology Licensing Llc Hybrid client/server architecture for parallel processing
CN105609108A (en) * 2015-12-30 2016-05-25 生迪智慧科技有限公司 Distributed voice control method, system and wireless voice central controller
CN107068145B (en) * 2016-12-30 2019-02-15 中南大学 Voice evaluation method and system
US10971157B2 (en) 2017-01-11 2021-04-06 Nuance Communications, Inc. Methods and apparatus for hybrid speech recognition processing
JP2019124881A (en) * 2018-01-19 2019-07-25 トヨタ自動車株式会社 Speech recognition apparatus and speech recognition method
CN111916058B (en) * 2020-06-24 2024-08-16 西安交通大学 Speech recognition method and system based on incremental word graph heavy scoring
CN111883133B (en) * 2020-07-20 2023-08-29 深圳乐信软件技术有限公司 Customer service voice recognition method, device, server and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU684872B2 (en) * 1994-03-10 1998-01-08 Cable And Wireless Plc Communication system
US5960399A (en) * 1996-12-24 1999-09-28 Gte Internetworking Incorporated Client/server speech processor/recognizer
US6456974B1 (en) * 1997-01-06 2002-09-24 Texas Instruments Incorporated System and method for adding speech recognition capabilities to java
DE19910234A1 (en) * 1999-03-09 2000-09-21 Philips Corp Intellectual Pty Method with multiple speech recognizers
CN1315721A (en) * 2000-03-23 2001-10-03 韦尔博泰克有限公司 Speech information transporting system and method for customer server

Also Published As

Publication number Publication date
CN1545694A (en) 2004-11-10
WO2002103675A1 (en) 2002-12-27
CN1223984C (en) 2005-10-19

Similar Documents

Publication Publication Date Title
US5960399A (en) Client/server speech processor/recognizer
WO2002103675A8 (en) Client-server based distributed speech recognition system architecture
Kenny et al. A linear predictive HMM for vector-valued observations with applications to speech recognition
US10079022B2 (en) Voice recognition terminal, voice recognition server, and voice recognition method for performing personalized voice recognition
CN105118501B (en) The method and system of speech recognition
US20020116196A1 (en) Speech recognizer
EP3092639B1 (en) A methodology for enhanced voice search experience
WO2003058603A3 (en) System and method for speech recognition by multi-pass recognition generating refined context specific grammars
WO2002054033A3 (en) Hierarchical language models for speech recognition
WO2005024780A3 (en) Methods and apparatus for providing services using speech recognition
EP1245023A4 (en) Distributed real time speech recognition system
CN101510424A (en) Method and system for encoding and synthesizing speech based on speech primitive
CN112309372B (en) Intent recognition method, device, equipment and storage medium based on intonation
AU2017428304B2 (en) Sound recognition apparatus
CN102237083A (en) Portable interpretation system based on WinCE platform and language recognition method thereof
CN109074809B (en) Information processing apparatus, information processing method, and computer-readable storage medium
EP2867890A1 (en) Meta-data inputs to front end processing for automatic speech recognition
CN118865942A (en) A low-latency real-time speech-to-text and text-to-speech transmission method
CN117041430B (en) Method and device for improving outbound quality and robustness of intelligent coordinated outbound system
Han et al. Towards distributed recognition of emotion from speech
Khaing et al. Myanmar continuous speech recognition system based on DTW and HMM
US20230386458A1 (en) Pre-wakeword speech processing
CN102314878A (en) Automatic phoneme splitting method
de Alencar et al. Transformations of LPC and LSF parameters to speech recognition features
Missaoui et al. Physiologically motivated feature extraction for robust automatic speech recognition

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 20018235557

Country of ref document: CN

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct app. not ent. europ. phase
NENP Non-entry into the national phase in:

Ref country code: JP

WR Later publication of a revised version of an international search report