WO2002103675A8

WO2002103675A8 - Client-server based distributed speech recognition system architecture

Info

Publication number: WO2002103675A8
Application number: PCT/CN2001/001030
Authority: WO
Inventors: Qingwei Zhao; Xiangdong Zhang; Yonghong Yan; Baosheng Yuan
Original assignee: Intel Corp; Intel China Ltd; Qingwei Zhao; Xiangdong Zhang; Yonghong Yan; Baosheng Yuan
Priority date: 2001-06-19
Filing date: 2001-06-19
Publication date: 2005-09-22
Also published as: CN1545694A; WO2002103675A1; CN1223984C

Abstract

In general, the new client-server based Distributed Speech Recognition (DSR) system provides an effective method of recognizing speech made by a human at a client device and transmitted to a remote server over a network. The system distributes the speech recognition process between the client and the server so that a speaker-dependent language model may be utilized yielding higher accuracy as compared to the tradition DSR systems. Accordingly, the client device is configured to generate a phonetic word graph by performing acoustic recognition using an acoustic model that is trained by the same end-user whose speech is to be recognized. The resulting phonetic word graph is transmitted to the server which will handle the language processing and generate a recognized word sequence. When compared to a design that uses the traditional DSR, the new DSR method and system produces a word error rate that is at least 2-3 times lower, resulting in a higher accuracy recognition system.