HK1076898A1 - System for predicting speech recognition accuracy and development for a dialog system - Google Patents
System for predicting speech recognition accuracy and development for a dialog systemInfo
- Publication number
- HK1076898A1 HK1076898A1 HK05111460.3A HK05111460A HK1076898A1 HK 1076898 A1 HK1076898 A1 HK 1076898A1 HK 05111460 A HK05111460 A HK 05111460A HK 1076898 A1 HK1076898 A1 HK 1076898A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- speech recognition
- recognition accuracy
- development
- dialog
- dialog system
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/01—Assessment or evaluation of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/193—Formal grammars, e.g. finite state automata, context free grammars or word networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2203/00—Aspects of automatic or semi-automatic exchanges
- H04M2203/35—Aspects of automatic or semi-automatic exchanges related to information services provided via a voice call
- H04M2203/355—Interactive dialogue design tools, features or methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4936—Speech interaction details
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Memory System Of A Hierarchy Structure (AREA)
- User Interface Of Digital Computer (AREA)
- Image Analysis (AREA)
- Paper (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2003900584A AU2003900584A0 (en) | 2003-02-11 | 2003-02-11 | System for predicting speech recognition accuracy and development for a dialog system |
PCT/AU2004/000156 WO2004072862A1 (en) | 2003-02-11 | 2004-02-11 | System for predicting speec recognition accuracy and development for a dialog system |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1076898A1 true HK1076898A1 (en) | 2006-01-27 |
Family
ID=30005284
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK05111460.3A HK1076898A1 (en) | 2003-02-11 | 2005-12-13 | System for predicting speech recognition accuracy and development for a dialog system |
Country Status (9)
Country | Link |
---|---|
US (1) | US7917363B2 (de) |
EP (1) | EP1593049B1 (de) |
AT (1) | ATE489680T1 (de) |
AU (1) | AU2003900584A0 (de) |
CA (1) | CA2515511C (de) |
DE (1) | DE602004030216D1 (de) |
HK (1) | HK1076898A1 (de) |
NZ (1) | NZ541471A (de) |
WO (1) | WO2004072862A1 (de) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000078022A1 (en) | 1999-06-11 | 2000-12-21 | Telstra New Wave Pty Ltd | A method of developing an interactive system |
AU2002950336A0 (en) | 2002-07-24 | 2002-09-12 | Telstra New Wave Pty Ltd | System and process for developing a voice application |
AU2002951244A0 (en) * | 2002-09-06 | 2002-09-19 | Telstra New Wave Pty Ltd | A development system for a dialog system |
EP1567941A2 (de) | 2002-11-28 | 2005-08-31 | Koninklijke Philips Electronics N.V. | Verfahren zur zuordnung von wordklassifikationen |
US7505984B1 (en) * | 2002-12-09 | 2009-03-17 | Google Inc. | Systems and methods for information extraction |
AU2003900584A0 (en) | 2003-02-11 | 2003-02-27 | Telstra New Wave Pty Ltd | System for predicting speech recognition accuracy and development for a dialog system |
AU2003902020A0 (en) * | 2003-04-29 | 2003-05-15 | Telstra New Wave Pty Ltd | A process for grammatical inference |
US20070179784A1 (en) * | 2006-02-02 | 2007-08-02 | Queensland University Of Technology | Dynamic match lattice spotting for indexing speech content |
KR100717385B1 (ko) * | 2006-02-09 | 2007-05-11 | 삼성전자주식회사 | 인식 후보의 사전적 거리를 이용한 인식 신뢰도 측정 방법및 인식 신뢰도 측정 시스템 |
US7552047B2 (en) * | 2006-05-02 | 2009-06-23 | International Business Machines Corporation | Instance-based sentence boundary determination by optimization |
US8856002B2 (en) * | 2007-04-12 | 2014-10-07 | International Business Machines Corporation | Distance metrics for universal pattern processing tasks |
US8219407B1 (en) | 2007-12-27 | 2012-07-10 | Great Northern Research, LLC | Method for processing the output of a speech recognizer |
US8949122B2 (en) * | 2008-02-25 | 2015-02-03 | Nuance Communications, Inc. | Stored phrase reutilization when testing speech recognition |
DE102008062923A1 (de) * | 2008-12-23 | 2010-06-24 | Volkswagen Ag | Verfahren und Vorrichtung zur Erzeugung einer Trefferliste bei einer automatischen Spracherkennung |
US9659559B2 (en) * | 2009-06-25 | 2017-05-23 | Adacel Systems, Inc. | Phonetic distance measurement system and related methods |
EP2287835B1 (de) * | 2009-07-10 | 2012-05-23 | Deutsche Telekom AG | Automatisiertes Auswerten der Nutzbarkeit eines Sprachdialogsystems |
US8515734B2 (en) * | 2010-02-08 | 2013-08-20 | Adacel Systems, Inc. | Integrated language model, related systems and methods |
US10102860B2 (en) * | 2010-10-05 | 2018-10-16 | Infraware, Inc. | Common phrase identification and language dictation recognition systems and methods for using the same |
EP2619697A1 (de) * | 2011-01-31 | 2013-07-31 | Walter Rosenbaum | Verfahren und system zur informationserkennung |
US10019983B2 (en) | 2012-08-30 | 2018-07-10 | Aravind Ganapathiraju | Method and system for predicting speech recognition performance using accuracy scores |
US9513885B2 (en) | 2013-08-22 | 2016-12-06 | Peter Warren | Web application development platform with relationship modeling |
US9613619B2 (en) | 2013-10-30 | 2017-04-04 | Genesys Telecommunications Laboratories, Inc. | Predicting recognition quality of a phrase in automatic speech recognition systems |
US9384731B2 (en) | 2013-11-06 | 2016-07-05 | Microsoft Technology Licensing, Llc | Detecting speech input phrase confusion risk |
US20150179170A1 (en) * | 2013-12-20 | 2015-06-25 | Microsoft Corporation | Discriminative Policy Training for Dialog Systems |
US9508339B2 (en) | 2015-01-30 | 2016-11-29 | Microsoft Technology Licensing, Llc | Updating language understanding classifier models for a digital personal assistant based on crowd-sourcing |
CN108091328B (zh) * | 2017-11-20 | 2021-04-16 | 北京百度网讯科技有限公司 | 基于人工智能的语音识别纠错方法、装置及可读介质 |
CN108052499B (zh) * | 2017-11-20 | 2021-06-11 | 北京百度网讯科技有限公司 | 基于人工智能的文本纠错方法、装置及计算机可读介质 |
US10210861B1 (en) * | 2018-09-28 | 2019-02-19 | Apprente, Inc. | Conversational agent pipeline trained on synthetic data |
US10573296B1 (en) | 2018-12-10 | 2020-02-25 | Apprente Llc | Reconciliation between simulator and speech recognition output using sequence-to-sequence mapping |
JP2020160144A (ja) * | 2019-03-25 | 2020-10-01 | 株式会社Subaru | 音声認識装置 |
CN115223588B (zh) * | 2022-03-24 | 2024-08-13 | 华东师范大学 | 一种基于拼音距离和滑动窗口的儿童语音短语匹配方法 |
US11908476B1 (en) | 2023-09-21 | 2024-02-20 | Rabbit Inc. | System and method of facilitating human interactions with products and services over a network |
Family Cites Families (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH01102599A (ja) * | 1987-10-12 | 1989-04-20 | Internatl Business Mach Corp <Ibm> | 音声認識方法 |
US5241619A (en) * | 1991-06-25 | 1993-08-31 | Bolt Beranek And Newman Inc. | Word dependent N-best search method |
US5452397A (en) * | 1992-12-11 | 1995-09-19 | Texas Instruments Incorporated | Method and system for preventing entry of confusingly similar phases in a voice recognition system vocabulary list |
US5642519A (en) * | 1994-04-29 | 1997-06-24 | Sun Microsystems, Inc. | Speech interpreter with a unified grammer compiler |
CA2146890C (en) | 1994-06-03 | 2000-10-24 | At&T Corp. | Outline programming for developing communication services |
US5737723A (en) * | 1994-08-29 | 1998-04-07 | Lucent Technologies Inc. | Confusable word detection in speech recognition |
US6173261B1 (en) * | 1998-09-30 | 2001-01-09 | At&T Corp | Grammar fragment acquisition using syntactic and semantic clustering |
WO1998050907A1 (en) | 1997-05-06 | 1998-11-12 | Speechworks International, Inc. | System and method for developing interactive speech applications |
US5860063A (en) * | 1997-07-11 | 1999-01-12 | At&T Corp | Automated meaningful phrase clustering |
US6044347A (en) * | 1997-08-05 | 2000-03-28 | Lucent Technologies Inc. | Methods and apparatus object-oriented rule-based dialogue management |
US5995918A (en) | 1997-09-17 | 1999-11-30 | Unisys Corporation | System and method for creating a language grammar using a spreadsheet or table interface |
US5937385A (en) * | 1997-10-20 | 1999-08-10 | International Business Machines Corporation | Method and apparatus for creating speech recognition grammars constrained by counter examples |
US6016470A (en) * | 1997-11-12 | 2000-01-18 | Gte Internetworking Incorporated | Rejection grammar using selected phonemes for speech recognition system |
US6154722A (en) * | 1997-12-18 | 2000-11-28 | Apple Computer, Inc. | Method and apparatus for a speech recognition system language model that integrates a finite state grammar probability and an N-gram probability |
US6144938A (en) * | 1998-05-01 | 2000-11-07 | Sun Microsystems, Inc. | Voice user interface with personality |
US6411952B1 (en) * | 1998-06-24 | 2002-06-25 | Compaq Information Technologies Group, Lp | Method for learning character patterns to interactively control the scope of a web crawler |
US6269336B1 (en) | 1998-07-24 | 2001-07-31 | Motorola, Inc. | Voice browser for interactive services and methods thereof |
US6587822B2 (en) * | 1998-10-06 | 2003-07-01 | Lucent Technologies Inc. | Web-based platform for interactive voice response (IVR) |
US6321198B1 (en) * | 1999-02-23 | 2001-11-20 | Unisys Corporation | Apparatus for design and simulation of dialogue |
US6523016B1 (en) * | 1999-04-12 | 2003-02-18 | George Mason University | Learnable non-darwinian evolution |
US20050091057A1 (en) * | 1999-04-12 | 2005-04-28 | General Magic, Inc. | Voice application development methodology |
US6314402B1 (en) * | 1999-04-23 | 2001-11-06 | Nuance Communications | Method and apparatus for creating modifiable and combinable speech objects for acquiring information from a speaker in an interactive voice response system |
US6618697B1 (en) * | 1999-05-14 | 2003-09-09 | Justsystem Corporation | Method for rule-based correction of spelling and grammar errors |
US6604075B1 (en) * | 1999-05-20 | 2003-08-05 | Lucent Technologies Inc. | Web-based voice dialog interface |
WO2000078022A1 (en) | 1999-06-11 | 2000-12-21 | Telstra New Wave Pty Ltd | A method of developing an interactive system |
US6434521B1 (en) * | 1999-06-24 | 2002-08-13 | Speechworks International, Inc. | Automatically determining words for updating in a pronunciation dictionary in a speech recognition system |
US6510411B1 (en) * | 1999-10-29 | 2003-01-21 | Unisys Corporation | Task oriented dialog model and manager |
US6684183B1 (en) * | 1999-12-06 | 2004-01-27 | Comverse Ltd. | Generic natural language service creation environment |
US6847734B2 (en) * | 2000-01-28 | 2005-01-25 | Kabushiki Kaisha Toshiba | Word recognition method and storage medium that stores word recognition program |
ATE274204T1 (de) | 2000-10-31 | 2004-09-15 | Unisys Corp | Entwicklungswerkzeug für einen dialogflussinterpreter |
GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
US20020087325A1 (en) * | 2000-12-29 | 2002-07-04 | Lee Victor Wai Leung | Dialogue application computer platform |
CN1156751C (zh) | 2001-02-02 | 2004-07-07 | 国际商业机器公司 | 用于自动生成语音xml文件的方法和系统 |
US6792408B2 (en) * | 2001-06-12 | 2004-09-14 | Dell Products L.P. | Interactive command recognition enhancement system and method |
AUPR579301A0 (en) | 2001-06-19 | 2001-07-12 | Syrinx Speech Systems Pty Limited | Neural network post-processor |
US20030007609A1 (en) * | 2001-07-03 | 2003-01-09 | Yuen Michael S. | Method and apparatus for development, deployment, and maintenance of a voice software application for distribution to one or more consumers |
US20030055651A1 (en) * | 2001-08-24 | 2003-03-20 | Pfeiffer Ralf I. | System, method and computer program product for extended element types to enhance operational characteristics in a voice portal |
US7013276B2 (en) * | 2001-10-05 | 2006-03-14 | Comverse, Inc. | Method of assessing degree of acoustic confusability, and system therefor |
US7181392B2 (en) * | 2002-07-16 | 2007-02-20 | International Business Machines Corporation | Determining speech recognition accuracy |
AU2002950336A0 (en) | 2002-07-24 | 2002-09-12 | Telstra New Wave Pty Ltd | System and process for developing a voice application |
AU2002951244A0 (en) * | 2002-09-06 | 2002-09-19 | Telstra New Wave Pty Ltd | A development system for a dialog system |
US8959019B2 (en) * | 2002-10-31 | 2015-02-17 | Promptu Systems Corporation | Efficient empirical determination, computation, and use of acoustic confusability measures |
AU2003900584A0 (en) | 2003-02-11 | 2003-02-27 | Telstra New Wave Pty Ltd | System for predicting speech recognition accuracy and development for a dialog system |
JP2010531492A (ja) * | 2007-06-25 | 2010-09-24 | グーグル・インコーポレーテッド | ワード確率決定 |
-
2003
- 2003-02-11 AU AU2003900584A patent/AU2003900584A0/en not_active Abandoned
-
2004
- 2004-02-11 CA CA2515511A patent/CA2515511C/en not_active Expired - Fee Related
- 2004-02-11 DE DE602004030216T patent/DE602004030216D1/de not_active Expired - Lifetime
- 2004-02-11 AT AT04709960T patent/ATE489680T1/de not_active IP Right Cessation
- 2004-02-11 US US10/545,762 patent/US7917363B2/en not_active Expired - Fee Related
- 2004-02-11 NZ NZ541471A patent/NZ541471A/en not_active IP Right Cessation
- 2004-02-11 EP EP04709960A patent/EP1593049B1/de not_active Expired - Lifetime
- 2004-02-11 WO PCT/AU2004/000156 patent/WO2004072862A1/en active Application Filing
-
2005
- 2005-12-13 HK HK05111460.3A patent/HK1076898A1/xx not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
NZ541471A (en) | 2009-11-27 |
AU2003900584A0 (en) | 2003-02-27 |
CA2515511C (en) | 2012-10-30 |
ATE489680T1 (de) | 2010-12-15 |
EP1593049B1 (de) | 2010-11-24 |
WO2004072862A1 (en) | 2004-08-26 |
CA2515511A1 (en) | 2004-08-26 |
US7917363B2 (en) | 2011-03-29 |
EP1593049A1 (de) | 2005-11-09 |
DE602004030216D1 (de) | 2011-01-05 |
EP1593049A4 (de) | 2009-04-22 |
US20060190252A1 (en) | 2006-08-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1076898A1 (en) | System for predicting speech recognition accuracy and development for a dialog system | |
CN111971742B (zh) | 与语言无关的唤醒词检测的技术 | |
Wang et al. | A unified context-free grammar and n-gram model for spoken language processing | |
CN107679042B (zh) | 一种面向智能语音对话系统的多层级对话分析方法 | |
Echols | A role for stress in early speech segmentation | |
US8818801B2 (en) | Dialogue speech recognition system, dialogue speech recognition method, and recording medium for storing dialogue speech recognition program | |
KR101211796B1 (ko) | 외국어 학습 장치 및 그 제공 방법 | |
KR101590724B1 (ko) | 음성 인식 오류 수정 방법 및 이를 수행하는 장치 | |
US9135237B2 (en) | System and a method for generating semantically similar sentences for building a robust SLM | |
US11258671B1 (en) | Functionality management for devices | |
US20070192104A1 (en) | A system and method for providing large vocabulary speech processing based on fixed-point arithmetic | |
US10650306B1 (en) | User representation using a generative adversarial network | |
KR20120066530A (ko) | 언어 모델 가중치 추정 방법 및 이를 위한 장치 | |
ATE405920T1 (de) | Erzeugen einer spracherkennungsgrammatik für alphanumerische ausdrücke | |
WO2014085049A1 (en) | Speech transcription including written text | |
KR20190049260A (ko) | 차량의 음성인식 장치 및 방법 | |
Sennrich | Modelling and optimizing on syntactic n-grams for statistical machine translation | |
Henderson et al. | Mixture model POMDPs for efficient handling of uncertainty in dialogue management | |
Price et al. | Combining linguistic with statistical methods in modeling prosody | |
Kaljurand et al. | Controlled natural language in speech recognition based user interfaces | |
Weigelt et al. | Integrating a dialog component into a framework for spoken language understanding | |
JP6183147B2 (ja) | 情報処理装置、プログラム、及び方法 | |
US6128595A (en) | Method of determining a reliability measure | |
Korenevsky et al. | Unknown Words Modeling in Training and Using Language Models for Russian LVCSR System | |
Lhioui et al. | Towards a Hybrid Approach to Semantic Analysis of Spontaneous Arabic Speech. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20170211 |
|
ARF | Application filed for restoration |
Effective date: 20170929 |
|
ARG | Restoration of standard patent granted |
Effective date: 20171031 |
|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20180211 |