AU2002218274A1 - Robust voice recognition with data bank organisation - Google Patents
Robust voice recognition with data bank organisationInfo
- Publication number
- AU2002218274A1 AU2002218274A1 AU2002218274A AU1827402A AU2002218274A1 AU 2002218274 A1 AU2002218274 A1 AU 2002218274A1 AU 2002218274 A AU2002218274 A AU 2002218274A AU 1827402 A AU1827402 A AU 1827402A AU 2002218274 A1 AU2002218274 A1 AU 2002218274A1
- Authority
- AU
- Australia
- Prior art keywords
- information
- segment
- database
- voice signal
- stored
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000011664 signaling Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- Image Analysis (AREA)
Abstract
A method for controlling an information system during the output of stored information segments via a signaling device (50a). Useful information is stored in a database (32) for being requested, from which information at least one information segment is specified as a first data segment (W1) via a first voice signal (sa(t),sa(z)) and is provided via a control output (20,40,50;50a) or is converted (50b) into a control signal for a technical device (G). The information is organized in the database such that an initially limited first information area (32a) of stored information is accessible (4,4a,4b) to said voice signal, for selecting the specified information segment therefrom. A further information area (32b,32c,32d) of said database (32) is activated (59,70,4c,4d) as a second information area, if the information segment (W1) corresponding to a first voice signal segment (s1) of said first voice signal (sa(t) is not contained in said first information area (32a). When accessing information of the database, a robust word recognition is obtained and the request is successfully processed within a short time.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10054413 | 2000-11-03 | ||
DE10054413 | 2000-11-03 | ||
DE10107336 | 2001-02-16 | ||
DE10107336 | 2001-02-16 | ||
PCT/EP2001/012632 WO2002037473A1 (en) | 2000-11-03 | 2001-10-31 | Robust voice recognition with data bank organisation |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2002218274A1 true AU2002218274A1 (en) | 2002-05-15 |
Family
ID=26007554
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2002218274A Abandoned AU2002218274A1 (en) | 2000-11-03 | 2001-10-31 | Robust voice recognition with data bank organisation |
Country Status (6)
Country | Link |
---|---|
US (1) | US7587322B2 (en) |
EP (1) | EP1330817B1 (en) |
AT (1) | ATE300083T1 (en) |
AU (1) | AU2002218274A1 (en) |
DE (2) | DE50106815D1 (en) |
WO (1) | WO2002037473A1 (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3827317B2 (en) * | 2004-06-03 | 2006-09-27 | 任天堂株式会社 | Command processing unit |
DE102004056164A1 (en) * | 2004-11-18 | 2006-05-24 | Deutsche Telekom Ag | Method for dialogue control and dialog system operating thereafter |
US7499903B2 (en) * | 2005-01-24 | 2009-03-03 | Nevin James B | Semantic to non-semantic routing for locating a live expert |
US8583436B2 (en) * | 2007-12-21 | 2013-11-12 | Nec Corporation | Word category estimation apparatus, word category estimation method, speech recognition apparatus, speech recognition method, program, and recording medium |
US8751230B2 (en) * | 2008-06-27 | 2014-06-10 | Koninklijke Philips N.V. | Method and device for generating vocabulary entry from acoustic data |
JP2010066365A (en) * | 2008-09-09 | 2010-03-25 | Toshiba Corp | Speech recognition apparatus, method, and program |
US8332205B2 (en) * | 2009-01-09 | 2012-12-11 | Microsoft Corporation | Mining transliterations for out-of-vocabulary query terms |
US8358747B2 (en) | 2009-11-10 | 2013-01-22 | International Business Machines Corporation | Real time automatic caller speech profiling |
US9390708B1 (en) * | 2013-05-28 | 2016-07-12 | Amazon Technologies, Inc. | Low latency and memory efficient keywork spotting |
JP6655835B2 (en) * | 2016-06-16 | 2020-02-26 | パナソニックIpマネジメント株式会社 | Dialogue processing method, dialogue processing system, and program |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3397372B2 (en) * | 1993-06-16 | 2003-04-14 | キヤノン株式会社 | Speech recognition method and apparatus |
NZ294296A (en) * | 1994-10-25 | 1999-04-29 | British Telecomm | Speech recognition for voice operated telephone services includes comparison against stored lists of expected words |
US6629069B1 (en) * | 1998-07-21 | 2003-09-30 | British Telecommunications A Public Limited Company | Speech recognizer using database linking |
US6499013B1 (en) | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
JP2001005488A (en) * | 1999-06-18 | 2001-01-12 | Mitsubishi Electric Corp | Voice interactive system |
US7778816B2 (en) * | 2001-04-24 | 2010-08-17 | Microsoft Corporation | Method and system for applying input mode bias |
-
2001
- 2001-10-31 AU AU2002218274A patent/AU2002218274A1/en not_active Abandoned
- 2001-10-31 WO PCT/EP2001/012632 patent/WO2002037473A1/en active IP Right Grant
- 2001-10-31 DE DE50106815T patent/DE50106815D1/en not_active Expired - Lifetime
- 2001-10-31 DE DE10196793T patent/DE10196793D2/en not_active Expired - Lifetime
- 2001-10-31 EP EP01992999A patent/EP1330817B1/en not_active Expired - Lifetime
- 2001-10-31 AT AT01992999T patent/ATE300083T1/en not_active IP Right Cessation
- 2001-10-31 US US10/415,709 patent/US7587322B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
DE10196793D2 (en) | 2004-10-07 |
US7587322B2 (en) | 2009-09-08 |
EP1330817B1 (en) | 2005-07-20 |
US20040148167A1 (en) | 2004-07-29 |
WO2002037473A1 (en) | 2002-05-10 |
EP1330817A1 (en) | 2003-07-30 |
DE50106815D1 (en) | 2005-08-25 |
ATE300083T1 (en) | 2005-08-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TW347619B (en) | A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA). | |
AU2002218274A1 (en) | Robust voice recognition with data bank organisation | |
EP1505504A3 (en) | Remote copy system | |
TW368633B (en) | Semiconductor memory | |
EP0735736A3 (en) | Method for automatic speech recognition of arbitrary spoken words | |
EP2364067A3 (en) | Method and apparatus for controlling a lighting system in response to an audio input | |
WO2002037246A3 (en) | System and method for using location identity to control access to digital information | |
EP2254101A3 (en) | System and method for retrieving information while commanding operation of an applicance | |
EP1107254A3 (en) | Audio information reproducing apparatus, movable body, and audio information reproduction controlling system | |
WO2002029589A1 (en) | Comparing device, data communication system, and data communication method | |
EP0817106A3 (en) | Method and apparatus for caching file control information | |
DE69819690D1 (en) | LANGUAGE RECOGNITION USING A COMMAND LIKE | |
EP0959401A3 (en) | Audio control method and audio controlled device | |
WO2002039383A3 (en) | Method and arrangement for embedding a watermark in an information signal | |
EP0734173A3 (en) | Fixed rate transmission system with selection of compressor configuration and corresponding method | |
GB2402589A (en) | System and method for dynamically generating a textual description for a visual data representation | |
EP1219930A3 (en) | Data processing apparatus and method and data recording medium | |
EP0589219A3 (en) | Method and system for non-specific data retrieval in a data processing system | |
KR880008172A (en) | Data processing system with bus commands for another subsystem generated by one subsystem | |
CN1140859A (en) | Data recording apparatus and method for semiconductor memory card | |
EP0875893A3 (en) | Apparatus for processing a control command sequence as well as a method for generating a control command sequence, and a storage medium for storing a control command sequence | |
EP0768763A3 (en) | Variable-length decoder using a memory | |
CA2300021A1 (en) | Knocking activated device and method for operating an electromechanical device responsive to a control signal | |
EP0195091A3 (en) | Method of controlling automatic drawing machine | |
KR970023231A (en) | Variable transmission video-CD decoder device |