AU2002218274A1 - Robust voice recognition with data bank organisation - Google Patents

Robust voice recognition with data bank organisation

Info

Publication number
AU2002218274A1
AU2002218274A1 AU2002218274A AU1827402A AU2002218274A1 AU 2002218274 A1 AU2002218274 A1 AU 2002218274A1 AU 2002218274 A AU2002218274 A AU 2002218274A AU 1827402 A AU1827402 A AU 1827402A AU 2002218274 A1 AU2002218274 A1 AU 2002218274A1
Authority
AU
Australia
Prior art keywords
information
segment
database
voice signal
stored
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2002218274A
Inventor
Stefan Harbeck
Peter Plankensteiner
Klaus Schimmer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VOICECOM AG
Original Assignee
VOICECOM AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VOICECOM AG filed Critical VOICECOM AG
Publication of AU2002218274A1 publication Critical patent/AU2002218274A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
  • Image Analysis (AREA)

Abstract

A method for controlling an information system during the output of stored information segments via a signaling device (50a). Useful information is stored in a database (32) for being requested, from which information at least one information segment is specified as a first data segment (W1) via a first voice signal (sa(t),sa(z)) and is provided via a control output (20,40,50;50a) or is converted (50b) into a control signal for a technical device (G). The information is organized in the database such that an initially limited first information area (32a) of stored information is accessible (4,4a,4b) to said voice signal, for selecting the specified information segment therefrom. A further information area (32b,32c,32d) of said database (32) is activated (59,70,4c,4d) as a second information area, if the information segment (W1) corresponding to a first voice signal segment (s1) of said first voice signal (sa(t) is not contained in said first information area (32a). When accessing information of the database, a robust word recognition is obtained and the request is successfully processed within a short time.
AU2002218274A 2000-11-03 2001-10-31 Robust voice recognition with data bank organisation Abandoned AU2002218274A1 (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
DE10054413 2000-11-03
DE10054413 2000-11-03
DE10107336 2001-02-16
DE10107336 2001-02-16
PCT/EP2001/012632 WO2002037473A1 (en) 2000-11-03 2001-10-31 Robust voice recognition with data bank organisation

Publications (1)

Publication Number Publication Date
AU2002218274A1 true AU2002218274A1 (en) 2002-05-15

Family

ID=26007554

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2002218274A Abandoned AU2002218274A1 (en) 2000-11-03 2001-10-31 Robust voice recognition with data bank organisation

Country Status (6)

Country Link
US (1) US7587322B2 (en)
EP (1) EP1330817B1 (en)
AT (1) ATE300083T1 (en)
AU (1) AU2002218274A1 (en)
DE (2) DE50106815D1 (en)
WO (1) WO2002037473A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3827317B2 (en) * 2004-06-03 2006-09-27 任天堂株式会社 Command processing unit
DE102004056164A1 (en) * 2004-11-18 2006-05-24 Deutsche Telekom Ag Method for dialogue control and dialog system operating thereafter
US7499903B2 (en) * 2005-01-24 2009-03-03 Nevin James B Semantic to non-semantic routing for locating a live expert
US8583436B2 (en) * 2007-12-21 2013-11-12 Nec Corporation Word category estimation apparatus, word category estimation method, speech recognition apparatus, speech recognition method, program, and recording medium
US8751230B2 (en) * 2008-06-27 2014-06-10 Koninklijke Philips N.V. Method and device for generating vocabulary entry from acoustic data
JP2010066365A (en) * 2008-09-09 2010-03-25 Toshiba Corp Speech recognition apparatus, method, and program
US8332205B2 (en) * 2009-01-09 2012-12-11 Microsoft Corporation Mining transliterations for out-of-vocabulary query terms
US8358747B2 (en) 2009-11-10 2013-01-22 International Business Machines Corporation Real time automatic caller speech profiling
US9390708B1 (en) * 2013-05-28 2016-07-12 Amazon Technologies, Inc. Low latency and memory efficient keywork spotting
JP6655835B2 (en) * 2016-06-16 2020-02-26 パナソニックIpマネジメント株式会社 Dialogue processing method, dialogue processing system, and program

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3397372B2 (en) * 1993-06-16 2003-04-14 キヤノン株式会社 Speech recognition method and apparatus
NZ294296A (en) * 1994-10-25 1999-04-29 British Telecomm Speech recognition for voice operated telephone services includes comparison against stored lists of expected words
US6629069B1 (en) * 1998-07-21 2003-09-30 British Telecommunications A Public Limited Company Speech recognizer using database linking
US6499013B1 (en) 1998-09-09 2002-12-24 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
JP2001005488A (en) * 1999-06-18 2001-01-12 Mitsubishi Electric Corp Voice interactive system
US7778816B2 (en) * 2001-04-24 2010-08-17 Microsoft Corporation Method and system for applying input mode bias

Also Published As

Publication number Publication date
DE10196793D2 (en) 2004-10-07
US7587322B2 (en) 2009-09-08
EP1330817B1 (en) 2005-07-20
US20040148167A1 (en) 2004-07-29
WO2002037473A1 (en) 2002-05-10
EP1330817A1 (en) 2003-07-30
DE50106815D1 (en) 2005-08-25
ATE300083T1 (en) 2005-08-15

Similar Documents

Publication Publication Date Title
TW347619B (en) A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
AU2002218274A1 (en) Robust voice recognition with data bank organisation
EP1505504A3 (en) Remote copy system
TW368633B (en) Semiconductor memory
EP0735736A3 (en) Method for automatic speech recognition of arbitrary spoken words
EP2364067A3 (en) Method and apparatus for controlling a lighting system in response to an audio input
WO2002037246A3 (en) System and method for using location identity to control access to digital information
EP2254101A3 (en) System and method for retrieving information while commanding operation of an applicance
EP1107254A3 (en) Audio information reproducing apparatus, movable body, and audio information reproduction controlling system
WO2002029589A1 (en) Comparing device, data communication system, and data communication method
EP0817106A3 (en) Method and apparatus for caching file control information
DE69819690D1 (en) LANGUAGE RECOGNITION USING A COMMAND LIKE
EP0959401A3 (en) Audio control method and audio controlled device
WO2002039383A3 (en) Method and arrangement for embedding a watermark in an information signal
EP0734173A3 (en) Fixed rate transmission system with selection of compressor configuration and corresponding method
GB2402589A (en) System and method for dynamically generating a textual description for a visual data representation
EP1219930A3 (en) Data processing apparatus and method and data recording medium
EP0589219A3 (en) Method and system for non-specific data retrieval in a data processing system
KR880008172A (en) Data processing system with bus commands for another subsystem generated by one subsystem
CN1140859A (en) Data recording apparatus and method for semiconductor memory card
EP0875893A3 (en) Apparatus for processing a control command sequence as well as a method for generating a control command sequence, and a storage medium for storing a control command sequence
EP0768763A3 (en) Variable-length decoder using a memory
CA2300021A1 (en) Knocking activated device and method for operating an electromechanical device responsive to a control signal
EP0195091A3 (en) Method of controlling automatic drawing machine
KR970023231A (en) Variable transmission video-CD decoder device