JPH05197394A - 音声信号のワードシーケンス認識方法および装置 - Google Patents
音声信号のワードシーケンス認識方法および装置Info
- Publication number
- JPH05197394A JPH05197394A JP4244874A JP24487492A JPH05197394A JP H05197394 A JPH05197394 A JP H05197394A JP 4244874 A JP4244874 A JP 4244874A JP 24487492 A JP24487492 A JP 24487492A JP H05197394 A JPH05197394 A JP H05197394A
- Authority
- JP
- Japan
- Prior art keywords
- word
- score
- signal
- test signal
- list
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 230000005236 sound signal Effects 0.000 title claims description 7
- 238000012360 testing method Methods 0.000 claims description 51
- 238000005070 sampling Methods 0.000 claims description 4
- 230000000694 effects Effects 0.000 abstract description 4
- 230000001427 coherent effect Effects 0.000 abstract description 2
- 238000012937 correction Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 241000270666 Testudines Species 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002250 progressing effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE4130632A DE4130632A1 (de) | 1991-09-14 | 1991-09-14 | Verfahren zum erkennen der gesprochenen woerter in einem sprachsignal |
DE4130632:5 | 1991-09-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
JPH05197394A true JPH05197394A (ja) | 1993-08-06 |
Family
ID=6440626
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP4244874A Pending JPH05197394A (ja) | 1991-09-14 | 1992-09-14 | 音声信号のワードシーケンス認識方法および装置 |
Country Status (4)
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1991019352A1 (en) * | 1990-05-25 | 1991-12-12 | Toyo Communication Equipment Co., Ltd. | Ultra thin quartz crystal filter element of multiple mode |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE4130631A1 (de) * | 1991-09-14 | 1993-03-18 | Philips Patentverwaltung | Verfahren zum erkennen der gesprochenen woerter in einem sprachsignal |
US6101468A (en) * | 1992-11-13 | 2000-08-08 | Dragon Systems, Inc. | Apparatuses and methods for training and operating speech recognition systems |
DE69607913T2 (de) * | 1995-05-03 | 2000-10-05 | Koninklijke Philips Electronics N.V., Eindhoven | Verfahren und vorrichtung zur spracherkennung auf der basis neuer wortmodelle |
DE19516106C2 (de) * | 1995-05-05 | 2003-04-03 | Philips Corp Intellectual Pty | Verfahren zum Bestimmen von Referenzwerten |
DE19516099C2 (de) * | 1995-05-05 | 2003-07-03 | Philips Intellectual Property | Verfahren zum Bestimmen von Sprachmodellwerten |
US5903864A (en) * | 1995-08-30 | 1999-05-11 | Dragon Systems | Speech recognition |
DE19533541C1 (de) * | 1995-09-11 | 1997-03-27 | Daimler Benz Aerospace Ag | Verfahren zur automatischen Steuerung eines oder mehrerer Geräte durch Sprachkommandos oder per Sprachdialog im Echtzeitbetrieb und Vorrichtung zum Ausführen des Verfahrens |
JP3535292B2 (ja) * | 1995-12-27 | 2004-06-07 | Kddi株式会社 | 音声認識システム |
US5835888A (en) * | 1996-06-10 | 1998-11-10 | International Business Machines Corporation | Statistical language model for inflected languages |
US6119114A (en) * | 1996-09-17 | 2000-09-12 | Smadja; Frank | Method and apparatus for dynamic relevance ranking |
US6173298B1 (en) | 1996-09-17 | 2001-01-09 | Asap, Ltd. | Method and apparatus for implementing a dynamic collocation dictionary |
US6374219B1 (en) | 1997-09-19 | 2002-04-16 | Microsoft Corporation | System for using silence in speech recognition |
US6321226B1 (en) * | 1998-06-30 | 2001-11-20 | Microsoft Corporation | Flexible keyboard searching |
US6856956B2 (en) * | 2000-07-20 | 2005-02-15 | Microsoft Corporation | Method and apparatus for generating and displaying N-best alternatives in a speech recognition system |
DE10220522B4 (de) * | 2002-05-08 | 2005-11-17 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse |
DE10220524B4 (de) | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
EP1361740A1 (de) * | 2002-05-08 | 2003-11-12 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs |
DE10220521B4 (de) * | 2002-05-08 | 2005-11-24 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen |
EP1363271A1 (de) | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
US7912716B2 (en) * | 2005-10-06 | 2011-03-22 | Sony Online Entertainment Llc | Generating words and names using N-grams of phonemes |
US20070118873A1 (en) * | 2005-11-09 | 2007-05-24 | Bbnt Solutions Llc | Methods and apparatus for merging media content |
US20070106685A1 (en) * | 2005-11-09 | 2007-05-10 | Podzinger Corp. | Method and apparatus for updating speech recognition databases and reindexing audio and video content using the same |
US20070106646A1 (en) * | 2005-11-09 | 2007-05-10 | Bbnt Solutions Llc | User-directed navigation of multimedia search results |
US9697230B2 (en) | 2005-11-09 | 2017-07-04 | Cxense Asa | Methods and apparatus for dynamic presentation of advertising, factual, and informational content using enhanced metadata in search-driven media applications |
US9697231B2 (en) * | 2005-11-09 | 2017-07-04 | Cxense Asa | Methods and apparatus for providing virtual media channels based on media search |
US7801910B2 (en) * | 2005-11-09 | 2010-09-21 | Ramp Holdings, Inc. | Method and apparatus for timed tagging of media content |
US8312022B2 (en) * | 2008-03-21 | 2012-11-13 | Ramp Holdings, Inc. | Search engine optimization |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3215868A1 (de) * | 1982-04-29 | 1983-11-03 | Philips Patentverwaltung Gmbh, 2000 Hamburg | Verfahren und anordnung zum erkennen der woerter in einer zusammenhaengenden wortkette |
US4903305A (en) * | 1986-05-12 | 1990-02-20 | Dragon Systems, Inc. | Method for representing word models for use in speech recognition |
DE3710507A1 (de) * | 1987-03-30 | 1988-10-20 | Philips Patentverwaltung | Verfahren zum erkennen kontinuierlich gesprochener woerter |
US4803729A (en) * | 1987-04-03 | 1989-02-07 | Dragon Systems, Inc. | Speech recognition method |
DE3723078A1 (de) * | 1987-07-11 | 1989-01-19 | Philips Patentverwaltung | Verfahren zur erkennung von zusammenhaengend gesprochenen woertern |
DE3930889A1 (de) * | 1989-09-15 | 1991-03-28 | Philips Patentverwaltung | Verfahren zur erkennung von n unterschiedlichen wortketten in einem sprachsignal |
US5208897A (en) * | 1990-08-21 | 1993-05-04 | Emerson & Stern Associates, Inc. | Method and apparatus for speech recognition based on subsyllable spellings |
-
1991
- 1991-09-14 DE DE4130632A patent/DE4130632A1/de not_active Withdrawn
-
1992
- 1992-09-11 EP EP92202784A patent/EP0533261A2/de not_active Ceased
- 1992-09-14 JP JP4244874A patent/JPH05197394A/ja active Pending
-
1994
- 1994-09-26 US US08/312,495 patent/US5613034A/en not_active Expired - Fee Related
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1991019352A1 (en) * | 1990-05-25 | 1991-12-12 | Toyo Communication Equipment Co., Ltd. | Ultra thin quartz crystal filter element of multiple mode |
Also Published As
Publication number | Publication date |
---|---|
EP0533261A2 (de) | 1993-03-24 |
US5613034A (en) | 1997-03-18 |
DE4130632A1 (de) | 1993-03-18 |
EP0533261A3 (GUID-C5D7CC26-194C-43D0-91A1-9AE8C70A9BFF.html) | 1994-03-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JPH05197394A (ja) | 音声信号のワードシーケンス認識方法および装置 | |
KR100512662B1 (ko) | 음성검출의조기종료와신뢰성있는바지-인에유용한연속적인음성인식에서의워드카운팅방법및장치 | |
US5719997A (en) | Large vocabulary connected speech recognition system and method of language representation using evolutional grammer to represent context free grammars | |
US6266634B1 (en) | Method and apparatus for generating deterministic approximate weighted finite-state automata | |
US5995930A (en) | Method and apparatus for recognizing spoken words in a speech signal by organizing the vocabulary in the form of a tree | |
US5634083A (en) | Method of and device for determining words in a speech signal | |
US20090112587A1 (en) | System and method for generating a phrase pronunciation | |
JP3747171B2 (ja) | 音声処理システム | |
US9858038B2 (en) | Correction menu enrichment with alternate choices and generation of choice lists in multi-pass recognition systems | |
EP2317507B1 (en) | Corpus compilation for language model generation | |
Schwartz et al. | Efficient, high-performance algorithms for n-best search | |
US20030009331A1 (en) | Grammars for speech recognition | |
JPH10105189A (ja) | シーケンス取出し方法及びその装置 | |
US8682668B2 (en) | Language model score look-ahead value imparting device, language model score look-ahead value imparting method, and program storage medium | |
US5909665A (en) | Speech recognition system | |
JPH05197393A (ja) | 音声信号のワードシーケンス認識方法および装置 | |
JP2003015686A (ja) | 音声対話装置、音声対話方法及び音声対話処理プログラム | |
JP2003208195A (ja) | 連続音声認識装置および連続音声認識方法、連続音声認識プログラム、並びに、プログラム記録媒体 | |
CN1103986C (zh) | 字序列的识别方法 | |
JPH08248980A (ja) | 音声認識装置 | |
JP2000056795A (ja) | 音声認識装置 | |
JP3440840B2 (ja) | 音声認識方法及びその装置 | |
JP2000276482A (ja) | 文書検索装置及び文書検索方法 | |
JP3484077B2 (ja) | 音声認識装置 | |
JPH1145097A (ja) | 連続音声認識方式 |