CN1120372A - Speech processing - Google Patents
Speech processing Download PDFInfo
- Publication number
- CN1120372A CN1120372A CN94191652A CN94191652A CN1120372A CN 1120372 A CN1120372 A CN 1120372A CN 94191652 A CN94191652 A CN 94191652A CN 94191652 A CN94191652 A CN 94191652A CN 1120372 A CN1120372 A CN 1120372A
- Authority
- CN
- China
- Prior art keywords
- node
- path link
- path
- model
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
Abstract
Description
Claims (19)
- The speech recognition systems that 1 one kinds of path link are transmitted, voice are read by the company that is used to discern input, and this recognition system comprises: the device that is used for deriving from an input speech signal recognition feature data; Treating apparatus, voice are imported in the expectation that is used for input voice component model that will expectation and is used for comparison recognition feature data and component model, and this treating apparatus has a plurality of lexical nodes related with the word list representation model; And the device that is used for indicating the identification of input speech signal according to comparative result; It is characterized in that at least one lexical node can handle the path link of one or more simultaneously.
- The 2 a kind of speech recognition systems according to claim 1 is characterized in that this at least one lexical node is related with word list representation model identical more than.
- The 3 a kind of speech recognition systems according to claim 2 is characterized in that these word list representation models are latent Markov model.
- 4 according to any one a kind of speech recognition system in the claim 1,2 or 3, it is characterized in that all lexical nodes all have the mark of distributing to them.
- 5 according to any one a kind of speech recognition system in the claim 1,2 or 3, it is characterized in that the lexical node that only appears at the decision-point front just has the mark of distributing to them.
- The 6 a kind of speech recognition systems according to one of claim 4 and 5 is characterized in that comprising in this path link an accumulation mark.
- 7 according to any one a kind of speech recognition system in the claim 4,5 or 6, it is characterized in that some node suffers restraints at least only to propagate the path link with certain predetermined labels.
- 8 according to any one a kind of speech recognition system in the claim 4 to 7, it is characterized in that this identification indicating device comprises that the score that is used for the comparison path link and mark determine to have with input continuous speech optimum matching and have the device in the path of the alternative coupling of suboptimum.
- The method of 9 one kinds of speech recognitions comprises: the model that constitutes the input voice of expectation; From an input speech signal, derive the recognition feature data; The input voice of characteristic and component model are compared, and according to comparative result indication speech recognition, the input voice of expectation are to constitute a network that comprises a plurality of lexical nodes that are associated with the word list representation model; It is characterized in that at least one lexical node can handle more than one input simultaneously.
- The 10 a kind of methods according to claim 9 is characterized in that this at least one lexical node is related with word list representation model identical more than.
- The 11 a kind of methods according to claim 10 is characterized in that this at least one lexical node is related with the several same word list representation model of the number that equals desired recognition result.
- The 12 a kind of methods according to one of claim 10 or 11 is characterized in that on each decision-point of network the relatively score of these path link, have only n bar top score path link just to propagate into following node.
- 13 according to any one a kind of method in the claim 10,11 or 12, it is characterized in that mark is distributed to all lexical nodes.
- 14 according to any one a kind of method in the claim 10,11 or 12, it is characterized in that only mark being distributed to the lexical node that appears at decision-point front in the network.
- The 15 a kind of methods according to one of claim 13 or 14 when dependent claims 12, is characterized in that also the relatively mark of path link, and only comprising not, the path link of isolabeling just propagates into following node.
- 16 according to any one a kind of method in the claim 13,14 or 15, it is characterized in that retraining the path link that has certain predetermined labels in the tag field that some node at least only is delivered in them,
- 17 according to any one a kind of method in the claim 10 to 16, it is characterized in that thinking that the input voice of having discerned determine by recall path link through network.
- 18 according to any one a kind of method in the claim 13 to 16, it is characterized in that thinking that the input speech signal of having discerned is that the accumulation mark of each path link is determined.
- 19 according to any one a kind of method in the claim 10 to 18, it is characterized in that best score path link is to be handled by the first word list representation model of a vocabulary point, suboptimum by second models treated, more than analogize, use until exhausted up to parallel model or the path link that enters.
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP93302538.9 | 1993-03-31 | ||
EP93302538 | 1993-03-31 | ||
EP93304993.4 | 1993-06-25 | ||
EP93304993 | 1993-06-25 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1120372A true CN1120372A (en) | 1996-04-10 |
CN1196104C CN1196104C (en) | 2005-04-06 |
Family
ID=26134252
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB941916529A Expired - Lifetime CN1196104C (en) | 1993-03-31 | 1994-03-31 | Speech processing |
Country Status (12)
Country | Link |
---|---|
JP (1) | JPH08508350A (en) |
KR (1) | KR100309205B1 (en) |
CN (1) | CN1196104C (en) |
AU (1) | AU682177B2 (en) |
CA (1) | CA2158064C (en) |
DE (1) | DE69416670T2 (en) |
FI (1) | FI954572A (en) |
HK (1) | HK1014390A1 (en) |
NO (1) | NO308756B1 (en) |
NZ (1) | NZ263223A (en) |
SG (1) | SG47716A1 (en) |
WO (1) | WO1994023424A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103035243A (en) * | 2012-12-18 | 2013-04-10 | 中国科学院自动化研究所 | Real-time feedback method and system of long voice continuous recognition and recognition result |
CN105913848A (en) * | 2016-04-13 | 2016-08-31 | 乐视控股(北京)有限公司 | Path storing method and path storing system based on minimal heap, and speech recognizer |
Families Citing this family (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5943438A (en) * | 1995-03-07 | 1999-08-24 | Siemens Aktiengesellschaft | Method for pattern recognition |
US7117149B1 (en) | 1999-08-30 | 2006-10-03 | Harman Becker Automotive Systems-Wavemakers, Inc. | Sound source classification |
US8073689B2 (en) | 2003-02-21 | 2011-12-06 | Qnx Software Systems Co. | Repetitive transient noise removal |
US7725315B2 (en) | 2003-02-21 | 2010-05-25 | Qnx Software Systems (Wavemakers), Inc. | Minimization of transient noises in a voice signal |
US8326621B2 (en) | 2003-02-21 | 2012-12-04 | Qnx Software Systems Limited | Repetitive transient noise removal |
US8271279B2 (en) | 2003-02-21 | 2012-09-18 | Qnx Software Systems Limited | Signature noise removal |
US7949522B2 (en) | 2003-02-21 | 2011-05-24 | Qnx Software Systems Co. | System for suppressing rain noise |
US7885420B2 (en) | 2003-02-21 | 2011-02-08 | Qnx Software Systems Co. | Wind noise suppression system |
US7895036B2 (en) | 2003-02-21 | 2011-02-22 | Qnx Software Systems Co. | System for suppressing wind noise |
US7716046B2 (en) | 2004-10-26 | 2010-05-11 | Qnx Software Systems (Wavemakers), Inc. | Advanced periodic signal enhancement |
US8170879B2 (en) | 2004-10-26 | 2012-05-01 | Qnx Software Systems Limited | Periodic signal enhancement system |
US8306821B2 (en) | 2004-10-26 | 2012-11-06 | Qnx Software Systems Limited | Sub-band periodic signal enhancement system |
US7680652B2 (en) | 2004-10-26 | 2010-03-16 | Qnx Software Systems (Wavemakers), Inc. | Periodic signal enhancement system |
US7949520B2 (en) | 2004-10-26 | 2011-05-24 | QNX Software Sytems Co. | Adaptive filter pitch extraction |
US8543390B2 (en) | 2004-10-26 | 2013-09-24 | Qnx Software Systems Limited | Multi-channel periodic signal enhancement system |
US8284947B2 (en) | 2004-12-01 | 2012-10-09 | Qnx Software Systems Limited | Reverberation estimation and suppression system |
US8027833B2 (en) | 2005-05-09 | 2011-09-27 | Qnx Software Systems Co. | System for suppressing passing tire hiss |
US8311819B2 (en) | 2005-06-15 | 2012-11-13 | Qnx Software Systems Limited | System for detecting speech with background voice estimates and noise estimates |
US8170875B2 (en) * | 2005-06-15 | 2012-05-01 | Qnx Software Systems Limited | Speech end-pointer |
US7844453B2 (en) | 2006-05-12 | 2010-11-30 | Qnx Software Systems Co. | Robust noise estimation |
US8326620B2 (en) | 2008-04-30 | 2012-12-04 | Qnx Software Systems Limited | Robust downlink speech and noise detector |
US8335685B2 (en) | 2006-12-22 | 2012-12-18 | Qnx Software Systems Limited | Ambient noise compensation system robust to high excitation noise |
US8904400B2 (en) | 2007-09-11 | 2014-12-02 | 2236008 Ontario Inc. | Processing system having a partitioning component for resource partitioning |
US8850154B2 (en) | 2007-09-11 | 2014-09-30 | 2236008 Ontario Inc. | Processing system having memory partitioning |
US8694310B2 (en) | 2007-09-17 | 2014-04-08 | Qnx Software Systems Limited | Remote control server protocol system |
US8209514B2 (en) | 2008-02-04 | 2012-06-26 | Qnx Software Systems Limited | Media processing system having resource partitioning |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4980918A (en) * | 1985-05-09 | 1990-12-25 | International Business Machines Corporation | Speech recognition system with efficient storage and rapid assembly of phonological graphs |
ATE108571T1 (en) * | 1986-06-02 | 1994-07-15 | Motorola Inc | CONTINUOUS SPEECH RECOGNITION SYSTEM. |
US5388183A (en) * | 1991-09-30 | 1995-02-07 | Kurzwell Applied Intelligence, Inc. | Speech recognition providing multiple outputs |
-
1994
- 1994-03-31 AU AU63829/94A patent/AU682177B2/en not_active Expired
- 1994-03-31 CA CA002158064A patent/CA2158064C/en not_active Expired - Lifetime
- 1994-03-31 DE DE69416670T patent/DE69416670T2/en not_active Expired - Lifetime
- 1994-03-31 WO PCT/GB1994/000704 patent/WO1994023424A1/en active IP Right Grant
- 1994-03-31 NZ NZ263223A patent/NZ263223A/en unknown
- 1994-03-31 JP JP6521853A patent/JPH08508350A/en not_active Ceased
- 1994-03-31 SG SG1996004023A patent/SG47716A1/en unknown
- 1994-03-31 KR KR1019950704196A patent/KR100309205B1/en not_active IP Right Cessation
- 1994-03-31 CN CNB941916529A patent/CN1196104C/en not_active Expired - Lifetime
-
1995
- 1995-09-27 FI FI954572A patent/FI954572A/en unknown
- 1995-09-29 NO NO953895A patent/NO308756B1/en not_active IP Right Cessation
-
1998
- 1998-12-24 HK HK98115660A patent/HK1014390A1/en not_active IP Right Cessation
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103035243A (en) * | 2012-12-18 | 2013-04-10 | 中国科学院自动化研究所 | Real-time feedback method and system of long voice continuous recognition and recognition result |
CN105913848A (en) * | 2016-04-13 | 2016-08-31 | 乐视控股(北京)有限公司 | Path storing method and path storing system based on minimal heap, and speech recognizer |
Also Published As
Publication number | Publication date |
---|---|
CN1196104C (en) | 2005-04-06 |
NO953895L (en) | 1995-11-28 |
KR100309205B1 (en) | 2001-12-17 |
DE69416670T2 (en) | 1999-06-24 |
HK1014390A1 (en) | 1999-09-24 |
DE69416670D1 (en) | 1999-04-01 |
CA2158064A1 (en) | 1994-10-13 |
NZ263223A (en) | 1997-11-24 |
FI954572A0 (en) | 1995-09-27 |
WO1994023424A1 (en) | 1994-10-13 |
JPH08508350A (en) | 1996-09-03 |
AU682177B2 (en) | 1997-09-25 |
SG47716A1 (en) | 1998-04-17 |
AU6382994A (en) | 1994-10-24 |
NO953895D0 (en) | 1995-09-29 |
FI954572A (en) | 1995-09-27 |
NO308756B1 (en) | 2000-10-23 |
CA2158064C (en) | 2000-10-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1196104C (en) | Speech processing | |
CN1058097C (en) | Connected speech recognition | |
Soong et al. | A Tree. Trellis based fast search for finding the n best sentence hypotheses in continuous speech recognition | |
EP0387602B1 (en) | Method and apparatus for the automatic determination of phonological rules as for a continuous speech recognition system | |
CN1303582C (en) | Automatic speech sound classifying method | |
CN1211779C (en) | Method and appts. for determining non-target language in speech identifying system | |
EP0769184B1 (en) | Speech recognition methods and apparatus on the basis of the modelling of new words | |
US20080294433A1 (en) | Automatic Text-Speech Mapping Tool | |
CN1215491A (en) | Speech processing | |
CN1199488A (en) | Pattern recognition | |
CN112712349A (en) | Intelligent paperless conference data information processing method based on artificial intelligence and big data analysis | |
CN1170472A (en) | Information processing system | |
Chen et al. | Discriminative training on language model | |
US6230128B1 (en) | Path link passing speech recognition with vocabulary node being capable of simultaneously processing plural path links | |
CN1223984C (en) | Client-server based distributed speech recognition system | |
CN1315721A (en) | Speech information transporting system and method for customer server | |
US10402492B1 (en) | Processing natural language grammar | |
CN1369830A (en) | Divergence elimination language model | |
CN1381005A (en) | Method and apparatus for iterative training of classification system | |
EP0692134B1 (en) | Speech processing | |
CN112668664A (en) | Intelligent voice-based talk training method | |
TW394926B (en) | Speech recognition system employing multiple grammar networks | |
Mitra et al. | Recognition of Isolated Speech Signals using Simplified Statistical Parameters | |
Yixiang et al. | The implementation of a practical high performance Mandarin and Sichuan Dialect continuous speech recognition system for parcels checking task | |
JPS59126599A (en) | Continuous word string recognition method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: BT RAVEN SCOTT CO., LTD. Free format text: FORMER OWNER: BRITISH TELECOMM Effective date: 20080620 Owner name: CISCO TECHNOLOGY COMPANY Free format text: FORMER OWNER: SUCRE WENDSCOTT LIMITED LIABILITY COMPANY Effective date: 20080620 |
|
C41 | Transfer of patent application or patent right or utility model | ||
C56 | Change in the name or address of the patentee |
Owner name: SUCRE WENDSCOTT LIMITED LIABILITY COMPANY Free format text: FORMER NAME OR ADDRESS: BT RAVEN SCOTT CO., LTD. |
|
CP03 | Change of name, title or address |
Address after: Delaware Patentee after: CISCO Levin Scott LLC Address before: American California Patentee before: BT Levin Scott LLC |
|
TR01 | Transfer of patent right |
Effective date of registration: 20080620 Address after: California, USA Patentee after: Cisco Technology, Inc. Address before: Delaware Patentee before: CISCO Levin Scott LLC Effective date of registration: 20080620 Address after: American California Patentee after: BT Levin Scott LLC Address before: London, England Patentee before: BRITISH TELECOMMUNICATIONS PLC |
|
C17 | Cessation of patent right | ||
CX01 | Expiry of patent term |
Expiration termination date: 20140331 Granted publication date: 20050406 |