CN1132147C - 语音识别系统中的特征提取方法 - Google Patents
语音识别系统中的特征提取方法 Download PDFInfo
- Publication number
- CN1132147C CN1132147C CN00102407A CN00102407A CN1132147C CN 1132147 C CN1132147 C CN 1132147C CN 00102407 A CN00102407 A CN 00102407A CN 00102407 A CN00102407 A CN 00102407A CN 1132147 C CN1132147 C CN 1132147C
- Authority
- CN
- China
- Prior art keywords
- coefficient
- eigenvector
- cepstrum
- speech recognition
- definition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000000605 extraction Methods 0.000 claims description 14
- 238000004891 communication Methods 0.000 claims description 3
- 230000008676 import Effects 0.000 claims description 3
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 claims description 2
- 239000013598 vector Substances 0.000 abstract description 9
- 230000015654 memory Effects 0.000 description 19
- 238000012549 training Methods 0.000 description 13
- 238000012545 processing Methods 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 206010038743 Restlessness Diseases 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000000994 depressogenic effect Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B7/00—Closing containers or receptacles after filling
- B65B7/16—Closing semi-rigid or rigid containers or receptacles not deformed by, or not taking-up shape of, contents, e.g. boxes or cartons
- B65B7/162—Closing semi-rigid or rigid containers or receptacles not deformed by, or not taking-up shape of, contents, e.g. boxes or cartons by feeding web material to securing means
- B65B7/164—Securing by heat-sealing
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B43/00—Forming, feeding, opening or setting-up containers or receptacles in association with packaging
- B65B43/42—Feeding or positioning bags, boxes, or cartons in the distended, opened, or set-up state; Feeding preformed rigid containers, e.g. tins, capsules, glass tubes, glasses, to the packaging position; Locating containers or receptacles at the filling position; Supporting containers or receptacles during the filling operation
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B51/00—Devices for, or methods of, sealing or securing package folds or closures; Devices for gathering or twisting wrappers, or necks of bags
- B65B51/10—Applying or generating heat or pressure or combinations thereof
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B57/00—Automatic control, checking, warning, or safety devices
- B65B57/18—Automatic control, checking, warning, or safety devices causing operation of audible or visible alarm signals
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B59/00—Arrangements to enable machines to handle articles of different sizes, to produce packages of different sizes, to vary the contents of packages, to handle different types of packaging material, or to give access for cleaning or maintenance purposes
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B65—CONVEYING; PACKING; STORING; HANDLING THIN OR FILAMENTARY MATERIAL
- B65B—MACHINES, APPARATUS OR DEVICES FOR, OR METHODS OF, PACKAGING ARTICLES OR MATERIALS; UNPACKING
- B65B61/00—Auxiliary devices, not otherwise provided for, for operating on sheets, blanks, webs, binding material, containers or packages
- B65B61/04—Auxiliary devices, not otherwise provided for, for operating on sheets, blanks, webs, binding material, containers or packages for severing webs, or for separating joined packages
- B65B61/06—Auxiliary devices, not otherwise provided for, for operating on sheets, blanks, webs, binding material, containers or packages for severing webs, or for separating joined packages by cutting
Landscapes
- Engineering & Computer Science (AREA)
- Mechanical Engineering (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Mobile Radio Communication Systems (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (8)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/256,280 | 1999-02-23 | ||
US09/256,280 US6182036B1 (en) | 1999-02-23 | 1999-02-23 | Method of extracting features in a voice recognition system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1264889A CN1264889A (zh) | 2000-08-30 |
CN1132147C true CN1132147C (zh) | 2003-12-24 |
Family
ID=22971643
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN00102407A Expired - Lifetime CN1132147C (zh) | 1999-02-23 | 2000-02-23 | 语音识别系统中的特征提取方法 |
Country Status (5)
Country | Link |
---|---|
US (1) | US6182036B1 (zh) |
JP (1) | JP4912518B2 (zh) |
KR (1) | KR100321464B1 (zh) |
CN (1) | CN1132147C (zh) |
GB (1) | GB2347775B (zh) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9076448B2 (en) * | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US8290768B1 (en) | 2000-06-21 | 2012-10-16 | International Business Machines Corporation | System and method for determining a set of attributes based on content of communications |
US6408277B1 (en) * | 2000-06-21 | 2002-06-18 | Banter Limited | System and method for automatic task prioritization |
US9699129B1 (en) | 2000-06-21 | 2017-07-04 | International Business Machines Corporation | System and method for increasing email productivity |
JP3877270B2 (ja) * | 2000-07-12 | 2007-02-07 | アルパイン株式会社 | 音声特徴量抽出装置 |
US20020065649A1 (en) * | 2000-08-25 | 2002-05-30 | Yoon Kim | Mel-frequency linear prediction speech recognition apparatus and method |
US7644057B2 (en) | 2001-01-03 | 2010-01-05 | International Business Machines Corporation | System and method for electronic communication management |
US6633839B2 (en) * | 2001-02-02 | 2003-10-14 | Motorola, Inc. | Method and apparatus for speech reconstruction in a distributed speech recognition system |
US20020178004A1 (en) * | 2001-05-23 | 2002-11-28 | Chienchung Chang | Method and apparatus for voice recognition |
ES2190342B1 (es) * | 2001-06-25 | 2004-11-16 | Universitat Pompeu Fabra | Metodo para identificacion de secuencias de audio. |
US8495002B2 (en) * | 2003-05-06 | 2013-07-23 | International Business Machines Corporation | Software tool for training and testing a knowledge base |
US20050187913A1 (en) | 2003-05-06 | 2005-08-25 | Yoram Nelken | Web-based customer service interface |
US20060271368A1 (en) * | 2005-05-25 | 2006-11-30 | Yishay Carmiel | Voice interface for consumer products |
CN101165779B (zh) * | 2006-10-20 | 2010-06-02 | 索尼株式会社 | 信息处理装置和方法、程序及记录介质 |
US20100057452A1 (en) * | 2008-08-28 | 2010-03-04 | Microsoft Corporation | Speech interfaces |
EP2328363B1 (en) * | 2009-09-11 | 2016-05-18 | Starkey Laboratories, Inc. | Sound classification system for hearing aids |
US8670980B2 (en) * | 2009-10-26 | 2014-03-11 | Panasonic Corporation | Tone determination device and method |
US8719019B2 (en) * | 2011-04-25 | 2014-05-06 | Microsoft Corporation | Speaker identification |
US20160283864A1 (en) * | 2015-03-27 | 2016-09-29 | Qualcomm Incorporated | Sequential image sampling and storage of fine-tuned features |
DE102017109736A1 (de) | 2017-05-05 | 2018-11-08 | Storopack Hans Reichenecker Gmbh | Vorrichtung und Verfahren zum Polstern mindestens eines Gegenstands in einem Behälter |
CN108154883A (zh) * | 2018-03-23 | 2018-06-12 | 南昌航空大学 | 一种具备语音控制功能的密集架管理系统 |
CN108694951B (zh) * | 2018-05-22 | 2020-05-22 | 华南理工大学 | 一种基于多流分层融合变换特征和长短时记忆网络的说话人辨识方法 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1232686A (en) * | 1985-01-30 | 1988-02-09 | Northern Telecom Limited | Speech recognition |
JP2760096B2 (ja) * | 1989-10-31 | 1998-05-28 | 日本電気株式会社 | 音声認識方式 |
US5097509A (en) * | 1990-03-28 | 1992-03-17 | Northern Telecom Limited | Rejection method for speech recognition |
JP2973805B2 (ja) * | 1993-12-10 | 1999-11-08 | 日本電気株式会社 | 標準パターン作成装置 |
JP3537949B2 (ja) * | 1996-03-06 | 2004-06-14 | 株式会社東芝 | パターン認識装置及び同装置における辞書修正方法 |
JPH10149190A (ja) * | 1996-11-19 | 1998-06-02 | Matsushita Electric Ind Co Ltd | 音声認識方法及び音声認識装置 |
US6029124A (en) * | 1997-02-21 | 2000-02-22 | Dragon Systems, Inc. | Sequential, nonparametric speech recognition and speaker identification |
GB9706174D0 (en) * | 1997-03-25 | 1997-11-19 | Secr Defence | Recognition system |
-
1999
- 1999-02-23 US US09/256,280 patent/US6182036B1/en not_active Expired - Lifetime
-
2000
- 2000-02-15 JP JP2000036104A patent/JP4912518B2/ja not_active Expired - Lifetime
- 2000-02-18 GB GB0003949A patent/GB2347775B/en not_active Expired - Lifetime
- 2000-02-22 KR KR1020000008392A patent/KR100321464B1/ko active IP Right Grant
- 2000-02-23 CN CN00102407A patent/CN1132147C/zh not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
KR100321464B1 (ko) | 2002-03-18 |
KR20000071366A (ko) | 2000-11-25 |
US6182036B1 (en) | 2001-01-30 |
CN1264889A (zh) | 2000-08-30 |
GB2347775B (en) | 2001-08-08 |
JP4912518B2 (ja) | 2012-04-11 |
GB2347775A (en) | 2000-09-13 |
JP2000250576A (ja) | 2000-09-14 |
GB0003949D0 (en) | 2000-04-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1132147C (zh) | 语音识别系统中的特征提取方法 | |
EP1301922B1 (en) | System and method for voice recognition with a plurality of voice recognition engines | |
Hermansky et al. | RASTA processing of speech | |
Shrawankar et al. | Techniques for feature extraction in speech recognition system: A comparative study | |
CN1160698C (zh) | 噪声信号中语音的端点定位 | |
JP4202124B2 (ja) | 話者独立音声認識システムのための音声テンプレートを構成するための方法及び装置 | |
US6836758B2 (en) | System and method for hybrid voice recognition | |
CN100527224C (zh) | 有效存储语音识别模型的系统和方法 | |
CN103377651B (zh) | 语音自动合成装置及方法 | |
CN1352787A (zh) | 分布式语音识别系统 | |
WO2002095729A1 (en) | Method and apparatus for adapting voice recognition templates | |
US5943647A (en) | Speech recognition based on HMMs | |
KR20040038419A (ko) | 음성을 이용한 감정인식 시스템 및 감정인식 방법 | |
Kaur et al. | Optimizing feature extraction techniques constituting phone based modelling on connected words for Punjabi automatic speech recognition | |
KR20010093325A (ko) | 스피치 인에이블 장치의 유저 인터페이스 보존성을테스트하기 위한 방법 및 장치 | |
Li et al. | An auditory system-based feature for robust speech recognition | |
Gemello et al. | Multi-source neural networks for speech recognition: a review of recent results | |
Chakraborty et al. | An automatic speaker recognition system | |
Marković et al. | Recognition of normal and whispered speech based on RASTA filtering and DTW algorithm | |
CN108986794B (zh) | 一种基于幂函数频率变换的说话人补偿方法 | |
Marković et al. | Recognition of Whispered Speech Based on PLP Features and DTW Algorithm | |
Marković et al. | Whispered Speech Recognition Based on DTW algorithm and µFCC feature | |
Leibman et al. | Perceptual time-varying modelling of speech signals for ASR and compression application | |
TAKAGI et al. | s po I RECOGNITION VVITH RAPID ENVIRONMENT ADAPTATION BY SPECTRUM EQUALIZATION | |
O'Shaughnessy | Improving analysis techniques for automatic speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MOTOROLA MOBILITY, INC. Free format text: FORMER OWNER: MOTOROLA INC. Effective date: 20110126 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20110126 Address after: Illinois Instrunment Patentee after: MOTOROLA MOBILITY, Inc. Address before: Illinois Instrunment Patentee before: Motorola, Inc. |
|
C41 | Transfer of patent application or patent right or utility model | ||
C56 | Change in the name or address of the patentee | ||
CP01 | Change in the name or title of a patent holder |
Address after: Illinois State Patentee after: MOTOROLA MOBILITY LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY, Inc. |
|
CP02 | Change in the address of a patent holder |
Address after: Illinois State Patentee after: MOTOROLA MOBILITY, Inc. Address before: Illinois Instrunment Patentee before: MOTOROLA MOBILITY, Inc. |
|
TR01 | Transfer of patent right |
Effective date of registration: 20160612 Address after: California, USA Patentee after: Google Technology Holdings LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY LLC |
|
CX01 | Expiry of patent term | ||
CX01 | Expiry of patent term |
Granted publication date: 20031224 |