KR100699622B1 - 음성 신호를 구분 및 인식하기 위한 시스템 및 방법 - Google Patents
음성 신호를 구분 및 인식하기 위한 시스템 및 방법 Download PDFInfo
- Publication number
- KR100699622B1 KR100699622B1 KR1020017008529A KR20017008529A KR100699622B1 KR 100699622 B1 KR100699622 B1 KR 100699622B1 KR 1020017008529 A KR1020017008529 A KR 1020017008529A KR 20017008529 A KR20017008529 A KR 20017008529A KR 100699622 B1 KR100699622 B1 KR 100699622B1
- Authority
- KR
- South Korea
- Prior art keywords
- cluster
- speech
- signal
- clusters
- forming
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000011218 segmentation Effects 0.000 title description 2
- 230000003595 spectral effect Effects 0.000 claims abstract description 43
- 230000010354 integration Effects 0.000 claims description 12
- 238000012545 processing Methods 0.000 claims description 2
- 239000006185 dispersion Substances 0.000 claims 1
- 238000001228 spectrum Methods 0.000 description 9
- 238000010586 diagram Methods 0.000 description 6
- IERHLVCPSMICTF-XVFCMESISA-N CMP group Chemical group P(=O)(O)(O)OC[C@@H]1[C@H]([C@H]([C@@H](O1)N1C(=O)N=C(N)C=C1)O)O IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 4
- 239000013317 conjugated microporous polymer Substances 0.000 description 4
- 210000003643 myeloid progenitor cell Anatomy 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 241000970807 Thermoanaerobacterales Species 0.000 description 1
- 238000007596 consolidation process Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Mobile Radio Communication Systems (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/225,891 | 1999-01-04 | ||
US09/225,891 US6278972B1 (en) | 1999-01-04 | 1999-01-04 | System and method for segmentation and recognition of speech signals |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20010089769A KR20010089769A (ko) | 2001-10-08 |
KR100699622B1 true KR100699622B1 (ko) | 2007-03-23 |
Family
ID=22846699
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020017008529A KR100699622B1 (ko) | 1999-01-04 | 1999-12-29 | 음성 신호를 구분 및 인식하기 위한 시스템 및 방법 |
Country Status (10)
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013085613A1 (en) * | 2011-12-08 | 2013-06-13 | Noguar, L.C. | Apparatus, system, and method for distinguishing voice in a communication stream |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6735563B1 (en) * | 2000-07-13 | 2004-05-11 | Qualcomm, Inc. | Method and apparatus for constructing voice templates for a speaker-independent voice recognition system |
US20030154181A1 (en) * | 2002-01-25 | 2003-08-14 | Nec Usa, Inc. | Document clustering with cluster refinement and model selection capabilities |
US7299173B2 (en) * | 2002-01-30 | 2007-11-20 | Motorola Inc. | Method and apparatus for speech detection using time-frequency variance |
KR100880480B1 (ko) * | 2002-02-21 | 2009-01-28 | 엘지전자 주식회사 | 디지털 오디오 신호의 실시간 음악/음성 식별 방법 및시스템 |
KR100435440B1 (ko) * | 2002-03-18 | 2004-06-10 | 정희석 | 화자간 변별력 향상을 위한 가변 길이 코드북 생성 장치및 그 방법, 그를 이용한 코드북 조합 방식의 화자 인식장치 및 그 방법 |
US7050973B2 (en) * | 2002-04-22 | 2006-05-23 | Intel Corporation | Speaker recognition using dynamic time warp template spotting |
DE10220524B4 (de) * | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
DE10220521B4 (de) * | 2002-05-08 | 2005-11-24 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen |
DE10220520A1 (de) * | 2002-05-08 | 2003-11-20 | Sap Ag | Verfahren zur Erkennung von Sprachinformation |
DE10220522B4 (de) * | 2002-05-08 | 2005-11-17 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse |
EP1361740A1 (de) * | 2002-05-08 | 2003-11-12 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs |
EP1363271A1 (de) * | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
US7509257B2 (en) * | 2002-12-24 | 2009-03-24 | Marvell International Ltd. | Method and apparatus for adapting reference templates |
US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
BRPI0707135A2 (pt) * | 2006-01-18 | 2011-04-19 | Lg Electronics Inc. | aparelho e método para codificação e decodificação de sinal |
US20080189109A1 (en) * | 2007-02-05 | 2008-08-07 | Microsoft Corporation | Segmentation posterior based boundary point determination |
CN101998289B (zh) * | 2009-08-19 | 2015-01-28 | 中兴通讯股份有限公司 | 一种集群终端呼叫过程中控制声音播放设备的方法及装置 |
CA2898677C (en) * | 2013-01-29 | 2017-12-05 | Stefan Dohla | Low-frequency emphasis for lpc-based coding in frequency domain |
CN105989849B (zh) * | 2015-06-03 | 2019-12-03 | 乐融致新电子科技(天津)有限公司 | 一种语音增强方法、语音识别方法、聚类方法及装置 |
CN105161094A (zh) * | 2015-06-26 | 2015-12-16 | 徐信 | 一种语音音频切分手动调整切分点的系统及方法 |
CN111785296B (zh) * | 2020-05-26 | 2022-06-10 | 浙江大学 | 基于重复旋律的音乐分段边界识别方法 |
CN115580682B (zh) * | 2022-12-07 | 2023-04-28 | 北京云迹科技股份有限公司 | 机器人拨打电话的接通挂断时刻的确定的方法及装置 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0831455A2 (en) * | 1996-09-20 | 1998-03-25 | Digital Equipment Corporation | Clustering-based signal segmentation |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8503304A (nl) * | 1985-11-29 | 1987-06-16 | Philips Nv | Werkwijze en inrichting voor het segmenteren van een uit een akoestisch signaal, bij voorbeeld een spraaksignaal, afgeleid elektrisch signaal. |
CN1013525B (zh) | 1988-11-16 | 1991-08-14 | 中国科学院声学研究所 | 认人与不认人实时语音识别的方法和装置 |
EP0706172A1 (en) * | 1994-10-04 | 1996-04-10 | Hughes Aircraft Company | Low bit rate speech encoder and decoder |
-
1999
- 1999-01-04 US US09/225,891 patent/US6278972B1/en not_active Expired - Lifetime
- 1999-12-29 CN CNB998153230A patent/CN1173333C/zh not_active Expired - Fee Related
- 1999-12-29 WO PCT/US1999/031308 patent/WO2000041164A1/en active IP Right Grant
- 1999-12-29 JP JP2000592818A patent/JP4391701B2/ja not_active Expired - Fee Related
- 1999-12-29 EP EP99967799A patent/EP1141939B1/en not_active Expired - Lifetime
- 1999-12-29 AT AT99967799T patent/ATE323932T1/de not_active IP Right Cessation
- 1999-12-29 DE DE69930961T patent/DE69930961T2/de not_active Expired - Lifetime
- 1999-12-29 AU AU24015/00A patent/AU2401500A/en not_active Abandoned
- 1999-12-29 KR KR1020017008529A patent/KR100699622B1/ko not_active IP Right Cessation
-
2002
- 2002-07-31 HK HK02105630.3A patent/HK1044063B/zh not_active IP Right Cessation
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0831455A2 (en) * | 1996-09-20 | 1998-03-25 | Digital Equipment Corporation | Clustering-based signal segmentation |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013085613A1 (en) * | 2011-12-08 | 2013-06-13 | Noguar, L.C. | Apparatus, system, and method for distinguishing voice in a communication stream |
Also Published As
Publication number | Publication date |
---|---|
JP4391701B2 (ja) | 2009-12-24 |
US6278972B1 (en) | 2001-08-21 |
KR20010089769A (ko) | 2001-10-08 |
JP2002534718A (ja) | 2002-10-15 |
HK1044063B (zh) | 2005-05-20 |
DE69930961D1 (de) | 2006-05-24 |
CN1348580A (zh) | 2002-05-08 |
AU2401500A (en) | 2000-07-24 |
HK1044063A1 (en) | 2002-10-04 |
EP1141939A1 (en) | 2001-10-10 |
CN1173333C (zh) | 2004-10-27 |
ATE323932T1 (de) | 2006-05-15 |
DE69930961T2 (de) | 2007-01-04 |
EP1141939B1 (en) | 2006-04-19 |
WO2000041164A1 (en) | 2000-07-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100699622B1 (ko) | 음성 신호를 구분 및 인식하기 위한 시스템 및 방법 | |
US5327521A (en) | Speech transformation system | |
Chapaneri | Spoken digits recognition using weighted MFCC and improved features for dynamic time warping | |
JP2003514263A (ja) | マッピング・マトリックスを用いた広帯域音声合成 | |
Thakur et al. | Speech recognition using euclidean distance | |
US5963904A (en) | Phoneme dividing method using multilevel neural network | |
US5307442A (en) | Method and apparatus for speaker individuality conversion | |
Pang | Spectrum energy based voice activity detection | |
JPH07334184A (ja) | 音響カテゴリ平均値計算装置及び適応化装置 | |
JPH0612089A (ja) | 音声認識方法 | |
JP3189598B2 (ja) | 信号合成方法および信号合成装置 | |
JPS634200B2 (US07585860-20090908-C00083.png) | ||
JPS6128998B2 (US07585860-20090908-C00083.png) | ||
Ziółko et al. | Wavelet method of speech segmentation | |
KR20170088165A (ko) | 심층 신경망 기반 음성인식 방법 및 그 장치 | |
CN112466276A (zh) | 一种语音合成系统训练方法、装置以及可读存储介质 | |
Singh et al. | Novel feature extraction algorithm using DWT and temporal statistical techniques for word dependent speaker’s recognition | |
JP2017520016A (ja) | パラメトリック音声合成システムに基づく声門パルスモデルの励磁信号形成方法 | |
JPS63502304A (ja) | 高雑音環境における言語認識のためのフレ−ム比較法 | |
US11270721B2 (en) | Systems and methods of pre-processing of speech signals for improved speech recognition | |
JP4603727B2 (ja) | 音響信号分析方法及び装置 | |
Wang et al. | An implementation of multi-microphone dereverbera-tion approach as a preprocessor to the word recogni-tion system | |
JPH0247758B2 (US07585860-20090908-C00083.png) | ||
JPH11352982A (ja) | 音声認識システムにおける単語学習および認識方法 | |
JPH05508242A (ja) | 話者認識方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A201 | Request for examination | ||
E902 | Notification of reason for refusal | ||
E701 | Decision to grant or registration of patent right | ||
GRNT | Written decision to grant | ||
FPAY | Annual fee payment |
Payment date: 20100122 Year of fee payment: 4 |
|
LAPS | Lapse due to unpaid annual fee |