HK1044063B - 分段和識別語音信號的系統和方法 - Google Patents
分段和識別語音信號的系統和方法Info
- Publication number
- HK1044063B HK1044063B HK02105630.3A HK02105630A HK1044063B HK 1044063 B HK1044063 B HK 1044063B HK 02105630 A HK02105630 A HK 02105630A HK 1044063 B HK1044063 B HK 1044063B
- Authority
- HK
- Hong Kong
- Prior art keywords
- cluster
- domain signal
- merged
- frequency domain
- pair
- Prior art date
Links
- 230000011218 segmentation Effects 0.000 title 1
- 230000003595 spectral effect Effects 0.000 abstract 5
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Mobile Radio Communication Systems (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/225,891 US6278972B1 (en) | 1999-01-04 | 1999-01-04 | System and method for segmentation and recognition of speech signals |
PCT/US1999/031308 WO2000041164A1 (en) | 1999-01-04 | 1999-12-29 | System and method for segmentation and recognition of speech signals |
Publications (2)
Publication Number | Publication Date |
---|---|
HK1044063A1 HK1044063A1 (en) | 2002-10-04 |
HK1044063B true HK1044063B (zh) | 2005-05-20 |
Family
ID=22846699
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK02105630.3A HK1044063B (zh) | 1999-01-04 | 2002-07-31 | 分段和識別語音信號的系統和方法 |
Country Status (10)
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6735563B1 (en) * | 2000-07-13 | 2004-05-11 | Qualcomm, Inc. | Method and apparatus for constructing voice templates for a speaker-independent voice recognition system |
US20030154181A1 (en) * | 2002-01-25 | 2003-08-14 | Nec Usa, Inc. | Document clustering with cluster refinement and model selection capabilities |
US7299173B2 (en) * | 2002-01-30 | 2007-11-20 | Motorola Inc. | Method and apparatus for speech detection using time-frequency variance |
KR100880480B1 (ko) * | 2002-02-21 | 2009-01-28 | 엘지전자 주식회사 | 디지털 오디오 신호의 실시간 음악/음성 식별 방법 및시스템 |
KR100435440B1 (ko) * | 2002-03-18 | 2004-06-10 | 정희석 | 화자간 변별력 향상을 위한 가변 길이 코드북 생성 장치및 그 방법, 그를 이용한 코드북 조합 방식의 화자 인식장치 및 그 방법 |
US7050973B2 (en) * | 2002-04-22 | 2006-05-23 | Intel Corporation | Speaker recognition using dynamic time warp template spotting |
DE10220524B4 (de) * | 2002-05-08 | 2006-08-10 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache |
DE10220521B4 (de) * | 2002-05-08 | 2005-11-24 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen |
DE10220520A1 (de) * | 2002-05-08 | 2003-11-20 | Sap Ag | Verfahren zur Erkennung von Sprachinformation |
DE10220522B4 (de) * | 2002-05-08 | 2005-11-17 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse |
EP1361740A1 (de) * | 2002-05-08 | 2003-11-12 | Sap Ag | Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs |
EP1363271A1 (de) * | 2002-05-08 | 2003-11-19 | Sap Ag | Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs |
US7509257B2 (en) * | 2002-12-24 | 2009-03-24 | Marvell International Ltd. | Method and apparatus for adapting reference templates |
US8219391B2 (en) * | 2005-02-15 | 2012-07-10 | Raytheon Bbn Technologies Corp. | Speech analyzing system with speech codebook |
BRPI0707135A2 (pt) * | 2006-01-18 | 2011-04-19 | Lg Electronics Inc. | aparelho e método para codificação e decodificação de sinal |
US20080189109A1 (en) * | 2007-02-05 | 2008-08-07 | Microsoft Corporation | Segmentation posterior based boundary point determination |
CN101998289B (zh) * | 2009-08-19 | 2015-01-28 | 中兴通讯股份有限公司 | 一种集群终端呼叫过程中控制声音播放设备的方法及装置 |
US20130151248A1 (en) * | 2011-12-08 | 2013-06-13 | Forrest Baker, IV | Apparatus, System, and Method For Distinguishing Voice in a Communication Stream |
CA2898677C (en) * | 2013-01-29 | 2017-12-05 | Stefan Dohla | Low-frequency emphasis for lpc-based coding in frequency domain |
CN105989849B (zh) * | 2015-06-03 | 2019-12-03 | 乐融致新电子科技(天津)有限公司 | 一种语音增强方法、语音识别方法、聚类方法及装置 |
CN105161094A (zh) * | 2015-06-26 | 2015-12-16 | 徐信 | 一种语音音频切分手动调整切分点的系统及方法 |
CN111785296B (zh) * | 2020-05-26 | 2022-06-10 | 浙江大学 | 基于重复旋律的音乐分段边界识别方法 |
CN115580682B (zh) * | 2022-12-07 | 2023-04-28 | 北京云迹科技股份有限公司 | 机器人拨打电话的接通挂断时刻的确定的方法及装置 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8503304A (nl) * | 1985-11-29 | 1987-06-16 | Philips Nv | Werkwijze en inrichting voor het segmenteren van een uit een akoestisch signaal, bij voorbeeld een spraaksignaal, afgeleid elektrisch signaal. |
CN1013525B (zh) | 1988-11-16 | 1991-08-14 | 中国科学院声学研究所 | 认人与不认人实时语音识别的方法和装置 |
EP0706172A1 (en) * | 1994-10-04 | 1996-04-10 | Hughes Aircraft Company | Low bit rate speech encoder and decoder |
US6314392B1 (en) | 1996-09-20 | 2001-11-06 | Digital Equipment Corporation | Method and apparatus for clustering-based signal segmentation |
-
1999
- 1999-01-04 US US09/225,891 patent/US6278972B1/en not_active Expired - Lifetime
- 1999-12-29 CN CNB998153230A patent/CN1173333C/zh not_active Expired - Fee Related
- 1999-12-29 WO PCT/US1999/031308 patent/WO2000041164A1/en active IP Right Grant
- 1999-12-29 JP JP2000592818A patent/JP4391701B2/ja not_active Expired - Fee Related
- 1999-12-29 EP EP99967799A patent/EP1141939B1/en not_active Expired - Lifetime
- 1999-12-29 AT AT99967799T patent/ATE323932T1/de not_active IP Right Cessation
- 1999-12-29 DE DE69930961T patent/DE69930961T2/de not_active Expired - Lifetime
- 1999-12-29 AU AU24015/00A patent/AU2401500A/en not_active Abandoned
- 1999-12-29 KR KR1020017008529A patent/KR100699622B1/ko not_active IP Right Cessation
-
2002
- 2002-07-31 HK HK02105630.3A patent/HK1044063B/zh not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
JP4391701B2 (ja) | 2009-12-24 |
US6278972B1 (en) | 2001-08-21 |
KR20010089769A (ko) | 2001-10-08 |
JP2002534718A (ja) | 2002-10-15 |
DE69930961D1 (de) | 2006-05-24 |
CN1348580A (zh) | 2002-05-08 |
AU2401500A (en) | 2000-07-24 |
HK1044063A1 (en) | 2002-10-04 |
EP1141939A1 (en) | 2001-10-10 |
CN1173333C (zh) | 2004-10-27 |
ATE323932T1 (de) | 2006-05-15 |
DE69930961T2 (de) | 2007-01-04 |
KR100699622B1 (ko) | 2007-03-23 |
EP1141939B1 (en) | 2006-04-19 |
WO2000041164A1 (en) | 2000-07-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1044063B (zh) | 分段和識別語音信號的系統和方法 | |
Assmann et al. | Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency | |
HK40596A (en) | Optimal method of data reduction in a speech recognition system | |
CA2349944A1 (en) | Speech coding with comfort noise variability feature for increased fidelity | |
ATE133529T1 (de) | Verfahren zum herstellen von individuell an die konturen eines ohrkanals angepassten otoplastiken oder ohrpassstücken | |
WO2004114193A3 (en) | Adaptive prediction of changes of physiological/pathological states using processing of biomedical signals | |
SE9903215L (sv) | Metod vid produktion av mekanisk massa | |
FI990033A (fi) | Menetelmä ja laite puheenkoodausparametrien määrittämiseksi | |
EP0240329A2 (en) | Noise compensation in speech recognition | |
EP0781833A3 (en) | Noise-robust speech processing | |
MX2022011501A (es) | Sistema, metodo y producto de programa de computadora para optimizar un proceso de manufactura. | |
EP1129537B8 (en) | Processing received data in a distributed speech recognition process | |
WO2003107225A3 (de) | Verfahren zum verändern von entwurfsdaten für die herstellung eines bauteils sowie zugehörige einheiten | |
Akaishi et al. | Harmonic and percussive sound separation based on mixed partial derivative of phase spectrogram | |
DE59104347D1 (de) | Verfahren zum übertragen digitalisierter, blockcodierter tonsignale unter verwendung von skalenfaktoren. | |
Itahashi et al. | A discrimination method between Japanese dialects | |
CN105761657A (zh) | 一种采用彩色点阵显示音乐频谱或动画的方法和系统 | |
TW325542B (en) | Phrase speech input method | |
JPH04181298A (ja) | 参照ベクトル更新方法 | |
Hu et al. | On amplitude modulation for monaural speech segregation | |
KR970024549A (ko) | 전압동조튜닝방식의 채널선국방법 | |
Brown et al. | An Oscillatory Correlation Frame work for Computational Auditory Scene Analysis | |
JPH0465399B2 (US07585860-20090908-C00083.png) | ||
GB2202667B (en) | Voice recognition | |
CN116168715A (zh) | 一种基于双滤波器的回声抑制方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20101229 |