HK1044063B - 分段和識別語音信號的系統和方法 - Google Patents

分段和識別語音信號的系統和方法

Info

Publication number
HK1044063B
HK1044063B HK02105630.3A HK02105630A HK1044063B HK 1044063 B HK1044063 B HK 1044063B HK 02105630 A HK02105630 A HK 02105630A HK 1044063 B HK1044063 B HK 1044063B
Authority
HK
Hong Kong
Prior art keywords
cluster
domain signal
merged
frequency domain
pair
Prior art date
Application number
HK02105630.3A
Other languages
English (en)
Chinese (zh)
Other versions
HK1044063A1 (en
Inventor
畢寧
張承純
Original Assignee
高通股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 高通股份有限公司 filed Critical 高通股份有限公司
Publication of HK1044063A1 publication Critical patent/HK1044063A1/xx
Publication of HK1044063B publication Critical patent/HK1044063B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Mobile Radio Communication Systems (AREA)
HK02105630.3A 1999-01-04 2002-07-31 分段和識別語音信號的系統和方法 HK1044063B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/225,891 US6278972B1 (en) 1999-01-04 1999-01-04 System and method for segmentation and recognition of speech signals
PCT/US1999/031308 WO2000041164A1 (en) 1999-01-04 1999-12-29 System and method for segmentation and recognition of speech signals

Publications (2)

Publication Number Publication Date
HK1044063A1 HK1044063A1 (en) 2002-10-04
HK1044063B true HK1044063B (zh) 2005-05-20

Family

ID=22846699

Family Applications (1)

Application Number Title Priority Date Filing Date
HK02105630.3A HK1044063B (zh) 1999-01-04 2002-07-31 分段和識別語音信號的系統和方法

Country Status (10)

Country Link
US (1) US6278972B1 (US07585860-20090908-C00083.png)
EP (1) EP1141939B1 (US07585860-20090908-C00083.png)
JP (1) JP4391701B2 (US07585860-20090908-C00083.png)
KR (1) KR100699622B1 (US07585860-20090908-C00083.png)
CN (1) CN1173333C (US07585860-20090908-C00083.png)
AT (1) ATE323932T1 (US07585860-20090908-C00083.png)
AU (1) AU2401500A (US07585860-20090908-C00083.png)
DE (1) DE69930961T2 (US07585860-20090908-C00083.png)
HK (1) HK1044063B (US07585860-20090908-C00083.png)
WO (1) WO2000041164A1 (US07585860-20090908-C00083.png)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735563B1 (en) * 2000-07-13 2004-05-11 Qualcomm, Inc. Method and apparatus for constructing voice templates for a speaker-independent voice recognition system
US20030154181A1 (en) * 2002-01-25 2003-08-14 Nec Usa, Inc. Document clustering with cluster refinement and model selection capabilities
US7299173B2 (en) * 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
KR100880480B1 (ko) * 2002-02-21 2009-01-28 엘지전자 주식회사 디지털 오디오 신호의 실시간 음악/음성 식별 방법 및시스템
KR100435440B1 (ko) * 2002-03-18 2004-06-10 정희석 화자간 변별력 향상을 위한 가변 길이 코드북 생성 장치및 그 방법, 그를 이용한 코드북 조합 방식의 화자 인식장치 및 그 방법
US7050973B2 (en) * 2002-04-22 2006-05-23 Intel Corporation Speaker recognition using dynamic time warp template spotting
DE10220524B4 (de) * 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
DE10220521B4 (de) * 2002-05-08 2005-11-24 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen
DE10220520A1 (de) * 2002-05-08 2003-11-20 Sap Ag Verfahren zur Erkennung von Sprachinformation
DE10220522B4 (de) * 2002-05-08 2005-11-17 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse
EP1361740A1 (de) * 2002-05-08 2003-11-12 Sap Ag Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs
EP1363271A1 (de) * 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
US7509257B2 (en) * 2002-12-24 2009-03-24 Marvell International Ltd. Method and apparatus for adapting reference templates
US8219391B2 (en) * 2005-02-15 2012-07-10 Raytheon Bbn Technologies Corp. Speech analyzing system with speech codebook
BRPI0707135A2 (pt) * 2006-01-18 2011-04-19 Lg Electronics Inc. aparelho e método para codificação e decodificação de sinal
US20080189109A1 (en) * 2007-02-05 2008-08-07 Microsoft Corporation Segmentation posterior based boundary point determination
CN101998289B (zh) * 2009-08-19 2015-01-28 中兴通讯股份有限公司 一种集群终端呼叫过程中控制声音播放设备的方法及装置
US20130151248A1 (en) * 2011-12-08 2013-06-13 Forrest Baker, IV Apparatus, System, and Method For Distinguishing Voice in a Communication Stream
CA2898677C (en) * 2013-01-29 2017-12-05 Stefan Dohla Low-frequency emphasis for lpc-based coding in frequency domain
CN105989849B (zh) * 2015-06-03 2019-12-03 乐融致新电子科技(天津)有限公司 一种语音增强方法、语音识别方法、聚类方法及装置
CN105161094A (zh) * 2015-06-26 2015-12-16 徐信 一种语音音频切分手动调整切分点的系统及方法
CN111785296B (zh) * 2020-05-26 2022-06-10 浙江大学 基于重复旋律的音乐分段边界识别方法
CN115580682B (zh) * 2022-12-07 2023-04-28 北京云迹科技股份有限公司 机器人拨打电话的接通挂断时刻的确定的方法及装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8503304A (nl) * 1985-11-29 1987-06-16 Philips Nv Werkwijze en inrichting voor het segmenteren van een uit een akoestisch signaal, bij voorbeeld een spraaksignaal, afgeleid elektrisch signaal.
CN1013525B (zh) 1988-11-16 1991-08-14 中国科学院声学研究所 认人与不认人实时语音识别的方法和装置
EP0706172A1 (en) * 1994-10-04 1996-04-10 Hughes Aircraft Company Low bit rate speech encoder and decoder
US6314392B1 (en) 1996-09-20 2001-11-06 Digital Equipment Corporation Method and apparatus for clustering-based signal segmentation

Also Published As

Publication number Publication date
JP4391701B2 (ja) 2009-12-24
US6278972B1 (en) 2001-08-21
KR20010089769A (ko) 2001-10-08
JP2002534718A (ja) 2002-10-15
DE69930961D1 (de) 2006-05-24
CN1348580A (zh) 2002-05-08
AU2401500A (en) 2000-07-24
HK1044063A1 (en) 2002-10-04
EP1141939A1 (en) 2001-10-10
CN1173333C (zh) 2004-10-27
ATE323932T1 (de) 2006-05-15
DE69930961T2 (de) 2007-01-04
KR100699622B1 (ko) 2007-03-23
EP1141939B1 (en) 2006-04-19
WO2000041164A1 (en) 2000-07-13

Similar Documents

Publication Publication Date Title
HK1044063B (zh) 分段和識別語音信號的系統和方法
Assmann et al. Modeling the perception of concurrent vowels: Vowels with the same fundamental frequency
HK40596A (en) Optimal method of data reduction in a speech recognition system
CA2349944A1 (en) Speech coding with comfort noise variability feature for increased fidelity
ATE133529T1 (de) Verfahren zum herstellen von individuell an die konturen eines ohrkanals angepassten otoplastiken oder ohrpassstücken
WO2004114193A3 (en) Adaptive prediction of changes of physiological/pathological states using processing of biomedical signals
SE9903215L (sv) Metod vid produktion av mekanisk massa
FI990033A (fi) Menetelmä ja laite puheenkoodausparametrien määrittämiseksi
EP0240329A2 (en) Noise compensation in speech recognition
EP0781833A3 (en) Noise-robust speech processing
MX2022011501A (es) Sistema, metodo y producto de programa de computadora para optimizar un proceso de manufactura.
EP1129537B8 (en) Processing received data in a distributed speech recognition process
WO2003107225A3 (de) Verfahren zum verändern von entwurfsdaten für die herstellung eines bauteils sowie zugehörige einheiten
Akaishi et al. Harmonic and percussive sound separation based on mixed partial derivative of phase spectrogram
DE59104347D1 (de) Verfahren zum übertragen digitalisierter, blockcodierter tonsignale unter verwendung von skalenfaktoren.
Itahashi et al. A discrimination method between Japanese dialects
CN105761657A (zh) 一种采用彩色点阵显示音乐频谱或动画的方法和系统
TW325542B (en) Phrase speech input method
JPH04181298A (ja) 参照ベクトル更新方法
Hu et al. On amplitude modulation for monaural speech segregation
KR970024549A (ko) 전압동조튜닝방식의 채널선국방법
Brown et al. An Oscillatory Correlation Frame work for Computational Auditory Scene Analysis
JPH0465399B2 (US07585860-20090908-C00083.png)
GB2202667B (en) Voice recognition
CN116168715A (zh) 一种基于双滤波器的回声抑制方法

Legal Events

Date Code Title Description
PC Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee)

Effective date: 20101229