KR100699622B1 - 음성 신호를 구분 및 인식하기 위한 시스템 및 방법 - Google Patents

음성 신호를 구분 및 인식하기 위한 시스템 및 방법 Download PDF

Info

Publication number
KR100699622B1
KR100699622B1 KR1020017008529A KR20017008529A KR100699622B1 KR 100699622 B1 KR100699622 B1 KR 100699622B1 KR 1020017008529 A KR1020017008529 A KR 1020017008529A KR 20017008529 A KR20017008529 A KR 20017008529A KR 100699622 B1 KR100699622 B1 KR 100699622B1
Authority
KR
South Korea
Prior art keywords
cluster
speech
signal
clusters
forming
Prior art date
Application number
KR1020017008529A
Other languages
English (en)
Korean (ko)
Other versions
KR20010089769A (ko
Inventor
닝 비
치엔충 창
Original Assignee
콸콤 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 콸콤 인코포레이티드 filed Critical 콸콤 인코포레이티드
Publication of KR20010089769A publication Critical patent/KR20010089769A/ko
Application granted granted Critical
Publication of KR100699622B1 publication Critical patent/KR100699622B1/ko

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Mobile Radio Communication Systems (AREA)
KR1020017008529A 1999-01-04 1999-12-29 음성 신호를 구분 및 인식하기 위한 시스템 및 방법 KR100699622B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/225,891 1999-01-04
US09/225,891 US6278972B1 (en) 1999-01-04 1999-01-04 System and method for segmentation and recognition of speech signals

Publications (2)

Publication Number Publication Date
KR20010089769A KR20010089769A (ko) 2001-10-08
KR100699622B1 true KR100699622B1 (ko) 2007-03-23

Family

ID=22846699

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020017008529A KR100699622B1 (ko) 1999-01-04 1999-12-29 음성 신호를 구분 및 인식하기 위한 시스템 및 방법

Country Status (10)

Country Link
US (1) US6278972B1 (US07585860-20090908-C00083.png)
EP (1) EP1141939B1 (US07585860-20090908-C00083.png)
JP (1) JP4391701B2 (US07585860-20090908-C00083.png)
KR (1) KR100699622B1 (US07585860-20090908-C00083.png)
CN (1) CN1173333C (US07585860-20090908-C00083.png)
AT (1) ATE323932T1 (US07585860-20090908-C00083.png)
AU (1) AU2401500A (US07585860-20090908-C00083.png)
DE (1) DE69930961T2 (US07585860-20090908-C00083.png)
HK (1) HK1044063B (US07585860-20090908-C00083.png)
WO (1) WO2000041164A1 (US07585860-20090908-C00083.png)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013085613A1 (en) * 2011-12-08 2013-06-13 Noguar, L.C. Apparatus, system, and method for distinguishing voice in a communication stream

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735563B1 (en) * 2000-07-13 2004-05-11 Qualcomm, Inc. Method and apparatus for constructing voice templates for a speaker-independent voice recognition system
US20030154181A1 (en) * 2002-01-25 2003-08-14 Nec Usa, Inc. Document clustering with cluster refinement and model selection capabilities
US7299173B2 (en) * 2002-01-30 2007-11-20 Motorola Inc. Method and apparatus for speech detection using time-frequency variance
KR100880480B1 (ko) * 2002-02-21 2009-01-28 엘지전자 주식회사 디지털 오디오 신호의 실시간 음악/음성 식별 방법 및시스템
KR100435440B1 (ko) * 2002-03-18 2004-06-10 정희석 화자간 변별력 향상을 위한 가변 길이 코드북 생성 장치및 그 방법, 그를 이용한 코드북 조합 방식의 화자 인식장치 및 그 방법
US7050973B2 (en) * 2002-04-22 2006-05-23 Intel Corporation Speaker recognition using dynamic time warp template spotting
DE10220524B4 (de) * 2002-05-08 2006-08-10 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und zur Erkennung einer Sprache
DE10220521B4 (de) * 2002-05-08 2005-11-24 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten und Klassifizierung von Gesprächen
DE10220520A1 (de) * 2002-05-08 2003-11-20 Sap Ag Verfahren zur Erkennung von Sprachinformation
DE10220522B4 (de) * 2002-05-08 2005-11-17 Sap Ag Verfahren und System zur Verarbeitung von Sprachdaten mittels Spracherkennung und Frequenzanalyse
EP1361740A1 (de) * 2002-05-08 2003-11-12 Sap Ag Verfahren und System zur Verarbeitung von Sprachinformationen eines Dialogs
EP1363271A1 (de) * 2002-05-08 2003-11-19 Sap Ag Verfahren und System zur Verarbeitung und Speicherung von Sprachinformationen eines Dialogs
US7509257B2 (en) * 2002-12-24 2009-03-24 Marvell International Ltd. Method and apparatus for adapting reference templates
US8219391B2 (en) * 2005-02-15 2012-07-10 Raytheon Bbn Technologies Corp. Speech analyzing system with speech codebook
BRPI0707135A2 (pt) * 2006-01-18 2011-04-19 Lg Electronics Inc. aparelho e método para codificação e decodificação de sinal
US20080189109A1 (en) * 2007-02-05 2008-08-07 Microsoft Corporation Segmentation posterior based boundary point determination
CN101998289B (zh) * 2009-08-19 2015-01-28 中兴通讯股份有限公司 一种集群终端呼叫过程中控制声音播放设备的方法及装置
CA2898677C (en) * 2013-01-29 2017-12-05 Stefan Dohla Low-frequency emphasis for lpc-based coding in frequency domain
CN105989849B (zh) * 2015-06-03 2019-12-03 乐融致新电子科技(天津)有限公司 一种语音增强方法、语音识别方法、聚类方法及装置
CN105161094A (zh) * 2015-06-26 2015-12-16 徐信 一种语音音频切分手动调整切分点的系统及方法
CN111785296B (zh) * 2020-05-26 2022-06-10 浙江大学 基于重复旋律的音乐分段边界识别方法
CN115580682B (zh) * 2022-12-07 2023-04-28 北京云迹科技股份有限公司 机器人拨打电话的接通挂断时刻的确定的方法及装置

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0831455A2 (en) * 1996-09-20 1998-03-25 Digital Equipment Corporation Clustering-based signal segmentation

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NL8503304A (nl) * 1985-11-29 1987-06-16 Philips Nv Werkwijze en inrichting voor het segmenteren van een uit een akoestisch signaal, bij voorbeeld een spraaksignaal, afgeleid elektrisch signaal.
CN1013525B (zh) 1988-11-16 1991-08-14 中国科学院声学研究所 认人与不认人实时语音识别的方法和装置
EP0706172A1 (en) * 1994-10-04 1996-04-10 Hughes Aircraft Company Low bit rate speech encoder and decoder

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0831455A2 (en) * 1996-09-20 1998-03-25 Digital Equipment Corporation Clustering-based signal segmentation

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013085613A1 (en) * 2011-12-08 2013-06-13 Noguar, L.C. Apparatus, system, and method for distinguishing voice in a communication stream

Also Published As

Publication number Publication date
JP4391701B2 (ja) 2009-12-24
US6278972B1 (en) 2001-08-21
KR20010089769A (ko) 2001-10-08
JP2002534718A (ja) 2002-10-15
HK1044063B (zh) 2005-05-20
DE69930961D1 (de) 2006-05-24
CN1348580A (zh) 2002-05-08
AU2401500A (en) 2000-07-24
HK1044063A1 (en) 2002-10-04
EP1141939A1 (en) 2001-10-10
CN1173333C (zh) 2004-10-27
ATE323932T1 (de) 2006-05-15
DE69930961T2 (de) 2007-01-04
EP1141939B1 (en) 2006-04-19
WO2000041164A1 (en) 2000-07-13

Similar Documents

Publication Publication Date Title
KR100699622B1 (ko) 음성 신호를 구분 및 인식하기 위한 시스템 및 방법
US5327521A (en) Speech transformation system
Chapaneri Spoken digits recognition using weighted MFCC and improved features for dynamic time warping
JP2003514263A (ja) マッピング・マトリックスを用いた広帯域音声合成
Thakur et al. Speech recognition using euclidean distance
US5963904A (en) Phoneme dividing method using multilevel neural network
US5307442A (en) Method and apparatus for speaker individuality conversion
Pang Spectrum energy based voice activity detection
JPH07334184A (ja) 音響カテゴリ平均値計算装置及び適応化装置
JPH0612089A (ja) 音声認識方法
JP3189598B2 (ja) 信号合成方法および信号合成装置
JPS634200B2 (US07585860-20090908-C00083.png)
JPS6128998B2 (US07585860-20090908-C00083.png)
Ziółko et al. Wavelet method of speech segmentation
KR20170088165A (ko) 심층 신경망 기반 음성인식 방법 및 그 장치
CN112466276A (zh) 一种语音合成系统训练方法、装置以及可读存储介质
Singh et al. Novel feature extraction algorithm using DWT and temporal statistical techniques for word dependent speaker’s recognition
JP2017520016A (ja) パラメトリック音声合成システムに基づく声門パルスモデルの励磁信号形成方法
JPS63502304A (ja) 高雑音環境における言語認識のためのフレ−ム比較法
US11270721B2 (en) Systems and methods of pre-processing of speech signals for improved speech recognition
JP4603727B2 (ja) 音響信号分析方法及び装置
Wang et al. An implementation of multi-microphone dereverbera-tion approach as a preprocessor to the word recogni-tion system
JPH0247758B2 (US07585860-20090908-C00083.png)
JPH11352982A (ja) 音声認識システムにおける単語学習および認識方法
JPH05508242A (ja) 話者認識方法

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant
FPAY Annual fee payment

Payment date: 20100122

Year of fee payment: 4

LAPS Lapse due to unpaid annual fee