JP2002358095A - 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 - Google Patents
音声処理装置および音声処理方法、並びにプログラムおよび記録媒体Info
- Publication number
- JP2002358095A JP2002358095A JP2002069603A JP2002069603A JP2002358095A JP 2002358095 A JP2002358095 A JP 2002358095A JP 2002069603 A JP2002069603 A JP 2002069603A JP 2002069603 A JP2002069603 A JP 2002069603A JP 2002358095 A JP2002358095 A JP 2002358095A
- Authority
- JP
- Japan
- Prior art keywords
- cluster
- unit
- speech
- dictionary
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Manipulator (AREA)
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2002069603A JP2002358095A (ja) | 2001-03-30 | 2002-03-14 | 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 |
| EP02708744A EP1376536A1 (en) | 2001-03-30 | 2002-04-01 | Sound processing apparatus |
| KR1020027016297A KR20030007793A (ko) | 2001-03-30 | 2002-04-01 | 음성 처리 장치 |
| PCT/JP2002/003248 WO2002080141A1 (en) | 2001-03-30 | 2002-04-01 | Sound processing apparatus |
| CN02801646A CN1462428A (zh) | 2001-03-30 | 2002-04-01 | 语音处理装置 |
| US10/296,797 US7228276B2 (en) | 2001-03-30 | 2002-04-01 | Sound processing registering a word in a dictionary |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2001097843 | 2001-03-30 | ||
| JP2001-97843 | 2001-03-30 | ||
| JP2002069603A JP2002358095A (ja) | 2001-03-30 | 2002-03-14 | 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2002358095A true JP2002358095A (ja) | 2002-12-13 |
| JP2002358095A5 JP2002358095A5 (enExample) | 2005-09-02 |
Family
ID=26612647
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2002069603A Abandoned JP2002358095A (ja) | 2001-03-30 | 2002-03-14 | 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US7228276B2 (enExample) |
| EP (1) | EP1376536A1 (enExample) |
| JP (1) | JP2002358095A (enExample) |
| KR (1) | KR20030007793A (enExample) |
| CN (1) | CN1462428A (enExample) |
| WO (1) | WO2002080141A1 (enExample) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2004252121A (ja) * | 2003-02-20 | 2004-09-09 | Sony Corp | 言語処理装置および言語処理方法、並びにプログラムおよび記録媒体 |
| WO2005122144A1 (ja) * | 2004-06-10 | 2005-12-22 | Matsushita Electric Industrial Co., Ltd. | 音声認識装置、音声認識方法、及びプログラム |
| JP2006171710A (ja) * | 2004-12-10 | 2006-06-29 | Microsoft Corp | 音響情報から意味的な意図を識別するためのシステムおよび方法 |
| WO2007138875A1 (ja) * | 2006-05-31 | 2007-12-06 | Nec Corporation | 音声認識用単語辞書・言語モデル作成システム、方法、プログラムおよび音声認識システム |
| JP2009157119A (ja) * | 2007-12-27 | 2009-07-16 | Univ Of Ryukyus | 音声単語自動獲得方法 |
| US8423354B2 (en) | 2008-05-09 | 2013-04-16 | Fujitsu Limited | Speech recognition dictionary creating support device, computer readable medium storing processing program, and processing method |
| KR20160014465A (ko) * | 2014-07-29 | 2016-02-11 | 삼성전자주식회사 | 전자 장치 및 이의 음성 인식 방법 |
Families Citing this family (51)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070265834A1 (en) * | 2001-09-06 | 2007-11-15 | Einat Melnick | In-context analysis |
| US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
| US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
| US7110949B2 (en) * | 2004-09-13 | 2006-09-19 | At&T Knowledge Ventures, L.P. | System and method for analysis and adjustment of speech-enabled systems |
| US7729478B1 (en) * | 2005-04-12 | 2010-06-01 | Avaya Inc. | Change speed of voicemail playback depending on context |
| EP1884923A4 (en) * | 2005-05-27 | 2009-06-03 | Panasonic Corp | DEVICE, METHOD AND PROGRAM FOR EDITING VOICES |
| US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
| US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
| US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
| EP1934971A4 (en) | 2005-08-31 | 2010-10-27 | Voicebox Technologies Inc | DYNAMIC LANGUAGE SCRIPTURE |
| KR100717385B1 (ko) * | 2006-02-09 | 2007-05-11 | 삼성전자주식회사 | 인식 후보의 사전적 거리를 이용한 인식 신뢰도 측정 방법및 인식 신뢰도 측정 시스템 |
| JP2007286356A (ja) * | 2006-04-17 | 2007-11-01 | Funai Electric Co Ltd | 電子機器 |
| JP4181590B2 (ja) * | 2006-08-30 | 2008-11-19 | 株式会社東芝 | インタフェース装置及びインタフェース処理方法 |
| US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
| US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
| DE102007033472A1 (de) * | 2007-07-18 | 2009-01-29 | Siemens Ag | Verfahren zur Spracherkennung |
| US8868410B2 (en) * | 2007-08-31 | 2014-10-21 | National Institute Of Information And Communications Technology | Non-dialogue-based and dialogue-based learning apparatus by substituting for uttered words undefined in a dictionary with word-graphs comprising of words defined in the dictionary |
| US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
| US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
| US9305548B2 (en) | 2008-05-27 | 2016-04-05 | Voicebox Technologies Corporation | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
| US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
| US8064290B2 (en) * | 2009-04-28 | 2011-11-22 | Luidia, Inc. | Digital transcription system utilizing small aperture acoustical sensors |
| US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
| US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
| US8645136B2 (en) | 2010-07-20 | 2014-02-04 | Intellisist, Inc. | System and method for efficiently reducing transcription error using hybrid voice transcription |
| CN103229233B (zh) * | 2010-12-10 | 2015-11-25 | 松下电器(美国)知识产权公司 | 用于识别说话人的建模设备和方法、以及说话人识别系统 |
| US9064491B2 (en) * | 2012-05-29 | 2015-06-23 | Nuance Communications, Inc. | Methods and apparatus for performing transformation techniques for data clustering and/or classification |
| CN103219007A (zh) * | 2013-03-27 | 2013-07-24 | 谢东来 | 语音识别方法及装置 |
| US9697828B1 (en) * | 2014-06-20 | 2017-07-04 | Amazon Technologies, Inc. | Keyword detection modeling using contextual and environmental information |
| EP3195145A4 (en) | 2014-09-16 | 2018-01-24 | VoiceBox Technologies Corporation | Voice commerce |
| WO2016044321A1 (en) | 2014-09-16 | 2016-03-24 | Min Tang | Integration of domain information into state transitions of a finite state transducer for natural language processing |
| US9747896B2 (en) | 2014-10-15 | 2017-08-29 | Voicebox Technologies Corporation | System and method for providing follow-up responses to prior natural language inputs of a user |
| US10431214B2 (en) | 2014-11-26 | 2019-10-01 | Voicebox Technologies Corporation | System and method of determining a domain and/or an action related to a natural language input |
| US10614799B2 (en) | 2014-11-26 | 2020-04-07 | Voicebox Technologies Corporation | System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance |
| JP6109451B2 (ja) * | 2014-12-24 | 2017-04-05 | 三菱電機株式会社 | 音声認識装置及び音声認識方法 |
| US10515150B2 (en) * | 2015-07-14 | 2019-12-24 | Genesys Telecommunications Laboratories, Inc. | Data driven speech enabled self-help systems and methods of operating thereof |
| US10382623B2 (en) | 2015-10-21 | 2019-08-13 | Genesys Telecommunications Laboratories, Inc. | Data-driven dialogue enabled self-help systems |
| US10455088B2 (en) | 2015-10-21 | 2019-10-22 | Genesys Telecommunications Laboratories, Inc. | Dialogue flow optimization and personalization |
| CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
| US10331784B2 (en) | 2016-07-29 | 2019-06-25 | Voicebox Technologies Corporation | System and method of disambiguating natural language processing requests |
| US20180254054A1 (en) * | 2017-03-02 | 2018-09-06 | Otosense Inc. | Sound-recognition system based on a sound language and associated annotations |
| US20180268844A1 (en) * | 2017-03-14 | 2018-09-20 | Otosense Inc. | Syntactic system for sound recognition |
| JP6711343B2 (ja) * | 2017-12-05 | 2020-06-17 | カシオ計算機株式会社 | 音声処理装置、音声処理方法及びプログラム |
| JP7000268B2 (ja) * | 2018-07-18 | 2022-01-19 | 株式会社東芝 | 情報処理装置、情報処理方法、およびプログラム |
| US10977872B2 (en) | 2018-10-31 | 2021-04-13 | Sony Interactive Entertainment Inc. | Graphical style modification for video games using machine learning |
| US10854109B2 (en) | 2018-10-31 | 2020-12-01 | Sony Interactive Entertainment Inc. | Color accommodation for on-demand accessibility |
| US11636673B2 (en) | 2018-10-31 | 2023-04-25 | Sony Interactive Entertainment Inc. | Scene annotation using machine learning |
| US11375293B2 (en) | 2018-10-31 | 2022-06-28 | Sony Interactive Entertainment Inc. | Textual annotation of acoustic effects |
| KR20220094400A (ko) * | 2020-12-29 | 2022-07-06 | 현대자동차주식회사 | 대화 시스템, 그를 가지는 차량 및 대화 시스템의 제어 방법 |
| CN115171702B (zh) * | 2022-05-30 | 2024-09-24 | 青岛海尔科技有限公司 | 数字孪生声纹特征处理方法、存储介质及电子装置 |
| CN119495304B (zh) * | 2024-11-06 | 2025-12-12 | 深圳前海微众银行股份有限公司 | 语音识别模型微调方法、电子设备、存储介质及程序产品 |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5745680A (en) | 1980-08-30 | 1982-03-15 | Fujitsu Ltd | Pattern recognition device |
| JPS6125199A (ja) | 1984-07-14 | 1986-02-04 | 日本電気株式会社 | 音声認識方式 |
| US6243680B1 (en) * | 1998-06-15 | 2001-06-05 | Nortel Networks Limited | Method and apparatus for obtaining a transcription of phrases through text and spoken utterances |
| KR100277694B1 (ko) * | 1998-11-11 | 2001-01-15 | 정선종 | 음성인식시스템에서의 발음사전 자동생성 방법 |
| JP2002160185A (ja) * | 2000-03-31 | 2002-06-04 | Sony Corp | ロボット装置、ロボット装置の行動制御方法、外力検出装置及び外力検出方法 |
-
2002
- 2002-03-14 JP JP2002069603A patent/JP2002358095A/ja not_active Abandoned
- 2002-04-01 KR KR1020027016297A patent/KR20030007793A/ko not_active Withdrawn
- 2002-04-01 EP EP02708744A patent/EP1376536A1/en not_active Withdrawn
- 2002-04-01 CN CN02801646A patent/CN1462428A/zh active Pending
- 2002-04-01 US US10/296,797 patent/US7228276B2/en not_active Expired - Fee Related
- 2002-04-01 WO PCT/JP2002/003248 patent/WO2002080141A1/ja not_active Ceased
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2004252121A (ja) * | 2003-02-20 | 2004-09-09 | Sony Corp | 言語処理装置および言語処理方法、並びにプログラムおよび記録媒体 |
| WO2005122144A1 (ja) * | 2004-06-10 | 2005-12-22 | Matsushita Electric Industrial Co., Ltd. | 音声認識装置、音声認識方法、及びプログラム |
| US7813928B2 (en) | 2004-06-10 | 2010-10-12 | Panasonic Corporation | Speech recognition device, speech recognition method, and program |
| JP2006171710A (ja) * | 2004-12-10 | 2006-06-29 | Microsoft Corp | 音響情報から意味的な意図を識別するためのシステムおよび方法 |
| WO2007138875A1 (ja) * | 2006-05-31 | 2007-12-06 | Nec Corporation | 音声認識用単語辞書・言語モデル作成システム、方法、プログラムおよび音声認識システム |
| JP2009157119A (ja) * | 2007-12-27 | 2009-07-16 | Univ Of Ryukyus | 音声単語自動獲得方法 |
| US8423354B2 (en) | 2008-05-09 | 2013-04-16 | Fujitsu Limited | Speech recognition dictionary creating support device, computer readable medium storing processing program, and processing method |
| KR20160014465A (ko) * | 2014-07-29 | 2016-02-11 | 삼성전자주식회사 | 전자 장치 및 이의 음성 인식 방법 |
| KR102246900B1 (ko) | 2014-07-29 | 2021-04-30 | 삼성전자주식회사 | 전자 장치 및 이의 음성 인식 방법 |
Also Published As
| Publication number | Publication date |
|---|---|
| US20040030552A1 (en) | 2004-02-12 |
| EP1376536A1 (en) | 2004-01-02 |
| CN1462428A (zh) | 2003-12-17 |
| WO2002080141A1 (en) | 2002-10-10 |
| KR20030007793A (ko) | 2003-01-23 |
| US7228276B2 (en) | 2007-06-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2002358095A (ja) | 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 | |
| JP4296714B2 (ja) | ロボット制御装置およびロボット制御方法、記録媒体、並びにプログラム | |
| US7065490B1 (en) | Voice processing method based on the emotion and instinct states of a robot | |
| JP6550068B2 (ja) | 音声認識における発音予測 | |
| JP4510953B2 (ja) | 音声認識におけるノンインタラクティブ方式のエンロールメント | |
| JP2001188555A (ja) | 情報処理装置および方法、並びに記録媒体 | |
| US20230186905A1 (en) | System and method for tone recognition in spoken languages | |
| JP2001188553A (ja) | 音声合成装置および方法、並びに記録媒体 | |
| KR101153078B1 (ko) | 음성 분류 및 음성 인식을 위한 은닉 조건부 랜덤 필드모델 | |
| JP2001154685A (ja) | 音声認識装置および音声認識方法、並びに記録媒体 | |
| JP2002318594A (ja) | 言語処理装置および言語処理方法、並びにプログラムおよび記録媒体 | |
| EP1906386A1 (en) | Using child directed speech to bootstrap a model based speech segmentation and recognition system | |
| KR20030007866A (ko) | 단어열 출력 장치 | |
| JP2002116792A (ja) | ロボット制御装置およびロボット制御方法、並びに記録媒体 | |
| JP4600736B2 (ja) | ロボット制御装置および方法、記録媒体、並びにプログラム | |
| JP4587009B2 (ja) | ロボット制御装置およびロボット制御方法、並びに記録媒体 | |
| JP2001154693A (ja) | ロボット制御装置およびロボット制御方法、並びに記録媒体 | |
| JP2002268663A (ja) | 音声合成装置および音声合成方法、並びにプログラムおよび記録媒体 | |
| JP2003271172A (ja) | 音声合成方法、音声合成装置、プログラム及び記録媒体、並びにロボット装置 | |
| JP2001188782A (ja) | 情報処理装置および方法、並びに記録媒体 | |
| JP2002258886A (ja) | 音声合成装置および音声合成方法、並びにプログラムおよび記録媒体 | |
| JP2002318590A (ja) | 音声合成装置および音声合成方法、並びにプログラムおよび記録媒体 | |
| JP4178777B2 (ja) | ロボット装置、記録媒体、並びにプログラム | |
| JP2003271181A (ja) | 情報処理装置および情報処理方法、並びに記録媒体およびプログラム | |
| JP2002311981A (ja) | 自然言語処理装置および自然言語処理方法、並びにプログラムおよび記録媒体 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20050301 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20050301 |
|
| A762 | Written abandonment of application |
Free format text: JAPANESE INTERMEDIATE CODE: A762 Effective date: 20080826 |