CN104008752B - 语音识别装置及方法、以及半导体集成电路装置 - Google Patents
语音识别装置及方法、以及半导体集成电路装置 Download PDFInfo
- Publication number
- CN104008752B CN104008752B CN201410065495.6A CN201410065495A CN104008752B CN 104008752 B CN104008752 B CN 104008752B CN 201410065495 A CN201410065495 A CN 201410065495A CN 104008752 B CN104008752 B CN 104008752B
- Authority
- CN
- China
- Prior art keywords
- speech recognition
- sentence
- voice signal
- voice
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000004020 conductor Substances 0.000 title claims abstract description 80
- 238000000034 method Methods 0.000 title claims abstract description 21
- 238000013519 translation Methods 0.000 claims abstract description 105
- 238000006243 chemical reaction Methods 0.000 claims abstract description 73
- 238000012545 processing Methods 0.000 claims abstract description 63
- 238000012360 testing method Methods 0.000 claims abstract description 48
- 238000000605 extraction Methods 0.000 claims abstract description 32
- 238000009826 distribution Methods 0.000 claims abstract description 31
- 230000004044 response Effects 0.000 claims description 80
- 238000003860 storage Methods 0.000 claims description 34
- 238000001514 detection method Methods 0.000 claims description 30
- 239000004065 semiconductor Substances 0.000 claims description 14
- 230000005611 electricity Effects 0.000 claims description 8
- 239000004615 ingredient Substances 0.000 claims description 6
- 239000000284 extract Substances 0.000 abstract description 8
- 235000013305 food Nutrition 0.000 description 65
- 235000012054 meals Nutrition 0.000 description 22
- 235000015277 pork Nutrition 0.000 description 20
- 238000004088 simulation Methods 0.000 description 15
- 241000209094 Oryza Species 0.000 description 10
- 235000007164 Oryza sativa Nutrition 0.000 description 10
- 235000021438 curry Nutrition 0.000 description 10
- 235000009566 rice Nutrition 0.000 description 10
- 230000009471 action Effects 0.000 description 7
- 241000219051 Fagopyrum Species 0.000 description 6
- 235000009419 Fagopyrum esculentum Nutrition 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 235000012149 noodles Nutrition 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 5
- 230000003213 activating effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000001143 conditioned effect Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 235000013312 flour Nutrition 0.000 description 3
- 241001269238 Data Species 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000005304 joining Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/027—Syllables being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0635—Training updating or merging of old and new templates; Mean values; Weighting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/085—Methods for reducing search complexity, pruning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Artificial Intelligence (AREA)
Abstract
Description
Claims (14)
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2013034257A JP6221253B2 (ja) | 2013-02-25 | 2013-02-25 | 音声認識装置及び方法、並びに、半導体集積回路装置 |
JP2013-034257 | 2013-02-25 | ||
JP2013-042664 | 2013-03-05 | ||
JP2013042664A JP6221267B2 (ja) | 2013-03-05 | 2013-03-05 | 音声認識装置及び方法、並びに、半導体集積回路装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104008752A CN104008752A (zh) | 2014-08-27 |
CN104008752B true CN104008752B (zh) | 2018-08-28 |
Family
ID=51369379
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410065495.6A Active CN104008752B (zh) | 2013-02-25 | 2014-02-25 | 语音识别装置及方法、以及半导体集成电路装置 |
Country Status (2)
Country | Link |
---|---|
US (1) | US9886947B2 (zh) |
CN (1) | CN104008752B (zh) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10134424B2 (en) * | 2015-06-25 | 2018-11-20 | VersaMe, Inc. | Wearable word counter |
US10789939B2 (en) | 2015-06-25 | 2020-09-29 | The University Of Chicago | Wearable word counter |
US10959648B2 (en) | 2015-06-25 | 2021-03-30 | The University Of Chicago | Wearable word counter |
US20170076626A1 (en) * | 2015-09-14 | 2017-03-16 | Seashells Education Software, Inc. | System and Method for Dynamic Response to User Interaction |
CN105679318A (zh) * | 2015-12-23 | 2016-06-15 | 珠海格力电器股份有限公司 | 一种基于语音识别的显示方法、装置、显示系统和空调 |
CN111384051B (zh) * | 2016-03-07 | 2022-09-27 | 杭州海存信息技术有限公司 | 兼具语音识别功能的存储器 |
CN106781013A (zh) * | 2017-01-18 | 2017-05-31 | 广东美基沃得科技有限公司 | 自动售卖设备及自动售卖方法 |
CN107274891A (zh) * | 2017-05-23 | 2017-10-20 | 武汉秀宝软件有限公司 | 一种基于语音识别引擎的ar界面交互方法及系统 |
DE102017216571B4 (de) | 2017-09-19 | 2022-10-06 | Volkswagen Aktiengesellschaft | Kraftfahrzeug |
CN109378005A (zh) * | 2017-11-30 | 2019-02-22 | 金超 | 一种无人售货便利店多语音分辨系统 |
WO2020103008A1 (zh) * | 2018-11-21 | 2020-05-28 | 深圳市欢太科技有限公司 | 音频检测方法、计算机可读存储介质和电子设备 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1543640A (zh) * | 2001-06-14 | 2004-11-03 | �����ɷ� | 在分布式语音识别系统中传输语音活动的方法和设备 |
CN101185115A (zh) * | 2005-05-27 | 2008-05-21 | 松下电器产业株式会社 | 语音编辑装置、语音编辑方法和语音编辑程序 |
CN101625864A (zh) * | 2008-07-10 | 2010-01-13 | 富士通株式会社 | 声音识别装置和声音识别方法 |
CN102687197A (zh) * | 2010-01-22 | 2012-09-19 | 三菱电机株式会社 | 识别词典制作装置、声音识别装置及声音合成装置 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH02106800A (ja) | 1988-10-17 | 1990-04-18 | Matsushita Refrig Co Ltd | 音声認識システム |
JPH03231297A (ja) | 1990-02-06 | 1991-10-15 | Matsushita Refrig Co Ltd | 音声認識システム |
JP3006496B2 (ja) | 1996-03-21 | 2000-02-07 | 日本電気株式会社 | 音声認識装置 |
JP2001154685A (ja) | 1999-11-30 | 2001-06-08 | Sony Corp | 音声認識装置および音声認識方法、並びに記録媒体 |
JP2002182687A (ja) | 2000-12-15 | 2002-06-26 | Alpine Electronics Inc | 車載音声認識用騒音低減装置のデータ配信システム、車載音声認識用騒音低減装置、及びサーバ |
US20050004788A1 (en) * | 2003-07-03 | 2005-01-06 | Lee Hang Shun Raymond | Multi-level confidence measures for task modeling and its application to task-oriented multi-modal dialog management |
JP2005085433A (ja) | 2003-09-11 | 2005-03-31 | Xanavi Informatics Corp | 音声認識による再生装置および再生方法 |
JP2008015209A (ja) | 2006-07-05 | 2008-01-24 | Kddi Corp | 音声認識装置およびその認識辞書更新方法、プログラムならびに記憶媒体 |
JP2008064885A (ja) | 2006-09-05 | 2008-03-21 | Honda Motor Co Ltd | 音声認識装置、音声認識方法、及び音声認識プログラム |
JP4471128B2 (ja) | 2006-11-22 | 2010-06-02 | セイコーエプソン株式会社 | 半導体集積回路装置、電子機器 |
US8056070B2 (en) * | 2007-01-10 | 2011-11-08 | Goller Michael D | System and method for modifying and updating a speech recognition program |
JP2011039202A (ja) | 2009-08-07 | 2011-02-24 | Aisin Aw Co Ltd | 車載情報処理装置 |
US8775177B1 (en) * | 2012-03-08 | 2014-07-08 | Google Inc. | Speech recognition process |
US9159319B1 (en) * | 2012-12-03 | 2015-10-13 | Amazon Technologies, Inc. | Keyword spotting with competitor models |
-
2014
- 2014-02-14 US US14/180,672 patent/US9886947B2/en active Active
- 2014-02-25 CN CN201410065495.6A patent/CN104008752B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1543640A (zh) * | 2001-06-14 | 2004-11-03 | �����ɷ� | 在分布式语音识别系统中传输语音活动的方法和设备 |
CN101185115A (zh) * | 2005-05-27 | 2008-05-21 | 松下电器产业株式会社 | 语音编辑装置、语音编辑方法和语音编辑程序 |
CN101625864A (zh) * | 2008-07-10 | 2010-01-13 | 富士通株式会社 | 声音识别装置和声音识别方法 |
CN102687197A (zh) * | 2010-01-22 | 2012-09-19 | 三菱电机株式会社 | 识别词典制作装置、声音识别装置及声音合成装置 |
Non-Patent Citations (1)
Title |
---|
《信頼度基準による解探索打ち切りに基づ》;小島弘等;《電子情報通信学会技術研究報告:信学技報》;20090131;第108卷(第422期);第13-18页 * |
Also Published As
Publication number | Publication date |
---|---|
US20140244255A1 (en) | 2014-08-28 |
CN104008752A (zh) | 2014-08-27 |
US9886947B2 (en) | 2018-02-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104008752B (zh) | 语音识别装置及方法、以及半导体集成电路装置 | |
Fagherazzi et al. | Voice for health: the use of vocal biomarkers from research to clinical practice | |
Taylor | Analysis and synthesis of intonation using the tilt model | |
US6839667B2 (en) | Method of speech recognition by presenting N-best word candidates | |
Zue et al. | An expert spectrogram reader: a knowledge-based approach to speech recognition | |
Pao et al. | Mandarin emotional speech recognition based on SVM and NN | |
US9190060B2 (en) | Speech recognition device and method, and semiconductor integrated circuit device | |
JP5824829B2 (ja) | 音声認識装置、音声認識方法及び音声認識プログラム | |
CN110782875B (zh) | 一种基于人工智能的语音韵律处理方法及装置 | |
CN107086040A (zh) | 语音识别能力测试方法和装置 | |
CN105210147B (zh) | 用于改进至少一个语义单元集合的方法、设备及计算机可读记录介质 | |
US9390709B2 (en) | Voice recognition device and method, and semiconductor integrated circuit device | |
CN111370024B (zh) | 一种音频调整方法、设备及计算机可读存储介质 | |
CN112908308B (zh) | 一种音频处理方法、装置、设备及介质 | |
Jacobi | On variation and change in diphthongs and long vowels of spoken Dutch | |
CN110111778A (zh) | 一种语音处理方法、装置、存储介质及电子设备 | |
CN106782503A (zh) | 基于发音过程中生理信息的自动语音识别方法 | |
MacIntyre et al. | Pushing the envelope: Evaluating speech rhythm with different envelope extraction techniques | |
CN108364655A (zh) | 语音处理方法、介质、装置和计算设备 | |
JP2015055653A (ja) | 音声認識装置及び方法、並びに、電子機器 | |
Lee et al. | Acoustic voice variation in spontaneous speech | |
CN107251137B (zh) | 利用语音改善至少一种语义单元的集合的方法、装置及计算机可读记录介质 | |
CN111091810A (zh) | 基于语音信息的vr游戏人物表情控制方法及存储介质 | |
JP2010060846A (ja) | 合成音声評価システム及び合成音声評価方法 | |
Whitfield | Exploration of metrics for quantifying formant space: Implications for clinical assessment of Parkinson disease |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240111 Address after: 15 Adindere Street, Ulanjer, Hungary Patentee after: Crystal Leap LLC Address before: Tokyo Patentee before: Seiko Epson Corp. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240603 Address after: No.8, Lixing 6th Road, Xinzhu City, Xinzhu Science Industrial Park, Taiwan, China Patentee after: Taiwan Semiconductor Manufacturing Co.,Ltd. Country or region after: TaiWan, China Address before: 15 Adindere Street, Ulanjer, Hungary Patentee before: Crystal Leap LLC Country or region before: Hungary |