JP4468264B2 - 多言語による名称の音声認識のための方法とシステム - Google Patents
多言語による名称の音声認識のための方法とシステム Download PDFInfo
- Publication number
- JP4468264B2 JP4468264B2 JP2005228583A JP2005228583A JP4468264B2 JP 4468264 B2 JP4468264 B2 JP 4468264B2 JP 2005228583 A JP2005228583 A JP 2005228583A JP 2005228583 A JP2005228583 A JP 2005228583A JP 4468264 B2 JP4468264 B2 JP 4468264B2
- Authority
- JP
- Japan
- Prior art keywords
- chinese
- speech recognition
- character
- feature vector
- characters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims description 29
- 239000013598 vector Substances 0.000 claims description 32
- 239000000203 mixture Substances 0.000 claims description 7
- 238000004422 calculation algorithm Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims 7
- 241001672694 Citrus reticulata Species 0.000 description 16
- 230000006870 function Effects 0.000 description 15
- 230000001419 dependent effect Effects 0.000 description 9
- 238000004891 communication Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 230000003068 static effect Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000013515 script Methods 0.000 description 2
- 238000010845 search algorithm Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 125000002066 L-histidyl group Chemical group [H]N1C([H])=NC(C([H])([H])[C@](C(=O)[*])([H])N([H])[H])=C1[H] 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005315 distribution function Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/081—Search algorithms, e.g. Baum-Welch or Viterbi
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200410056515A CN100592385C (zh) | 2004-08-06 | 2004-08-06 | 用于对多语言的姓名进行语音识别的方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2006048058A JP2006048058A (ja) | 2006-02-16 |
JP4468264B2 true JP4468264B2 (ja) | 2010-05-26 |
Family
ID=35963852
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2005228583A Active JP4468264B2 (ja) | 2004-08-06 | 2005-08-05 | 多言語による名称の音声認識のための方法とシステム |
Country Status (4)
Country | Link |
---|---|
JP (1) | JP4468264B2 (zh) |
KR (1) | KR100769029B1 (zh) |
CN (1) | CN100592385C (zh) |
SG (1) | SG119358A1 (zh) |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5343744B2 (ja) * | 2009-07-24 | 2013-11-13 | 富士通株式会社 | 音声翻訳装置及び音声翻訳方法 |
JP2011033874A (ja) * | 2009-08-03 | 2011-02-17 | Alpine Electronics Inc | 多言語音声認識装置及び多言語音声認識辞書作成方法 |
KR101250897B1 (ko) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 |
CN101826325B (zh) * | 2010-03-10 | 2012-04-18 | 华为终端有限公司 | 对中英文语音信号进行识别的方法和装置 |
US10134385B2 (en) * | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
CN102780653B (zh) * | 2012-08-09 | 2016-03-09 | 上海量明科技发展有限公司 | 即时通信中快捷通信的方法、客户端及系统 |
CN103853779A (zh) * | 2012-12-04 | 2014-06-11 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
CN103928024B (zh) * | 2013-01-14 | 2017-11-28 | 联想(北京)有限公司 | 一种语音查询方法及电子设备 |
KR101579533B1 (ko) | 2014-10-16 | 2015-12-22 | 현대자동차주식회사 | 차량 및 그 제어 방법 |
CN104900235B (zh) * | 2015-05-25 | 2019-05-28 | 重庆大学 | 基于基音周期混合特征参数的声纹识别方法 |
KR101664080B1 (ko) * | 2015-07-28 | 2016-10-10 | 현대자동차 주식회사 | 음성 다이얼링 시스템 및 방법 |
CN105095509B (zh) * | 2015-09-06 | 2019-01-25 | 百度在线网络技术(北京)有限公司 | 语音搜索方法及装置 |
CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
CN106856091A (zh) * | 2016-12-21 | 2017-06-16 | 北京智能管家科技有限公司 | 一种多语言文本的自动播报方法及系统 |
DE102017200976B4 (de) * | 2017-01-23 | 2018-08-23 | Audi Ag | Verfahren zum Betreiben eines Kraftfahrzeugs mit einer Bedienvorrichtung |
CN109192202B (zh) * | 2018-09-21 | 2023-05-16 | 平安科技(深圳)有限公司 | 语音安全识别方法、装置、计算机设备及存储介质 |
CN112397051B (zh) * | 2019-08-16 | 2024-02-02 | 武汉Tcl集团工业研究院有限公司 | 语音识别方法、装置及终端设备 |
CN110808034A (zh) * | 2019-10-31 | 2020-02-18 | 北京大米科技有限公司 | 语音转换方法、装置、存储介质及电子设备 |
CN112153206B (zh) * | 2020-09-23 | 2022-08-09 | 阿波罗智联(北京)科技有限公司 | 一种联系人匹配方法、装置、电子设备及存储介质 |
CN112652311B (zh) * | 2020-12-01 | 2021-09-03 | 北京百度网讯科技有限公司 | 中英文混合语音识别方法、装置、电子设备和存储介质 |
CN112669841B (zh) * | 2020-12-18 | 2024-07-02 | 平安科技(深圳)有限公司 | 多语种语音的生成模型的训练方法、装置及计算机设备 |
CN113536776B (zh) * | 2021-06-22 | 2024-06-14 | 深圳价值在线信息科技股份有限公司 | 混淆语句的生成方法、终端设备及计算机可读存储介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR0136425B1 (ko) * | 1995-01-26 | 1998-05-15 | 조백제 | 의존문법을 후향 언어모델로 사용하는 한국어 연속음성 인식장치 및 방법과 그를 이용한 자동통역시스템 |
CA2185262C (en) * | 1995-09-12 | 2006-08-29 | Michele B. Gammel | Method and system for enrolling addresses in a speech recognition database |
JP3447521B2 (ja) * | 1997-08-25 | 2003-09-16 | Necエレクトロニクス株式会社 | 音声認識ダイアル装置 |
US6314165B1 (en) * | 1998-04-30 | 2001-11-06 | Matsushita Electric Industrial Co., Ltd. | Automated hotel attendant using speech recognition |
JP2000047684A (ja) * | 1998-07-28 | 2000-02-18 | Nec Corp | 音声認識方法および音声サービス装置 |
JP4053151B2 (ja) * | 1998-09-01 | 2008-02-27 | 富士通株式会社 | 放流警報システム |
US6502075B1 (en) * | 1999-03-26 | 2002-12-31 | Koninklijke Philips Electronics, N.V. | Auto attendant having natural names database library |
JP2000352990A (ja) * | 1999-06-14 | 2000-12-19 | Nippon Telegr & Teleph Corp <Ntt> | 外国語音声合成装置 |
JP2001085233A (ja) * | 1999-09-10 | 2001-03-30 | Concorde Denshi Kogyo:Kk | 半閉磁路インダクタおよびその製造法。 |
JP3539548B2 (ja) * | 1999-09-20 | 2004-07-07 | Jfeスチール株式会社 | 加工用高張力熱延鋼板の製造方法 |
KR100423460B1 (ko) * | 2001-07-19 | 2004-03-18 | 한국전자통신연구원 | 주제어 인식이 가능한 음성인식시스템 및 방법 |
US7496498B2 (en) * | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
US7684988B2 (en) * | 2004-10-15 | 2010-03-23 | Microsoft Corporation | Testing and tuning of automatic speech recognition systems using synthetic inputs generated from its acoustic models |
-
2004
- 2004-08-06 CN CN200410056515A patent/CN100592385C/zh not_active Expired - Lifetime
-
2005
- 2005-08-01 SG SG200504797A patent/SG119358A1/en unknown
- 2005-08-05 JP JP2005228583A patent/JP4468264B2/ja active Active
- 2005-08-05 KR KR1020050071867A patent/KR100769029B1/ko active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
KR20060050277A (ko) | 2006-05-19 |
SG119358A1 (en) | 2006-02-28 |
JP2006048058A (ja) | 2006-02-16 |
CN1731511A (zh) | 2006-02-08 |
CN100592385C (zh) | 2010-02-24 |
KR100769029B1 (ko) | 2007-10-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4468264B2 (ja) | 多言語による名称の音声認識のための方法とシステム | |
KR100679042B1 (ko) | 음성인식 방법 및 장치, 이를 이용한 네비게이션 시스템 | |
JP3962763B2 (ja) | 対話支援装置 | |
JP5480760B2 (ja) | 端末装置、音声認識方法および音声認識プログラム | |
KR101109265B1 (ko) | 텍스트 입력 방법 | |
US20050049870A1 (en) | Open vocabulary speech recognition | |
JP2007500367A (ja) | 音声認識方法およびコミュニケーション機器 | |
JP5703491B2 (ja) | 言語モデル・音声認識辞書作成装置及びそれらにより作成された言語モデル・音声認識辞書を用いた情報処理装置 | |
JPH11119791A (ja) | 音声感情認識システムおよび方法 | |
JP2007538278A (ja) | 音声認識システム | |
JP2003308090A (ja) | 音声認識装置、音声認識方法および音声認識プログラム | |
CN111916062B (zh) | 语音识别方法、装置和系统 | |
US20070016420A1 (en) | Dictionary lookup for mobile devices using spelling recognition | |
JP2002116793A (ja) | データ入力システム及びその方法 | |
US20080270128A1 (en) | Text Input System and Method Based on Voice Recognition | |
KR102069697B1 (ko) | 자동 통역 장치 및 방법 | |
JP4230142B2 (ja) | 悪環境下でのキーパッド/音声を用いたハイブリッドな東洋文字認識技術 | |
KR101250897B1 (ko) | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 | |
Mittal et al. | Speaker-independent automatic speech recognition system for mobile phone applications in Punjabi | |
JP2004170466A (ja) | 音声認識方法と電子装置 | |
JP2003108551A (ja) | 携帯型機械翻訳装置、翻訳方法及び翻訳プログラム | |
KR20030010979A (ko) | 의미어단위 모델을 이용한 연속음성인식방법 및 장치 | |
JP2002073081A (ja) | 音声認識方法と電子装置 | |
KR100777569B1 (ko) | 멀티모달을 이용한 음성 인식 방법 및 그 장치 | |
JP2000056796A (ja) | 音声入力装置および方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20090127 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20090427 |
|
A602 | Written permission of extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A602 Effective date: 20090501 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20090727 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20100202 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20100224 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 4468264 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130305 Year of fee payment: 3 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130305 Year of fee payment: 3 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313111 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130305 Year of fee payment: 3 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
S533 | Written request for registration of change of name |
Free format text: JAPANESE INTERMEDIATE CODE: R313533 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130305 Year of fee payment: 3 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20130305 Year of fee payment: 3 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140305 Year of fee payment: 4 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
S111 | Request for change of ownership or part of ownership |
Free format text: JAPANESE INTERMEDIATE CODE: R313113 |
|
S531 | Written request for registration of change of domicile |
Free format text: JAPANESE INTERMEDIATE CODE: R313531 |
|
R350 | Written notification of registration of transfer |
Free format text: JAPANESE INTERMEDIATE CODE: R350 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |