CN1731511A - 用于对多语言的姓名进行语音识别的方法和系统 - Google Patents
用于对多语言的姓名进行语音识别的方法和系统 Download PDFInfo
- Publication number
- CN1731511A CN1731511A CNA200410056515XA CN200410056515A CN1731511A CN 1731511 A CN1731511 A CN 1731511A CN A200410056515X A CNA200410056515X A CN A200410056515XA CN 200410056515 A CN200410056515 A CN 200410056515A CN 1731511 A CN1731511 A CN 1731511A
- Authority
- CN
- China
- Prior art keywords
- name
- language
- speech recognition
- orderly
- voice unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 239000013598 vector Substances 0.000 claims abstract description 31
- 241001672694 Citrus reticulata Species 0.000 claims description 27
- 239000000203 mixture Substances 0.000 claims description 11
- 238000004422 calculation algorithm Methods 0.000 claims description 6
- 238000003860 storage Methods 0.000 abstract description 7
- 150000001875 compounds Chemical class 0.000 description 11
- 230000008878 coupling Effects 0.000 description 10
- 238000010168 coupling process Methods 0.000 description 10
- 238000005859 coupling reaction Methods 0.000 description 10
- 238000004891 communication Methods 0.000 description 7
- 230000002596 correlated effect Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000003068 static effect Effects 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000005465 channeling Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005315 distribution function Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 230000005039 memory span Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 238000013138 pruning Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/005—Language recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/081—Search algorithms, e.g. Baum-Welch or Viterbi
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
Abstract
Description
音节 | 声母 | 韵母 |
Nei | n_e | Ei |
Tuo | t_u | Uo |
Fa | f_a | A |
Ya | zero_I | Ia |
准确率 | 单语言 | 混合语言 | 交叉错误 |
普通话 | 98.55% | 96.77% | 1.78% |
英语 | 95.01% | 94.04% | 0.97% |
Claims (18)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200410056515A CN100592385C (zh) | 2004-08-06 | 2004-08-06 | 用于对多语言的姓名进行语音识别的方法和系统 |
SG200504797A SG119358A1 (en) | 2004-08-06 | 2005-08-01 | Method and system for voice recognition of names in multiple languages |
KR1020050071867A KR100769029B1 (ko) | 2004-08-06 | 2005-08-05 | 다언어의 이름들의 음성 인식을 위한 방법 및 시스템 |
JP2005228583A JP4468264B2 (ja) | 2004-08-06 | 2005-08-05 | 多言語による名称の音声認識のための方法とシステム |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200410056515A CN100592385C (zh) | 2004-08-06 | 2004-08-06 | 用于对多语言的姓名进行语音识别的方法和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1731511A true CN1731511A (zh) | 2006-02-08 |
CN100592385C CN100592385C (zh) | 2010-02-24 |
Family
ID=35963852
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200410056515A Expired - Lifetime CN100592385C (zh) | 2004-08-06 | 2004-08-06 | 用于对多语言的姓名进行语音识别的方法和系统 |
Country Status (4)
Country | Link |
---|---|
JP (1) | JP4468264B2 (zh) |
KR (1) | KR100769029B1 (zh) |
CN (1) | CN100592385C (zh) |
SG (1) | SG119358A1 (zh) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103853779A (zh) * | 2012-12-04 | 2014-06-11 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
CN103928024A (zh) * | 2013-01-14 | 2014-07-16 | 联想(北京)有限公司 | 一种语音查询方法及电子设备 |
CN105095509A (zh) * | 2015-09-06 | 2015-11-25 | 百度在线网络技术(北京)有限公司 | 语音搜索方法及装置 |
CN106856091A (zh) * | 2016-12-21 | 2017-06-16 | 北京智能管家科技有限公司 | 一种多语言文本的自动播报方法及系统 |
CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
CN107680581A (zh) * | 2012-03-02 | 2018-02-09 | 苹果公司 | 用于名称发音的系统和方法 |
CN109192202A (zh) * | 2018-09-21 | 2019-01-11 | 平安科技(深圳)有限公司 | 语音安全识别方法、装置、计算机设备及存储介质 |
CN110199349A (zh) * | 2017-01-23 | 2019-09-03 | 奥迪股份公司 | 用于运行具有操作设备的机动车的方法 |
CN110808034A (zh) * | 2019-10-31 | 2020-02-18 | 北京大米科技有限公司 | 语音转换方法、装置、存储介质及电子设备 |
CN112153206A (zh) * | 2020-09-23 | 2020-12-29 | 北京百度网讯科技有限公司 | 一种联系人匹配方法、装置、电子设备及存储介质 |
CN112397051A (zh) * | 2019-08-16 | 2021-02-23 | 武汉Tcl集团工业研究院有限公司 | 语音识别方法、装置及终端设备 |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5343744B2 (ja) * | 2009-07-24 | 2013-11-13 | 富士通株式会社 | 音声翻訳装置及び音声翻訳方法 |
JP2011033874A (ja) * | 2009-08-03 | 2011-02-17 | Alpine Electronics Inc | 多言語音声認識装置及び多言語音声認識辞書作成方法 |
KR101250897B1 (ko) * | 2009-08-14 | 2013-04-04 | 한국전자통신연구원 | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 |
CN101826325B (zh) * | 2010-03-10 | 2012-04-18 | 华为终端有限公司 | 对中英文语音信号进行识别的方法和装置 |
CN102780653B (zh) * | 2012-08-09 | 2016-03-09 | 上海量明科技发展有限公司 | 即时通信中快捷通信的方法、客户端及系统 |
KR101579533B1 (ko) | 2014-10-16 | 2015-12-22 | 현대자동차주식회사 | 차량 및 그 제어 방법 |
CN104900235B (zh) * | 2015-05-25 | 2019-05-28 | 重庆大学 | 基于基音周期混合特征参数的声纹识别方法 |
KR101664080B1 (ko) * | 2015-07-28 | 2016-10-10 | 현대자동차 주식회사 | 음성 다이얼링 시스템 및 방법 |
CN112652311B (zh) * | 2020-12-01 | 2021-09-03 | 北京百度网讯科技有限公司 | 中英文混合语音识别方法、装置、电子设备和存储介质 |
CN112669841B (zh) * | 2020-12-18 | 2024-07-02 | 平安科技(深圳)有限公司 | 多语种语音的生成模型的训练方法、装置及计算机设备 |
CN113536776B (zh) * | 2021-06-22 | 2024-06-14 | 深圳价值在线信息科技股份有限公司 | 混淆语句的生成方法、终端设备及计算机可读存储介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR0136425B1 (ko) * | 1995-01-26 | 1998-05-15 | 조백제 | 의존문법을 후향 언어모델로 사용하는 한국어 연속음성 인식장치 및 방법과 그를 이용한 자동통역시스템 |
MY119374A (en) * | 1995-09-12 | 2005-05-31 | Texas Instruments Inc | Method and system for enrolling addresses in a speech recognition database |
JP3447521B2 (ja) * | 1997-08-25 | 2003-09-16 | Necエレクトロニクス株式会社 | 音声認識ダイアル装置 |
US6314165B1 (en) * | 1998-04-30 | 2001-11-06 | Matsushita Electric Industrial Co., Ltd. | Automated hotel attendant using speech recognition |
JP2000047684A (ja) * | 1998-07-28 | 2000-02-18 | Nec Corp | 音声認識方法および音声サービス装置 |
JP4053151B2 (ja) * | 1998-09-01 | 2008-02-27 | 富士通株式会社 | 放流警報システム |
US6502075B1 (en) * | 1999-03-26 | 2002-12-31 | Koninklijke Philips Electronics, N.V. | Auto attendant having natural names database library |
JP2000352990A (ja) * | 1999-06-14 | 2000-12-19 | Nippon Telegr & Teleph Corp <Ntt> | 外国語音声合成装置 |
JP2001085233A (ja) * | 1999-09-10 | 2001-03-30 | Concorde Denshi Kogyo:Kk | 半閉磁路インダクタおよびその製造法。 |
JP3539548B2 (ja) * | 1999-09-20 | 2004-07-07 | Jfeスチール株式会社 | 加工用高張力熱延鋼板の製造方法 |
KR100423460B1 (ko) * | 2001-07-19 | 2004-03-18 | 한국전자통신연구원 | 주제어 인식이 가능한 음성인식시스템 및 방법 |
US7496498B2 (en) * | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
US7684988B2 (en) * | 2004-10-15 | 2010-03-23 | Microsoft Corporation | Testing and tuning of automatic speech recognition systems using synthetic inputs generated from its acoustic models |
-
2004
- 2004-08-06 CN CN200410056515A patent/CN100592385C/zh not_active Expired - Lifetime
-
2005
- 2005-08-01 SG SG200504797A patent/SG119358A1/en unknown
- 2005-08-05 JP JP2005228583A patent/JP4468264B2/ja active Active
- 2005-08-05 KR KR1020050071867A patent/KR100769029B1/ko active IP Right Grant
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11069336B2 (en) | 2012-03-02 | 2021-07-20 | Apple Inc. | Systems and methods for name pronunciation |
CN107680581A (zh) * | 2012-03-02 | 2018-02-09 | 苹果公司 | 用于名称发音的系统和方法 |
CN103853779A (zh) * | 2012-12-04 | 2014-06-11 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
CN103928024A (zh) * | 2013-01-14 | 2014-07-16 | 联想(北京)有限公司 | 一种语音查询方法及电子设备 |
CN105095509B (zh) * | 2015-09-06 | 2019-01-25 | 百度在线网络技术(北京)有限公司 | 语音搜索方法及装置 |
CN105095509A (zh) * | 2015-09-06 | 2015-11-25 | 百度在线网络技术(北京)有限公司 | 语音搜索方法及装置 |
CN106935239A (zh) * | 2015-12-29 | 2017-07-07 | 阿里巴巴集团控股有限公司 | 一种发音词典的构建方法及装置 |
CN106856091A (zh) * | 2016-12-21 | 2017-06-16 | 北京智能管家科技有限公司 | 一种多语言文本的自动播报方法及系统 |
CN110199349A (zh) * | 2017-01-23 | 2019-09-03 | 奥迪股份公司 | 用于运行具有操作设备的机动车的方法 |
CN110199349B (zh) * | 2017-01-23 | 2023-03-21 | 奥迪股份公司 | 用于运行具有操作设备的机动车的方法 |
CN109192202A (zh) * | 2018-09-21 | 2019-01-11 | 平安科技(深圳)有限公司 | 语音安全识别方法、装置、计算机设备及存储介质 |
CN112397051A (zh) * | 2019-08-16 | 2021-02-23 | 武汉Tcl集团工业研究院有限公司 | 语音识别方法、装置及终端设备 |
CN112397051B (zh) * | 2019-08-16 | 2024-02-02 | 武汉Tcl集团工业研究院有限公司 | 语音识别方法、装置及终端设备 |
CN110808034A (zh) * | 2019-10-31 | 2020-02-18 | 北京大米科技有限公司 | 语音转换方法、装置、存储介质及电子设备 |
CN112153206A (zh) * | 2020-09-23 | 2020-12-29 | 北京百度网讯科技有限公司 | 一种联系人匹配方法、装置、电子设备及存储介质 |
CN112153206B (zh) * | 2020-09-23 | 2022-08-09 | 阿波罗智联(北京)科技有限公司 | 一种联系人匹配方法、装置、电子设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN100592385C (zh) | 2010-02-24 |
KR100769029B1 (ko) | 2007-10-22 |
JP2006048058A (ja) | 2006-02-16 |
JP4468264B2 (ja) | 2010-05-26 |
KR20060050277A (ko) | 2006-05-19 |
SG119358A1 (en) | 2006-02-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100769029B1 (ko) | 다언어의 이름들의 음성 인식을 위한 방법 및 시스템 | |
US8364487B2 (en) | Speech recognition system with display information | |
US8290775B2 (en) | Pronunciation correction of text-to-speech systems between different spoken languages | |
US8229747B2 (en) | System and method for spelling recognition using speech and non-speech input | |
EP1291848B1 (en) | Multilingual pronunciations for speech recognition | |
US7818166B2 (en) | Method and apparatus for intention based communications for mobile communication devices | |
US20030023426A1 (en) | Japanese language entry mechanism for small keypads | |
US20020198715A1 (en) | Artificial language generation | |
KR20060043845A (ko) | 발음 그래프를 사용한 새 단어 발음 습득 개선 방법 및 시스템 | |
CN101681365A (zh) | 用于分布式语音搜索的方法和装置 | |
KR20050071334A (ko) | 텍스트 입력 방법 | |
CN1742273A (zh) | 多模态语音-语音语言翻译和显示 | |
JP2007538278A (ja) | 音声認識システム | |
CN112580335B (zh) | 多音字消歧方法及装置 | |
CN1758211A (zh) | 把输入提供给计算设备的有效多方式的方法 | |
CN1359514A (zh) | 多模式数据输入设备 | |
CN112489634A (zh) | 语言的声学模型训练方法、装置、电子设备及计算机介质 | |
EP3185132A1 (en) | Method for writing a foreign language in a pseudo language phonetically resembling native language of the speaker | |
Mittal et al. | Speaker-independent automatic speech recognition system for mobile phone applications in Punjabi | |
JP2002268680A (ja) | 悪環境下でのキーパッド/音声を用いたハイブリッドな東洋文字認識技術 | |
KR20110017600A (ko) | 전자사전에서 음성인식을 이용한 단어 탐색 장치 및 그 방법 | |
KR100910302B1 (ko) | 멀티모달 기반의 정보 검색 장치 및 방법 | |
CN111489742A (zh) | 声学模型训练方法、语音识别方法、装置及电子设备 | |
EP1187431A1 (en) | Portable terminal with voice dialing minimizing memory usage | |
CN1979636A (zh) | 一种音标到语音的转换方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: MOTOROLA MOBILE CO., LTD Free format text: FORMER OWNER: MOTOROLA INC. Effective date: 20110120 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20110120 Address after: Illinois State Patentee after: MOTOROLA MOBILITY, Inc. Address before: Illinois, USA Patentee before: Motorola, Inc. |
|
C41 | Transfer of patent application or patent right or utility model | ||
C56 | Change in the name or address of the patentee | ||
CP01 | Change in the name or title of a patent holder |
Address after: Illinois State Patentee after: MOTOROLA MOBILITY LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY, Inc. |
|
TR01 | Transfer of patent right |
Effective date of registration: 20160311 Address after: California, USA Patentee after: Google Technology Holdings LLC Address before: Illinois State Patentee before: MOTOROLA MOBILITY LLC |
|
CX01 | Expiry of patent term |
Granted publication date: 20100224 |