JP2022515048A5 - - Google Patents
Info
- Publication number
- JP2022515048A5 JP2022515048A5 JP2021533448A JP2021533448A JP2022515048A5 JP 2022515048 A5 JP2022515048 A5 JP 2022515048A5 JP 2021533448 A JP2021533448 A JP 2021533448A JP 2021533448 A JP2021533448 A JP 2021533448A JP 2022515048 A5 JP2022515048 A5 JP 2022515048A5
- Authority
- JP
- Japan
- Prior art keywords
- script
- words
- speech recognition
- recognition model
- transcribing
- Prior art date
Links
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201862778431P | 2018-12-12 | 2018-12-12 | |
| US62/778,431 | 2018-12-12 | ||
| PCT/US2019/017258 WO2020122974A1 (en) | 2018-12-12 | 2019-02-08 | Transliteration for speech recognition training and scoring |
Publications (4)
| Publication Number | Publication Date |
|---|---|
| JP2022515048A JP2022515048A (ja) | 2022-02-17 |
| JP2022515048A5 true JP2022515048A5 (https=) | 2022-08-25 |
| JPWO2020122974A5 JPWO2020122974A5 (https=) | 2022-08-25 |
| JP7208399B2 JP7208399B2 (ja) | 2023-01-18 |
Family
ID=65520451
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021533448A Active JP7208399B2 (ja) | 2018-12-12 | 2019-02-08 | 音声認識の訓練および採点のための音訳 |
Country Status (5)
| Country | Link |
|---|---|
| EP (1) | EP3877973B1 (https=) |
| JP (1) | JP7208399B2 (https=) |
| KR (1) | KR102731583B1 (https=) |
| CN (1) | CN113396455B (https=) |
| WO (1) | WO2020122974A1 (https=) |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114420159B (zh) * | 2020-10-12 | 2025-04-18 | 苏州声通信息科技有限公司 | 音频评测方法及装置、非瞬时性存储介质 |
| US11568858B2 (en) * | 2020-10-17 | 2023-01-31 | International Business Machines Corporation | Transliteration based data augmentation for training multilingual ASR acoustic models in low resource settings |
| CN113626563A (zh) * | 2021-08-30 | 2021-11-09 | 京东方科技集团股份有限公司 | 训练自然语言处理模型和自然语言处理的方法、电子设备 |
| CN113889105B (zh) * | 2021-09-29 | 2025-07-04 | 北京搜狗科技发展有限公司 | 一种语音翻译方法、装置和用于语音翻译的装置 |
| CN114118108A (zh) * | 2021-11-11 | 2022-03-01 | 支付宝(杭州)信息技术有限公司 | 建立转译模型的方法、转译方法和对应装置 |
| CN114299930B (zh) * | 2021-12-21 | 2025-03-14 | 广州虎牙科技有限公司 | 端到端语音识别模型处理方法、语音识别方法及相关装置 |
| CN114520001B (zh) * | 2022-03-22 | 2025-08-01 | 科大讯飞股份有限公司 | 一种语音识别方法、装置、设备及存储介质 |
| KR102616598B1 (ko) * | 2023-05-30 | 2023-12-22 | 주식회사 엘솔루 | 번역 자막을 이용한 원문 자막 병렬 데이터 생성 방법 |
Family Cites Families (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8335688B2 (en) * | 2004-08-20 | 2012-12-18 | Multimodal Technologies, Llc | Document transcription system training |
| US20080221866A1 (en) * | 2007-03-06 | 2008-09-11 | Lalitesh Katragadda | Machine Learning For Transliteration |
| JP2009157888A (ja) | 2007-12-28 | 2009-07-16 | National Institute Of Information & Communication Technology | 音訳モデル作成装置、音訳装置、及びそれらのためのコンピュータプログラム |
| US7472061B1 (en) * | 2008-03-31 | 2008-12-30 | International Business Machines Corporation | Systems and methods for building a native language phoneme lexicon having native pronunciations of non-native words derived from non-native pronunciations |
| WO2009129315A1 (en) | 2008-04-15 | 2009-10-22 | Mobile Technologies, Llc | System and methods for maintaining speech-to-speech translation in the field |
| US9176936B2 (en) * | 2012-09-28 | 2015-11-03 | International Business Machines Corporation | Transliteration pair matching |
| US10540957B2 (en) * | 2014-12-15 | 2020-01-21 | Baidu Usa Llc | Systems and methods for speech transcription |
| JP2018028848A (ja) | 2016-08-19 | 2018-02-22 | 日本放送協会 | 変換処理装置、音訳処理装置、およびプログラム |
| US10255909B2 (en) * | 2017-06-29 | 2019-04-09 | Intel IP Corporation | Statistical-analysis-based reset of recurrent neural networks for automatic speech recognition |
-
2019
- 2019-02-08 KR KR1020217017741A patent/KR102731583B1/ko active Active
- 2019-02-08 EP EP19707226.7A patent/EP3877973B1/en active Active
- 2019-02-08 CN CN201980082043.XA patent/CN113396455B/zh active Active
- 2019-02-08 JP JP2021533448A patent/JP7208399B2/ja active Active
- 2019-02-08 WO PCT/US2019/017258 patent/WO2020122974A1/en not_active Ceased
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2022515048A5 (https=) | ||
| US11664020B2 (en) | Speech recognition method and apparatus | |
| US10573296B1 (en) | Reconciliation between simulator and speech recognition output using sequence-to-sequence mapping | |
| US20230230576A1 (en) | Text-to-speech synthesis method and system, and a method of training a text-to-speech synthesis system | |
| US9721561B2 (en) | Method and apparatus for speech recognition using neural networks with speaker adaptation | |
| WO2020068790A1 (en) | Conversational agent pipeline trained on synthetic data | |
| US10235991B2 (en) | Hybrid phoneme, diphone, morpheme, and word-level deep neural networks | |
| US20160071512A1 (en) | Multilingual prosody generation | |
| CN106297800B (zh) | 一种自适应的语音识别的方法和设备 | |
| JP7055630B2 (ja) | 音声認識のための学習方法、学習装置、コンピュータプログラム及び記憶媒体 | |
| WO2017136016A1 (en) | Re-recognizing speech with external data sources | |
| US8447603B2 (en) | Rating speech naturalness of speech utterances based on a plurality of human testers | |
| JP2007279744A5 (https=) | ||
| CN112634866A (zh) | 语音合成模型训练和语音合成方法、装置、设备及介质 | |
| KR20240122776A (ko) | 뉴럴 음성 합성의 적응 및 학습 | |
| CN111933121B (zh) | 一种声学模型训练方法及装置 | |
| CN108039168B (zh) | 声学模型优化方法及装置 | |
| US20130030808A1 (en) | Computer-Implemented Systems and Methods for Scoring Concatenated Speech Responses | |
| KR20190012419A (ko) | 발화 유창성 자동 평가 시스템 및 방법 | |
| WO2015025788A1 (ja) | 定量的f0パターン生成装置及び方法、並びにf0パターン生成のためのモデル学習装置及び方法 | |
| JPWO2020122974A5 (https=) | ||
| CN112259084A (zh) | 语音识别方法、装置和存储介质 | |
| CN119993196A (zh) | 一种语音训练数据的获取方法、装置、设备及介质 | |
| JP2013214016A (ja) | 音響モデル性能評価装置とその方法とプログラム | |
| CN114267339A (zh) | 语音识别处理方法及系统、设备以及存储介质 |