JPWO2022185437A5 - - Google Patents
Download PDFInfo
- Publication number
- JPWO2022185437A5 JPWO2022185437A5 JP2023503251A JP2023503251A JPWO2022185437A5 JP WO2022185437 A5 JPWO2022185437 A5 JP WO2022185437A5 JP 2023503251 A JP2023503251 A JP 2023503251A JP 2023503251 A JP2023503251 A JP 2023503251A JP WO2022185437 A5 JPWO2022185437 A5 JP WO2022185437A5
- Authority
- JP
- Japan
- Prior art keywords
- probability
- sequence
- data
- voice
- phoneme
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000013528 artificial neural network Methods 0.000 claims 5
- 238000000034 method Methods 0.000 claims 4
- 230000001537 neural effect Effects 0.000 claims 3
- 238000004590 computer program Methods 0.000 claims 2
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/JP2021/008106 WO2022185437A1 (ja) | 2021-03-03 | 2021-03-03 | 音声認識装置、音声認識方法、学習装置、学習方法、及び、記録媒体 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2022185437A1 JPWO2022185437A1 (https=) | 2022-09-09 |
| JPWO2022185437A5 true JPWO2022185437A5 (https=) | 2023-11-10 |
| JP7605289B2 JP7605289B2 (ja) | 2024-12-24 |
Family
ID=83153997
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023503251A Active JP7605289B2 (ja) | 2021-03-03 | 2021-03-03 | 音声認識装置、音声認識方法、学習装置、学習方法、及び、記録媒体 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20240144915A1 (https=) |
| JP (1) | JP7605289B2 (https=) |
| WO (1) | WO2022185437A1 (https=) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118891636A (zh) * | 2023-02-20 | 2024-11-01 | 株式会社日立高新技术 | 模型生成系统以及模型生成方法 |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2013072974A (ja) | 2011-09-27 | 2013-04-22 | Toshiba Corp | 音声認識装置、方法及びプログラム |
| JP6876543B2 (ja) | 2017-06-29 | 2021-05-26 | 日本放送協会 | 音素認識辞書生成装置および音素認識装置ならびにそれらのプログラム |
| US10210860B1 (en) * | 2018-07-27 | 2019-02-19 | Deepgram, Inc. | Augmented generalized deep learning with special vocabulary |
-
2021
- 2021-03-03 JP JP2023503251A patent/JP7605289B2/ja active Active
- 2021-03-03 WO PCT/JP2021/008106 patent/WO2022185437A1/ja not_active Ceased
- 2021-03-03 US US18/279,134 patent/US20240144915A1/en not_active Abandoned
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU2019395322B2 (en) | Reconciliation between simulated data and speech recognition output using sequence-to-sequence mapping | |
| CN107871496B (zh) | 语音识别方法和装置 | |
| CN115116443B (zh) | 语音识别模型的训练方法、装置、电子设备及存储介质 | |
| CN111179917B (zh) | 语音识别模型训练方法、系统、移动终端及存储介质 | |
| CN113555006B (zh) | 一种语音信息识别方法、装置、电子设备及存储介质 | |
| CN116778967B (zh) | 基于预训练模型的多模态情感识别方法及装置 | |
| CN113326367A (zh) | 基于端到端文本生成的任务型对话方法和系统 | |
| CN115050351B (zh) | 生成时间戳的方法、装置及计算机设备 | |
| CN112700796B (zh) | 一种基于交互式注意力模型的语音情感识别方法 | |
| KR20220090171A (ko) | 음성 인식 장치, 프로그램 및 그것의 학습 제어 방법 | |
| CN112397056A (zh) | 语音评测方法及计算机存储介质 | |
| CN115240710B (zh) | 基于神经网络的多尺度融合的发音评测模型优化方法 | |
| CN109754784A (zh) | 训练滤波模型的方法和语音识别的方法 | |
| CN110598208A (zh) | Ai/ml增强发音课程设计和个性化练习计划方法 | |
| CN114333778A (zh) | 一种语音识别方法、装置、存储介质及设备 | |
| Lakshminarayanan et al. | Automated speech therapy through personalized pronunciation correction using reinforcement learning and large language models | |
| Minematsu et al. | Speech structure and its application to robust speech processing | |
| JPWO2022185437A5 (https=) | ||
| CN116787437A (zh) | 基于语音处理的辩论机器人的控制方法、装置、介质 | |
| CN115691481B (zh) | 一种基于门控卷积的老年方言语音识别方法及介质 | |
| CN112015921A (zh) | 一种基于学习辅助知识图谱的自然语言处理方法 | |
| CN118535683A (zh) | 人工智能驱动的多功能英语语言学习和评估方法及其应用 | |
| KR102363955B1 (ko) | 녹음의 품질을 평가하는 방법 및 시스템 | |
| CN113096646B (zh) | 音频识别方法、装置、电子设备及存储介质 | |
| CN114519104A (zh) | 动作标签标注方法及装置 |