JPWO2022185437A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2022185437A5
JPWO2022185437A5 JP2023503251A JP2023503251A JPWO2022185437A5 JP WO2022185437 A5 JPWO2022185437 A5 JP WO2022185437A5 JP 2023503251 A JP2023503251 A JP 2023503251A JP 2023503251 A JP2023503251 A JP 2023503251A JP WO2022185437 A5 JPWO2022185437 A5 JP WO2022185437A5
Authority
JP
Japan
Prior art keywords
probability
sequence
data
voice
phoneme
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2023503251A
Other languages
English (en)
Japanese (ja)
Other versions
JP7605289B2 (ja
JPWO2022185437A1 (https=
Filing date
Publication date
Application filed filed Critical
Priority claimed from PCT/JP2021/008106 external-priority patent/WO2022185437A1/ja
Publication of JPWO2022185437A1 publication Critical patent/JPWO2022185437A1/ja
Publication of JPWO2022185437A5 publication Critical patent/JPWO2022185437A5/ja
Application granted granted Critical
Publication of JP7605289B2 publication Critical patent/JP7605289B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2023503251A 2021-03-03 2021-03-03 音声認識装置、音声認識方法、学習装置、学習方法、及び、記録媒体 Active JP7605289B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2021/008106 WO2022185437A1 (ja) 2021-03-03 2021-03-03 音声認識装置、音声認識方法、学習装置、学習方法、及び、記録媒体

Publications (3)

Publication Number Publication Date
JPWO2022185437A1 JPWO2022185437A1 (https=) 2022-09-09
JPWO2022185437A5 true JPWO2022185437A5 (https=) 2023-11-10
JP7605289B2 JP7605289B2 (ja) 2024-12-24

Family

ID=83153997

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023503251A Active JP7605289B2 (ja) 2021-03-03 2021-03-03 音声認識装置、音声認識方法、学習装置、学習方法、及び、記録媒体

Country Status (3)

Country Link
US (1) US20240144915A1 (https=)
JP (1) JP7605289B2 (https=)
WO (1) WO2022185437A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118891636A (zh) * 2023-02-20 2024-11-01 株式会社日立高新技术 模型生成系统以及模型生成方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013072974A (ja) 2011-09-27 2013-04-22 Toshiba Corp 音声認識装置、方法及びプログラム
JP6876543B2 (ja) 2017-06-29 2021-05-26 日本放送協会 音素認識辞書生成装置および音素認識装置ならびにそれらのプログラム
US10210860B1 (en) * 2018-07-27 2019-02-19 Deepgram, Inc. Augmented generalized deep learning with special vocabulary

Similar Documents

Publication Publication Date Title
AU2019395322B2 (en) Reconciliation between simulated data and speech recognition output using sequence-to-sequence mapping
CN107871496B (zh) 语音识别方法和装置
CN115116443B (zh) 语音识别模型的训练方法、装置、电子设备及存储介质
CN111179917B (zh) 语音识别模型训练方法、系统、移动终端及存储介质
CN113555006B (zh) 一种语音信息识别方法、装置、电子设备及存储介质
CN116778967B (zh) 基于预训练模型的多模态情感识别方法及装置
CN113326367A (zh) 基于端到端文本生成的任务型对话方法和系统
CN115050351B (zh) 生成时间戳的方法、装置及计算机设备
CN112700796B (zh) 一种基于交互式注意力模型的语音情感识别方法
KR20220090171A (ko) 음성 인식 장치, 프로그램 및 그것의 학습 제어 방법
CN112397056A (zh) 语音评测方法及计算机存储介质
CN115240710B (zh) 基于神经网络的多尺度融合的发音评测模型优化方法
CN109754784A (zh) 训练滤波模型的方法和语音识别的方法
CN110598208A (zh) Ai/ml增强发音课程设计和个性化练习计划方法
CN114333778A (zh) 一种语音识别方法、装置、存储介质及设备
Lakshminarayanan et al. Automated speech therapy through personalized pronunciation correction using reinforcement learning and large language models
Minematsu et al. Speech structure and its application to robust speech processing
JPWO2022185437A5 (https=)
CN116787437A (zh) 基于语音处理的辩论机器人的控制方法、装置、介质
CN115691481B (zh) 一种基于门控卷积的老年方言语音识别方法及介质
CN112015921A (zh) 一种基于学习辅助知识图谱的自然语言处理方法
CN118535683A (zh) 人工智能驱动的多功能英语语言学习和评估方法及其应用
KR102363955B1 (ko) 녹음의 품질을 평가하는 방법 및 시스템
CN113096646B (zh) 音频识别方法、装置、电子设备及存储介质
CN114519104A (zh) 动作标签标注方法及装置