JPWO2023166557A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2023166557A5
JPWO2023166557A5 JP2024504041A JP2024504041A JPWO2023166557A5 JP WO2023166557 A5 JPWO2023166557 A5 JP WO2023166557A5 JP 2024504041 A JP2024504041 A JP 2024504041A JP 2024504041 A JP2024504041 A JP 2024504041A JP WO2023166557 A5 JPWO2023166557 A5 JP WO2023166557A5
Authority
JP
Japan
Prior art keywords
speech
data
voice
synthetic
real
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2024504041A
Other languages
English (en)
Japanese (ja)
Other versions
JP7691027B2 (ja
JPWO2023166557A1 (https=
Filing date
Publication date
Application filed filed Critical
Priority claimed from PCT/JP2022/008597 external-priority patent/WO2023166557A1/ja
Publication of JPWO2023166557A1 publication Critical patent/JPWO2023166557A1/ja
Publication of JPWO2023166557A5 publication Critical patent/JPWO2023166557A5/ja
Application granted granted Critical
Publication of JP7691027B2 publication Critical patent/JP7691027B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2024504041A 2022-03-01 2022-03-01 音声認識システム、音声認識方法、及び記録媒体 Active JP7691027B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2022/008597 WO2023166557A1 (ja) 2022-03-01 2022-03-01 音声認識システム、音声認識方法、及び記録媒体

Publications (3)

Publication Number Publication Date
JPWO2023166557A1 JPWO2023166557A1 (https=) 2023-09-07
JPWO2023166557A5 true JPWO2023166557A5 (https=) 2024-10-23
JP7691027B2 JP7691027B2 (ja) 2025-06-11

Family

ID=87883147

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2024504041A Active JP7691027B2 (ja) 2022-03-01 2022-03-01 音声認識システム、音声認識方法、及び記録媒体

Country Status (3)

Country Link
US (1) US20250061884A1 (https=)
JP (1) JP7691027B2 (https=)
WO (1) WO2023166557A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20230386446A1 (en) * 2022-05-25 2023-11-30 AuthenticVoice Inc. Modifying an audio signal to incorporate a natural-sounding intonation

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003522978A (ja) * 2000-02-10 2003-07-29 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 手話を音声へ変換する方法及び装置
JP2019008120A (ja) * 2017-06-23 2019-01-17 株式会社日立製作所 声質変換システム、声質変換方法、及び声質変換プログラム

Similar Documents

Publication Publication Date Title
CN111785261B (zh) 基于解纠缠和解释性表征的跨语种语音转换方法及系统
CN111048064B (zh) 基于单说话人语音合成数据集的声音克隆方法及装置
CN105304080B (zh) 语音合成装置及方法
KR102505927B1 (ko) 생성 모델 기반 데이터 증강 기법을 활용한 딥러닝 기반 감정음성합성 장치 및 방법
US20060229876A1 (en) Method, apparatus and computer program providing a multi-speaker database for concatenative text-to-speech synthesis
JP2018146803A (ja) 音声合成装置及びプログラム
CN106128450A (zh) 一种汉藏双语跨语言语音转换的方法及其系统
JP7393585B2 (ja) テキスト読み上げのためのWaveNetの自己トレーニング
ATE374991T1 (de) Verfahren und system für die umsetzung von text- zu-sprache
CN102543081A (zh) 可调控式韵律重估测系统与方法及计算机程序产品
US20220156552A1 (en) Data conversion learning device, data conversion device, method, and program
Wu et al. Multilingual text-to-speech training using cross language voice conversion and self-supervised learning of speech representations
JPWO2023166557A5 (https=)
CN115346512B (zh) 一种基于数字人的多情感语音合成方法
Tsiakoulis et al. Dialogue context sensitive HMM-based speech synthesis
Cao et al. VNet: A GAN-based Multi-Tier Discriminator Network for Speech Synthesis Vocoders
JPWO2023022206A5 (https=)
JP6864322B2 (ja) 音声処理装置、音声処理プログラムおよび音声処理方法
CN112634861B (zh) 数据处理方法、装置、电子设备和可读存储介质
KR20210025295A (ko) 이미지를 통한 학습을 이용하여 합성 음원을 생성하는 장치, 방법 및 컴퓨터 프로그램
US20150149181A1 (en) Method and system for voice synthesis
JP7847733B1 (ja) 発言データ提供装置、発言推定システム、発言データ提供システム、発言データ提供方法及びプログラム
WO2022144851A1 (en) System and method of automated audio output
Anumanchipalli et al. A style capturing approach to F0 transformation in voice conversion
CN113178186A (zh) 一种方言语音合成方法、装置、电子设备和存储介质