JPWO2023022206A5 - - Google Patents

Download PDF

Info

Publication number
JPWO2023022206A5
JPWO2023022206A5 JP2023542446A JP2023542446A JPWO2023022206A5 JP WO2023022206 A5 JPWO2023022206 A5 JP WO2023022206A5 JP 2023542446 A JP2023542446 A JP 2023542446A JP 2023542446 A JP2023542446 A JP 2023542446A JP WO2023022206 A5 JPWO2023022206 A5 JP WO2023022206A5
Authority
JP
Japan
Prior art keywords
speech
information
image
data
book
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2023542446A
Other languages
English (en)
Japanese (ja)
Other versions
JP7603948B2 (ja
JPWO2023022206A1 (https=
Filing date
Publication date
Application filed filed Critical
Priority claimed from PCT/JP2022/031276 external-priority patent/WO2023022206A1/ja
Publication of JPWO2023022206A1 publication Critical patent/JPWO2023022206A1/ja
Publication of JPWO2023022206A5 publication Critical patent/JPWO2023022206A5/ja
Application granted granted Critical
Publication of JP7603948B2 publication Critical patent/JP7603948B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2023542446A 2021-08-18 2022-08-18 音声合成装置、音声合成方法及び音声合成プログラム Active JP7603948B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2021133713 2021-08-18
JP2021133713 2021-08-18
PCT/JP2022/031276 WO2023022206A1 (ja) 2021-08-18 2022-08-18 音声合成装置、音声合成方法及び音声合成プログラム

Publications (3)

Publication Number Publication Date
JPWO2023022206A1 JPWO2023022206A1 (https=) 2023-02-23
JPWO2023022206A5 true JPWO2023022206A5 (https=) 2024-05-13
JP7603948B2 JP7603948B2 (ja) 2024-12-23

Family

ID=85240853

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023542446A Active JP7603948B2 (ja) 2021-08-18 2022-08-18 音声合成装置、音声合成方法及び音声合成プログラム

Country Status (3)

Country Link
US (1) US20240347039A1 (https=)
JP (1) JP7603948B2 (https=)
WO (1) WO2023022206A1 (https=)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240203418A1 (en) * 2022-12-20 2024-06-20 Jpmorgan Chase Bank, N.A. Method and system for automatically visualizing a transcript
US12548589B1 (en) 2025-09-24 2026-02-10 CNTXT FZCo Systems and methods for generating audio descriptions

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003044072A (ja) * 2001-07-30 2003-02-14 Seiko Epson Corp 音声読み上げ設定装置、音声読み上げ装置、音声読み上げ設定方法、音声読み上げ設定プログラム及び記録媒体
JP2005249880A (ja) * 2004-03-01 2005-09-15 Xing Inc 携帯式通信端末によるディジタル絵本システム
JP2005321706A (ja) 2004-05-11 2005-11-17 Nippon Telegr & Teleph Corp <Ntt> 電子書籍の再生方法及びその装置
US20080070199A1 (en) * 2006-08-28 2008-03-20 Sommer Sandra R Coloring book composed of digital images converted to black and white outlines
WO2016103652A1 (ja) * 2014-12-24 2016-06-30 日本電気株式会社 音声処理装置、音声処理方法、および記録媒体
US20180133900A1 (en) * 2016-11-15 2018-05-17 JIBO, Inc. Embodied dialog and embodied speech authoring tools for use with an expressive social robot
CN108885614B (zh) * 2017-02-06 2020-12-15 华为技术有限公司 一种文本和语音信息的处理方法以及终端
US10607595B2 (en) * 2017-08-07 2020-03-31 Lenovo (Singapore) Pte. Ltd. Generating audio rendering from textual content based on character models
US10540445B2 (en) * 2017-11-03 2020-01-21 International Business Machines Corporation Intelligent integration of graphical elements into context for screen reader applications
US11226673B2 (en) * 2018-01-26 2022-01-18 Institute Of Software Chinese Academy Of Sciences Affective interaction systems, devices, and methods based on affective computing user interface
WO2020235696A1 (ko) * 2019-05-17 2020-11-26 엘지전자 주식회사 스타일을 고려하여 텍스트와 음성을 상호 변환하는 인공 지능 장치 및 그 방법
KR20210011844A (ko) * 2019-07-23 2021-02-02 삼성전자주식회사 전자 장치 및 그 제어 방법
US11270684B2 (en) * 2019-09-11 2022-03-08 Artificial Intelligence Foundation, Inc. Generation of speech with a prosodic characteristic
CN110717498A (zh) * 2019-09-16 2020-01-21 腾讯科技(深圳)有限公司 图像描述生成方法、装置及电子设备
JP7339151B2 (ja) * 2019-12-23 2023-09-05 株式会社 ディー・エヌ・エー 音声合成装置、音声合成プログラム及び音声合成方法
US20220269870A1 (en) * 2021-02-18 2022-08-25 Meta Platforms, Inc. Readout of Communication Content Comprising Non-Latin or Non-Parsable Content Items for Assistant Systems
JP2024516664A (ja) * 2021-04-27 2024-04-16 フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ デコーダ

Similar Documents

Publication Publication Date Title
JP7500020B2 (ja) 多言語テキスト音声合成方法
JP5323212B2 (ja) 複数言語音声認識
CN112599113B (zh) 方言语音合成方法、装置、电子设备和可读存储介质
JP2018146803A (ja) 音声合成装置及びプログラム
KR20150076128A (ko) 3차원 멀티미디어 활용 발음 학습 지원 시스템 및 그 시스템의 발음 학습 지원 방법
Aryal et al. Reduction of non-native accents through statistical parametric articulatory synthesis
JPWO2023022206A5 (https=)
Kayte et al. Di-phone-based concatenative speech synthesis systems for marathi language
KR102418465B1 (ko) 동화 낭독 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램
JP7357518B2 (ja) 音声合成装置及びプログラム
JP5334716B2 (ja) 文字情報提示制御装置及びプログラム
JP6475572B2 (ja) 発話リズム変換装置、方法及びプログラム
Souza et al. An automatic phonetic aligner for Brazilian Portuguese with a Praat interface
KR101246287B1 (ko) 음가의 강세를 이용한 발음기관 애니메이션 생성 장치 및 방법
Rashad et al. Diphone speech synthesis system for Arabic using MARY TTS
Ai Perceptual feedback in computer assisted pronunciation training: A survey
Iyanda et al. Development of a yorúbà texttospeech system using festival
Alsabaan Pronunciation support for Arabic learners
Azab et al. Masry: A Text-to-Speech System for the Egyptian Arabic.
Winarti et al. Enhancing Indonesian Speech Synthesis: Embracing Naturalness and Expressiveness with Hidden Markov Models
Ma et al. The SCUT Text-To-Speech System for the Blizzard Challenge 2023.
Lobanov et al. Computer-based system of analysis and interpretation of speech intonation
Лобанов et al. Компьютерная система анализа и интерпретации интонации речи
Ekwonwune et al. Analysis of Leveraging Fastspeech 2 and Hifi-Gan Models for Speech Synthesis Adapted for Nigerian Languages
Truong et al. Building a mixed Vietnamese-English speech recognition solution