JPWO2023022206A5 - - Google Patents
Download PDFInfo
- Publication number
- JPWO2023022206A5 JPWO2023022206A5 JP2023542446A JP2023542446A JPWO2023022206A5 JP WO2023022206 A5 JPWO2023022206 A5 JP WO2023022206A5 JP 2023542446 A JP2023542446 A JP 2023542446A JP 2023542446 A JP2023542446 A JP 2023542446A JP WO2023022206 A5 JPWO2023022206 A5 JP WO2023022206A5
- Authority
- JP
- Japan
- Prior art keywords
- speech
- information
- image
- data
- book
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000015572 biosynthetic process Effects 0.000 claims 11
- 238000003786 synthesis reaction Methods 0.000 claims 11
- 230000000007 visual effect Effects 0.000 claims 3
- 238000006243 chemical reaction Methods 0.000 claims 2
- 238000000034 method Methods 0.000 claims 1
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021133713 | 2021-08-18 | ||
| JP2021133713 | 2021-08-18 | ||
| PCT/JP2022/031276 WO2023022206A1 (ja) | 2021-08-18 | 2022-08-18 | 音声合成装置、音声合成方法及び音声合成プログラム |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JPWO2023022206A1 JPWO2023022206A1 (https=) | 2023-02-23 |
| JPWO2023022206A5 true JPWO2023022206A5 (https=) | 2024-05-13 |
| JP7603948B2 JP7603948B2 (ja) | 2024-12-23 |
Family
ID=85240853
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023542446A Active JP7603948B2 (ja) | 2021-08-18 | 2022-08-18 | 音声合成装置、音声合成方法及び音声合成プログラム |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20240347039A1 (https=) |
| JP (1) | JP7603948B2 (https=) |
| WO (1) | WO2023022206A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20240203418A1 (en) * | 2022-12-20 | 2024-06-20 | Jpmorgan Chase Bank, N.A. | Method and system for automatically visualizing a transcript |
| US12548589B1 (en) | 2025-09-24 | 2026-02-10 | CNTXT FZCo | Systems and methods for generating audio descriptions |
Family Cites Families (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003044072A (ja) * | 2001-07-30 | 2003-02-14 | Seiko Epson Corp | 音声読み上げ設定装置、音声読み上げ装置、音声読み上げ設定方法、音声読み上げ設定プログラム及び記録媒体 |
| JP2005249880A (ja) * | 2004-03-01 | 2005-09-15 | Xing Inc | 携帯式通信端末によるディジタル絵本システム |
| JP2005321706A (ja) | 2004-05-11 | 2005-11-17 | Nippon Telegr & Teleph Corp <Ntt> | 電子書籍の再生方法及びその装置 |
| US20080070199A1 (en) * | 2006-08-28 | 2008-03-20 | Sommer Sandra R | Coloring book composed of digital images converted to black and white outlines |
| WO2016103652A1 (ja) * | 2014-12-24 | 2016-06-30 | 日本電気株式会社 | 音声処理装置、音声処理方法、および記録媒体 |
| US20180133900A1 (en) * | 2016-11-15 | 2018-05-17 | JIBO, Inc. | Embodied dialog and embodied speech authoring tools for use with an expressive social robot |
| CN108885614B (zh) * | 2017-02-06 | 2020-12-15 | 华为技术有限公司 | 一种文本和语音信息的处理方法以及终端 |
| US10607595B2 (en) * | 2017-08-07 | 2020-03-31 | Lenovo (Singapore) Pte. Ltd. | Generating audio rendering from textual content based on character models |
| US10540445B2 (en) * | 2017-11-03 | 2020-01-21 | International Business Machines Corporation | Intelligent integration of graphical elements into context for screen reader applications |
| US11226673B2 (en) * | 2018-01-26 | 2022-01-18 | Institute Of Software Chinese Academy Of Sciences | Affective interaction systems, devices, and methods based on affective computing user interface |
| WO2020235696A1 (ko) * | 2019-05-17 | 2020-11-26 | 엘지전자 주식회사 | 스타일을 고려하여 텍스트와 음성을 상호 변환하는 인공 지능 장치 및 그 방법 |
| KR20210011844A (ko) * | 2019-07-23 | 2021-02-02 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
| US11270684B2 (en) * | 2019-09-11 | 2022-03-08 | Artificial Intelligence Foundation, Inc. | Generation of speech with a prosodic characteristic |
| CN110717498A (zh) * | 2019-09-16 | 2020-01-21 | 腾讯科技(深圳)有限公司 | 图像描述生成方法、装置及电子设备 |
| JP7339151B2 (ja) * | 2019-12-23 | 2023-09-05 | 株式会社 ディー・エヌ・エー | 音声合成装置、音声合成プログラム及び音声合成方法 |
| US20220269870A1 (en) * | 2021-02-18 | 2022-08-25 | Meta Platforms, Inc. | Readout of Communication Content Comprising Non-Latin or Non-Parsable Content Items for Assistant Systems |
| JP2024516664A (ja) * | 2021-04-27 | 2024-04-16 | フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | デコーダ |
-
2022
- 2022-08-18 JP JP2023542446A patent/JP7603948B2/ja active Active
- 2022-08-18 WO PCT/JP2022/031276 patent/WO2023022206A1/ja not_active Ceased
- 2022-08-18 US US18/683,786 patent/US20240347039A1/en active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7500020B2 (ja) | 多言語テキスト音声合成方法 | |
| JP5323212B2 (ja) | 複数言語音声認識 | |
| CN112599113B (zh) | 方言语音合成方法、装置、电子设备和可读存储介质 | |
| JP2018146803A (ja) | 音声合成装置及びプログラム | |
| KR20150076128A (ko) | 3차원 멀티미디어 활용 발음 학습 지원 시스템 및 그 시스템의 발음 학습 지원 방법 | |
| Aryal et al. | Reduction of non-native accents through statistical parametric articulatory synthesis | |
| JPWO2023022206A5 (https=) | ||
| Kayte et al. | Di-phone-based concatenative speech synthesis systems for marathi language | |
| KR102418465B1 (ko) | 동화 낭독 서비스를 제공하는 서버, 방법 및 컴퓨터 프로그램 | |
| JP7357518B2 (ja) | 音声合成装置及びプログラム | |
| JP5334716B2 (ja) | 文字情報提示制御装置及びプログラム | |
| JP6475572B2 (ja) | 発話リズム変換装置、方法及びプログラム | |
| Souza et al. | An automatic phonetic aligner for Brazilian Portuguese with a Praat interface | |
| KR101246287B1 (ko) | 음가의 강세를 이용한 발음기관 애니메이션 생성 장치 및 방법 | |
| Rashad et al. | Diphone speech synthesis system for Arabic using MARY TTS | |
| Ai | Perceptual feedback in computer assisted pronunciation training: A survey | |
| Iyanda et al. | Development of a yorúbà texttospeech system using festival | |
| Alsabaan | Pronunciation support for Arabic learners | |
| Azab et al. | Masry: A Text-to-Speech System for the Egyptian Arabic. | |
| Winarti et al. | Enhancing Indonesian Speech Synthesis: Embracing Naturalness and Expressiveness with Hidden Markov Models | |
| Ma et al. | The SCUT Text-To-Speech System for the Blizzard Challenge 2023. | |
| Lobanov et al. | Computer-based system of analysis and interpretation of speech intonation | |
| Лобанов et al. | Компьютерная система анализа и интерпретации интонации речи | |
| Ekwonwune et al. | Analysis of Leveraging Fastspeech 2 and Hifi-Gan Models for Speech Synthesis Adapted for Nigerian Languages | |
| Truong et al. | Building a mixed Vietnamese-English speech recognition solution |