JPWO2023090419A1 - - Google Patents
Info
- Publication number
- JPWO2023090419A1 JPWO2023090419A1 JP2023562416A JP2023562416A JPWO2023090419A1 JP WO2023090419 A1 JPWO2023090419 A1 JP WO2023090419A1 JP 2023562416 A JP2023562416 A JP 2023562416A JP 2023562416 A JP2023562416 A JP 2023562416A JP WO2023090419 A1 JPWO2023090419 A1 JP WO2023090419A1
- Authority
- JP
- Japan
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—Three-dimensional [3D] animation
- G06T13/205—Three-dimensional [3D] animation driven by audio data
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—Three-dimensional [3D] animation
- G06T13/40—Three-dimensional [3D] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
- G10L13/047—Architecture of speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Electrically Operated Instructional Devices (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021188791 | 2021-11-19 | ||
| PCT/JP2022/042847 WO2023090419A1 (ja) | 2021-11-19 | 2022-11-18 | コンテンツ生成装置、コンテンツ生成方法、及びプログラム |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JPWO2023090419A1 true JPWO2023090419A1 (https=) | 2023-05-25 |
| JPWO2023090419A5 JPWO2023090419A5 (https=) | 2024-08-05 |
Family
ID=86396966
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023562416A Pending JPWO2023090419A1 (https=) | 2021-11-19 | 2022-11-18 |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US12608864B2 (https=) |
| JP (1) | JPWO2023090419A1 (https=) |
| WO (1) | WO2023090419A1 (https=) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12561876B2 (en) * | 2022-11-28 | 2026-02-24 | Constructor Technology Ag | System and method for an audio-visual avatar creation |
| JP7794515B1 (ja) * | 2025-07-14 | 2026-01-06 | 株式会社バリューアップデート | 動画生成装置及び動画生成方法並びにプログラム |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001014307A (ja) * | 1999-07-02 | 2001-01-19 | Sony Corp | 文書処理装置、文書処理方法、及び記録媒体 |
| JP2003108502A (ja) * | 2001-09-28 | 2003-04-11 | Interrobot Inc | 身体性メディア通信システム |
| JP2006163871A (ja) * | 2004-12-08 | 2006-06-22 | Sony Corp | 画像処理装置、画像処理方法、およびプログラム |
| US20100082345A1 (en) * | 2008-09-26 | 2010-04-01 | Microsoft Corporation | Speech and text driven hmm-based body animation synthesis |
| WO2015092936A1 (ja) * | 2013-12-20 | 2015-06-25 | 株式会社東芝 | 音声合成装置、音声合成方法およびプログラム |
| JP2020006482A (ja) * | 2018-07-09 | 2020-01-16 | 株式会社国際電気通信基礎技術研究所 | アンドロイドのジェスチャ生成装置及びコンピュータプログラム |
| WO2020204000A1 (ja) * | 2019-04-01 | 2020-10-08 | 住友電気工業株式会社 | コミュニケーション支援システム、コミュニケーション支援方法、コミュニケーション支援プログラム、および画像制御プログラム |
| US20210034976A1 (en) * | 2019-08-02 | 2021-02-04 | Google Llc | Framework for Learning to Transfer Learn |
| JP2021177647A (ja) * | 2020-12-22 | 2021-11-11 | ベイジン バイドゥ ネットコム サイエンス アンド テクノロジー カンパニー リミテッド | ビデオシーケンス編成方法、装置、電子設備、記憶媒体、及びプログラム |
Family Cites Families (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH11312160A (ja) | 1998-02-13 | 1999-11-09 | Fuji Xerox Co Ltd | 自律的パ―ソナルアバタ―による文書注釈方法及び装置 |
| KR102407132B1 (ko) * | 2021-02-05 | 2022-06-10 | 장건 | 고인을 모사하는 가상 인물과 대화를 수행하는 서비스를 제공하는 방법 및 시스템 |
| US12417762B2 (en) * | 2022-04-13 | 2025-09-16 | International Business Machines Corporation | Speech-to-text voice visualization |
| US12039653B1 (en) * | 2023-05-30 | 2024-07-16 | Roku, Inc. | Video-content system with narrative-based video content generation feature |
| CN119126980A (zh) * | 2024-09-04 | 2024-12-13 | 中国矿业大学 | 一种交互式虚拟专家形象生成方法和系统 |
| KR102832018B1 (ko) * | 2024-09-25 | 2025-07-10 | 주식회사 에이아이트릭스 | 얼굴 이미지에 기초한 tts 모델 기반 음성 합성 시스템 및 그것의 합성 방법 |
-
2022
- 2022-11-18 WO PCT/JP2022/042847 patent/WO2023090419A1/ja not_active Ceased
- 2022-11-18 JP JP2023562416A patent/JPWO2023090419A1/ja active Pending
-
2024
- 2024-05-17 US US18/667,096 patent/US12608864B2/en active Active
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001014307A (ja) * | 1999-07-02 | 2001-01-19 | Sony Corp | 文書処理装置、文書処理方法、及び記録媒体 |
| JP2003108502A (ja) * | 2001-09-28 | 2003-04-11 | Interrobot Inc | 身体性メディア通信システム |
| JP2006163871A (ja) * | 2004-12-08 | 2006-06-22 | Sony Corp | 画像処理装置、画像処理方法、およびプログラム |
| US20100082345A1 (en) * | 2008-09-26 | 2010-04-01 | Microsoft Corporation | Speech and text driven hmm-based body animation synthesis |
| WO2015092936A1 (ja) * | 2013-12-20 | 2015-06-25 | 株式会社東芝 | 音声合成装置、音声合成方法およびプログラム |
| JP2020006482A (ja) * | 2018-07-09 | 2020-01-16 | 株式会社国際電気通信基礎技術研究所 | アンドロイドのジェスチャ生成装置及びコンピュータプログラム |
| WO2020204000A1 (ja) * | 2019-04-01 | 2020-10-08 | 住友電気工業株式会社 | コミュニケーション支援システム、コミュニケーション支援方法、コミュニケーション支援プログラム、および画像制御プログラム |
| US20210034976A1 (en) * | 2019-08-02 | 2021-02-04 | Google Llc | Framework for Learning to Transfer Learn |
| JP2021177647A (ja) * | 2020-12-22 | 2021-11-11 | ベイジン バイドゥ ネットコム サイエンス アンド テクノロジー カンパニー リミテッド | ビデオシーケンス編成方法、装置、電子設備、記憶媒体、及びプログラム |
Non-Patent Citations (4)
| Title |
|---|
| "LOGOSWARE STORM XE 操作マニュアル", vol. 第8版, JPN6022041147, 15 April 2021 (2021-04-15), pages 1 - 165, ISSN: 0005657510 * |
| 斉藤典明: ""個人適用型メディア講義の提案"", 情報処理学会シンポジウム グループウェアとネットワークサービスワークショップ2018, JPN6023004142, 8 November 2018 (2018-11-08), ISSN: 0005760316 * |
| 籠嶋岳彦 他: ""音声合成の多様性向上の取り組み"", IN:情報処理学会研究報告 [CD-ROM], vol. Vol.2012-SLP-93, No.7, JPN6025051492, 15 December 2012 (2012-12-15), pages 1 - 4, ISSN: 0005760317 * |
| 藤井祐介 他: ""放送におけるアンドロイド活用 〜テレビ朝日におけるロボットの活用〜"", IN:映像情報メディア学会誌, vol. 73, no. 4, JPN6025051491, 1 July 2019 (2019-07-01), pages 60 - 64, ISSN: 0005760318 * |
Also Published As
| Publication number | Publication date |
|---|---|
| US12608864B2 (en) | 2026-04-21 |
| US20240303892A1 (en) | 2024-09-12 |
| WO2023090419A1 (ja) | 2023-05-25 |
Similar Documents
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240515 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20250602 |
|
| A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20250602 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20250805 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20251003 |
|
| A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20251223 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20260323 |