JP2023024987A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2023024987A5 JP2023024987A5 JP2022176503A JP2022176503A JP2023024987A5 JP 2023024987 A5 JP2023024987 A5 JP 2023024987A5 JP 2022176503 A JP2022176503 A JP 2022176503A JP 2022176503 A JP2022176503 A JP 2022176503A JP 2023024987 A5 JP2023024987 A5 JP 2023024987A5
- Authority
- JP
- Japan
- Prior art keywords
- component object
- digital component
- processing system
- data processing
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims 9
- 230000003993 interaction Effects 0.000 claims 8
- 230000001755 vocal effect Effects 0.000 claims 6
- 230000004044 response Effects 0.000 claims 5
- 230000000007 visual effect Effects 0.000 claims 5
- 238000009877 rendering Methods 0.000 claims 3
- 238000010801 machine learning Methods 0.000 claims 1
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2022176503A JP7525575B2 (ja) | 2020-06-09 | 2022-11-02 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2021520598A JP7171911B2 (ja) | 2020-06-09 | 2020-06-09 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
| PCT/US2020/036749 WO2021251953A1 (en) | 2020-06-09 | 2020-06-09 | Generation of interactive audio tracks from visual content |
| JP2022176503A JP7525575B2 (ja) | 2020-06-09 | 2022-11-02 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021520598A Division JP7171911B2 (ja) | 2020-06-09 | 2020-06-09 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2023024987A JP2023024987A (ja) | 2023-02-21 |
| JP2023024987A5 true JP2023024987A5 (https=) | 2023-08-07 |
| JP7525575B2 JP7525575B2 (ja) | 2024-07-30 |
Family
ID=71465407
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021520598A Active JP7171911B2 (ja) | 2020-06-09 | 2020-06-09 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
| JP2022176503A Active JP7525575B2 (ja) | 2020-06-09 | 2022-11-02 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021520598A Active JP7171911B2 (ja) | 2020-06-09 | 2020-06-09 | ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US12230252B2 (https=) |
| EP (2) | EP4478338B1 (https=) |
| JP (2) | JP7171911B2 (https=) |
| KR (1) | KR102765838B1 (https=) |
| CN (2) | CN114080817B (https=) |
| WO (1) | WO2021251953A1 (https=) |
Families Citing this family (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11010428B2 (en) * | 2018-01-16 | 2021-05-18 | Google Llc | Systems, methods, and apparatuses for providing assistant deep links to effectuate third-party dialog session transfers |
| CN111753553B (zh) * | 2020-07-06 | 2022-07-05 | 北京世纪好未来教育科技有限公司 | 语句类型识别方法、装置、电子设备和存储介质 |
| US12045717B2 (en) * | 2020-12-09 | 2024-07-23 | International Business Machines Corporation | Automatic creation of difficult annotated data leveraging cues |
| US20240153488A1 (en) * | 2021-03-17 | 2024-05-09 | Pioneer Corporation | Sound output control device, sound output control method, and sound output control program |
| US20230005486A1 (en) * | 2021-07-02 | 2023-01-05 | Pindrop Security, Inc. | Speaker embedding conversion for backward and cross-channel compatability |
| US20230098356A1 (en) * | 2021-09-30 | 2023-03-30 | Meta Platforms, Inc. | Systems and methods for identifying candidate videos for audio experiences |
| KR20230056923A (ko) * | 2021-10-21 | 2023-04-28 | 주식회사 캐스트유 | 음원을 위한 키워드 생성방법 |
| US12230278B1 (en) * | 2022-02-22 | 2025-02-18 | Amazon Technologies, Inc. | Output of visual supplemental content |
| EP4420751A1 (en) * | 2023-02-23 | 2024-08-28 | Sony Interactive Entertainment Inc. | Generating a musical score for a video game |
| JP7497502B1 (ja) | 2023-08-14 | 2024-06-10 | 株式会社コロプラ | プログラム及びシステム |
| US20250118287A1 (en) * | 2023-10-06 | 2025-04-10 | Google Llc | Sonifying Visual Content For Vision-Impaired Users |
| US12604062B2 (en) * | 2024-04-03 | 2026-04-14 | Nec Corporation Of America | Enhancing media consumption experience through generative AI-powered interactive companions |
| US12561348B2 (en) * | 2024-05-29 | 2026-02-24 | Microsoft Technology Licensing, Llc | Semantic-tree-based AI content management platform |
Family Cites Families (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6876968B2 (en) * | 2001-03-08 | 2005-04-05 | Matsushita Electric Industrial Co., Ltd. | Run time synthesizer adaptation to improve intelligibility of synthesized speech |
| JP4383690B2 (ja) * | 2001-04-27 | 2009-12-16 | 株式会社日立製作所 | デジタルコンテンツ出力方法およびシステム |
| US20060229877A1 (en) * | 2005-04-06 | 2006-10-12 | Jilei Tian | Memory usage in a text-to-speech system |
| US8996376B2 (en) * | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
| US20110047163A1 (en) * | 2009-08-24 | 2011-02-24 | Google Inc. | Relevance-Based Image Selection |
| US9043474B2 (en) * | 2010-01-20 | 2015-05-26 | Microsoft Technology Licensing, Llc | Communication sessions among devices and interfaces with mixed capabilities |
| US8862985B2 (en) * | 2012-06-08 | 2014-10-14 | Freedom Scientific, Inc. | Screen reader with customizable web page output |
| US9536528B2 (en) * | 2012-07-03 | 2017-01-03 | Google Inc. | Determining hotword suitability |
| CN104795066A (zh) * | 2014-01-17 | 2015-07-22 | 株式会社Ntt都科摩 | 语音识别方法和装置 |
| US20160048561A1 (en) * | 2014-08-15 | 2016-02-18 | Chacha Search, Inc. | Method, system, and computer readable storage for podcasting and video training in an information search system |
| US20160379638A1 (en) * | 2015-06-26 | 2016-12-29 | Amazon Technologies, Inc. | Input speech quality matching |
| US9792835B2 (en) * | 2016-02-05 | 2017-10-17 | Microsoft Technology Licensing, Llc | Proxemic interfaces for exploring imagery |
| US10049670B2 (en) * | 2016-06-06 | 2018-08-14 | Google Llc | Providing voice action discoverability example for trigger term |
| CN107516511B (zh) * | 2016-06-13 | 2021-05-25 | 微软技术许可有限责任公司 | 意图识别和情绪的文本到语音学习系统 |
| US10141006B1 (en) * | 2016-06-27 | 2018-11-27 | Amazon Technologies, Inc. | Artificial intelligence system for improving accessibility of digitized speech |
| US20180082679A1 (en) * | 2016-09-18 | 2018-03-22 | Newvoicemedia, Ltd. | Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning |
| US10311856B2 (en) | 2016-10-03 | 2019-06-04 | Google Llc | Synthesized voice selection for computational agents |
| US10387488B2 (en) | 2016-12-07 | 2019-08-20 | At7T Intellectual Property I, L.P. | User configurable radio |
| US11722571B1 (en) * | 2016-12-20 | 2023-08-08 | Amazon Technologies, Inc. | Recipient device presence activity monitoring for a communications session |
| US10559309B2 (en) * | 2016-12-22 | 2020-02-11 | Google Llc | Collaborative voice controlled devices |
| US10708313B2 (en) * | 2016-12-30 | 2020-07-07 | Google Llc | Multimodal transmission of packetized data |
| KR102622356B1 (ko) * | 2017-04-20 | 2024-01-08 | 구글 엘엘씨 | 장치에 대한 다중 사용자 인증 |
| EP4060476B1 (en) * | 2017-06-13 | 2025-08-06 | Google LLC | Establishment of audio-based network sessions with non-registered resources |
| KR102389331B1 (ko) * | 2018-05-07 | 2022-04-21 | 구글 엘엘씨 | 컴퓨팅 디바이스간의 액세스 제어 동기화 |
| CN108615526B (zh) * | 2018-05-08 | 2020-07-07 | 腾讯科技(深圳)有限公司 | 语音信号中关键词的检测方法、装置、终端及存储介质 |
| US11094324B2 (en) * | 2019-05-14 | 2021-08-17 | Motorola Mobility Llc | Accumulative multi-cue activation of domain-specific automatic speech recognition engine |
| JP7191792B2 (ja) * | 2019-08-23 | 2022-12-19 | 株式会社東芝 | 情報処理装置、情報処理方法およびプログラム |
| US11361759B2 (en) * | 2019-11-18 | 2022-06-14 | Streamingo Solutions Private Limited | Methods and systems for automatic generation and convergence of keywords and/or keyphrases from a media |
-
2020
- 2020-06-09 EP EP24210566.6A patent/EP4478338B1/en active Active
- 2020-06-09 US US17/282,135 patent/US12230252B2/en active Active
- 2020-06-09 KR KR1020217011130A patent/KR102765838B1/ko active Active
- 2020-06-09 EP EP20736811.9A patent/EP3948516B1/en active Active
- 2020-06-09 JP JP2021520598A patent/JP7171911B2/ja active Active
- 2020-06-09 CN CN202080005699.4A patent/CN114080817B/zh active Active
- 2020-06-09 CN CN202410825424.5A patent/CN118714395A/zh active Pending
- 2020-06-09 WO PCT/US2020/036749 patent/WO2021251953A1/en not_active Ceased
-
2022
- 2022-11-02 JP JP2022176503A patent/JP7525575B2/ja active Active
-
2024
- 2024-11-05 US US18/938,126 patent/US20250061892A1/en active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2023024987A5 (https=) | ||
| CN108288468B (zh) | 语音识别方法及装置 | |
| CN111090727B (zh) | 语言转换处理方法、装置及方言语音交互系统 | |
| CN113948062B (zh) | 数据转换方法及计算机存储介质 | |
| US10236017B1 (en) | Goal segmentation in speech dialogs | |
| US11417339B1 (en) | Detection of plagiarized spoken responses using machine learning | |
| US11762451B2 (en) | Methods and apparatus to add common sense reasoning to artificial intelligence in the context of human machine interfaces | |
| CN116434731A (zh) | 语音编辑方法、装置、存储介质及电子装置 | |
| CN116187292A (zh) | 对话模板生成方法、装置及计算机可读存储介质 | |
| CN113761268A (zh) | 音频节目内容的播放控制方法、装置、设备和存储介质 | |
| CN112837688B (zh) | 语音转写方法、装置、相关系统及设备 | |
| CN117221656A (zh) | 题目讲解视频的生成方法、装置、电子设备及存储介质 | |
| CN116787437A (zh) | 基于语音处理的辩论机器人的控制方法、装置、介质 | |
| CN119993196B (zh) | 一种语音训练数据的获取方法、装置、设备及介质 | |
| JPWO2021059968A5 (https=) | ||
| US20250259006A1 (en) | Generating communication summaries using artificial intelligence models, summary templates, and enriched transcripts | |
| CN118609542A (zh) | 文本转语音方法、装置、计算机设备、可读存储介质和程序产品 | |
| KR20230019188A (ko) | 개인화된 음성 콘텐츠를 생성하는 방법 | |
| JP2023035787A5 (https=) | ||
| WO2021017302A1 (zh) | 一种数据提取方法、装置、计算机系统及可读存储介质 | |
| CN119272880B (zh) | 模型转换方法、设备、存储介质及程序产品 | |
| CN119920234B (zh) | 基于自回归模型的语音克隆方法、装置、设备及存储介质 | |
| JP5383608B2 (ja) | 解説放送文作成支援装置及びプログラム | |
| CN114661941B (zh) | 一种点击率预测模型构建方法、装置、计算机设备和存储介质 | |
| CN118737151A (zh) | 一种会议对话记录要点生成方法、装置、设备及存储介质 |