JP2023024987A5 - - Google Patents

Download PDF

Info

Publication number
JP2023024987A5
JP2023024987A5 JP2022176503A JP2022176503A JP2023024987A5 JP 2023024987 A5 JP2023024987 A5 JP 2023024987A5 JP 2022176503 A JP2022176503 A JP 2022176503A JP 2022176503 A JP2022176503 A JP 2022176503A JP 2023024987 A5 JP2023024987 A5 JP 2023024987A5
Authority
JP
Japan
Prior art keywords
component object
digital component
processing system
data processing
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2022176503A
Other languages
English (en)
Japanese (ja)
Other versions
JP7525575B2 (ja
JP2023024987A (ja
Filing date
Publication date
Priority claimed from JP2021520598A external-priority patent/JP7171911B2/ja
Application filed filed Critical
Priority to JP2022176503A priority Critical patent/JP7525575B2/ja
Publication of JP2023024987A publication Critical patent/JP2023024987A/ja
Publication of JP2023024987A5 publication Critical patent/JP2023024987A5/ja
Application granted granted Critical
Publication of JP7525575B2 publication Critical patent/JP7525575B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2022176503A 2020-06-09 2022-11-02 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成 Active JP7525575B2 (ja)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2022176503A JP7525575B2 (ja) 2020-06-09 2022-11-02 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2021520598A JP7171911B2 (ja) 2020-06-09 2020-06-09 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成
PCT/US2020/036749 WO2021251953A1 (en) 2020-06-09 2020-06-09 Generation of interactive audio tracks from visual content
JP2022176503A JP7525575B2 (ja) 2020-06-09 2022-11-02 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
JP2021520598A Division JP7171911B2 (ja) 2020-06-09 2020-06-09 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成

Publications (3)

Publication Number Publication Date
JP2023024987A JP2023024987A (ja) 2023-02-21
JP2023024987A5 true JP2023024987A5 (https=) 2023-08-07
JP7525575B2 JP7525575B2 (ja) 2024-07-30

Family

ID=71465407

Family Applications (2)

Application Number Title Priority Date Filing Date
JP2021520598A Active JP7171911B2 (ja) 2020-06-09 2020-06-09 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成
JP2022176503A Active JP7525575B2 (ja) 2020-06-09 2022-11-02 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成

Family Applications Before (1)

Application Number Title Priority Date Filing Date
JP2021520598A Active JP7171911B2 (ja) 2020-06-09 2020-06-09 ビジュアルコンテンツからのインタラクティブなオーディオトラックの生成

Country Status (6)

Country Link
US (2) US12230252B2 (https=)
EP (2) EP4478338B1 (https=)
JP (2) JP7171911B2 (https=)
KR (1) KR102765838B1 (https=)
CN (2) CN114080817B (https=)
WO (1) WO2021251953A1 (https=)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11010428B2 (en) * 2018-01-16 2021-05-18 Google Llc Systems, methods, and apparatuses for providing assistant deep links to effectuate third-party dialog session transfers
CN111753553B (zh) * 2020-07-06 2022-07-05 北京世纪好未来教育科技有限公司 语句类型识别方法、装置、电子设备和存储介质
US12045717B2 (en) * 2020-12-09 2024-07-23 International Business Machines Corporation Automatic creation of difficult annotated data leveraging cues
US20240153488A1 (en) * 2021-03-17 2024-05-09 Pioneer Corporation Sound output control device, sound output control method, and sound output control program
US20230005486A1 (en) * 2021-07-02 2023-01-05 Pindrop Security, Inc. Speaker embedding conversion for backward and cross-channel compatability
US20230098356A1 (en) * 2021-09-30 2023-03-30 Meta Platforms, Inc. Systems and methods for identifying candidate videos for audio experiences
KR20230056923A (ko) * 2021-10-21 2023-04-28 주식회사 캐스트유 음원을 위한 키워드 생성방법
US12230278B1 (en) * 2022-02-22 2025-02-18 Amazon Technologies, Inc. Output of visual supplemental content
EP4420751A1 (en) * 2023-02-23 2024-08-28 Sony Interactive Entertainment Inc. Generating a musical score for a video game
JP7497502B1 (ja) 2023-08-14 2024-06-10 株式会社コロプラ プログラム及びシステム
US20250118287A1 (en) * 2023-10-06 2025-04-10 Google Llc Sonifying Visual Content For Vision-Impaired Users
US12604062B2 (en) * 2024-04-03 2026-04-14 Nec Corporation Of America Enhancing media consumption experience through generative AI-powered interactive companions
US12561348B2 (en) * 2024-05-29 2026-02-24 Microsoft Technology Licensing, Llc Semantic-tree-based AI content management platform

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6876968B2 (en) * 2001-03-08 2005-04-05 Matsushita Electric Industrial Co., Ltd. Run time synthesizer adaptation to improve intelligibility of synthesized speech
JP4383690B2 (ja) * 2001-04-27 2009-12-16 株式会社日立製作所 デジタルコンテンツ出力方法およびシステム
US20060229877A1 (en) * 2005-04-06 2006-10-12 Jilei Tian Memory usage in a text-to-speech system
US8996376B2 (en) * 2008-04-05 2015-03-31 Apple Inc. Intelligent text-to-speech conversion
US20110047163A1 (en) * 2009-08-24 2011-02-24 Google Inc. Relevance-Based Image Selection
US9043474B2 (en) * 2010-01-20 2015-05-26 Microsoft Technology Licensing, Llc Communication sessions among devices and interfaces with mixed capabilities
US8862985B2 (en) * 2012-06-08 2014-10-14 Freedom Scientific, Inc. Screen reader with customizable web page output
US9536528B2 (en) * 2012-07-03 2017-01-03 Google Inc. Determining hotword suitability
CN104795066A (zh) * 2014-01-17 2015-07-22 株式会社Ntt都科摩 语音识别方法和装置
US20160048561A1 (en) * 2014-08-15 2016-02-18 Chacha Search, Inc. Method, system, and computer readable storage for podcasting and video training in an information search system
US20160379638A1 (en) * 2015-06-26 2016-12-29 Amazon Technologies, Inc. Input speech quality matching
US9792835B2 (en) * 2016-02-05 2017-10-17 Microsoft Technology Licensing, Llc Proxemic interfaces for exploring imagery
US10049670B2 (en) * 2016-06-06 2018-08-14 Google Llc Providing voice action discoverability example for trigger term
CN107516511B (zh) * 2016-06-13 2021-05-25 微软技术许可有限责任公司 意图识别和情绪的文本到语音学习系统
US10141006B1 (en) * 2016-06-27 2018-11-27 Amazon Technologies, Inc. Artificial intelligence system for improving accessibility of digitized speech
US20180082679A1 (en) * 2016-09-18 2018-03-22 Newvoicemedia, Ltd. Optimal human-machine conversations using emotion-enhanced natural speech using hierarchical neural networks and reinforcement learning
US10311856B2 (en) 2016-10-03 2019-06-04 Google Llc Synthesized voice selection for computational agents
US10387488B2 (en) 2016-12-07 2019-08-20 At7T Intellectual Property I, L.P. User configurable radio
US11722571B1 (en) * 2016-12-20 2023-08-08 Amazon Technologies, Inc. Recipient device presence activity monitoring for a communications session
US10559309B2 (en) * 2016-12-22 2020-02-11 Google Llc Collaborative voice controlled devices
US10708313B2 (en) * 2016-12-30 2020-07-07 Google Llc Multimodal transmission of packetized data
KR102622356B1 (ko) * 2017-04-20 2024-01-08 구글 엘엘씨 장치에 대한 다중 사용자 인증
EP4060476B1 (en) * 2017-06-13 2025-08-06 Google LLC Establishment of audio-based network sessions with non-registered resources
KR102389331B1 (ko) * 2018-05-07 2022-04-21 구글 엘엘씨 컴퓨팅 디바이스간의 액세스 제어 동기화
CN108615526B (zh) * 2018-05-08 2020-07-07 腾讯科技(深圳)有限公司 语音信号中关键词的检测方法、装置、终端及存储介质
US11094324B2 (en) * 2019-05-14 2021-08-17 Motorola Mobility Llc Accumulative multi-cue activation of domain-specific automatic speech recognition engine
JP7191792B2 (ja) * 2019-08-23 2022-12-19 株式会社東芝 情報処理装置、情報処理方法およびプログラム
US11361759B2 (en) * 2019-11-18 2022-06-14 Streamingo Solutions Private Limited Methods and systems for automatic generation and convergence of keywords and/or keyphrases from a media

Similar Documents

Publication Publication Date Title
JP2023024987A5 (https=)
CN108288468B (zh) 语音识别方法及装置
CN111090727B (zh) 语言转换处理方法、装置及方言语音交互系统
CN113948062B (zh) 数据转换方法及计算机存储介质
US10236017B1 (en) Goal segmentation in speech dialogs
US11417339B1 (en) Detection of plagiarized spoken responses using machine learning
US11762451B2 (en) Methods and apparatus to add common sense reasoning to artificial intelligence in the context of human machine interfaces
CN116434731A (zh) 语音编辑方法、装置、存储介质及电子装置
CN116187292A (zh) 对话模板生成方法、装置及计算机可读存储介质
CN113761268A (zh) 音频节目内容的播放控制方法、装置、设备和存储介质
CN112837688B (zh) 语音转写方法、装置、相关系统及设备
CN117221656A (zh) 题目讲解视频的生成方法、装置、电子设备及存储介质
CN116787437A (zh) 基于语音处理的辩论机器人的控制方法、装置、介质
CN119993196B (zh) 一种语音训练数据的获取方法、装置、设备及介质
JPWO2021059968A5 (https=)
US20250259006A1 (en) Generating communication summaries using artificial intelligence models, summary templates, and enriched transcripts
CN118609542A (zh) 文本转语音方法、装置、计算机设备、可读存储介质和程序产品
KR20230019188A (ko) 개인화된 음성 콘텐츠를 생성하는 방법
JP2023035787A5 (https=)
WO2021017302A1 (zh) 一种数据提取方法、装置、计算机系统及可读存储介质
CN119272880B (zh) 模型转换方法、设备、存储介质及程序产品
CN119920234B (zh) 基于自回归模型的语音克隆方法、装置、设备及存储介质
JP5383608B2 (ja) 解説放送文作成支援装置及びプログラム
CN114661941B (zh) 一种点击率预测模型构建方法、装置、计算机设备和存储介质
CN118737151A (zh) 一种会议对话记录要点生成方法、装置、设备及存储介质