JP2025504748A - ゲーム内自動字幕及びクローズドキャプション - Google Patents
ゲーム内自動字幕及びクローズドキャプション Download PDFInfo
- Publication number
- JP2025504748A JP2025504748A JP2024535349A JP2024535349A JP2025504748A JP 2025504748 A JP2025504748 A JP 2025504748A JP 2024535349 A JP2024535349 A JP 2024535349A JP 2024535349 A JP2024535349 A JP 2024535349A JP 2025504748 A JP2025504748 A JP 2025504748A
- Authority
- JP
- Japan
- Prior art keywords
- game
- subtitle
- overlay
- audio
- audio stream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/30—Interconnection arrangements between game servers and game devices; Interconnection arrangements between game devices; Interconnection arrangements between game servers
- A63F13/35—Details of game servers
- A63F13/355—Performing operations on behalf of clients with restricted processing capabilities, e.g. servers transform changing game scene into an encoded video stream for transmitting to a mobile phone or a thin client
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/40—Processing input control signals of video game devices, e.g. signals generated by the player or derived from the environment
- A63F13/42—Processing input control signals of video game devices, e.g. signals generated by the player or derived from the environment by mapping the input signals into game commands, e.g. mapping the displacement of a stylus on a touch screen to the steering angle of a virtual vehicle
- A63F13/424—Processing input control signals of video game devices, e.g. signals generated by the player or derived from the environment by mapping the input signals into game commands, e.g. mapping the displacement of a stylus on a touch screen to the steering angle of a virtual vehicle involving acoustic input signals, e.g. by using the results of pitch or rhythm extraction or voice recognition
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/50—Controlling the output signals based on the game progress
- A63F13/53—Controlling the output signals based on the game progress involving additional visual information provided to the game scene, e.g. by overlay to simulate a head-up display [HUD] or displaying a laser sight in a shooting game
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F13/00—Video games, i.e. games using an electronically generated display having two or more dimensions
- A63F13/85—Providing additional services to players
- A63F13/87—Communicating with other players during game play, e.g. by e-mail or chat
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/26—Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/478—Supplemental services, e.g. displaying phone caller identification, shopping application
- H04N21/4781—Games
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/488—Data services, e.g. news ticker
- H04N21/4884—Data services, e.g. news ticker for displaying subtitles
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F2300/00—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
- A63F2300/30—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by output arrangements for receiving control signals generated by the game device
- A63F2300/303—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by output arrangements for receiving control signals generated by the game device for displaying additional data, e.g. simulating a Head Up Display
-
- A—HUMAN NECESSITIES
- A63—SPORTS; GAMES; AMUSEMENTS
- A63F—CARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
- A63F2300/00—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
- A63F2300/50—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by details of game servers
- A63F2300/57—Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by details of game servers details of game services offered to the player
- A63F2300/572—Communication between players during game play of non game information, e.g. e-mail, chat, file transfer, streaming of audio and streaming of video
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Optics & Photonics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Processing Or Creating Images (AREA)
- Studio Devices (AREA)
- User Interface Of Digital Computer (AREA)
- Studio Circuits (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/561,477 US11857877B2 (en) | 2021-12-23 | 2021-12-23 | Automatic in-game subtitles and closed captions |
| US17/561,477 | 2021-12-23 | ||
| PCT/US2022/051581 WO2023121850A1 (en) | 2021-12-23 | 2022-12-01 | Automatic in-game subtitles and closed captions |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2025504748A true JP2025504748A (ja) | 2025-02-19 |
| JP2025504748A5 JP2025504748A5 (https=) | 2025-10-03 |
Family
ID=86898719
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024535349A Pending JP2025504748A (ja) | 2021-12-23 | 2022-12-01 | ゲーム内自動字幕及びクローズドキャプション |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US11857877B2 (https=) |
| EP (1) | EP4452432A4 (https=) |
| JP (1) | JP2025504748A (https=) |
| KR (1) | KR20240131376A (https=) |
| CN (1) | CN118414200A (https=) |
| WO (1) | WO2023121850A1 (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11857877B2 (en) * | 2021-12-23 | 2024-01-02 | Ati Technologies Ulc | Automatic in-game subtitles and closed captions |
| US20240022682A1 (en) * | 2022-07-13 | 2024-01-18 | Sony Interactive Entertainment LLC | Systems and methods for communicating audio data |
| GB2622405A (en) * | 2022-09-15 | 2024-03-20 | Sony Interactive Entertainment Inc | Systems and methods for controlling dialogue complexity in video games |
| TWI891080B (zh) * | 2023-10-05 | 2025-07-21 | 宏碁股份有限公司 | 電子裝置與其影像片段萃取方法 |
Family Cites Families (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10987597B2 (en) * | 2002-12-10 | 2021-04-27 | Sony Interactive Entertainment LLC | System and method for managing audio and video channels for video game players and spectators |
| US8620139B2 (en) * | 2011-04-29 | 2013-12-31 | Microsoft Corporation | Utilizing subtitles in multiple languages to facilitate second-language learning |
| EP2525568B1 (en) * | 2011-05-19 | 2017-11-15 | EchoStar Technologies L.L.C. | Automatic subtitle resizing |
| US8839292B1 (en) * | 2011-12-13 | 2014-09-16 | Google Inc. | Systems and methods for rendering multiple applications on television screens |
| US10304458B1 (en) * | 2014-03-06 | 2019-05-28 | Board of Trustees of the University of Alabama and the University of Alabama in Huntsville | Systems and methods for transcribing videos using speaker identification |
| EP3220374A4 (en) * | 2014-11-12 | 2018-07-18 | Fujitsu Limited | Wearable device, display control method, and display control program |
| KR102202576B1 (ko) * | 2014-12-12 | 2021-01-13 | 삼성전자주식회사 | 음향 출력을 제어하는 디바이스 및 그 방법 |
| US9922095B2 (en) * | 2015-06-02 | 2018-03-20 | Microsoft Technology Licensing, Llc | Automated closed captioning using temporal data |
| US10332506B2 (en) * | 2015-09-02 | 2019-06-25 | Oath Inc. | Computerized system and method for formatted transcription of multimedia content |
| KR20170035502A (ko) * | 2015-09-23 | 2017-03-31 | 삼성전자주식회사 | 디스플레이 장치 및 이의 제어 방법 |
| US10179291B2 (en) * | 2016-12-09 | 2019-01-15 | Microsoft Technology Licensing, Llc | Session speech-to-text conversion |
| KR20180087009A (ko) * | 2017-01-24 | 2018-08-01 | 주식회사 소리자바 | 실시간 오디오 스트리밍 분석을 통한 자막 제공 시스템, 단말기 및 자막 서버 |
| US10299008B1 (en) * | 2017-11-21 | 2019-05-21 | International Business Machines Corporation | Smart closed caption positioning system for video content |
| CN108491127B (zh) * | 2018-03-12 | 2020-02-07 | Oppo广东移动通信有限公司 | 输入法界面显示方法、装置、终端及存储介质 |
| CN112154658B (zh) * | 2018-05-29 | 2024-07-23 | 索尼公司 | 图像处理装置、图像处理方法和存储介质 |
| US12451154B2 (en) * | 2018-08-08 | 2025-10-21 | Comcast Cable Communications, Llc | Generating and/or displaying synchronized captions |
| EP3719613B1 (en) * | 2019-04-01 | 2026-05-06 | Nokia Technologies Oy | Rendering captions for media content |
| KR102261597B1 (ko) * | 2019-04-23 | 2021-06-07 | 주식회사 비포에이 | Vr 영상 콘텐츠의 자막 처리 기기 |
| EP3963580B1 (en) * | 2019-05-02 | 2025-10-15 | Google LLC | Automatically captioning audible parts of content on a computing device |
| US11094324B2 (en) * | 2019-05-14 | 2021-08-17 | Motorola Mobility Llc | Accumulative multi-cue activation of domain-specific automatic speech recognition engine |
| US10885893B2 (en) * | 2019-06-06 | 2021-01-05 | Sony Corporation | Textual display of aural information broadcast via frequency modulated signals |
| US20210074298A1 (en) * | 2019-09-11 | 2021-03-11 | Soundhound, Inc. | Video conference captioning |
| US11172266B2 (en) * | 2019-11-04 | 2021-11-09 | Sling Media, L.L.C. | System to correct closed captioning display using context from audio/video |
| US11295497B2 (en) * | 2019-11-25 | 2022-04-05 | International Business Machines Corporation | Dynamic subtitle enhancement |
| CN111556372A (zh) * | 2020-04-20 | 2020-08-18 | 北京甲骨今声科技有限公司 | 为视音频节目实时添加字幕的方法和装置 |
| US11557121B2 (en) * | 2020-04-26 | 2023-01-17 | Cloudinary Ltd. | System, device, and method for generating and utilizing content-aware metadata |
| US11475895B2 (en) * | 2020-07-06 | 2022-10-18 | Meta Platforms, Inc. | Caption customization and editing |
| US20230055421A1 (en) * | 2020-09-16 | 2023-02-23 | Meta Platforms, Inc. | Caption customization and editing |
| US11418849B2 (en) * | 2020-10-22 | 2022-08-16 | Rovi Guides, Inc. | Systems and methods for inserting emoticons within a media asset |
| US20240064485A1 (en) * | 2020-11-30 | 2024-02-22 | The Regents Of The University Of California | Systems and methods for sound-enhanced meeting platforms |
| US12342102B2 (en) * | 2021-11-19 | 2025-06-24 | Apple Inc. | Systems and methods for managing captions |
| US11857877B2 (en) * | 2021-12-23 | 2024-01-02 | Ati Technologies Ulc | Automatic in-game subtitles and closed captions |
-
2021
- 2021-12-23 US US17/561,477 patent/US11857877B2/en active Active
-
2022
- 2022-12-01 EP EP22912270.0A patent/EP4452432A4/en active Pending
- 2022-12-01 WO PCT/US2022/051581 patent/WO2023121850A1/en not_active Ceased
- 2022-12-01 CN CN202280084788.1A patent/CN118414200A/zh active Pending
- 2022-12-01 KR KR1020247024720A patent/KR20240131376A/ko active Pending
- 2022-12-01 JP JP2024535349A patent/JP2025504748A/ja active Pending
-
2023
- 2023-11-28 US US18/520,717 patent/US12427413B2/en active Active
Also Published As
| Publication number | Publication date |
|---|---|
| US20230201717A1 (en) | 2023-06-29 |
| EP4452432A4 (en) | 2025-12-31 |
| WO2023121850A1 (en) | 2023-06-29 |
| US12427413B2 (en) | 2025-09-30 |
| EP4452432A1 (en) | 2024-10-30 |
| US11857877B2 (en) | 2024-01-02 |
| US20240091640A1 (en) | 2024-03-21 |
| CN118414200A (zh) | 2024-07-30 |
| KR20240131376A (ko) | 2024-08-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11857877B2 (en) | Automatic in-game subtitles and closed captions | |
| US11386903B2 (en) | Methods and systems for speech presentation based on simulated binaural audio signals | |
| CN110473525B (zh) | 获取语音训练样本的方法和装置 | |
| WO2009104564A1 (ja) | 仮想空間における会話サーバ、会話のための方法及びコンピュータ・プログラム | |
| JP2008500573A (ja) | メッセージを変更するための方法及びシステム | |
| US20240046914A1 (en) | Assisted speech | |
| US12141902B2 (en) | System and methods for resolving audio conflicts in extended reality environments | |
| JP2025504748A5 (https=) | ||
| US20250356842A1 (en) | Voice chat translation | |
| US20230412766A1 (en) | Information processing system, information processing method, and computer program | |
| US12293759B2 (en) | Method and device for presenting a CGR environment based on audio data and lyric data | |
| JP7225642B2 (ja) | コミュニケーションロボット、制御方法及び制御プログラム | |
| WO2010140254A1 (ja) | 映像音声出力装置及び音声定位方法 | |
| WO2025025564A1 (zh) | 一种虚拟形象控制方法、装置及相关设备 | |
| CN119450163A (zh) | 一种虚拟形象控制方法、装置及相关设备 | |
| US20110311059A1 (en) | Method of navigating in a sound content | |
| EP4290516A1 (en) | Audio processing system and method | |
| US20250365549A1 (en) | Audio streams in mixed voice chat in a virtual environment | |
| JP7425243B1 (ja) | 情報処理装置及び情報処理方法 | |
| JP7809998B2 (ja) | 分析システム、情報処理装置、分析方法、及びプログラム | |
| US20240267572A1 (en) | Content modification system and method | |
| US12119021B1 (en) | Situational awareness for head mounted devices | |
| de Magalhães | Identificação Espacial de Som Surround em Jogos | |
| US20220351727A1 (en) | Conversaton method, conversation system, conversation apparatus, and program | |
| GB2621873A (en) | Content display system and method |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240815 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20250925 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20250925 |