JP7798901B2 - 音声オーディオストリーム中断を処理するシステムおよび方法 - Google Patents
音声オーディオストリーム中断を処理するシステムおよび方法Info
- Publication number
- JP7798901B2 JP7798901B2 JP2023546311A JP2023546311A JP7798901B2 JP 7798901 B2 JP7798901 B2 JP 7798901B2 JP 2023546311 A JP2023546311 A JP 2023546311A JP 2023546311 A JP2023546311 A JP 2023546311A JP 7798901 B2 JP7798901 B2 JP 7798901B2
- Authority
- JP
- Japan
- Prior art keywords
- stream
- voice
- interruption
- text
- audio stream
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
- H04M3/568—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
- H04N7/157—Conference systems defining a virtual conference space and using avatars or agents
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/39—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech synthesis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/60—Medium conversion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2203/00—Aspects of automatic or semi-automatic exchanges
- H04M2203/20—Aspects of automatic or semi-automatic exchanges related to features of supplementary services
- H04M2203/2088—Call or conference reconnect, e.g. resulting from isdn terminal portability
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US17/166,250 | 2021-02-03 | ||
| US17/166,250 US11580954B2 (en) | 2021-02-03 | 2021-02-03 | Systems and methods of handling speech audio stream interruptions |
| PCT/US2021/072831 WO2022169534A1 (en) | 2021-02-03 | 2021-12-09 | Systems and methods of handling speech audio stream interruptions |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| JP2024505944A JP2024505944A (ja) | 2024-02-08 |
| JP2024505944A5 JP2024505944A5 (https=) | 2024-12-03 |
| JP7798901B2 true JP7798901B2 (ja) | 2026-01-14 |
Family
ID=79283143
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2023546311A Active JP7798901B2 (ja) | 2021-02-03 | 2021-12-09 | 音声オーディオストリーム中断を処理するシステムおよび方法 |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US11580954B2 (https=) |
| EP (1) | EP4289129B1 (https=) |
| JP (1) | JP7798901B2 (https=) |
| KR (1) | KR20230133864A (https=) |
| CN (1) | CN116830559A (https=) |
| BR (1) | BR112023014966A2 (https=) |
| WO (1) | WO2022169534A1 (https=) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20220303152A1 (en) * | 2021-03-18 | 2022-09-22 | Lenovo (Singapore) Pte. Ltd. | Recordation of video conference based on bandwidth issue(s) |
| US11895263B2 (en) * | 2021-05-25 | 2024-02-06 | International Business Machines Corporation | Interpreting conference call interruptions |
| US20240062750A1 (en) * | 2022-08-18 | 2024-02-22 | Avaya Management L.P. | Speech transmission from a telecommunication endpoint using phonetic characters |
| CN118018137A (zh) | 2022-11-08 | 2024-05-10 | 联发科技(新加坡)私人有限公司 | 音频播放方法及装置 |
| US20240428774A1 (en) * | 2023-06-21 | 2024-12-26 | International Business Machines Corporation | Cognitive assistant voice amelioration model |
| WO2026007057A1 (en) * | 2024-07-04 | 2026-01-08 | Ringcentral, Inc. | Systems and methods for recreating lost or inaudible speech in a conversation |
| US20260073903A1 (en) * | 2024-09-12 | 2026-03-12 | Cisco Technology, Inc. | Augmenting audio of communication sessions with transcribed visual content |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001230801A (ja) | 2000-02-14 | 2001-08-24 | Sony Corp | 通信システムとその方法、通信サービスサーバおよび通信端末装置 |
| JP2008021058A (ja) | 2006-07-12 | 2008-01-31 | Nec Corp | 翻訳機能付き携帯電話装置、音声データ翻訳方法、音声データ翻訳プログラムおよびプログラム記録媒体 |
| JP2016529839A (ja) | 2013-08-29 | 2016-09-23 | ユニファイ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディートゲゼルシャフトUnify GmbH & Co. KG | 混雑した通信チャネルでの音声通信の維持方法 |
| US20180218727A1 (en) | 2017-02-02 | 2018-08-02 | Microsoft Technology Licensing, Llc | Artificially generated speech for a communication session |
| US20180226073A1 (en) | 2017-02-06 | 2018-08-09 | International Business Machines Corporation | Context-based cognitive speech to text engine |
Family Cites Families (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9922641B1 (en) * | 2012-10-01 | 2018-03-20 | Google Llc | Cross-lingual speaker adaptation for multi-lingual speech synthesis |
| US10187433B2 (en) * | 2013-03-15 | 2019-01-22 | Swyme Ip Bv | Methods and systems for dynamic adjustment of session parameters for effective video collaboration among heterogenous devices |
| DE102014018205A1 (de) * | 2014-12-09 | 2016-06-09 | Unify Gmbh & Co. Kg | Konferenzsystem und Verfahren zum Steuern des Konferenzsystems |
| US9883144B2 (en) * | 2016-05-12 | 2018-01-30 | Fuji Xerox Co., Ltd. | System and method for replacing user media streams with animated avatars in live videoconferences |
| US9843673B1 (en) | 2016-11-14 | 2017-12-12 | Motorola Mobility Llc | Managing calls |
| US20180358003A1 (en) * | 2017-06-09 | 2018-12-13 | Qualcomm Incorporated | Methods and apparatus for improving speech communication and speech interface quality using neural networks |
| CN107393544B (zh) | 2017-06-19 | 2019-03-05 | 维沃移动通信有限公司 | 一种语音信号修复方法及移动终端 |
| US10372298B2 (en) * | 2017-09-29 | 2019-08-06 | Apple Inc. | User interface for multi-user communication session |
| US20200090648A1 (en) | 2018-09-14 | 2020-03-19 | International Business Machines Corporation | Maintaining voice conversation continuity |
| US10971161B1 (en) * | 2018-12-12 | 2021-04-06 | Amazon Technologies, Inc. | Techniques for loss mitigation of audio streams |
| KR102740698B1 (ko) * | 2019-08-22 | 2024-12-11 | 엘지전자 주식회사 | 감정 정보 기반의 음성 합성 방법 및 장치 |
| US11889128B2 (en) * | 2021-01-05 | 2024-01-30 | Qualcomm Incorporated | Call audio playback speed adjustment |
-
2021
- 2021-02-03 US US17/166,250 patent/US11580954B2/en active Active
- 2021-12-09 WO PCT/US2021/072831 patent/WO2022169534A1/en not_active Ceased
- 2021-12-09 CN CN202180092238.XA patent/CN116830559A/zh active Pending
- 2021-12-09 BR BR112023014966A patent/BR112023014966A2/pt unknown
- 2021-12-09 KR KR1020237025451A patent/KR20230133864A/ko active Pending
- 2021-12-09 EP EP21839806.3A patent/EP4289129B1/en active Active
- 2021-12-09 JP JP2023546311A patent/JP7798901B2/ja active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2001230801A (ja) | 2000-02-14 | 2001-08-24 | Sony Corp | 通信システムとその方法、通信サービスサーバおよび通信端末装置 |
| JP2008021058A (ja) | 2006-07-12 | 2008-01-31 | Nec Corp | 翻訳機能付き携帯電話装置、音声データ翻訳方法、音声データ翻訳プログラムおよびプログラム記録媒体 |
| JP2016529839A (ja) | 2013-08-29 | 2016-09-23 | ユニファイ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディートゲゼルシャフトUnify GmbH & Co. KG | 混雑した通信チャネルでの音声通信の維持方法 |
| US20180218727A1 (en) | 2017-02-02 | 2018-08-02 | Microsoft Technology Licensing, Llc | Artificially generated speech for a communication session |
| US20180226073A1 (en) | 2017-02-06 | 2018-08-09 | International Business Machines Corporation | Context-based cognitive speech to text engine |
Also Published As
| Publication number | Publication date |
|---|---|
| BR112023014966A2 (pt) | 2024-01-23 |
| JP2024505944A (ja) | 2024-02-08 |
| WO2022169534A1 (en) | 2022-08-11 |
| TW202236084A (zh) | 2022-09-16 |
| US20220246133A1 (en) | 2022-08-04 |
| EP4289129A1 (en) | 2023-12-13 |
| US11580954B2 (en) | 2023-02-14 |
| EP4289129B1 (en) | 2025-09-03 |
| CN116830559A (zh) | 2023-09-29 |
| EP4289129C0 (en) | 2025-09-03 |
| KR20230133864A (ko) | 2023-09-19 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7798901B2 (ja) | 音声オーディオストリーム中断を処理するシステムおよび方法 | |
| US10680995B1 (en) | Continuous multimodal communication and recording system with automatic transmutation of audio and textual content | |
| EP2663064B1 (en) | Method and system for operating communication service | |
| US7822050B2 (en) | Buffering, pausing and condensing a live phone call | |
| US10228899B2 (en) | Monitoring environmental noise and data packets to display a transcription of call audio | |
| US20130211826A1 (en) | Audio Signals as Buffered Streams of Audio Signals and Metadata | |
| US20240029755A1 (en) | Intelligent speech or dialogue enhancement | |
| CN108920128B (zh) | 演示文稿的操作方法及系统 | |
| US20240087597A1 (en) | Source speech modification based on an input speech characteristic | |
| CN108288469A (zh) | 一种音箱及交互方法 | |
| JP7842767B2 (ja) | 通話オーディオ再生速度調整 | |
| CN105898486A (zh) | 蓝牙遥控器及蓝牙遥控器的信号处理方法 | |
| US12603957B2 (en) | Conference calls | |
| TWI914456B (zh) | 處理語音音頻流中斷的系統和方法 | |
| US20240402980A1 (en) | Disabling audio coding of media content when a no volume condition of a device is detected | |
| CN118918915A (zh) | 音视频处理方法、装置、存储介质以及终端 | |
| TWI917501B (zh) | 通話音頻回放速度調整 | |
| US20250372081A1 (en) | Personalized nearby voice detection system | |
| JP4531013B2 (ja) | 映像音声会議システムおよび端末装置 | |
| CN119545295A (zh) | 一种座舱对讲集成控制方法、系统、设备及介质 | |
| CN118887944A (zh) | 一种语音数据的处理方法、电子设备、服务设备及可读存储介质 | |
| CN121415765A (zh) | 流式语音同传方法、相关设备及计算机程序产品 | |
| WO2020177483A1 (zh) | 音视频处理方法、装置、电子设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20241125 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20241125 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20250630 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20250701 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20251001 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20251202 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20251225 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7798901 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |