JP7798901B2 - 音声オーディオストリーム中断を処理するシステムおよび方法 - Google Patents

音声オーディオストリーム中断を処理するシステムおよび方法

Info

Publication number
JP7798901B2
JP7798901B2 JP2023546311A JP2023546311A JP7798901B2 JP 7798901 B2 JP7798901 B2 JP 7798901B2 JP 2023546311 A JP2023546311 A JP 2023546311A JP 2023546311 A JP2023546311 A JP 2023546311A JP 7798901 B2 JP7798901 B2 JP 7798901B2
Authority
JP
Japan
Prior art keywords
stream
voice
interruption
text
audio stream
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023546311A
Other languages
English (en)
Japanese (ja)
Other versions
JP2024505944A (ja
JP2024505944A5 (https=
Inventor
フェルディナンド・オリヴィエリ
リード・ウェストバーグ
シャンカール・タガドゥル・シヴァッパ
Original Assignee
クアルコム,インコーポレイテッド
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by クアルコム,インコーポレイテッド filed Critical クアルコム,インコーポレイテッド
Publication of JP2024505944A publication Critical patent/JP2024505944A/ja
Publication of JP2024505944A5 publication Critical patent/JP2024505944A5/ja
Application granted granted Critical
Publication of JP7798901B2 publication Critical patent/JP7798901B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/027Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/157Conference systems defining a virtual conference space and using avatars or agents
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/39Electronic components, circuits, software, systems or apparatus used in telephone systems using speech synthesis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/20Aspects of automatic or semi-automatic exchanges related to features of supplementary services
    • H04M2203/2088Call or conference reconnect, e.g. resulting from isdn terminal portability

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
JP2023546311A 2021-02-03 2021-12-09 音声オーディオストリーム中断を処理するシステムおよび方法 Active JP7798901B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/166,250 2021-02-03
US17/166,250 US11580954B2 (en) 2021-02-03 2021-02-03 Systems and methods of handling speech audio stream interruptions
PCT/US2021/072831 WO2022169534A1 (en) 2021-02-03 2021-12-09 Systems and methods of handling speech audio stream interruptions

Publications (3)

Publication Number Publication Date
JP2024505944A JP2024505944A (ja) 2024-02-08
JP2024505944A5 JP2024505944A5 (https=) 2024-12-03
JP7798901B2 true JP7798901B2 (ja) 2026-01-14

Family

ID=79283143

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023546311A Active JP7798901B2 (ja) 2021-02-03 2021-12-09 音声オーディオストリーム中断を処理するシステムおよび方法

Country Status (7)

Country Link
US (1) US11580954B2 (https=)
EP (1) EP4289129B1 (https=)
JP (1) JP7798901B2 (https=)
KR (1) KR20230133864A (https=)
CN (1) CN116830559A (https=)
BR (1) BR112023014966A2 (https=)
WO (1) WO2022169534A1 (https=)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220303152A1 (en) * 2021-03-18 2022-09-22 Lenovo (Singapore) Pte. Ltd. Recordation of video conference based on bandwidth issue(s)
US11895263B2 (en) * 2021-05-25 2024-02-06 International Business Machines Corporation Interpreting conference call interruptions
US20240062750A1 (en) * 2022-08-18 2024-02-22 Avaya Management L.P. Speech transmission from a telecommunication endpoint using phonetic characters
CN118018137A (zh) 2022-11-08 2024-05-10 联发科技(新加坡)私人有限公司 音频播放方法及装置
US20240428774A1 (en) * 2023-06-21 2024-12-26 International Business Machines Corporation Cognitive assistant voice amelioration model
WO2026007057A1 (en) * 2024-07-04 2026-01-08 Ringcentral, Inc. Systems and methods for recreating lost or inaudible speech in a conversation
US20260073903A1 (en) * 2024-09-12 2026-03-12 Cisco Technology, Inc. Augmenting audio of communication sessions with transcribed visual content

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001230801A (ja) 2000-02-14 2001-08-24 Sony Corp 通信システムとその方法、通信サービスサーバおよび通信端末装置
JP2008021058A (ja) 2006-07-12 2008-01-31 Nec Corp 翻訳機能付き携帯電話装置、音声データ翻訳方法、音声データ翻訳プログラムおよびプログラム記録媒体
JP2016529839A (ja) 2013-08-29 2016-09-23 ユニファイ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディートゲゼルシャフトUnify GmbH & Co. KG 混雑した通信チャネルでの音声通信の維持方法
US20180218727A1 (en) 2017-02-02 2018-08-02 Microsoft Technology Licensing, Llc Artificially generated speech for a communication session
US20180226073A1 (en) 2017-02-06 2018-08-09 International Business Machines Corporation Context-based cognitive speech to text engine

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9922641B1 (en) * 2012-10-01 2018-03-20 Google Llc Cross-lingual speaker adaptation for multi-lingual speech synthesis
US10187433B2 (en) * 2013-03-15 2019-01-22 Swyme Ip Bv Methods and systems for dynamic adjustment of session parameters for effective video collaboration among heterogenous devices
DE102014018205A1 (de) * 2014-12-09 2016-06-09 Unify Gmbh & Co. Kg Konferenzsystem und Verfahren zum Steuern des Konferenzsystems
US9883144B2 (en) * 2016-05-12 2018-01-30 Fuji Xerox Co., Ltd. System and method for replacing user media streams with animated avatars in live videoconferences
US9843673B1 (en) 2016-11-14 2017-12-12 Motorola Mobility Llc Managing calls
US20180358003A1 (en) * 2017-06-09 2018-12-13 Qualcomm Incorporated Methods and apparatus for improving speech communication and speech interface quality using neural networks
CN107393544B (zh) 2017-06-19 2019-03-05 维沃移动通信有限公司 一种语音信号修复方法及移动终端
US10372298B2 (en) * 2017-09-29 2019-08-06 Apple Inc. User interface for multi-user communication session
US20200090648A1 (en) 2018-09-14 2020-03-19 International Business Machines Corporation Maintaining voice conversation continuity
US10971161B1 (en) * 2018-12-12 2021-04-06 Amazon Technologies, Inc. Techniques for loss mitigation of audio streams
KR102740698B1 (ko) * 2019-08-22 2024-12-11 엘지전자 주식회사 감정 정보 기반의 음성 합성 방법 및 장치
US11889128B2 (en) * 2021-01-05 2024-01-30 Qualcomm Incorporated Call audio playback speed adjustment

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001230801A (ja) 2000-02-14 2001-08-24 Sony Corp 通信システムとその方法、通信サービスサーバおよび通信端末装置
JP2008021058A (ja) 2006-07-12 2008-01-31 Nec Corp 翻訳機能付き携帯電話装置、音声データ翻訳方法、音声データ翻訳プログラムおよびプログラム記録媒体
JP2016529839A (ja) 2013-08-29 2016-09-23 ユニファイ ゲゼルシャフト ミット ベシュレンクテル ハフツング ウント コンパニー コマンディートゲゼルシャフトUnify GmbH & Co. KG 混雑した通信チャネルでの音声通信の維持方法
US20180218727A1 (en) 2017-02-02 2018-08-02 Microsoft Technology Licensing, Llc Artificially generated speech for a communication session
US20180226073A1 (en) 2017-02-06 2018-08-09 International Business Machines Corporation Context-based cognitive speech to text engine

Also Published As

Publication number Publication date
BR112023014966A2 (pt) 2024-01-23
JP2024505944A (ja) 2024-02-08
WO2022169534A1 (en) 2022-08-11
TW202236084A (zh) 2022-09-16
US20220246133A1 (en) 2022-08-04
EP4289129A1 (en) 2023-12-13
US11580954B2 (en) 2023-02-14
EP4289129B1 (en) 2025-09-03
CN116830559A (zh) 2023-09-29
EP4289129C0 (en) 2025-09-03
KR20230133864A (ko) 2023-09-19

Similar Documents

Publication Publication Date Title
JP7798901B2 (ja) 音声オーディオストリーム中断を処理するシステムおよび方法
US10680995B1 (en) Continuous multimodal communication and recording system with automatic transmutation of audio and textual content
EP2663064B1 (en) Method and system for operating communication service
US7822050B2 (en) Buffering, pausing and condensing a live phone call
US10228899B2 (en) Monitoring environmental noise and data packets to display a transcription of call audio
US20130211826A1 (en) Audio Signals as Buffered Streams of Audio Signals and Metadata
US20240029755A1 (en) Intelligent speech or dialogue enhancement
CN108920128B (zh) 演示文稿的操作方法及系统
US20240087597A1 (en) Source speech modification based on an input speech characteristic
CN108288469A (zh) 一种音箱及交互方法
JP7842767B2 (ja) 通話オーディオ再生速度調整
CN105898486A (zh) 蓝牙遥控器及蓝牙遥控器的信号处理方法
US12603957B2 (en) Conference calls
TWI914456B (zh) 處理語音音頻流中斷的系統和方法
US20240402980A1 (en) Disabling audio coding of media content when a no volume condition of a device is detected
CN118918915A (zh) 音视频处理方法、装置、存储介质以及终端
TWI917501B (zh) 通話音頻回放速度調整
US20250372081A1 (en) Personalized nearby voice detection system
JP4531013B2 (ja) 映像音声会議システムおよび端末装置
CN119545295A (zh) 一种座舱对讲集成控制方法、系统、设备及介质
CN118887944A (zh) 一种语音数据的处理方法、电子设备、服务设备及可读存储介质
CN121415765A (zh) 流式语音同传方法、相关设备及计算机程序产品
WO2020177483A1 (zh) 音视频处理方法、装置、电子设备及存储介质

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241125

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20241125

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20250630

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20250701

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20251001

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20251202

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20251225

R150 Certificate of patent or registration of utility model

Ref document number: 7798901

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150