BR112023014966A2 - Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala - Google Patents

Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala

Info

Publication number
BR112023014966A2
BR112023014966A2 BR112023014966A BR112023014966A BR112023014966A2 BR 112023014966 A2 BR112023014966 A2 BR 112023014966A2 BR 112023014966 A BR112023014966 A BR 112023014966A BR 112023014966 A BR112023014966 A BR 112023014966A BR 112023014966 A2 BR112023014966 A2 BR 112023014966A2
Authority
BR
Brazil
Prior art keywords
speech audio
audio stream
systems
methods
processors
Prior art date
Application number
BR112023014966A
Other languages
English (en)
Portuguese (pt)
Inventor
Ferdinando Olivieri
Reid Westburg
Thagadur Shivappa Shankar
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of BR112023014966A2 publication Critical patent/BR112023014966A2/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/027Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • H04N7/157Conference systems defining a virtual conference space and using avatars or agents
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/39Electronic components, circuits, software, systems or apparatus used in telephone systems using speech synthesis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/60Medium conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/20Aspects of automatic or semi-automatic exchanges related to features of supplementary services
    • H04M2203/2088Call or conference reconnect, e.g. resulting from isdn terminal portability

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)
BR112023014966A 2021-02-03 2021-12-09 Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala BR112023014966A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/166,250 US11580954B2 (en) 2021-02-03 2021-02-03 Systems and methods of handling speech audio stream interruptions
PCT/US2021/072831 WO2022169534A1 (en) 2021-02-03 2021-12-09 Systems and methods of handling speech audio stream interruptions

Publications (1)

Publication Number Publication Date
BR112023014966A2 true BR112023014966A2 (pt) 2024-01-23

Family

ID=79283143

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023014966A BR112023014966A2 (pt) 2021-02-03 2021-12-09 Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala

Country Status (7)

Country Link
US (1) US11580954B2 (https=)
EP (1) EP4289129B1 (https=)
JP (1) JP7798901B2 (https=)
KR (1) KR20230133864A (https=)
CN (1) CN116830559A (https=)
BR (1) BR112023014966A2 (https=)
WO (1) WO2022169534A1 (https=)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220303152A1 (en) * 2021-03-18 2022-09-22 Lenovo (Singapore) Pte. Ltd. Recordation of video conference based on bandwidth issue(s)
US11895263B2 (en) * 2021-05-25 2024-02-06 International Business Machines Corporation Interpreting conference call interruptions
US20240062750A1 (en) * 2022-08-18 2024-02-22 Avaya Management L.P. Speech transmission from a telecommunication endpoint using phonetic characters
CN118018137A (zh) 2022-11-08 2024-05-10 联发科技(新加坡)私人有限公司 音频播放方法及装置
US20240428774A1 (en) * 2023-06-21 2024-12-26 International Business Machines Corporation Cognitive assistant voice amelioration model
WO2026007057A1 (en) * 2024-07-04 2026-01-08 Ringcentral, Inc. Systems and methods for recreating lost or inaudible speech in a conversation
US20260073903A1 (en) * 2024-09-12 2026-03-12 Cisco Technology, Inc. Augmenting audio of communication sessions with transcribed visual content

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001230801A (ja) * 2000-02-14 2001-08-24 Sony Corp 通信システムとその方法、通信サービスサーバおよび通信端末装置
JP2008021058A (ja) * 2006-07-12 2008-01-31 Nec Corp 翻訳機能付き携帯電話装置、音声データ翻訳方法、音声データ翻訳プログラムおよびプログラム記録媒体
US9922641B1 (en) * 2012-10-01 2018-03-20 Google Llc Cross-lingual speaker adaptation for multi-lingual speech synthesis
US10187433B2 (en) * 2013-03-15 2019-01-22 Swyme Ip Bv Methods and systems for dynamic adjustment of session parameters for effective video collaboration among heterogenous devices
KR101787594B1 (ko) 2013-08-29 2017-10-18 유니파이 게엠베하 운트 코. 카게 혼잡한 통신 채널에서 오디오 통신의 유지
DE102014018205A1 (de) * 2014-12-09 2016-06-09 Unify Gmbh & Co. Kg Konferenzsystem und Verfahren zum Steuern des Konferenzsystems
US9883144B2 (en) * 2016-05-12 2018-01-30 Fuji Xerox Co., Ltd. System and method for replacing user media streams with animated avatars in live videoconferences
US9843673B1 (en) 2016-11-14 2017-12-12 Motorola Mobility Llc Managing calls
US10147415B2 (en) * 2017-02-02 2018-12-04 Microsoft Technology Licensing, Llc Artificially generated speech for a communication session
US20180226073A1 (en) * 2017-02-06 2018-08-09 International Business Machines Corporation Context-based cognitive speech to text engine
US20180358003A1 (en) * 2017-06-09 2018-12-13 Qualcomm Incorporated Methods and apparatus for improving speech communication and speech interface quality using neural networks
CN107393544B (zh) 2017-06-19 2019-03-05 维沃移动通信有限公司 一种语音信号修复方法及移动终端
US10372298B2 (en) * 2017-09-29 2019-08-06 Apple Inc. User interface for multi-user communication session
US20200090648A1 (en) 2018-09-14 2020-03-19 International Business Machines Corporation Maintaining voice conversation continuity
US10971161B1 (en) * 2018-12-12 2021-04-06 Amazon Technologies, Inc. Techniques for loss mitigation of audio streams
KR102740698B1 (ko) * 2019-08-22 2024-12-11 엘지전자 주식회사 감정 정보 기반의 음성 합성 방법 및 장치
US11889128B2 (en) * 2021-01-05 2024-01-30 Qualcomm Incorporated Call audio playback speed adjustment

Also Published As

Publication number Publication date
JP2024505944A (ja) 2024-02-08
WO2022169534A1 (en) 2022-08-11
TW202236084A (zh) 2022-09-16
US20220246133A1 (en) 2022-08-04
EP4289129A1 (en) 2023-12-13
US11580954B2 (en) 2023-02-14
EP4289129B1 (en) 2025-09-03
JP7798901B2 (ja) 2026-01-14
CN116830559A (zh) 2023-09-29
EP4289129C0 (en) 2025-09-03
KR20230133864A (ko) 2023-09-19

Similar Documents

Publication Publication Date Title
BR112023014966A2 (pt) Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala
EP4057279A3 (en) Natural assistant interaction
MX2021013237A (es) Salida personalizada para optimizar segun las preferencias de usuario en un sistema distribuido.
SG10201707702YA (en) Collaborative Voice Controlled Devices
EP4332961A3 (en) Voice interaction services
CO2019013913A2 (es) Sistemas y métodos para proporcionar audio y datos en tiempo real
WO2019221851A3 (en) Building management hvac control using human sensors
EP3862931A3 (en) Gesture feedback in distributed neural network system
WO2019118469A3 (en) Methods and systems for management of media content associated with message context on mobile computing devices
EP4617851A3 (en) Dynamic computation of system response volume
WO2004100638A3 (en) Source-dependent text-to-speech system
EP3920181A3 (en) Text independent speaker recognition
WO2014022306A3 (en) Dynamic context-based language determination
MX2018001498A (es) Control de una nube de dispositivos.
CL2017001397A1 (es) Anuncio de tráfico en trayectoria de datos de red con concocimiento de vecinos (nan)
EP3751561A3 (en) Hotword recognition
MX2018015642A (es) Dispositivo de procesamiento de informacion, dispositivo de recepcion, y metodo de procesamiento de informacion.
EP4235648A3 (en) Language model biasing
CA3002470A1 (en) Systems and methods for media production and editing
MX2018015248A (es) Sistema y metodo para incorporar contenido creativo de marca en servicios de mensajeria.
PH12022553483A1 (en) Context-aware hardware-based voice activity detection
PH12019000353A1 (en) Natural language processing based sign language generation
WO2019234486A8 (en) Speech recognition system, information processing device and server
WO2016064155A3 (ko) Sns를 이용한 감성 조명 제어 시스템 및 방법
SG155161A1 (en) Devices and methods for routeing a unit of data in a network