PH12021552746A1 - Customized output to optimize for user preference in a distributed system - Google Patents

Customized output to optimize for user preference in a distributed system

Info

Publication number
PH12021552746A1
PH12021552746A1 PH1/2021/552746A PH12021552746A PH12021552746A1 PH 12021552746 A1 PH12021552746 A1 PH 12021552746A1 PH 12021552746 A PH12021552746 A PH 12021552746A PH 12021552746 A1 PH12021552746 A1 PH 12021552746A1
Authority
PH
Philippines
Prior art keywords
user
distributed
transcript
meeting
user preference
Prior art date
Application number
PH1/2021/552746A
Other languages
English (en)
Inventor
Zhuo Chen
Dimitrios Basile Dimitriadis
William Isaac Hinthorn
Xuedong Huang
Lijuan Qin
Andreas Stolcke
Takuya Yoshioka
Nanshan Zeng
Original Assignee
Microsoft Technology Licensing Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing Llc filed Critical Microsoft Technology Licensing Llc
Publication of PH12021552746A1 publication Critical patent/PH12021552746A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies
    • G10L17/08Use of distortion metrics or a particular distance between probe pattern and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/1813Arrangements for providing special services to substations for broadcast or conference, e.g. multicast for computer conferences, e.g. chat rooms
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/403Arrangements for multi-party communication, e.g. for conferences
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Biomedical Technology (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
PH1/2021/552746A 2019-04-30 2020-03-17 Customized output to optimize for user preference in a distributed system PH12021552746A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/398,836 US11023690B2 (en) 2019-04-30 2019-04-30 Customized output to optimize for user preference in a distributed system
PCT/US2020/023054 WO2020222925A1 (en) 2019-04-30 2020-03-17 Customized output to optimize for user preference in a distributed system

Publications (1)

Publication Number Publication Date
PH12021552746A1 true PH12021552746A1 (en) 2022-07-11

Family

ID=70277494

Family Applications (1)

Application Number Title Priority Date Filing Date
PH1/2021/552746A PH12021552746A1 (en) 2019-04-30 2020-03-17 Customized output to optimize for user preference in a distributed system

Country Status (14)

Country Link
US (1) US11023690B2 (https=)
EP (1) EP3963574B1 (https=)
JP (1) JP7536789B2 (https=)
KR (1) KR20220002326A (https=)
CN (1) CN113874936A (https=)
AU (1) AU2020265509A1 (https=)
BR (1) BR112021020017A2 (https=)
CA (1) CA3132837A1 (https=)
IL (1) IL287494A (https=)
MX (1) MX2021013237A (https=)
PH (1) PH12021552746A1 (https=)
SG (1) SG11202109373TA (https=)
WO (1) WO2020222925A1 (https=)
ZA (1) ZA202106434B (https=)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11810575B2 (en) * 2019-06-12 2023-11-07 Lg Electronics Inc. Artificial intelligence robot for providing voice recognition function and method of operating the same
US11783135B2 (en) * 2020-02-25 2023-10-10 Vonage Business, Inc. Systems and methods for providing and using translation-enabled multiparty communication sessions
US11521636B1 (en) 2020-05-13 2022-12-06 Benjamin Slotznick Method and apparatus for using a test audio pattern to generate an audio signal transform for use in performing acoustic echo cancellation
US11107490B1 (en) 2020-05-13 2021-08-31 Benjamin Slotznick System and method for adding host-sent audio streams to videoconferencing meetings, without compromising intelligibility of the conversational components
US11373425B2 (en) 2020-06-02 2022-06-28 The Nielsen Company (U.S.), Llc Methods and apparatus for monitoring an audience of media based on thermal imaging
US11915716B2 (en) * 2020-07-16 2024-02-27 International Business Machines Corporation Audio modifying conferencing system
CN111935111B (zh) * 2020-07-27 2023-04-07 北京字节跳动网络技术有限公司 交互方法、装置和电子设备
US11553247B2 (en) 2020-08-20 2023-01-10 The Nielsen Company (Us), Llc Methods and apparatus to determine an audience composition based on thermal imaging and facial recognition
US11595723B2 (en) 2020-08-20 2023-02-28 The Nielsen Company (Us), Llc Methods and apparatus to determine an audience composition based on voice recognition
US11763591B2 (en) * 2020-08-20 2023-09-19 The Nielsen Company (Us), Llc Methods and apparatus to determine an audience composition based on voice recognition, thermal imaging, and facial recognition
WO2022101890A1 (en) * 2020-11-16 2022-05-19 Vocal Power-House Systems, LLC Responsive communication system
US20220311764A1 (en) * 2021-03-24 2022-09-29 Daniel Oke Device for and method of automatically disabling access to a meeting via computer
CN112966096B (zh) * 2021-04-07 2022-05-24 重庆大学 一种基于多任务学习的云服务发现方法
JP2024516815A (ja) * 2021-04-30 2024-04-17 ドルビー ラボラトリーズ ライセンシング コーポレイション エピソード的コンテンツをサポートする話者ダイアライゼーション
WO2022235918A1 (en) * 2021-05-05 2022-11-10 Deep Media Inc. Audio and video translator
WO2022271295A1 (en) * 2021-06-22 2022-12-29 Microsoft Technology Licensing, Llc Distributed processing of communication session services
WO2023048747A1 (en) * 2021-09-24 2023-03-30 Intel Corporation Systems, apparatus, articles of manufacture, and methods for cross training and collaborative artificial intelligence for proactive data management and analytics
US11330229B1 (en) * 2021-09-28 2022-05-10 Atlassian Pty Ltd. Apparatuses, computer-implemented methods, and computer program products for generating a collaborative contextual summary interface in association with an audio-video conferencing interface service
US12555565B2 (en) * 2021-09-30 2026-02-17 Samsung Electronics Co., Ltd. Device and method with target speaker identification
WO2023100374A1 (ja) * 2021-12-03 2023-06-08 日本電信電話株式会社 信号処理装置、信号処理方法及び信号処理プログラム
FR3130437B1 (fr) * 2021-12-14 2024-09-27 Orange Procédé et dispositif de sélection d’un capteur audio parmi une pluralité de capteurs audio
US11722536B2 (en) 2021-12-27 2023-08-08 Atlassian Pty Ltd. Apparatuses, computer-implemented methods, and computer program products for managing a shared dynamic collaborative presentation progression interface in association with an audio-video conferencing interface service
JP7804283B2 (ja) * 2022-04-18 2026-01-22 国立研究開発法人情報通信研究機構 同時通訳装置、同時通訳システム、同時通訳処理方法、および、プログラム
US20230351123A1 (en) * 2022-04-29 2023-11-02 Zoom Video Communications, Inc. Providing multistream machine translation during virtual conferences
US12457378B2 (en) 2022-04-29 2025-10-28 MIXHalo Corp. Synchronized audio streams for live broadcasts
US20230385740A1 (en) * 2022-05-27 2023-11-30 Microsoft Technology Licensing, Llc Meeting Analysis and Coaching System
CN115240693A (zh) * 2022-06-28 2022-10-25 安睿杰翻译(上海)有限公司 基于云的远程多声道翻译系统
US20240127818A1 (en) * 2022-10-12 2024-04-18 discourse.ai, Inc. Structuring and Displaying Conversational Voice Transcripts in a Message-style Format
US12581256B2 (en) * 2022-12-29 2026-03-17 Sonos, Inc. Systems and methods for coordinated playback of analog and digital media content
CN120201152A (zh) * 2023-12-21 2025-06-24 英济股份有限公司 智能会议辅助系统及生成会议纪录的方法
KR102772943B1 (ko) * 2024-03-27 2025-02-27 주식회사 플리토 음성-텍스트 변환을 활용한 번역 서비스 제공 방법, 서버 및 컴퓨터 프로그램
JP7806128B2 (ja) * 2024-05-22 2026-01-26 Nttドコモビジネス株式会社 処理装置、処理方法及び処理プログラム
KR102798832B1 (ko) * 2024-12-24 2025-04-18 권도완 참석자의 인증 및 식별 정보 기반 채널 별 모니터링을 통한 참석자 별 녹취 관리가 가능한 다자간 회의록 자동 생성 솔루션 제공 방법, 장치 및 시스템

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8498725B2 (en) * 2008-11-14 2013-07-30 8X8, Inc. Systems and methods for distributed conferencing
US20110246172A1 (en) 2010-03-30 2011-10-06 Polycom, Inc. Method and System for Adding Translation in a Videoconference
US8797380B2 (en) * 2010-04-30 2014-08-05 Microsoft Corporation Accelerated instant replay for co-present and distributed meetings
US8482593B2 (en) * 2010-05-12 2013-07-09 Blue Jeans Network, Inc. Systems and methods for scalable composition of media streams for real-time multimedia communication
JP2012208630A (ja) 2011-03-29 2012-10-25 Mizuho Information & Research Institute Inc 発言管理システム、発言管理方法及び発言管理プログラム
JP5677901B2 (ja) 2011-06-29 2015-02-25 みずほ情報総研株式会社 議事録作成システム及び議事録作成方法
US9245254B2 (en) 2011-12-01 2016-01-26 Elwha Llc Enhanced voice conferencing with history, language translation and identification
US9110891B2 (en) * 2011-12-12 2015-08-18 Google Inc. Auto-translation for multi user audio and video
EP2795884A4 (en) * 2011-12-20 2015-07-29 Nokia Corp Audio conferencing
US20150314454A1 (en) * 2013-03-15 2015-11-05 JIBO, Inc. Apparatus and methods for providing a persistent companion device
US9774743B2 (en) * 2013-03-29 2017-09-26 Hewlett-Packard Development Company, L.P. Silence signatures of audio signals
NO341316B1 (no) * 2013-05-31 2017-10-09 Pexip AS Fremgangsmåte og system for å assosiere en ekstern enhet til en videokonferansesesjon.
US9094453B2 (en) * 2013-11-06 2015-07-28 Google Technology Holdings LLC Method and apparatus for associating mobile devices using audio signature detection
JP6580362B2 (ja) 2014-04-24 2019-09-25 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカPanasonic Intellectual Property Corporation of America 会議決定方法およびサーバ装置
EP3198855A1 (en) * 2014-09-26 2017-08-02 Intel Corporation Techniques for enhancing user experience in video conferencing
US10468051B2 (en) * 2015-05-09 2019-11-05 Sugarcrm Inc. Meeting assistant
US9947364B2 (en) * 2015-09-16 2018-04-17 Google Llc Enhancing audio using multiple recording devices
US20180054507A1 (en) * 2016-08-19 2018-02-22 Circle River, Inc. Artificial Intelligence Communication with Caller and Real-Time Transcription and Manipulation Thereof
KR102257181B1 (ko) * 2016-09-13 2021-05-27 매직 립, 인코포레이티드 감각 안경류
US10417349B2 (en) 2017-06-14 2019-09-17 Microsoft Technology Licensing, Llc Customized multi-device translated and transcribed conversations
JP7046546B2 (ja) 2017-09-28 2022-04-04 株式会社野村総合研究所 会議支援システムおよび会議支援プログラム
US10552546B2 (en) * 2017-10-09 2020-02-04 Ricoh Company, Ltd. Speech-to-text conversion for interactive whiteboard appliances in multi-language electronic meetings
US11030585B2 (en) * 2017-10-09 2021-06-08 Ricoh Company, Ltd. Person detection, person identification and meeting start for interactive whiteboard appliances
US10553208B2 (en) * 2017-10-09 2020-02-04 Ricoh Company, Ltd. Speech-to-text conversion for interactive whiteboard appliances using multiple services
US10956875B2 (en) * 2017-10-09 2021-03-23 Ricoh Company, Ltd. Attendance tracking, presentation files, meeting services and agenda extraction for interactive whiteboard appliances
US10743107B1 (en) * 2019-04-30 2020-08-11 Microsoft Technology Licensing, Llc Synchronization of audio signals from distributed devices

Also Published As

Publication number Publication date
KR20220002326A (ko) 2022-01-06
ZA202106434B (en) 2022-11-30
US20200349230A1 (en) 2020-11-05
AU2020265509A1 (en) 2021-10-07
BR112021020017A2 (pt) 2021-12-14
CA3132837A1 (en) 2020-11-05
JP7536789B2 (ja) 2024-08-20
MX2021013237A (es) 2021-11-17
SG11202109373TA (en) 2021-09-29
JP2022532313A (ja) 2022-07-14
WO2020222925A1 (en) 2020-11-05
CN113874936A (zh) 2021-12-31
IL287494A (en) 2021-12-01
EP3963574A1 (en) 2022-03-09
US11023690B2 (en) 2021-06-01
EP3963574B1 (en) 2025-08-20

Similar Documents

Publication Publication Date Title
PH12021552746A1 (en) Customized output to optimize for user preference in a distributed system
WO2021118531A8 (en) Systems and methods for local automated speech-to-text processing
EP3373292A3 (en) Method for controlling artificial intelligence system that performs multilingual processing
WO2019014591A8 (en) Blockchain-based data processing method and device
MX2016015432A (es) Traduccion durante una llamada.
SG11201901441QA (en) Information processing apparatus, speech recognition system, and information processing method
EP4668097A3 (en) Implementations for voice assistant on devices
EP4617851A3 (en) Dynamic computation of system response volume
DE602004022787D1 (de) Verteiltes spracherkennungsverfahren
EP4654184A3 (en) Hub device, multi-device system including the hub device and plurality of devices, and method of operating the same
MX2019013630A (es) Sistemas y metodos para proporcionar audio y datos en tiempo real con referencia reciproca a aplicaciones relacionadas.
MX2015013070A (es) Sistemas y metodos para dialogo de personajes sinteticos interactivos.
EP4586657A3 (en) Secure provisioning and management of devices
EP4333481A3 (en) Apparatus and method
WO2018217165A3 (en) SYSTEM AND METHOD FOR IMPLEMENTING CENTRALIZED PERSONALIZABLE OPERATING SOLUTION
EP3862931A3 (en) Gesture feedback in distributed neural network system
MY174199A (en) Data processor and transport of user control data to audio decoders and renderers
WO2016009444A3 (en) Music performance system and method thereof
EP3793134A3 (en) Satellite volume control
MX2019010344A (es) Sistema de control de acceso con ajuste dinamico del funcionamiento.
MX2021004983A (es) Sistemas y metodos para modificacion de busqueda.
EP4243013A3 (en) Method, apparatus and computer-readable media for touch and speech interface with audio location
WO2020084587A3 (en) Passive fitting techniques
BR112023014966A2 (pt) Sistemas e métodos de tratamento de interrupções de fluxo de áudio de fala
EP3982358A3 (en) Whisper conversion for private conversations