TW202232362A - 基於上下文的模型選擇 - Google Patents

基於上下文的模型選擇 Download PDF

Info

Publication number
TW202232362A
TW202232362A TW110143343A TW110143343A TW202232362A TW 202232362 A TW202232362 A TW 202232362A TW 110143343 A TW110143343 A TW 110143343A TW 110143343 A TW110143343 A TW 110143343A TW 202232362 A TW202232362 A TW 202232362A
Authority
TW
Taiwan
Prior art keywords
model
context
data
sec
processors
Prior art date
Application number
TW110143343A
Other languages
English (en)
Chinese (zh)
Inventor
法特梅 薩基
郭銀怡
艾里克 維瑟
Original Assignee
美商高通公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 美商高通公司 filed Critical 美商高通公司
Publication of TW202232362A publication Critical patent/TW202232362A/zh

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/34Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/025Services making use of location information using location based information parameters
    • H04W4/027Services making use of location information using location based information parameters using movement velocity, acceleration information

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Electrotherapy Devices (AREA)
TW110143343A 2020-11-24 2021-11-22 基於上下文的模型選擇 TW202232362A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US17/102,748 US12198057B2 (en) 2020-11-24 2020-11-24 Context-based model selection
US17/102,748 2020-11-24

Publications (1)

Publication Number Publication Date
TW202232362A true TW202232362A (zh) 2022-08-16

Family

ID=79024726

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110143343A TW202232362A (zh) 2020-11-24 2021-11-22 基於上下文的模型選擇

Country Status (7)

Country Link
US (2) US12198057B2 (https=)
EP (1) EP4252230A1 (https=)
JP (1) JP7840941B2 (https=)
KR (1) KR20230110518A (https=)
CN (1) CN116601703A (https=)
TW (1) TW202232362A (https=)
WO (1) WO2022115839A1 (https=)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI887855B (zh) * 2023-11-18 2025-06-21 鴻海精密工業股份有限公司 車輛、車輛降噪方法及系統
TWI902207B (zh) * 2024-04-08 2025-10-21 律芯科技股份有限公司 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220111078A (ko) * 2021-02-01 2022-08-09 삼성전자주식회사 전자 장치, 사운드 입출력 기기를 포함하는 시스템 및 그 제어 방법
US20220391685A1 (en) * 2021-06-02 2022-12-08 Arm Limited System, devices and/or processes for augmenting artificial intelligence agent and computing devices
JP7548147B2 (ja) * 2021-07-19 2024-09-10 トヨタ自動車株式会社 配送車両
JP7666430B2 (ja) * 2022-07-15 2025-04-22 トヨタ自動車株式会社 車両用情報処理装置、車両用情報処理システム及び車両用情報処理方法
US12293773B2 (en) * 2022-11-03 2025-05-06 Robert Bosch Gmbh Automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment
WO2024259213A1 (en) * 2023-06-15 2024-12-19 Convida Wireless, Llc Inferencing model selection and management
WO2025023469A1 (ko) * 2023-07-26 2025-01-30 삼성전자주식회사 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법
DE102023132751A1 (de) * 2023-11-23 2025-05-28 Audi Aktiengesellschaft Verfahren zum Betreiben eines Sprachdialogsystems sowie Sprachdialogsystem
US12542707B2 (en) * 2024-02-22 2026-02-03 Dell Products L.P. Facilitating intelligent concept drift mitigation in advanced communication networks
US12608422B2 (en) * 2024-07-24 2026-04-21 Robert Bosch Gmbh Video management system and method for audio event search and classification
WO2026053065A1 (en) * 2024-09-03 2026-03-12 Cochlear Limited Linguistic context in hearing device systems

Family Cites Families (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827521A (en) 1986-03-27 1989-05-02 International Business Machines Corporation Training of markov models used in a speech recognition system
CN100472500C (zh) 1998-10-02 2009-03-25 联想(新加坡)私人有限公司 会话浏览器和会话系统
EP1282113B1 (en) 2001-08-02 2005-01-12 Sony International (Europe) GmbH Method for detecting emotions from speech using speaker identification
US7620547B2 (en) 2002-07-25 2009-11-17 Sony Deutschland Gmbh Spoken man-machine interface with speaker identification
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US20070183604A1 (en) 2006-02-09 2007-08-09 St-Infonox Response to anomalous acoustic environments
US8762143B2 (en) * 2007-05-29 2014-06-24 At&T Intellectual Property Ii, L.P. Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition
US7877335B2 (en) 2007-10-18 2011-01-25 Yahoo! Inc. System and method for learning a network of categories using prediction
US8788270B2 (en) 2009-06-16 2014-07-22 University Of Florida Research Foundation, Inc. Apparatus and method for determining an emotion state of a speaker
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US9165556B1 (en) 2012-02-01 2015-10-20 Predictive Business Intelligence, LLC Methods and systems related to audio data processing to provide key phrase notification and potential cost associated with the key phrase
US9575963B2 (en) 2012-04-20 2017-02-21 Maluuba Inc. Conversational agent
US8463648B1 (en) 2012-05-04 2013-06-11 Pearl.com LLC Method and apparatus for automated topic extraction used for the creation and promotion of new categories in a consultation system
US20140074466A1 (en) 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US10134401B2 (en) 2012-11-21 2018-11-20 Verint Systems Ltd. Diarization using linguistic labeling
US9449613B2 (en) 2012-12-06 2016-09-20 Audeme Llc Room identification using acoustic features in a recording
US10013483B2 (en) 2014-01-30 2018-07-03 Microsoft Technology Licensing, Llc System and method for identifying trending topics in a social network
WO2015120184A1 (en) 2014-02-06 2015-08-13 Otosense Inc. Instant real time neuro-compatible imaging of signals
WO2015196063A1 (en) 2014-06-19 2015-12-23 Robert Bosch Gmbh System and method for speech-enabled personalized operation of devices and services in multiple operating environments
US10073673B2 (en) 2014-07-14 2018-09-11 Samsung Electronics Co., Ltd. Method and system for robust tagging of named entities in the presence of source or translation errors
US9412361B1 (en) 2014-09-30 2016-08-09 Amazon Technologies, Inc. Configuring system operation using image data
US9643511B2 (en) 2014-12-17 2017-05-09 Samsung Electronics Co., Ltd. Method and apparatus for estimating state of charge (SOC) of battery in electric vehicle
JP5956624B1 (ja) 2015-02-02 2016-07-27 西日本高速道路エンジニアリング四国株式会社 異常音の検出方法及びその検出値を用いた構造物の異常判定方法、並びに、振動波の類似度検出方法及びその検出値を用いた音声認識方法
WO2016135069A1 (en) * 2015-02-26 2016-09-01 Koninklijke Philips N.V. Context detection for medical monitoring
US10482184B2 (en) 2015-03-08 2019-11-19 Google Llc Context-based natural language processing
JP6556575B2 (ja) 2015-09-15 2019-08-07 株式会社東芝 音声処理装置、音声処理方法及び音声処理プログラム
US9847000B2 (en) 2015-10-29 2017-12-19 Immersion Corporation Ambient triggered notifications for rendering haptic effects
US9946862B2 (en) 2015-12-01 2018-04-17 Qualcomm Incorporated Electronic device generating notification based on context data in response to speech phrase from user
US10026401B1 (en) 2015-12-28 2018-07-17 Amazon Technologies, Inc. Naming devices via voice commands
US10902043B2 (en) * 2016-01-03 2021-01-26 Gracenote, Inc. Responding to remote media classification queries using classifier models and context parameters
US10373612B2 (en) 2016-03-21 2019-08-06 Amazon Technologies, Inc. Anchored speech detection and speech recognition
US10304444B2 (en) 2016-03-23 2019-05-28 Amazon Technologies, Inc. Fine-grained natural language understanding
WO2017187712A1 (ja) 2016-04-26 2017-11-02 株式会社ソニー・インタラクティブエンタテインメント 情報処理装置
US10026405B2 (en) 2016-05-03 2018-07-17 SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S. Method for speaker diarization
JP7006592B2 (ja) * 2016-06-16 2022-01-24 日本電気株式会社 信号処理装置、信号処理方法および信号処理プログラム
CN109997186B (zh) * 2016-09-09 2021-10-15 华为技术有限公司 一种用于分类声环境的设备和方法
US10705683B2 (en) 2016-10-31 2020-07-07 Microsoft Technology Licensing, Llc Changing visual aspects of a graphical user interface to bring focus to a message
EP3545374B1 (en) 2016-11-23 2024-11-06 Alarm.com Incorporated Detection of authorized user presence and handling of unauthenticated monitoring system commands
US10713703B2 (en) 2016-11-30 2020-07-14 Apple Inc. Diversity in media item recommendations
EP3905242A1 (en) * 2017-05-12 2021-11-03 Apple Inc. User-specific acoustic models
DK179496B1 (en) * 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
US10235128B2 (en) * 2017-05-19 2019-03-19 Intel Corporation Contextual sound filter
US10311454B2 (en) 2017-06-22 2019-06-04 NewVoiceMedia Ltd. Customer interaction and experience system using emotional-semantic computing
WO2019070328A1 (en) * 2017-10-04 2019-04-11 Google Llc METHODS AND SYSTEMS FOR AUTOMATICALLY EQUALIZING AUDIO OUTPUT BASED ON THE CHARACTERISTICS OF THE PART
US10360482B1 (en) * 2017-12-04 2019-07-23 Amazon Technologies, Inc. Crowd-sourced artificial intelligence image processing services
US10481858B2 (en) 2017-12-06 2019-11-19 Harman International Industries, Incorporated Generating personalized audio content based on mood
US10832009B2 (en) 2018-01-02 2020-11-10 International Business Machines Corporation Extraction and summarization of decision elements from communications
CN110276235B (zh) 2018-01-25 2023-06-16 意法半导体公司 通过感测瞬态事件和连续事件的智能装置的情境感知
US11011162B2 (en) * 2018-06-01 2021-05-18 Soundhound, Inc. Custom acoustic models
US11195534B1 (en) * 2020-03-30 2021-12-07 Amazon Technologies, Inc. Permissioning for natural language processing systems

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI887855B (zh) * 2023-11-18 2025-06-21 鴻海精密工業股份有限公司 車輛、車輛降噪方法及系統
TWI902207B (zh) * 2024-04-08 2025-10-21 律芯科技股份有限公司 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法

Also Published As

Publication number Publication date
KR20230110518A (ko) 2023-07-24
WO2022115839A1 (en) 2022-06-02
JP7840941B2 (ja) 2026-04-06
US20250103888A1 (en) 2025-03-27
JP2023550336A (ja) 2023-12-01
US20220164662A1 (en) 2022-05-26
CN116601703A (zh) 2023-08-15
EP4252230A1 (en) 2023-10-04
US12198057B2 (en) 2025-01-14

Similar Documents

Publication Publication Date Title
US12198057B2 (en) Context-based model selection
EP4252231B1 (en) Adaptive sound event classification
US10869154B2 (en) Location-based personal audio
US11995561B2 (en) Universal client API for AI services
US9842490B2 (en) System and method of controlling external apparatus connected with device
WO2020029906A1 (zh) 一种多人语音的分离方法和装置
US11096112B2 (en) Electronic device for setting up network of external device and method for operating same
KR20050007429A (ko) 이동 장치의 음성 인식 개선
US20220164667A1 (en) Transfer learning for sound event classification
US11874876B2 (en) Electronic device and method for predicting an intention of a user
EP4396654B1 (en) Building augmented reality experiences with iot devices
US20250371824A1 (en) Two-way control of iot devices using ar camera
US20250348267A1 (en) Machine learning based voice control for audio device
US20200349960A1 (en) Method for embedding and executing audio semantics
US12175746B2 (en) Controlling IoT devices through AR object interaction
WO2020180590A1 (en) Systems and methods for augmented reality content harvesting and information extraction
US11941231B2 (en) Camera interfaces to interact with IoT devices