KR20230110518A - 콘텍스트-기반 모델 선택 - Google Patents

콘텍스트-기반 모델 선택 Download PDF

Info

Publication number
KR20230110518A
KR20230110518A KR1020237016581A KR20237016581A KR20230110518A KR 20230110518 A KR20230110518 A KR 20230110518A KR 1020237016581 A KR1020237016581 A KR 1020237016581A KR 20237016581 A KR20237016581 A KR 20237016581A KR 20230110518 A KR20230110518 A KR 20230110518A
Authority
KR
South Korea
Prior art keywords
model
context
data
sec
processors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
KR1020237016581A
Other languages
English (en)
Korean (ko)
Inventor
파테메 사키
인이 궈
에릭 비저
Original Assignee
퀄컴 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 퀄컴 인코포레이티드 filed Critical 퀄컴 인코포레이티드
Publication of KR20230110518A publication Critical patent/KR20230110518A/ko
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/34Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/025Services making use of location information using location based information parameters
    • H04W4/027Services making use of location information using location based information parameters using movement velocity, acceleration information

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Electrotherapy Devices (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
KR1020237016581A 2020-11-24 2021-11-19 콘텍스트-기반 모델 선택 Pending KR20230110518A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/102,748 2020-11-24
US17/102,748 US12198057B2 (en) 2020-11-24 2020-11-24 Context-based model selection
PCT/US2021/072521 WO2022115839A1 (en) 2020-11-24 2021-11-19 Context-based model selection

Publications (1)

Publication Number Publication Date
KR20230110518A true KR20230110518A (ko) 2023-07-24

Family

ID=79024726

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020237016581A Pending KR20230110518A (ko) 2020-11-24 2021-11-19 콘텍스트-기반 모델 선택

Country Status (7)

Country Link
US (2) US12198057B2 (https=)
EP (1) EP4252230A1 (https=)
JP (1) JP7840941B2 (https=)
KR (1) KR20230110518A (https=)
CN (1) CN116601703A (https=)
TW (1) TW202232362A (https=)
WO (1) WO2022115839A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025023469A1 (ko) * 2023-07-26 2025-01-30 삼성전자주식회사 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220111078A (ko) * 2021-02-01 2022-08-09 삼성전자주식회사 전자 장치, 사운드 입출력 기기를 포함하는 시스템 및 그 제어 방법
US20220391685A1 (en) * 2021-06-02 2022-12-08 Arm Limited System, devices and/or processes for augmenting artificial intelligence agent and computing devices
JP7548147B2 (ja) * 2021-07-19 2024-09-10 トヨタ自動車株式会社 配送車両
JP7666430B2 (ja) * 2022-07-15 2025-04-22 トヨタ自動車株式会社 車両用情報処理装置、車両用情報処理システム及び車両用情報処理方法
US12293773B2 (en) * 2022-11-03 2025-05-06 Robert Bosch Gmbh Automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment
EP4728443A1 (en) * 2023-06-15 2026-04-22 InterDigital Patent Holdings, Inc. Inferencing model selection and management
TWI887855B (zh) * 2023-11-18 2025-06-21 鴻海精密工業股份有限公司 車輛、車輛降噪方法及系統
DE102023132751A1 (de) * 2023-11-23 2025-05-28 Audi Aktiengesellschaft Verfahren zum Betreiben eines Sprachdialogsystems sowie Sprachdialogsystem
US12542707B2 (en) * 2024-02-22 2026-02-03 Dell Products L.P. Facilitating intelligent concept drift mitigation in advanced communication networks
TWI902207B (zh) * 2024-04-08 2025-10-21 律芯科技股份有限公司 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法
US12608422B2 (en) * 2024-07-24 2026-04-21 Robert Bosch Gmbh Video management system and method for audio event search and classification
WO2026053065A1 (en) * 2024-09-03 2026-03-12 Cochlear Limited Linguistic context in hearing device systems

Family Cites Families (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827521A (en) 1986-03-27 1989-05-02 International Business Machines Corporation Training of markov models used in a speech recognition system
CA2345661A1 (en) 1998-10-02 2000-04-13 International Business Machines Corporation Conversational browser and conversational systems
DE60108373T2 (de) 2001-08-02 2005-12-22 Sony International (Europe) Gmbh Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation
US7620547B2 (en) 2002-07-25 2009-11-17 Sony Deutschland Gmbh Spoken man-machine interface with speaker identification
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US20070183604A1 (en) 2006-02-09 2007-08-09 St-Infonox Response to anomalous acoustic environments
US8762143B2 (en) * 2007-05-29 2014-06-24 At&T Intellectual Property Ii, L.P. Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition
US7877335B2 (en) 2007-10-18 2011-01-25 Yahoo! Inc. System and method for learning a network of categories using prediction
US8788270B2 (en) 2009-06-16 2014-07-22 University Of Florida Research Foundation, Inc. Apparatus and method for determining an emotion state of a speaker
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US9165556B1 (en) 2012-02-01 2015-10-20 Predictive Business Intelligence, LLC Methods and systems related to audio data processing to provide key phrase notification and potential cost associated with the key phrase
US9575963B2 (en) 2012-04-20 2017-02-21 Maluuba Inc. Conversational agent
US8463648B1 (en) 2012-05-04 2013-06-11 Pearl.com LLC Method and apparatus for automated topic extraction used for the creation and promotion of new categories in a consultation system
US20140074466A1 (en) 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US10134401B2 (en) 2012-11-21 2018-11-20 Verint Systems Ltd. Diarization using linguistic labeling
US9449613B2 (en) 2012-12-06 2016-09-20 Audeme Llc Room identification using acoustic features in a recording
US10013483B2 (en) 2014-01-30 2018-07-03 Microsoft Technology Licensing, Llc System and method for identifying trending topics in a social network
US9466316B2 (en) 2014-02-06 2016-10-11 Otosense Inc. Device, method and system for instant real time neuro-compatible imaging of a signal
US10410630B2 (en) 2014-06-19 2019-09-10 Robert Bosch Gmbh System and method for speech-enabled personalized operation of devices and services in multiple operating environments
US10073673B2 (en) 2014-07-14 2018-09-11 Samsung Electronics Co., Ltd. Method and system for robust tagging of named entities in the presence of source or translation errors
US9412361B1 (en) 2014-09-30 2016-08-09 Amazon Technologies, Inc. Configuring system operation using image data
US9643511B2 (en) 2014-12-17 2017-05-09 Samsung Electronics Co., Ltd. Method and apparatus for estimating state of charge (SOC) of battery in electric vehicle
JP5956624B1 (ja) 2015-02-02 2016-07-27 西日本高速道路エンジニアリング四国株式会社 異常音の検出方法及びその検出値を用いた構造物の異常判定方法、並びに、振動波の類似度検出方法及びその検出値を用いた音声認識方法
US10720240B2 (en) * 2015-02-26 2020-07-21 Koninklijke Philips N.V. Context detection for medical monitoring
US10482184B2 (en) 2015-03-08 2019-11-19 Google Llc Context-based natural language processing
JP6556575B2 (ja) 2015-09-15 2019-08-07 株式会社東芝 音声処理装置、音声処理方法及び音声処理プログラム
US9847000B2 (en) 2015-10-29 2017-12-19 Immersion Corporation Ambient triggered notifications for rendering haptic effects
US9946862B2 (en) 2015-12-01 2018-04-17 Qualcomm Incorporated Electronic device generating notification based on context data in response to speech phrase from user
US10026401B1 (en) 2015-12-28 2018-07-17 Amazon Technologies, Inc. Naming devices via voice commands
US10902043B2 (en) * 2016-01-03 2021-01-26 Gracenote, Inc. Responding to remote media classification queries using classifier models and context parameters
US10373612B2 (en) 2016-03-21 2019-08-06 Amazon Technologies, Inc. Anchored speech detection and speech recognition
US10304444B2 (en) 2016-03-23 2019-05-28 Amazon Technologies, Inc. Fine-grained natural language understanding
WO2017187712A1 (ja) 2016-04-26 2017-11-02 株式会社ソニー・インタラクティブエンタテインメント 情報処理装置
US10026405B2 (en) 2016-05-03 2018-07-17 SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S. Method for speaker diarization
JP7006592B2 (ja) * 2016-06-16 2022-01-24 日本電気株式会社 信号処理装置、信号処理方法および信号処理プログラム
CN109997186B (zh) * 2016-09-09 2021-10-15 华为技术有限公司 一种用于分类声环境的设备和方法
US10705683B2 (en) 2016-10-31 2020-07-07 Microsoft Technology Licensing, Llc Changing visual aspects of a graphical user interface to bring focus to a message
EP3545374B1 (en) 2016-11-23 2024-11-06 Alarm.com Incorporated Detection of authorized user presence and handling of unauthenticated monitoring system commands
US10713703B2 (en) 2016-11-30 2020-07-14 Apple Inc. Diversity in media item recommendations
DK179496B1 (en) * 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
EP3424046B1 (en) * 2017-05-12 2020-07-08 Apple Inc. User-specific acoustic models
US10235128B2 (en) * 2017-05-19 2019-03-19 Intel Corporation Contextual sound filter
US10311454B2 (en) 2017-06-22 2019-06-04 NewVoiceMedia Ltd. Customer interaction and experience system using emotional-semantic computing
WO2019070328A1 (en) * 2017-10-04 2019-04-11 Google Llc METHODS AND SYSTEMS FOR AUTOMATICALLY EQUALIZING AUDIO OUTPUT BASED ON THE CHARACTERISTICS OF THE PART
US10360482B1 (en) * 2017-12-04 2019-07-23 Amazon Technologies, Inc. Crowd-sourced artificial intelligence image processing services
US10481858B2 (en) 2017-12-06 2019-11-19 Harman International Industries, Incorporated Generating personalized audio content based on mood
US10832009B2 (en) 2018-01-02 2020-11-10 International Business Machines Corporation Extraction and summarization of decision elements from communications
CN116738284A (zh) 2018-01-25 2023-09-12 意法半导体公司 通过感测瞬态事件和连续事件的智能装置的情境感知
US11011162B2 (en) * 2018-06-01 2021-05-18 Soundhound, Inc. Custom acoustic models
US11195534B1 (en) * 2020-03-30 2021-12-07 Amazon Technologies, Inc. Permissioning for natural language processing systems

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2025023469A1 (ko) * 2023-07-26 2025-01-30 삼성전자주식회사 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법

Also Published As

Publication number Publication date
US12198057B2 (en) 2025-01-14
JP7840941B2 (ja) 2026-04-06
US20250103888A1 (en) 2025-03-27
JP2023550336A (ja) 2023-12-01
WO2022115839A1 (en) 2022-06-02
US20220164662A1 (en) 2022-05-26
TW202232362A (zh) 2022-08-16
CN116601703A (zh) 2023-08-15
EP4252230A1 (en) 2023-10-04

Similar Documents

Publication Publication Date Title
US20250103888A1 (en) Context-based model selection
JP7757405B2 (ja) 適応型サウンドイベント分類
KR102932652B1 (ko) 추론 연산을 수행하는 전자 장치
US11991253B2 (en) Intelligent layer to power cross platform, edge-cloud hybrid artificial intelligence services
US10657963B2 (en) Method and system for processing user command to provide and adjust operation of electronic device by analyzing presentation of user speech
US10692492B2 (en) Techniques for client-side speech domain detection using gyroscopic data and a system using the same
US11995561B2 (en) Universal client API for AI services
EP4066243B1 (en) Sound event detection learning
US10573299B2 (en) Digital assistant and associated methods for a transportation vehicle
KR20230110512A (ko) 사운드 이벤트 분류를 위한 전이 학습
JP6619488B2 (ja) 人工知能機器における連続会話機能
CN105573128B (zh) 用户装置及其驱动方法、提供服务的设备及其驱动方法
KR102795306B1 (ko) 학습 처리 시스템, 로컬 파라미터 개수 결정 장치 및 방법
CN115515068B (zh) 电子设备及其分布式实现方法和介质
CN115698949A (zh) 用于ai服务的通用客户端api
US11296942B1 (en) Relative device placement configuration
CN110400561A (zh) 用于交通工具的方法和系统
WO2020207316A1 (zh) 设备资源配置方法、装置、存储介质及电子设备
US12229343B2 (en) System, device and method for real time gesture prediction

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20230516

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20241104

Comment text: Request for Examination of Application