JP7840941B2 - コンテキストベースのモデル選択 - Google Patents

コンテキストベースのモデル選択

Info

Publication number
JP7840941B2
JP7840941B2 JP2023528472A JP2023528472A JP7840941B2 JP 7840941 B2 JP7840941 B2 JP 7840941B2 JP 2023528472 A JP2023528472 A JP 2023528472A JP 2023528472 A JP2023528472 A JP 2023528472A JP 7840941 B2 JP7840941 B2 JP 7840941B2
Authority
JP
Japan
Prior art keywords
model
context
data
sec
processors
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2023528472A
Other languages
English (en)
Japanese (ja)
Other versions
JP2023550336A (ja
JP2023550336A5 (https=
Inventor
サキ、ファテメー
グオ、インイー
ビッサー、エリック
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of JP2023550336A publication Critical patent/JP2023550336A/ja
Publication of JP2023550336A5 publication Critical patent/JP2023550336A5/ja
Application granted granted Critical
Publication of JP7840941B2 publication Critical patent/JP7840941B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/082Learning methods modifying the architecture, e.g. adding, deleting or silencing nodes or connections
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/34Network arrangements or protocols for supporting network services or applications involving the movement of software or configuration parameters 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/025Services making use of location information using location based information parameters
    • H04W4/027Services making use of location information using location based information parameters using movement velocity, acceleration information

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • General Physics & Mathematics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • User Interface Of Digital Computer (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Electrotherapy Devices (AREA)
JP2023528472A 2020-11-24 2021-11-19 コンテキストベースのモデル選択 Active JP7840941B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US17/102,748 US12198057B2 (en) 2020-11-24 2020-11-24 Context-based model selection
US17/102,748 2020-11-24
PCT/US2021/072521 WO2022115839A1 (en) 2020-11-24 2021-11-19 Context-based model selection

Publications (3)

Publication Number Publication Date
JP2023550336A JP2023550336A (ja) 2023-12-01
JP2023550336A5 JP2023550336A5 (https=) 2024-10-29
JP7840941B2 true JP7840941B2 (ja) 2026-04-06

Family

ID=79024726

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2023528472A Active JP7840941B2 (ja) 2020-11-24 2021-11-19 コンテキストベースのモデル選択

Country Status (7)

Country Link
US (2) US12198057B2 (https=)
EP (1) EP4252230A1 (https=)
JP (1) JP7840941B2 (https=)
KR (1) KR20230110518A (https=)
CN (1) CN116601703A (https=)
TW (1) TW202232362A (https=)
WO (1) WO2022115839A1 (https=)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20220111078A (ko) * 2021-02-01 2022-08-09 삼성전자주식회사 전자 장치, 사운드 입출력 기기를 포함하는 시스템 및 그 제어 방법
US20220391685A1 (en) * 2021-06-02 2022-12-08 Arm Limited System, devices and/or processes for augmenting artificial intelligence agent and computing devices
JP7548147B2 (ja) * 2021-07-19 2024-09-10 トヨタ自動車株式会社 配送車両
JP7666430B2 (ja) * 2022-07-15 2025-04-22 トヨタ自動車株式会社 車両用情報処理装置、車両用情報処理システム及び車両用情報処理方法
US12293773B2 (en) * 2022-11-03 2025-05-06 Robert Bosch Gmbh Automatically selecting a sound recognition model for an environment based on audio data and image data associated with the environment
WO2024259213A1 (en) * 2023-06-15 2024-12-19 Convida Wireless, Llc Inferencing model selection and management
WO2025023469A1 (ko) * 2023-07-26 2025-01-30 삼성전자주식회사 시각 정보를 고려해서 인공지능 에이전트를 동작하는 장치 및 방법
TWI887855B (zh) * 2023-11-18 2025-06-21 鴻海精密工業股份有限公司 車輛、車輛降噪方法及系統
DE102023132751A1 (de) * 2023-11-23 2025-05-28 Audi Aktiengesellschaft Verfahren zum Betreiben eines Sprachdialogsystems sowie Sprachdialogsystem
US12542707B2 (en) * 2024-02-22 2026-02-03 Dell Products L.P. Facilitating intelligent concept drift mitigation in advanced communication networks
TWI902207B (zh) * 2024-04-08 2025-10-21 律芯科技股份有限公司 基於虛擬麥克風的車用降噪系統及車用降噪系統訓練方法
US12608422B2 (en) * 2024-07-24 2026-04-21 Robert Bosch Gmbh Video management system and method for audio event search and classification
WO2026053065A1 (en) * 2024-09-03 2026-03-12 Cochlear Limited Linguistic context in hearing device systems

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140303972A1 (en) 2007-05-29 2014-10-09 At&T Intellectual Property Ii, L.P. Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition
WO2017217412A1 (ja) 2016-06-16 2017-12-21 日本電気株式会社 信号処理装置、信号処理方法およびコンピュータ読み取り可能記録媒体
US20180330737A1 (en) 2017-05-12 2018-11-15 Apple Inc. User-specific acoustic models
US20180336000A1 (en) 2017-05-19 2018-11-22 Intel Corporation Contextual sound filter
US20190206418A1 (en) 2016-09-09 2019-07-04 Huawei Technologies Co., Ltd. Device and a method for classifying an acoustic environment

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827521A (en) 1986-03-27 1989-05-02 International Business Machines Corporation Training of markov models used in a speech recognition system
CN100472500C (zh) 1998-10-02 2009-03-25 联想(新加坡)私人有限公司 会话浏览器和会话系统
EP1282113B1 (en) 2001-08-02 2005-01-12 Sony International (Europe) GmbH Method for detecting emotions from speech using speaker identification
US7620547B2 (en) 2002-07-25 2009-11-17 Sony Deutschland Gmbh Spoken man-machine interface with speaker identification
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
US20070183604A1 (en) 2006-02-09 2007-08-09 St-Infonox Response to anomalous acoustic environments
US7877335B2 (en) 2007-10-18 2011-01-25 Yahoo! Inc. System and method for learning a network of categories using prediction
US8788270B2 (en) 2009-06-16 2014-07-22 University Of Florida Research Foundation, Inc. Apparatus and method for determining an emotion state of a speaker
US8600743B2 (en) 2010-01-06 2013-12-03 Apple Inc. Noise profile determination for voice-related feature
US8381107B2 (en) 2010-01-13 2013-02-19 Apple Inc. Adaptive audio feedback system and method
US9165556B1 (en) 2012-02-01 2015-10-20 Predictive Business Intelligence, LLC Methods and systems related to audio data processing to provide key phrase notification and potential cost associated with the key phrase
US9575963B2 (en) 2012-04-20 2017-02-21 Maluuba Inc. Conversational agent
US8463648B1 (en) 2012-05-04 2013-06-11 Pearl.com LLC Method and apparatus for automated topic extraction used for the creation and promotion of new categories in a consultation system
US20140074466A1 (en) 2012-09-10 2014-03-13 Google Inc. Answering questions using environmental context
US10134401B2 (en) 2012-11-21 2018-11-20 Verint Systems Ltd. Diarization using linguistic labeling
US9449613B2 (en) 2012-12-06 2016-09-20 Audeme Llc Room identification using acoustic features in a recording
US10013483B2 (en) 2014-01-30 2018-07-03 Microsoft Technology Licensing, Llc System and method for identifying trending topics in a social network
WO2015120184A1 (en) 2014-02-06 2015-08-13 Otosense Inc. Instant real time neuro-compatible imaging of signals
WO2015196063A1 (en) 2014-06-19 2015-12-23 Robert Bosch Gmbh System and method for speech-enabled personalized operation of devices and services in multiple operating environments
US10073673B2 (en) 2014-07-14 2018-09-11 Samsung Electronics Co., Ltd. Method and system for robust tagging of named entities in the presence of source or translation errors
US9412361B1 (en) 2014-09-30 2016-08-09 Amazon Technologies, Inc. Configuring system operation using image data
US9643511B2 (en) 2014-12-17 2017-05-09 Samsung Electronics Co., Ltd. Method and apparatus for estimating state of charge (SOC) of battery in electric vehicle
JP5956624B1 (ja) 2015-02-02 2016-07-27 西日本高速道路エンジニアリング四国株式会社 異常音の検出方法及びその検出値を用いた構造物の異常判定方法、並びに、振動波の類似度検出方法及びその検出値を用いた音声認識方法
WO2016135069A1 (en) * 2015-02-26 2016-09-01 Koninklijke Philips N.V. Context detection for medical monitoring
US10482184B2 (en) 2015-03-08 2019-11-19 Google Llc Context-based natural language processing
JP6556575B2 (ja) 2015-09-15 2019-08-07 株式会社東芝 音声処理装置、音声処理方法及び音声処理プログラム
US9847000B2 (en) 2015-10-29 2017-12-19 Immersion Corporation Ambient triggered notifications for rendering haptic effects
US9946862B2 (en) 2015-12-01 2018-04-17 Qualcomm Incorporated Electronic device generating notification based on context data in response to speech phrase from user
US10026401B1 (en) 2015-12-28 2018-07-17 Amazon Technologies, Inc. Naming devices via voice commands
US10902043B2 (en) * 2016-01-03 2021-01-26 Gracenote, Inc. Responding to remote media classification queries using classifier models and context parameters
US10373612B2 (en) 2016-03-21 2019-08-06 Amazon Technologies, Inc. Anchored speech detection and speech recognition
US10304444B2 (en) 2016-03-23 2019-05-28 Amazon Technologies, Inc. Fine-grained natural language understanding
WO2017187712A1 (ja) 2016-04-26 2017-11-02 株式会社ソニー・インタラクティブエンタテインメント 情報処理装置
US10026405B2 (en) 2016-05-03 2018-07-17 SESTEK Ses velletisim Bilgisayar Tekn. San. Ve Tic A.S. Method for speaker diarization
US10705683B2 (en) 2016-10-31 2020-07-07 Microsoft Technology Licensing, Llc Changing visual aspects of a graphical user interface to bring focus to a message
EP3545374B1 (en) 2016-11-23 2024-11-06 Alarm.com Incorporated Detection of authorized user presence and handling of unauthenticated monitoring system commands
US10713703B2 (en) 2016-11-30 2020-07-14 Apple Inc. Diversity in media item recommendations
EP3905242A1 (en) * 2017-05-12 2021-11-03 Apple Inc. User-specific acoustic models
US10311454B2 (en) 2017-06-22 2019-06-04 NewVoiceMedia Ltd. Customer interaction and experience system using emotional-semantic computing
WO2019070328A1 (en) * 2017-10-04 2019-04-11 Google Llc METHODS AND SYSTEMS FOR AUTOMATICALLY EQUALIZING AUDIO OUTPUT BASED ON THE CHARACTERISTICS OF THE PART
US10360482B1 (en) * 2017-12-04 2019-07-23 Amazon Technologies, Inc. Crowd-sourced artificial intelligence image processing services
US10481858B2 (en) 2017-12-06 2019-11-19 Harman International Industries, Incorporated Generating personalized audio content based on mood
US10832009B2 (en) 2018-01-02 2020-11-10 International Business Machines Corporation Extraction and summarization of decision elements from communications
CN110276235B (zh) 2018-01-25 2023-06-16 意法半导体公司 通过感测瞬态事件和连续事件的智能装置的情境感知
US11011162B2 (en) * 2018-06-01 2021-05-18 Soundhound, Inc. Custom acoustic models
US11195534B1 (en) * 2020-03-30 2021-12-07 Amazon Technologies, Inc. Permissioning for natural language processing systems

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140303972A1 (en) 2007-05-29 2014-10-09 At&T Intellectual Property Ii, L.P. Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition
WO2017217412A1 (ja) 2016-06-16 2017-12-21 日本電気株式会社 信号処理装置、信号処理方法およびコンピュータ読み取り可能記録媒体
US20190206418A1 (en) 2016-09-09 2019-07-04 Huawei Technologies Co., Ltd. Device and a method for classifying an acoustic environment
US20180330737A1 (en) 2017-05-12 2018-11-15 Apple Inc. User-specific acoustic models
US20180336000A1 (en) 2017-05-19 2018-11-22 Intel Corporation Contextual sound filter

Also Published As

Publication number Publication date
KR20230110518A (ko) 2023-07-24
WO2022115839A1 (en) 2022-06-02
US20250103888A1 (en) 2025-03-27
JP2023550336A (ja) 2023-12-01
US20220164662A1 (en) 2022-05-26
CN116601703A (zh) 2023-08-15
EP4252230A1 (en) 2023-10-04
TW202232362A (zh) 2022-08-16
US12198057B2 (en) 2025-01-14

Similar Documents

Publication Publication Date Title
JP7840941B2 (ja) コンテキストベースのモデル選択
JP7757405B2 (ja) 適応型サウンドイベント分類
US10869154B2 (en) Location-based personal audio
CN113377899A (zh) 意图识别方法及电子设备
EP4066243B1 (en) Sound event detection learning
JP6619488B2 (ja) 人工知能機器における連続会話機能
KR20230110512A (ko) 사운드 이벤트 분류를 위한 전이 학습
US11822889B2 (en) Personal conversationalist system
US12114075B1 (en) Object selection in computer vision
CN105573128B (zh) 用户装置及其驱动方法、提供服务的设备及其驱动方法
JP7501523B2 (ja) 情報処理装置、情報処理方法、およびプログラム
US20190163436A1 (en) Electronic device and method for controlling the same
CN117897676A (zh) 与iot设备构建增强现实体验
US11997445B2 (en) Systems and methods for live conversation using hearing devices
CN117916692A (zh) 使用ar摄像头对iot设备的双向控制
US12596961B1 (en) Local device embeddings for automation
KR20250138610A (ko) 맥락에 기반하여 사용자 입력에 대한 응답을 생성하는 방법, 장치 및 기록 매체
KR20240049831A (ko) IoT 디바이스들과 상호 작용하는 카메라 인터페이스들

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20241021

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20241021

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20250926

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20251007

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20251219

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20260303

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20260325

R150 Certificate of patent or registration of utility model

Ref document number: 7840941

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150